Khurram Khalil

I am a PhD student in Computer Science at the University of Missouri-Columbia, working on GenAI optimization and AI hardware security. My research focuses on efficient model compression, adversarial robustness, formal verification, and hardware-aware deployment for edge and cyber-physical AI systems.

I build methods that make large language and vision-language models smaller, faster, more reliable, and safer to deploy under real constraints: energy, latency, fairness, thermal safety, accelerator faults, and hardware-induced failures.

Research thesis: The next generation of useful AI systems will be judged not only by capability, but by whether their internal representations are predictable, their deployment costs are controlled, and their hardware-level failure modes are understood.

Email CV Scholar GitHub LinkedIn

Research Publications Projects Service Updates Blog CV

Research

My work currently centers on three connected directions: Predictive Geometry of LLM Representations, GenAI Optimization, and AI Hardware Security.

Recent Writing

I will use the blog for short notes on GenAI optimization, AI hardware security, papers, experiments, and research engineering. See all blog posts.

How to Evaluate GenAI Optimization Beyond Demo Quality
Planned essay · GenAI Optimization

Evaluation sets, latency budgets, regression testing, and measuring whether an optimization actually helps.

What Bit-Flip Attacks Reveal About Multimodal LLMs
Planned essay · AI Hardware Security

A practical note on model faults, accelerator reliability, and security-aware evaluation.

News

2026 FlipLLM accepted at IEEE HOST 2026.

2026 RIFT accepted at DATE 2026 and nominated for best paper award.

2026 Received Best PhD Student in ECE Department.

2026 Paper selected for presentation at Meta workshop in HRI 2026.

2026 IEEE HOST NSF Travel Funding Support.

2025 IEEE CEDA and CASS Travel Awards.

All updates

Predictive Geometry	Predictive Geometry of LLM Representations Khurram Khalil Attention sinks, semantic trajectories, representation geometry, hallucination prediction. I am increasingly interested in whether the internal geometry of LLM representations can predict model behavior before the final answer appears. Read more.
GenAI Optimization	Specification-guided compression and deployment of GenAI systems Khurram Khalil LLM/VLM compression, edge AI, temporal logic, energy-aware design, efficient attention. I study how generative models can be compressed and deployed while preserving useful behavior under constraints such as energy, latency, fairness, and runtime safety. Read more.
Hardware Security	AI hardware security, fault attacks, and secure model deployment Khurram Khalil Bit-flip attacks on LLMs, accelerator faults, approximate DNN faults, hardware-aware robustness. I evaluate how AI models fail under hardware faults and accelerator-level attacks, and design methods to make these systems more resilient under deployment constraints. Read more.

Selected Publications

HOST 2026	FlipLLM: Efficient Bit-Flip Attacks on Multimodal LLMs using Reinforcement Learning K. Khalil, K. A. Hoque IEEE International Symposium on Hardware Oriented Security and Trust, 2026. Accepted.
DATE 2026	RIFT: A Scalable Methodology for LLM Accelerator Fault Assessment using Reinforcement Learning K. Khalil, K. A. Hoque Design, Automation and Test in Europe, 2026. Accepted. Best paper nomination.
ICCAD 2025	TOGGLE: Temporal Logic-Guided Large Language Model Compression for Edge K. Khalil, K. A. Hoque IEEE/ACM International Conference on Computer-Aided Design, 2025. doi

For projects, full publications, academic service, CV, and longer writing, use the links at the top of this page.

Research

Recent Writing

News

Selected Publications

More