|
Khurram Khalil
I am a PhD student in Computer Science at the
University of Missouri-Columbia, working on
GenAI optimization and AI hardware security.
My research focuses on efficient model compression, adversarial robustness,
formal verification, and hardware-aware deployment for edge and cyber-physical AI systems.
I build methods that make large language and vision-language models smaller, faster,
more reliable, and safer to deploy under real constraints: energy, latency, fairness,
thermal safety, accelerator faults, and hardware-induced failures.
Research thesis: The next generation of useful AI systems will be judged
not only by capability, but by whether their internal representations are predictable,
their deployment costs are controlled, and their hardware-level failure modes are understood.
Email
CV
Scholar
GitHub
LinkedIn
Research
Publications
Projects
Service
Updates
Blog
CV
|
|
Research
My work currently centers on three connected directions:
Predictive Geometry of LLM Representations,
GenAI Optimization, and
AI Hardware Security.
|
Recent Writing
I will use the blog for short notes on GenAI optimization, AI hardware security, papers,
experiments, and research engineering. See all blog posts.
How to Evaluate GenAI Optimization Beyond Demo Quality
Planned essay · GenAI Optimization
Evaluation sets, latency budgets, regression testing, and measuring whether an optimization actually helps.
What Bit-Flip Attacks Reveal About Multimodal LLMs
Planned essay · AI Hardware Security
A practical note on model faults, accelerator reliability, and security-aware evaluation.
|
News
2026 FlipLLM accepted at IEEE HOST 2026.
2026 RIFT accepted at DATE 2026 and nominated for best paper award.
2026 Received Best PhD Student in ECE Department.
2026 Paper selected for presentation at Meta workshop in HRI 2026.
2026 IEEE HOST NSF Travel Funding Support.
2025 IEEE CEDA and CASS Travel Awards.
All updates
|
|
Predictive Geometry
|
Predictive Geometry of LLM Representations
Khurram Khalil
Attention sinks, semantic trajectories, representation geometry, hallucination prediction.
I am increasingly interested in whether the internal geometry of LLM representations can
predict model behavior before the final answer appears. Read more.
|
|
GenAI Optimization
|
Specification-guided compression and deployment of GenAI systems
Khurram Khalil
LLM/VLM compression, edge AI, temporal logic, energy-aware design, efficient attention.
I study how generative models can be compressed and deployed while preserving useful behavior
under constraints such as energy, latency, fairness, and runtime safety. Read more.
|
|
Hardware Security
|
AI hardware security, fault attacks, and secure model deployment
Khurram Khalil
Bit-flip attacks on LLMs, accelerator faults, approximate DNN faults, hardware-aware robustness.
I evaluate how AI models fail under hardware faults and accelerator-level attacks,
and design methods to make these systems more resilient under deployment constraints. Read more.
|
HOST 2026 |
FlipLLM: Efficient Bit-Flip Attacks on Multimodal LLMs using Reinforcement Learning
K. Khalil, K. A. Hoque
IEEE International Symposium on Hardware Oriented Security and Trust, 2026. Accepted.
|
DATE 2026 |
RIFT: A Scalable Methodology for LLM Accelerator Fault Assessment using Reinforcement Learning
K. Khalil, K. A. Hoque
Design, Automation and Test in Europe, 2026. Accepted. Best paper nomination.
|
ICCAD 2025 |
TOGGLE: Temporal Logic-Guided Large Language Model Compression for Edge
K. Khalil, K. A. Hoque
IEEE/ACM International Conference on Computer-Aided Design, 2025.
doi
|
More
For projects, full publications, academic service, CV, and longer writing, use the links at the top of this page.
|
|