Mohan Kumar

mohan.cbein@gmail.com

Mohan Kumar

10109 Ridgeway Dr
Cupertino CA 95014
(650) 709-4897

About me

I am a Research Scientist at Meta Reality Labs focused on accelerating machine learning inference on embedded devices. My work encompasses optimizing and deploying diverse ML architectures—including CNNs, RCNNs, RNNs, and transformers—onto resource-constrained hardware platforms. I leverage PyTorch and related frameworks to develop efficient model implementations and quantization techniques for edge deployment. Additionally, I partner with SoC vendors to architect NPU/eNPU specifications that enable more efficient and effective hardware solutions for on-device ML inference.

I received my Ph.D. in Computer Science from the Georgia Institute of Technology, where I specialized in Systems under the advisement of Dr. Taesoo Kim. My thesis, Taming Latency In Data Center Applications, is available here. I also hold an M.S. in Computer Science from Georgia Tech and a B.E. in Computer Science from the University of Madras.

Publications

Posters