I am a Machine Learning Engineer at Samsung Research (AI System Team). Previously, I worked at Google (CoreML Team) as a Student Researcher Intern.
My research interests lie in Model Compression and On-Device Personalization. Recently, I have been exploring ways to internalize retrieval augmented generation (e.g., GraphRAG) on edge devices for personalized AI. I have experience developing quantization methods that significantly reduce model size and latency while maintaining accuracy.