Anil Kag

Senior Research Scientist, Snap Research.

anil_kag.jpg

anilkagak2 @ gmail

Los Angeles, California, USA

I am a Senior Research Scientist and the Lead of the Efficient Generative AI team within the Creative Vision group at Snap Research. My research focuses on the intersection of high-fidelity generative modeling and large-scale efficiency—specifically architecting text, image, video, and audio generation models that are both state-of-the-art and deployable at a global scale. Since joining Snap in 2023, my work has transitioned from academic breakthroughs at top-tier venues like CVPR, NeurIPS, ICML, and ICLR to core product features. This includes Snap Video and SnapGen, which power AI Lenses used by millions of Snapchatters daily. My research also powers personalized image generation and lens creation in Easy Lens. My contributions are currently supported by 9 United States patent applications. You can find a thematic breakdown of my research, patents, and publications in this consolidated research overview.

I earned my Ph.D. in Electrical and Computer Engineering from Boston University, where I was a Hariri Graduate Student Fellow in Prof. Venkatesh Saligrama’s lab specializing in constrained learning and efficient neural architectures. Prior to my doctoral studies, I served as a Research Fellow at Microsoft Research India within the Machine Learning & Optimization Group, collaborating with Dr. Manik Varma on Extreme Multi-label Classification (XMC). I hold a B.Tech. in Computer Science from IIT Guwahati. My early professional experience also includes two years as a Software Engineer at Microsoft India, where I optimized core performance for Dynamics CRM.

Opportunities

  • Internships: I am always looking for exceptional Ph.D. students for full-time research internships on my team at Snap Research. If you are passionate about pushing the boundaries of efficient GenAI (on-device or server), please reach out.
  • Collaborations: I am always open to discussing new frontiers in efficient generative modeling—from elastic architecture design to novel training algorithms. I have listed some of my current perspectives in my research statement.

news

Feb 20, 2026 Three papers accepted to CVPR 2026: S2DiT, ELITE, and Omni-Attribute.
Jan 26, 2026 Our paper SPRINT has been accepted to ICLR 2026.
Jan 13, 2026 New preprints available on arXiv: SnapGen++, Diffusion-DRF, and H3AE.
Sep 18, 2025 Two papers accepted to NeurIPS 2025: PointVid and DenseDPO.
Jun 25, 2025 Our paper RankDPO has been accepted to ICCV 2025.
More news...

selected publications

  1. arXiv
    SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices
    D. Hu, A. Gupta, M. Gabidolla, A. Sahni, H. Coskun, Y. Li, Y. Idelbayev, A. Mahmood, A. Lebedev, D. Lahiri, A. Goyal, J. Hu, M. Gong, S. Tulyakov, and Anil Kag
    In , 2026
  2. ICLR
    SPRINT: Sparse-Dense Residual Fusion for Efficient Diffusion Transformers
    D. Park, M. Haji-Ali, Y. Li, W. Menapace, S. Tulyakov, H. Kim, A. Siarohin, and Anil Kag
    In International Conference on Learning Representations, 2026
  3. CVPR
    S2DiT: Sandwich Diffusion Transformer for Mobile Streaming Video Generation
    L. Zhao, Y. Wu, A. Lebedev, D. Lahiri, M. Dong, A. Sahni, M. Vasilkovsky, H. Chen, J. Hu, A. Siarohin, S. Tulyakov, Y. Wang, Anil Kag, and Y. Li
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2026
  4. CVPR
    SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures
    J. Chen, D. Hu, X. Huang, H. Coskun, A. Sahni, A. Gupta, A. Goyal, D. Lahiri, R. Singh, Y. Idelbayev, J. Cao, Y. Li, K. Cheng, S. Chan, M. Gong, S. Tulyakov, Y. Xu, J. Ren, and Anil Kag
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
  5. CVPR
    Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
    W. Menapace, A. Siarohin, I. Skorokhodov, E. Deyneka, T. S. Chen, Anil Kag, Y. Fang, A. Stoliar, E. Ricci, J. Ren, and S. Tulyakov
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
  6. ICLR
    Efficient Edge Inference by Selective Query
    Anil Kag, Igor Fedorov, Aditya Gangrade, Paul Whatmough, and Venkatesh Saligrama
    In International Conference on Learning Representations, 2023
  7. ICML
    Training Recurrent Neural Networks via Forward Propagation Through Time
    Anil Kag and Venkatesh Saligrama
    In International Conference on Machine Learning, 2021