Yatian Pang

Research Scientist, NVIDIA

👋 About Me

Greetings!

I am a Research Scientist at NVIDIA Cosmos Lab. My long-term goal is to build general-purpose AI systems that seamlessly bridge multi-modal understanding and generation, advancing the frontiers of world models and physical AI.

I obtained my Ph.D. at the National University of Singapore. I feel incredibly lucky to be supervised by Prof. E. H. Francis Tay, collaborate closely with Prof. Li Yuan from PKU, and gain valuable experience working with Alibaba's Qwen, Everlyn AI, and A*STAR.

🔥 News

Jun 2026 🚀 Cosmos 3 released!
Jan 2026 🎓 Officially Dr. P!
Dec 2025 🎉 Join NVIDIA as Research Scientist!

🎨 Selected Works

🌟 Cosmos 3

NVIDIA Cosmos Team, Yatian Pang (2026)

Cosmos 3, a family of omnimodal world models designed to jointly process and generate language, image, video, audio, and action sequences within a unified mixture-of-transformers architecture.

[Project Page] [Code]

🌟 Qwen3-VL

Qwen Team, Yatian Pang (2025)

One of the most popular open-sourced VLMs. Key contributions to long and streaming video understanding within the Qwen3-VL framework, achieving SOTA results across multiple video benchmarks.

[Project Page] [Code]

🌟 Open-Sora-Plan

Open-Sora-Plan Team, Yatian Pang (2024)

Pioneering open-source effort to reproduce Sora. Responsible for the high-resolution, long-duration video generation architecture and curating open-source high-quality video datasets.

[arXiv link] [Code]