Anyi Rao

Bio

Anyi Rao is an Assistant Professor at the Hong Kong University of Science and Technology (HKUST), leading the Multimedia Creativity Lab (MMLab@HKUST). He studies human AI, generative AI, agentic AI, computer vision, and computer graphics. His works include ControlNet, AnimateDiff, MovieNet, Virtual Studio, and IC-Light, with a best paper award in computer vision (Marr Prize ICCV best paper award), a best paper award in human-computer interaction (ACM DIS best paper award), and a best ai film award. These works have been widely used in industry, including Amazon, Netflix, Tencent, and more.

He was a Postdoctoral Scholar at Stanford with Maneesh Agrawala. He received the Ph.D. at MMLab, Chinese University of Hong Kong with Dahua Lin and Bolei Zhou. He has research experiences at Meta Reality Lab, Vector Institute, University of Toronto, and University of Hong Kong. He serves as an area chair/TPC of CVPR, ICLR, SIGGRAPH Asia, NeurIPS, UIST, AAAI and co-chair of MMSys, UIST, VINCI, CVM. He is the founding chair of the Hong Kong HKUST AI Film Festival and the Paris ShortFest AI Film Festival.

He is the recipient of the Microsoft Research Asia StarTrack Scholar, Forbes 30 Under 30 Asia, the Rising Star Award at the World Artificial Intelligence Conference, and the Brown Magic Grant.

Actively looking for highly motivated students to join the group. See openings for more details. Please fill out this 2027 form for 2027 intake and send an email if you are interested in.

Full Biography

Anyi Rao is an Assistant Professor in the Division of Arts and Machine Creativity (AMC), the Department of Computer Science and Engineering (CSE), and the Division of Emerging Interdisciplinary Areas (EMIA) at the Academy of Interdisciplinary Studies (AIS) at the Hong Kong University of Science and Technology (HKUST), jointly appointed in the Computational Media and Arts (CMA), HKUST (GZ). He is the Director of Multimedia Creativity Lab (MMLab@HKUST), and the Associate Director of HKUST Media Intelligence Research Center.

He studies human AI, generative AI, agentic AI, computer vision, and computer graphics, focusing on the creation, editing and understanding of art, media and film, aiming to build human-AI collaborative intelligence and unleash human creativity and productivity. His works include ControlNet, AnimateDiff, MovieNet, Virtual Studio, and IC-Light, with a best paper award in computer vision (Marr Prize ICCV best paper award), a best paper award in human computer interaction (ACM DIS best paper award), and a best ai film award. These works have been widely used in industry, including Amazon, Netflix, Tencent, and more.

He has been selected as the Microsoft Research Asia StarTrack Scholar 2026, been featured in the Forbes 30 Under 30 Asia 2025 List, won the Rising Star Award at the World Artificial Intelligence Conference 2024, and hosted the Brown Media Innovation Research Fund and the Amazon Video Research Fund, etc. He gave keynote at the Beijing Film Festival, the Golden Rooster Film Festival, the Shanghai Television Magnolia Festival, was featured by Shanghai TV Financial Channel and Hong Kong Cable Television.

News

2026-06: Four papers are accepted to ECCV 2026.

2026-06: We are organizing the Workshop on AI for Creative Visual Content Generation Editing and Understanding 10th editon at SIGGRAPH 2026. Please follow our Twitter for more information!

2026-05: DataSway won the 🏆 Best Paper Award in ACM DIS 2026 and one paper is accepted to TPAMI 2026.

2026-03: Five papers are accepted to SIGGRAPH Journal TOG 2026, CVPR 2026, ICLR 2026, and CHI 2026.

2026-03: We are organizing the Workshop on AI for Creative Visual Content Generation Editing and Understanding 9th editon at CVPR 2026, the 2nd HKUST AI Film Festival on May 16, and the 1st Workshop on Agentic AI for Visual Media at CVPR 2026.

2025-07: CineVision is accepted to UIST 2025 and Light-A-Video is accepted to ICCV 2025.

2025-08: We are organizing the Workshop on AI for Creative Visual Content Generation Editing and Understanding 8th edtion at SIGGRAPH Asia 2025. 7th edtion at SIGGRAPH 2025. This is the first year that SIGGRAPH has a technical workshop program.

2025-04: Chair and curate the 1st HKUST AI Film Festival with CVM 2025 on April 19.

2025-02: IC-Light is accepted to ICLR 2025 as Oral. And one paper is accepted to CVPR 2025.

2024-08: Three paper are accepted to UIST 2024, ICML 2024 and ECCV 2024.

2023-11: AnimateDiff is online and gets an update on SparseCtrl ability.

2023-10: 🧑‍🎨 ControlNet receives the 🏆 Best Paper Award (Marr Prize) at ICCV 2023. V1 V1.1 A1111 WebUI

2023-06: We are organizing 🍿 inaugural Paris ShortFest AI Film Festival, jointly with ICCV 2023 in Paris, France.

2025-02: We are organizing the Sixth Workshop on AI for Creative Visual Content Generation Editing and Understanding at CVPR 2025.

2024-08: We are organizing the Course on Generative Models for Visual Content Editing and Creation at SIGGRAPH 2024.

2024-02: Cinematic Behavior Transfer is accepted to CVPR 2024. Our efforts on Intelligent Cinematography 🎬 Virtual Film Studio include our SIGGRAPH Virtual Dynamic Storyboard, CVPR Cinematic Behavior Transfer and ECCV Multi-camera Editing.

2024-01: We are organizing the Fourth Workshop on AI for Creative Visual Content Generation Editing and Understanding at CVPR 2024.

2023-08: Three papers are accepted to ICCV 2023 as Oral, UIST 2023, and ACM MM 2023.

2023-06: We are organizing the Third Workshop on AI for Creative Video Editing and Understanding at ICCV 2023 in Paris

2023-04: Two papers are accepted to AAAI 2023 as Oral and IJCAI 2023.

2022-10: We are organizing the Second Workshop on AI for Creative Video Editing and Understanding at ECCV 2022.

2022-07: Two papers on 👷 City-Super Research: CityNeRF and Shoot360 are accepted to ECCV 2022 and SIGGRAPH 2022.

2022-03: Two papers are accepted to CVPR 2022 and IEEE Transactions on Multimedia.

2021-09: We are organizing the First Workshop on AI for Creative Video Editing and Understanding during ICCV 2021.

2021-07: Two papers are accepted to ICCV 2021 and IEEE Transactions on Multimedia.

2021-05: Our CVPR 2020 work SceneSeg is set as the baseline for the ACM Multimedia 2021 Grand Challenge: Tencent Ads Algorithm Competition. Participate to win USD$100,000 for the first prize.

2020-07: MovieNet is online with an easy-to-use toolkit as a part of OpenMMLab.

2020-07: Three papers are accepted to ECCV 2020.

2020-02: One paper is accepted to CVPR 2020. Also appears at LUV 2020 (15-min talk) and Sight and Sound 2020 (5-min talk).

2020-01: HotFlip is included in AllenNLP and TextAttack

Selected Publication [Full List]

MagicPrompt: Ultra-Lightweight Prompt Tuning for Video Generation
Yinhan Zhang, Dingwei Tan, Xianghao Kong, Yue Ma, Yeying Jin, Anyi Rao
European Conference on Computer Vision (ECCV), 2026
[Paper] [Webpage]

FlexComposer: Unified Video Compositing from Images to Dynamic Footage with Flexible Trajectory Control
Songchun Zhang, Sitong Guo, Xianghao Kong, Pengwei Liu, Yuwei Guo, Lvmin Zhang, Anyi Rao
European Conference on Computer Vision (ECCV), 2026
[Paper] [Webpage]

UniVidX: A Unified Multimodal Framework for Versatile Video
Houyuan Chen, Hong Li, Xianghao Kong, Tianrui Zhu, Shaocong Xu, Weiqing Xiao, Yuwei Guo, Chongjie Ye, Lvmin Zhang, Hao Zhao, Anyi Rao
(SIGGRAPH), 2026
[Paper] [Webpage]

Composing Concepts from Images and Videos via Concept-prompt Binding
Xianghao Kong, Zeyu Zhang, Yuwei Guo, Zhuoran Zhao, Songchun Zhang, Anyi Rao
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026 Highlight
[Paper] [Webpage]

DataSway: Vivifying Metaphoric Visualization with Animation Clip Generation and Coordination
Liwenhan Xie, Jiayi Zhou, Anyi Rao, Huamin Qu, Xinhuan Shu
ACM Conference on Designing Interactive Systems (DIS), 2026 Best Paper Award
[Paper] [Webpage]

Collaposer: Transforming Photo Collections into Visual Assets for Storytelling with Collages
Jiayi Zhou, Liwenhan Xie, Jiaju Ma, Zheng Wei, Huamin Qu, Anyi Rao
ACM Conference on Human Factors in Computing Systems (CHI), 2026
[Paper] [Webpage]

CineVision: An Interactive Pre-visualization Storyboard System for Director–Cinematographer Collaboration
Zheng Wei, Hongtao Wu, Lvmin Zhang, Xian Xu, Yefeng Zheng, Pan Hui, Maneesh Agrawala, Huamin Qu, Anyi Rao
User Interface Software and Technology (UIST), 2025
[Paper] [Webpage]

IC-Light: Scaling In-the-Wild Training for Diffusion-based Illumination Harmonization and Editing by Imposing Consistent Light Transport
Lvmin Zhang, Anyi Rao, Maneesh Agrawala
International Conference on Learning Representations (ICLR), 2025 Oral
[Paper] [Webpage] [Demo]

ScriptViz: A Visualization Tool to Aid Scriptwriting based on a Large Movie Database
Anyi Rao, Jean-Peïc Chou, Maneesh Agrawala
User Interface Software and Technology (UIST), 2024
[Paper] [Webpage]

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo, Ceyuan Yang, Anyi Rao, Zhengyang Liang, Yaohui Wang, Yu Qiao, Maneesh Agrawala, Dahua Lin, Bo Dai
International Conference on Learning Representations (ICLR), 2024 Spotlight
[Paper] [Webpage]

ControlNet: Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang, Anyi Rao, Maneesh Agrawala
IEEE/CVF International Conference on Computer Vision (ICCV), 2023 Best Paper Award (Marr Prize)
[Paper] [Webpage] [Supplements] [V1] [V1.1] [A1111 WebUI]

Dynamic Storyboard Generation in an Engine-based Virtual Environments for Video Production
Anyi Rao*, Xuekun Jiang*, Yuwei Guo, Linning Xu, Lei Yang, Libiao Jin, Dahua Lin, Bo Dai
(SIGGRAPH) Poster, 2023
[Paper] [Webpage]

Shoot360: Normal View Video Creation from City Panorama Footage
Anyi Rao, Linning Xu, Dahua Lin
(SIGGRAPH), 2022
[Paper] [Webpage]

A Coarse-to-Fine Framework for Automatic Video Unscreen
Anyi Rao, Linning Xu, Zhizhong Li, Qingqiu Huang, Zhanghui Kuang, Wayne Zhang, Dahua Lin
IEEE Transactions on Multimedia, (TMM), 2022
[Paper] [Webpage]

A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
Anyi Rao, Linning Xu, Yu Xiong, Guodong Xu, Qingqiu Huang, Bolei Zhou, Dahua Lin
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020
LUV 2020 and Sight and Sound 2020 workshops Oral
[Paper] [Webpage]

A Unified Framework for Shot Type Classification Based on Subject Centric Lens
Anyi Rao, Jiaze Wang, Linning Xu, Xuekun Jiang, Qingqiu Huang, Bolei Zhou, Dahua Lin
European Conference on Computer Vision (ECCV), 2020
Video Turing Test 2020 Workshop Oral
[Paper] [Webpage]

MovieNet: A Holistic Dataset for Movie Understanding
Qingqiu Huang, Yu Xiong, Anyi Rao, Jiaze Wang, Dahua Lin
European Conference on Computer Vision (ECCV), 2020 (Spotlight)
[Paper] [Webpage]

Selected Awards and Grants

Microsoft Research Asia StarTrack Scholar	2026
Forbes 30 Under 30 Asia	2025
Tencent Rhino-Bird Grant	2025
AIS Support Funding for Interdisciplinary Research Collaboration	2025
Bridge Gap Funding	2025
Rising Star by World Artificial Intelligence Conference (WAIC)	2024
Talent Gift Funding	2024
Art Tech Funding	2024
Best Paper Award (Marr Prize) by International Conference on Computer Vision (ICCV)	2023
Magic Grant by The Brown Institute for Media Innovation	2023
Amazon Prime Video Gift Funding	2023

Tencnet, Cybever, and Pika Grant for SIGGRAPH Workshop Organization	2025
Adobe and Pika Grant for CVPR Workshop Organization	2024
Pika and KAUST Grant for ICCV Workshop Organization	2023
KAUST Grant for ECCV Workshop Organization	2022
Adobe Grant for ICCV Workshop Organization	2021
Hong Kong PhD Fellowship	2021
Most Influential Paper by Paper Digest	2020
Nanjing University Top-Grade Scholarship,the highest honor in the university	2018
SenseTime Scholarship, awarded to 30 students out of all AI major undergraduate students in China	2017
Provincial Merit Student awarded by the Jiangsu Province	2017
National Scholarship awarded by the China Ministry of Education	2015
Gold Medal in Invitational National Mathematical Olympiad	2013
Nanjing University Outstanding Student Leader Award	2015
Nanjing University Outstanding Student Award	2016
Nanjing University Top Volunteer Excellence Award	2015
Zhenggang Scholarship, top 40 students in Nanjing University	2016
Zhenggang Jingying Scholarship	2017
Nanjing University People Scholarship	2016
Nanjing University People Scholarship	2017
World ranking 32nd in 2016 Calculus World Cup	2016
Meritorious winner prize in the 2016 National Mathematical Contest in Modeling	2016
Best paper in the 2014 University Electronics Design Contest	2014

Talks

From Intention to Attention to Manifestation

Beijing Film Festival, 04/2026
Rome Square High-Level Forum, 01/2026
Hong Kong International AI Art Festival, 12/2025
SIGGRAPH Asia, 12/2025
World Cultural Forum, 11/2025
SIGGRAPH, 08/2025

AI for Local Video Dataset

Digital Entertainment Leadership Forum, 09/2025

Bridging the Representation Gap of Humans and Computers for Video Production

China Golden Rooster Film Festival, 11/2024
World Artificial Intelligence Conference (WAIC), 07/2024
Shanghai Television Magnolia Festival, 06/2024
University of Hong Kong (HKU), 05/2024
Hong Kong University of Science and Technology (HKUST), 05/2024
Stanford University, 04/2024
Hong Kong Shanghai AI Forum, 04/2024

Collaborative Intelligent Tools to Support Video Production

Symposium on AI Technologies and Their Implications, 03/2024
Adobe, 03/2024
CCF, 03/2024
Film School of HKBU, 05/2021

Creative Video Understanding, Editing and Generation

Art School of UTK, 02/2024
ICCV23 Workshop, Paris, France, 10/2022

Controllable Visual Content Generation to Unleash Creativity and Productivity

Netflix, 11/2023
Bay Area Vision Day (Stanford, Berkeley, Caltech), 09/2023

Temporal and Contextual Transformer for Multi-Camera Editing

ECCV22 Workshop, 10/2022

Multimodal Representation Learning

Meta, Redmond, WA, 12/2022
CVPR20 Workshop on Sight and Sound, 06/2020
CVPR20 Workshop on Learning from Unlabeled Videos, 06/2020

Cinematic Style Analysis Based on Subject Centric Lens

ECCV20 Workshop on Video Turing Test: Human-level Video Story Understanding, 08/2020

Press Coverage

2nd AI Film Festival: HKTKWW / CZTV

Coexistence of Artificial Intelligence and Humanity: HK01

Where AI Meets Humanity - A Scholar Democratizing Storytelling: HKUST

AI Film Production: i-CABLE News / HKUST

1st AI Film Festival: Associated Press / Yahoo / PR Newswire HKET Ta Kung Pao HK Economic Journal

Media/Entertainment Industry: China Golden Rooster Film Festival / Shanghai Television Magnolia Festival / China News

World AI Conference: Video Generation Forum / Rising Star

Art: Hidden Imagery In AI Art / Funky AI-generated Spiraling Medieval Village Captivates Social Media

Professional Activities

Area Chair for CVPR, ICLR, NeurIPS, WACV

Senior Program Committee Member for AAAI

Program Committee Member for SIGGRAPH Asia, UIST

Co-Chair for MMSys26 (Workshop), UIST25 (Registeration), CVM25 (Film), VINCI25 (Media Gallery), UIST24 (Registeration)

Advisor: 2026 Hong Kong The Second HKUST AI Film Festival,

Organizer: Workshop on Agentic AI for Visual Media at CVPR26

Organizer: Workshop on AI for Creative Video Editing and Understanding at SIGGRAPH26, CVPR26, SIGGRAPH Asia25, SIGGRAPH25, CVPR25, CVPR24, ICCV23, ECCV22, ICCV21

Curator: 2025 Hong Kong The First HKUST AI Film Festival, 2023 Paris ShortFest AI Film Festival.

Conference Reviewer: CVPR, ICCV, ECCV, ACCV, SIGGRAPH, SIGGRAPH Asia, Eurographics, CHI, UIST, MM, NeurIPS, ICML, ICLR, AAAI, IJCAI

Journal Reviewer: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), IEEE Transactions on Multimedia (TMM), IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), Transactions on Machine Learning Research (TMLR), International Journal of Computer Vision (IJCV), MIT Press Leonardo

Judge: Beijing International Film Festival AIGC Unit, The 3rd International Artificial Intelligence Fair

Art Experiences

Executive Producer of Echo, 11 Episodes of AI Short Films for HKUST 35th Anniversary

Executive Producer of But, AI Interactive Gaming Video

Technical Consultant of The Tale of the Peony, Best Film and Best Visual in Beijing International Film Festival AIGC Sector 2026

Advisor of The Meeting, AI Short Film, Gold Award at the AAAI 2026 CVM Workshop

Advisor of Ash Boat, AI Short Film, Best AI Feature at the Bali International AI Film Festival 2026

Exhibitor of Recursive Myth, Asian Digital Art Exhibition 2026

Research Experiences

Research Intern at Meta Reality Lab

Research Intern at Shanghai Artificial Intelligence Laboratory

Research Intern at SenseTime Research

Visitor at the University of Toronto and Vector Institute

Research Assistant at the Advanced Integration and Mining Lab, Eugene, OR, USA

Research Intern at University of Hong Kong, Hong Kong S.A.R.

Teaching Experiences

HKUST AMCC 5140 AI for Visual Arts and Creativity (AVAC)
HKUST AMCC 5150 Visual Computing for Visual Arts and Creativity (VCVAC)
HKUST AMCC 5250 Filmmaking with AI Innovations (FilmAI)
HKUST AMCC 5000 Creative Convergence: Foundations of Arts and Machine Creativity
HKUST AMCC 6950A Special Projects in Arts and Machine Creativity
HKUST EMIA 6950F Independent Study
HKUST EMIA 6500K Visual Computing for Visual Content Creation (VCVCC)
HKUST EMIA 6500H AI for Visual Content Creation (AIVCC)
SIGGRAPH Asia 2025 Course on Generative Models for Visual Content Editing and Creation
SIGGRAPH 2024 Course on Generative Models for Visual Content Editing and Creation
CCF 2024 Advanced Disciplines Lectures

Patents

A Video Generation Method, CN202210699177.X
A Video Editing Method and Related Program Products, CN202210691662.2
A Video Editing Method, CN202010694551.1
A Video Classification Method, CN202010694811.1
An Image Processing Method and Related Products, CN202010450801.3
A Zero-shot Action Recognition Method, CN202110821209.4
A Layout Generation Method, CN202111128