FULL PUBLICATAION LIST

Cinematic Behavior Transfer via NeRF-based Differentiable Filming
Anyi Rao, Xuekun Jiang*, Jingbo Wang, Dahua Lin, Bo Dai
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[Paper] [Webpage]

Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang, Anyi Rao, Maneesh Agrawala
IEEE/CVF International Conference on Computer Vision (ICCV), 2023 Marr Prize: Best Paper Award
[Paper] [Webpage]

Dynamic Storyboard Generation in an Engine-based Virtual Environments for Video Production
Anyi Rao, Xuekun Jiang, Yuwei Guo, Linning Xu, Lei Yang, Libiao Jin, Dahua Lin, Bo Dai
ACM Special Interest Group on Computer Graphics and Interactive Techniques Conference (SIGGRAPH), Poster, 2023
[Paper] [Webpage]

Shoot360: Normal View Video Creation from City Panorama Footage
Anyi Rao, Linning Xu, Dahua Lin
ACM Special Interest Group on Computer Graphics and Interactive Techniques Conference (SIGGRAPH), 2022
[Paper] [Webpage]

A Coarse-to-Fine Framework for Automatic Video Unscreen
Anyi Rao, Linning Xu, Zhizhong Li, Qingqiu Huang, Zhanghui Kuang, Wayne Zhang, Dahua Lin
IEEE Transactions on Multimedia (TMM), 2022
[Paper] [Webpage]

Temporal and Contextual Transformer for Multi-Camera Editing of TV Shows
Anyi Rao, Xuekun Jiang, Sichen Wang, Yuwei Guo, Zihao Liu, Bo Dai, Long Pang, Xiaoyu Wu, Dahua Lin, Libiao Jin
European Conference on Computer Vision (ECCVW), 2022
[Paper] [Webpage]

A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
Anyi Rao, Linning Xu, Yu Xiong, Guodong Xu, Qingqiu Huang, Bolei Zhou, Dahua Lin
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020
[Paper] [Webpage] [Code]

A Unified Framework for Shot Type Classification Based on Subject Centric Lens
Anyi Rao, Jiaze Wang, Linning Xu, Xuekun Jiang, Qingqiu Huang, Bolei Zhou, Dahua Lin
European Conference on Computer Vision (ECCV), 2020
[Paper] [Webpage]

Online Multi-modal Person Search in Videos
Jiayue Xia, Anyi Rao+(corresponding), Linning Xu, Qingqiu Huang, Dahua Lin
European Conference on Computer Vision (ECCV), 2020
[Paper] [Webpage]

Jointly Learning the Attributes and Composition of Shots for Boundary Detection in Videos
Xuekun Jiang, Libiao Jin, Anyi Rao+(corresponding), Linning Xu, Dahua Lin
IEEE Transactions on Multimedia (TMM), 2021
[Paper] [Webpage]

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo, Ceyuan Yang, Anyi Rao, Zhengyang Liang, Yaohui Wang, Yu Qiao, Maneesh Agrawala, Dahua Lin, Bo Dai
International Conference on Learning Representations (ICLR), 2024
[Paper] [Webpage] [Code]

Automated Conversion of Music Videos into Lyric Videos
Jiaju Ma, Anyi Rao, Li-Yi Wei, Rubaiat Habib Kazi, Hijung Valentina Shin, Maneesh Agrawala
User Interface Software and Technology (UIST), 2023
[Paper] [Webpage]

Zero-shot Skeleton-based Action Recognition via Mutual Information Estimation and Maximization
Yujie Zhou, Wenwen Qiang, Anyi Rao, Ning Lin, Bing Su, Jiaqi Wang
ACM International Conference on Multimedia (ACM MM), 2023
[Paper] [Webpage]

HireVAE: An Online and Adaptive Factor Model Based on Hierarchical and Regime-Switch VAE
Zikai Wei, Anyi Rao, Bo Dai, Dahua Lin
International Joint Conference on Artificial Intelligence (IJCAI), 2023
[Paper] [Code]

Self-supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences
Yujie Zhou, Haodong Duan, Anyi Rao, Bing Su, Jiaqi Wang
AAAI Conference on Artificial Intelligence (AAAI), 2023 Oral
[Paper] [Code]

BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering
Yuanbo Xiangli, Linning Xu, Xingang Pan, Nanxuan Zhao, Anyi Rao, Christian Theobalt, Bo Dai, Dahua Lin
European Conference on Computer Vision (ECCV), 2022
[Paper] [Webpage] [Supplements] [Code]

AutoGPart: Intermediate Supervision Search for Generalizable 3D Part Segmentation
Xueyi Liu, Xiaomeng Xu, Anyi Rao, Chuang Gan, Li Yi
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[Paper] [Webpage] [Code]

BlockPlanner: City Block Generation with Vectorized Graph Representation
Linning Xu, Yuanbo Xiangli, Anyi Rao, Nanxuan Zhao, Bo Dai, Ziwei Liu, Dahua Lin
IEEE/CVF International Conference on Computer Vision (ICCV), 2021
[Paper] [Webpage] [Supplements]

MovieNet: A Holistic Dataset for Movie Understanding
Qingqiu Huang, Yu Xiong, Anyi Rao, Jiaze Wang, Dahua Lin
European Conference on Computer Vision (ECCV), 2020 Spotlight
[Paper] [Webpage] [Supplements] [Code]

HotFlip: White-Box Adversarial Examples for Text Classification
Javid Ebrahimi, Anyi Rao, Daniel Lowd, Dejing Dou
Annual Meeting of the Association for Computational Linguistics (ACL), 2018
[Paper] [Poster] [AllenNLP] [OpenAttack] [Code]

Automatic Music Accompaniment
Anyi Rao, Francis Lau
In arXiv, 2018
[Paper] [Presentation]