I'm a research scientist at Adobe Research. I received my Ph.D. from the University of Maryland, College Park, where I was advised by Prof. Jia-Bin Huang.
My research interests are in computer vision and deep learning, focusing on generative models and their applications, especially in videos.
I've interned at Google DeepMind,
Adobe Research
and Snap Research,
where I had the pleasure of collaborating with many
Google DeepMind (2024):
Feng Yang,
Yinxiao Li,
Luciano Sbaiz,
Junjie Ke,
Miaosen Wang,
Hang Qi,
Han Zhang,
Jose Lezama,
Ming-Hsuan Yang,
Irfan Essa,
Jesse Berent.
Adobe Research (2023):
Difan Liu,
Taesung Park,
Richard Zhang,
Yang Zhou,
Eli Shechtman,
Feng Liu.
Adobe Research (2022):
Seoung Wug Oh,
Zhixin Shu,
Cameron Smith.
Snap Research (2021):
Jian Ren,
Zeng Huang,
Menglei Chai,
Kyle Olszewski,
Hsin-Ying Lee,
Sergey Tulyakov.
@article{hong2025relicinteractivevideoworld,
author = {Yicong Hong* and Yiqun Mei* and Chongjian Ge* and Yiran Xu and Yang Zhou and Sai Bi and Yannick Hold-Geoffroy and Mike Roberts and Matthew Fisher and Eli Shechtman and Kalyan Sunkavalli and Feng Liu and Zhengqi Li and Hao Tan},
title = {RELIC: Interactive Video World Model with Long-Horizon Memory},
journal = {Technical Report},
year = {2025},
}@article{kim2025rethinking,
author = {Subin Kim and Sangwoo Mo and Mamshad Nayeem Rizve and Yiran Xu and Difan Liu and Jinwoo Shin and Tobias Hinz},
title = {Rethinking Prompt Design for Inference-time Scaling in Text-to-Visual Generation},
journal = {arXiv},
year = {2025},
}@article{liao2025pad3r,
author = {Ting-Hsuan Liao and Haowen Liu and Yiran Xu and Songwei Ge and Gengshan Yang and Jia-Bin Huang},
title = {PAD3R: Pose-Aware Dynamic 3D Reconstruction from Casual Videos},
journal = {SIGGRAPH ASIA},
year = {2025},
}@inproceedings{xu2024videogigagan,
author = {Yiran Xu and Taesung Park and Richard Zhang and Yang Zhou and Eli Shechtman and Feng Liu and Jia-Bin Huang and Difan Liu},
title = {VideoGigaGAN: Towards Detail-rich Video Super-Resolution},
booktitle = {CVPR},
year = {2025},
}
@article{xu2025halo,
author = {Yiran Xu and Siqi Xie and Zhuofang Li and Harris Shadmany and Yinxiao Li and Luciano Sbaiz and Miaosen Wang and Junjie Ke and Jose Lezama and Hang Qi and Han Zhang and Jesse Berent and Ming-Hsuan Yang and Irfan Essa and Jia-Bin Huang and Feng Yang},
title = {HALO: Human-Aligned End-to-end Image Retargeting with Layered Transformations},
journal = {arXiv},
year = {2025},
}
@inproceedings{lee2025cropper,
author = {Seung Hyun Lee* and Jijun Jiang* and Yiran Xu* and Zhuofang Li* and Junjie Ke and Yinxiao Li and Junfeng He and Steven Hickson and Katie Datsenko and Sangpil Kim and Ming-Hsuan Yang and Irfan Essa and Feng Yang},
title = {Cropper: Vision-Language Model for Image Cropping through In-Context Learning},
booktitle = {CVPR},
year = {2025},
}
@inproceedings{xie2024flashsplat,
author = {Mingyang Xie and Haoming Cai and Sachin Shah and Yiran Xu and Brandon Y Feng and Jia-Bin Huang and Christopher A Metzler},
title = {Flash-Splat: 3D Reflection Removal with Flash Cues and Gaussian Splats},
booktitle = {ECCV},
year = {2024},
}@inproceedings{xu2024innout,
author = {Yiran Xu and Zhixin Shu and Cameron Smith and Seoung Wug Oh and Jia-Bin Huang},
title = {In-N-Out: Faithful 3D GAN Inversion with Volumetric Decomposition for Face Editing},
booktitle = {CVPR},
year = {2024},
}
@inproceedings{qiao2023dynamic,
author = {Yi-Ling Qiao and Alexander Gao and Yiran Xu and Yue Feng and Jia-Bin Huang and Ming Lin},
title = {Dynamic Mesh-aware Radiance Fields},
booktitle = {ICCV},
year = {2023},
}
@article{liao2023text,
author = {Ting-Hsuan Liao and Songwei Ge and Yiran Xu and Yao-Chih Lee and Badour AlBahar and Jia-Bin Huang},
title = {Text-driven Visual Synthesis with Latent Diffusion Prior},
journal = {arXiv},
year = {2023},
}
@inproceedings{xu2022temporally,
author = {Yiran Xu and Badour AlBahar and Jia-Bin Huang},
title = {Temporally Consistent Semantic Video Editing},
booktitle = {ECCV},
year = {2022},
}
@inproceedings{xu2020explainable,
author = {Yiran Xu and Xiaoyin Yang and Lihang Gong and Hsuan-Chu Lin and Tz-Ying Wu and Yunsheng Li and Nuno Vasconcelos},
title = {Explainable Object-Induced Action Decision for Autonomous Vehicles},
booktitle = {CVPR},
year = {2020},
}My Chinese name is 许亦冉. I was born in Hengyang, China, also known as the “Wild Goose City” (雁城). According to local tales, geese migrating south never go beyond Hengyang because they find the warmest weather here.
I have two cats, Bai (白) and Chelle (锈). They are important members who keep me company during my PhD journey.
I started working on computer vision because I wanted to build a makeup machine for my wife. Well...maybe not that simple.
I'm also a newbie in photography. Feel free to check out some of my photos!