Yiran Xu


I'm a CS Ph.D. candidate at the University of Maryland, College Park working with Prof. Jia-Bin Huang. Before this, I earned my M.S. from UC San Diego under Prof. Nuno Vasconcelos, and my B.E. from South China University of Technology (SCUT) with Prof. Junbo Zhang.

My research interests are in computer vision and deep learning, focusing on generative models and their applications, especially in videos.

I've interned at Google DeepMind, Adobe Research and Snap Research, where I had the pleasure of collaborating with many

Bio CV Mail (Twitter) Scholar Github LinkedIn

Profile picture

Publications

VideoGigaGAN: Towards Detail-rich Video Super-Resolution
Yiran Xu, Taesung Park, Richard Zhang, Yang Zhou, Eli Shechtman, Feng Liu, Jia-Bin Huang, Difan Liu
arXiv, 2024
TL;DR: VideoGigaGAN is a Video Super-Resolution (VSR) method capable of upscaling videos by up to 8x with fine-grained details.
Project Page / Paper / Supplemental / Media Coverage /
@InProceedings{xu2024videogigagan, 
	author = {Yiran Xu and Taesung Park and Richard Zhang and Yang Zhou and Eli Shechtman and Feng Liu and Jia-Bin Huang and Difan Liu}, 
	title = {VideoGigaGAN: Towards Detail-rich Video Super-Resolution}, 
	booktitle = {arXiv}, 
	year = {2024}, 
}
Project image
Flash-Splat: 3D Reflection Removal with Flash Cues and Gaussian Splats
Mingyang Xie, Haoming Cai, Sachin Shah, Yiran Xu, Brandon Y Feng, Jia-Bin Huang, Christopher A Metzler
ECCV, 2024
TL;DR: Flash-Splat is a 3D reflection removal method that leverages flash cues and Gaussian splats for robust performance.
Project Page / Paper /
@InProceedings{xie2024flashsplat, 
	author = {Mingyang Xie and Haoming Cai and Sachin Shah and Yiran Xu and Brandon Y Feng and Jia-Bin Huang and Christopher A Metzler}, 
	title = {Flash-Splat: 3D Reflection Removal with Flash Cues and Gaussian Splats}, 
	booktitle = {ECCV}, 
	year = {2024}, 
}
In-N-Out: Faithful 3D GAN Inversion with Volumetric Decomposition for Face Editing
Yiran Xu, Zhixin Shu, Cameron Smith, Seoung Wug Oh, Jia-Bin Huang
CVPR, 2024
TL;DR: In-N-Out is a 3D GAN inversion method that decomposes the input into two components, enabling effective inversion for Out-of-Distribution (OOD) data.
Project Page / Paper / Supplemental /
@InProceedings{xu2024innout, 
	author = {Yiran Xu and Zhixin Shu and Cameron Smith and Seoung Wug Oh and Jia-Bin Huang}, 
	title = {In-N-Out: Faithful 3D GAN Inversion with Volumetric Decomposition for Face Editing}, 
	booktitle = {CVPR}, 
	year = {2024}, 
}
Project image
Dynamic Mesh-aware Radiance Fields
Yi-Ling Qiao, Alexander Gao, Yiran Xu, Yue Feng, Jia-Bin Huang, Ming Lin
ICCV, 2023
TL;DR: DMRF is a hybrid 3D representation that facilitates physical simulations based on NeRFs
Project Page / Paper /
@InProceedings{qiao2023dynamic, 
	author = {Yi-Ling Qiao and Alexander Gao and Yiran Xu and Yue Feng and Jia-Bin Huang and Ming Lin}, 
	title = {Dynamic Mesh-aware Radiance Fields}, 
	booktitle = {ICCV}, 
	year = {2023}, 
}
Project image
Text-driven Visual Synthesis with Latent Diffusion Prior
Ting-Hsuan Liao, Songwei Ge, Yiran Xu, Yao-Chih Lee, Badour AlBahar, Jia-Bin Huang
arXiv, 2023
TL;DR: We leverage latent diffusion priors for multiple text-driven visual synthesis tasks.
Project Page / Paper /
@InProceedings{liao2023text, 
	author = {Ting-Hsuan Liao and Songwei Ge and Yiran Xu and Yao-Chih Lee and Badour AlBahar and Jia-Bin Huang}, 
	title = {Text-driven Visual Synthesis with Latent Diffusion Prior}, 
	booktitle = {arXiv}, 
	year = {2023}, 
}
Project image
Temporally Consistent Semantic Video Editing
Yiran Xu, Badour AlBahar, Jia-Bin Huang
ECCV, 2022
TL;DR: We propose a flow-based video editing method that enables semantic manipulation with temporal consistency.
Project Page / Paper /
@InProceedings{xu2022temporally, 
	author = {Yiran Xu and Badour AlBahar and Jia-Bin Huang}, 
	title = {Temporally Consistent Semantic Video Editing}, 
	booktitle = {ECCV}, 
	year = {2022}, 
}
Project image
Explainable Object-Induced Action Decision for Autonomous Vehicles
Yiran Xu, Xiaoyin Yang, Lihang Gong, Hsuan-Chu Lin, Tz-Ying Wu, Yunsheng Li, Nuno Vasconcelos
CVPR, 2020
TL;DR: We propose an explainable decision-making framework for autonomous vehicles.
Project Page / Paper /
@InProceedings{xu2020explainable, 
	author = {Yiran Xu and Xiaoyin Yang and Lihang Gong and Hsuan-Chu Lin and Tz-Ying Wu and Yunsheng Li and Nuno Vasconcelos}, 
	title = {Explainable Object-Induced Action Decision for Autonomous Vehicles}, 
	booktitle = {CVPR}, 
	year = {2020}, 
}

Service

Reviewer for CVPR'2022-2024, ICCV'2021-2023, ECCV'2022-2024, NeurIPS'2023, ICLR'2024, ICML'2024, WACV'2023-2024, ACCV'2024.

Miscellaneous

My Chinese name is 许亦冉. I was born in Hengyang, China, also known as the “Wild Goose City” (雁城). According to local tales, geese migrating south never go beyond Hengyang because they find the warmest weather here.

I have two cats, Bai (白) and Chelle (锈). They are important members who keep me company during my PhD journey.

I started working on computer vision because I wanted to build a makeup machine for my wife. Well...maybe not that simple.

I'm also a newbie in photography. Feel free to check out some of my photos!