Research internship positions: if you are interested in large 3D or video generative models, please drop me an email.
V3D: Video Diffusion Models are Effective 3D Generators
Zilong Chen, Yikai Wang#, Feng Wang, Zhengyi Wang, Huaping Liu
Preprint, 2024
[arxiv] [webpage] [code] [bibtex]
AnimatableDreamer: Text-Guided Non-rigid 3D Model Generation and Reconstruction with Canonical Score Distillation
Xinzhou Wang, Yikai Wang#, Junliang Ye, Zhengyi Wang, Fuchun Sun, Pengkun Liu, Ling Wang, et al
Preprint, 2024
[arxiv] [webpage] [bibtex]
CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model
Zhengyi Wang, Yikai Wang, Yifei Chen, Chendong Xiang, Shuo Chen, Dajiang Yu, Chongxuan Li, Hang Su, Jun Zhu
Preprint, 2024
[arxiv] [webpage] [bibtex]
Equivariant Local Reference Frames for Unsupervised Non-rigid Point Cloud Shape Correspondence
Ling Wang, Runfa Chen, Yikai Wang#, Fuchun Sun, Xinzhou Wang, Sun Kai, Guangyuan Fu, Jianwei Zhang, Wenbing Huang
Preprint, 2024
[arxiv] [bibtex]
Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding
Pengkun Liu, Yikai Wang#, Fuchun Sun, Jiafang Li, Hang Xiao, Hongxiang Xue, Xinzhou Wang
Preprint, 2024
[arxiv] [webpage] [code] [bibtex]
FlexiDreamer: Single Image-to-3D Generation with FlexiCubes
Ruowen Zhao, Zhengyi Wang, Yikai Wang, Zihan Zhou, Jun Zhu
Preprint, 2024
[arxiv] [webpage] [bibtex]
DreamReward: Aligning Human Preference in Text-to-3D Generation
Junliang Ye, Fangfu Liu, Qixiu Li, Zhengyi Wang, Yikai Wang, Xinzhou Wang, Yueqi Duan, Jun Zhu
Preprint, 2024
[arxiv] [webpage] [code] [bibtex]
Text-to-3D using Gaussian Splatting
Zilong Chen, Feng Wang, Yikai Wang, Huaping Liu
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[arxiv] [webpage] [code] [bibtex]
Small Scale Data-Free Knowledge Distillation
He Liu*, Yikai Wang*, Huaping Liu, Fuchun Sun, Anbang Yao
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting
Yiwen Chen, Zilong Chen, Chi Zhang, Feng Wang, Xiaofeng Yang, Yikai Wang, et al
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[paper] [arxiv] [webpage] [code] [bibtex]
InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image
Jianhui Li, Shilong Liu, Zidong Liu, Yikai Wang, Kaiwen Zheng, Jinghui Xu, Jianmin Li, Jun Zhu
International Conference on Learning Representations (ICLR), 2024
[paper] [arxiv] [code] [bibtex]
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation
Zhengyi Wang, Cheng Lu, Yikai Wang, Fan Bao, Chongxuan Li, Hang Su, Jun Zhu
Advances in Neural Information Processing Systems (NeurIPS), Spotlight, 2023
[paper] [arxiv] [webpage] [code] [bibtex]
Root Pose Decomposition Towards Generic Non-rigid 3D Reconstruction with Monocular Videos
Yikai Wang, Yinpeng Dong, Fuchun Sun, Xiao Yang
International Conference on Computer Vision (ICCV), 2023
[paper] [arxiv] [bibtex]
Compacting Binary Neural Networks by Sparse Kernel Selection
Yikai Wang, Wenbing Huang, Yinpeng Dong, Fuchun Sun, Anbang Yao
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
[paper] [arxiv] [slides] [bibtex]
Towards Effective Adversarial Textured 3D Meshes on Physical Face Recognition
Xiao Yang, Chang Liu, Longlong Xu, Yikai Wang, Yinpeng Dong, et al
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Highlight, 2023
[paper] [arxiv] [code] [bibtex]
Benchmarking Robustness of 3D Object Detection to Common Corruptions in Autonomous Driving
Yinpeng Dong, Caixin Kang, Jinlai Zhang, Zijian Zhu, Yikai Wang, Xiao Yang, et al
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
[paper] [arxiv] [code] [slides] [bibtex]
Channel Exchanging Networks for Multimodal and Multitask Dense Image Prediction
Yikai Wang, Fuchun Sun, Wenbing Huang, Fengxiang He, Dacheng Tao
IEEE Transaction on Pattern Analysis and Machine Intelligence (TPAMI), 2023
[paper] [arxiv] [code] [slides] [bibtex]
Multimodal Token Fusion for Vision Transformers
Yikai Wang, Xinghao Chen, Lele Cao, Wenbing Huang, Fuchun Sun, Yunhe Wang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[paper] [arxiv] [code] [slides] [bibtex]
Bridged Transformer for Vision and Point Cloud 3D Object Detection
Yikai Wang, Tengqi Ye, Lele Cao, Wenbing Huang, Fuchun Sun, Fengxiang He, Dacheng Tao
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[paper] [arxiv] [slides] [bibtex]
Sound Adversarial Audio-visual Navigation
Yinfeng Yu, Wenbing Huang, Fuchun Sun, Changan Chen, Yikai Wang, Xiaolong Liu
International Conference on Learning Representations (ICLR), 2022
[paper] [arxiv] [code] [slides] [bibtex]
Fine-grained Multi-level Fusion for Anti-occlusion Monocular 3D Object Detection
He Liu, Huaping Liu, Yikai Wang, Fuchun Sun, Wenbing Huang
IEEE Transactions on Image Processing (TIP), 2022
[paper] [bibtex]
Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks
Yikai Wang, Yi Yang, Fuchun Sun, Anbang Yao
International Conference on Computer Vision (ICCV), 2021
[paper] [arxiv] [code] [slides] [bibtex]
Elastic Tactile Simulation Towards Tactile-visual Perception
Yikai Wang, Wenbing Huang, Bin Fang, Fuchun Sun, Chang Li
ACM International Conference on Multimedia (MM), Oral, 2021
[paper] [arxiv] [code] [slides] [bibtex]
Deep Multimodal Fusion by Channel Exchanging
Yikai Wang, Wenbing Huang, Fuchun Sun, Tingyang Xu, Yu Rong, Junzhou Huang
Advances in Neural Information Processing Systems (NeurIPS), 2020
[paper] [arxiv] [code] [slides] [bibtex]
Resolution Switchable Networks for Runtime Efficient Image Recognition
Yikai Wang, Fuchun Sun, Duo Li, Anbang Yao
European Conference on Computer Vision (ECCV), 2020
[paper] [arxiv] [code] [slides] [bibtex]
Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion
Yikai Wang, Fuchun Sun, Ming Lu, Anbang Yao
ACM International Conference on Multimedia (MM), 2020
[paper] [arxiv] [code] [slides] [bibtex]
Regularized Adversarial Sampling and Deep Time-aware Attention for Click-through Rate Prediction
Yikai Wang*, Liang Zhang*, Quanyu Dai, Fuchun Sun, et al
ACM International Conference on Information and Knowledge Management (CIKM), 2020
[paper] [arxiv] [slides] [bibtex]