Alert button
Picture for Weiming Hu

Weiming Hu

Alert button

STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians

Add code
Bookmark button
Alert button
Mar 22, 2024
Yifei Zeng, Yanqin Jiang, Siyu Zhu, Yuanxun Lu, Youtian Lin, Hao Zhu, Weiming Hu, Xun Cao, Yao Yao

Viaarxiv icon

BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues

Add code
Bookmark button
Alert button
Mar 11, 2024
Fudong Ge, Yiwei Zhang, Shuhan Shen, Yue Wang, Weiming Hu, Jin Gao

Figure 1 for BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues
Figure 2 for BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues
Figure 3 for BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues
Figure 4 for BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues
Viaarxiv icon

PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts

Add code
Bookmark button
Alert button
Mar 08, 2024
Zewen Chen, Haina Qin, Juan Wang, Chunfeng Yuan, Bing Li, Weiming Hu, Liang Wang

Figure 1 for PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts
Figure 2 for PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts
Figure 3 for PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts
Figure 4 for PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts
Viaarxiv icon

Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training

Add code
Bookmark button
Alert button
Mar 01, 2024
Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu

Figure 1 for Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training
Figure 2 for Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training
Figure 3 for Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training
Figure 4 for Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training
Viaarxiv icon

Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval

Add code
Bookmark button
Alert button
Feb 26, 2024
Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu

Viaarxiv icon

GMC-IQA: Exploiting Global-correlation and Mean-opinion Consistency for No-reference Image Quality Assessment

Add code
Bookmark button
Alert button
Jan 19, 2024
Zewen Chen, Juan Wang, Bing Li, Chunfeng Yuan, Weiming Hu, Junxian Liu, Peng Li, Yan Wang, Youqun Zhang, Congxuan Zhang

Viaarxiv icon

I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models

Add code
Bookmark button
Alert button
Dec 27, 2023
Xun Guo, Mingwu Zheng, Liang Hou, Yuan Gao, Yufan Deng, Chongyang Ma, Weiming Hu, Zhengjun Zha, Haibin Huang, Pengfei Wan, Di Zhang

Viaarxiv icon

Set Prediction Guided by Semantic Concepts for Diverse Video Captioning

Add code
Bookmark button
Alert button
Dec 25, 2023
Yifan Lu, Ziqi Zhang, Chunfeng Yuan, Peng Li, Yan Wang, Bing Li, Weiming Hu

Viaarxiv icon

ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual Tracking

Add code
Bookmark button
Alert button
Oct 16, 2023
Yutong Kou, Jin Gao, Bing Li, Gang Wang, Weiming Hu, Yizheng Wang, Liang Li

Figure 1 for ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual Tracking
Figure 2 for ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual Tracking
Figure 3 for ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual Tracking
Figure 4 for ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual Tracking
Viaarxiv icon

RT-MonoDepth: Real-time Monocular Depth Estimation on Embedded Systems

Add code
Bookmark button
Alert button
Aug 21, 2023
Cheng Feng, Zhen Chen, Congxuan Zhang, Weiming Hu, Bing Li, Feng Lu

Figure 1 for RT-MonoDepth: Real-time Monocular Depth Estimation on Embedded Systems
Figure 2 for RT-MonoDepth: Real-time Monocular Depth Estimation on Embedded Systems
Figure 3 for RT-MonoDepth: Real-time Monocular Depth Estimation on Embedded Systems
Figure 4 for RT-MonoDepth: Real-time Monocular Depth Estimation on Embedded Systems
Viaarxiv icon