Picture for Hao Zhang

Hao Zhang

refer to the report for detailed contributions

Bandwidth-Adaptive Spatiotemporal Correspondence Identification for Collaborative Perception

Add code
Feb 17, 2025
Figure 1 for Bandwidth-Adaptive Spatiotemporal Correspondence Identification for Collaborative Perception
Figure 2 for Bandwidth-Adaptive Spatiotemporal Correspondence Identification for Collaborative Perception
Figure 3 for Bandwidth-Adaptive Spatiotemporal Correspondence Identification for Collaborative Perception
Figure 4 for Bandwidth-Adaptive Spatiotemporal Correspondence Identification for Collaborative Perception
Viaarxiv icon

Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization

Add code
Feb 11, 2025
Figure 1 for Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization
Figure 2 for Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization
Figure 3 for Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization
Figure 4 for Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization
Viaarxiv icon

Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile

Add code
Feb 10, 2025
Figure 1 for Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile
Figure 2 for Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile
Figure 3 for Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile
Figure 4 for Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile
Viaarxiv icon

LMS-Net: A Learned Mumford-Shah Network For Few-Shot Medical Image Segmentation

Add code
Feb 08, 2025
Figure 1 for LMS-Net: A Learned Mumford-Shah Network For Few-Shot Medical Image Segmentation
Figure 2 for LMS-Net: A Learned Mumford-Shah Network For Few-Shot Medical Image Segmentation
Figure 3 for LMS-Net: A Learned Mumford-Shah Network For Few-Shot Medical Image Segmentation
Figure 4 for LMS-Net: A Learned Mumford-Shah Network For Few-Shot Medical Image Segmentation
Viaarxiv icon

3DMolFormer: A Dual-channel Framework for Structure-based Drug Discovery

Add code
Feb 07, 2025
Figure 1 for 3DMolFormer: A Dual-channel Framework for Structure-based Drug Discovery
Figure 2 for 3DMolFormer: A Dual-channel Framework for Structure-based Drug Discovery
Figure 3 for 3DMolFormer: A Dual-channel Framework for Structure-based Drug Discovery
Figure 4 for 3DMolFormer: A Dual-channel Framework for Structure-based Drug Discovery
Viaarxiv icon

Fast Video Generation with Sliding Tile Attention

Add code
Feb 06, 2025
Figure 1 for Fast Video Generation with Sliding Tile Attention
Figure 2 for Fast Video Generation with Sliding Tile Attention
Figure 3 for Fast Video Generation with Sliding Tile Attention
Figure 4 for Fast Video Generation with Sliding Tile Attention
Viaarxiv icon

Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing

Add code
Jan 23, 2025
Figure 1 for Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing
Figure 2 for Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing
Figure 3 for Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing
Figure 4 for Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing
Viaarxiv icon

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Add code
Jan 21, 2025
Figure 1 for Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Figure 2 for Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Figure 3 for Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Figure 4 for Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Viaarxiv icon

OpenMLDB: A Real-Time Relational Data Feature Computation System for Online ML

Add code
Jan 15, 2025
Figure 1 for OpenMLDB: A Real-Time Relational Data Feature Computation System for Online ML
Figure 2 for OpenMLDB: A Real-Time Relational Data Feature Computation System for Online ML
Figure 3 for OpenMLDB: A Real-Time Relational Data Feature Computation System for Online ML
Figure 4 for OpenMLDB: A Real-Time Relational Data Feature Computation System for Online ML
Viaarxiv icon

Spatiotemporal Gaussian Optimization for 4D Cone Beam CT Reconstruction from Sparse Projections

Add code
Jan 07, 2025
Figure 1 for Spatiotemporal Gaussian Optimization for 4D Cone Beam CT Reconstruction from Sparse Projections
Figure 2 for Spatiotemporal Gaussian Optimization for 4D Cone Beam CT Reconstruction from Sparse Projections
Figure 3 for Spatiotemporal Gaussian Optimization for 4D Cone Beam CT Reconstruction from Sparse Projections
Figure 4 for Spatiotemporal Gaussian Optimization for 4D Cone Beam CT Reconstruction from Sparse Projections
Viaarxiv icon