Picture for Hongsheng Li

Hongsheng Li

StreamChat: Chatting with Streaming Video

Add code
Dec 11, 2024
Figure 1 for StreamChat: Chatting with Streaming Video
Figure 2 for StreamChat: Chatting with Streaming Video
Figure 3 for StreamChat: Chatting with Streaming Video
Figure 4 for StreamChat: Chatting with Streaming Video
Viaarxiv icon

FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes

Add code
Dec 04, 2024
Figure 1 for FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes
Figure 2 for FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes
Figure 3 for FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes
Figure 4 for FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes
Viaarxiv icon

TimeWalker: Personalized Neural Space for Lifelong Head Avatars

Add code
Dec 03, 2024
Figure 1 for TimeWalker: Personalized Neural Space for Lifelong Head Avatars
Figure 2 for TimeWalker: Personalized Neural Space for Lifelong Head Avatars
Figure 3 for TimeWalker: Personalized Neural Space for Lifelong Head Avatars
Figure 4 for TimeWalker: Personalized Neural Space for Lifelong Head Avatars
Viaarxiv icon

Revisiting Generative Policies: A Simpler Reinforcement Learning Algorithmic Perspective

Add code
Dec 02, 2024
Viaarxiv icon

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Add code
Nov 16, 2024
Figure 1 for BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices
Figure 2 for BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices
Figure 3 for BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices
Figure 4 for BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices
Viaarxiv icon

ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving

Add code
Nov 08, 2024
Figure 1 for ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving
Figure 2 for ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving
Figure 3 for ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving
Figure 4 for ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving
Viaarxiv icon

A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding

Add code
Nov 04, 2024
Figure 1 for A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding
Figure 2 for A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding
Figure 3 for A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding
Figure 4 for A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding
Viaarxiv icon

BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events

Add code
Oct 27, 2024
Viaarxiv icon

Stable Consistency Tuning: Understanding and Improving Consistency Models

Add code
Oct 24, 2024
Figure 1 for Stable Consistency Tuning: Understanding and Improving Consistency Models
Figure 2 for Stable Consistency Tuning: Understanding and Improving Consistency Models
Figure 3 for Stable Consistency Tuning: Understanding and Improving Consistency Models
Figure 4 for Stable Consistency Tuning: Understanding and Improving Consistency Models
Viaarxiv icon

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Add code
Oct 17, 2024
Figure 1 for PUMA: Empowering Unified MLLM with Multi-granular Visual Generation
Figure 2 for PUMA: Empowering Unified MLLM with Multi-granular Visual Generation
Figure 3 for PUMA: Empowering Unified MLLM with Multi-granular Visual Generation
Figure 4 for PUMA: Empowering Unified MLLM with Multi-granular Visual Generation
Viaarxiv icon