Picture for Tiejun Huang

Tiejun Huang

OmniUMI: Towards Physically Grounded Robot Learning via Human-Aligned Multimodal Interaction

Add code
Apr 12, 2026
Viaarxiv icon

Ge$^\text{2}$mS-T: Multi-Dimensional Grouping for Ultra-High Energy Efficiency in Spiking Transformer

Add code
Apr 10, 2026
Viaarxiv icon

BAAI Cardiac Agent: An intelligent multimodal agent for automated reasoning and diagnosis of cardiovascular diseases from cardiac magnetic resonance imaging

Add code
Apr 05, 2026
Viaarxiv icon

Brain-Inspired Multimodal Spiking Neural Network for Image-Text Retrieval

Add code
Mar 25, 2026
Viaarxiv icon

General Self-Prediction Enhancement for Spiking Neurons

Add code
Jan 29, 2026
Viaarxiv icon

RoboBrain 2.5: Depth in Sight, Time in Mind

Add code
Jan 20, 2026
Viaarxiv icon

On-the-fly Large-scale 3D Reconstruction from Multi-Camera Rigs

Add code
Dec 09, 2025
Viaarxiv icon

Driving in Spikes: An Entropy-Guided Object Detector for Spike Cameras

Add code
Nov 19, 2025
Viaarxiv icon

Emu3.5: Native Multimodal Models are World Learners

Add code
Oct 30, 2025
Viaarxiv icon

$π_\texttt{RL}$: Online RL Fine-tuning for Flow-based Vision-Language-Action Models

Add code
Oct 29, 2025
Figure 1 for $π_\texttt{RL}$: Online RL Fine-tuning for Flow-based Vision-Language-Action Models
Figure 2 for $π_\texttt{RL}$: Online RL Fine-tuning for Flow-based Vision-Language-Action Models
Figure 3 for $π_\texttt{RL}$: Online RL Fine-tuning for Flow-based Vision-Language-Action Models
Figure 4 for $π_\texttt{RL}$: Online RL Fine-tuning for Flow-based Vision-Language-Action Models
Viaarxiv icon