Picture for Hao Tang

Hao Tang

Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation

Add code
May 28, 2025
Figure 1 for Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation
Figure 2 for Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation
Figure 3 for Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation
Figure 4 for Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation
Viaarxiv icon

SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams

Add code
May 26, 2025
Figure 1 for SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams
Figure 2 for SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams
Figure 3 for SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams
Figure 4 for SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams
Viaarxiv icon

Token Reduction Should Go Beyond Efficiency in Generative Models -- From Vision, Language to Multimodality

Add code
May 23, 2025
Viaarxiv icon

Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models

Add code
May 22, 2025
Figure 1 for Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models
Figure 2 for Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models
Figure 3 for Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models
Figure 4 for Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models
Viaarxiv icon

SAMba-UNet: Synergizing SAM2 and Mamba in UNet with Heterogeneous Aggregation for Cardiac MRI Segmentation

Add code
May 22, 2025
Figure 1 for SAMba-UNet: Synergizing SAM2 and Mamba in UNet with Heterogeneous Aggregation for Cardiac MRI Segmentation
Figure 2 for SAMba-UNet: Synergizing SAM2 and Mamba in UNet with Heterogeneous Aggregation for Cardiac MRI Segmentation
Figure 3 for SAMba-UNet: Synergizing SAM2 and Mamba in UNet with Heterogeneous Aggregation for Cardiac MRI Segmentation
Figure 4 for SAMba-UNet: Synergizing SAM2 and Mamba in UNet with Heterogeneous Aggregation for Cardiac MRI Segmentation
Viaarxiv icon

Replace in Translation: Boost Concept Alignment in Counterfactual Text-to-Image

Add code
May 20, 2025
Figure 1 for Replace in Translation: Boost Concept Alignment in Counterfactual Text-to-Image
Figure 2 for Replace in Translation: Boost Concept Alignment in Counterfactual Text-to-Image
Figure 3 for Replace in Translation: Boost Concept Alignment in Counterfactual Text-to-Image
Figure 4 for Replace in Translation: Boost Concept Alignment in Counterfactual Text-to-Image
Viaarxiv icon

Structured Agent Distillation for Large Language Model

Add code
May 20, 2025
Figure 1 for Structured Agent Distillation for Large Language Model
Figure 2 for Structured Agent Distillation for Large Language Model
Figure 3 for Structured Agent Distillation for Large Language Model
Figure 4 for Structured Agent Distillation for Large Language Model
Viaarxiv icon

Programmatic Video Prediction Using Large Language Models

Add code
May 20, 2025
Viaarxiv icon

CtrlDiff: Boosting Large Diffusion Language Models with Dynamic Block Prediction and Controllable Generation

Add code
May 20, 2025
Viaarxiv icon

PoE-World: Compositional World Modeling with Products of Programmatic Experts

Add code
May 16, 2025
Viaarxiv icon