Picture for Song Han

Song Han

University of Connecticut

FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos

Add code
Dec 11, 2025
Figure 1 for FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos
Figure 2 for FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos
Figure 3 for FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos
Figure 4 for FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos
Viaarxiv icon

Optimizing Mixture of Block Attention

Add code
Nov 14, 2025
Viaarxiv icon

ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

Add code
Nov 13, 2025
Viaarxiv icon

StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation

Add code
Nov 10, 2025
Figure 1 for StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
Figure 2 for StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
Figure 3 for StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
Figure 4 for StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
Viaarxiv icon

NVIDIA Nemotron Nano V2 VL

Add code
Nov 07, 2025
Viaarxiv icon

OckBench: Measuring the Efficiency of LLM Reasoning

Add code
Nov 07, 2025
Viaarxiv icon

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Add code
Oct 10, 2025
Viaarxiv icon

LongLive: Real-time Interactive Long Video Generation

Add code
Sep 26, 2025
Viaarxiv icon

3D Aware Region Prompted Vision Language Model

Add code
Sep 16, 2025
Figure 1 for 3D Aware Region Prompted Vision Language Model
Figure 2 for 3D Aware Region Prompted Vision Language Model
Figure 3 for 3D Aware Region Prompted Vision Language Model
Figure 4 for 3D Aware Region Prompted Vision Language Model
Viaarxiv icon

Artificial Intelligence-Based Multiscale Temporal Modeling for Anomaly Detection in Cloud Services

Add code
Aug 20, 2025
Figure 1 for Artificial Intelligence-Based Multiscale Temporal Modeling for Anomaly Detection in Cloud Services
Figure 2 for Artificial Intelligence-Based Multiscale Temporal Modeling for Anomaly Detection in Cloud Services
Figure 3 for Artificial Intelligence-Based Multiscale Temporal Modeling for Anomaly Detection in Cloud Services
Figure 4 for Artificial Intelligence-Based Multiscale Temporal Modeling for Anomaly Detection in Cloud Services
Viaarxiv icon