Picture for Teng Wang

Teng Wang

UltraHiT: A Hierarchical Transformer Architecture for Generalizable Internal Carotid Artery Robotic Ultrasonography

Add code
Sep 17, 2025
Viaarxiv icon

Predicting person-level injury severity using crash narratives: A balanced approach with roadway classification and natural language process techniques

Add code
Sep 09, 2025
Viaarxiv icon

CVBench: Evaluating Cross-Video Synergies for Complex Multimodal Understanding and Reasoning

Add code
Aug 28, 2025
Viaarxiv icon

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Add code
Aug 27, 2025
Viaarxiv icon

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts

Add code
Jul 28, 2025
Viaarxiv icon

SAGE: Strategy-Adaptive Generation Engine for Query Rewriting

Add code
Jun 24, 2025
Viaarxiv icon

Reinforcing Video Reasoning with Focused Thinking

Add code
May 30, 2025
Viaarxiv icon

Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?

Add code
May 27, 2025
Viaarxiv icon

CP-Router: An Uncertainty-Aware Router Between LLM and LRM

Add code
May 26, 2025
Viaarxiv icon

TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

Add code
May 08, 2025
Viaarxiv icon