Text


VisualThink-VLA: Visual Intermediate Reasoning for Effective and Low-Latency Vision-Language-Action Policies

Add code
May 28, 2026
Viaarxiv icon

Reinforcement Learning with Robust Rubric Rewards

Add code
May 28, 2026
Viaarxiv icon

SGMD: Score Gradient Matching Distillation for Few-Step Video Diffusion Distillation

Add code
May 28, 2026
Viaarxiv icon

MedCase-Structured: A Text-to-FHIR Dataset for Benchmarking Diagnostic Reasoning in Clinically Realistic EHR Settings

Add code
May 28, 2026
Viaarxiv icon

LLMSurgeon: Diagnosing Data Mixture of Large Language Models

Add code
May 28, 2026
Viaarxiv icon

AsymVLM: Asymmetric Token Pruning for Efficient Vision-Language Model Inference

Add code
May 28, 2026
Viaarxiv icon

BitC-3DGS: High-Capacity 3D Gaussian Splatting Watermarking via Bit Compression

Add code
May 28, 2026
Viaarxiv icon

The Importance of Being Statistically Earnest: A Critical Re-evaluation of GSM-Symbolic

Add code
May 28, 2026
Viaarxiv icon

Comparative Evaluation of Machine Translation Systems on Images with Text

Add code
May 28, 2026
Viaarxiv icon

GraphLit: Learning Text-Enriched Dynamic Character Network Representations for Literary Study

Add code
May 28, 2026
Viaarxiv icon