Picture for Ruihua Song

Ruihua Song

JointAVBench: A Benchmark for Joint Audio-Visual Reasoning Evaluation

Add code
Dec 14, 2025
Viaarxiv icon

Robust Motion Generation using Part-level Reliable Data from Videos

Add code
Dec 14, 2025
Viaarxiv icon

ChronusOmni: Improving Time Awareness of Omni Large Language Models

Add code
Dec 10, 2025
Viaarxiv icon

IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction

Add code
Nov 10, 2025
Viaarxiv icon

MARS: Optimizing Dual-System Deep Research via Multi-Agent Reinforcement Learning

Add code
Oct 06, 2025
Viaarxiv icon

Select, Read, and Write: A Multi-Agent Framework of Full-Text-based Related Work Generation

Add code
May 26, 2025
Viaarxiv icon

Evaluating Text Creativity across Diverse Domains: A Dataset and Large Language Model Evaluator

Add code
May 25, 2025
Viaarxiv icon

Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains

Add code
May 22, 2025
Figure 1 for Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
Figure 2 for Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
Figure 3 for Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
Figure 4 for Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
Viaarxiv icon

Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization

Add code
Dec 26, 2024
Figure 1 for Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization
Figure 2 for Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization
Figure 3 for Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization
Figure 4 for Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization
Viaarxiv icon

Two-in-One: Unified Multi-Person Interactive Motion Generation by Latent Diffusion Transformer

Add code
Dec 21, 2024
Figure 1 for Two-in-One: Unified Multi-Person Interactive Motion Generation by Latent Diffusion Transformer
Figure 2 for Two-in-One: Unified Multi-Person Interactive Motion Generation by Latent Diffusion Transformer
Figure 3 for Two-in-One: Unified Multi-Person Interactive Motion Generation by Latent Diffusion Transformer
Figure 4 for Two-in-One: Unified Multi-Person Interactive Motion Generation by Latent Diffusion Transformer
Viaarxiv icon