Picture for Ruofan Hu

Ruofan Hu

DocRetriever: A Plug-and-Play Framework for Multimodal Document Retrieval with Comprehensive Benchmark

Add code
May 28, 2026
Viaarxiv icon

DynSess: Dynamic Session-Level Evaluation and Optimization Framework for Role-Playing Agents

Add code
May 28, 2026
Viaarxiv icon

From Facts to Insights: A Persona-Driven Dual Memory Framework and Dataset for Role-Playing Agents

Add code
May 25, 2026
Viaarxiv icon

SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image Generation

Add code
Mar 23, 2026
Viaarxiv icon

Generative Reasoning Recommendation via LLMs

Add code
Oct 23, 2025
Viaarxiv icon

CLID-MU: Cross-Layer Information Divergence Based Meta Update Strategy for Learning with Noisy Labels

Add code
Jul 16, 2025
Figure 1 for CLID-MU: Cross-Layer Information Divergence Based Meta Update Strategy for Learning with Noisy Labels
Figure 2 for CLID-MU: Cross-Layer Information Divergence Based Meta Update Strategy for Learning with Noisy Labels
Figure 3 for CLID-MU: Cross-Layer Information Divergence Based Meta Update Strategy for Learning with Noisy Labels
Figure 4 for CLID-MU: Cross-Layer Information Divergence Based Meta Update Strategy for Learning with Noisy Labels
Viaarxiv icon

Vela: Scalable Embeddings with Voice Large Language Models for Multimodal Retrieval

Add code
Jun 17, 2025
Viaarxiv icon

OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios

Add code
Jan 02, 2025
Figure 1 for OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios
Figure 2 for OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios
Figure 3 for OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios
Figure 4 for OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios
Viaarxiv icon

Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt

Add code
Mar 18, 2024
Figure 1 for Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
Figure 2 for Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
Figure 3 for Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
Figure 4 for Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
Viaarxiv icon

CoLafier: Collaborative Noisy Label Purifier With Local Intrinsic Dimensionality Guidance

Add code
Jan 10, 2024
Viaarxiv icon