Picture for Xiaoyu Li

Xiaoyu Li

Alphabetical order by last name

AMO-Bench: Large Language Models Still Struggle in High School Math Competitions

Add code
Oct 30, 2025
Figure 1 for AMO-Bench: Large Language Models Still Struggle in High School Math Competitions
Figure 2 for AMO-Bench: Large Language Models Still Struggle in High School Math Competitions
Figure 3 for AMO-Bench: Large Language Models Still Struggle in High School Math Competitions
Figure 4 for AMO-Bench: Large Language Models Still Struggle in High School Math Competitions
Viaarxiv icon

ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing

Add code
Aug 14, 2025
Viaarxiv icon

Channel-Independent Federated Traffic Prediction

Add code
Aug 06, 2025
Figure 1 for Channel-Independent Federated Traffic Prediction
Figure 2 for Channel-Independent Federated Traffic Prediction
Figure 3 for Channel-Independent Federated Traffic Prediction
Figure 4 for Channel-Independent Federated Traffic Prediction
Viaarxiv icon

MoCHA: Advanced Vision-Language Reasoning with MoE Connector and Hierarchical Group Attention

Add code
Jul 30, 2025
Viaarxiv icon

IC-Custom: Diverse Image Customization via In-Context Learning

Add code
Jul 02, 2025
Viaarxiv icon

Ming-Omni: A Unified Multimodal Model for Perception and Generation

Add code
Jun 11, 2025
Figure 1 for Ming-Omni: A Unified Multimodal Model for Perception and Generation
Figure 2 for Ming-Omni: A Unified Multimodal Model for Perception and Generation
Figure 3 for Ming-Omni: A Unified Multimodal Model for Perception and Generation
Figure 4 for Ming-Omni: A Unified Multimodal Model for Perception and Generation
Viaarxiv icon

Proactive Guidance of Multi-Turn Conversation in Industrial Search

Add code
May 30, 2025
Figure 1 for Proactive Guidance of Multi-Turn Conversation in Industrial Search
Figure 2 for Proactive Guidance of Multi-Turn Conversation in Industrial Search
Figure 3 for Proactive Guidance of Multi-Turn Conversation in Industrial Search
Figure 4 for Proactive Guidance of Multi-Turn Conversation in Industrial Search
Viaarxiv icon

Sci-Fi: Symmetric Constraint for Frame Inbetweening

Add code
May 27, 2025
Figure 1 for Sci-Fi: Symmetric Constraint for Frame Inbetweening
Figure 2 for Sci-Fi: Symmetric Constraint for Frame Inbetweening
Figure 3 for Sci-Fi: Symmetric Constraint for Frame Inbetweening
Figure 4 for Sci-Fi: Symmetric Constraint for Frame Inbetweening
Viaarxiv icon

ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations

Add code
May 20, 2025
Figure 1 for ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations
Figure 2 for ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations
Figure 3 for ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations
Figure 4 for ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations
Viaarxiv icon

Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese

Add code
May 16, 2025
Viaarxiv icon