Picture for Yu Lu

Yu Lu

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Add code
Aug 20, 2025
Viaarxiv icon

Seed LiveInterpret 2.0: End-to-end Simultaneous Speech-to-speech Translation with Your Voice

Add code
Jul 24, 2025
Viaarxiv icon

In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer

Add code
Apr 29, 2025
Viaarxiv icon

Do PhD-level LLMs Truly Grasp Elementary Addition? Probing Rule Learning vs. Memorization in Large Language Models

Add code
Apr 07, 2025
Viaarxiv icon

HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization

Add code
Mar 04, 2025
Figure 1 for HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization
Figure 2 for HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization
Figure 3 for HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization
Figure 4 for HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization
Viaarxiv icon

UBER: Uncertainty-Based Evolution with Large Language Models for Automatic Heuristic Design

Add code
Dec 30, 2024
Figure 1 for UBER: Uncertainty-Based Evolution with Large Language Models for Automatic Heuristic Design
Figure 2 for UBER: Uncertainty-Based Evolution with Large Language Models for Automatic Heuristic Design
Figure 3 for UBER: Uncertainty-Based Evolution with Large Language Models for Automatic Heuristic Design
Figure 4 for UBER: Uncertainty-Based Evolution with Large Language Models for Automatic Heuristic Design
Viaarxiv icon

Energy-Efficient RIS-Aided Cell-Free Massive MIMO Systems: Application, Opportunities, and Challenges

Add code
Dec 23, 2024
Figure 1 for Energy-Efficient RIS-Aided Cell-Free Massive MIMO Systems: Application, Opportunities, and Challenges
Figure 2 for Energy-Efficient RIS-Aided Cell-Free Massive MIMO Systems: Application, Opportunities, and Challenges
Figure 3 for Energy-Efficient RIS-Aided Cell-Free Massive MIMO Systems: Application, Opportunities, and Challenges
Figure 4 for Energy-Efficient RIS-Aided Cell-Free Massive MIMO Systems: Application, Opportunities, and Challenges
Viaarxiv icon

Push the Limit of Multi-modal Emotion Recognition by Prompting LLMs with Receptive-Field-Aware Attention Weighting

Add code
Nov 26, 2024
Figure 1 for Push the Limit of Multi-modal Emotion Recognition by Prompting LLMs with Receptive-Field-Aware Attention Weighting
Figure 2 for Push the Limit of Multi-modal Emotion Recognition by Prompting LLMs with Receptive-Field-Aware Attention Weighting
Figure 3 for Push the Limit of Multi-modal Emotion Recognition by Prompting LLMs with Receptive-Field-Aware Attention Weighting
Figure 4 for Push the Limit of Multi-modal Emotion Recognition by Prompting LLMs with Receptive-Field-Aware Attention Weighting
Viaarxiv icon

MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning

Add code
Nov 05, 2024
Figure 1 for MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
Figure 2 for MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
Figure 3 for MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
Figure 4 for MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
Viaarxiv icon

Nova: An Iterative Planning and Search Approach to Enhance Novelty and Diversity of LLM Generated Ideas

Add code
Oct 18, 2024
Viaarxiv icon