Picture for Yahui Liu

Yahui Liu

SeqPE: Transformer with Sequential Position Encoding

Add code
Jun 16, 2025
Viaarxiv icon

Evaluating Multimodal Large Language Models on Video Captioning via Monte Carlo Tree Search

Add code
Jun 11, 2025
Viaarxiv icon

What Makes a Good Reasoning Chain? Uncovering Structural Patterns in Long Chain-of-Thought Reasoning

Add code
May 28, 2025
Viaarxiv icon

Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval

Add code
May 26, 2025
Viaarxiv icon

Evaluating Text Creativity across Diverse Domains: A Dataset and Large Language Model Evaluator

Add code
May 25, 2025
Viaarxiv icon

Guiding the Experts: Semantic Priors for Efficient and Focused MoE Routing

Add code
May 24, 2025
Viaarxiv icon

DiffusionReward: Enhancing Blind Face Restoration through Reward Feedback Learning

Add code
May 23, 2025
Viaarxiv icon

Leanabell-Prover: Posttraining Scaling in Formal Reasoning

Add code
Apr 09, 2025
Viaarxiv icon

SEGT: A General Spatial Expansion Group Transformer for nuScenes Lidar-based Object Detection Task

Add code
Dec 12, 2024
Figure 1 for SEGT: A General Spatial Expansion Group Transformer for nuScenes Lidar-based Object Detection Task
Figure 2 for SEGT: A General Spatial Expansion Group Transformer for nuScenes Lidar-based Object Detection Task
Viaarxiv icon

LTOS: Layout-controllable Text-Object Synthesis via Adaptive Cross-attention Fusions

Add code
Apr 21, 2024
Figure 1 for LTOS: Layout-controllable Text-Object Synthesis via Adaptive Cross-attention Fusions
Figure 2 for LTOS: Layout-controllable Text-Object Synthesis via Adaptive Cross-attention Fusions
Figure 3 for LTOS: Layout-controllable Text-Object Synthesis via Adaptive Cross-attention Fusions
Figure 4 for LTOS: Layout-controllable Text-Object Synthesis via Adaptive Cross-attention Fusions
Viaarxiv icon