Picture for Fandong Meng

Fandong Meng

LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information

Add code
Feb 04, 2025
Viaarxiv icon

DeepRAG: Thinking to Retrieval Step by Step for Large Language Models

Add code
Feb 03, 2025
Viaarxiv icon

Personalized Language Model Learning on Text Data Without User Identifiers

Add code
Jan 10, 2025
Viaarxiv icon

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

Add code
Dec 23, 2024
Viaarxiv icon

PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension

Add code
Dec 16, 2024
Figure 1 for PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
Figure 2 for PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
Figure 3 for PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
Figure 4 for PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
Viaarxiv icon

Retrieval-Augmented Machine Translation with Unstructured Knowledge

Add code
Dec 05, 2024
Viaarxiv icon

Extralonger: Toward a Unified Perspective of Spatial-Temporal Factors for Extra-Long-Term Traffic Forecasting

Add code
Oct 30, 2024
Viaarxiv icon

CRAT: A Multi-Agent Framework for Causality-Enhanced Reflective and Retrieval-Augmented Translation with Large Language Models

Add code
Oct 28, 2024
Figure 1 for CRAT: A Multi-Agent Framework for Causality-Enhanced Reflective and Retrieval-Augmented Translation with Large Language Models
Figure 2 for CRAT: A Multi-Agent Framework for Causality-Enhanced Reflective and Retrieval-Augmented Translation with Large Language Models
Figure 3 for CRAT: A Multi-Agent Framework for Causality-Enhanced Reflective and Retrieval-Augmented Translation with Large Language Models
Figure 4 for CRAT: A Multi-Agent Framework for Causality-Enhanced Reflective and Retrieval-Augmented Translation with Large Language Models
Viaarxiv icon

MiniPLM: Knowledge Distillation for Pre-Training Language Models

Add code
Oct 22, 2024
Viaarxiv icon

On the token distance modeling ability of higher RoPE attention dimension

Add code
Oct 11, 2024
Figure 1 for On the token distance modeling ability of higher RoPE attention dimension
Figure 2 for On the token distance modeling ability of higher RoPE attention dimension
Figure 3 for On the token distance modeling ability of higher RoPE attention dimension
Figure 4 for On the token distance modeling ability of higher RoPE attention dimension
Viaarxiv icon