Picture for Liangming Pan

Liangming Pan

LEDOM: An Open and Fundamental Reverse Language Model

Add code
Jul 02, 2025
Viaarxiv icon

How does Transformer Learn Implicit Reasoning?

Add code
May 29, 2025
Viaarxiv icon

How Is LLM Reasoning Distracted by Irrelevant Context? An Analysis Using a Controlled Benchmark

Add code
May 24, 2025
Viaarxiv icon

ConciseRL: Conciseness-Guided Reinforcement Learning for Efficient Reasoning Models

Add code
May 22, 2025
Viaarxiv icon

FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance

Add code
Mar 07, 2025
Figure 1 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Figure 2 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Figure 3 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Figure 4 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Viaarxiv icon

InductionBench: LLMs Fail in the Simplest Complexity Class

Add code
Feb 26, 2025
Viaarxiv icon

Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework

Add code
Dec 22, 2024
Viaarxiv icon

AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge

Add code
Dec 18, 2024
Figure 1 for AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
Figure 2 for AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
Figure 3 for AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
Figure 4 for AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
Viaarxiv icon

Combating Multimodal LLM Hallucination via Bottom-up Holistic Reasoning

Add code
Dec 15, 2024
Viaarxiv icon

RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

Add code
Dec 12, 2024
Figure 1 for RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios
Figure 2 for RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios
Figure 3 for RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios
Figure 4 for RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios
Viaarxiv icon