Picture for Liangming Pan

Liangming Pan

How does Transformer Learn Implicit Reasoning?

Add code
May 29, 2025
Viaarxiv icon

How Is LLM Reasoning Distracted by Irrelevant Context? An Analysis Using a Controlled Benchmark

Add code
May 24, 2025
Viaarxiv icon

ConciseRL: Conciseness-Guided Reinforcement Learning for Efficient Reasoning Models

Add code
May 22, 2025
Viaarxiv icon

FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance

Add code
Mar 07, 2025
Figure 1 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Figure 2 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Figure 3 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Figure 4 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Viaarxiv icon

InductionBench: LLMs Fail in the Simplest Complexity Class

Add code
Feb 26, 2025
Viaarxiv icon

Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework

Add code
Dec 22, 2024
Viaarxiv icon

AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge

Add code
Dec 18, 2024
Figure 1 for AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
Figure 2 for AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
Figure 3 for AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
Figure 4 for AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
Viaarxiv icon

Combating Multimodal LLM Hallucination via Bottom-up Holistic Reasoning

Add code
Dec 15, 2024
Viaarxiv icon

RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

Add code
Dec 12, 2024
Figure 1 for RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios
Figure 2 for RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios
Figure 3 for RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios
Figure 4 for RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios
Viaarxiv icon

Improving Causal Reasoning in Large Language Models: A Survey

Add code
Oct 22, 2024
Figure 1 for Improving Causal Reasoning in Large Language Models: A Survey
Figure 2 for Improving Causal Reasoning in Large Language Models: A Survey
Figure 3 for Improving Causal Reasoning in Large Language Models: A Survey
Figure 4 for Improving Causal Reasoning in Large Language Models: A Survey
Viaarxiv icon