Picture for Xipeng Qiu

Xipeng Qiu

AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models

Add code
Feb 24, 2025
Figure 1 for AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models
Figure 2 for AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models
Figure 3 for AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models
Figure 4 for AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models
Viaarxiv icon

Thus Spake Long-Context Large Language Model

Add code
Feb 24, 2025
Viaarxiv icon

Human2Robot: Learning Robot Actions from Paired Human-Robot Videos

Add code
Feb 23, 2025
Figure 1 for Human2Robot: Learning Robot Actions from Paired Human-Robot Videos
Figure 2 for Human2Robot: Learning Robot Actions from Paired Human-Robot Videos
Figure 3 for Human2Robot: Learning Robot Actions from Paired Human-Robot Videos
Figure 4 for Human2Robot: Learning Robot Actions from Paired Human-Robot Videos
Viaarxiv icon

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Add code
Feb 20, 2025
Viaarxiv icon

UnitCoder: Scalable Iterative Code Synthesis with Unit Test Guidance

Add code
Feb 17, 2025
Viaarxiv icon

FastMCTS: A Simple Sampling Strategy for Data Synthesis

Add code
Feb 17, 2025
Figure 1 for FastMCTS: A Simple Sampling Strategy for Data Synthesis
Figure 2 for FastMCTS: A Simple Sampling Strategy for Data Synthesis
Figure 3 for FastMCTS: A Simple Sampling Strategy for Data Synthesis
Figure 4 for FastMCTS: A Simple Sampling Strategy for Data Synthesis
Viaarxiv icon

Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?

Add code
Feb 17, 2025
Figure 1 for Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?
Figure 2 for Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?
Figure 3 for Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?
Figure 4 for Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?
Viaarxiv icon

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Add code
Feb 07, 2025
Figure 1 for VideoRoPE: What Makes for Good Video Rotary Position Embedding?
Figure 2 for VideoRoPE: What Makes for Good Video Rotary Position Embedding?
Figure 3 for VideoRoPE: What Makes for Good Video Rotary Position Embedding?
Figure 4 for VideoRoPE: What Makes for Good Video Rotary Position Embedding?
Viaarxiv icon

CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs

Add code
Jan 28, 2025
Figure 1 for CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs
Figure 2 for CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs
Figure 3 for CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs
Figure 4 for CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs
Viaarxiv icon

Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework

Add code
Jan 26, 2025
Figure 1 for Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework
Figure 2 for Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework
Figure 3 for Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework
Figure 4 for Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework
Viaarxiv icon