Picture for Xipeng Qiu

Xipeng Qiu

CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs

Add code
Jan 28, 2025
Figure 1 for CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs
Figure 2 for CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs
Figure 3 for CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs
Figure 4 for CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs
Viaarxiv icon

Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework

Add code
Jan 26, 2025
Figure 1 for Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework
Figure 2 for Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework
Figure 3 for Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework
Figure 4 for Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework
Viaarxiv icon

VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks

Add code
Dec 24, 2024
Figure 1 for VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks
Figure 2 for VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks
Figure 3 for VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks
Figure 4 for VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks
Viaarxiv icon

Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective

Add code
Dec 18, 2024
Figure 1 for Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective
Figure 2 for Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective
Figure 3 for Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective
Figure 4 for Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective
Viaarxiv icon

Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning

Add code
Dec 04, 2024
Figure 1 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Figure 2 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Figure 3 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Figure 4 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Viaarxiv icon

ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection

Add code
Nov 29, 2024
Figure 1 for ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Figure 2 for ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Figure 3 for ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Figure 4 for ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Viaarxiv icon

Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

Add code
Nov 25, 2024
Figure 1 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 2 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 3 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 4 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Viaarxiv icon

LongSafetyBench: Long-Context LLMs Struggle with Safety Issues

Add code
Nov 11, 2024
Figure 1 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Figure 2 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Figure 3 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Figure 4 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Viaarxiv icon

Can Language Models Learn to Skip Steps?

Add code
Nov 04, 2024
Viaarxiv icon

BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments

Add code
Oct 31, 2024
Figure 1 for BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments
Figure 2 for BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments
Figure 3 for BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments
Figure 4 for BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments
Viaarxiv icon