Picture for Aojun Zhou

Aojun Zhou

MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning

Add code
Jun 05, 2025
Viaarxiv icon

Probability-Consistent Preference Optimization for Enhanced LLM Reasoning

Add code
May 29, 2025
Viaarxiv icon

UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents

Add code
May 27, 2025
Viaarxiv icon

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning

Add code
May 15, 2025
Viaarxiv icon

Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding

Add code
May 08, 2025
Viaarxiv icon

WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch

Add code
May 06, 2025
Viaarxiv icon

GenieBlue: Integrating both Linguistic and Multimodal Capabilities for Large Language Models on Mobile Devices

Add code
Mar 08, 2025
Viaarxiv icon

Chimera: Improving Generalist Model with Domain-Specific Experts

Add code
Dec 08, 2024
Viaarxiv icon

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Add code
Nov 16, 2024
Viaarxiv icon

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code

Add code
Oct 10, 2024
Figure 1 for MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code
Figure 2 for MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code
Figure 3 for MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code
Figure 4 for MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code
Viaarxiv icon