Picture for Jiaqi Wang

Jiaqi Wang

Michael Pokorny

GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization

Add code
Jun 08, 2025
Viaarxiv icon

Towards Storage-Efficient Visual Document Retrieval: An Empirical Study on Reducing Patch-Level Embeddings

Add code
Jun 05, 2025
Viaarxiv icon

Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models

Add code
May 22, 2025
Viaarxiv icon

Visual Thoughts: A Unified Perspective of Understanding Multimodal Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

Visual Agentic Reinforcement Fine-Tuning

Add code
May 20, 2025
Viaarxiv icon

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning

Add code
May 19, 2025
Viaarxiv icon

NeuroGen: Neural Network Parameter Generation via Large Language Models

Add code
May 18, 2025
Viaarxiv icon

Toward Malicious Clients Detection in Federated Learning

Add code
May 14, 2025
Viaarxiv icon

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Add code
May 06, 2025
Viaarxiv icon

MM-IFEngine: Towards Multimodal Instruction Following

Add code
Apr 10, 2025
Viaarxiv icon