Picture for Fei Huang

Fei Huang

additional authors not shown

VLM-R$^3$: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought

Add code
May 22, 2025
Viaarxiv icon

Mobile-Agent-V: A Video-Guided Approach for Effortless and Efficient Operational Knowledge Injection in Mobile Automation

Add code
May 21, 2025
Viaarxiv icon

SoLoPO: Unlocking Long-Context Capabilities in LLMs via Short-to-Long Preference Optimization

Add code
May 16, 2025
Viaarxiv icon

WorldPM: Scaling Human Preference Modeling

Add code
May 15, 2025
Viaarxiv icon

Qwen3 Technical Report

Add code
May 14, 2025
Figure 1 for Qwen3 Technical Report
Figure 2 for Qwen3 Technical Report
Figure 3 for Qwen3 Technical Report
Figure 4 for Qwen3 Technical Report
Viaarxiv icon

Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Add code
May 10, 2025
Viaarxiv icon

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Add code
May 07, 2025
Viaarxiv icon

Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents

Add code
May 04, 2025
Figure 1 for Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents
Figure 2 for Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents
Figure 3 for Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents
Figure 4 for Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents
Viaarxiv icon

Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning

Add code
May 01, 2025
Viaarxiv icon

PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts

Add code
Apr 30, 2025
Viaarxiv icon