Picture for Bo An

Bo An

Skywork Open Reasoner 1 Technical Report

Add code
May 29, 2025
Viaarxiv icon

MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems

Add code
May 22, 2025
Viaarxiv icon

Enhance Mobile Agents Thinking Process Via Iterative Preference Learning

Add code
May 18, 2025
Viaarxiv icon

Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents

Add code
May 17, 2025
Viaarxiv icon

Group-in-Group Policy Optimization for LLM Agent Training

Add code
May 16, 2025
Viaarxiv icon

Establishing Linear Surrogate Regret Bounds for Convex Smooth Losses via Convolutional Fenchel-Young Losses

Add code
May 15, 2025
Viaarxiv icon

Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning

Add code
May 01, 2025
Viaarxiv icon

MF-LLM: Simulating Collective Decision Dynamics via a Mean-Field Large Language Model Framework

Add code
Apr 30, 2025
Viaarxiv icon

Guiding VLM Agents with Process Rewards at Inference Time for GUI Navigation

Add code
Apr 22, 2025
Viaarxiv icon

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Add code
Apr 22, 2025
Viaarxiv icon