Picture for Jun Song

Jun Song

DDA-Thinker: Decoupled Dual-Atomic Reinforcement Learning for Reasoning-Driven Image Editing

Add code
Apr 28, 2026
Viaarxiv icon

ClawNet: Human-Symbiotic Agent Network for Cross-User Autonomous Cooperation

Add code
Apr 21, 2026
Viaarxiv icon

Geoparsing: Diagram Parsing for Plane and Solid Geometry with a Unified Formal Language

Add code
Apr 13, 2026
Viaarxiv icon

AdaSpark: Adaptive Sparsity for Efficient Long-Video Understanding

Add code
Apr 09, 2026
Viaarxiv icon

GIFT: Global Irreplaceability Frame Targeting for Efficient Video Understanding

Add code
Mar 26, 2026
Viaarxiv icon

SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image Generation

Add code
Mar 23, 2026
Viaarxiv icon

SecAgent: Efficient Mobile GUI Agent with Semantic Context

Add code
Mar 09, 2026
Viaarxiv icon

How Foundational Skills Influence VLM-based Embodied Agents:A Native Perspective

Add code
Feb 24, 2026
Viaarxiv icon

GeoEyes: On-Demand Visual Focusing for Evidence-Grounded Understanding of Ultra-High-Resolution Remote Sensing Imagery

Add code
Feb 15, 2026
Viaarxiv icon

Text Before Vision: Staged Knowledge Injection Matters for Agentic RLVR in Ultra-High-Resolution Remote Sensing Understanding

Add code
Feb 15, 2026
Viaarxiv icon