Action Parsing


From Watch to Imagine: Steering Long-horizon Manipulation via Human Demonstration and Future Envisionment

Add code
Sep 26, 2025
Viaarxiv icon

SAGE: Scene Graph-Aware Guidance and Execution for Long-Horizon Manipulation Tasks

Add code
Sep 26, 2025
Viaarxiv icon

Hierarchical Bracketing Encodings Work for Dependency Graphs

Add code
Sep 11, 2025
Viaarxiv icon

EndoAgent: A Memory-Guided Reflective Agent for Intelligent Endoscopic Vision-to-Decision Reasoning

Add code
Aug 10, 2025
Viaarxiv icon

Incremental Language Understanding for Online Motion Planning of Robot Manipulators

Add code
Aug 08, 2025
Viaarxiv icon

Human in the Loop Adaptive Optimization for Improved Time Series Forecasting

Add code
May 21, 2025
Viaarxiv icon

PhysLab: A Benchmark Dataset for Multi-Granularity Visual Parsing of Physics Experiments

Add code
Jun 07, 2025
Figure 1 for PhysLab: A Benchmark Dataset for Multi-Granularity Visual Parsing of Physics Experiments
Figure 2 for PhysLab: A Benchmark Dataset for Multi-Granularity Visual Parsing of Physics Experiments
Figure 3 for PhysLab: A Benchmark Dataset for Multi-Granularity Visual Parsing of Physics Experiments
Figure 4 for PhysLab: A Benchmark Dataset for Multi-Granularity Visual Parsing of Physics Experiments
Viaarxiv icon

Terminators: Terms of Service Parsing and Auditing Agents

Add code
May 16, 2025
Viaarxiv icon

UFO2: The Desktop AgentOS

Add code
Apr 20, 2025
Figure 1 for UFO2: The Desktop AgentOS
Figure 2 for UFO2: The Desktop AgentOS
Figure 3 for UFO2: The Desktop AgentOS
Figure 4 for UFO2: The Desktop AgentOS
Viaarxiv icon

The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement

Add code
Mar 20, 2025
Viaarxiv icon