Picture for Yuquan Xie

Yuquan Xie

Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills

Add code
Jun 12, 2025
Figure 1 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 2 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 3 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 4 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Viaarxiv icon

Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts

Add code
Jun 12, 2025
Figure 1 for Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts
Figure 2 for Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts
Figure 3 for Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts
Figure 4 for Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts
Viaarxiv icon

Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy

Add code
Feb 27, 2025
Figure 1 for Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy
Figure 2 for Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy
Figure 3 for Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy
Figure 4 for Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy
Viaarxiv icon

Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks

Add code
Aug 07, 2024
Viaarxiv icon

Domain Generalizable Knowledge Tracing via Concept Aggregation and Relation-Based Attention

Add code
Jul 02, 2024
Figure 1 for Domain Generalizable Knowledge Tracing via Concept Aggregation and Relation-Based Attention
Figure 2 for Domain Generalizable Knowledge Tracing via Concept Aggregation and Relation-Based Attention
Figure 3 for Domain Generalizable Knowledge Tracing via Concept Aggregation and Relation-Based Attention
Figure 4 for Domain Generalizable Knowledge Tracing via Concept Aggregation and Relation-Based Attention
Viaarxiv icon

HCQA @ Ego4D EgoSchema Challenge 2024

Add code
Jun 22, 2024
Figure 1 for HCQA @ Ego4D EgoSchema Challenge 2024
Figure 2 for HCQA @ Ego4D EgoSchema Challenge 2024
Figure 3 for HCQA @ Ego4D EgoSchema Challenge 2024
Figure 4 for HCQA @ Ego4D EgoSchema Challenge 2024
Viaarxiv icon

ObjectNLQ @ Ego4D Episodic Memory Challenge 2024

Add code
Jun 22, 2024
Figure 1 for ObjectNLQ @ Ego4D Episodic Memory Challenge 2024
Figure 2 for ObjectNLQ @ Ego4D Episodic Memory Challenge 2024
Figure 3 for ObjectNLQ @ Ego4D Episodic Memory Challenge 2024
Figure 4 for ObjectNLQ @ Ego4D Episodic Memory Challenge 2024
Viaarxiv icon