Picture for Jitao Sang

Jitao Sang

Mobile-Agent-V: A Video-Guided Approach for Effortless and Efficient Operational Knowledge Injection in Mobile Automation

Add code
May 21, 2025
Figure 1 for Mobile-Agent-V: A Video-Guided Approach for Effortless and Efficient Operational Knowledge Injection in Mobile Automation
Figure 2 for Mobile-Agent-V: A Video-Guided Approach for Effortless and Efficient Operational Knowledge Injection in Mobile Automation
Figure 3 for Mobile-Agent-V: A Video-Guided Approach for Effortless and Efficient Operational Knowledge Injection in Mobile Automation
Figure 4 for Mobile-Agent-V: A Video-Guided Approach for Effortless and Efficient Operational Knowledge Injection in Mobile Automation
Viaarxiv icon

Unifying Perplexing Behaviors in Modified BP Attributions through Alignment Perspective

Add code
Mar 14, 2025
Figure 1 for Unifying Perplexing Behaviors in Modified BP Attributions through Alignment Perspective
Figure 2 for Unifying Perplexing Behaviors in Modified BP Attributions through Alignment Perspective
Figure 3 for Unifying Perplexing Behaviors in Modified BP Attributions through Alignment Perspective
Figure 4 for Unifying Perplexing Behaviors in Modified BP Attributions through Alignment Perspective
Viaarxiv icon

Debiased Prompt Tuning in Vision-Language Model without Annotations

Add code
Mar 11, 2025
Figure 1 for Debiased Prompt Tuning in Vision-Language Model without Annotations
Figure 2 for Debiased Prompt Tuning in Vision-Language Model without Annotations
Figure 3 for Debiased Prompt Tuning in Vision-Language Model without Annotations
Figure 4 for Debiased Prompt Tuning in Vision-Language Model without Annotations
Viaarxiv icon

Agent models: Internalizing Chain-of-Action Generation into Reasoning models

Add code
Mar 09, 2025
Figure 1 for Agent models: Internalizing Chain-of-Action Generation into Reasoning models
Figure 2 for Agent models: Internalizing Chain-of-Action Generation into Reasoning models
Figure 3 for Agent models: Internalizing Chain-of-Action Generation into Reasoning models
Figure 4 for Agent models: Internalizing Chain-of-Action Generation into Reasoning models
Viaarxiv icon

Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration

Add code
Feb 25, 2025
Figure 1 for Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration
Figure 2 for Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration
Figure 3 for Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration
Figure 4 for Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration
Viaarxiv icon

OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning

Add code
Dec 22, 2024
Viaarxiv icon

o1-Coder: an o1 Replication for Coding

Add code
Nov 29, 2024
Figure 1 for o1-Coder: an o1 Replication for Coding
Figure 2 for o1-Coder: an o1 Replication for Coding
Figure 3 for o1-Coder: an o1 Replication for Coding
Figure 4 for o1-Coder: an o1 Replication for Coding
Viaarxiv icon

Don't Command, Cultivate: An Exploratory Study of System-2 Alignment

Add code
Nov 26, 2024
Figure 1 for Don't Command, Cultivate: An Exploratory Study of System-2 Alignment
Figure 2 for Don't Command, Cultivate: An Exploratory Study of System-2 Alignment
Figure 3 for Don't Command, Cultivate: An Exploratory Study of System-2 Alignment
Figure 4 for Don't Command, Cultivate: An Exploratory Study of System-2 Alignment
Viaarxiv icon

VaLiD: Mitigating the Hallucination of Large Vision Language Models by Visual Layer Fusion Contrastive Decoding

Add code
Nov 24, 2024
Viaarxiv icon

Debiasing Vison-Language Models with Text-Only Training

Add code
Oct 12, 2024
Figure 1 for Debiasing Vison-Language Models with Text-Only Training
Figure 2 for Debiasing Vison-Language Models with Text-Only Training
Figure 3 for Debiasing Vison-Language Models with Text-Only Training
Figure 4 for Debiasing Vison-Language Models with Text-Only Training
Viaarxiv icon