Picture for Alice Li

Alice Li

On the Effects of Data Scale on Computer Control Agents

Add code
Jun 06, 2024
Figure 1 for On the Effects of Data Scale on Computer Control Agents
Figure 2 for On the Effects of Data Scale on Computer Control Agents
Figure 3 for On the Effects of Data Scale on Computer Control Agents
Figure 4 for On the Effects of Data Scale on Computer Control Agents
Viaarxiv icon

Dissociation of Faithful and Unfaithful Reasoning in LLMs

Add code
May 23, 2024
Figure 1 for Dissociation of Faithful and Unfaithful Reasoning in LLMs
Figure 2 for Dissociation of Faithful and Unfaithful Reasoning in LLMs
Figure 3 for Dissociation of Faithful and Unfaithful Reasoning in LLMs
Figure 4 for Dissociation of Faithful and Unfaithful Reasoning in LLMs
Viaarxiv icon

AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents

Add code
May 23, 2024
Figure 1 for AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents
Figure 2 for AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents
Figure 3 for AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents
Figure 4 for AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents
Viaarxiv icon

Generative AI Search Engines as Arbiters of Public Knowledge: An Audit of Bias and Authority

Add code
May 22, 2024
Viaarxiv icon

Latent State Estimation Helps UI Agents to Reason

Add code
May 17, 2024
Figure 1 for Latent State Estimation Helps UI Agents to Reason
Figure 2 for Latent State Estimation Helps UI Agents to Reason
Figure 3 for Latent State Estimation Helps UI Agents to Reason
Figure 4 for Latent State Estimation Helps UI Agents to Reason
Viaarxiv icon

Android in the Wild: A Large-Scale Dataset for Android Device Control

Add code
Jul 19, 2023
Figure 1 for Android in the Wild: A Large-Scale Dataset for Android Device Control
Figure 2 for Android in the Wild: A Large-Scale Dataset for Android Device Control
Figure 3 for Android in the Wild: A Large-Scale Dataset for Android Device Control
Figure 4 for Android in the Wild: A Large-Scale Dataset for Android Device Control
Viaarxiv icon

The 7th AI City Challenge

Add code
Apr 15, 2023
Figure 1 for The 7th AI City Challenge
Figure 2 for The 7th AI City Challenge
Figure 3 for The 7th AI City Challenge
Figure 4 for The 7th AI City Challenge
Viaarxiv icon

Productivity Assessment of Neural Code Completion

Add code
May 13, 2022
Figure 1 for Productivity Assessment of Neural Code Completion
Figure 2 for Productivity Assessment of Neural Code Completion
Figure 3 for Productivity Assessment of Neural Code Completion
Figure 4 for Productivity Assessment of Neural Code Completion
Viaarxiv icon

The 6th AI City Challenge

Add code
Apr 21, 2022
Figure 1 for The 6th AI City Challenge
Figure 2 for The 6th AI City Challenge
Figure 3 for The 6th AI City Challenge
Figure 4 for The 6th AI City Challenge
Viaarxiv icon