Picture for Ankit Aich

Ankit Aich

ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents

Add code
Nov 10, 2025
Viaarxiv icon

Remote Labor Index: Measuring AI Automation of Remote Work

Add code
Oct 30, 2025
Viaarxiv icon

Reliable Weak-to-Strong Monitoring of LLM Agents

Add code
Aug 26, 2025
Figure 1 for Reliable Weak-to-Strong Monitoring of LLM Agents
Figure 2 for Reliable Weak-to-Strong Monitoring of LLM Agents
Figure 3 for Reliable Weak-to-Strong Monitoring of LLM Agents
Figure 4 for Reliable Weak-to-Strong Monitoring of LLM Agents
Viaarxiv icon

The Illusion of Empathy: How AI Chatbots Shape Conversation Perception

Add code
Nov 19, 2024
Figure 1 for The Illusion of Empathy: How AI Chatbots Shape Conversation Perception
Figure 2 for The Illusion of Empathy: How AI Chatbots Shape Conversation Perception
Figure 3 for The Illusion of Empathy: How AI Chatbots Shape Conversation Perception
Figure 4 for The Illusion of Empathy: How AI Chatbots Shape Conversation Perception
Viaarxiv icon

DiverseDialogue: A Methodology for Designing Chatbots with Human-Like Diversity

Add code
Aug 30, 2024
Figure 1 for DiverseDialogue: A Methodology for Designing Chatbots with Human-Like Diversity
Figure 2 for DiverseDialogue: A Methodology for Designing Chatbots with Human-Like Diversity
Figure 3 for DiverseDialogue: A Methodology for Designing Chatbots with Human-Like Diversity
Figure 4 for DiverseDialogue: A Methodology for Designing Chatbots with Human-Like Diversity
Viaarxiv icon

Explicit and Implicit Large Language Model Personas Generate Opinions but Fail to Replicate Deeper Perceptions and Biases

Add code
Jun 20, 2024
Figure 1 for Explicit and Implicit Large Language Model Personas Generate Opinions but Fail to Replicate Deeper Perceptions and Biases
Figure 2 for Explicit and Implicit Large Language Model Personas Generate Opinions but Fail to Replicate Deeper Perceptions and Biases
Figure 3 for Explicit and Implicit Large Language Model Personas Generate Opinions but Fail to Replicate Deeper Perceptions and Biases
Figure 4 for Explicit and Implicit Large Language Model Personas Generate Opinions but Fail to Replicate Deeper Perceptions and Biases
Viaarxiv icon

Vernacular? I Barely Know Her: Challenges with Style Control and Stereotyping

Add code
Jun 18, 2024
Figure 1 for Vernacular? I Barely Know Her: Challenges with Style Control and Stereotyping
Figure 2 for Vernacular? I Barely Know Her: Challenges with Style Control and Stereotyping
Figure 3 for Vernacular? I Barely Know Her: Challenges with Style Control and Stereotyping
Figure 4 for Vernacular? I Barely Know Her: Challenges with Style Control and Stereotyping
Viaarxiv icon

Using LLMs to Aid Annotation and Collection of Clinically-Enriched Data in Bipolar Disorder and Schizophrenia

Add code
Jun 18, 2024
Figure 1 for Using LLMs to Aid Annotation and Collection of Clinically-Enriched Data in Bipolar Disorder and Schizophrenia
Figure 2 for Using LLMs to Aid Annotation and Collection of Clinically-Enriched Data in Bipolar Disorder and Schizophrenia
Figure 3 for Using LLMs to Aid Annotation and Collection of Clinically-Enriched Data in Bipolar Disorder and Schizophrenia
Figure 4 for Using LLMs to Aid Annotation and Collection of Clinically-Enriched Data in Bipolar Disorder and Schizophrenia
Viaarxiv icon