Picture for Jayanth Srinivasa

Jayanth Srinivasa

AgentDS Technical Report: Benchmarking the Future of Human-AI Collaboration in Domain-Specific Data Science

Add code
Mar 19, 2026
Viaarxiv icon

Context Bootstrapped Reinforcement Learning

Add code
Mar 19, 2026
Viaarxiv icon

Can Agentic AI Match the Performance of Human Data Scientists?

Add code
Dec 24, 2025
Figure 1 for Can Agentic AI Match the Performance of Human Data Scientists?
Figure 2 for Can Agentic AI Match the Performance of Human Data Scientists?
Figure 3 for Can Agentic AI Match the Performance of Human Data Scientists?
Figure 4 for Can Agentic AI Match the Performance of Human Data Scientists?
Viaarxiv icon

Dora: QoE-Aware Hybrid Parallelism for Distributed Edge AI

Add code
Dec 09, 2025
Figure 1 for Dora: QoE-Aware Hybrid Parallelism for Distributed Edge AI
Figure 2 for Dora: QoE-Aware Hybrid Parallelism for Distributed Edge AI
Figure 3 for Dora: QoE-Aware Hybrid Parallelism for Distributed Edge AI
Figure 4 for Dora: QoE-Aware Hybrid Parallelism for Distributed Edge AI
Viaarxiv icon

How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on $τ$-bench

Add code
Aug 28, 2025
Viaarxiv icon

EXP-Bench: Can AI Conduct AI Research Experiments?

Add code
May 30, 2025
Figure 1 for EXP-Bench: Can AI Conduct AI Research Experiments?
Figure 2 for EXP-Bench: Can AI Conduct AI Research Experiments?
Figure 3 for EXP-Bench: Can AI Conduct AI Research Experiments?
Figure 4 for EXP-Bench: Can AI Conduct AI Research Experiments?
Viaarxiv icon

An Outlook on the Opportunities and Challenges of Multi-Agent AI Systems

Add code
May 23, 2025
Viaarxiv icon

SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning

Add code
May 22, 2025
Figure 1 for SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning
Figure 2 for SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning
Figure 3 for SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning
Figure 4 for SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning
Viaarxiv icon

Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection

Add code
May 18, 2025
Viaarxiv icon

SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills

Add code
Apr 09, 2025
Figure 1 for SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills
Figure 2 for SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills
Figure 3 for SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills
Figure 4 for SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills
Viaarxiv icon