Picture for Jin Du

Jin Du

Can Agentic AI Match the Performance of Human Data Scientists?

Add code
Dec 24, 2025
Figure 1 for Can Agentic AI Match the Performance of Human Data Scientists?
Figure 2 for Can Agentic AI Match the Performance of Human Data Scientists?
Figure 3 for Can Agentic AI Match the Performance of Human Data Scientists?
Figure 4 for Can Agentic AI Match the Performance of Human Data Scientists?
Viaarxiv icon

A benchmark multimodal oro-dental dataset for large vision-language models

Add code
Nov 07, 2025
Figure 1 for A benchmark multimodal oro-dental dataset for large vision-language models
Figure 2 for A benchmark multimodal oro-dental dataset for large vision-language models
Figure 3 for A benchmark multimodal oro-dental dataset for large vision-language models
Figure 4 for A benchmark multimodal oro-dental dataset for large vision-language models
Viaarxiv icon

An Outlook on the Opportunities and Challenges of Multi-Agent AI Systems

Add code
May 23, 2025
Viaarxiv icon

Ice Cream Doesn't Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference

Add code
May 19, 2025
Viaarxiv icon

Drift to Remember

Add code
Sep 21, 2024
Figure 1 for Drift to Remember
Figure 2 for Drift to Remember
Viaarxiv icon