Picture for Xiangru Tang

Xiangru Tang

ChemSafetyBench: Benchmarking LLM Safety on Chemistry Domain

Add code
Nov 23, 2024
Viaarxiv icon

FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents

Add code
Nov 08, 2024
Figure 1 for FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents
Figure 2 for FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents
Figure 3 for FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents
Figure 4 for FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents
Viaarxiv icon

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Add code
Jul 23, 2024
Viaarxiv icon

Unveiling the Spectrum of Data Contamination in Language Models: A Survey from Detection to Remediation

Add code
Jun 20, 2024
Figure 1 for Unveiling the Spectrum of Data Contamination in Language Models: A Survey from Detection to Remediation
Figure 2 for Unveiling the Spectrum of Data Contamination in Language Models: A Survey from Detection to Remediation
Figure 3 for Unveiling the Spectrum of Data Contamination in Language Models: A Survey from Detection to Remediation
Viaarxiv icon

Step-Back Profiling: Distilling User History for Personalized Scientific Writing

Add code
Jun 20, 2024
Viaarxiv icon

PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes

Add code
Jun 19, 2024
Viaarxiv icon

Lessons from the Trenches on Reproducible Evaluation of Language Models

Add code
May 23, 2024
Viaarxiv icon

MIMIR: A Streamlined Platform for Personalized Agent Tuning in Domain Expertise

Add code
Apr 03, 2024
Viaarxiv icon

Data Interpreter: An LLM Agent For Data Science

Add code
Mar 12, 2024
Viaarxiv icon

StarCoder 2 and The Stack v2: The Next Generation

Add code
Feb 29, 2024
Figure 1 for StarCoder 2 and The Stack v2: The Next Generation
Figure 2 for StarCoder 2 and The Stack v2: The Next Generation
Figure 3 for StarCoder 2 and The Stack v2: The Next Generation
Figure 4 for StarCoder 2 and The Stack v2: The Next Generation
Viaarxiv icon