Picture for Zengzhi Wang

Zengzhi Wang

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Add code
Jul 22, 2025
Viaarxiv icon

MegaMath: Pushing the Limits of Open Math Corpora

Add code
Apr 03, 2025
Viaarxiv icon

Towards Explainable Multimodal Depression Recognition for Clinical Interviews

Add code
Jan 27, 2025
Figure 1 for Towards Explainable Multimodal Depression Recognition for Clinical Interviews
Figure 2 for Towards Explainable Multimodal Depression Recognition for Clinical Interviews
Figure 3 for Towards Explainable Multimodal Depression Recognition for Clinical Interviews
Figure 4 for Towards Explainable Multimodal Depression Recognition for Clinical Interviews
Viaarxiv icon

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Add code
Sep 25, 2024
Figure 1 for Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale
Figure 2 for Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale
Figure 3 for Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale
Figure 4 for Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale
Viaarxiv icon

Data Contamination Report from the 2024 CONDA Shared Task

Add code
Jul 31, 2024
Figure 1 for Data Contamination Report from the 2024 CONDA Shared Task
Figure 2 for Data Contamination Report from the 2024 CONDA Shared Task
Figure 3 for Data Contamination Report from the 2024 CONDA Shared Task
Figure 4 for Data Contamination Report from the 2024 CONDA Shared Task
Viaarxiv icon

OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far?

Add code
Jun 26, 2024
Figure 1 for OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far?
Figure 2 for OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far?
Figure 3 for OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far?
Figure 4 for OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far?
Viaarxiv icon

OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI

Add code
Jun 18, 2024
Figure 1 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 2 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 3 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 4 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Viaarxiv icon

Benchmarking Benchmark Leakage in Large Language Models

Add code
Apr 29, 2024
Figure 1 for Benchmarking Benchmark Leakage in Large Language Models
Figure 2 for Benchmarking Benchmark Leakage in Large Language Models
Figure 3 for Benchmarking Benchmark Leakage in Large Language Models
Figure 4 for Benchmarking Benchmark Leakage in Large Language Models
Viaarxiv icon

Generative AI for Math: Part I -- MathPile: A Billion-Token-Scale Pretraining Corpus for Math

Add code
Dec 28, 2023
Figure 1 for Generative AI for Math: Part I -- MathPile: A Billion-Token-Scale Pretraining Corpus for Math
Figure 2 for Generative AI for Math: Part I -- MathPile: A Billion-Token-Scale Pretraining Corpus for Math
Figure 3 for Generative AI for Math: Part I -- MathPile: A Billion-Token-Scale Pretraining Corpus for Math
Figure 4 for Generative AI for Math: Part I -- MathPile: A Billion-Token-Scale Pretraining Corpus for Math
Viaarxiv icon

Ask Again, Then Fail: Large Language Models' Vacillations in Judgement

Add code
Oct 03, 2023
Figure 1 for Ask Again, Then Fail: Large Language Models' Vacillations in Judgement
Figure 2 for Ask Again, Then Fail: Large Language Models' Vacillations in Judgement
Figure 3 for Ask Again, Then Fail: Large Language Models' Vacillations in Judgement
Figure 4 for Ask Again, Then Fail: Large Language Models' Vacillations in Judgement
Viaarxiv icon