Picture for Max Tian

Max Tian

Using Scaling Laws for Data Source Utility Estimation in Domain-Specific Pre-Training

Add code
Jul 29, 2025
Viaarxiv icon

StarCoder 2 and The Stack v2: The Next Generation

Add code
Feb 29, 2024
Figure 1 for StarCoder 2 and The Stack v2: The Next Generation
Figure 2 for StarCoder 2 and The Stack v2: The Next Generation
Figure 3 for StarCoder 2 and The Stack v2: The Next Generation
Figure 4 for StarCoder 2 and The Stack v2: The Next Generation
Viaarxiv icon

An Experimental Evaluation of Transformer-based Language Models in the Biomedical Domain

Add code
Dec 31, 2020
Figure 1 for An Experimental Evaluation of Transformer-based Language Models in the Biomedical Domain
Figure 2 for An Experimental Evaluation of Transformer-based Language Models in the Biomedical Domain
Figure 3 for An Experimental Evaluation of Transformer-based Language Models in the Biomedical Domain
Figure 4 for An Experimental Evaluation of Transformer-based Language Models in the Biomedical Domain
Viaarxiv icon