Picture for Taifeng Wang

Taifeng Wang

TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training

Add code
Aug 25, 2025
Viaarxiv icon

MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining

Add code
Jul 02, 2025
Viaarxiv icon

MuBench: Assessment of Multilingual Capabilities of Large Language Models Across 61 Languages

Add code
Jun 24, 2025
Viaarxiv icon

MoORE: SVD-based Model MoE-ization for Conflict- and Oblivion-Resistant Multi-Task Adaptation

Add code
Jun 17, 2025
Viaarxiv icon

QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining

Add code
Apr 23, 2025
Figure 1 for QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining
Figure 2 for QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining
Figure 3 for QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining
Figure 4 for QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining
Viaarxiv icon

xTrimoGene: An Efficient and Scalable Representation Learner for Single-Cell RNA-Seq Data

Add code
Nov 26, 2023
Viaarxiv icon

LogicMP: A Neuro-symbolic Approach for Encoding First-order Logic Constraints

Add code
Sep 29, 2023
Viaarxiv icon

Drug Synergistic Combinations Predictions via Large-Scale Pre-Training and Graph Structure Learning

Add code
Jan 14, 2023
Figure 1 for Drug Synergistic Combinations Predictions via Large-Scale Pre-Training and Graph Structure Learning
Figure 2 for Drug Synergistic Combinations Predictions via Large-Scale Pre-Training and Graph Structure Learning
Figure 3 for Drug Synergistic Combinations Predictions via Large-Scale Pre-Training and Graph Structure Learning
Figure 4 for Drug Synergistic Combinations Predictions via Large-Scale Pre-Training and Graph Structure Learning
Viaarxiv icon

Keywords and Instances: A Hierarchical Contrastive Learning Framework Unifying Hybrid Granularities for Text Generation

Add code
May 26, 2022
Figure 1 for Keywords and Instances: A Hierarchical Contrastive Learning Framework Unifying Hybrid Granularities for Text Generation
Figure 2 for Keywords and Instances: A Hierarchical Contrastive Learning Framework Unifying Hybrid Granularities for Text Generation
Figure 3 for Keywords and Instances: A Hierarchical Contrastive Learning Framework Unifying Hybrid Granularities for Text Generation
Figure 4 for Keywords and Instances: A Hierarchical Contrastive Learning Framework Unifying Hybrid Granularities for Text Generation
Viaarxiv icon

PIE: a Parameter and Inference Efficient Solution for Large Scale Knowledge Graph Embedding Reasoning

Add code
May 05, 2022
Figure 1 for PIE: a Parameter and Inference Efficient Solution for Large Scale Knowledge Graph Embedding Reasoning
Figure 2 for PIE: a Parameter and Inference Efficient Solution for Large Scale Knowledge Graph Embedding Reasoning
Figure 3 for PIE: a Parameter and Inference Efficient Solution for Large Scale Knowledge Graph Embedding Reasoning
Figure 4 for PIE: a Parameter and Inference Efficient Solution for Large Scale Knowledge Graph Embedding Reasoning
Viaarxiv icon