Picture for Furu Wei

Furu Wei

PEACE: Empowering Geologic Map Holistic Understanding with MLLMs

Add code
Jan 10, 2025
Figure 1 for PEACE: Empowering Geologic Map Holistic Understanding with MLLMs
Figure 2 for PEACE: Empowering Geologic Map Holistic Understanding with MLLMs
Figure 3 for PEACE: Empowering Geologic Map Holistic Understanding with MLLMs
Figure 4 for PEACE: Empowering Geologic Map Holistic Understanding with MLLMs
Viaarxiv icon

GeAR: Generation Augmented Retrieval

Add code
Jan 06, 2025
Figure 1 for GeAR: Generation Augmented Retrieval
Figure 2 for GeAR: Generation Augmented Retrieval
Figure 3 for GeAR: Generation Augmented Retrieval
Figure 4 for GeAR: Generation Augmented Retrieval
Viaarxiv icon

Bootstrap Your Own Context Length

Add code
Dec 25, 2024
Viaarxiv icon

MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark

Add code
Dec 19, 2024
Viaarxiv icon

Context-DPO: Aligning Language Models for Context-Faithfulness

Add code
Dec 18, 2024
Figure 1 for Context-DPO: Aligning Language Models for Context-Faithfulness
Figure 2 for Context-DPO: Aligning Language Models for Context-Faithfulness
Figure 3 for Context-DPO: Aligning Language Models for Context-Faithfulness
Figure 4 for Context-DPO: Aligning Language Models for Context-Faithfulness
Viaarxiv icon

Multimodal Latent Language Modeling with Next-Token Diffusion

Add code
Dec 11, 2024
Figure 1 for Multimodal Latent Language Modeling with Next-Token Diffusion
Figure 2 for Multimodal Latent Language Modeling with Next-Token Diffusion
Figure 3 for Multimodal Latent Language Modeling with Next-Token Diffusion
Figure 4 for Multimodal Latent Language Modeling with Next-Token Diffusion
Viaarxiv icon

RedStone: Curating General, Code, Math, and QA Data for Large Language Models

Add code
Dec 04, 2024
Figure 1 for RedStone: Curating General, Code, Math, and QA Data for Large Language Models
Figure 2 for RedStone: Curating General, Code, Math, and QA Data for Large Language Models
Figure 3 for RedStone: Curating General, Code, Math, and QA Data for Large Language Models
Figure 4 for RedStone: Curating General, Code, Math, and QA Data for Large Language Models
Viaarxiv icon

MH-MoE: Multi-Head Mixture-of-Experts

Add code
Nov 26, 2024
Figure 1 for MH-MoE: Multi-Head Mixture-of-Experts
Figure 2 for MH-MoE: Multi-Head Mixture-of-Experts
Figure 3 for MH-MoE: Multi-Head Mixture-of-Experts
Figure 4 for MH-MoE: Multi-Head Mixture-of-Experts
Viaarxiv icon

Preference Optimization for Reasoning with Pseudo Feedback

Add code
Nov 25, 2024
Figure 1 for Preference Optimization for Reasoning with Pseudo Feedback
Figure 2 for Preference Optimization for Reasoning with Pseudo Feedback
Figure 3 for Preference Optimization for Reasoning with Pseudo Feedback
Figure 4 for Preference Optimization for Reasoning with Pseudo Feedback
Viaarxiv icon

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Add code
Nov 07, 2024
Figure 1 for BitNet a4.8: 4-bit Activations for 1-bit LLMs
Figure 2 for BitNet a4.8: 4-bit Activations for 1-bit LLMs
Figure 3 for BitNet a4.8: 4-bit Activations for 1-bit LLMs
Figure 4 for BitNet a4.8: 4-bit Activations for 1-bit LLMs
Viaarxiv icon