Picture for Boxin Wang

Boxin Wang

Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models

Add code
Dec 15, 2025
Viaarxiv icon

NVIDIA Nemotron Nano V2 VL

Add code
Nov 07, 2025
Viaarxiv icon

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Add code
Apr 10, 2025
Figure 1 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Figure 2 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Figure 3 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Figure 4 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Viaarxiv icon

From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models

Add code
Apr 08, 2025
Viaarxiv icon

Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Add code
Mar 18, 2025
Viaarxiv icon

NVLM: Open Frontier-Class Multimodal LLMs

Add code
Sep 17, 2024
Figure 1 for NVLM: Open Frontier-Class Multimodal LLMs
Figure 2 for NVLM: Open Frontier-Class Multimodal LLMs
Figure 3 for NVLM: Open Frontier-Class Multimodal LLMs
Figure 4 for NVLM: Open Frontier-Class Multimodal LLMs
Viaarxiv icon

RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs

Add code
Jul 02, 2024
Figure 1 for RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs
Figure 2 for RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs
Figure 3 for RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs
Figure 4 for RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs
Viaarxiv icon

InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining

Add code
Oct 11, 2023
Viaarxiv icon

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models

Add code
Jun 20, 2023
Figure 1 for DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
Figure 2 for DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
Figure 3 for DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
Figure 4 for DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
Viaarxiv icon

Can Public Large Language Models Help Private Cross-device Federated Learning?

Add code
May 20, 2023
Figure 1 for Can Public Large Language Models Help Private Cross-device Federated Learning?
Figure 2 for Can Public Large Language Models Help Private Cross-device Federated Learning?
Figure 3 for Can Public Large Language Models Help Private Cross-device Federated Learning?
Figure 4 for Can Public Large Language Models Help Private Cross-device Federated Learning?
Viaarxiv icon