Picture for Chengyu Dong

Chengyu Dong

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Add code
Apr 10, 2025
Figure 1 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Figure 2 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Figure 3 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Figure 4 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Viaarxiv icon

Linear Correlation in LM's Compositional Generalization and Hallucination

Add code
Feb 06, 2025
Figure 1 for Linear Correlation in LM's Compositional Generalization and Hallucination
Figure 2 for Linear Correlation in LM's Compositional Generalization and Hallucination
Figure 3 for Linear Correlation in LM's Compositional Generalization and Hallucination
Figure 4 for Linear Correlation in LM's Compositional Generalization and Hallucination
Viaarxiv icon

Next-Token Prediction Task Assumes Optimal Data Ordering for LLM Training in Proof Generation

Add code
Oct 30, 2024
Figure 1 for Next-Token Prediction Task Assumes Optimal Data Ordering for LLM Training in Proof Generation
Figure 2 for Next-Token Prediction Task Assumes Optimal Data Ordering for LLM Training in Proof Generation
Figure 3 for Next-Token Prediction Task Assumes Optimal Data Ordering for LLM Training in Proof Generation
Figure 4 for Next-Token Prediction Task Assumes Optimal Data Ordering for LLM Training in Proof Generation
Viaarxiv icon

When is the consistent prediction likely to be a correct prediction?

Add code
Jul 08, 2024
Figure 1 for When is the consistent prediction likely to be a correct prediction?
Figure 2 for When is the consistent prediction likely to be a correct prediction?
Figure 3 for When is the consistent prediction likely to be a correct prediction?
Figure 4 for When is the consistent prediction likely to be a correct prediction?
Viaarxiv icon

Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification

Add code
Jun 17, 2024
Figure 1 for Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification
Figure 2 for Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification
Figure 3 for Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification
Figure 4 for Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification
Viaarxiv icon

Evaluating the Smooth Control of Attribute Intensity in Text Generation with LLMs

Add code
Jun 06, 2024
Figure 1 for Evaluating the Smooth Control of Attribute Intensity in Text Generation with LLMs
Figure 2 for Evaluating the Smooth Control of Attribute Intensity in Text Generation with LLMs
Figure 3 for Evaluating the Smooth Control of Attribute Intensity in Text Generation with LLMs
Figure 4 for Evaluating the Smooth Control of Attribute Intensity in Text Generation with LLMs
Viaarxiv icon

Physics-Informed Data Denoising for Real-Life Sensing Systems

Add code
Nov 12, 2023
Figure 1 for Physics-Informed Data Denoising for Real-Life Sensing Systems
Figure 2 for Physics-Informed Data Denoising for Real-Life Sensing Systems
Figure 3 for Physics-Informed Data Denoising for Real-Life Sensing Systems
Figure 4 for Physics-Informed Data Denoising for Real-Life Sensing Systems
Viaarxiv icon

Fast-ELECTRA for Efficient Pre-training

Add code
Oct 11, 2023
Figure 1 for Fast-ELECTRA for Efficient Pre-training
Figure 2 for Fast-ELECTRA for Efficient Pre-training
Figure 3 for Fast-ELECTRA for Efficient Pre-training
Figure 4 for Fast-ELECTRA for Efficient Pre-training
Viaarxiv icon

Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models

Add code
Oct 04, 2023
Figure 1 for Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
Figure 2 for Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
Figure 3 for Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
Figure 4 for Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
Viaarxiv icon

Learning Concise and Descriptive Attributes for Visual Recognition

Add code
Aug 07, 2023
Figure 1 for Learning Concise and Descriptive Attributes for Visual Recognition
Figure 2 for Learning Concise and Descriptive Attributes for Visual Recognition
Figure 3 for Learning Concise and Descriptive Attributes for Visual Recognition
Figure 4 for Learning Concise and Descriptive Attributes for Visual Recognition
Viaarxiv icon