Picture for Yubo Gao

Yubo Gao

NVIDIA Nemotron 3: Efficient and Open Intelligence

Add code
Dec 24, 2025
Viaarxiv icon

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Dec 23, 2025
Viaarxiv icon

EffiReason-Bench: A Unified Benchmark for Evaluating and Advancing Efficient Reasoning in Large Language Models

Add code
Nov 13, 2025
Viaarxiv icon

DPQuant: Efficient and Differentially-Private Model Training via Dynamic Quantization Scheduling

Add code
Sep 03, 2025
Viaarxiv icon

Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities

Add code
May 27, 2025
Viaarxiv icon

Do BERT-Like Bidirectional Models Still Perform Better on Text Classification in the Era of LLMs?

Add code
May 23, 2025
Viaarxiv icon

PhysicsArena: The First Multimodal Physics Reasoning Benchmark Exploring Variable, Process, and Solution Dimensions

Add code
May 21, 2025
Viaarxiv icon

CoheMark: A Novel Sentence-Level Watermark for Enhanced Text Quality

Add code
Apr 24, 2025
Viaarxiv icon

APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts

Add code
Jun 19, 2024
Viaarxiv icon

Proteus: Preserving Model Confidentiality during Graph Optimizations

Add code
Apr 18, 2024
Figure 1 for Proteus: Preserving Model Confidentiality during Graph Optimizations
Figure 2 for Proteus: Preserving Model Confidentiality during Graph Optimizations
Figure 3 for Proteus: Preserving Model Confidentiality during Graph Optimizations
Figure 4 for Proteus: Preserving Model Confidentiality during Graph Optimizations
Viaarxiv icon