Picture for Zach Moshe

Zach Moshe

Google Research, Tel-Aviv, Israel

NVIDIA Nemotron 3: Efficient and Open Intelligence

Add code
Dec 24, 2025
Viaarxiv icon

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Dec 23, 2025
Viaarxiv icon

Llama-Nemotron: Efficient Reasoning Models

Add code
May 02, 2025
Figure 1 for Llama-Nemotron: Efficient Reasoning Models
Figure 2 for Llama-Nemotron: Efficient Reasoning Models
Figure 3 for Llama-Nemotron: Efficient Reasoning Models
Figure 4 for Llama-Nemotron: Efficient Reasoning Models
Viaarxiv icon

FFN Fusion: Rethinking Sequential Computation in Large Language Models

Add code
Mar 24, 2025
Viaarxiv icon

Puzzle: Distillation-Based NAS for Inference-Optimized LLMs

Add code
Dec 03, 2024
Figure 1 for Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
Figure 2 for Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
Figure 3 for Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
Figure 4 for Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
Viaarxiv icon

Flood forecasting with machine learning models in an operational framework

Add code
Nov 04, 2021
Figure 1 for Flood forecasting with machine learning models in an operational framework
Figure 2 for Flood forecasting with machine learning models in an operational framework
Figure 3 for Flood forecasting with machine learning models in an operational framework
Figure 4 for Flood forecasting with machine learning models in an operational framework
Viaarxiv icon

HydroNets: Leveraging River Structure for Hydrologic Modeling

Add code
Jul 01, 2020
Figure 1 for HydroNets: Leveraging River Structure for Hydrologic Modeling
Figure 2 for HydroNets: Leveraging River Structure for Hydrologic Modeling
Figure 3 for HydroNets: Leveraging River Structure for Hydrologic Modeling
Figure 4 for HydroNets: Leveraging River Structure for Hydrologic Modeling
Viaarxiv icon

ML for Flood Forecasting at Scale

Add code
Jan 28, 2019
Viaarxiv icon

Towards Global Remote Discharge Estimation: Using the Few to Estimate The Many

Add code
Jan 03, 2019
Figure 1 for Towards Global Remote Discharge Estimation: Using the Few to Estimate The Many
Viaarxiv icon