Picture for Zhen Guo

Zhen Guo

Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts, 02139, USA

MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition

Add code
Dec 08, 2025
Viaarxiv icon

$π_\texttt{RL}$: Online RL Fine-tuning for Flow-based Vision-Language-Action Models

Add code
Oct 29, 2025
Figure 1 for $π_\texttt{RL}$: Online RL Fine-tuning for Flow-based Vision-Language-Action Models
Figure 2 for $π_\texttt{RL}$: Online RL Fine-tuning for Flow-based Vision-Language-Action Models
Figure 3 for $π_\texttt{RL}$: Online RL Fine-tuning for Flow-based Vision-Language-Action Models
Figure 4 for $π_\texttt{RL}$: Online RL Fine-tuning for Flow-based Vision-Language-Action Models
Viaarxiv icon

RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation

Add code
Sep 19, 2025
Figure 1 for RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation
Figure 2 for RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation
Figure 3 for RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation
Figure 4 for RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation
Viaarxiv icon

Persistent Backdoor Attacks in Continual Learning

Add code
Sep 20, 2024
Figure 1 for Persistent Backdoor Attacks in Continual Learning
Figure 2 for Persistent Backdoor Attacks in Continual Learning
Figure 3 for Persistent Backdoor Attacks in Continual Learning
Figure 4 for Persistent Backdoor Attacks in Continual Learning
Viaarxiv icon

Scaling Law Hypothesis for Multimodal Model

Add code
Sep 10, 2024
Viaarxiv icon

Unveiling the Unseen: Exploring Whitebox Membership Inference through the Lens of Explainability

Add code
Jul 01, 2024
Figure 1 for Unveiling the Unseen: Exploring Whitebox Membership Inference through the Lens of Explainability
Figure 2 for Unveiling the Unseen: Exploring Whitebox Membership Inference through the Lens of Explainability
Figure 3 for Unveiling the Unseen: Exploring Whitebox Membership Inference through the Lens of Explainability
Figure 4 for Unveiling the Unseen: Exploring Whitebox Membership Inference through the Lens of Explainability
Viaarxiv icon

Octo-planner: On-device Language Model for Planner-Action Agents

Add code
Jun 26, 2024
Figure 1 for Octo-planner: On-device Language Model for Planner-Action Agents
Figure 2 for Octo-planner: On-device Language Model for Planner-Action Agents
Figure 3 for Octo-planner: On-device Language Model for Planner-Action Agents
Figure 4 for Octo-planner: On-device Language Model for Planner-Action Agents
Viaarxiv icon

More Compute Is What You Need

Add code
May 02, 2024
Figure 1 for More Compute Is What You Need
Viaarxiv icon

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

Add code
Apr 11, 2024
Figure 1 for JetMoE: Reaching Llama2 Performance with 0.1M Dollars
Figure 2 for JetMoE: Reaching Llama2 Performance with 0.1M Dollars
Figure 3 for JetMoE: Reaching Llama2 Performance with 0.1M Dollars
Figure 4 for JetMoE: Reaching Llama2 Performance with 0.1M Dollars
Viaarxiv icon

Automated HER2 Scoring in Breast Cancer Images Using Deep Learning and Pyramid Sampling

Add code
Apr 01, 2024
Viaarxiv icon