Picture for Aram Davtyan

Aram Davtyan

Faster Inference of Flow-Based Generative Models via Improved Data-Noise Coupling

Add code
Mar 16, 2026
Viaarxiv icon

Communication-Inspired Tokenization for Structured Image Representations

Add code
Feb 24, 2026
Viaarxiv icon

From Generation to Generalization: Emergent Few-Shot Learning in Video Diffusion Models

Add code
Jun 08, 2025
Viaarxiv icon

KOALA++: Efficient Kalman-Based Optimization of Neural Networks with Gradient-Covariance Products

Add code
Jun 04, 2025
Figure 1 for KOALA++: Efficient Kalman-Based Optimization of Neural Networks with Gradient-Covariance Products
Figure 2 for KOALA++: Efficient Kalman-Based Optimization of Neural Networks with Gradient-Covariance Products
Figure 3 for KOALA++: Efficient Kalman-Based Optimization of Neural Networks with Gradient-Covariance Products
Figure 4 for KOALA++: Efficient Kalman-Based Optimization of Neural Networks with Gradient-Covariance Products
Viaarxiv icon

Can AI Agents Design and Implement Drug Discovery Pipelines?

Add code
Apr 28, 2025
Figure 1 for Can AI Agents Design and Implement Drug Discovery Pipelines?
Figure 2 for Can AI Agents Design and Implement Drug Discovery Pipelines?
Figure 3 for Can AI Agents Design and Implement Drug Discovery Pipelines?
Figure 4 for Can AI Agents Design and Implement Drug Discovery Pipelines?
Viaarxiv icon

GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control

Add code
Dec 15, 2024
Figure 1 for GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Figure 2 for GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Figure 3 for GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Figure 4 for GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Viaarxiv icon

Enabling Visual Composition and Animation in Unsupervised Video Generation

Add code
Mar 21, 2024
Figure 1 for Enabling Visual Composition and Animation in Unsupervised Video Generation
Figure 2 for Enabling Visual Composition and Animation in Unsupervised Video Generation
Figure 3 for Enabling Visual Composition and Animation in Unsupervised Video Generation
Figure 4 for Enabling Visual Composition and Animation in Unsupervised Video Generation
Viaarxiv icon

Multi-View Unsupervised Image Generation with Cross Attention Guidance

Add code
Dec 07, 2023
Figure 1 for Multi-View Unsupervised Image Generation with Cross Attention Guidance
Figure 2 for Multi-View Unsupervised Image Generation with Cross Attention Guidance
Figure 3 for Multi-View Unsupervised Image Generation with Cross Attention Guidance
Figure 4 for Multi-View Unsupervised Image Generation with Cross Attention Guidance
Viaarxiv icon

Learn the Force We Can: Multi-Object Video Generation from Pixel-Level Interactions

Add code
Jun 06, 2023
Figure 1 for Learn the Force We Can: Multi-Object Video Generation from Pixel-Level Interactions
Figure 2 for Learn the Force We Can: Multi-Object Video Generation from Pixel-Level Interactions
Figure 3 for Learn the Force We Can: Multi-Object Video Generation from Pixel-Level Interactions
Figure 4 for Learn the Force We Can: Multi-Object Video Generation from Pixel-Level Interactions
Viaarxiv icon

Randomized Conditional Flow Matching for Video Prediction

Add code
Nov 26, 2022
Figure 1 for Randomized Conditional Flow Matching for Video Prediction
Figure 2 for Randomized Conditional Flow Matching for Video Prediction
Figure 3 for Randomized Conditional Flow Matching for Video Prediction
Figure 4 for Randomized Conditional Flow Matching for Video Prediction
Viaarxiv icon