Alert button
Picture for Tim K. Marks

Tim K. Marks

Alert button

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis

Sep 30, 2023
Nithin Gopalakrishnan Nair, Anoop Cherian, Suhas Lohit, Ye Wang, Toshiaki Koike-Akino, Vishal M. Patel, Tim K. Marks

Figure 1 for Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Figure 2 for Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Figure 3 for Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Figure 4 for Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Viaarxiv icon

H-SAUR: Hypothesize, Simulate, Act, Update, and Repeat for Understanding Object Articulations from Interactions

Oct 22, 2022
Kei Ota, Hsiao-Yu Tung, Kevin A. Smith, Anoop Cherian, Tim K. Marks, Alan Sullivan, Asako Kanezaki, Joshua B. Tenenbaum

Figure 1 for H-SAUR: Hypothesize, Simulate, Act, Update, and Repeat for Understanding Object Articulations from Interactions
Figure 2 for H-SAUR: Hypothesize, Simulate, Act, Update, and Repeat for Understanding Object Articulations from Interactions
Figure 3 for H-SAUR: Hypothesize, Simulate, Act, Update, and Repeat for Understanding Object Articulations from Interactions
Figure 4 for H-SAUR: Hypothesize, Simulate, Act, Update, and Repeat for Understanding Object Articulations from Interactions
Viaarxiv icon

(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering

Feb 18, 2022
Anoop Cherian, Chiori Hori, Tim K. Marks, Jonathan Le Roux

Figure 1 for (2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering
Figure 2 for (2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering
Figure 3 for (2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering
Figure 4 for (2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering
Viaarxiv icon

MOST-GAN: 3D Morphable StyleGAN for Disentangled Face Image Manipulation

Nov 01, 2021
Safa C. Medin, Bernhard Egger, Anoop Cherian, Ye Wang, Joshua B. Tenenbaum, Xiaoming Liu, Tim K. Marks

Figure 1 for MOST-GAN: 3D Morphable StyleGAN for Disentangled Face Image Manipulation
Figure 2 for MOST-GAN: 3D Morphable StyleGAN for Disentangled Face Image Manipulation
Figure 3 for MOST-GAN: 3D Morphable StyleGAN for Disentangled Face Image Manipulation
Figure 4 for MOST-GAN: 3D Morphable StyleGAN for Disentangled Face Image Manipulation
Viaarxiv icon

Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning

Oct 13, 2021
Ankit P. Shah, Shijie Geng, Peng Gao, Anoop Cherian, Takaaki Hori, Tim K. Marks, Jonathan Le Roux, Chiori Hori

Figure 1 for Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning
Figure 2 for Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning
Figure 3 for Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning
Figure 4 for Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning
Viaarxiv icon

InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images

Aug 31, 2021
Anoop Cherian, Goncalo Dias Pais, Siddarth Jain, Tim K. Marks, Alan Sullivan

Figure 1 for InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images
Figure 2 for InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images
Figure 3 for InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images
Figure 4 for InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images
Viaarxiv icon

LUVLi Face Alignment: Estimating Landmarks' Location, Uncertainty, and Visibility Likelihood

Apr 06, 2020
Abhinav Kumar, Tim K. Marks, Wenxuan Mou, Ye Wang, Michael Jones, Anoop Cherian, Toshiaki Koike-Akino, Xiaoming Liu, Chen Feng

Figure 1 for LUVLi Face Alignment: Estimating Landmarks' Location, Uncertainty, and Visibility Likelihood
Figure 2 for LUVLi Face Alignment: Estimating Landmarks' Location, Uncertainty, and Visibility Likelihood
Figure 3 for LUVLi Face Alignment: Estimating Landmarks' Location, Uncertainty, and Visibility Likelihood
Figure 4 for LUVLi Face Alignment: Estimating Landmarks' Location, Uncertainty, and Visibility Likelihood
Viaarxiv icon

Spatio-Temporal Ranked-Attention Networks for Video Captioning

Jan 17, 2020
Anoop Cherian, Jue Wang, Chiori Hori, Tim K. Marks

Figure 1 for Spatio-Temporal Ranked-Attention Networks for Video Captioning
Figure 2 for Spatio-Temporal Ranked-Attention Networks for Video Captioning
Figure 3 for Spatio-Temporal Ranked-Attention Networks for Video Captioning
Figure 4 for Spatio-Temporal Ranked-Attention Networks for Video Captioning
Viaarxiv icon

The Eighth Dialog System Technology Challenge

Nov 14, 2019
Seokhwan Kim, Michel Galley, Chulaka Gunasekara, Sungjin Lee, Adam Atkinson, Baolin Peng, Hannes Schulz, Jianfeng Gao, Jinchao Li, Mahmoud Adada, Minlie Huang, Luis Lastras, Jonathan K. Kummerfeld, Walter S. Lasecki, Chiori Hori, Anoop Cherian, Tim K. Marks, Abhinav Rastogi, Xiaoxue Zang, Srinivas Sunkara, Raghav Gupta

Figure 1 for The Eighth Dialog System Technology Challenge
Figure 2 for The Eighth Dialog System Technology Challenge
Figure 3 for The Eighth Dialog System Technology Challenge
Figure 4 for The Eighth Dialog System Technology Challenge
Viaarxiv icon

Audio-Visual Scene-Aware Dialog

Jan 25, 2019
Huda Alamri, Vincent Cartillier, Abhishek Das, Jue Wang, Stefan Lee, Peter Anderson, Irfan Essa, Devi Parikh, Dhruv Batra, Anoop Cherian, Tim K. Marks, Chiori Hori

Figure 1 for Audio-Visual Scene-Aware Dialog
Figure 2 for Audio-Visual Scene-Aware Dialog
Figure 3 for Audio-Visual Scene-Aware Dialog
Figure 4 for Audio-Visual Scene-Aware Dialog
Viaarxiv icon