Picture for Sangho Lee

Sangho Lee

Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

Add code
Dec 28, 2023
Viaarxiv icon

Integrated Path Tracking with DYC and MPC using LSTM Based Tire Force Estimator for Four-wheel Independent Steering and Driving Vehicle

Dec 13, 2023
Viaarxiv icon

Can Language Models Laugh at YouTube Short-form Videos?

Add code
Oct 26, 2023
Viaarxiv icon

X-CANIDS: Signal-Aware Explainable Intrusion Detection System for Controller Area Network-Based In-Vehicle Network

Mar 22, 2023
Figure 1 for X-CANIDS: Signal-Aware Explainable Intrusion Detection System for Controller Area Network-Based In-Vehicle Network
Figure 2 for X-CANIDS: Signal-Aware Explainable Intrusion Detection System for Controller Area Network-Based In-Vehicle Network
Figure 3 for X-CANIDS: Signal-Aware Explainable Intrusion Detection System for Controller Area Network-Based In-Vehicle Network
Figure 4 for X-CANIDS: Signal-Aware Explainable Intrusion Detection System for Controller Area Network-Based In-Vehicle Network
Viaarxiv icon

Boundary-aware Self-supervised Learning for Video Scene Segmentation

Add code
Jan 14, 2022
Figure 1 for Boundary-aware Self-supervised Learning for Video Scene Segmentation
Figure 2 for Boundary-aware Self-supervised Learning for Video Scene Segmentation
Figure 3 for Boundary-aware Self-supervised Learning for Video Scene Segmentation
Figure 4 for Boundary-aware Self-supervised Learning for Video Scene Segmentation
Viaarxiv icon

Unsupervised Representation Learning via Neural Activation Coding

Add code
Dec 07, 2021
Figure 1 for Unsupervised Representation Learning via Neural Activation Coding
Figure 2 for Unsupervised Representation Learning via Neural Activation Coding
Figure 3 for Unsupervised Representation Learning via Neural Activation Coding
Figure 4 for Unsupervised Representation Learning via Neural Activation Coding
Viaarxiv icon

Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning

Add code
Jan 26, 2021
Figure 1 for Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning
Figure 2 for Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning
Figure 3 for Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning
Figure 4 for Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning
Viaarxiv icon

Parameter Efficient Multimodal Transformers for Video Representation Learning

Dec 08, 2020
Figure 1 for Parameter Efficient Multimodal Transformers for Video Representation Learning
Figure 2 for Parameter Efficient Multimodal Transformers for Video Representation Learning
Figure 3 for Parameter Efficient Multimodal Transformers for Video Representation Learning
Figure 4 for Parameter Efficient Multimodal Transformers for Video Representation Learning
Viaarxiv icon

A Memory Network Approach for Story-based Temporal Summarization of 360° Videos

Jun 18, 2018
Figure 1 for A Memory Network Approach for Story-based Temporal Summarization of 360° Videos
Figure 2 for A Memory Network Approach for Story-based Temporal Summarization of 360° Videos
Figure 3 for A Memory Network Approach for Story-based Temporal Summarization of 360° Videos
Figure 4 for A Memory Network Approach for Story-based Temporal Summarization of 360° Videos
Viaarxiv icon

A Read-Write Memory Network for Movie Story Understanding

Add code
Mar 16, 2018
Figure 1 for A Read-Write Memory Network for Movie Story Understanding
Figure 2 for A Read-Write Memory Network for Movie Story Understanding
Figure 3 for A Read-Write Memory Network for Movie Story Understanding
Figure 4 for A Read-Write Memory Network for Movie Story Understanding
Viaarxiv icon