Picture for Jaesung Huh

Jaesung Huh

TIM: A Time Interval Machine for Audio-Visual Action Recognition

Add code
Apr 09, 2024
Viaarxiv icon

Look, Listen and Recognise: Character-Aware Audio-Visual Subtitling

Add code
Jan 22, 2024
Viaarxiv icon

OxfordVGG Submission to the EGO4D AV Transcription Challenge

Add code
Jul 18, 2023
Figure 1 for OxfordVGG Submission to the EGO4D AV Transcription Challenge
Figure 2 for OxfordVGG Submission to the EGO4D AV Transcription Challenge
Viaarxiv icon

VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge

Add code
Mar 06, 2023
Figure 1 for VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
Figure 2 for VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
Figure 3 for VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
Figure 4 for VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
Viaarxiv icon

WhisperX: Time-Accurate Speech Transcription of Long-Form Audio

Add code
Mar 01, 2023
Figure 1 for WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Figure 2 for WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Figure 3 for WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Figure 4 for WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Viaarxiv icon

Epic-Sounds: A Large-scale Dataset of Actions That Sound

Add code
Feb 01, 2023
Figure 1 for Epic-Sounds: A Large-scale Dataset of Actions That Sound
Figure 2 for Epic-Sounds: A Large-scale Dataset of Actions That Sound
Figure 3 for Epic-Sounds: A Large-scale Dataset of Actions That Sound
Figure 4 for Epic-Sounds: A Large-scale Dataset of Actions That Sound
Viaarxiv icon

In search of strong embedding extractors for speaker diarisation

Add code
Oct 26, 2022
Figure 1 for In search of strong embedding extractors for speaker diarisation
Figure 2 for In search of strong embedding extractors for speaker diarisation
Figure 3 for In search of strong embedding extractors for speaker diarisation
Viaarxiv icon

VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge

Add code
Jan 12, 2022
Figure 1 for VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge
Figure 2 for VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge
Figure 3 for VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge
Figure 4 for VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge
Viaarxiv icon

With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition

Add code
Nov 01, 2021
Figure 1 for With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition
Figure 2 for With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition
Figure 3 for With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition
Figure 4 for With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition
Viaarxiv icon

VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge

Add code
Dec 12, 2020
Figure 1 for VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge
Figure 2 for VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge
Figure 3 for VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge
Figure 4 for VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge
Viaarxiv icon