Picture for Mohammad Nur Hossain Khan

Mohammad Nur Hossain Khan

RAVEN: Query-Guided Representation Alignment for Question Answering over Audio, Video, Embedded Sensors, and Natural Language

Add code
May 21, 2025
Viaarxiv icon

QUADS: QUAntized Distillation Framework for Efficient Speech Language Understanding

Add code
May 19, 2025
Viaarxiv icon

Sound Tagging in Infant-centric Home Soundscapes

Add code
Jun 25, 2024
Figure 1 for Sound Tagging in Infant-centric Home Soundscapes
Figure 2 for Sound Tagging in Infant-centric Home Soundscapes
Figure 3 for Sound Tagging in Infant-centric Home Soundscapes
Figure 4 for Sound Tagging in Infant-centric Home Soundscapes
Viaarxiv icon

LLaSA: Large Multimodal Agent for Human Activity Analysis Through Wearable Sensors

Add code
Jun 20, 2024
Figure 1 for LLaSA: Large Multimodal Agent for Human Activity Analysis Through Wearable Sensors
Figure 2 for LLaSA: Large Multimodal Agent for Human Activity Analysis Through Wearable Sensors
Figure 3 for LLaSA: Large Multimodal Agent for Human Activity Analysis Through Wearable Sensors
Figure 4 for LLaSA: Large Multimodal Agent for Human Activity Analysis Through Wearable Sensors
Viaarxiv icon