Picture for Dinesh Manocha

Dinesh Manocha

EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning

Add code
Oct 17, 2024
Figure 1 for EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning
Figure 2 for EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning
Figure 3 for EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning
Figure 4 for EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning
Viaarxiv icon

Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation

Add code
Oct 17, 2024
Figure 1 for Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation
Figure 2 for Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation
Figure 3 for Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation
Figure 4 for Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation
Viaarxiv icon

ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera

Add code
Oct 14, 2024
Figure 1 for ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera
Figure 2 for ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera
Figure 3 for ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera
Figure 4 for ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera
Viaarxiv icon

MeshGS: Adaptive Mesh-Aligned Gaussian Splatting for High-Quality Rendering

Add code
Oct 11, 2024
Viaarxiv icon

Mode-GS: Monocular Depth Guided Anchored 3D Gaussian Splatting for Robust Ground-View Scene Rendering

Add code
Oct 06, 2024
Viaarxiv icon

AIME: AI System Optimization via Multiple LLM Evaluators

Add code
Oct 04, 2024
Viaarxiv icon

Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data

Add code
Oct 02, 2024
Figure 1 for Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data
Figure 2 for Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data
Figure 3 for Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data
Figure 4 for Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data
Viaarxiv icon

Robot Navigation Using Physically Grounded Vision-Language Models in Outdoor Environments

Add code
Sep 30, 2024
Figure 1 for Robot Navigation Using Physically Grounded Vision-Language Models in Outdoor Environments
Figure 2 for Robot Navigation Using Physically Grounded Vision-Language Models in Outdoor Environments
Figure 3 for Robot Navigation Using Physically Grounded Vision-Language Models in Outdoor Environments
Figure 4 for Robot Navigation Using Physically Grounded Vision-Language Models in Outdoor Environments
Viaarxiv icon

SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining

Add code
Sep 26, 2024
Figure 1 for SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining
Figure 2 for SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining
Figure 3 for SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining
Figure 4 for SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining
Viaarxiv icon

CROSS-GAiT: Cross-Attention-Based Multimodal Representation Fusion for Parametric Gait Adaptation in Complex Terrains

Add code
Sep 25, 2024
Figure 1 for CROSS-GAiT: Cross-Attention-Based Multimodal Representation Fusion for Parametric Gait Adaptation in Complex Terrains
Figure 2 for CROSS-GAiT: Cross-Attention-Based Multimodal Representation Fusion for Parametric Gait Adaptation in Complex Terrains
Figure 3 for CROSS-GAiT: Cross-Attention-Based Multimodal Representation Fusion for Parametric Gait Adaptation in Complex Terrains
Figure 4 for CROSS-GAiT: Cross-Attention-Based Multimodal Representation Fusion for Parametric Gait Adaptation in Complex Terrains
Viaarxiv icon