Picture for Sarthak Sharma

Sarthak Sharma

AVR: Synergizing Foundation Models for Audio-Visual Humor Detection

Add code
Jun 15, 2024
Figure 1 for AVR: Synergizing Foundation Models for Audio-Visual Humor Detection
Figure 2 for AVR: Synergizing Foundation Models for Audio-Visual Humor Detection
Figure 3 for AVR: Synergizing Foundation Models for Audio-Visual Humor Detection
Viaarxiv icon

UAP-BEV: Uncertainty Aware Planning using Bird's Eye View generated from Surround Monocular Images

Add code
Jun 08, 2023
Figure 1 for UAP-BEV: Uncertainty Aware Planning using Bird's Eye View generated from Surround Monocular Images
Figure 2 for UAP-BEV: Uncertainty Aware Planning using Bird's Eye View generated from Surround Monocular Images
Figure 3 for UAP-BEV: Uncertainty Aware Planning using Bird's Eye View generated from Surround Monocular Images
Figure 4 for UAP-BEV: Uncertainty Aware Planning using Bird's Eye View generated from Surround Monocular Images
Viaarxiv icon

FinderNet: A Data Augmentation Free Canonicalization aided Loop Detection and Closure technique for Point clouds in 6-DOF separation

Add code
Apr 03, 2023
Figure 1 for FinderNet: A Data Augmentation Free Canonicalization aided Loop Detection and Closure technique for Point clouds in 6-DOF separation
Figure 2 for FinderNet: A Data Augmentation Free Canonicalization aided Loop Detection and Closure technique for Point clouds in 6-DOF separation
Figure 3 for FinderNet: A Data Augmentation Free Canonicalization aided Loop Detection and Closure technique for Point clouds in 6-DOF separation
Figure 4 for FinderNet: A Data Augmentation Free Canonicalization aided Loop Detection and Closure technique for Point clouds in 6-DOF separation
Viaarxiv icon

Estimation of Appearance and Occupancy Information in Birds Eye View from Surround Monocular Images

Add code
Nov 08, 2022
Figure 1 for Estimation of Appearance and Occupancy Information in Birds Eye View from Surround Monocular Images
Figure 2 for Estimation of Appearance and Occupancy Information in Birds Eye View from Surround Monocular Images
Figure 3 for Estimation of Appearance and Occupancy Information in Birds Eye View from Surround Monocular Images
Figure 4 for Estimation of Appearance and Occupancy Information in Birds Eye View from Surround Monocular Images
Viaarxiv icon

Bridging Sim2Real Gap Using Image Gradients for the Task of End-to-End Autonomous Driving

Add code
May 16, 2022
Figure 1 for Bridging Sim2Real Gap Using Image Gradients for the Task of End-to-End Autonomous Driving
Figure 2 for Bridging Sim2Real Gap Using Image Gradients for the Task of End-to-End Autonomous Driving
Figure 3 for Bridging Sim2Real Gap Using Image Gradients for the Task of End-to-End Autonomous Driving
Viaarxiv icon

NMR: Neural Manifold Representation for Autonomous Driving

Add code
May 11, 2022
Figure 1 for NMR: Neural Manifold Representation for Autonomous Driving
Figure 2 for NMR: Neural Manifold Representation for Autonomous Driving
Figure 3 for NMR: Neural Manifold Representation for Autonomous Driving
Figure 4 for NMR: Neural Manifold Representation for Autonomous Driving
Viaarxiv icon

Deep Implicit Surface Point Prediction Networks

Add code
Jun 15, 2021
Figure 1 for Deep Implicit Surface Point Prediction Networks
Figure 2 for Deep Implicit Surface Point Prediction Networks
Figure 3 for Deep Implicit Surface Point Prediction Networks
Figure 4 for Deep Implicit Surface Point Prediction Networks
Viaarxiv icon

DUDE: Deep Unsigned Distance Embeddings for Hi-Fidelity Representation of Complex 3D Surfaces

Add code
Nov 04, 2020
Figure 1 for DUDE: Deep Unsigned Distance Embeddings for Hi-Fidelity Representation of Complex 3D Surfaces
Figure 2 for DUDE: Deep Unsigned Distance Embeddings for Hi-Fidelity Representation of Complex 3D Surfaces
Figure 3 for DUDE: Deep Unsigned Distance Embeddings for Hi-Fidelity Representation of Complex 3D Surfaces
Figure 4 for DUDE: Deep Unsigned Distance Embeddings for Hi-Fidelity Representation of Complex 3D Surfaces
Viaarxiv icon

INFER: INtermediate representations for FuturE pRediction

Add code
Mar 26, 2019
Figure 1 for INFER: INtermediate representations for FuturE pRediction
Figure 2 for INFER: INtermediate representations for FuturE pRediction
Figure 3 for INFER: INtermediate representations for FuturE pRediction
Figure 4 for INFER: INtermediate representations for FuturE pRediction
Viaarxiv icon

Beyond Pixels: Leveraging Geometry and Shape Cues for Online Multi-Object Tracking

Add code
Jul 27, 2018
Figure 1 for Beyond Pixels: Leveraging Geometry and Shape Cues for Online Multi-Object Tracking
Figure 2 for Beyond Pixels: Leveraging Geometry and Shape Cues for Online Multi-Object Tracking
Figure 3 for Beyond Pixels: Leveraging Geometry and Shape Cues for Online Multi-Object Tracking
Figure 4 for Beyond Pixels: Leveraging Geometry and Shape Cues for Online Multi-Object Tracking
Viaarxiv icon