Picture for Ravi Kumar Satzoda

Ravi Kumar Satzoda

DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models

Add code
Oct 04, 2024
Viaarxiv icon

RAVEN: Multitask Retrieval Augmented Vision-Language Learning

Add code
Jun 27, 2024
Figure 1 for RAVEN: Multitask Retrieval Augmented Vision-Language Learning
Figure 2 for RAVEN: Multitask Retrieval Augmented Vision-Language Learning
Figure 3 for RAVEN: Multitask Retrieval Augmented Vision-Language Learning
Figure 4 for RAVEN: Multitask Retrieval Augmented Vision-Language Learning
Viaarxiv icon

DocTr: Document Transformer for Structured Information Extraction in Documents

Add code
Jul 16, 2023
Viaarxiv icon

PolyFormer: Referring Image Segmentation as Sequential Polygon Generation

Add code
Feb 14, 2023
Viaarxiv icon

A Multimodal, Full-Surround Vehicular Testbed for Naturalistic Studies and Benchmarking: Design, Calibration and Deployment

Add code
Mar 20, 2018
Figure 1 for A Multimodal, Full-Surround Vehicular Testbed for Naturalistic Studies and Benchmarking: Design, Calibration and Deployment
Figure 2 for A Multimodal, Full-Surround Vehicular Testbed for Naturalistic Studies and Benchmarking: Design, Calibration and Deployment
Figure 3 for A Multimodal, Full-Surround Vehicular Testbed for Naturalistic Studies and Benchmarking: Design, Calibration and Deployment
Figure 4 for A Multimodal, Full-Surround Vehicular Testbed for Naturalistic Studies and Benchmarking: Design, Calibration and Deployment
Viaarxiv icon