Picture for Akshita Gupta

Akshita Gupta

University of Florida

ReCap: Lightweight Referential Grounding for Coherent Story Visualization

Add code
Apr 20, 2026
Viaarxiv icon

HaloProbe: Bayesian Detection and Mitigation of Object Hallucinations in Vision-Language Models

Add code
Apr 07, 2026
Viaarxiv icon

A Comprehensive Benchmark of Histopathology Foundation Models for Kidney Histopathology

Add code
Mar 16, 2026
Viaarxiv icon

Extending $μ$P: Spectral Conditions for Feature Learning Across Optimizers

Add code
Feb 24, 2026
Viaarxiv icon

Information-Driven Fault Detection and Identification for Multi-Agent Spacecraft Systems: Collaborative On-Orbit Inspection Mission

Add code
Nov 11, 2025
Viaarxiv icon

A multi-modal dataset for insect biodiversity with imagery and DNA at the trap and individual level

Add code
Jul 09, 2025
Viaarxiv icon

Global Task-aware Fault Detection, Identification For On-Orbit Multi-Spacecraft Collaborative Inspection

Add code
May 06, 2025
Viaarxiv icon

Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis

Add code
Nov 26, 2024
Figure 1 for Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis
Figure 2 for Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis
Figure 3 for Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis
Figure 4 for Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis
Viaarxiv icon

Open-Vocabulary Temporal Action Localization using Multimodal Guidance

Add code
Jun 21, 2024
Viaarxiv icon

LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization

Add code
Apr 01, 2024
Figure 1 for LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization
Figure 2 for LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization
Figure 3 for LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization
Figure 4 for LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization
Viaarxiv icon