Picture for Sean Sedwards

Sean Sedwards

Where Does the Answer Come From? Benchmarking View-Level Visual Evidence Identification in Multi-View MLLMs for Autonomous Driving

Add code
Jun 08, 2026
Viaarxiv icon

VISTAQA: Benchmarking Joint Visual Question Answering and Pixel-Level Evidence

Add code
May 20, 2026
Viaarxiv icon

HAWAII: Hierarchical Visual Knowledge Transfer for Efficient Vision-Language Models

Add code
Jun 23, 2025
Viaarxiv icon

LEO-MINI: An Efficient Multimodal Large Language Model using Conditional Token Reduction and Mixture of Multi-Modal Experts

Add code
Apr 07, 2025
Figure 1 for LEO-MINI: An Efficient Multimodal Large Language Model using Conditional Token Reduction and Mixture of Multi-Modal Experts
Figure 2 for LEO-MINI: An Efficient Multimodal Large Language Model using Conditional Token Reduction and Mixture of Multi-Modal Experts
Figure 3 for LEO-MINI: An Efficient Multimodal Large Language Model using Conditional Token Reduction and Mixture of Multi-Modal Experts
Figure 4 for LEO-MINI: An Efficient Multimodal Large Language Model using Conditional Token Reduction and Mixture of Multi-Modal Experts
Viaarxiv icon

OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection

Add code
Mar 09, 2025
Figure 1 for OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection
Figure 2 for OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection
Figure 3 for OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection
Figure 4 for OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection
Viaarxiv icon

LEO: Boosting Mixture of Vision Encoders for Multimodal Large Language Models

Add code
Jan 13, 2025
Figure 1 for LEO: Boosting Mixture of Vision Encoders for Multimodal Large Language Models
Figure 2 for LEO: Boosting Mixture of Vision Encoders for Multimodal Large Language Models
Figure 3 for LEO: Boosting Mixture of Vision Encoders for Multimodal Large Language Models
Figure 4 for LEO: Boosting Mixture of Vision Encoders for Multimodal Large Language Models
Viaarxiv icon

VADet: Multi-frame LiDAR 3D Object Detection using Variable Aggregation

Add code
Nov 20, 2024
Figure 1 for VADet: Multi-frame LiDAR 3D Object Detection using Variable Aggregation
Figure 2 for VADet: Multi-frame LiDAR 3D Object Detection using Variable Aggregation
Figure 3 for VADet: Multi-frame LiDAR 3D Object Detection using Variable Aggregation
Figure 4 for VADet: Multi-frame LiDAR 3D Object Detection using Variable Aggregation
Viaarxiv icon

SOAP: Cross-sensor Domain Adaptation for 3D Object Detection Using Stationary Object Aggregation Pseudo-labelling

Add code
Jan 08, 2024
Figure 1 for SOAP: Cross-sensor Domain Adaptation for 3D Object Detection Using Stationary Object Aggregation Pseudo-labelling
Figure 2 for SOAP: Cross-sensor Domain Adaptation for 3D Object Detection Using Stationary Object Aggregation Pseudo-labelling
Figure 3 for SOAP: Cross-sensor Domain Adaptation for 3D Object Detection Using Stationary Object Aggregation Pseudo-labelling
Figure 4 for SOAP: Cross-sensor Domain Adaptation for 3D Object Detection Using Stationary Object Aggregation Pseudo-labelling
Viaarxiv icon

A Hierarchical Pedestrian Behavior Model to Generate Realistic Human Behavior in Traffic Simulation

Add code
Jun 01, 2022
Figure 1 for A Hierarchical Pedestrian Behavior Model to Generate Realistic Human Behavior in Traffic Simulation
Figure 2 for A Hierarchical Pedestrian Behavior Model to Generate Realistic Human Behavior in Traffic Simulation
Figure 3 for A Hierarchical Pedestrian Behavior Model to Generate Realistic Human Behavior in Traffic Simulation
Figure 4 for A Hierarchical Pedestrian Behavior Model to Generate Realistic Human Behavior in Traffic Simulation
Viaarxiv icon

Recursive Constraints to Prevent Instability in Constrained Reinforcement Learning

Add code
Jan 20, 2022
Viaarxiv icon