Picture for Simon Stepputtis

Simon Stepputtis

InstructPart: Task-Oriented Part Segmentation with Instruction Reasoning

Add code
May 23, 2025
Viaarxiv icon

Spectral-Aware Global Fusion for RGB-Thermal Semantic Segmentation

Add code
May 21, 2025
Viaarxiv icon

Model-Agnostic Policy Explanations with Large Language Models

Add code
Apr 08, 2025
Viaarxiv icon

Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models

Add code
Feb 10, 2025
Viaarxiv icon

HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation

Add code
Nov 03, 2024
Figure 1 for HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation
Figure 2 for HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation
Figure 3 for HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation
Figure 4 for HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation
Viaarxiv icon

LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation

Add code
Nov 01, 2024
Figure 1 for LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation
Figure 2 for LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation
Figure 3 for LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation
Figure 4 for LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation
Viaarxiv icon

Symbolic Graph Inference for Compound Scene Understanding

Add code
Oct 30, 2024
Figure 1 for Symbolic Graph Inference for Compound Scene Understanding
Figure 2 for Symbolic Graph Inference for Compound Scene Understanding
Figure 3 for Symbolic Graph Inference for Compound Scene Understanding
Viaarxiv icon

Navigating Noisy Feedback: Enhancing Reinforcement Learning with Error-Prone Language Models

Add code
Oct 22, 2024
Viaarxiv icon

Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models

Add code
Oct 16, 2024
Figure 1 for Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models
Figure 2 for Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models
Figure 3 for Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models
Figure 4 for Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models
Viaarxiv icon

A Comparison of Imitation Learning Algorithms for Bimanual Manipulation

Add code
Aug 13, 2024
Figure 1 for A Comparison of Imitation Learning Algorithms for Bimanual Manipulation
Figure 2 for A Comparison of Imitation Learning Algorithms for Bimanual Manipulation
Figure 3 for A Comparison of Imitation Learning Algorithms for Bimanual Manipulation
Figure 4 for A Comparison of Imitation Learning Algorithms for Bimanual Manipulation
Viaarxiv icon