Picture for Dimitris N. Metaxas

Dimitris N. Metaxas

Rutgers University

Your Reward Function for RL is Your Best PRM for Search: Unifying RL and Search-Based TTS

Add code
Aug 19, 2025
Viaarxiv icon

SignX: The Foundation Model for Sign Recognition

Add code
Apr 22, 2025
Viaarxiv icon

Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning

Add code
Apr 14, 2025
Viaarxiv icon

Show and Segment: Universal Medical Image Segmentation via In-Context Learning

Add code
Mar 25, 2025
Viaarxiv icon

LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation

Add code
Mar 18, 2025
Figure 1 for LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation
Figure 2 for LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation
Figure 3 for LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation
Figure 4 for LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation
Viaarxiv icon

Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars

Add code
Mar 15, 2025
Figure 1 for Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars
Figure 2 for Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars
Figure 3 for Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars
Figure 4 for Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars
Viaarxiv icon

Towards Universal Learning-based Model for Cardiac Image Reconstruction: Summary of the CMRxRecon2024 Challenge

Add code
Mar 05, 2025
Viaarxiv icon

LUCAS: Layered Universal Codec Avatars

Add code
Feb 27, 2025
Viaarxiv icon

The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering

Add code
Feb 05, 2025
Figure 1 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Figure 2 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Figure 3 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Figure 4 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Viaarxiv icon

RankFlow: A Multi-Role Collaborative Reranking Workflow Utilizing Large Language Models

Add code
Feb 04, 2025
Figure 1 for RankFlow: A Multi-Role Collaborative Reranking Workflow Utilizing Large Language Models
Figure 2 for RankFlow: A Multi-Role Collaborative Reranking Workflow Utilizing Large Language Models
Figure 3 for RankFlow: A Multi-Role Collaborative Reranking Workflow Utilizing Large Language Models
Figure 4 for RankFlow: A Multi-Role Collaborative Reranking Workflow Utilizing Large Language Models
Viaarxiv icon