Picture for Aditya Sharma

Aditya Sharma

DeSQ: Decomposition-based SPARQL Query Generation

Add code
May 29, 2026
Viaarxiv icon

Generative Floor Plan Design with LLMs via Reinforcement Learning with Verifiable Rewards

Add code
May 13, 2026
Viaarxiv icon

MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks

Add code
May 26, 2025
Figure 1 for MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
Figure 2 for MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
Figure 3 for MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
Figure 4 for MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
Viaarxiv icon

Reducing Hallucinations in Language Model-based SPARQL Query Generation Using Post-Generation Memory Retrieval

Add code
Feb 19, 2025
Figure 1 for Reducing Hallucinations in Language Model-based SPARQL Query Generation Using Post-Generation Memory Retrieval
Figure 2 for Reducing Hallucinations in Language Model-based SPARQL Query Generation Using Post-Generation Memory Retrieval
Figure 3 for Reducing Hallucinations in Language Model-based SPARQL Query Generation Using Post-Generation Memory Retrieval
Figure 4 for Reducing Hallucinations in Language Model-based SPARQL Query Generation Using Post-Generation Memory Retrieval
Viaarxiv icon

GeoCoder: Solving Geometry Problems by Generating Modular Code through Vision-Language Models

Add code
Oct 17, 2024
Figure 1 for GeoCoder: Solving Geometry Problems by Generating Modular Code through Vision-Language Models
Figure 2 for GeoCoder: Solving Geometry Problems by Generating Modular Code through Vision-Language Models
Figure 3 for GeoCoder: Solving Geometry Problems by Generating Modular Code through Vision-Language Models
Figure 4 for GeoCoder: Solving Geometry Problems by Generating Modular Code through Vision-Language Models
Viaarxiv icon

Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and Long Contexts

Add code
Jun 24, 2024
Figure 1 for Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and Long Contexts
Figure 2 for Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and Long Contexts
Figure 3 for Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and Long Contexts
Figure 4 for Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and Long Contexts
Viaarxiv icon

Bi-level Trajectory Optimization on Uneven Terrains with Differentiable Wheel-Terrain Interaction Model

Add code
Apr 11, 2024
Figure 1 for Bi-level Trajectory Optimization on Uneven Terrains with Differentiable Wheel-Terrain Interaction Model
Figure 2 for Bi-level Trajectory Optimization on Uneven Terrains with Differentiable Wheel-Terrain Interaction Model
Figure 3 for Bi-level Trajectory Optimization on Uneven Terrains with Differentiable Wheel-Terrain Interaction Model
Figure 4 for Bi-level Trajectory Optimization on Uneven Terrains with Differentiable Wheel-Terrain Interaction Model
Viaarxiv icon

Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2)

Add code
Apr 05, 2024
Figure 1 for Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2)
Figure 2 for Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2)
Figure 3 for Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2)
Figure 4 for Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2)
Viaarxiv icon

Standing on FURM ground -- A framework for evaluating Fair, Useful, and Reliable AI Models in healthcare systems

Add code
Mar 14, 2024
Figure 1 for Standing on FURM ground -- A framework for evaluating Fair, Useful, and Reliable AI Models in healthcare systems
Figure 2 for Standing on FURM ground -- A framework for evaluating Fair, Useful, and Reliable AI Models in healthcare systems
Figure 3 for Standing on FURM ground -- A framework for evaluating Fair, Useful, and Reliable AI Models in healthcare systems
Figure 4 for Standing on FURM ground -- A framework for evaluating Fair, Useful, and Reliable AI Models in healthcare systems
Viaarxiv icon

ATPPNet: Attention based Temporal Point cloud Prediction Network

Add code
Jan 30, 2024
Viaarxiv icon