Picture for Tuan Dung Nguyen

Tuan Dung Nguyen

Med-StepBench: A Hierarchical Reasoning Framework for Evaluating Hallucinations in Medical Vision-Language Models

Add code
May 11, 2026
Viaarxiv icon

Region-Grounded Report Generation for 3D Medical Imaging: A Fine-Grained Dataset and Graph-Enhanced Framework

Add code
Apr 20, 2026
Viaarxiv icon

Beyond the Global Scores: Fine-Grained Token Grounding as a Robust Detector of LVLM Hallucinations

Add code
Apr 06, 2026
Viaarxiv icon

Overthinking Causes Hallucination: Tracing Confounder Propagation in Vision Language Models

Add code
Mar 08, 2026
Viaarxiv icon

AstroMLab 4: Benchmark-Topping Performance in Astronomy Q&A with a 70B-Parameter Domain-Specialized Reasoning Model

Add code
May 23, 2025
Viaarxiv icon

Empirically evaluating commonsense intelligence in large language models with large-scale human judgments

Add code
May 15, 2025
Figure 1 for Empirically evaluating commonsense intelligence in large language models with large-scale human judgments
Figure 2 for Empirically evaluating commonsense intelligence in large language models with large-scale human judgments
Figure 3 for Empirically evaluating commonsense intelligence in large language models with large-scale human judgments
Figure 4 for Empirically evaluating commonsense intelligence in large language models with large-scale human judgments
Viaarxiv icon

AstroMLab 2: AstroLLaMA-2-70B Model and Benchmarking Specialised LLMs for Astronomy

Add code
Sep 29, 2024
Viaarxiv icon

AstroMLab 1: Who Wins Astronomy Jeopardy!?

Add code
Jul 15, 2024
Viaarxiv icon

Federated PCA on Grassmann Manifold for IoT Anomaly Detection

Add code
Jul 10, 2024
Figure 1 for Federated PCA on Grassmann Manifold for IoT Anomaly Detection
Figure 2 for Federated PCA on Grassmann Manifold for IoT Anomaly Detection
Figure 3 for Federated PCA on Grassmann Manifold for IoT Anomaly Detection
Figure 4 for Federated PCA on Grassmann Manifold for IoT Anomaly Detection
Viaarxiv icon

AstroLLaMA-Chat: Scaling AstroLLaMA with Conversational and Diverse Datasets

Add code
Jan 05, 2024
Viaarxiv icon