Picture for Junjun He

Junjun He

dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Add code
Dec 22, 2025
Figure 1 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 2 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 3 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 4 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Viaarxiv icon

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Add code
Dec 18, 2025
Viaarxiv icon

MedQ-Bench: Evaluating and Exploring Medical Image Quality Assessment Abilities in MLLMs

Add code
Oct 02, 2025
Figure 1 for MedQ-Bench: Evaluating and Exploring Medical Image Quality Assessment Abilities in MLLMs
Figure 2 for MedQ-Bench: Evaluating and Exploring Medical Image Quality Assessment Abilities in MLLMs
Figure 3 for MedQ-Bench: Evaluating and Exploring Medical Image Quality Assessment Abilities in MLLMs
Figure 4 for MedQ-Bench: Evaluating and Exploring Medical Image Quality Assessment Abilities in MLLMs
Viaarxiv icon

TopoSculpt: Betti-Steered Topological Sculpting of 3D Fine-grained Tubular Shapes

Add code
Sep 04, 2025
Viaarxiv icon

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Add code
Aug 28, 2025
Figure 1 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Figure 2 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Figure 3 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Figure 4 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Viaarxiv icon

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Add code
Aug 25, 2025
Figure 1 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Figure 2 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Figure 3 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Figure 4 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Viaarxiv icon

EventRR: Event Referential Reasoning for Referring Video Object Segmentation

Add code
Aug 10, 2025
Viaarxiv icon

S2-UniSeg: Fast Universal Agglomerative Pooling for Scalable Segment Anything without Supervision

Add code
Aug 09, 2025
Viaarxiv icon

F^2TTA: Free-Form Test-Time Adaptation on Cross-Domain Medical Image Classification via Image-Level Disentangled Prompt Tuning

Add code
Jul 03, 2025
Viaarxiv icon

ADAgent: LLM Agent for Alzheimer's Disease Analysis with Collaborative Coordinator

Add code
Jun 16, 2025
Viaarxiv icon