Picture for Junlin Han

Junlin Han

Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training

Add code
Sep 30, 2025
Viaarxiv icon

Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning

Add code
Jun 08, 2025
Viaarxiv icon

VGRP-Bench: Visual Grid Reasoning Puzzle Benchmark for Large Vision-Language Models

Add code
Apr 02, 2025
Figure 1 for VGRP-Bench: Visual Grid Reasoning Puzzle Benchmark for Large Vision-Language Models
Figure 2 for VGRP-Bench: Visual Grid Reasoning Puzzle Benchmark for Large Vision-Language Models
Figure 3 for VGRP-Bench: Visual Grid Reasoning Puzzle Benchmark for Large Vision-Language Models
Figure 4 for VGRP-Bench: Visual Grid Reasoning Puzzle Benchmark for Large Vision-Language Models
Viaarxiv icon

Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model

Add code
Mar 20, 2025
Figure 1 for Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model
Figure 2 for Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model
Figure 3 for Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model
Figure 4 for Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model
Viaarxiv icon

Semantic Score Distillation Sampling for Compositional Text-to-3D Generation

Add code
Oct 11, 2024
Figure 1 for Semantic Score Distillation Sampling for Compositional Text-to-3D Generation
Figure 2 for Semantic Score Distillation Sampling for Compositional Text-to-3D Generation
Figure 3 for Semantic Score Distillation Sampling for Compositional Text-to-3D Generation
Figure 4 for Semantic Score Distillation Sampling for Compositional Text-to-3D Generation
Viaarxiv icon

Flex3D: Feed-Forward 3D Generation With Flexible Reconstruction Model And Input View Curation

Add code
Oct 02, 2024
Figure 1 for Flex3D: Feed-Forward 3D Generation With Flexible Reconstruction Model And Input View Curation
Figure 2 for Flex3D: Feed-Forward 3D Generation With Flexible Reconstruction Model And Input View Curation
Figure 3 for Flex3D: Feed-Forward 3D Generation With Flexible Reconstruction Model And Input View Curation
Figure 4 for Flex3D: Feed-Forward 3D Generation With Flexible Reconstruction Model And Input View Curation
Viaarxiv icon

DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer

Add code
Sep 12, 2024
Figure 1 for DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer
Figure 2 for DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer
Figure 3 for DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer
Figure 4 for DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer
Viaarxiv icon

Learning-based Multi-View Stereo: A Survey

Add code
Aug 27, 2024
Figure 1 for Learning-based Multi-View Stereo: A Survey
Figure 2 for Learning-based Multi-View Stereo: A Survey
Figure 3 for Learning-based Multi-View Stereo: A Survey
Figure 4 for Learning-based Multi-View Stereo: A Survey
Viaarxiv icon

VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models

Add code
Mar 18, 2024
Viaarxiv icon

Strong and Controllable Blind Image Decomposition

Add code
Mar 15, 2024
Viaarxiv icon