Picture for Jinkyu Kim

Jinkyu Kim

Watermarking for Factuality: Guiding Vision-Language Models Toward Truth via Tri-layer Contrastive Decoding

Add code
Oct 16, 2025
Viaarxiv icon

SemanticControl: A Training-Free Approach for Handling Loosely Aligned Visual Conditions in ControlNet

Add code
Sep 26, 2025
Viaarxiv icon

LRSLAM: Low-rank Representation of Signed Distance Fields in Dense Visual SLAM System

Add code
Jun 12, 2025
Viaarxiv icon

Mitigating Trade-off: Stream and Query-guided Aggregation for Efficient and Effective 3D Occupancy Prediction

Add code
Mar 28, 2025
Figure 1 for Mitigating Trade-off: Stream and Query-guided Aggregation for Efficient and Effective 3D Occupancy Prediction
Figure 2 for Mitigating Trade-off: Stream and Query-guided Aggregation for Efficient and Effective 3D Occupancy Prediction
Figure 3 for Mitigating Trade-off: Stream and Query-guided Aggregation for Efficient and Effective 3D Occupancy Prediction
Figure 4 for Mitigating Trade-off: Stream and Query-guided Aggregation for Efficient and Effective 3D Occupancy Prediction
Viaarxiv icon

3D Occupancy Prediction with Low-Resolution Queries via Prototype-aware View Transformation

Add code
Mar 19, 2025
Figure 1 for 3D Occupancy Prediction with Low-Resolution Queries via Prototype-aware View Transformation
Figure 2 for 3D Occupancy Prediction with Low-Resolution Queries via Prototype-aware View Transformation
Figure 3 for 3D Occupancy Prediction with Low-Resolution Queries via Prototype-aware View Transformation
Figure 4 for 3D Occupancy Prediction with Low-Resolution Queries via Prototype-aware View Transformation
Viaarxiv icon

GUIDE-CoT: Goal-driven and User-Informed Dynamic Estimation for Pedestrian Trajectory using Chain-of-Thought

Add code
Mar 10, 2025
Figure 1 for GUIDE-CoT: Goal-driven and User-Informed Dynamic Estimation for Pedestrian Trajectory using Chain-of-Thought
Figure 2 for GUIDE-CoT: Goal-driven and User-Informed Dynamic Estimation for Pedestrian Trajectory using Chain-of-Thought
Figure 3 for GUIDE-CoT: Goal-driven and User-Informed Dynamic Estimation for Pedestrian Trajectory using Chain-of-Thought
Figure 4 for GUIDE-CoT: Goal-driven and User-Informed Dynamic Estimation for Pedestrian Trajectory using Chain-of-Thought
Viaarxiv icon

DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models

Add code
Feb 19, 2025
Figure 1 for DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models
Figure 2 for DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models
Figure 3 for DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models
Figure 4 for DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models
Viaarxiv icon

Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection

Add code
Oct 29, 2024
Figure 1 for Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection
Figure 2 for Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection
Figure 3 for Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection
Figure 4 for Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection
Viaarxiv icon

ENTP: Encoder-only Next Token Prediction

Add code
Oct 02, 2024
Figure 1 for ENTP: Encoder-only Next Token Prediction
Figure 2 for ENTP: Encoder-only Next Token Prediction
Figure 3 for ENTP: Encoder-only Next Token Prediction
Figure 4 for ENTP: Encoder-only Next Token Prediction
Viaarxiv icon

Finetuning Pre-trained Model with Limited Data for LiDAR-based 3D Object Detection by Bridging Domain Gaps

Add code
Oct 02, 2024
Viaarxiv icon