Picture for Hongmin Cai

Hongmin Cai

PEAM: Parametric Embodied Agent Memory through Contrastive Internalization of Experience in Minecraft

Add code
May 26, 2026
Viaarxiv icon

Can Segmentation Models Understand the World? Towards Proactive Affordance Reasoning via Visual Chain-of-Thought

Add code
May 26, 2026
Viaarxiv icon

SplitAvatar: One-shot Head Avatar with Autoregressive Gaussian Splitting

Add code
May 25, 2026
Viaarxiv icon

LumiVideo: An Intelligent Agentic System for Video Color Grading

Add code
Apr 02, 2026
Viaarxiv icon

FunduSAM: A Specialized Deep Learning Model for Enhanced Optic Disc and Cup Segmentation in Fundus Images

Add code
Feb 10, 2025
Viaarxiv icon

Goal-Driven Reasoning in DatalogMTL with Magic Sets

Add code
Dec 10, 2024
Figure 1 for Goal-Driven Reasoning in DatalogMTL with Magic Sets
Figure 2 for Goal-Driven Reasoning in DatalogMTL with Magic Sets
Viaarxiv icon

Accelerate Neural Subspace-Based Reduced-Order Solver of Deformable Simulation by Lipschitz Optimization

Add code
Sep 05, 2024
Figure 1 for Accelerate Neural Subspace-Based Reduced-Order Solver of Deformable Simulation by Lipschitz Optimization
Figure 2 for Accelerate Neural Subspace-Based Reduced-Order Solver of Deformable Simulation by Lipschitz Optimization
Figure 3 for Accelerate Neural Subspace-Based Reduced-Order Solver of Deformable Simulation by Lipschitz Optimization
Figure 4 for Accelerate Neural Subspace-Based Reduced-Order Solver of Deformable Simulation by Lipschitz Optimization
Viaarxiv icon

Multiple-Crop Human Mesh Recovery with Contrastive Learning and Camera Consistency in A Single Image

Add code
Feb 03, 2024
Figure 1 for Multiple-Crop Human Mesh Recovery with Contrastive Learning and Camera Consistency in A Single Image
Figure 2 for Multiple-Crop Human Mesh Recovery with Contrastive Learning and Camera Consistency in A Single Image
Figure 3 for Multiple-Crop Human Mesh Recovery with Contrastive Learning and Camera Consistency in A Single Image
Figure 4 for Multiple-Crop Human Mesh Recovery with Contrastive Learning and Camera Consistency in A Single Image
Viaarxiv icon

Interleaving One-Class and Weakly-Supervised Models with Adaptive Thresholding for Unsupervised Video Anomaly Detection

Add code
Jan 24, 2024
Figure 1 for Interleaving One-Class and Weakly-Supervised Models with Adaptive Thresholding for Unsupervised Video Anomaly Detection
Figure 2 for Interleaving One-Class and Weakly-Supervised Models with Adaptive Thresholding for Unsupervised Video Anomaly Detection
Figure 3 for Interleaving One-Class and Weakly-Supervised Models with Adaptive Thresholding for Unsupervised Video Anomaly Detection
Figure 4 for Interleaving One-Class and Weakly-Supervised Models with Adaptive Thresholding for Unsupervised Video Anomaly Detection
Viaarxiv icon

The Radiation Oncology NLP Database

Add code
Jan 19, 2024
Figure 1 for The Radiation Oncology NLP Database
Figure 2 for The Radiation Oncology NLP Database
Figure 3 for The Radiation Oncology NLP Database
Figure 4 for The Radiation Oncology NLP Database
Viaarxiv icon