Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Youngjo Min

Pose Splatter: A 3D Gaussian Splatting Model for Quantifying Animal Pose and Appearance

May 23, 2025

Jack Goffinet, Youngjo Min, Carlo Tomasi, David E. Carlson

Abstract:Accurate and scalable quantification of animal pose and appearance is crucial for studying behavior. Current 3D pose estimation techniques, such as keypoint- and mesh-based techniques, often face challenges including limited representational detail, labor-intensive annotation requirements, and expensive per-frame optimization. These limitations hinder the study of subtle movements and can make large-scale analyses impractical. We propose Pose Splatter, a novel framework leveraging shape carving and 3D Gaussian splatting to model the complete pose and appearance of laboratory animals without prior knowledge of animal geometry, per-frame optimization, or manual annotations. We also propose a novel rotation-invariant visual embedding technique for encoding pose and appearance, designed to be a plug-in replacement for 3D keypoint data in downstream behavioral analyses. Experiments on datasets of mice, rats, and zebra finches show Pose Splatter learns accurate 3D animal geometries. Notably, Pose Splatter represents subtle variations in pose, provides better low-dimensional pose embeddings over state-of-the-art as evaluated by humans, and generalizes to unseen data. By eliminating annotation and per-frame optimization bottlenecks, Pose Splatter enables analysis of large-scale, longitudinal behavior needed to map genotype, neural activity, and micro-behavior at unprecedented resolution.

* 19 pages, 13 figures

Via

Access Paper or Ask Questions

Controllable Style Transfer via Test-time Training of Implicit Neural Representation

Oct 17, 2022

Sunwoo Kim, Youngjo Min, Younghun Jung, Seungryong Kim

Figure 1 for Controllable Style Transfer via Test-time Training of Implicit Neural Representation

Figure 2 for Controllable Style Transfer via Test-time Training of Implicit Neural Representation

Figure 3 for Controllable Style Transfer via Test-time Training of Implicit Neural Representation

Figure 4 for Controllable Style Transfer via Test-time Training of Implicit Neural Representation

Abstract:We propose a controllable style transfer framework based on Implicit Neural Representation that pixel-wisely controls the stylized output via test-time training. Unlike traditional image optimization methods that often suffer from unstable convergence and learning-based methods that require intensive training and have limited generalization ability, we present a model optimization framework that optimizes the neural networks during test-time with explicit loss functions for style transfer. After being test-time trained once, thanks to the flexibility of the INR-based model, our framework can precisely control the stylized images in a pixel-wise manner and freely adjust image resolution without further optimization or training. We demonstrate several applications.

* Project Page: https://ku-cvlab.github.io/INR-st/

Via

Access Paper or Ask Questions

ConMatch: Semi-Supervised Learning with Confidence-Guided Consistency Regularization

Aug 18, 2022

Jiwon Kim, Youngjo Min, Daehwan Kim, Gyuseong Lee, Junyoung Seo, Kwangrok Ryoo, Seungryong Kim

Figure 1 for ConMatch: Semi-Supervised Learning with Confidence-Guided Consistency Regularization

Figure 2 for ConMatch: Semi-Supervised Learning with Confidence-Guided Consistency Regularization

Figure 3 for ConMatch: Semi-Supervised Learning with Confidence-Guided Consistency Regularization

Figure 4 for ConMatch: Semi-Supervised Learning with Confidence-Guided Consistency Regularization

Abstract:We present a novel semi-supervised learning framework that intelligently leverages the consistency regularization between the model's predictions from two strongly-augmented views of an image, weighted by a confidence of pseudo-label, dubbed ConMatch. While the latest semi-supervised learning methods use weakly- and strongly-augmented views of an image to define a directional consistency loss, how to define such direction for the consistency regularization between two strongly-augmented views remains unexplored. To account for this, we present novel confidence measures for pseudo-labels from strongly-augmented views by means of weakly-augmented view as an anchor in non-parametric and parametric approaches. Especially, in parametric approach, we present, for the first time, to learn the confidence of pseudo-label within the networks, which is learned with backbone model in an end-to-end manner. In addition, we also present a stage-wise training to boost the convergence of training. When incorporated in existing semi-supervised learners, ConMatch consistently boosts the performance. We conduct experiments to demonstrate the effectiveness of our ConMatch over the latest methods and provide extensive ablation studies. Code has been made publicly available at https://github.com/JiwonCocoder/ConMatch.

* Accepted at ECCV 2022

Via

Access Paper or Ask Questions

Joint Learning of Feature Extraction and Cost Aggregation for Semantic Correspondence

Apr 19, 2022

Jiwon Kim, Youngjo Min, Mira Kim, Seungryong Kim

Figure 1 for Joint Learning of Feature Extraction and Cost Aggregation for Semantic Correspondence

Figure 2 for Joint Learning of Feature Extraction and Cost Aggregation for Semantic Correspondence

Figure 3 for Joint Learning of Feature Extraction and Cost Aggregation for Semantic Correspondence

Figure 4 for Joint Learning of Feature Extraction and Cost Aggregation for Semantic Correspondence

Abstract:Establishing dense correspondences across semantically similar images is one of the challenging tasks due to the significant intra-class variations and background clutters. To solve these problems, numerous methods have been proposed, focused on learning feature extractor or cost aggregation independently, which yields sub-optimal performance. In this paper, we propose a novel framework for jointly learning feature extraction and cost aggregation for semantic correspondence. By exploiting the pseudo labels from each module, the networks consisting of feature extraction and cost aggregation modules are simultaneously learned in a boosting fashion. Moreover, to ignore unreliable pseudo labels, we present a confidence-aware contrastive loss function for learning the networks in a weakly-supervised manner. We demonstrate our competitive results on standard benchmarks for semantic correspondence.

* IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022

Via

Access Paper or Ask Questions