Picture for Sheng Jin

Sheng Jin

OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation

Add code
May 29, 2025
Viaarxiv icon

Harmonizing Visual Representations for Unified Multimodal Understanding and Generation

Add code
Mar 27, 2025
Viaarxiv icon

NADER: Neural Architecture Design via Multi-Agent Collaboration

Add code
Dec 26, 2024
Figure 1 for NADER: Neural Architecture Design via Multi-Agent Collaboration
Figure 2 for NADER: Neural Architecture Design via Multi-Agent Collaboration
Figure 3 for NADER: Neural Architecture Design via Multi-Agent Collaboration
Figure 4 for NADER: Neural Architecture Design via Multi-Agent Collaboration
Viaarxiv icon

KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension

Add code
Nov 04, 2024
Figure 1 for KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
Figure 2 for KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
Figure 3 for KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
Figure 4 for KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
Viaarxiv icon

Frame-Voyager: Learning to Query Frames for Video Large Language Models

Add code
Oct 07, 2024
Viaarxiv icon

FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation

Add code
Sep 05, 2024
Figure 1 for FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation
Figure 2 for FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation
Figure 3 for FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation
Figure 4 for FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation
Viaarxiv icon

ESOD: Efficient Small Object Detection on High-Resolution Images

Add code
Jul 23, 2024
Figure 1 for ESOD: Efficient Small Object Detection on High-Resolution Images
Figure 2 for ESOD: Efficient Small Object Detection on High-Resolution Images
Figure 3 for ESOD: Efficient Small Object Detection on High-Resolution Images
Figure 4 for ESOD: Efficient Small Object Detection on High-Resolution Images
Viaarxiv icon

Category-Extensible Out-of-Distribution Detection via Hierarchical Context Descriptions

Add code
Jul 23, 2024
Viaarxiv icon

Rethinking Out-of-Distribution Detection on Imbalanced Data Distribution

Add code
Jul 23, 2024
Viaarxiv icon

TCFormer: Visual Recognition via Token Clustering Transformer

Add code
Jul 16, 2024
Viaarxiv icon