Alert button
Picture for Hongkai Xiong

Hongkai Xiong

Alert button

From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models

Add code
Bookmark button
Alert button
Oct 18, 2023
Dongsheng Jiang, Yuchen Liu, Songlin Liu, Xiaopeng Zhang, Jin Li, Hongkai Xiong, Qi Tian

Figure 1 for From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
Figure 2 for From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
Figure 3 for From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
Figure 4 for From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
Viaarxiv icon

Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation

Add code
Bookmark button
Alert button
Aug 08, 2023
Shuangrui Ding, Peisen Zhao, Xiaopeng Zhang, Rui Qian, Hongkai Xiong, Qi Tian

Figure 1 for Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation
Figure 2 for Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation
Figure 3 for Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation
Figure 4 for Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation
Viaarxiv icon

ActionPrompt: Action-Guided 3D Human Pose Estimation With Text and Pose Prompting

Add code
Bookmark button
Alert button
Jul 18, 2023
Hongwei Zheng, Han Li, Bowen Shi, Wenrui Dai, Botao Wan, Yu Sun, Min Guo, Hongkai Xiong

Figure 1 for ActionPrompt: Action-Guided 3D Human Pose Estimation With Text and Pose Prompting
Figure 2 for ActionPrompt: Action-Guided 3D Human Pose Estimation With Text and Pose Prompting
Figure 3 for ActionPrompt: Action-Guided 3D Human Pose Estimation With Text and Pose Prompting
Figure 4 for ActionPrompt: Action-Guided 3D Human Pose Estimation With Text and Pose Prompting
Viaarxiv icon

Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners

Add code
Bookmark button
Alert button
Jun 28, 2023
Bowen Shi, Xiaopeng Zhang, Yaoming Wang, Jin Li, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian

Figure 1 for Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners
Figure 2 for Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners
Figure 3 for Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners
Figure 4 for Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners
Viaarxiv icon

Dynamic Scenario Representation Learning for Motion Forecasting with Heterogeneous Graph Convolutional Recurrent Networks

Add code
Bookmark button
Alert button
Mar 08, 2023
Xing Gao, Xiaogang Jia, Yikang Li, Hongkai Xiong

Figure 1 for Dynamic Scenario Representation Learning for Motion Forecasting with Heterogeneous Graph Convolutional Recurrent Networks
Figure 2 for Dynamic Scenario Representation Learning for Motion Forecasting with Heterogeneous Graph Convolutional Recurrent Networks
Figure 3 for Dynamic Scenario Representation Learning for Motion Forecasting with Heterogeneous Graph Convolutional Recurrent Networks
Figure 4 for Dynamic Scenario Representation Learning for Motion Forecasting with Heterogeneous Graph Convolutional Recurrent Networks
Viaarxiv icon

Learned Lossless Compression for JPEG via Frequency-Domain Prediction

Add code
Bookmark button
Alert button
Mar 05, 2023
Jixiang Luo, Shaohui Li, Wenrui Dai, Chenglin Li, Junni Zou, Hongkai Xiong

Figure 1 for Learned Lossless Compression for JPEG via Frequency-Domain Prediction
Figure 2 for Learned Lossless Compression for JPEG via Frequency-Domain Prediction
Figure 3 for Learned Lossless Compression for JPEG via Frequency-Domain Prediction
Figure 4 for Learned Lossless Compression for JPEG via Frequency-Domain Prediction
Viaarxiv icon

Pose-Oriented Transformer with Uncertainty-Guided Refinement for 2D-to-3D Human Pose Estimation

Add code
Bookmark button
Alert button
Feb 15, 2023
Han Li, Bowen Shi, Wenrui Dai, Hongwei Zheng, Botao Wang, Yu Sun, Min Guo, Chenlin Li, Junni Zou, Hongkai Xiong

Figure 1 for Pose-Oriented Transformer with Uncertainty-Guided Refinement for 2D-to-3D Human Pose Estimation
Figure 2 for Pose-Oriented Transformer with Uncertainty-Guided Refinement for 2D-to-3D Human Pose Estimation
Figure 3 for Pose-Oriented Transformer with Uncertainty-Guided Refinement for 2D-to-3D Human Pose Estimation
Figure 4 for Pose-Oriented Transformer with Uncertainty-Guided Refinement for 2D-to-3D Human Pose Estimation
Viaarxiv icon

Dual adaptive training of photonic neural networks

Add code
Bookmark button
Alert button
Dec 09, 2022
Ziyang Zheng, Zhengyang Duan, Hang Chen, Rui Yang, Sheng Gao, Haiou Zhang, Hongkai Xiong, Xing Lin

Figure 1 for Dual adaptive training of photonic neural networks
Figure 2 for Dual adaptive training of photonic neural networks
Figure 3 for Dual adaptive training of photonic neural networks
Figure 4 for Dual adaptive training of photonic neural networks
Viaarxiv icon

MFDNet: Towards Real-time Image Denoising On Mobile Devices

Add code
Bookmark button
Alert button
Nov 09, 2022
Zhuoqun Liu, Meiguang Jin, Ying Chen, Huaida Liu, Canqian Yang, Hongkai Xiong

Figure 1 for MFDNet: Towards Real-time Image Denoising On Mobile Devices
Figure 2 for MFDNet: Towards Real-time Image Denoising On Mobile Devices
Figure 3 for MFDNet: Towards Real-time Image Denoising On Mobile Devices
Figure 4 for MFDNet: Towards Real-time Image Denoising On Mobile Devices
Viaarxiv icon