Alert button

"Image": models, code, and papers
Alert button

SVDM: Single-View Diffusion Model for Pseudo-Stereo 3D Object Detection

Jul 05, 2023
Yuguang Shi

Figure 1 for SVDM: Single-View Diffusion Model for Pseudo-Stereo 3D Object Detection
Figure 2 for SVDM: Single-View Diffusion Model for Pseudo-Stereo 3D Object Detection
Figure 3 for SVDM: Single-View Diffusion Model for Pseudo-Stereo 3D Object Detection
Figure 4 for SVDM: Single-View Diffusion Model for Pseudo-Stereo 3D Object Detection
Viaarxiv icon

What Matters in Training a GPT4-Style Language Model with Multimodal Inputs?

Jul 05, 2023
Yan Zeng, Hanbo Zhang, Jiani Zheng, Jiangnan Xia, Guoqiang Wei, Yang Wei, Yuchen Zhang, Tao Kong

Figure 1 for What Matters in Training a GPT4-Style Language Model with Multimodal Inputs?
Figure 2 for What Matters in Training a GPT4-Style Language Model with Multimodal Inputs?
Figure 3 for What Matters in Training a GPT4-Style Language Model with Multimodal Inputs?
Figure 4 for What Matters in Training a GPT4-Style Language Model with Multimodal Inputs?
Viaarxiv icon

Additive Decoders for Latent Variables Identification and Cartesian-Product Extrapolation

Jul 05, 2023
Sébastien Lachapelle, Divyat Mahajan, Ioannis Mitliagkas, Simon Lacoste-Julien

Viaarxiv icon

Unsupervised Intrinsic Image Decomposition with LiDAR Intensity

Mar 28, 2023
Shogo Sato, Yasuhiro Yao, Taiga Yoshida, Takuhiro Kaneko, Shingo Ando, Jun Shimamura

Figure 1 for Unsupervised Intrinsic Image Decomposition with LiDAR Intensity
Figure 2 for Unsupervised Intrinsic Image Decomposition with LiDAR Intensity
Figure 3 for Unsupervised Intrinsic Image Decomposition with LiDAR Intensity
Figure 4 for Unsupervised Intrinsic Image Decomposition with LiDAR Intensity
Viaarxiv icon

Novel Categories Discovery from probability matrix perspective

Jul 07, 2023
Zahid Hasan, Abu Zaher Md Faridee, Masud Ahmed, Sanjay Purushotham, Heesung Kwon, Hyungtae Lee, Nirmalya Roy

Figure 1 for Novel Categories Discovery from probability matrix perspective
Figure 2 for Novel Categories Discovery from probability matrix perspective
Figure 3 for Novel Categories Discovery from probability matrix perspective
Figure 4 for Novel Categories Discovery from probability matrix perspective
Viaarxiv icon

Intelligent Robotic Sonographer: Mutual Information-based Disentangled Reward Learning from Few Demonstrations

Jul 07, 2023
Zhongliang Jiang, Yuan Bi, Mingchuan Zhou, Ying Hu, Michael Burke, and Nassir Navab

Figure 1 for Intelligent Robotic Sonographer: Mutual Information-based Disentangled Reward Learning from Few Demonstrations
Figure 2 for Intelligent Robotic Sonographer: Mutual Information-based Disentangled Reward Learning from Few Demonstrations
Figure 3 for Intelligent Robotic Sonographer: Mutual Information-based Disentangled Reward Learning from Few Demonstrations
Figure 4 for Intelligent Robotic Sonographer: Mutual Information-based Disentangled Reward Learning from Few Demonstrations
Viaarxiv icon

Language-free Compositional Action Generation via Decoupling Refinement

Jul 07, 2023
Xiao Liu, Guangyi Chen, Yansong Tang, Guangrun Wang, Ser-Nam Lim

Figure 1 for Language-free Compositional Action Generation via Decoupling Refinement
Figure 2 for Language-free Compositional Action Generation via Decoupling Refinement
Figure 3 for Language-free Compositional Action Generation via Decoupling Refinement
Figure 4 for Language-free Compositional Action Generation via Decoupling Refinement
Viaarxiv icon

Equivariant Single View Pose Prediction Via Induced and Restricted Representations

Jul 07, 2023
Owen Howell, David Klee, Ondrej Biza, Linfeng Zhao, Robin Walters

Figure 1 for Equivariant Single View Pose Prediction Via Induced and Restricted Representations
Figure 2 for Equivariant Single View Pose Prediction Via Induced and Restricted Representations
Figure 3 for Equivariant Single View Pose Prediction Via Induced and Restricted Representations
Figure 4 for Equivariant Single View Pose Prediction Via Induced and Restricted Representations
Viaarxiv icon

Read, Look or Listen? What's Needed for Solving a Multimodal Dataset

Jul 06, 2023
Netta Madvil, Yonatan Bitton, Roy Schwartz

Figure 1 for Read, Look or Listen? What's Needed for Solving a Multimodal Dataset
Figure 2 for Read, Look or Listen? What's Needed for Solving a Multimodal Dataset
Figure 3 for Read, Look or Listen? What's Needed for Solving a Multimodal Dataset
Figure 4 for Read, Look or Listen? What's Needed for Solving a Multimodal Dataset
Viaarxiv icon

Cross-Modal Content Inference and Feature Enrichment for Cold-Start Recommendation

Jul 06, 2023
Haokai Ma, Zhuang Qi, Xinxin Dong, Xiangxian Li, Yuze Zheng, Xiangxu Mengand Lei Meng

Figure 1 for Cross-Modal Content Inference and Feature Enrichment for Cold-Start Recommendation
Figure 2 for Cross-Modal Content Inference and Feature Enrichment for Cold-Start Recommendation
Figure 3 for Cross-Modal Content Inference and Feature Enrichment for Cold-Start Recommendation
Figure 4 for Cross-Modal Content Inference and Feature Enrichment for Cold-Start Recommendation
Viaarxiv icon