Picture for Sinan Tan

Sinan Tan

Qwen2 Technical Report

Add code
Jul 16, 2024
Figure 1 for Qwen2 Technical Report
Figure 2 for Qwen2 Technical Report
Figure 3 for Qwen2 Technical Report
Figure 4 for Qwen2 Technical Report
Viaarxiv icon

Qwen Technical Report

Add code
Sep 28, 2023
Figure 1 for Qwen Technical Report
Figure 2 for Qwen Technical Report
Figure 3 for Qwen Technical Report
Figure 4 for Qwen Technical Report
Viaarxiv icon

Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond

Add code
Sep 14, 2023
Figure 1 for Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond
Figure 2 for Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond
Figure 3 for Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond
Figure 4 for Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond
Viaarxiv icon

OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models

Add code
Dec 08, 2022
Figure 1 for OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models
Figure 2 for OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models
Figure 3 for OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models
Figure 4 for OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models
Viaarxiv icon

Mixed Neural Voxels for Fast Multi-view Video Synthesis

Add code
Dec 01, 2022
Figure 1 for Mixed Neural Voxels for Fast Multi-view Video Synthesis
Figure 2 for Mixed Neural Voxels for Fast Multi-view Video Synthesis
Figure 3 for Mixed Neural Voxels for Fast Multi-view Video Synthesis
Figure 4 for Mixed Neural Voxels for Fast Multi-view Video Synthesis
Viaarxiv icon

Embodied Referring Expression for Manipulation Question Answering in Interactive Environment

Add code
Oct 06, 2022
Figure 1 for Embodied Referring Expression for Manipulation Question Answering in Interactive Environment
Figure 2 for Embodied Referring Expression for Manipulation Question Answering in Interactive Environment
Figure 3 for Embodied Referring Expression for Manipulation Question Answering in Interactive Environment
Figure 4 for Embodied Referring Expression for Manipulation Question Answering in Interactive Environment
Viaarxiv icon

An Automated Question-Answering Framework Based on Evolution Algorithm

Add code
Jan 26, 2022
Figure 1 for An Automated Question-Answering Framework Based on Evolution Algorithm
Figure 2 for An Automated Question-Answering Framework Based on Evolution Algorithm
Figure 3 for An Automated Question-Answering Framework Based on Evolution Algorithm
Figure 4 for An Automated Question-Answering Framework Based on Evolution Algorithm
Viaarxiv icon

Self-supervised 3D Semantic Representation Learning for Vision-and-Language Navigation

Add code
Jan 26, 2022
Figure 1 for Self-supervised 3D Semantic Representation Learning for Vision-and-Language Navigation
Figure 2 for Self-supervised 3D Semantic Representation Learning for Vision-and-Language Navigation
Figure 3 for Self-supervised 3D Semantic Representation Learning for Vision-and-Language Navigation
Figure 4 for Self-supervised 3D Semantic Representation Learning for Vision-and-Language Navigation
Viaarxiv icon

Knowledge-based Embodied Question Answering

Add code
Sep 16, 2021
Figure 1 for Knowledge-based Embodied Question Answering
Figure 2 for Knowledge-based Embodied Question Answering
Figure 3 for Knowledge-based Embodied Question Answering
Figure 4 for Knowledge-based Embodied Question Answering
Viaarxiv icon

Towards Embodied Scene Description

Add code
May 07, 2020
Figure 1 for Towards Embodied Scene Description
Figure 2 for Towards Embodied Scene Description
Figure 3 for Towards Embodied Scene Description
Figure 4 for Towards Embodied Scene Description
Viaarxiv icon