Picture for Bo Wang

Bo Wang

Tencent, WeChat Pay

jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images

Add code
Dec 11, 2024
Figure 1 for jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images
Figure 2 for jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images
Figure 3 for jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images
Figure 4 for jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images
Viaarxiv icon

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Add code
Dec 06, 2024
Figure 1 for Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
Figure 2 for Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
Figure 3 for Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
Figure 4 for Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
Viaarxiv icon

Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy

Add code
Nov 23, 2024
Figure 1 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Figure 2 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Figure 3 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Figure 4 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Viaarxiv icon

Gazing at Rewards: Eye Movements as a Lens into Human and AI Decision-Making in Hybrid Visual Foraging

Add code
Nov 14, 2024
Figure 1 for Gazing at Rewards: Eye Movements as a Lens into Human and AI Decision-Making in Hybrid Visual Foraging
Figure 2 for Gazing at Rewards: Eye Movements as a Lens into Human and AI Decision-Making in Hybrid Visual Foraging
Figure 3 for Gazing at Rewards: Eye Movements as a Lens into Human and AI Decision-Making in Hybrid Visual Foraging
Figure 4 for Gazing at Rewards: Eye Movements as a Lens into Human and AI Decision-Making in Hybrid Visual Foraging
Viaarxiv icon

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Add code
Nov 05, 2024
Figure 1 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 2 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 3 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 4 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Viaarxiv icon

BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments

Add code
Oct 31, 2024
Figure 1 for BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments
Figure 2 for BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments
Figure 3 for BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments
Figure 4 for BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments
Viaarxiv icon

EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection

Add code
Oct 31, 2024
Figure 1 for EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection
Figure 2 for EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection
Figure 3 for EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection
Figure 4 for EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection
Viaarxiv icon

MassSpecGym: A benchmark for the discovery and identification of molecules

Add code
Oct 30, 2024
Viaarxiv icon

IDEATOR: Jailbreaking VLMs Using VLMs

Add code
Oct 29, 2024
Figure 1 for IDEATOR: Jailbreaking VLMs Using VLMs
Figure 2 for IDEATOR: Jailbreaking VLMs Using VLMs
Figure 3 for IDEATOR: Jailbreaking VLMs Using VLMs
Figure 4 for IDEATOR: Jailbreaking VLMs Using VLMs
Viaarxiv icon

metasnf: Meta Clustering with Similarity Network Fusion in R

Add code
Oct 23, 2024
Figure 1 for metasnf: Meta Clustering with Similarity Network Fusion in R
Figure 2 for metasnf: Meta Clustering with Similarity Network Fusion in R
Figure 3 for metasnf: Meta Clustering with Similarity Network Fusion in R
Figure 4 for metasnf: Meta Clustering with Similarity Network Fusion in R
Viaarxiv icon