Picture for Pei Zhang

Pei Zhang

additional authors not shown

ConText: Driving In-context Learning for Text Removal and Segmentation

Add code
Jun 04, 2025
Viaarxiv icon

Qwen3 Technical Report

Add code
May 14, 2025
Viaarxiv icon

PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts

Add code
Apr 30, 2025
Viaarxiv icon

Falcon: Fractional Alternating Cut with Overcoming Minima in Unsupervised Segmentation

Add code
Apr 08, 2025
Viaarxiv icon

Sampling-Efficient Test-Time Scaling: Self-Estimating the Best-of-N Sampling in Early Decoding

Add code
Mar 03, 2025
Viaarxiv icon

WeVibe: Weight Change Estimation Through Audio-Induced Shelf Vibrations In Autonomous Stores

Add code
Feb 17, 2025
Viaarxiv icon

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Add code
Jan 10, 2025
Figure 1 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Figure 2 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Figure 3 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Figure 4 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Viaarxiv icon

A Separable Self-attention Inspired by the State Space Model for Computer Vision

Add code
Jan 03, 2025
Figure 1 for A Separable Self-attention Inspired by the State Space Model for Computer Vision
Figure 2 for A Separable Self-attention Inspired by the State Space Model for Computer Vision
Figure 3 for A Separable Self-attention Inspired by the State Space Model for Computer Vision
Figure 4 for A Separable Self-attention Inspired by the State Space Model for Computer Vision
Viaarxiv icon

MATEY: multiscale adaptive foundation models for spatiotemporal physical systems

Add code
Dec 29, 2024
Viaarxiv icon

Qwen2.5 Technical Report

Add code
Dec 19, 2024
Viaarxiv icon