Picture for Wei Jia

Wei Jia

StableToken: A Noise-Robust Semantic Speech Tokenizer for Resilient SpeechLLMs

Add code
Sep 26, 2025
Viaarxiv icon

LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation

Add code
Sep 05, 2025
Viaarxiv icon

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Add code
Jul 02, 2025
Figure 1 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Figure 2 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Figure 3 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Figure 4 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Viaarxiv icon

Seed1.5-VL Technical Report

Add code
May 11, 2025
Viaarxiv icon

Understanding Stragglers in Large Model Training Using What-if Analysis

Add code
May 09, 2025
Viaarxiv icon

OVERLORD: Ultimate Scaling of DataLoader for Multi-Source Large Foundation Model Training

Add code
Apr 14, 2025
Viaarxiv icon

PVTree: Realistic and Controllable Palm Vein Generation for Recognition Tasks

Add code
Mar 04, 2025
Viaarxiv icon

PoAct: Policy and Action Dual-Control Agent for Generalized Applications

Add code
Jan 13, 2025
Viaarxiv icon

Deep Learning in Palmprint Recognition-A Comprehensive Survey

Add code
Jan 02, 2025
Figure 1 for Deep Learning in Palmprint Recognition-A Comprehensive Survey
Figure 2 for Deep Learning in Palmprint Recognition-A Comprehensive Survey
Figure 3 for Deep Learning in Palmprint Recognition-A Comprehensive Survey
Figure 4 for Deep Learning in Palmprint Recognition-A Comprehensive Survey
Viaarxiv icon

LegalAgentBench: Evaluating LLM Agents in Legal Domain

Add code
Dec 23, 2024
Viaarxiv icon