Picture for Yuxuan Zhang

Yuxuan Zhang

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Add code
Jul 02, 2025
Viaarxiv icon

Enhancing Vector Quantization with Distributional Matching: A Theoretical and Empirical Study

Add code
Jun 18, 2025
Viaarxiv icon

WikiGap: Promoting Epistemic Equity by Surfacing Knowledge Gaps Between English Wikipedia and other Language Editions

Add code
May 30, 2025
Viaarxiv icon

EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering

Add code
May 30, 2025
Viaarxiv icon

StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs

Add code
May 26, 2025
Viaarxiv icon

Two-way Evidence self-Alignment based Dual-Gated Reasoning Enhancement

Add code
May 22, 2025
Viaarxiv icon

CineTechBench: A Benchmark for Cinematographic Technique Understanding and Generation

Add code
May 21, 2025
Viaarxiv icon

lmgame-Bench: How Good are LLMs at Playing Games?

Add code
May 21, 2025
Viaarxiv icon

PLAICraft: Large-Scale Time-Aligned Vision-Speech-Action Dataset for Embodied AI

Add code
May 19, 2025
Viaarxiv icon

Adaptive Noise Resilient Keyword Spotting Using One-Shot Learning

Add code
May 14, 2025
Viaarxiv icon