Picture for Xin Zhang

Xin Zhang

Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, China, School of Computing, University of Portsmouth, Portsmouth, United Kingdom

PVMark: Enabling Public Verifiability for LLM Watermarking Schemes

Add code
Oct 30, 2025
Viaarxiv icon

CityRiSE: Reasoning Urban Socio-Economic Status in Vision-Language Models via Reinforcement Learning

Add code
Oct 25, 2025
Viaarxiv icon

CCrepairBench: A High-Fidelity Benchmark and Reinforcement Learning Framework for C++ Compilation Repair

Add code
Sep 19, 2025
Figure 1 for CCrepairBench: A High-Fidelity Benchmark and Reinforcement Learning Framework for C++ Compilation Repair
Figure 2 for CCrepairBench: A High-Fidelity Benchmark and Reinforcement Learning Framework for C++ Compilation Repair
Figure 3 for CCrepairBench: A High-Fidelity Benchmark and Reinforcement Learning Framework for C++ Compilation Repair
Figure 4 for CCrepairBench: A High-Fidelity Benchmark and Reinforcement Learning Framework for C++ Compilation Repair
Viaarxiv icon

RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation

Add code
Sep 19, 2025
Figure 1 for RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation
Figure 2 for RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation
Figure 3 for RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation
Figure 4 for RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation
Viaarxiv icon

Human Motion Video Generation: A Survey

Add code
Sep 04, 2025
Viaarxiv icon

ERF-BA-TFD+: A Multimodal Model for Audio-Visual Deepfake Detection

Add code
Aug 24, 2025
Figure 1 for ERF-BA-TFD+: A Multimodal Model for Audio-Visual Deepfake Detection
Figure 2 for ERF-BA-TFD+: A Multimodal Model for Audio-Visual Deepfake Detection
Figure 3 for ERF-BA-TFD+: A Multimodal Model for Audio-Visual Deepfake Detection
Figure 4 for ERF-BA-TFD+: A Multimodal Model for Audio-Visual Deepfake Detection
Viaarxiv icon

Condition Weaving Meets Expert Modulation: Towards Universal and Controllable Image Generation

Add code
Aug 24, 2025
Viaarxiv icon

Deep Learning for Taxol Exposure Analysis: A New Cell Image Dataset and Attention-Based Baseline Model

Add code
Aug 20, 2025
Viaarxiv icon

Two-dimensional Sparse Parallelism for Large Scale Deep Learning Recommendation Model Training

Add code
Aug 05, 2025
Viaarxiv icon

On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey

Add code
Jul 28, 2025
Viaarxiv icon