Picture for Jiarui Zhang

Jiarui Zhang

DiffGraph: An Automated Agent-driven Model Merging Framework for In-the-Wild Text-to-Image Generation

Add code
Mar 20, 2026
Viaarxiv icon

AgroNVILA: Perception-Reasoning Decoupling for Multi-view Agricultural Multimodal Large Language Models

Add code
Mar 15, 2026
Viaarxiv icon

MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data

Add code
Mar 10, 2026
Viaarxiv icon

High-order Knowledge Based Network Controllability Robustness Prediction: A Hypergraph Neural Network Approach

Add code
Feb 28, 2026
Viaarxiv icon

High-Speed Vision-Based Flight in Clutter with Safety-Shielded Reinforcement Learning

Add code
Feb 09, 2026
Viaarxiv icon

AREAL-DTA: Dynamic Tree Attention for Efficient Reinforcement Learning of Large Language Models

Add code
Jan 31, 2026
Viaarxiv icon

LitVISTA: A Benchmark for Narrative Orchestration in Literary Text

Add code
Jan 10, 2026
Viaarxiv icon

M4Human: A Large-Scale Multimodal mmWave Radar Benchmark for Human Mesh Reconstruction

Add code
Dec 17, 2025
Figure 1 for M4Human: A Large-Scale Multimodal mmWave Radar Benchmark for Human Mesh Reconstruction
Figure 2 for M4Human: A Large-Scale Multimodal mmWave Radar Benchmark for Human Mesh Reconstruction
Figure 3 for M4Human: A Large-Scale Multimodal mmWave Radar Benchmark for Human Mesh Reconstruction
Figure 4 for M4Human: A Large-Scale Multimodal mmWave Radar Benchmark for Human Mesh Reconstruction
Viaarxiv icon

VP-AutoTest: A Virtual-Physical Fusion Autonomous Driving Testing Platform

Add code
Dec 08, 2025
Viaarxiv icon

MonkeyOCR v1.5 Technical Report: Unlocking Robust Document Parsing for Complex Patterns

Add code
Nov 16, 2025
Viaarxiv icon