Picture for Shiliang Zhang

Shiliang Zhang

Prompt-Anchored Vision-Text Distillation for Lifelong Person Re-identification

Add code
May 06, 2026
Viaarxiv icon

AI and Open-data Driven Scalable Solar Power Profiling

Add code
May 04, 2026
Viaarxiv icon

Security, privacy, and agentic AI in a regulatory view: From definitions and distinctions to provisions and reflections

Add code
Mar 19, 2026
Viaarxiv icon

Looping Back to Move Forward: Recursive Transformers for Efficient and Flexible Large Multimodal Models

Add code
Feb 09, 2026
Viaarxiv icon

FlattenGPT: Depth Compression for Transformer with Layer Flattening

Add code
Feb 09, 2026
Viaarxiv icon

PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss

Add code
Feb 02, 2026
Viaarxiv icon

SLAM-LLM: A Modular, Open-Source Multimodal Large Language Model Framework and Best Practice for Speech, Language, Audio and Music Processing

Add code
Jan 14, 2026
Viaarxiv icon

MoCo: Motion-Consistent Human Video Generation via Structure-Appearance Decoupling

Add code
Aug 24, 2025
Viaarxiv icon

NN-Former: Rethinking Graph Structure in Neural Architecture Representation

Add code
Jul 01, 2025
Figure 1 for NN-Former: Rethinking Graph Structure in Neural Architecture Representation
Figure 2 for NN-Former: Rethinking Graph Structure in Neural Architecture Representation
Figure 3 for NN-Former: Rethinking Graph Structure in Neural Architecture Representation
Viaarxiv icon

MagCache: Fast Video Generation with Magnitude-Aware Cache

Add code
Jun 10, 2025
Viaarxiv icon