Picture for Liqiang Nie

Liqiang Nie

Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation

Add code
Feb 22, 2026
Viaarxiv icon

StructAlign: Structured Cross-Modal Alignment for Continual Text-to-Video Retrieval

Add code
Jan 28, 2026
Viaarxiv icon

AR-Omni: A Unified Autoregressive Model for Any-to-Any Generation

Add code
Jan 25, 2026
Viaarxiv icon

Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning

Add code
Jan 14, 2026
Viaarxiv icon

PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records

Add code
Jan 14, 2026
Viaarxiv icon

SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation

Add code
Nov 13, 2025
Figure 1 for SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation
Figure 2 for SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation
Figure 3 for SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation
Figure 4 for SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation
Viaarxiv icon

A Polynomial-time Algorithm for Online Sparse Linear Regression with Improved Regret Bound under Weaker Conditions

Add code
Oct 31, 2025
Viaarxiv icon

Open Multimodal Retrieval-Augmented Factual Image Generation

Add code
Oct 26, 2025
Figure 1 for Open Multimodal Retrieval-Augmented Factual Image Generation
Figure 2 for Open Multimodal Retrieval-Augmented Factual Image Generation
Figure 3 for Open Multimodal Retrieval-Augmented Factual Image Generation
Figure 4 for Open Multimodal Retrieval-Augmented Factual Image Generation
Viaarxiv icon

Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space

Add code
Oct 14, 2025
Figure 1 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Figure 2 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Figure 3 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Figure 4 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Viaarxiv icon

TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

Add code
Oct 09, 2025
Viaarxiv icon