Picture for Liqiang Nie

Liqiang Nie

Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning

Add code
Jan 14, 2026
Viaarxiv icon

PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records

Add code
Jan 14, 2026
Viaarxiv icon

SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation

Add code
Nov 13, 2025
Figure 1 for SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation
Figure 2 for SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation
Figure 3 for SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation
Figure 4 for SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation
Viaarxiv icon

A Polynomial-time Algorithm for Online Sparse Linear Regression with Improved Regret Bound under Weaker Conditions

Add code
Oct 31, 2025
Viaarxiv icon

Open Multimodal Retrieval-Augmented Factual Image Generation

Add code
Oct 26, 2025
Figure 1 for Open Multimodal Retrieval-Augmented Factual Image Generation
Figure 2 for Open Multimodal Retrieval-Augmented Factual Image Generation
Figure 3 for Open Multimodal Retrieval-Augmented Factual Image Generation
Figure 4 for Open Multimodal Retrieval-Augmented Factual Image Generation
Viaarxiv icon

Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space

Add code
Oct 14, 2025
Figure 1 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Figure 2 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Figure 3 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Figure 4 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Viaarxiv icon

Parallel Test-Time Scaling for Latent Reasoning Models

Add code
Oct 09, 2025
Viaarxiv icon

IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction

Add code
Oct 09, 2025
Figure 1 for IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction
Figure 2 for IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction
Figure 3 for IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction
Figure 4 for IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction
Viaarxiv icon

TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

Add code
Oct 09, 2025
Viaarxiv icon

Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems

Add code
Sep 09, 2025
Figure 1 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Figure 2 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Figure 3 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Figure 4 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Viaarxiv icon