Picture for Liqiang Nie

Liqiang Nie

A Polynomial-time Algorithm for Online Sparse Linear Regression with Improved Regret Bound under Weaker Conditions

Add code
Oct 31, 2025
Viaarxiv icon

Open Multimodal Retrieval-Augmented Factual Image Generation

Add code
Oct 26, 2025
Viaarxiv icon

Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space

Add code
Oct 14, 2025
Figure 1 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Figure 2 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Figure 3 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Figure 4 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Viaarxiv icon

Parallel Test-Time Scaling for Latent Reasoning Models

Add code
Oct 09, 2025
Viaarxiv icon

TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

Add code
Oct 09, 2025
Viaarxiv icon

IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction

Add code
Oct 09, 2025
Viaarxiv icon

Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems

Add code
Sep 09, 2025
Figure 1 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Figure 2 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Figure 3 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Figure 4 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Viaarxiv icon

CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification

Add code
Aug 28, 2025
Figure 1 for CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification
Figure 2 for CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification
Figure 3 for CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification
Figure 4 for CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification
Viaarxiv icon

Fair Deepfake Detectors Can Generalize

Add code
Jul 03, 2025
Viaarxiv icon

KEPLA: A Knowledge-Enhanced Deep Learning Framework for Accurate Protein-Ligand Binding Affinity Prediction

Add code
Jun 16, 2025
Figure 1 for KEPLA: A Knowledge-Enhanced Deep Learning Framework for Accurate Protein-Ligand Binding Affinity Prediction
Figure 2 for KEPLA: A Knowledge-Enhanced Deep Learning Framework for Accurate Protein-Ligand Binding Affinity Prediction
Figure 3 for KEPLA: A Knowledge-Enhanced Deep Learning Framework for Accurate Protein-Ligand Binding Affinity Prediction
Figure 4 for KEPLA: A Knowledge-Enhanced Deep Learning Framework for Accurate Protein-Ligand Binding Affinity Prediction
Viaarxiv icon