Picture for Liqiang Nie

Liqiang Nie

Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space

Add code
Oct 14, 2025
Figure 1 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Figure 2 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Figure 3 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Figure 4 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Viaarxiv icon

IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction

Add code
Oct 09, 2025
Figure 1 for IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction
Figure 2 for IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction
Figure 3 for IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction
Figure 4 for IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction
Viaarxiv icon

Parallel Test-Time Scaling for Latent Reasoning Models

Add code
Oct 09, 2025
Viaarxiv icon

TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

Add code
Oct 09, 2025
Viaarxiv icon

Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems

Add code
Sep 09, 2025
Figure 1 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Figure 2 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Figure 3 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Figure 4 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Viaarxiv icon

CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification

Add code
Aug 28, 2025
Figure 1 for CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification
Figure 2 for CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification
Figure 3 for CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification
Figure 4 for CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification
Viaarxiv icon

Fair Deepfake Detectors Can Generalize

Add code
Jul 03, 2025
Viaarxiv icon

KEPLA: A Knowledge-Enhanced Deep Learning Framework for Accurate Protein-Ligand Binding Affinity Prediction

Add code
Jun 16, 2025
Figure 1 for KEPLA: A Knowledge-Enhanced Deep Learning Framework for Accurate Protein-Ligand Binding Affinity Prediction
Figure 2 for KEPLA: A Knowledge-Enhanced Deep Learning Framework for Accurate Protein-Ligand Binding Affinity Prediction
Figure 3 for KEPLA: A Knowledge-Enhanced Deep Learning Framework for Accurate Protein-Ligand Binding Affinity Prediction
Figure 4 for KEPLA: A Knowledge-Enhanced Deep Learning Framework for Accurate Protein-Ligand Binding Affinity Prediction
Viaarxiv icon

Mitigating Hallucination Through Theory-Consistent Symmetric Multimodal Preference Optimization

Add code
Jun 13, 2025
Figure 1 for Mitigating Hallucination Through Theory-Consistent Symmetric Multimodal Preference Optimization
Figure 2 for Mitigating Hallucination Through Theory-Consistent Symmetric Multimodal Preference Optimization
Figure 3 for Mitigating Hallucination Through Theory-Consistent Symmetric Multimodal Preference Optimization
Figure 4 for Mitigating Hallucination Through Theory-Consistent Symmetric Multimodal Preference Optimization
Viaarxiv icon

Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills

Add code
Jun 12, 2025
Figure 1 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 2 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 3 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 4 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Viaarxiv icon