Picture for Qiang Zhou

Qiang Zhou

WoW: Towards a World omniscient World model Through Embodied Interaction

Add code
Sep 26, 2025
Viaarxiv icon

MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook

Add code
Sep 17, 2025
Figure 1 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 2 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 3 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 4 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Viaarxiv icon

MMG-Vid: Maximizing Marginal Gains at Segment-level and Token-level for Efficient Video LLMs

Add code
Aug 28, 2025
Figure 1 for MMG-Vid: Maximizing Marginal Gains at Segment-level and Token-level for Efficient Video LLMs
Figure 2 for MMG-Vid: Maximizing Marginal Gains at Segment-level and Token-level for Efficient Video LLMs
Figure 3 for MMG-Vid: Maximizing Marginal Gains at Segment-level and Token-level for Efficient Video LLMs
Figure 4 for MMG-Vid: Maximizing Marginal Gains at Segment-level and Token-level for Efficient Video LLMs
Viaarxiv icon

Active management of battery degradation in wireless sensor network using deep reinforcement learning for group battery replacement

Add code
Mar 20, 2025
Figure 1 for Active management of battery degradation in wireless sensor network using deep reinforcement learning for group battery replacement
Figure 2 for Active management of battery degradation in wireless sensor network using deep reinforcement learning for group battery replacement
Figure 3 for Active management of battery degradation in wireless sensor network using deep reinforcement learning for group battery replacement
Figure 4 for Active management of battery degradation in wireless sensor network using deep reinforcement learning for group battery replacement
Viaarxiv icon

EmpathyAgent: Can Embodied Agents Conduct Empathetic Actions?

Add code
Mar 19, 2025
Viaarxiv icon

Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos

Add code
Feb 28, 2025
Viaarxiv icon

Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario

Add code
Dec 24, 2024
Figure 1 for Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario
Figure 2 for Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario
Figure 3 for Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario
Figure 4 for Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario
Viaarxiv icon

DLF: Disentangled-Language-Focused Multimodal Sentiment Analysis

Add code
Dec 16, 2024
Viaarxiv icon

KAAE: Numerical Reasoning for Knowledge Graphs via Knowledge-aware Attributes Learning

Add code
Nov 20, 2024
Figure 1 for KAAE: Numerical Reasoning for Knowledge Graphs via Knowledge-aware Attributes Learning
Figure 2 for KAAE: Numerical Reasoning for Knowledge Graphs via Knowledge-aware Attributes Learning
Figure 3 for KAAE: Numerical Reasoning for Knowledge Graphs via Knowledge-aware Attributes Learning
Figure 4 for KAAE: Numerical Reasoning for Knowledge Graphs via Knowledge-aware Attributes Learning
Viaarxiv icon

Motion Control for Enhanced Complex Action Video Generation

Add code
Nov 13, 2024
Figure 1 for Motion Control for Enhanced Complex Action Video Generation
Figure 2 for Motion Control for Enhanced Complex Action Video Generation
Figure 3 for Motion Control for Enhanced Complex Action Video Generation
Figure 4 for Motion Control for Enhanced Complex Action Video Generation
Viaarxiv icon