Picture for Yinwei Wei

Yinwei Wei

HABIT: Chrono-Synergia Robust Progressive Learning Framework for Composed Image Retrieval

Add code
Apr 20, 2026
Viaarxiv icon

INTENT: Invariance and Discrimination-aware Noise Mitigation for Robust Composed Image Retrieval

Add code
Apr 20, 2026
Viaarxiv icon

Context-Agent: Dynamic Discourse Trees for Non-Linear Dialogue

Add code
Apr 07, 2026
Viaarxiv icon

HINT: Composed Image Retrieval with Dual-path Compositional Contextualized Network

Add code
Mar 27, 2026
Viaarxiv icon

VideoTemp-o3: Harmonizing Temporal Grounding and Video Understanding in Agentic Thinking-with-Videos

Add code
Feb 08, 2026
Viaarxiv icon

Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space

Add code
Oct 14, 2025
Figure 1 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Figure 2 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Figure 3 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Figure 4 for Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
Viaarxiv icon

MIST: Towards Multi-dimensional Implicit Bias and Stereotype Evaluation of LLMs via Theory of Mind

Add code
Jun 17, 2025
Viaarxiv icon

KEPLA: A Knowledge-Enhanced Deep Learning Framework for Accurate Protein-Ligand Binding Affinity Prediction

Add code
Jun 16, 2025
Figure 1 for KEPLA: A Knowledge-Enhanced Deep Learning Framework for Accurate Protein-Ligand Binding Affinity Prediction
Figure 2 for KEPLA: A Knowledge-Enhanced Deep Learning Framework for Accurate Protein-Ligand Binding Affinity Prediction
Figure 3 for KEPLA: A Knowledge-Enhanced Deep Learning Framework for Accurate Protein-Ligand Binding Affinity Prediction
Figure 4 for KEPLA: A Knowledge-Enhanced Deep Learning Framework for Accurate Protein-Ligand Binding Affinity Prediction
Viaarxiv icon

Mitigating Hallucination Through Theory-Consistent Symmetric Multimodal Preference Optimization

Add code
Jun 13, 2025
Figure 1 for Mitigating Hallucination Through Theory-Consistent Symmetric Multimodal Preference Optimization
Figure 2 for Mitigating Hallucination Through Theory-Consistent Symmetric Multimodal Preference Optimization
Figure 3 for Mitigating Hallucination Through Theory-Consistent Symmetric Multimodal Preference Optimization
Figure 4 for Mitigating Hallucination Through Theory-Consistent Symmetric Multimodal Preference Optimization
Viaarxiv icon

Dynamic Multimodal Fusion via Meta-Learning Towards Micro-Video Recommendation

Add code
Jan 13, 2025
Figure 1 for Dynamic Multimodal Fusion via Meta-Learning Towards Micro-Video Recommendation
Figure 2 for Dynamic Multimodal Fusion via Meta-Learning Towards Micro-Video Recommendation
Figure 3 for Dynamic Multimodal Fusion via Meta-Learning Towards Micro-Video Recommendation
Figure 4 for Dynamic Multimodal Fusion via Meta-Learning Towards Micro-Video Recommendation
Viaarxiv icon