Picture for Wenhao Wu

Wenhao Wu

Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement

Add code
Jun 17, 2024
Figure 1 for Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement
Figure 2 for Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement
Figure 3 for Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement
Figure 4 for Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement
Viaarxiv icon

Dense Connector for MLLMs

Add code
May 22, 2024
Figure 1 for Dense Connector for MLLMs
Figure 2 for Dense Connector for MLLMs
Figure 3 for Dense Connector for MLLMs
Figure 4 for Dense Connector for MLLMs
Viaarxiv icon

FreeVA: Offline MLLM as Training-Free Video Assistant

Add code
May 13, 2024
Figure 1 for FreeVA: Offline MLLM as Training-Free Video Assistant
Figure 2 for FreeVA: Offline MLLM as Training-Free Video Assistant
Figure 3 for FreeVA: Offline MLLM as Training-Free Video Assistant
Figure 4 for FreeVA: Offline MLLM as Training-Free Video Assistant
Viaarxiv icon

Long Context Alignment with Short Instructions and Synthesized Positions

Add code
May 07, 2024
Figure 1 for Long Context Alignment with Short Instructions and Synthesized Positions
Figure 2 for Long Context Alignment with Short Instructions and Synthesized Positions
Figure 3 for Long Context Alignment with Short Instructions and Synthesized Positions
Figure 4 for Long Context Alignment with Short Instructions and Synthesized Positions
Viaarxiv icon

Retrieval Head Mechanistically Explains Long-Context Factuality

Add code
Apr 24, 2024
Figure 1 for Retrieval Head Mechanistically Explains Long-Context Factuality
Figure 2 for Retrieval Head Mechanistically Explains Long-Context Factuality
Figure 3 for Retrieval Head Mechanistically Explains Long-Context Factuality
Figure 4 for Retrieval Head Mechanistically Explains Long-Context Factuality
Viaarxiv icon

LongEmbed: Extending Embedding Models for Long Context Retrieval

Add code
Apr 18, 2024
Figure 1 for LongEmbed: Extending Embedding Models for Long Context Retrieval
Figure 2 for LongEmbed: Extending Embedding Models for Long Context Retrieval
Figure 3 for LongEmbed: Extending Embedding Models for Long Context Retrieval
Figure 4 for LongEmbed: Extending Embedding Models for Long Context Retrieval
Viaarxiv icon

CoUDA: Coherence Evaluation via Unified Data Augmentation

Add code
Mar 31, 2024
Viaarxiv icon

DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM

Add code
Mar 19, 2024
Figure 1 for DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
Figure 2 for DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
Figure 3 for DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
Figure 4 for DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
Viaarxiv icon

MetaSplit: Meta-Split Network for Limited-Stock Product Recommendation

Add code
Mar 16, 2024
Figure 1 for MetaSplit: Meta-Split Network for Limited-Stock Product Recommendation
Figure 2 for MetaSplit: Meta-Split Network for Limited-Stock Product Recommendation
Figure 3 for MetaSplit: Meta-Split Network for Limited-Stock Product Recommendation
Figure 4 for MetaSplit: Meta-Split Network for Limited-Stock Product Recommendation
Viaarxiv icon

GPT4Ego: Unleashing the Potential of Pre-trained Models for Zero-Shot Egocentric Action Recognition

Add code
Jan 18, 2024
Viaarxiv icon