Picture for Lidong Bing

Lidong Bing

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Add code
Jan 03, 2025
Figure 1 for 2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Figure 2 for 2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Figure 3 for 2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Figure 4 for 2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Viaarxiv icon

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

Add code
Nov 09, 2024
Figure 1 for M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework
Figure 2 for M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework
Figure 3 for M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework
Figure 4 for M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework
Viaarxiv icon

Chain of Ideas: Revolutionizing Research Via Novel Idea Development with LLM Agents

Add code
Oct 25, 2024
Figure 1 for Chain of Ideas: Revolutionizing Research Via Novel Idea Development with LLM Agents
Figure 2 for Chain of Ideas: Revolutionizing Research Via Novel Idea Development with LLM Agents
Figure 3 for Chain of Ideas: Revolutionizing Research Via Novel Idea Development with LLM Agents
Figure 4 for Chain of Ideas: Revolutionizing Research Via Novel Idea Development with LLM Agents
Viaarxiv icon

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Add code
Oct 22, 2024
Figure 1 for Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss
Figure 2 for Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss
Figure 3 for Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss
Figure 4 for Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss
Viaarxiv icon

Chain of Ideas: Revolutionizing Research in Novel Idea Development with LLM Agents

Add code
Oct 17, 2024
Figure 1 for Chain of Ideas: Revolutionizing Research in Novel Idea Development with LLM Agents
Figure 2 for Chain of Ideas: Revolutionizing Research in Novel Idea Development with LLM Agents
Figure 3 for Chain of Ideas: Revolutionizing Research in Novel Idea Development with LLM Agents
Figure 4 for Chain of Ideas: Revolutionizing Research in Novel Idea Development with LLM Agents
Viaarxiv icon

Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective

Add code
Oct 16, 2024
Figure 1 for Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
Figure 2 for Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
Figure 3 for Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
Figure 4 for Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
Viaarxiv icon

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Add code
Oct 16, 2024
Figure 1 for The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
Figure 2 for The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
Figure 3 for The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
Figure 4 for The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
Viaarxiv icon

Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths

Add code
Oct 07, 2024
Figure 1 for Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths
Figure 2 for Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths
Figure 3 for Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths
Figure 4 for Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths
Viaarxiv icon

Can We Further Elicit Reasoning in LLMs? Critic-Guided Planning with Retrieval-Augmentation for Solving Challenging Tasks

Add code
Oct 02, 2024
Viaarxiv icon

AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation

Add code
Oct 01, 2024
Figure 1 for AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation
Figure 2 for AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation
Figure 3 for AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation
Figure 4 for AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation
Viaarxiv icon