Picture for Yue Yang

Yue Yang

Institute for Transport Studies, University of Leeds, Leeds LS2 9JT, UK

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Add code
Sep 25, 2024
Figure 1 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 2 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 3 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 4 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Viaarxiv icon

Deep reinforcement learning for tracking a moving target in jellyfish-like swimming

Add code
Sep 13, 2024
Figure 1 for Deep reinforcement learning for tracking a moving target in jellyfish-like swimming
Figure 2 for Deep reinforcement learning for tracking a moving target in jellyfish-like swimming
Figure 3 for Deep reinforcement learning for tracking a moving target in jellyfish-like swimming
Figure 4 for Deep reinforcement learning for tracking a moving target in jellyfish-like swimming
Viaarxiv icon

RoomDiffusion: A Specialized Diffusion Model in the Interior Design Industry

Add code
Sep 05, 2024
Figure 1 for RoomDiffusion: A Specialized Diffusion Model in the Interior Design Industry
Figure 2 for RoomDiffusion: A Specialized Diffusion Model in the Interior Design Industry
Figure 3 for RoomDiffusion: A Specialized Diffusion Model in the Interior Design Industry
Figure 4 for RoomDiffusion: A Specialized Diffusion Model in the Interior Design Industry
Viaarxiv icon

Optimizing Automated Picking Systems in Warehouse Robots Using Machine Learning

Add code
Aug 29, 2024
Figure 1 for Optimizing Automated Picking Systems in Warehouse Robots Using Machine Learning
Figure 2 for Optimizing Automated Picking Systems in Warehouse Robots Using Machine Learning
Figure 3 for Optimizing Automated Picking Systems in Warehouse Robots Using Machine Learning
Figure 4 for Optimizing Automated Picking Systems in Warehouse Robots Using Machine Learning
Viaarxiv icon

Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research

Add code
Jul 22, 2024
Figure 1 for Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research
Figure 2 for Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research
Figure 3 for Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research
Figure 4 for Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research
Viaarxiv icon

Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification

Add code
Jul 11, 2024
Viaarxiv icon

BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations

Add code
Jul 03, 2024
Figure 1 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Figure 2 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Figure 3 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Figure 4 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Viaarxiv icon

Generative prediction of flow field based on the diffusion model

Add code
Jun 30, 2024
Figure 1 for Generative prediction of flow field based on the diffusion model
Figure 2 for Generative prediction of flow field based on the diffusion model
Figure 3 for Generative prediction of flow field based on the diffusion model
Figure 4 for Generative prediction of flow field based on the diffusion model
Viaarxiv icon

PhyBench: A Physical Commonsense Benchmark for Evaluating Text-to-Image Models

Add code
Jun 17, 2024
Figure 1 for PhyBench: A Physical Commonsense Benchmark for Evaluating Text-to-Image Models
Figure 2 for PhyBench: A Physical Commonsense Benchmark for Evaluating Text-to-Image Models
Figure 3 for PhyBench: A Physical Commonsense Benchmark for Evaluating Text-to-Image Models
Figure 4 for PhyBench: A Physical Commonsense Benchmark for Evaluating Text-to-Image Models
Viaarxiv icon

Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability,Reproducibility, and Practicality

Add code
Jun 13, 2024
Figure 1 for Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability,Reproducibility, and Practicality
Figure 2 for Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability,Reproducibility, and Practicality
Figure 3 for Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability,Reproducibility, and Practicality
Figure 4 for Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability,Reproducibility, and Practicality
Viaarxiv icon