Picture for Shiyu Huang

Shiyu Huang

CogVLM2: Visual Language Models for Image and Video Understanding

Add code
Aug 29, 2024
Figure 1 for CogVLM2: Visual Language Models for Image and Video Understanding
Figure 2 for CogVLM2: Visual Language Models for Image and Video Understanding
Figure 3 for CogVLM2: Visual Language Models for Image and Video Understanding
Figure 4 for CogVLM2: Visual Language Models for Image and Video Understanding
Viaarxiv icon

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

Add code
Aug 12, 2024
Viaarxiv icon

A Survey on Self-play Methods in Reinforcement Learning

Add code
Aug 02, 2024
Viaarxiv icon

Priorformer: A UGC-VQA Method with content and distortion priors

Add code
Jun 24, 2024
Viaarxiv icon

Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization

Add code
Jun 20, 2024
Viaarxiv icon

LVBench: An Extreme Long Video Understanding Benchmark

Add code
Jun 12, 2024
Figure 1 for LVBench: An Extreme Long Video Understanding Benchmark
Figure 2 for LVBench: An Extreme Long Video Understanding Benchmark
Figure 3 for LVBench: An Extreme Long Video Understanding Benchmark
Figure 4 for LVBench: An Extreme Long Video Understanding Benchmark
Viaarxiv icon

MQE: Unleashing the Power of Interaction with Multi-agent Quadruped Environment

Add code
Mar 24, 2024
Viaarxiv icon

LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments

Add code
Feb 26, 2024
Viaarxiv icon

AutoSAT: Automatically Optimize SAT Solvers via Large Language Models

Add code
Feb 16, 2024
Viaarxiv icon

OpenRL: A Unified Reinforcement Learning Framework

Add code
Dec 20, 2023
Viaarxiv icon