Picture for Jie Tang

Jie Tang

Tony

GPT-4o System Card

Add code
Oct 25, 2024
Viaarxiv icon

LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering

Add code
Oct 23, 2024
Viaarxiv icon

LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models

Add code
Sep 05, 2024
Figure 1 for LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
Figure 2 for LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
Figure 3 for LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
Figure 4 for LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
Viaarxiv icon

CogVLM2: Visual Language Models for Image and Video Understanding

Add code
Aug 29, 2024
Figure 1 for CogVLM2: Visual Language Models for Image and Video Understanding
Figure 2 for CogVLM2: Visual Language Models for Image and Video Understanding
Figure 3 for CogVLM2: Visual Language Models for Image and Video Understanding
Figure 4 for CogVLM2: Visual Language Models for Image and Video Understanding
Viaarxiv icon

BattleAgentBench: A Benchmark for Evaluating Cooperation and Competition Capabilities of Language Models in Multi-Agent Systems

Add code
Aug 28, 2024
Viaarxiv icon

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Add code
Aug 13, 2024
Figure 1 for LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Figure 2 for LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Figure 3 for LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Figure 4 for LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Viaarxiv icon

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

Add code
Aug 12, 2024
Figure 1 for VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Figure 2 for VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Figure 3 for VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Figure 4 for VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Viaarxiv icon

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

Add code
Aug 12, 2024
Viaarxiv icon

RAVSS: Robust Audio-Visual Speech Separation in Multi-Speaker Scenarios with Missing Visual Cues

Add code
Jul 27, 2024
Viaarxiv icon

Multi-turn Response Selection with Commonsense-enhanced Language Models

Add code
Jul 26, 2024
Figure 1 for Multi-turn Response Selection with Commonsense-enhanced Language Models
Figure 2 for Multi-turn Response Selection with Commonsense-enhanced Language Models
Figure 3 for Multi-turn Response Selection with Commonsense-enhanced Language Models
Figure 4 for Multi-turn Response Selection with Commonsense-enhanced Language Models
Viaarxiv icon