Picture for Jiawei Wang

Jiawei Wang

Tarsier: Recipes for Training and Evaluating Large Video Description Models

Add code
Jun 30, 2024
Figure 1 for Tarsier: Recipes for Training and Evaluating Large Video Description Models
Figure 2 for Tarsier: Recipes for Training and Evaluating Large Video Description Models
Figure 3 for Tarsier: Recipes for Training and Evaluating Large Video Description Models
Figure 4 for Tarsier: Recipes for Training and Evaluating Large Video Description Models
Viaarxiv icon

DLAFormer: An End-to-End Transformer For Document Layout Analysis

Add code
May 20, 2024
Viaarxiv icon

AMCEN: An Attention Masking-based Contrastive Event Network for Two-stage Temporal Knowledge Graph Reasoning

Add code
May 16, 2024
Figure 1 for AMCEN: An Attention Masking-based Contrastive Event Network for Two-stage Temporal Knowledge Graph Reasoning
Figure 2 for AMCEN: An Attention Masking-based Contrastive Event Network for Two-stage Temporal Knowledge Graph Reasoning
Figure 3 for AMCEN: An Attention Masking-based Contrastive Event Network for Two-stage Temporal Knowledge Graph Reasoning
Figure 4 for AMCEN: An Attention Masking-based Contrastive Event Network for Two-stage Temporal Knowledge Graph Reasoning
Viaarxiv icon

EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech

Add code
Mar 17, 2024
Figure 1 for EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech
Figure 2 for EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech
Figure 3 for EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech
Figure 4 for EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech
Viaarxiv icon

Large Language Models as Urban Residents: An LLM Agent Framework for Personal Mobility Generation

Add code
Feb 22, 2024
Viaarxiv icon

Boximator: Generating Rich and Controllable Motions for Video Synthesis

Add code
Feb 02, 2024
Viaarxiv icon

Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis

Add code
Jan 22, 2024
Viaarxiv icon

Dynamic Relation Transformer for Contextual Text Block Detection

Add code
Jan 17, 2024
Viaarxiv icon

UniVIE: A Unified Label Space Approach to Visual Information Extraction from Form-like Documents

Add code
Jan 17, 2024
Viaarxiv icon

XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library

Add code
Dec 25, 2023
Viaarxiv icon