Picture for Yuanyuan Fu

Yuanyuan Fu

UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation

Add code
Jan 10, 2025
Figure 1 for UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation
Figure 2 for UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation
Figure 3 for UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation
Figure 4 for UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation
Viaarxiv icon

Detection-based Intermediate Supervision for Visual Question Answering

Add code
Dec 26, 2023
Figure 1 for Detection-based Intermediate Supervision for Visual Question Answering
Figure 2 for Detection-based Intermediate Supervision for Visual Question Answering
Figure 3 for Detection-based Intermediate Supervision for Visual Question Answering
Figure 4 for Detection-based Intermediate Supervision for Visual Question Answering
Viaarxiv icon

An Empirical Study on the Language Modal in Visual Question Answering

Add code
May 17, 2023
Figure 1 for An Empirical Study on the Language Modal in Visual Question Answering
Figure 2 for An Empirical Study on the Language Modal in Visual Question Answering
Figure 3 for An Empirical Study on the Language Modal in Visual Question Answering
Figure 4 for An Empirical Study on the Language Modal in Visual Question Answering
Viaarxiv icon

STAGE: Span Tagging and Greedy Inference Scheme for Aspect Sentiment Triplet Extraction

Add code
Nov 29, 2022
Figure 1 for STAGE: Span Tagging and Greedy Inference Scheme for Aspect Sentiment Triplet Extraction
Figure 2 for STAGE: Span Tagging and Greedy Inference Scheme for Aspect Sentiment Triplet Extraction
Figure 3 for STAGE: Span Tagging and Greedy Inference Scheme for Aspect Sentiment Triplet Extraction
Figure 4 for STAGE: Span Tagging and Greedy Inference Scheme for Aspect Sentiment Triplet Extraction
Viaarxiv icon

End-to-end speaker diarization with transformer

Add code
Dec 14, 2021
Figure 1 for End-to-end speaker diarization with transformer
Figure 2 for End-to-end speaker diarization with transformer
Figure 3 for End-to-end speaker diarization with transformer
Figure 4 for End-to-end speaker diarization with transformer
Viaarxiv icon

Visual-Semantic Transformer for Scene Text Recognition

Add code
Dec 02, 2021
Figure 1 for Visual-Semantic Transformer for Scene Text Recognition
Figure 2 for Visual-Semantic Transformer for Scene Text Recognition
Figure 3 for Visual-Semantic Transformer for Scene Text Recognition
Figure 4 for Visual-Semantic Transformer for Scene Text Recognition
Viaarxiv icon