Picture for Fangyi Chen

Fangyi Chen

STELAR-VISION: Self-Topology-Aware Efficient Learning for Aligned Reasoning in Vision

Add code
Aug 12, 2025
Viaarxiv icon

Masked Autoencoders Are Effective Tokenizers for Diffusion Models

Add code
Feb 05, 2025
Viaarxiv icon

Semi-Supervised Learning from Small Annotated Data and Large Unlabeled Data for Fine-grained PICO Entity Recognition

Add code
Dec 26, 2024
Figure 1 for Semi-Supervised Learning from Small Annotated Data and Large Unlabeled Data for Fine-grained PICO Entity Recognition
Figure 2 for Semi-Supervised Learning from Small Annotated Data and Large Unlabeled Data for Fine-grained PICO Entity Recognition
Figure 3 for Semi-Supervised Learning from Small Annotated Data and Large Unlabeled Data for Fine-grained PICO Entity Recognition
Figure 4 for Semi-Supervised Learning from Small Annotated Data and Large Unlabeled Data for Fine-grained PICO Entity Recognition
Viaarxiv icon

A MapReduce Approach to Effectively Utilize Long Context Information in Retrieval Augmented Language Models

Add code
Dec 17, 2024
Figure 1 for A MapReduce Approach to Effectively Utilize Long Context Information in Retrieval Augmented Language Models
Figure 2 for A MapReduce Approach to Effectively Utilize Long Context Information in Retrieval Augmented Language Models
Figure 3 for A MapReduce Approach to Effectively Utilize Long Context Information in Retrieval Augmented Language Models
Figure 4 for A MapReduce Approach to Effectively Utilize Long Context Information in Retrieval Augmented Language Models
Viaarxiv icon

SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer

Add code
Dec 14, 2024
Figure 1 for SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Figure 2 for SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Figure 3 for SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Figure 4 for SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Viaarxiv icon

Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce

Add code
Oct 28, 2024
Figure 1 for Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce
Figure 2 for Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce
Figure 3 for Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce
Figure 4 for Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce
Viaarxiv icon

A Reference-Based 3D Semantic-Aware Framework for Accurate Local Facial Attribute Editing

Add code
Jul 29, 2024
Viaarxiv icon

RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection

Add code
May 30, 2024
Figure 1 for RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
Figure 2 for RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
Figure 3 for RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
Figure 4 for RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
Viaarxiv icon

Enhanced Training of Query-Based Object Detection via Selective Query Recollection

Add code
Dec 15, 2022
Viaarxiv icon

Unitail: Detecting, Reading, and Matching in Retail Scene

Add code
Apr 01, 2022
Figure 1 for Unitail: Detecting, Reading, and Matching in Retail Scene
Figure 2 for Unitail: Detecting, Reading, and Matching in Retail Scene
Figure 3 for Unitail: Detecting, Reading, and Matching in Retail Scene
Figure 4 for Unitail: Detecting, Reading, and Matching in Retail Scene
Viaarxiv icon