Picture for Chenfei Wu

Chenfei Wu

Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification

Add code
Jun 05, 2024
Figure 1 for Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification
Figure 2 for Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification
Figure 3 for Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification
Figure 4 for Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification
Viaarxiv icon

LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models

Add code
Apr 03, 2024
Figure 1 for LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models
Figure 2 for LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models
Figure 3 for LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models
Figure 4 for LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models
Viaarxiv icon

Using Left and Right Brains Together: Towards Vision and Language Planning

Add code
Feb 16, 2024
Viaarxiv icon

StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis

Add code
Jan 30, 2024
Viaarxiv icon

EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation

Add code
Oct 12, 2023
Figure 1 for EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation
Figure 2 for EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation
Figure 3 for EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation
Figure 4 for EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation
Viaarxiv icon

LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models

Add code
Sep 19, 2023
Figure 1 for LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models
Figure 2 for LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models
Figure 3 for LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models
Figure 4 for LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models
Viaarxiv icon

ORES: Open-vocabulary Responsible Visual Synthesis

Add code
Aug 26, 2023
Figure 1 for ORES: Open-vocabulary Responsible Visual Synthesis
Figure 2 for ORES: Open-vocabulary Responsible Visual Synthesis
Figure 3 for ORES: Open-vocabulary Responsible Visual Synthesis
Figure 4 for ORES: Open-vocabulary Responsible Visual Synthesis
Viaarxiv icon

GameEval: Evaluating LLMs on Conversational Games

Add code
Aug 19, 2023
Figure 1 for GameEval: Evaluating LLMs on Conversational Games
Figure 2 for GameEval: Evaluating LLMs on Conversational Games
Figure 3 for GameEval: Evaluating LLMs on Conversational Games
Figure 4 for GameEval: Evaluating LLMs on Conversational Games
Viaarxiv icon

DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory

Add code
Aug 16, 2023
Figure 1 for DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory
Figure 2 for DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory
Figure 3 for DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory
Figure 4 for DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory
Viaarxiv icon

ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning

Add code
May 31, 2023
Figure 1 for ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning
Figure 2 for ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning
Figure 3 for ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning
Figure 4 for ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning
Viaarxiv icon