Picture for Jihao Wu

Jihao Wu

TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models

Add code
Apr 14, 2024
Figure 1 for TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models
Figure 2 for TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models
Figure 3 for TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models
Figure 4 for TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models
Viaarxiv icon

Android in the Zoo: Chain-of-Action-Thought for GUI Agents

Add code
Mar 05, 2024
Figure 1 for Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Figure 2 for Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Figure 3 for Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Figure 4 for Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Viaarxiv icon

Temporal-Spatial Entropy Balancing for Causal Continuous Treatment-Effect Estimation

Add code
Dec 19, 2023
Figure 1 for Temporal-Spatial Entropy Balancing for Causal Continuous Treatment-Effect Estimation
Figure 2 for Temporal-Spatial Entropy Balancing for Causal Continuous Treatment-Effect Estimation
Figure 3 for Temporal-Spatial Entropy Balancing for Causal Continuous Treatment-Effect Estimation
Figure 4 for Temporal-Spatial Entropy Balancing for Causal Continuous Treatment-Effect Estimation
Viaarxiv icon

DocStormer: Revitalizing Multi-Degraded Colored Document Images to Pristine PDF

Add code
Oct 27, 2023
Figure 1 for DocStormer: Revitalizing Multi-Degraded Colored Document Images to Pristine PDF
Figure 2 for DocStormer: Revitalizing Multi-Degraded Colored Document Images to Pristine PDF
Figure 3 for DocStormer: Revitalizing Multi-Degraded Colored Document Images to Pristine PDF
Figure 4 for DocStormer: Revitalizing Multi-Degraded Colored Document Images to Pristine PDF
Viaarxiv icon

Efficient Image Captioning for Edge Devices

Add code
Dec 18, 2022
Figure 1 for Efficient Image Captioning for Edge Devices
Figure 2 for Efficient Image Captioning for Edge Devices
Figure 3 for Efficient Image Captioning for Edge Devices
Figure 4 for Efficient Image Captioning for Edge Devices
Viaarxiv icon

Controllable Image Captioning via Prompting

Add code
Dec 04, 2022
Figure 1 for Controllable Image Captioning via Prompting
Figure 2 for Controllable Image Captioning via Prompting
Figure 3 for Controllable Image Captioning via Prompting
Figure 4 for Controllable Image Captioning via Prompting
Viaarxiv icon