Picture for Zhizheng Zhang

Zhizheng Zhang

Southeast University, China

NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation

Add code
Mar 01, 2024
Figure 1 for NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation
Figure 2 for NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation
Figure 3 for NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation
Figure 4 for NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation
Viaarxiv icon

SeD: Semantic-Aware Discriminator for Image Super-Resolution

Add code
Feb 29, 2024
Figure 1 for SeD: Semantic-Aware Discriminator for Image Super-Resolution
Figure 2 for SeD: Semantic-Aware Discriminator for Image Super-Resolution
Figure 3 for SeD: Semantic-Aware Discriminator for Image Super-Resolution
Figure 4 for SeD: Semantic-Aware Discriminator for Image Super-Resolution
Viaarxiv icon

Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API

Add code
Oct 07, 2023
Figure 1 for Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API
Figure 2 for Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API
Figure 3 for Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API
Figure 4 for Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API
Viaarxiv icon

Adaptive Frequency Filters As Efficient Global Token Mixers

Add code
Jul 26, 2023
Viaarxiv icon

When and Why Momentum Accelerates SGD:An Empirical Study

Add code
Jun 15, 2023
Figure 1 for When and Why Momentum Accelerates SGD:An Empirical Study
Figure 2 for When and Why Momentum Accelerates SGD:An Empirical Study
Figure 3 for When and Why Momentum Accelerates SGD:An Empirical Study
Figure 4 for When and Why Momentum Accelerates SGD:An Empirical Study
Viaarxiv icon

Responsible Task Automation: Empowering Large Language Models as Responsible Task Automators

Add code
Jun 02, 2023
Figure 1 for Responsible Task Automation: Empowering Large Language Models as Responsible Task Automators
Figure 2 for Responsible Task Automation: Empowering Large Language Models as Responsible Task Automators
Figure 3 for Responsible Task Automation: Empowering Large Language Models as Responsible Task Automators
Figure 4 for Responsible Task Automation: Empowering Large Language Models as Responsible Task Automators
Viaarxiv icon

Learning Trajectories are Generalization Indicators

Add code
May 04, 2023
Figure 1 for Learning Trajectories are Generalization Indicators
Figure 2 for Learning Trajectories are Generalization Indicators
Figure 3 for Learning Trajectories are Generalization Indicators
Figure 4 for Learning Trajectories are Generalization Indicators
Viaarxiv icon

MRVM-NeRF: Mask-Based Pretraining for Neural Radiance Fields

Add code
Apr 11, 2023
Figure 1 for MRVM-NeRF: Mask-Based Pretraining for Neural Radiance Fields
Figure 2 for MRVM-NeRF: Mask-Based Pretraining for Neural Radiance Fields
Figure 3 for MRVM-NeRF: Mask-Based Pretraining for Neural Radiance Fields
Figure 4 for MRVM-NeRF: Mask-Based Pretraining for Neural Radiance Fields
Viaarxiv icon

Unifying Layout Generation with a Decoupled Diffusion Model

Add code
Mar 09, 2023
Figure 1 for Unifying Layout Generation with a Decoupled Diffusion Model
Figure 2 for Unifying Layout Generation with a Decoupled Diffusion Model
Figure 3 for Unifying Layout Generation with a Decoupled Diffusion Model
Figure 4 for Unifying Layout Generation with a Decoupled Diffusion Model
Viaarxiv icon

Versatile Neural Processes for Learning Implicit Neural Representations

Add code
Jan 21, 2023
Figure 1 for Versatile Neural Processes for Learning Implicit Neural Representations
Figure 2 for Versatile Neural Processes for Learning Implicit Neural Representations
Figure 3 for Versatile Neural Processes for Learning Implicit Neural Representations
Figure 4 for Versatile Neural Processes for Learning Implicit Neural Representations
Viaarxiv icon