Picture for Yue Wu

Yue Wu

A Survey of Reasoning with Foundation Models

Add code
Dec 26, 2023
Figure 1 for A Survey of Reasoning with Foundation Models
Figure 2 for A Survey of Reasoning with Foundation Models
Figure 3 for A Survey of Reasoning with Foundation Models
Figure 4 for A Survey of Reasoning with Foundation Models
Viaarxiv icon

Neural Gaussian Similarity Modeling for Differential Graph Structure Learning

Add code
Dec 15, 2023
Figure 1 for Neural Gaussian Similarity Modeling for Differential Graph Structure Learning
Figure 2 for Neural Gaussian Similarity Modeling for Differential Graph Structure Learning
Figure 3 for Neural Gaussian Similarity Modeling for Differential Graph Structure Learning
Figure 4 for Neural Gaussian Similarity Modeling for Differential Graph Structure Learning
Viaarxiv icon

SimAC: A Simple Anti-Customization Method against Text-to-Image Synthesis of Diffusion Models

Add code
Dec 13, 2023
Figure 1 for SimAC: A Simple Anti-Customization Method against Text-to-Image Synthesis of Diffusion Models
Figure 2 for SimAC: A Simple Anti-Customization Method against Text-to-Image Synthesis of Diffusion Models
Figure 3 for SimAC: A Simple Anti-Customization Method against Text-to-Image Synthesis of Diffusion Models
Figure 4 for SimAC: A Simple Anti-Customization Method against Text-to-Image Synthesis of Diffusion Models
Viaarxiv icon

Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation

Add code
Dec 12, 2023
Figure 1 for Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation
Figure 2 for Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation
Figure 3 for Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation
Figure 4 for Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation
Viaarxiv icon

M3SOT: Multi-frame, Multi-field, Multi-space 3D Single Object Tracking

Add code
Dec 11, 2023
Figure 1 for M3SOT: Multi-frame, Multi-field, Multi-space 3D Single Object Tracking
Figure 2 for M3SOT: Multi-frame, Multi-field, Multi-space 3D Single Object Tracking
Figure 3 for M3SOT: Multi-frame, Multi-field, Multi-space 3D Single Object Tracking
Figure 4 for M3SOT: Multi-frame, Multi-field, Multi-space 3D Single Object Tracking
Viaarxiv icon

PCRDiffusion: Diffusion Probabilistic Models for Point Cloud Registration

Add code
Dec 11, 2023
Viaarxiv icon

Drag-A-Video: Non-rigid Video Editing with Point-based Interaction

Add code
Dec 05, 2023
Figure 1 for Drag-A-Video: Non-rigid Video Editing with Point-based Interaction
Figure 2 for Drag-A-Video: Non-rigid Video Editing with Point-based Interaction
Figure 3 for Drag-A-Video: Non-rigid Video Editing with Point-based Interaction
Figure 4 for Drag-A-Video: Non-rigid Video Editing with Point-based Interaction
Viaarxiv icon

Language Grounded QFormer for Efficient Vision Language Understanding

Add code
Nov 13, 2023
Figure 1 for Language Grounded QFormer for Efficient Vision Language Understanding
Figure 2 for Language Grounded QFormer for Efficient Vision Language Understanding
Figure 3 for Language Grounded QFormer for Efficient Vision Language Understanding
Figure 4 for Language Grounded QFormer for Efficient Vision Language Understanding
Viaarxiv icon

Open-Ended Instructable Embodied Agents with Memory-Augmented Large Language Models

Add code
Oct 23, 2023
Figure 1 for Open-Ended Instructable Embodied Agents with Memory-Augmented Large Language Models
Figure 2 for Open-Ended Instructable Embodied Agents with Memory-Augmented Large Language Models
Figure 3 for Open-Ended Instructable Embodied Agents with Memory-Augmented Large Language Models
Figure 4 for Open-Ended Instructable Embodied Agents with Memory-Augmented Large Language Models
Viaarxiv icon

PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Add code
Oct 16, 2023
Figure 1 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Figure 2 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Figure 3 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Figure 4 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Viaarxiv icon