Picture for Lei Zhang

Lei Zhang

Sid

Compress & Align: Curating Image-Text Data with Human Knowledge

Add code
Dec 13, 2023
Figure 1 for Compress & Align: Curating Image-Text Data with Human Knowledge
Figure 2 for Compress & Align: Curating Image-Text Data with Human Knowledge
Figure 3 for Compress & Align: Curating Image-Text Data with Human Knowledge
Figure 4 for Compress & Align: Curating Image-Text Data with Human Knowledge
Viaarxiv icon

Dynamic Weighted Combiner for Mixed-Modal Image Retrieval

Add code
Dec 11, 2023
Viaarxiv icon

RCA-NOC: Relative Contrastive Alignment for Novel Object Captioning

Add code
Dec 11, 2023
Figure 1 for RCA-NOC: Relative Contrastive Alignment for Novel Object Captioning
Figure 2 for RCA-NOC: Relative Contrastive Alignment for Novel Object Captioning
Figure 3 for RCA-NOC: Relative Contrastive Alignment for Novel Object Captioning
Figure 4 for RCA-NOC: Relative Contrastive Alignment for Novel Object Captioning
Viaarxiv icon

OpenSD: Unified Open-Vocabulary Segmentation and Detection

Add code
Dec 10, 2023
Viaarxiv icon

PhysHOI: Physics-Based Imitation of Dynamic Human-Object Interaction

Add code
Dec 07, 2023
Figure 1 for PhysHOI: Physics-Based Imitation of Dynamic Human-Object Interaction
Figure 2 for PhysHOI: Physics-Based Imitation of Dynamic Human-Object Interaction
Figure 3 for PhysHOI: Physics-Based Imitation of Dynamic Human-Object Interaction
Figure 4 for PhysHOI: Physics-Based Imitation of Dynamic Human-Object Interaction
Viaarxiv icon

LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models

Add code
Dec 05, 2023
Figure 1 for LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
Figure 2 for LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
Figure 3 for LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
Figure 4 for LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
Viaarxiv icon

Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution

Add code
Dec 01, 2023
Figure 1 for Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution
Figure 2 for Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution
Figure 3 for Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution
Figure 4 for Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution
Viaarxiv icon

Value Approximation for Two-Player General-Sum Differential Games with State Constraints

Add code
Nov 28, 2023
Figure 1 for Value Approximation for Two-Player General-Sum Differential Games with State Constraints
Figure 2 for Value Approximation for Two-Player General-Sum Differential Games with State Constraints
Figure 3 for Value Approximation for Two-Player General-Sum Differential Games with State Constraints
Figure 4 for Value Approximation for Two-Player General-Sum Differential Games with State Constraints
Viaarxiv icon

RIDE: Real-time Intrusion Detection via Explainable Machine Learning Implemented in a Memristor Hardware Architecture

Add code
Nov 27, 2023
Figure 1 for RIDE: Real-time Intrusion Detection via Explainable Machine Learning Implemented in a Memristor Hardware Architecture
Figure 2 for RIDE: Real-time Intrusion Detection via Explainable Machine Learning Implemented in a Memristor Hardware Architecture
Figure 3 for RIDE: Real-time Intrusion Detection via Explainable Machine Learning Implemented in a Memristor Hardware Architecture
Figure 4 for RIDE: Real-time Intrusion Detection via Explainable Machine Learning Implemented in a Memristor Hardware Architecture
Viaarxiv icon

SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution

Add code
Nov 27, 2023
Figure 1 for SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
Figure 2 for SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
Figure 3 for SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
Figure 4 for SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
Viaarxiv icon