Picture for Lei Zhang

Lei Zhang

Sid

Self-Supervised Scene Flow Estimation with Point-Voxel Fusion and Surface Representation

Add code
Oct 17, 2024
Figure 1 for Self-Supervised Scene Flow Estimation with Point-Voxel Fusion and Surface Representation
Figure 2 for Self-Supervised Scene Flow Estimation with Point-Voxel Fusion and Surface Representation
Figure 3 for Self-Supervised Scene Flow Estimation with Point-Voxel Fusion and Surface Representation
Figure 4 for Self-Supervised Scene Flow Estimation with Point-Voxel Fusion and Surface Representation
Viaarxiv icon

UniG: Modelling Unitary 3D Gaussians for View-consistent 3D Reconstruction

Add code
Oct 17, 2024
Figure 1 for UniG: Modelling Unitary 3D Gaussians for View-consistent 3D Reconstruction
Figure 2 for UniG: Modelling Unitary 3D Gaussians for View-consistent 3D Reconstruction
Figure 3 for UniG: Modelling Unitary 3D Gaussians for View-consistent 3D Reconstruction
Figure 4 for UniG: Modelling Unitary 3D Gaussians for View-consistent 3D Reconstruction
Viaarxiv icon

LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections

Add code
Oct 14, 2024
Figure 1 for LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections
Figure 2 for LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections
Figure 3 for LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections
Figure 4 for LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections
Viaarxiv icon

Visual-O1: Understanding Ambiguous Instructions via Multi-modal Multi-turn Chain-of-thoughts Reasoning

Add code
Oct 04, 2024
Figure 1 for Visual-O1: Understanding Ambiguous Instructions via Multi-modal Multi-turn Chain-of-thoughts Reasoning
Figure 2 for Visual-O1: Understanding Ambiguous Instructions via Multi-modal Multi-turn Chain-of-thoughts Reasoning
Figure 3 for Visual-O1: Understanding Ambiguous Instructions via Multi-modal Multi-turn Chain-of-thoughts Reasoning
Figure 4 for Visual-O1: Understanding Ambiguous Instructions via Multi-modal Multi-turn Chain-of-thoughts Reasoning
Viaarxiv icon

Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models

Add code
Sep 27, 2024
Figure 1 for Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
Figure 2 for Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
Figure 3 for Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
Figure 4 for Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
Viaarxiv icon

LKA-ReID:Vehicle Re-Identification with Large Kernel Attention

Add code
Sep 26, 2024
Viaarxiv icon

Path-adaptive Spatio-Temporal State Space Model for Event-based Recognition with Arbitrary Duration

Add code
Sep 25, 2024
Viaarxiv icon

DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion

Add code
Sep 25, 2024
Figure 1 for DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion
Figure 2 for DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion
Figure 3 for DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion
Figure 4 for DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion
Viaarxiv icon

Qwen2.5-Coder Technical Report

Add code
Sep 18, 2024
Figure 1 for Qwen2.5-Coder Technical Report
Figure 2 for Qwen2.5-Coder Technical Report
Figure 3 for Qwen2.5-Coder Technical Report
Figure 4 for Qwen2.5-Coder Technical Report
Viaarxiv icon

ClearDepth: Enhanced Stereo Perception of Transparent Objects for Robotic Manipulation

Add code
Sep 13, 2024
Figure 1 for ClearDepth: Enhanced Stereo Perception of Transparent Objects for Robotic Manipulation
Figure 2 for ClearDepth: Enhanced Stereo Perception of Transparent Objects for Robotic Manipulation
Figure 3 for ClearDepth: Enhanced Stereo Perception of Transparent Objects for Robotic Manipulation
Figure 4 for ClearDepth: Enhanced Stereo Perception of Transparent Objects for Robotic Manipulation
Viaarxiv icon