Picture for Lei Zhang

Lei Zhang

Sid

Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence

Add code
Dec 10, 2024
Figure 1 for Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence
Figure 2 for Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence
Figure 3 for Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence
Figure 4 for Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence
Viaarxiv icon

Evaluating and Aligning CodeLLMs on Human Preference

Add code
Dec 06, 2024
Figure 1 for Evaluating and Aligning CodeLLMs on Human Preference
Figure 2 for Evaluating and Aligning CodeLLMs on Human Preference
Figure 3 for Evaluating and Aligning CodeLLMs on Human Preference
Figure 4 for Evaluating and Aligning CodeLLMs on Human Preference
Viaarxiv icon

Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach

Add code
Dec 04, 2024
Figure 1 for Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach
Figure 2 for Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach
Figure 3 for Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach
Figure 4 for Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach
Viaarxiv icon

Single-Cell Omics Arena: A Benchmark Study for Large Language Models on Cell Type Annotation Using Single-Cell Data

Add code
Dec 03, 2024
Figure 1 for Single-Cell Omics Arena: A Benchmark Study for Large Language Models on Cell Type Annotation Using Single-Cell Data
Figure 2 for Single-Cell Omics Arena: A Benchmark Study for Large Language Models on Cell Type Annotation Using Single-Cell Data
Figure 3 for Single-Cell Omics Arena: A Benchmark Study for Large Language Models on Cell Type Annotation Using Single-Cell Data
Figure 4 for Single-Cell Omics Arena: A Benchmark Study for Large Language Models on Cell Type Annotation Using Single-Cell Data
Viaarxiv icon

HandOS: 3D Hand Reconstruction in One Stage

Add code
Dec 02, 2024
Figure 1 for HandOS: 3D Hand Reconstruction in One Stage
Figure 2 for HandOS: 3D Hand Reconstruction in One Stage
Figure 3 for HandOS: 3D Hand Reconstruction in One Stage
Figure 4 for HandOS: 3D Hand Reconstruction in One Stage
Viaarxiv icon

ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

Add code
Dec 02, 2024
Figure 1 for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding
Figure 2 for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding
Figure 3 for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding
Figure 4 for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding
Viaarxiv icon

Learning on Less: Constraining Pre-trained Model Learning for Generalizable Diffusion-Generated Image Detection

Add code
Dec 01, 2024
Figure 1 for Learning on Less: Constraining Pre-trained Model Learning for Generalizable Diffusion-Generated Image Detection
Figure 2 for Learning on Less: Constraining Pre-trained Model Learning for Generalizable Diffusion-Generated Image Detection
Figure 3 for Learning on Less: Constraining Pre-trained Model Learning for Generalizable Diffusion-Generated Image Detection
Figure 4 for Learning on Less: Constraining Pre-trained Model Learning for Generalizable Diffusion-Generated Image Detection
Viaarxiv icon

Don't Let Your Robot be Harmful: Responsible Robotic Manipulation

Add code
Nov 27, 2024
Viaarxiv icon

TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video

Add code
Nov 27, 2024
Figure 1 for TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video
Figure 2 for TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video
Figure 3 for TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video
Figure 4 for TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video
Viaarxiv icon

Scaling Speech-Text Pre-training with Synthetic Interleaved Data

Add code
Nov 26, 2024
Figure 1 for Scaling Speech-Text Pre-training with Synthetic Interleaved Data
Figure 2 for Scaling Speech-Text Pre-training with Synthetic Interleaved Data
Figure 3 for Scaling Speech-Text Pre-training with Synthetic Interleaved Data
Figure 4 for Scaling Speech-Text Pre-training with Synthetic Interleaved Data
Viaarxiv icon