Picture for Yichi Zhang

Yichi Zhang

AI Lab, Netease

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Add code
Dec 30, 2024
Figure 1 for Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
Figure 2 for Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
Figure 3 for Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
Figure 4 for Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
Viaarxiv icon

6DMA-Aided Hybrid Beamforming with Joint Antenna Position and Orientation Optimization

Add code
Dec 22, 2024
Viaarxiv icon

PyOD 2: A Python Library for Outlier Detection with LLM-powered Model Selection

Add code
Dec 11, 2024
Figure 1 for PyOD 2: A Python Library for Outlier Detection with LLM-powered Model Selection
Figure 2 for PyOD 2: A Python Library for Outlier Detection with LLM-powered Model Selection
Figure 3 for PyOD 2: A Python Library for Outlier Detection with LLM-powered Model Selection
Viaarxiv icon

Scaling Laws for Black box Adversarial Attacks

Add code
Nov 25, 2024
Figure 1 for Scaling Laws for Black box Adversarial Attacks
Figure 2 for Scaling Laws for Black box Adversarial Attacks
Figure 3 for Scaling Laws for Black box Adversarial Attacks
Figure 4 for Scaling Laws for Black box Adversarial Attacks
Viaarxiv icon

Looking Beyond Text: Reducing Language bias in Large Vision-Language Models via Multimodal Dual-Attention and Soft-Image Guidance

Add code
Nov 21, 2024
Viaarxiv icon

MKGL: Mastery of a Three-Word Language

Add code
Oct 10, 2024
Figure 1 for MKGL: Mastery of a Three-Word Language
Figure 2 for MKGL: Mastery of a Three-Word Language
Figure 3 for MKGL: Mastery of a Three-Word Language
Figure 4 for MKGL: Mastery of a Three-Word Language
Viaarxiv icon

MetaOOD: Automatic Selection of OOD Detection Models

Add code
Oct 04, 2024
Figure 1 for MetaOOD: Automatic Selection of OOD Detection Models
Figure 2 for MetaOOD: Automatic Selection of OOD Detection Models
Figure 3 for MetaOOD: Automatic Selection of OOD Detection Models
Figure 4 for MetaOOD: Automatic Selection of OOD Detection Models
Viaarxiv icon

A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation

Add code
Oct 02, 2024
Figure 1 for A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
Figure 2 for A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
Figure 3 for A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
Figure 4 for A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
Viaarxiv icon

Robust Training of Neural Networks at Arbitrary Precision and Sparsity

Add code
Sep 14, 2024
Figure 1 for Robust Training of Neural Networks at Arbitrary Precision and Sparsity
Figure 2 for Robust Training of Neural Networks at Arbitrary Precision and Sparsity
Figure 3 for Robust Training of Neural Networks at Arbitrary Precision and Sparsity
Figure 4 for Robust Training of Neural Networks at Arbitrary Precision and Sparsity
Viaarxiv icon

Unleashing the Potential of SAM2 for Biomedical Images and Videos: A Survey

Add code
Aug 23, 2024
Viaarxiv icon