Picture for Hang Yan

Hang Yan

Length Generalization of Causal Transformers without Position Encoding

Add code
Apr 18, 2024
Figure 1 for Length Generalization of Causal Transformers without Position Encoding
Figure 2 for Length Generalization of Causal Transformers without Position Encoding
Figure 3 for Length Generalization of Causal Transformers without Position Encoding
Figure 4 for Length Generalization of Causal Transformers without Position Encoding
Viaarxiv icon

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

Add code
Apr 09, 2024
Figure 1 for InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Figure 2 for InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Figure 3 for InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Figure 4 for InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Viaarxiv icon

InternLM2 Technical Report

Add code
Mar 26, 2024
Figure 1 for InternLM2 Technical Report
Figure 2 for InternLM2 Technical Report
Figure 3 for InternLM2 Technical Report
Figure 4 for InternLM2 Technical Report
Viaarxiv icon

Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization Correlations

Add code
Mar 21, 2024
Viaarxiv icon

EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models

Add code
Mar 18, 2024
Figure 1 for EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models
Figure 2 for EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models
Figure 3 for EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models
Figure 4 for EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models
Viaarxiv icon

WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset

Add code
Mar 12, 2024
Figure 1 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 2 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 3 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 4 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Viaarxiv icon

In-Memory Learning: A Declarative Learning Framework for Large Language Models

Add code
Mar 05, 2024
Figure 1 for In-Memory Learning: A Declarative Learning Framework for Large Language Models
Figure 2 for In-Memory Learning: A Declarative Learning Framework for Large Language Models
Figure 3 for In-Memory Learning: A Declarative Learning Framework for Large Language Models
Figure 4 for In-Memory Learning: A Declarative Learning Framework for Large Language Models
Viaarxiv icon

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling

Add code
Feb 26, 2024
Figure 1 for AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Figure 2 for AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Figure 3 for AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Figure 4 for AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Viaarxiv icon

Data-freeWeight Compress and Denoise for Large Language Models

Add code
Feb 26, 2024
Viaarxiv icon

Balanced Data Sampling for Language Model Training with Clustering

Add code
Feb 22, 2024
Figure 1 for Balanced Data Sampling for Language Model Training with Clustering
Figure 2 for Balanced Data Sampling for Language Model Training with Clustering
Figure 3 for Balanced Data Sampling for Language Model Training with Clustering
Figure 4 for Balanced Data Sampling for Language Model Training with Clustering
Viaarxiv icon