Picture for Yushi Bai

Yushi Bai

Finding Safety Neurons in Large Language Models

Add code
Jun 20, 2024
Figure 1 for Finding Safety Neurons in Large Language Models
Figure 2 for Finding Safety Neurons in Large Language Models
Figure 3 for Finding Safety Neurons in Large Language Models
Figure 4 for Finding Safety Neurons in Large Language Models
Viaarxiv icon

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Add code
Jun 18, 2024
Figure 1 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Figure 2 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Figure 3 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Figure 4 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Viaarxiv icon

DICE: Detecting In-distribution Contamination in LLM's Fine-tuning Phase for Math Reasoning

Add code
Jun 06, 2024
Figure 1 for DICE: Detecting In-distribution Contamination in LLM's Fine-tuning Phase for Math Reasoning
Figure 2 for DICE: Detecting In-distribution Contamination in LLM's Fine-tuning Phase for Math Reasoning
Figure 3 for DICE: Detecting In-distribution Contamination in LLM's Fine-tuning Phase for Math Reasoning
Figure 4 for DICE: Detecting In-distribution Contamination in LLM's Fine-tuning Phase for Math Reasoning
Viaarxiv icon

Advancing Geometric Problem Solving: A Comprehensive Benchmark for Multimodal Model Evaluation

Add code
Apr 07, 2024
Figure 1 for Advancing Geometric Problem Solving: A Comprehensive Benchmark for Multimodal Model Evaluation
Figure 2 for Advancing Geometric Problem Solving: A Comprehensive Benchmark for Multimodal Model Evaluation
Figure 3 for Advancing Geometric Problem Solving: A Comprehensive Benchmark for Multimodal Model Evaluation
Figure 4 for Advancing Geometric Problem Solving: A Comprehensive Benchmark for Multimodal Model Evaluation
Viaarxiv icon

CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations

Add code
Feb 06, 2024
Figure 1 for CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
Figure 2 for CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
Figure 3 for CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
Figure 4 for CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
Viaarxiv icon

LongAlign: A Recipe for Long Context Alignment of Large Language Models

Add code
Jan 31, 2024
Viaarxiv icon

Text-Image Conditioned Diffusion for Consistent Text-to-3D Generation

Add code
Dec 19, 2023
Viaarxiv icon

WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models

Add code
Nov 13, 2023
Figure 1 for WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models
Figure 2 for WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models
Figure 3 for WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models
Figure 4 for WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models
Viaarxiv icon

Know2BIO: A Comprehensive Dual-View Benchmark for Evolving Biomedical Knowledge Graphs

Add code
Oct 05, 2023
Figure 1 for Know2BIO: A Comprehensive Dual-View Benchmark for Evolving Biomedical Knowledge Graphs
Figure 2 for Know2BIO: A Comprehensive Dual-View Benchmark for Evolving Biomedical Knowledge Graphs
Figure 3 for Know2BIO: A Comprehensive Dual-View Benchmark for Evolving Biomedical Knowledge Graphs
Figure 4 for Know2BIO: A Comprehensive Dual-View Benchmark for Evolving Biomedical Knowledge Graphs
Viaarxiv icon

T$^3$Bench: Benchmarking Current Progress in Text-to-3D Generation

Add code
Oct 04, 2023
Figure 1 for T$^3$Bench: Benchmarking Current Progress in Text-to-3D Generation
Figure 2 for T$^3$Bench: Benchmarking Current Progress in Text-to-3D Generation
Figure 3 for T$^3$Bench: Benchmarking Current Progress in Text-to-3D Generation
Figure 4 for T$^3$Bench: Benchmarking Current Progress in Text-to-3D Generation
Viaarxiv icon