Picture for Minlie Huang

Minlie Huang

EJ

Seeker: Enhancing Exception Handling in Code with LLM-based Multi-Agent Approach

Add code
Oct 09, 2024
Figure 1 for Seeker: Enhancing Exception Handling in Code with LLM-based Multi-Agent Approach
Figure 2 for Seeker: Enhancing Exception Handling in Code with LLM-based Multi-Agent Approach
Figure 3 for Seeker: Enhancing Exception Handling in Code with LLM-based Multi-Agent Approach
Figure 4 for Seeker: Enhancing Exception Handling in Code with LLM-based Multi-Agent Approach
Viaarxiv icon

LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models

Add code
Sep 05, 2024
Figure 1 for LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
Figure 2 for LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
Figure 3 for LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
Figure 4 for LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
Viaarxiv icon

How Well Do Large Language Models Serve as End-to-End Secure Code Producers?

Add code
Aug 20, 2024
Figure 1 for How Well Do Large Language Models Serve as End-to-End Secure Code Producers?
Figure 2 for How Well Do Large Language Models Serve as End-to-End Secure Code Producers?
Figure 3 for How Well Do Large Language Models Serve as End-to-End Secure Code Producers?
Figure 4 for How Well Do Large Language Models Serve as End-to-End Secure Code Producers?
Viaarxiv icon

Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules

Add code
Jul 09, 2024
Figure 1 for Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules
Figure 2 for Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules
Figure 3 for Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules
Figure 4 for Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules
Viaarxiv icon

Benchmarking Complex Instruction-Following with Multiple Constraints Composition

Add code
Jul 04, 2024
Figure 1 for Benchmarking Complex Instruction-Following with Multiple Constraints Composition
Figure 2 for Benchmarking Complex Instruction-Following with Multiple Constraints Composition
Figure 3 for Benchmarking Complex Instruction-Following with Multiple Constraints Composition
Figure 4 for Benchmarking Complex Instruction-Following with Multiple Constraints Composition
Viaarxiv icon

Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks

Add code
Jul 03, 2024
Figure 1 for Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks
Figure 2 for Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks
Figure 3 for Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks
Figure 4 for Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks
Viaarxiv icon

AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models

Add code
Jun 24, 2024
Figure 1 for AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models
Figure 2 for AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models
Figure 3 for AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models
Figure 4 for AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models
Viaarxiv icon

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Add code
Jun 20, 2024
Viaarxiv icon

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Add code
Jun 18, 2024
Figure 1 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Figure 2 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Figure 3 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Figure 4 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Viaarxiv icon

Learning Task Decomposition to Assist Humans in Competitive Programming

Add code
Jun 07, 2024
Figure 1 for Learning Task Decomposition to Assist Humans in Competitive Programming
Figure 2 for Learning Task Decomposition to Assist Humans in Competitive Programming
Figure 3 for Learning Task Decomposition to Assist Humans in Competitive Programming
Figure 4 for Learning Task Decomposition to Assist Humans in Competitive Programming
Viaarxiv icon