Picture for Lingpeng Kong

Lingpeng Kong

How Far Can Cantonese NLP Go? Benchmarking Cantonese Capabilities of Large Language Models

Add code
Aug 29, 2024
Viaarxiv icon

SubgoalXL: Subgoal-based Expert Learning for Theorem Proving

Add code
Aug 20, 2024
Viaarxiv icon

FACTTRACK: Time-Aware World State Tracking in Story Outlines

Add code
Jul 23, 2024
Figure 1 for FACTTRACK: Time-Aware World State Tracking in Story Outlines
Figure 2 for FACTTRACK: Time-Aware World State Tracking in Story Outlines
Figure 3 for FACTTRACK: Time-Aware World State Tracking in Story Outlines
Figure 4 for FACTTRACK: Time-Aware World State Tracking in Story Outlines
Viaarxiv icon

Data Augmentation of Multi-turn Psychological Dialogue via Knowledge-driven Progressive Thought Prompting

Add code
Jun 24, 2024
Figure 1 for Data Augmentation of Multi-turn Psychological Dialogue via Knowledge-driven Progressive Thought Prompting
Figure 2 for Data Augmentation of Multi-turn Psychological Dialogue via Knowledge-driven Progressive Thought Prompting
Figure 3 for Data Augmentation of Multi-turn Psychological Dialogue via Knowledge-driven Progressive Thought Prompting
Viaarxiv icon

Jailbreaking as a Reward Misspecification Problem

Add code
Jun 20, 2024
Figure 1 for Jailbreaking as a Reward Misspecification Problem
Figure 2 for Jailbreaking as a Reward Misspecification Problem
Figure 3 for Jailbreaking as a Reward Misspecification Problem
Figure 4 for Jailbreaking as a Reward Misspecification Problem
Viaarxiv icon

A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond

Add code
Mar 21, 2024
Viaarxiv icon

ImgTrojan: Jailbreaking Vision-Language Models with ONE Image

Add code
Mar 06, 2024
Figure 1 for ImgTrojan: Jailbreaking Vision-Language Models with ONE Image
Figure 2 for ImgTrojan: Jailbreaking Vision-Language Models with ONE Image
Figure 3 for ImgTrojan: Jailbreaking Vision-Language Models with ONE Image
Figure 4 for ImgTrojan: Jailbreaking Vision-Language Models with ONE Image
Viaarxiv icon

Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models

Add code
Mar 04, 2024
Figure 1 for Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Figure 2 for Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Figure 3 for Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Figure 4 for Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Viaarxiv icon

GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers

Add code
Feb 29, 2024
Figure 1 for GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers
Figure 2 for GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers
Figure 3 for GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers
Figure 4 for GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers
Viaarxiv icon

Training-Free Long-Context Scaling of Large Language Models

Add code
Feb 27, 2024
Viaarxiv icon