Picture for Jindong Wang

Jindong Wang

Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist

Add code
Jul 11, 2024
Viaarxiv icon

AgentReview: Exploring Peer Review Dynamics with LLM Agents

Add code
Jun 18, 2024
Figure 1 for AgentReview: Exploring Peer Review Dynamics with LLM Agents
Figure 2 for AgentReview: Exploring Peer Review Dynamics with LLM Agents
Figure 3 for AgentReview: Exploring Peer Review Dynamics with LLM Agents
Figure 4 for AgentReview: Exploring Peer Review Dynamics with LLM Agents
Viaarxiv icon

Can I understand what I create? Self-Knowledge Evaluation of Large Language Models

Add code
Jun 10, 2024
Figure 1 for Can I understand what I create? Self-Knowledge Evaluation of Large Language Models
Figure 2 for Can I understand what I create? Self-Knowledge Evaluation of Large Language Models
Figure 3 for Can I understand what I create? Self-Knowledge Evaluation of Large Language Models
Figure 4 for Can I understand what I create? Self-Knowledge Evaluation of Large Language Models
Viaarxiv icon

Beyond Metrics: Evaluating LLMs' Effectiveness in Culturally Nuanced, Low-Resource Real-World Scenarios

Add code
Jun 01, 2024
Figure 1 for Beyond Metrics: Evaluating LLMs' Effectiveness in Culturally Nuanced, Low-Resource Real-World Scenarios
Figure 2 for Beyond Metrics: Evaluating LLMs' Effectiveness in Culturally Nuanced, Low-Resource Real-World Scenarios
Figure 3 for Beyond Metrics: Evaluating LLMs' Effectiveness in Culturally Nuanced, Low-Resource Real-World Scenarios
Figure 4 for Beyond Metrics: Evaluating LLMs' Effectiveness in Culturally Nuanced, Low-Resource Real-World Scenarios
Viaarxiv icon

Slight Corruption in Pre-training Data Makes Better Diffusion Models

Add code
May 30, 2024
Figure 1 for Slight Corruption in Pre-training Data Makes Better Diffusion Models
Figure 2 for Slight Corruption in Pre-training Data Makes Better Diffusion Models
Figure 3 for Slight Corruption in Pre-training Data Makes Better Diffusion Models
Figure 4 for Slight Corruption in Pre-training Data Makes Better Diffusion Models
Viaarxiv icon

CulturePark: Boosting Cross-cultural Understanding in Large Language Models

Add code
May 24, 2024
Figure 1 for CulturePark: Boosting Cross-cultural Understanding in Large Language Models
Figure 2 for CulturePark: Boosting Cross-cultural Understanding in Large Language Models
Figure 3 for CulturePark: Boosting Cross-cultural Understanding in Large Language Models
Figure 4 for CulturePark: Boosting Cross-cultural Understanding in Large Language Models
Viaarxiv icon

NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional Stimuli

Add code
May 05, 2024
Figure 1 for NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional Stimuli
Figure 2 for NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional Stimuli
Figure 3 for NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional Stimuli
Figure 4 for NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional Stimuli
Viaarxiv icon

FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models

Add code
Apr 09, 2024
Figure 1 for FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models
Figure 2 for FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models
Figure 3 for FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models
Figure 4 for FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models
Viaarxiv icon

Detoxifying Large Language Models via Knowledge Editing

Add code
Mar 28, 2024
Figure 1 for Detoxifying Large Language Models via Knowledge Editing
Figure 2 for Detoxifying Large Language Models via Knowledge Editing
Figure 3 for Detoxifying Large Language Models via Knowledge Editing
Figure 4 for Detoxifying Large Language Models via Knowledge Editing
Viaarxiv icon

Learning with Noisy Foundation Models

Add code
Mar 11, 2024
Viaarxiv icon