Picture for Zhijing Jin

Zhijing Jin

NLP for Social Good: A Survey of Challenges, Opportunities, and Responsible Deployment

Add code
May 28, 2025
Figure 1 for NLP for Social Good: A Survey of Challenges, Opportunities, and Responsible Deployment
Figure 2 for NLP for Social Good: A Survey of Challenges, Opportunities, and Responsible Deployment
Figure 3 for NLP for Social Good: A Survey of Challenges, Opportunities, and Responsible Deployment
Figure 4 for NLP for Social Good: A Survey of Challenges, Opportunities, and Responsible Deployment
Viaarxiv icon

Are Language Models Consequentialist or Deontological Moral Reasoners?

Add code
May 27, 2025
Viaarxiv icon

When Ethics and Payoffs Diverge: LLM Agents in Morally Charged Social Dilemmas

Add code
May 25, 2025
Viaarxiv icon

Accidental Misalignment: Fine-Tuning Language Models Induces Unexpected Vulnerability

Add code
May 22, 2025
Figure 1 for Accidental Misalignment: Fine-Tuning Language Models Induces Unexpected Vulnerability
Figure 2 for Accidental Misalignment: Fine-Tuning Language Models Induces Unexpected Vulnerability
Figure 3 for Accidental Misalignment: Fine-Tuning Language Models Induces Unexpected Vulnerability
Figure 4 for Accidental Misalignment: Fine-Tuning Language Models Induces Unexpected Vulnerability
Viaarxiv icon

Causality for Natural Language Processing

Add code
Apr 20, 2025
Viaarxiv icon

The Reasoning-Memorization Interplay in Language Models Is Mediated by a Single Direction

Add code
Mar 29, 2025
Figure 1 for The Reasoning-Memorization Interplay in Language Models Is Mediated by a Single Direction
Figure 2 for The Reasoning-Memorization Interplay in Language Models Is Mediated by a Single Direction
Figure 3 for The Reasoning-Memorization Interplay in Language Models Is Mediated by a Single Direction
Figure 4 for The Reasoning-Memorization Interplay in Language Models Is Mediated by a Single Direction
Viaarxiv icon

DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal

Add code
Mar 18, 2025
Viaarxiv icon

Revealing Hidden Mechanisms of Cross-Country Content Moderation with Natural Language Processing

Add code
Mar 07, 2025
Figure 1 for Revealing Hidden Mechanisms of Cross-Country Content Moderation with Natural Language Processing
Figure 2 for Revealing Hidden Mechanisms of Cross-Country Content Moderation with Natural Language Processing
Figure 3 for Revealing Hidden Mechanisms of Cross-Country Content Moderation with Natural Language Processing
Figure 4 for Revealing Hidden Mechanisms of Cross-Country Content Moderation with Natural Language Processing
Viaarxiv icon

Causality Is Key to Understand and Balance Multiple Goals in Trustworthy ML and Foundation Models

Add code
Feb 28, 2025
Viaarxiv icon

Causality can systematically address the monsters under the bench(marks)

Add code
Feb 07, 2025
Viaarxiv icon