Picture for Huan Sun

Huan Sun

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Add code
May 27, 2024
Viaarxiv icon

AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMs

Add code
Apr 11, 2024
Viaarxiv icon

Cleared for Takeoff? Compositional & Conditional Reasoning may be the Achilles Heel to (Flight-Booking) Language Agents

Add code
Apr 05, 2024
Figure 1 for Cleared for Takeoff? Compositional & Conditional Reasoning may be the Achilles Heel to (Flight-Booking) Language Agents
Figure 2 for Cleared for Takeoff? Compositional & Conditional Reasoning may be the Achilles Heel to (Flight-Booking) Language Agents
Figure 3 for Cleared for Takeoff? Compositional & Conditional Reasoning may be the Achilles Heel to (Flight-Booking) Language Agents
Figure 4 for Cleared for Takeoff? Compositional & Conditional Reasoning may be the Achilles Heel to (Flight-Booking) Language Agents
Viaarxiv icon

AttributionBench: How Hard is Automatic Attribution Evaluation?

Add code
Feb 23, 2024
Viaarxiv icon

A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models

Add code
Feb 18, 2024
Viaarxiv icon

LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset

Add code
Feb 17, 2024
Viaarxiv icon

When is Tree Search Useful for LLM Planning? It Depends on the Discriminator

Add code
Feb 16, 2024
Figure 1 for When is Tree Search Useful for LLM Planning? It Depends on the Discriminator
Figure 2 for When is Tree Search Useful for LLM Planning? It Depends on the Discriminator
Figure 3 for When is Tree Search Useful for LLM Planning? It Depends on the Discriminator
Figure 4 for When is Tree Search Useful for LLM Planning? It Depends on the Discriminator
Viaarxiv icon

A Trembling House of Cards? Mapping Adversarial Attacks against Language Agents

Add code
Feb 15, 2024
Viaarxiv icon

eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction Data

Add code
Feb 13, 2024
Viaarxiv icon

GPT-4V(ision) is a Generalist Web Agent, if Grounded

Add code
Jan 03, 2024
Viaarxiv icon