Picture for Jiantao Jiao

Jiantao Jiao

Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs

Add code
Oct 17, 2024
Figure 1 for Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
Figure 2 for Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
Figure 3 for Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
Figure 4 for Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
Viaarxiv icon

Thinking LLMs: General Instruction Following with Thought Generation

Add code
Oct 14, 2024
Figure 1 for Thinking LLMs: General Instruction Following with Thought Generation
Figure 2 for Thinking LLMs: General Instruction Following with Thought Generation
Figure 3 for Thinking LLMs: General Instruction Following with Thought Generation
Figure 4 for Thinking LLMs: General Instruction Following with Thought Generation
Viaarxiv icon

EmbedLLM: Learning Compact Representations of Large Language Models

Add code
Oct 03, 2024
Figure 1 for EmbedLLM: Learning Compact Representations of Large Language Models
Figure 2 for EmbedLLM: Learning Compact Representations of Large Language Models
Figure 3 for EmbedLLM: Learning Compact Representations of Large Language Models
Figure 4 for EmbedLLM: Learning Compact Representations of Large Language Models
Viaarxiv icon

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Add code
Jul 28, 2024
Figure 1 for Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
Figure 2 for Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
Figure 3 for Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
Figure 4 for Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
Viaarxiv icon

Universal evaluation and design of imaging systems using information estimation

Add code
May 31, 2024
Figure 1 for Universal evaluation and design of imaging systems using information estimation
Figure 2 for Universal evaluation and design of imaging systems using information estimation
Figure 3 for Universal evaluation and design of imaging systems using information estimation
Figure 4 for Universal evaluation and design of imaging systems using information estimation
Viaarxiv icon

Toxicity Detection for Free

Add code
May 29, 2024
Figure 1 for Toxicity Detection for Free
Figure 2 for Toxicity Detection for Free
Figure 3 for Toxicity Detection for Free
Figure 4 for Toxicity Detection for Free
Viaarxiv icon

Toward a Theory of Tokenization in LLMs

Add code
Apr 12, 2024
Figure 1 for Toward a Theory of Tokenization in LLMs
Figure 2 for Toward a Theory of Tokenization in LLMs
Figure 3 for Toward a Theory of Tokenization in LLMs
Figure 4 for Toward a Theory of Tokenization in LLMs
Viaarxiv icon

Generative AI Security: Challenges and Countermeasures

Add code
Feb 20, 2024
Figure 1 for Generative AI Security: Challenges and Countermeasures
Figure 2 for Generative AI Security: Challenges and Countermeasures
Figure 3 for Generative AI Security: Challenges and Countermeasures
Figure 4 for Generative AI Security: Challenges and Countermeasures
Viaarxiv icon

Efficient Prompt Caching via Embedding Similarity

Add code
Feb 02, 2024
Viaarxiv icon

Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF

Add code
Jan 29, 2024
Figure 1 for Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF
Figure 2 for Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF
Figure 3 for Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF
Figure 4 for Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF
Viaarxiv icon