Picture for Mengdi Wang

Mengdi Wang

On the Statistical Complexity for Offline and Low-Adaptive Reinforcement Learning with Structures

Add code
Jan 03, 2025
Figure 1 for On the Statistical Complexity for Offline and Low-Adaptive Reinforcement Learning with Structures
Figure 2 for On the Statistical Complexity for Offline and Low-Adaptive Reinforcement Learning with Structures
Figure 3 for On the Statistical Complexity for Offline and Low-Adaptive Reinforcement Learning with Structures
Figure 4 for On the Statistical Complexity for Offline and Low-Adaptive Reinforcement Learning with Structures
Viaarxiv icon

LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds

Add code
Dec 06, 2024
Figure 1 for LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds
Figure 2 for LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds
Figure 3 for LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds
Figure 4 for LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds
Viaarxiv icon

Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment

Add code
Nov 27, 2024
Figure 1 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 2 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 3 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 4 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Viaarxiv icon

One-Layer Transformer Provably Learns One-Nearest Neighbor In Context

Add code
Nov 16, 2024
Figure 1 for One-Layer Transformer Provably Learns One-Nearest Neighbor In Context
Figure 2 for One-Layer Transformer Provably Learns One-Nearest Neighbor In Context
Figure 3 for One-Layer Transformer Provably Learns One-Nearest Neighbor In Context
Viaarxiv icon

CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR

Add code
Nov 07, 2024
Figure 1 for CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR
Figure 2 for CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR
Viaarxiv icon

Global Convergence in Training Large-Scale Transformers

Add code
Oct 31, 2024
Figure 1 for Global Convergence in Training Large-Scale Transformers
Figure 2 for Global Convergence in Training Large-Scale Transformers
Viaarxiv icon

A Theoretical Perspective for Speculative Decoding Algorithm

Add code
Oct 30, 2024
Figure 1 for A Theoretical Perspective for Speculative Decoding Algorithm
Figure 2 for A Theoretical Perspective for Speculative Decoding Algorithm
Figure 3 for A Theoretical Perspective for Speculative Decoding Algorithm
Figure 4 for A Theoretical Perspective for Speculative Decoding Algorithm
Viaarxiv icon

FoldMark: Protecting Protein Generative Models with Watermarking

Add code
Oct 27, 2024
Viaarxiv icon

Fast Best-of-N Decoding via Speculative Rejection

Add code
Oct 26, 2024
Figure 1 for Fast Best-of-N Decoding via Speculative Rejection
Figure 2 for Fast Best-of-N Decoding via Speculative Rejection
Figure 3 for Fast Best-of-N Decoding via Speculative Rejection
Figure 4 for Fast Best-of-N Decoding via Speculative Rejection
Viaarxiv icon

Long Term Memory: The Foundation of AI Self-Evolution

Add code
Oct 21, 2024
Figure 1 for Long Term Memory: The Foundation of AI Self-Evolution
Figure 2 for Long Term Memory: The Foundation of AI Self-Evolution
Figure 3 for Long Term Memory: The Foundation of AI Self-Evolution
Figure 4 for Long Term Memory: The Foundation of AI Self-Evolution
Viaarxiv icon