Picture for Kaixuan Huang

Kaixuan Huang

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

Add code
May 26, 2025
Viaarxiv icon

On Path to Multimodal Historical Reasoning: HistBench and HistAgent

Add code
May 26, 2025
Viaarxiv icon

Large Wireless Localization Model (LWLM): A Foundation Model for Positioning in 6G Networks

Add code
May 15, 2025
Viaarxiv icon

Temporal Consistency for LLM Reasoning Process Error Identification

Add code
Mar 18, 2025
Viaarxiv icon

Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models

Add code
Feb 27, 2025
Viaarxiv icon

MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations

Add code
Feb 10, 2025
Viaarxiv icon

A Theoretical Perspective for Speculative Decoding Algorithm

Add code
Oct 30, 2024
Viaarxiv icon

TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling

Add code
Oct 18, 2024
Figure 1 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Figure 2 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Figure 3 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Figure 4 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Viaarxiv icon

Latent Diffusion Models for Controllable RNA Sequence Generation

Add code
Sep 15, 2024
Viaarxiv icon

SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors

Add code
Jun 20, 2024
Figure 1 for SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors
Figure 2 for SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors
Figure 3 for SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors
Figure 4 for SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors
Viaarxiv icon