Picture for Kaixuan Huang

Kaixuan Huang

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

Add code
May 26, 2025
Viaarxiv icon

On Path to Multimodal Historical Reasoning: HistBench and HistAgent

Add code
May 26, 2025
Viaarxiv icon

Large Wireless Localization Model (LWLM): A Foundation Model for Positioning in 6G Networks

Add code
May 15, 2025
Viaarxiv icon

Temporal Consistency for LLM Reasoning Process Error Identification

Add code
Mar 18, 2025
Figure 1 for Temporal Consistency for LLM Reasoning Process Error Identification
Figure 2 for Temporal Consistency for LLM Reasoning Process Error Identification
Figure 3 for Temporal Consistency for LLM Reasoning Process Error Identification
Figure 4 for Temporal Consistency for LLM Reasoning Process Error Identification
Viaarxiv icon

Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models

Add code
Feb 27, 2025
Figure 1 for Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models
Figure 2 for Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models
Figure 3 for Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models
Figure 4 for Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models
Viaarxiv icon

MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations

Add code
Feb 10, 2025
Figure 1 for MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations
Figure 2 for MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations
Figure 3 for MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations
Figure 4 for MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations
Viaarxiv icon

A Theoretical Perspective for Speculative Decoding Algorithm

Add code
Oct 30, 2024
Viaarxiv icon

TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling

Add code
Oct 18, 2024
Figure 1 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Figure 2 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Figure 3 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Figure 4 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Viaarxiv icon

Latent Diffusion Models for Controllable RNA Sequence Generation

Add code
Sep 15, 2024
Figure 1 for Latent Diffusion Models for Controllable RNA Sequence Generation
Figure 2 for Latent Diffusion Models for Controllable RNA Sequence Generation
Figure 3 for Latent Diffusion Models for Controllable RNA Sequence Generation
Figure 4 for Latent Diffusion Models for Controllable RNA Sequence Generation
Viaarxiv icon

SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors

Add code
Jun 20, 2024
Figure 1 for SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors
Figure 2 for SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors
Figure 3 for SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors
Figure 4 for SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors
Viaarxiv icon