Picture for Baixuan Xu

Baixuan Xu

NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents

Add code
Oct 08, 2025
Viaarxiv icon

The Cognitive Bandwidth Bottleneck: Shifting Long-Horizon Agent from Planning with Actions to Planning with Schemas

Add code
Oct 08, 2025
Viaarxiv icon

SessionIntentBench: A Multi-task Inter-session Intention-shift Modeling Benchmark for E-commerce Customer Behavior Understanding

Add code
Jul 27, 2025
Viaarxiv icon

INFERENCEDYNAMICS: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling

Add code
May 22, 2025
Figure 1 for INFERENCEDYNAMICS: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling
Figure 2 for INFERENCEDYNAMICS: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling
Figure 3 for INFERENCEDYNAMICS: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling
Figure 4 for INFERENCEDYNAMICS: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling
Viaarxiv icon

Legal Rule Induction: Towards Generalizable Principle Discovery from Analogous Judicial Precedents

Add code
May 20, 2025
Viaarxiv icon

Towards Multi-Agent Reasoning Systems for Collaborative Expertise Delegation: An Exploratory Design Study

Add code
May 12, 2025
Viaarxiv icon

The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning

Add code
Apr 07, 2025
Figure 1 for The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning
Figure 2 for The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning
Figure 3 for The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning
Figure 4 for The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning
Viaarxiv icon

EcomEdit: An Automated E-commerce Knowledge Editing Framework for Enhanced Product and Purchase Intention Understanding

Add code
Oct 18, 2024
Figure 1 for EcomEdit: An Automated E-commerce Knowledge Editing Framework for Enhanced Product and Purchase Intention Understanding
Figure 2 for EcomEdit: An Automated E-commerce Knowledge Editing Framework for Enhanced Product and Purchase Intention Understanding
Figure 3 for EcomEdit: An Automated E-commerce Knowledge Editing Framework for Enhanced Product and Purchase Intention Understanding
Figure 4 for EcomEdit: An Automated E-commerce Knowledge Editing Framework for Enhanced Product and Purchase Intention Understanding
Viaarxiv icon

On the Role of Entity and Event Level Conceptualization in Generalizable Reasoning: A Survey of Tasks, Methods, Applications, and Future Directions

Add code
Jun 16, 2024
Figure 1 for On the Role of Entity and Event Level Conceptualization in Generalizable Reasoning: A Survey of Tasks, Methods, Applications, and Future Directions
Figure 2 for On the Role of Entity and Event Level Conceptualization in Generalizable Reasoning: A Survey of Tasks, Methods, Applications, and Future Directions
Figure 3 for On the Role of Entity and Event Level Conceptualization in Generalizable Reasoning: A Survey of Tasks, Methods, Applications, and Future Directions
Figure 4 for On the Role of Entity and Event Level Conceptualization in Generalizable Reasoning: A Survey of Tasks, Methods, Applications, and Future Directions
Viaarxiv icon

MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding

Add code
Jun 15, 2024
Figure 1 for MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding
Figure 2 for MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding
Figure 3 for MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding
Figure 4 for MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding
Viaarxiv icon