Picture for Ziyu Yao

Ziyu Yao

George Mason University

A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models

Add code
Jul 02, 2024
Viaarxiv icon

An Investigation of Neuron Activation as a Unified Lens to Explain Chain-of-Thought Eliciting Arithmetic Reasoning of LLMs

Add code
Jun 18, 2024
Viaarxiv icon

Look Further Ahead: Testing the Limits of GPT-4 in Path Planning

Add code
Jun 17, 2024
Viaarxiv icon

MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education

Add code
Apr 10, 2024
Figure 1 for MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education
Figure 2 for MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education
Figure 3 for MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education
Figure 4 for MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education
Viaarxiv icon

Lens: A Foundation Model for Network Traffic in Cybersecurity

Add code
Feb 09, 2024
Viaarxiv icon

Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient Reasoning

Add code
Oct 07, 2023
Figure 1 for Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient Reasoning
Figure 2 for Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient Reasoning
Figure 3 for Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient Reasoning
Figure 4 for Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient Reasoning
Viaarxiv icon

Instance Needs More Care: Rewriting Prompts for Instances Yields Better Zero-Shot Performance

Add code
Oct 05, 2023
Figure 1 for Instance Needs More Care: Rewriting Prompts for Instances Yields Better Zero-Shot Performance
Figure 2 for Instance Needs More Care: Rewriting Prompts for Instances Yields Better Zero-Shot Performance
Figure 3 for Instance Needs More Care: Rewriting Prompts for Instances Yields Better Zero-Shot Performance
Figure 4 for Instance Needs More Care: Rewriting Prompts for Instances Yields Better Zero-Shot Performance
Viaarxiv icon

Can Large Language Models be Good Path Planners? A Benchmark and Investigation on Spatial-temporal Reasoning

Add code
Oct 05, 2023
Figure 1 for Can Large Language Models be Good Path Planners? A Benchmark and Investigation on Spatial-temporal Reasoning
Figure 2 for Can Large Language Models be Good Path Planners? A Benchmark and Investigation on Spatial-temporal Reasoning
Figure 3 for Can Large Language Models be Good Path Planners? A Benchmark and Investigation on Spatial-temporal Reasoning
Figure 4 for Can Large Language Models be Good Path Planners? A Benchmark and Investigation on Spatial-temporal Reasoning
Viaarxiv icon

Gentopia: A Collaborative Platform for Tool-Augmented LLMs

Add code
Aug 08, 2023
Figure 1 for Gentopia: A Collaborative Platform for Tool-Augmented LLMs
Figure 2 for Gentopia: A Collaborative Platform for Tool-Augmented LLMs
Figure 3 for Gentopia: A Collaborative Platform for Tool-Augmented LLMs
Figure 4 for Gentopia: A Collaborative Platform for Tool-Augmented LLMs
Viaarxiv icon

Improving Generalization in Language Model-Based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-Based Techniques

Add code
May 27, 2023
Figure 1 for Improving Generalization in Language Model-Based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-Based Techniques
Figure 2 for Improving Generalization in Language Model-Based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-Based Techniques
Figure 3 for Improving Generalization in Language Model-Based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-Based Techniques
Figure 4 for Improving Generalization in Language Model-Based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-Based Techniques
Viaarxiv icon