Picture for Tianxiang Sun

Tianxiang Sun

Unified Active Retrieval for Retrieval Augmented Generation

Add code
Jun 18, 2024
Viaarxiv icon

Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models

Add code
May 21, 2024
Viaarxiv icon

Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance

Add code
Mar 25, 2024
Figure 1 for Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance
Figure 2 for Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance
Figure 3 for Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance
Figure 4 for Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance
Viaarxiv icon

In-Memory Learning: A Declarative Learning Framework for Large Language Models

Add code
Mar 05, 2024
Figure 1 for In-Memory Learning: A Declarative Learning Framework for Large Language Models
Figure 2 for In-Memory Learning: A Declarative Learning Framework for Large Language Models
Figure 3 for In-Memory Learning: A Declarative Learning Framework for Large Language Models
Figure 4 for In-Memory Learning: A Declarative Learning Framework for Large Language Models
Viaarxiv icon

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling

Add code
Feb 26, 2024
Figure 1 for AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Figure 2 for AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Figure 3 for AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Figure 4 for AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Viaarxiv icon

Turn Waste into Worth: Rectifying Top-$k$ Router of MoE

Add code
Feb 21, 2024
Viaarxiv icon

Dictionary Learning Improves Patch-Free Circuit Discovery in Mechanistic Interpretability: A Case Study on Othello-GPT

Add code
Feb 19, 2024
Viaarxiv icon

LLM can Achieve Self-Regulation via Hyperparameter Aware Generation

Add code
Feb 17, 2024
Figure 1 for LLM can Achieve Self-Regulation via Hyperparameter Aware Generation
Figure 2 for LLM can Achieve Self-Regulation via Hyperparameter Aware Generation
Figure 3 for LLM can Achieve Self-Regulation via Hyperparameter Aware Generation
Figure 4 for LLM can Achieve Self-Regulation via Hyperparameter Aware Generation
Viaarxiv icon

Can AI Assistants Know What They Don't Know?

Add code
Jan 28, 2024
Figure 1 for Can AI Assistants Know What They Don't Know?
Figure 2 for Can AI Assistants Know What They Don't Know?
Figure 3 for Can AI Assistants Know What They Don't Know?
Figure 4 for Can AI Assistants Know What They Don't Know?
Viaarxiv icon

DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning

Add code
Jan 24, 2024
Viaarxiv icon