Alert button
Picture for Sujian Li

Sujian Li

Alert button

LongEmbed: Extending Embedding Models for Long Context Retrieval

Add code
Bookmark button
Alert button
Apr 18, 2024
Dawei Zhu, Liang Wang, Nan Yang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li

Viaarxiv icon

CoUDA: Coherence Evaluation via Unified Data Augmentation

Add code
Bookmark button
Alert button
Mar 31, 2024
Dawei Zhu, Wenhao Wu, Yifan Song, Fangwei Zhu, Ziqiang Cao, Sujian Li

Viaarxiv icon

Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents

Add code
Bookmark button
Alert button
Mar 04, 2024
Yifan Song, Da Yin, Xiang Yue, Jie Huang, Sujian Li, Bill Yuchen Lin

Figure 1 for Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents
Figure 2 for Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents
Figure 3 for Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents
Figure 4 for Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents
Viaarxiv icon

Retrieval-based Full-length Wikipedia Generation for Emergent Events

Add code
Bookmark button
Alert button
Feb 28, 2024
Jiebin Zhang, Eugene J. Yu, Qinyu Chen, Chenhao Xiong, Dawei Zhu, Han Qian, Mingbo Song, Xiaoguang Li, Qun Liu, Sujian Li

Viaarxiv icon

Selecting Large Language Model to Fine-tune via Rectified Scaling Law

Add code
Bookmark button
Alert button
Feb 04, 2024
Haowei Lin, Baizhou Huang, Haotian Ye, Qinyu Chen, Zihao Wang, Sujian Li, Jianzhu Ma, Xiaojun Wan, James Zou, Yitao Liang

Viaarxiv icon

KBioXLM: A Knowledge-anchored Biomedical Multilingual Pretrained Language Model

Add code
Bookmark button
Alert button
Nov 20, 2023
Lei Geng, Xu Yan, Ziqiang Cao, Juntao Li, Wenjie Li, Sujian Li, Xinjie Zhou, Yang Yang, Jun Zhang

Viaarxiv icon

Rationale-Enhanced Language Models are Better Continual Relation Learners

Add code
Bookmark button
Alert button
Oct 10, 2023
Weimin Xiong, Yifan Song, Peiyi Wang, Sujian Li

Figure 1 for Rationale-Enhanced Language Models are Better Continual Relation Learners
Figure 2 for Rationale-Enhanced Language Models are Better Continual Relation Learners
Figure 3 for Rationale-Enhanced Language Models are Better Continual Relation Learners
Figure 4 for Rationale-Enhanced Language Models are Better Continual Relation Learners
Viaarxiv icon

InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspective

Add code
Bookmark button
Alert button
Oct 10, 2023
Yifan Song, Peiyi Wang, Weimin Xiong, Dawei Zhu, Tianyu Liu, Zhifang Sui, Sujian Li

Figure 1 for InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspective
Figure 2 for InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspective
Figure 3 for InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspective
Figure 4 for InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspective
Viaarxiv icon

PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

Add code
Bookmark button
Alert button
Sep 19, 2023
Dawei Zhu, Nan Yang, Liang Wang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li

Figure 1 for PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
Figure 2 for PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
Figure 3 for PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
Figure 4 for PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
Viaarxiv icon

RestGPT: Connecting Large Language Models with Real-World Applications via RESTful APIs

Add code
Bookmark button
Alert button
Jun 11, 2023
Yifan Song, Weimin Xiong, Dawei Zhu, Cheng Li, Ke Wang, Ye Tian, Sujian Li

Figure 1 for RestGPT: Connecting Large Language Models with Real-World Applications via RESTful APIs
Figure 2 for RestGPT: Connecting Large Language Models with Real-World Applications via RESTful APIs
Figure 3 for RestGPT: Connecting Large Language Models with Real-World Applications via RESTful APIs
Figure 4 for RestGPT: Connecting Large Language Models with Real-World Applications via RESTful APIs
Viaarxiv icon