Alert button
Picture for Ang Lv

Ang Lv

Alert button

Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models

Add code
Bookmark button
Alert button
Apr 09, 2024
Ang Lv, Kaiyi Zhang, Yuhan Chen, Yulong Wang, Lifeng Liu, Ji-Rong Wen, Jian Xie, Rui Yan

Figure 1 for Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models
Figure 2 for Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models
Figure 3 for Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models
Figure 4 for Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models
Viaarxiv icon

Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models

Add code
Bookmark button
Alert button
Mar 04, 2024
Changyu Chen, Xiting Wang, Ting-En Lin, Ang Lv, Yuchuan Wu, Xin Gao, Ji-Rong Wen, Rui Yan, Yongbin Li

Figure 1 for Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models
Figure 2 for Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models
Figure 3 for Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models
Figure 4 for Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models
Viaarxiv icon

Batch-ICL: Effective, Efficient, and Order-Agnostic In-Context Learning

Add code
Bookmark button
Alert button
Jan 12, 2024
Kaiyi Zhang, Ang Lv, Yuhan Chen, Hansen Ha, Tao Xu, Rui Yan

Viaarxiv icon

Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use

Add code
Bookmark button
Alert button
Dec 07, 2023
Yuhan Chen, Ang Lv, Ting-En Lin, Changyu Chen, Yuchuan Wu, Fei Huang, Yongbin Li, Rui Yan

Figure 1 for Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use
Figure 2 for Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use
Figure 3 for Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use
Figure 4 for Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use
Viaarxiv icon

Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse

Add code
Bookmark button
Alert button
Nov 16, 2023
Ang Lv, Kaiyi Zhang, Shufang Xie, Quan Tu, Yuhan Chen, Ji-Rong Wen, Rui Yan

Figure 1 for Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse
Figure 2 for Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse
Figure 3 for Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse
Figure 4 for Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse
Viaarxiv icon

DialoGPS: Dialogue Path Sampling in Continuous Semantic Space for Data Augmentation in Multi-Turn Conversations

Add code
Bookmark button
Alert button
Jun 29, 2023
Ang Lv, Jinpeng Li, Yuhan Chen, Xing Gao, Ji Zhang, Rui Yan

Figure 1 for DialoGPS: Dialogue Path Sampling in Continuous Semantic Space for Data Augmentation in Multi-Turn Conversations
Figure 2 for DialoGPS: Dialogue Path Sampling in Continuous Semantic Space for Data Augmentation in Multi-Turn Conversations
Figure 3 for DialoGPS: Dialogue Path Sampling in Continuous Semantic Space for Data Augmentation in Multi-Turn Conversations
Figure 4 for DialoGPS: Dialogue Path Sampling in Continuous Semantic Space for Data Augmentation in Multi-Turn Conversations
Viaarxiv icon

GETMusic: Generating Any Music Tracks with a Unified Representation and Diffusion Framework

Add code
Bookmark button
Alert button
May 18, 2023
Ang Lv, Xu Tan, Peiling Lu, Wei Ye, Shikun Zhang, Jiang Bian, Rui Yan

Figure 1 for GETMusic: Generating Any Music Tracks with a Unified Representation and Diffusion Framework
Figure 2 for GETMusic: Generating Any Music Tracks with a Unified Representation and Diffusion Framework
Figure 3 for GETMusic: Generating Any Music Tracks with a Unified Representation and Diffusion Framework
Figure 4 for GETMusic: Generating Any Music Tracks with a Unified Representation and Diffusion Framework
Viaarxiv icon

Re-creation of Creations: A New Paradigm for Lyric-to-Melody Generation

Add code
Bookmark button
Alert button
Aug 18, 2022
Ang Lv, Xu Tan, Tao Qin, Tie-Yan Liu, Rui Yan

Figure 1 for Re-creation of Creations: A New Paradigm for Lyric-to-Melody Generation
Figure 2 for Re-creation of Creations: A New Paradigm for Lyric-to-Melody Generation
Figure 3 for Re-creation of Creations: A New Paradigm for Lyric-to-Melody Generation
Figure 4 for Re-creation of Creations: A New Paradigm for Lyric-to-Melody Generation
Viaarxiv icon