Picture for Subhabrata Dutta

Subhabrata Dutta

Language Models can Exploit Cross-Task In-context Learning for Data-Scarce Novel Tasks

Add code
May 17, 2024
Figure 1 for Language Models can Exploit Cross-Task In-context Learning for Data-Scarce Novel Tasks
Figure 2 for Language Models can Exploit Cross-Task In-context Learning for Data-Scarce Novel Tasks
Figure 3 for Language Models can Exploit Cross-Task In-context Learning for Data-Scarce Novel Tasks
Figure 4 for Language Models can Exploit Cross-Task In-context Learning for Data-Scarce Novel Tasks
Viaarxiv icon

$\texttt{LM}^\texttt{2}$: A Simple Society of Language Models Solves Complex Reasoning

Add code
Apr 02, 2024
Figure 1 for $\texttt{LM}^\texttt{2}$: A Simple Society of Language Models Solves Complex Reasoning
Figure 2 for $\texttt{LM}^\texttt{2}$: A Simple Society of Language Models Solves Complex Reasoning
Figure 3 for $\texttt{LM}^\texttt{2}$: A Simple Society of Language Models Solves Complex Reasoning
Figure 4 for $\texttt{LM}^\texttt{2}$: A Simple Society of Language Models Solves Complex Reasoning
Viaarxiv icon

How to think step-by-step: A mechanistic understanding of chain-of-thought reasoning

Add code
Feb 28, 2024
Viaarxiv icon

Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning

Add code
Dec 19, 2023
Figure 1 for Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning
Figure 2 for Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning
Figure 3 for Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning
Figure 4 for Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning
Viaarxiv icon

Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning

Add code
Oct 21, 2023
Figure 1 for Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning
Figure 2 for Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning
Figure 3 for Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning
Figure 4 for Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning
Viaarxiv icon

Multilingual LLMs are Better Cross-lingual In-context Learners with Alignment

Add code
May 26, 2023
Figure 1 for Multilingual LLMs are Better Cross-lingual In-context Learners with Alignment
Figure 2 for Multilingual LLMs are Better Cross-lingual In-context Learners with Alignment
Figure 3 for Multilingual LLMs are Better Cross-lingual In-context Learners with Alignment
Figure 4 for Multilingual LLMs are Better Cross-lingual In-context Learners with Alignment
Viaarxiv icon

Hatemongers ride on echo chambers to escalate hate speech diffusion

Add code
Feb 05, 2023
Figure 1 for Hatemongers ride on echo chambers to escalate hate speech diffusion
Figure 2 for Hatemongers ride on echo chambers to escalate hate speech diffusion
Figure 3 for Hatemongers ride on echo chambers to escalate hate speech diffusion
Figure 4 for Hatemongers ride on echo chambers to escalate hate speech diffusion
Viaarxiv icon

Can Unsupervised Knowledge Transfer from Social Discussions Help Argument Mining?

Add code
Mar 24, 2022
Figure 1 for Can Unsupervised Knowledge Transfer from Social Discussions Help Argument Mining?
Figure 2 for Can Unsupervised Knowledge Transfer from Social Discussions Help Argument Mining?
Figure 3 for Can Unsupervised Knowledge Transfer from Social Discussions Help Argument Mining?
Figure 4 for Can Unsupervised Knowledge Transfer from Social Discussions Help Argument Mining?
Viaarxiv icon

Semi-supervised Stance Detection of Tweets Via Distant Network Supervision

Add code
Jan 05, 2022
Figure 1 for Semi-supervised Stance Detection of Tweets Via Distant Network Supervision
Figure 2 for Semi-supervised Stance Detection of Tweets Via Distant Network Supervision
Figure 3 for Semi-supervised Stance Detection of Tweets Via Distant Network Supervision
Figure 4 for Semi-supervised Stance Detection of Tweets Via Distant Network Supervision
Viaarxiv icon

Redesigning the Transformer Architecture with Insights from Multi-particle Dynamical Systems

Add code
Oct 03, 2021
Figure 1 for Redesigning the Transformer Architecture with Insights from Multi-particle Dynamical Systems
Figure 2 for Redesigning the Transformer Architecture with Insights from Multi-particle Dynamical Systems
Figure 3 for Redesigning the Transformer Architecture with Insights from Multi-particle Dynamical Systems
Figure 4 for Redesigning the Transformer Architecture with Insights from Multi-particle Dynamical Systems
Viaarxiv icon