Picture for Manaal Faruqui

Manaal Faruqui

Shammie

Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following

Add code
Nov 13, 2025
Viaarxiv icon

What Matters for Model Merging at Scale?

Add code
Oct 04, 2024
Figure 1 for What Matters for Model Merging at Scale?
Figure 2 for What Matters for Model Merging at Scale?
Figure 3 for What Matters for Model Merging at Scale?
Figure 4 for What Matters for Model Merging at Scale?
Viaarxiv icon

Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation

Add code
Jul 15, 2024
Viaarxiv icon

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Add code
Apr 10, 2024
Figure 1 for Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Figure 2 for Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Figure 3 for Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Figure 4 for Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

AutoMix: Automatically Mixing Language Models

Add code
Oct 19, 2023
Figure 1 for AutoMix: Automatically Mixing Language Models
Figure 2 for AutoMix: Automatically Mixing Language Models
Figure 3 for AutoMix: Automatically Mixing Language Models
Figure 4 for AutoMix: Automatically Mixing Language Models
Viaarxiv icon

How FaR Are Large Language Models From Agents with Theory-of-Mind?

Add code
Oct 04, 2023
Figure 1 for How FaR Are Large Language Models From Agents with Theory-of-Mind?
Figure 2 for How FaR Are Large Language Models From Agents with Theory-of-Mind?
Figure 3 for How FaR Are Large Language Models From Agents with Theory-of-Mind?
Figure 4 for How FaR Are Large Language Models From Agents with Theory-of-Mind?
Viaarxiv icon

Efficient Encoders for Streaming Sequence Tagging

Add code
Jan 23, 2023
Figure 1 for Efficient Encoders for Streaming Sequence Tagging
Figure 2 for Efficient Encoders for Streaming Sequence Tagging
Figure 3 for Efficient Encoders for Streaming Sequence Tagging
Figure 4 for Efficient Encoders for Streaming Sequence Tagging
Viaarxiv icon

Streaming Intended Query Detection using E2E Modeling for Continued Conversation

Add code
Aug 29, 2022
Figure 1 for Streaming Intended Query Detection using E2E Modeling for Continued Conversation
Figure 2 for Streaming Intended Query Detection using E2E Modeling for Continued Conversation
Figure 3 for Streaming Intended Query Detection using E2E Modeling for Continued Conversation
Figure 4 for Streaming Intended Query Detection using E2E Modeling for Continued Conversation
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon