Picture for Ashish Shenoy

Ashish Shenoy

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Lumos : Empowering Multimodal LLMs with Scene Text Recognition

Add code
Feb 12, 2024
Figure 1 for Lumos : Empowering Multimodal LLMs with Scene Text Recognition
Figure 2 for Lumos : Empowering Multimodal LLMs with Scene Text Recognition
Figure 3 for Lumos : Empowering Multimodal LLMs with Scene Text Recognition
Figure 4 for Lumos : Empowering Multimodal LLMs with Scene Text Recognition
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Now It Sounds Like You: Learning Personalized Vocabulary On Device

Add code
May 05, 2023
Figure 1 for Now It Sounds Like You: Learning Personalized Vocabulary On Device
Figure 2 for Now It Sounds Like You: Learning Personalized Vocabulary On Device
Figure 3 for Now It Sounds Like You: Learning Personalized Vocabulary On Device
Figure 4 for Now It Sounds Like You: Learning Personalized Vocabulary On Device
Viaarxiv icon

Green Federated Learning

Add code
Mar 26, 2023
Figure 1 for Green Federated Learning
Figure 2 for Green Federated Learning
Figure 3 for Green Federated Learning
Figure 4 for Green Federated Learning
Viaarxiv icon

Domain Prompts: Towards memory and compute efficient domain adaptation of ASR systems

Add code
Dec 16, 2021
Figure 1 for Domain Prompts: Towards memory and compute efficient domain adaptation of ASR systems
Figure 2 for Domain Prompts: Towards memory and compute efficient domain adaptation of ASR systems
Figure 3 for Domain Prompts: Towards memory and compute efficient domain adaptation of ASR systems
Figure 4 for Domain Prompts: Towards memory and compute efficient domain adaptation of ASR systems
Viaarxiv icon

Prompt-tuning in ASR systems for efficient domain-adaptation

Add code
Oct 22, 2021
Figure 1 for Prompt-tuning in ASR systems for efficient domain-adaptation
Viaarxiv icon

Remember the context! ASR slot error correction through memorization

Add code
Sep 18, 2021
Figure 1 for Remember the context! ASR slot error correction through memorization
Figure 2 for Remember the context! ASR slot error correction through memorization
Figure 3 for Remember the context! ASR slot error correction through memorization
Figure 4 for Remember the context! ASR slot error correction through memorization
Viaarxiv icon

ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling

Add code
Jun 15, 2021
Figure 1 for ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling
Figure 2 for ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling
Figure 3 for ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling
Figure 4 for ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling
Viaarxiv icon

"What's The Context?" : Long Context NLM Adaptation for ASR Rescoring in Conversational Agents

Add code
Apr 21, 2021
Figure 1 for "What's The Context?" : Long Context NLM Adaptation for ASR Rescoring in Conversational Agents
Figure 2 for "What's The Context?" : Long Context NLM Adaptation for ASR Rescoring in Conversational Agents
Figure 3 for "What's The Context?" : Long Context NLM Adaptation for ASR Rescoring in Conversational Agents
Figure 4 for "What's The Context?" : Long Context NLM Adaptation for ASR Rescoring in Conversational Agents
Viaarxiv icon