Alert button
Picture for Deepanway Ghosal

Deepanway Ghosal

Alert button

Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization

Add code
Bookmark button
Alert button
Apr 16, 2024
Navonil Majumder, Chia-Yu Hung, Deepanway Ghosal, Wei-Ning Hsu, Rada Mihalcea, Soujanya Poria

Viaarxiv icon

PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual Patterns

Add code
Bookmark button
Alert button
Mar 20, 2024
Yew Ken Chia, Vernon Toh Yan Han, Deepanway Ghosal, Lidong Bing, Soujanya Poria

Figure 1 for PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual Patterns
Figure 2 for PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual Patterns
Figure 3 for PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual Patterns
Figure 4 for PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual Patterns
Viaarxiv icon

Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoning

Add code
Bookmark button
Alert button
Mar 13, 2024
Deepanway Ghosal, Vernon Toh Yan Han, Chia Yew Ken, Soujanya Poria

Figure 1 for Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoning
Figure 2 for Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoning
Figure 3 for Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoning
Figure 4 for Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoning
Viaarxiv icon

Stuck in the Quicksand of Numeracy, Far from AGI Summit: Evaluating LLMs' Mathematical Competency through Ontology-guided Perturbations

Add code
Bookmark button
Alert button
Jan 17, 2024
Pengfei Hong, Deepanway Ghosal, Navonil Majumder, Somak Aditya, Rada Mihalcea, Soujanya Poria

Viaarxiv icon

Mustango: Toward Controllable Text-to-Music Generation

Add code
Bookmark button
Alert button
Nov 14, 2023
Jan Melechovsky, Zixun Guo, Deepanway Ghosal, Navonil Majumder, Dorien Herremans, Soujanya Poria

Viaarxiv icon

Language Guided Visual Question Answering: Elevate Your Multimodal Language Model Using Knowledge-Enriched Prompts

Add code
Bookmark button
Alert button
Oct 31, 2023
Deepanway Ghosal, Navonil Majumder, Roy Ka-Wei Lee, Rada Mihalcea, Soujanya Poria

Figure 1 for Language Guided Visual Question Answering: Elevate Your Multimodal Language Model Using Knowledge-Enriched Prompts
Figure 2 for Language Guided Visual Question Answering: Elevate Your Multimodal Language Model Using Knowledge-Enriched Prompts
Figure 3 for Language Guided Visual Question Answering: Elevate Your Multimodal Language Model Using Knowledge-Enriched Prompts
Figure 4 for Language Guided Visual Question Answering: Elevate Your Multimodal Language Model Using Knowledge-Enriched Prompts
Viaarxiv icon

Flacuna: Unleashing the Problem Solving Power of Vicuna using FLAN Fine-Tuning

Add code
Bookmark button
Alert button
Jul 05, 2023
Deepanway Ghosal, Yew Ken Chia, Navonil Majumder, Soujanya Poria

Figure 1 for Flacuna: Unleashing the Problem Solving Power of Vicuna using FLAN Fine-Tuning
Figure 2 for Flacuna: Unleashing the Problem Solving Power of Vicuna using FLAN Fine-Tuning
Figure 3 for Flacuna: Unleashing the Problem Solving Power of Vicuna using FLAN Fine-Tuning
Figure 4 for Flacuna: Unleashing the Problem Solving Power of Vicuna using FLAN Fine-Tuning
Viaarxiv icon

STOAT: Structured Data to Analytical Text With Controls

Add code
Bookmark button
Alert button
May 19, 2023
Deepanway Ghosal, Preksha Nema, Aravindan Raghuveer

Figure 1 for STOAT: Structured Data to Analytical Text With Controls
Figure 2 for STOAT: Structured Data to Analytical Text With Controls
Figure 3 for STOAT: Structured Data to Analytical Text With Controls
Figure 4 for STOAT: Structured Data to Analytical Text With Controls
Viaarxiv icon

Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model

Add code
Bookmark button
Alert button
Apr 24, 2023
Deepanway Ghosal, Navonil Majumder, Ambuj Mehrish, Soujanya Poria

Figure 1 for Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
Figure 2 for Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
Figure 3 for Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
Figure 4 for Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
Viaarxiv icon