Picture for Soujanya Poria

Soujanya Poria

Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability

Add code
Dec 24, 2024
Figure 1 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Figure 2 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Figure 3 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Figure 4 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Viaarxiv icon

Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning

Add code
Dec 17, 2024
Viaarxiv icon

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

Add code
Nov 09, 2024
Figure 1 for M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework
Figure 2 for M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework
Figure 3 for M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework
Figure 4 for M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework
Viaarxiv icon

Two are better than one: Context window extension with multi-grained self-injection

Add code
Oct 25, 2024
Figure 1 for Two are better than one: Context window extension with multi-grained self-injection
Figure 2 for Two are better than one: Context window extension with multi-grained self-injection
Figure 3 for Two are better than one: Context window extension with multi-grained self-injection
Figure 4 for Two are better than one: Context window extension with multi-grained self-injection
Viaarxiv icon

Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning

Add code
Oct 16, 2024
Figure 1 for Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning
Figure 2 for Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning
Figure 3 for Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning
Figure 4 for Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning
Viaarxiv icon

MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses

Add code
Oct 09, 2024
Viaarxiv icon

Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths

Add code
Oct 07, 2024
Figure 1 for Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths
Figure 2 for Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths
Figure 3 for Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths
Figure 4 for Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths
Viaarxiv icon

Can-Do! A Dataset and Neuro-Symbolic Grounded Framework for Embodied Planning with Large Multimodal Models

Add code
Sep 22, 2024
Viaarxiv icon

Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse

Add code
Sep 17, 2024
Figure 1 for Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse
Figure 2 for Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse
Figure 3 for Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse
Figure 4 for Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse
Viaarxiv icon

Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique

Add code
Aug 20, 2024
Figure 1 for Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique
Figure 2 for Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique
Figure 3 for Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique
Figure 4 for Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique
Viaarxiv icon