Picture for Chenyang Lyu

Chenyang Lyu

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Add code
Nov 21, 2024
Viaarxiv icon

Large Language Models as Code Executors: An Exploratory Study

Add code
Oct 10, 2024
Figure 1 for Large Language Models as Code Executors: An Exploratory Study
Figure 2 for Large Language Models as Code Executors: An Exploratory Study
Figure 3 for Large Language Models as Code Executors: An Exploratory Study
Figure 4 for Large Language Models as Code Executors: An Exploratory Study
Viaarxiv icon

Reference-free Hallucination Detection for Large Vision-Language Models

Add code
Aug 11, 2024
Figure 1 for Reference-free Hallucination Detection for Large Vision-Language Models
Figure 2 for Reference-free Hallucination Detection for Large Vision-Language Models
Figure 3 for Reference-free Hallucination Detection for Large Vision-Language Models
Figure 4 for Reference-free Hallucination Detection for Large Vision-Language Models
Viaarxiv icon

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

Add code
Jun 10, 2024
Figure 1 for CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
Figure 2 for CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
Figure 3 for CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
Figure 4 for CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
Viaarxiv icon

Can a Multichoice Dataset be Repurposed for Extractive Question Answering?

Add code
Apr 26, 2024
Figure 1 for Can a Multichoice Dataset be Repurposed for Extractive Question Answering?
Figure 2 for Can a Multichoice Dataset be Repurposed for Extractive Question Answering?
Figure 3 for Can a Multichoice Dataset be Repurposed for Extractive Question Answering?
Figure 4 for Can a Multichoice Dataset be Repurposed for Extractive Question Answering?
Viaarxiv icon

Beyond Probabilities: Unveiling the Misalignment in Evaluating Large Language Models

Add code
Feb 21, 2024
Figure 1 for Beyond Probabilities: Unveiling the Misalignment in Evaluating Large Language Models
Figure 2 for Beyond Probabilities: Unveiling the Misalignment in Evaluating Large Language Models
Figure 3 for Beyond Probabilities: Unveiling the Misalignment in Evaluating Large Language Models
Figure 4 for Beyond Probabilities: Unveiling the Misalignment in Evaluating Large Language Models
Viaarxiv icon

Retrieval-augmented Multi-modal Chain-of-Thoughts Reasoning for Large Language Models

Add code
Dec 04, 2023
Viaarxiv icon

GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation

Add code
Nov 25, 2023
Viaarxiv icon

A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question Answering

Add code
Nov 13, 2023
Viaarxiv icon

Findings of the WMT 2023 Shared Task on Discourse-Level Literary Translation: A Fresh Orb in the Cosmos of LLMs

Add code
Nov 06, 2023
Viaarxiv icon