Picture for Ansong Ni

Ansong Ni

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Add code
Jul 15, 2024
Viaarxiv icon

NExT: Teaching Large Language Models to Reason about Code Execution

Add code
Apr 23, 2024
Figure 1 for NExT: Teaching Large Language Models to Reason about Code Execution
Figure 2 for NExT: Teaching Large Language Models to Reason about Code Execution
Figure 3 for NExT: Teaching Large Language Models to Reason about Code Execution
Figure 4 for NExT: Teaching Large Language Models to Reason about Code Execution
Viaarxiv icon

Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models

Add code
Mar 06, 2024
Figure 1 for Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models
Figure 2 for Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models
Figure 3 for Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models
Figure 4 for Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models
Viaarxiv icon

L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models

Add code
Oct 02, 2023
Figure 1 for L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Figure 2 for L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Figure 3 for L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Figure 4 for L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Viaarxiv icon

LEVER: Learning to Verify Language-to-Code Generation with Execution

Add code
Feb 16, 2023
Figure 1 for LEVER: Learning to Verify Language-to-Code Generation with Execution
Figure 2 for LEVER: Learning to Verify Language-to-Code Generation with Execution
Figure 3 for LEVER: Learning to Verify Language-to-Code Generation with Execution
Figure 4 for LEVER: Learning to Verify Language-to-Code Generation with Execution
Viaarxiv icon

Explicit Knowledge Transfer for Weakly-Supervised Code Generation

Add code
Nov 30, 2022
Figure 1 for Explicit Knowledge Transfer for Weakly-Supervised Code Generation
Figure 2 for Explicit Knowledge Transfer for Weakly-Supervised Code Generation
Figure 3 for Explicit Knowledge Transfer for Weakly-Supervised Code Generation
Figure 4 for Explicit Knowledge Transfer for Weakly-Supervised Code Generation
Viaarxiv icon

FOLIO: Natural Language Reasoning with First-Order Logic

Add code
Sep 02, 2022
Figure 1 for FOLIO: Natural Language Reasoning with First-Order Logic
Figure 2 for FOLIO: Natural Language Reasoning with First-Order Logic
Figure 3 for FOLIO: Natural Language Reasoning with First-Order Logic
Figure 4 for FOLIO: Natural Language Reasoning with First-Order Logic
Viaarxiv icon

Learning from Self-Sampled Correct and Partially-Correct Programs

Add code
May 28, 2022
Figure 1 for Learning from Self-Sampled Correct and Partially-Correct Programs
Figure 2 for Learning from Self-Sampled Correct and Partially-Correct Programs
Figure 3 for Learning from Self-Sampled Correct and Partially-Correct Programs
Figure 4 for Learning from Self-Sampled Correct and Partially-Correct Programs
Viaarxiv icon

Leveraging Locality in Abstractive Text Summarization

Add code
May 25, 2022
Figure 1 for Leveraging Locality in Abstractive Text Summarization
Figure 2 for Leveraging Locality in Abstractive Text Summarization
Figure 3 for Leveraging Locality in Abstractive Text Summarization
Figure 4 for Leveraging Locality in Abstractive Text Summarization
Viaarxiv icon

UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models

Add code
Jan 20, 2022
Figure 1 for UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models
Figure 2 for UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models
Figure 3 for UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models
Figure 4 for UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models
Viaarxiv icon