Picture for Songyang Zhang

Songyang Zhang

University of Louisiana at Lafayette, USA

Adapting LLaMA Decoder to Vision Transformer

Add code
Apr 13, 2024
Viaarxiv icon

Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks

Add code
Apr 10, 2024
Figure 1 for Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks
Figure 2 for Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks
Figure 3 for Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks
Figure 4 for Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks
Viaarxiv icon

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

Add code
Apr 09, 2024
Figure 1 for InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Figure 2 for InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Figure 3 for InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Figure 4 for InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Viaarxiv icon

From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models

Add code
Apr 06, 2024
Viaarxiv icon

InternLM2 Technical Report

Add code
Mar 26, 2024
Figure 1 for InternLM2 Technical Report
Figure 2 for InternLM2 Technical Report
Figure 3 for InternLM2 Technical Report
Figure 4 for InternLM2 Technical Report
Viaarxiv icon

RadioGAT: A Joint Model-based and Data-driven Framework for Multi-band Radiomap Reconstruction via Graph Attention Networks

Add code
Mar 25, 2024
Figure 1 for RadioGAT: A Joint Model-based and Data-driven Framework for Multi-band Radiomap Reconstruction via Graph Attention Networks
Figure 2 for RadioGAT: A Joint Model-based and Data-driven Framework for Multi-band Radiomap Reconstruction via Graph Attention Networks
Figure 3 for RadioGAT: A Joint Model-based and Data-driven Framework for Multi-band Radiomap Reconstruction via Graph Attention Networks
Figure 4 for RadioGAT: A Joint Model-based and Data-driven Framework for Multi-band Radiomap Reconstruction via Graph Attention Networks
Viaarxiv icon

Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization Correlations

Add code
Mar 21, 2024
Viaarxiv icon

InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning

Add code
Feb 09, 2024
Figure 1 for InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning
Figure 2 for InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning
Figure 3 for InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning
Figure 4 for InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning
Viaarxiv icon

InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model

Add code
Jan 29, 2024
Figure 1 for InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model
Figure 2 for InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model
Figure 3 for InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model
Figure 4 for InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model
Viaarxiv icon

SGTR+: End-to-end Scene Graph Generation with Transformer

Add code
Jan 23, 2024
Figure 1 for SGTR+: End-to-end Scene Graph Generation with Transformer
Figure 2 for SGTR+: End-to-end Scene Graph Generation with Transformer
Figure 3 for SGTR+: End-to-end Scene Graph Generation with Transformer
Figure 4 for SGTR+: End-to-end Scene Graph Generation with Transformer
Viaarxiv icon