Picture for Michael R. Lyu

Michael R. Lyu

Making Long-Context Language Models Better Multi-Hop Reasoners

Add code
Aug 06, 2024
Figure 1 for Making Long-Context Language Models Better Multi-Hop Reasoners
Figure 2 for Making Long-Context Language Models Better Multi-Hop Reasoners
Figure 3 for Making Long-Context Language Models Better Multi-Hop Reasoners
Figure 4 for Making Long-Context Language Models Better Multi-Hop Reasoners
Viaarxiv icon

On the Resilience of Multi-Agent Systems with Malicious Agents

Add code
Aug 02, 2024
Viaarxiv icon

Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach

Add code
Jun 24, 2024
Figure 1 for Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach
Figure 2 for Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach
Figure 3 for Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach
Figure 4 for Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach
Viaarxiv icon

Automatic Programming: Large Language Models and Beyond

Add code
May 03, 2024
Figure 1 for Automatic Programming: Large Language Models and Beyond
Figure 2 for Automatic Programming: Large Language Models and Beyond
Figure 3 for Automatic Programming: Large Language Models and Beyond
Figure 4 for Automatic Programming: Large Language Models and Beyond
Viaarxiv icon

How Well Can LLMs Echo Us? Evaluating AI Chatbots' Role-Play Ability with ECHO

Add code
Apr 22, 2024
Viaarxiv icon

Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models

Add code
Mar 27, 2024
Figure 1 for Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models
Figure 2 for Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models
Figure 3 for Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models
Figure 4 for Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models
Viaarxiv icon

How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments

Add code
Mar 18, 2024
Figure 1 for How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
Figure 2 for How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
Figure 3 for How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
Figure 4 for How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
Viaarxiv icon

Knowledge-aware Alert Aggregation in Large-scale Cloud Systems: a Hybrid Approach

Add code
Mar 11, 2024
Viaarxiv icon

FaultProfIT: Hierarchical Fault Profiling of Incident Tickets in Large-scale Cloud Systems

Add code
Feb 27, 2024
Viaarxiv icon

Asclepius: A Spectrum Evaluation Benchmark for Medical Multi-Modal Large Language Models

Add code
Feb 17, 2024
Figure 1 for Asclepius: A Spectrum Evaluation Benchmark for Medical Multi-Modal Large Language Models
Figure 2 for Asclepius: A Spectrum Evaluation Benchmark for Medical Multi-Modal Large Language Models
Figure 3 for Asclepius: A Spectrum Evaluation Benchmark for Medical Multi-Modal Large Language Models
Figure 4 for Asclepius: A Spectrum Evaluation Benchmark for Medical Multi-Modal Large Language Models
Viaarxiv icon