Picture for Li Zhang

Li Zhang

Shammie

Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation

Add code
Dec 18, 2024
Figure 1 for Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation
Figure 2 for Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation
Figure 3 for Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation
Figure 4 for Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation
Viaarxiv icon

On the Limit of Language Models as Planning Formalizers

Add code
Dec 13, 2024
Figure 1 for On the Limit of Language Models as Planning Formalizers
Figure 2 for On the Limit of Language Models as Planning Formalizers
Figure 3 for On the Limit of Language Models as Planning Formalizers
Figure 4 for On the Limit of Language Models as Planning Formalizers
Viaarxiv icon

OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation

Add code
Dec 12, 2024
Figure 1 for OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation
Figure 2 for OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation
Figure 3 for OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation
Figure 4 for OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation
Viaarxiv icon

UniScene: Unified Occupancy-centric Driving Scene Generation

Add code
Dec 06, 2024
Viaarxiv icon

A Framework For Image Synthesis Using Supervised Contrastive Learning

Add code
Dec 05, 2024
Viaarxiv icon

A Multi-Agent Framework for Extensible Structured Text Generation in PLCs

Add code
Dec 03, 2024
Figure 1 for A Multi-Agent Framework for Extensible Structured Text Generation in PLCs
Figure 2 for A Multi-Agent Framework for Extensible Structured Text Generation in PLCs
Figure 3 for A Multi-Agent Framework for Extensible Structured Text Generation in PLCs
Figure 4 for A Multi-Agent Framework for Extensible Structured Text Generation in PLCs
Viaarxiv icon

Explainable CTR Prediction via LLM Reasoning

Add code
Dec 03, 2024
Figure 1 for Explainable CTR Prediction via LLM Reasoning
Figure 2 for Explainable CTR Prediction via LLM Reasoning
Figure 3 for Explainable CTR Prediction via LLM Reasoning
Figure 4 for Explainable CTR Prediction via LLM Reasoning
Viaarxiv icon

Driving Scene Synthesis on Free-form Trajectories with Generative Prior

Add code
Dec 02, 2024
Figure 1 for Driving Scene Synthesis on Free-form Trajectories with Generative Prior
Figure 2 for Driving Scene Synthesis on Free-form Trajectories with Generative Prior
Figure 3 for Driving Scene Synthesis on Free-form Trajectories with Generative Prior
Figure 4 for Driving Scene Synthesis on Free-form Trajectories with Generative Prior
Viaarxiv icon

Query Performance Explanation through Large Language Model for HTAP Systems

Add code
Dec 02, 2024
Figure 1 for Query Performance Explanation through Large Language Model for HTAP Systems
Figure 2 for Query Performance Explanation through Large Language Model for HTAP Systems
Figure 3 for Query Performance Explanation through Large Language Model for HTAP Systems
Figure 4 for Query Performance Explanation through Large Language Model for HTAP Systems
Viaarxiv icon

DroidCall: A Dataset for LLM-powered Android Intent Invocation

Add code
Nov 30, 2024
Figure 1 for DroidCall: A Dataset for LLM-powered Android Intent Invocation
Figure 2 for DroidCall: A Dataset for LLM-powered Android Intent Invocation
Figure 3 for DroidCall: A Dataset for LLM-powered Android Intent Invocation
Figure 4 for DroidCall: A Dataset for LLM-powered Android Intent Invocation
Viaarxiv icon