Picture for Lijuan Wang

Lijuan Wang

Cross-border Commodity Pricing Strategy Optimization via Mixed Neural Network for Time Series Analysis

Add code
Aug 22, 2024
Figure 1 for Cross-border Commodity Pricing Strategy Optimization via Mixed Neural Network for Time Series Analysis
Figure 2 for Cross-border Commodity Pricing Strategy Optimization via Mixed Neural Network for Time Series Analysis
Figure 3 for Cross-border Commodity Pricing Strategy Optimization via Mixed Neural Network for Time Series Analysis
Figure 4 for Cross-border Commodity Pricing Strategy Optimization via Mixed Neural Network for Time Series Analysis
Viaarxiv icon

AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition

Add code
Aug 21, 2024
Figure 1 for AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition
Figure 2 for AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition
Figure 3 for AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition
Figure 4 for AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition
Viaarxiv icon

MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities

Add code
Aug 01, 2024
Figure 1 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Figure 2 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Figure 3 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Figure 4 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Viaarxiv icon

IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation

Add code
Jul 15, 2024
Figure 1 for IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Figure 2 for IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Figure 3 for IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Figure 4 for IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Viaarxiv icon

Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness

Add code
Jul 02, 2024
Figure 1 for Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness
Figure 2 for Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness
Figure 3 for Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness
Figure 4 for Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness
Viaarxiv icon

VideoGUI: A Benchmark for GUI Automation from Instructional Videos

Add code
Jun 14, 2024
Figure 1 for VideoGUI: A Benchmark for GUI Automation from Instructional Videos
Figure 2 for VideoGUI: A Benchmark for GUI Automation from Instructional Videos
Figure 3 for VideoGUI: A Benchmark for GUI Automation from Instructional Videos
Figure 4 for VideoGUI: A Benchmark for GUI Automation from Instructional Videos
Viaarxiv icon

MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

Add code
Jun 12, 2024
Figure 1 for MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
Figure 2 for MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
Figure 3 for MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
Figure 4 for MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
Viaarxiv icon

Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation

Add code
Jun 11, 2024
Viaarxiv icon

Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning

Add code
Jun 04, 2024
Viaarxiv icon

List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Add code
Apr 25, 2024
Viaarxiv icon