Picture for Nan Du

Nan Du

Advancing General Multimodal Capability of Vision-language Models with Pyramid-descent Visual Position Encoding

Add code
Jan 19, 2025
Viaarxiv icon

Instruction-Following Pruning for Large Language Models

Add code
Jan 07, 2025
Viaarxiv icon

Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs

Add code
Dec 10, 2024
Figure 1 for Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs
Figure 2 for Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs
Figure 3 for Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs
Figure 4 for Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs
Viaarxiv icon

MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity

Add code
Dec 03, 2024
Figure 1 for MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity
Figure 2 for MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity
Figure 3 for MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity
Figure 4 for MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity
Viaarxiv icon

EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing

Add code
Oct 02, 2024
Figure 1 for EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing
Figure 2 for EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing
Figure 3 for EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing
Figure 4 for EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing
Viaarxiv icon

Apple Intelligence Foundation Language Models

Add code
Jul 29, 2024
Figure 1 for Apple Intelligence Foundation Language Models
Figure 2 for Apple Intelligence Foundation Language Models
Figure 3 for Apple Intelligence Foundation Language Models
Figure 4 for Apple Intelligence Foundation Language Models
Viaarxiv icon

Deep State-Space Generative Model For Correlated Time-to-Event Predictions

Add code
Jul 28, 2024
Figure 1 for Deep State-Space Generative Model For Correlated Time-to-Event Predictions
Figure 2 for Deep State-Space Generative Model For Correlated Time-to-Event Predictions
Figure 3 for Deep State-Space Generative Model For Correlated Time-to-Event Predictions
Figure 4 for Deep State-Space Generative Model For Correlated Time-to-Event Predictions
Viaarxiv icon

Learning to Select the Best Forecasting Tasks for Clinical Outcome Prediction

Add code
Jul 28, 2024
Figure 1 for Learning to Select the Best Forecasting Tasks for Clinical Outcome Prediction
Figure 2 for Learning to Select the Best Forecasting Tasks for Clinical Outcome Prediction
Figure 3 for Learning to Select the Best Forecasting Tasks for Clinical Outcome Prediction
Figure 4 for Learning to Select the Best Forecasting Tasks for Clinical Outcome Prediction
Viaarxiv icon

Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training

Add code
May 23, 2024
Viaarxiv icon

Knowledge Graph Reasoning with Self-supervised Reinforcement Learning

Add code
May 22, 2024
Figure 1 for Knowledge Graph Reasoning with Self-supervised Reinforcement Learning
Figure 2 for Knowledge Graph Reasoning with Self-supervised Reinforcement Learning
Figure 3 for Knowledge Graph Reasoning with Self-supervised Reinforcement Learning
Figure 4 for Knowledge Graph Reasoning with Self-supervised Reinforcement Learning
Viaarxiv icon