Picture for Tao Wang

Tao Wang

Improving Value Estimation Critically Enhances Vanilla Policy Gradient

Add code
May 25, 2025
Figure 1 for Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Figure 2 for Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Figure 3 for Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Figure 4 for Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Viaarxiv icon

Foundations of Top-$k$ Decoding For Language Models

Add code
May 25, 2025
Figure 1 for Foundations of Top-$k$ Decoding For Language Models
Figure 2 for Foundations of Top-$k$ Decoding For Language Models
Figure 3 for Foundations of Top-$k$ Decoding For Language Models
Figure 4 for Foundations of Top-$k$ Decoding For Language Models
Viaarxiv icon

ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation

Add code
May 22, 2025
Figure 1 for ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation
Figure 2 for ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation
Figure 3 for ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation
Figure 4 for ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation
Viaarxiv icon

Collaborative Multi-LoRA Experts with Achievement-based Multi-Tasks Loss for Unified Multimodal Information Extraction

Add code
May 08, 2025
Viaarxiv icon

Precision Neural Network Quantization via Learnable Adaptive Modules

Add code
Apr 24, 2025
Figure 1 for Precision Neural Network Quantization via Learnable Adaptive Modules
Figure 2 for Precision Neural Network Quantization via Learnable Adaptive Modules
Figure 3 for Precision Neural Network Quantization via Learnable Adaptive Modules
Figure 4 for Precision Neural Network Quantization via Learnable Adaptive Modules
Viaarxiv icon

Memory-Augmented Dual-Decoder Networks for Multi-Class Unsupervised Anomaly Detection

Add code
Apr 21, 2025
Figure 1 for Memory-Augmented Dual-Decoder Networks for Multi-Class Unsupervised Anomaly Detection
Figure 2 for Memory-Augmented Dual-Decoder Networks for Multi-Class Unsupervised Anomaly Detection
Figure 3 for Memory-Augmented Dual-Decoder Networks for Multi-Class Unsupervised Anomaly Detection
Figure 4 for Memory-Augmented Dual-Decoder Networks for Multi-Class Unsupervised Anomaly Detection
Viaarxiv icon

Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions

Add code
Apr 20, 2025
Figure 1 for Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions
Figure 2 for Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions
Figure 3 for Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions
Figure 4 for Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions
Viaarxiv icon

NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and Results

Add code
Apr 14, 2025
Viaarxiv icon

LITE: LLM-Impelled efficient Taxonomy Evaluation

Add code
Apr 02, 2025
Viaarxiv icon

ToReMi: Topic-Aware Data Reweighting for Dynamic Pre-Training Data Selection

Add code
Apr 01, 2025
Figure 1 for ToReMi: Topic-Aware Data Reweighting for Dynamic Pre-Training Data Selection
Figure 2 for ToReMi: Topic-Aware Data Reweighting for Dynamic Pre-Training Data Selection
Figure 3 for ToReMi: Topic-Aware Data Reweighting for Dynamic Pre-Training Data Selection
Figure 4 for ToReMi: Topic-Aware Data Reweighting for Dynamic Pre-Training Data Selection
Viaarxiv icon