Picture for Lu Wang

Lu Wang

Shandong University

FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation

Add code
Oct 29, 2024
Figure 1 for FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation
Figure 2 for FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation
Figure 3 for FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation
Figure 4 for FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation
Viaarxiv icon

Deep Learning-Driven Microstructure Characterization and Vickers Hardness Prediction of Mg-Gd Alloys

Add code
Oct 27, 2024
Figure 1 for Deep Learning-Driven Microstructure Characterization and Vickers Hardness Prediction of Mg-Gd Alloys
Figure 2 for Deep Learning-Driven Microstructure Characterization and Vickers Hardness Prediction of Mg-Gd Alloys
Figure 3 for Deep Learning-Driven Microstructure Characterization and Vickers Hardness Prediction of Mg-Gd Alloys
Figure 4 for Deep Learning-Driven Microstructure Characterization and Vickers Hardness Prediction of Mg-Gd Alloys
Viaarxiv icon

Expanding Chatbot Knowledge in Customer Service: Context-Aware Similar Question Generation Using Large Language Models

Add code
Oct 16, 2024
Figure 1 for Expanding Chatbot Knowledge in Customer Service: Context-Aware Similar Question Generation Using Large Language Models
Figure 2 for Expanding Chatbot Knowledge in Customer Service: Context-Aware Similar Question Generation Using Large Language Models
Figure 3 for Expanding Chatbot Knowledge in Customer Service: Context-Aware Similar Question Generation Using Large Language Models
Figure 4 for Expanding Chatbot Knowledge in Customer Service: Context-Aware Similar Question Generation Using Large Language Models
Viaarxiv icon

Closing the Loop: Learning to Generate Writing Feedback via Language Model Simulated Student Revisions

Add code
Oct 10, 2024
Viaarxiv icon

Narrative-of-Thought: Improving Temporal Reasoning of Large Language Models via Recounted Narratives

Add code
Oct 07, 2024
Figure 1 for Narrative-of-Thought: Improving Temporal Reasoning of Large Language Models via Recounted Narratives
Figure 2 for Narrative-of-Thought: Improving Temporal Reasoning of Large Language Models via Recounted Narratives
Figure 3 for Narrative-of-Thought: Improving Temporal Reasoning of Large Language Models via Recounted Narratives
Figure 4 for Narrative-of-Thought: Improving Temporal Reasoning of Large Language Models via Recounted Narratives
Viaarxiv icon

Scalable Fine-tuning from Multiple Data Sources:A First-Order Approximation Approach

Add code
Sep 28, 2024
Viaarxiv icon

Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents

Add code
Sep 25, 2024
Viaarxiv icon

Attack End-to-End Autonomous Driving through Module-Wise Noise

Add code
Sep 12, 2024
Figure 1 for Attack End-to-End Autonomous Driving through Module-Wise Noise
Figure 2 for Attack End-to-End Autonomous Driving through Module-Wise Noise
Figure 3 for Attack End-to-End Autonomous Driving through Module-Wise Noise
Viaarxiv icon

Module-wise Adaptive Adversarial Training for End-to-end Autonomous Driving

Add code
Sep 11, 2024
Figure 1 for Module-wise Adaptive Adversarial Training for End-to-end Autonomous Driving
Figure 2 for Module-wise Adaptive Adversarial Training for End-to-end Autonomous Driving
Figure 3 for Module-wise Adaptive Adversarial Training for End-to-end Autonomous Driving
Figure 4 for Module-wise Adaptive Adversarial Training for End-to-end Autonomous Driving
Viaarxiv icon

Scaling Law with Learning Rate Annealing

Add code
Aug 20, 2024
Figure 1 for Scaling Law with Learning Rate Annealing
Figure 2 for Scaling Law with Learning Rate Annealing
Figure 3 for Scaling Law with Learning Rate Annealing
Figure 4 for Scaling Law with Learning Rate Annealing
Viaarxiv icon