Picture for Zhifang Sui

Zhifang Sui

Large Language Models Are Unconscious of Unreasonability in Math Problems

Add code
Mar 28, 2024
Figure 1 for Large Language Models Are Unconscious of Unreasonability in Math Problems
Figure 2 for Large Language Models Are Unconscious of Unreasonability in Math Problems
Figure 3 for Large Language Models Are Unconscious of Unreasonability in Math Problems
Figure 4 for Large Language Models Are Unconscious of Unreasonability in Math Problems
Viaarxiv icon

Reducing Hallucinations in Entity Abstract Summarization with Facts-Template Decomposition

Add code
Feb 29, 2024
Figure 1 for Reducing Hallucinations in Entity Abstract Summarization with Facts-Template Decomposition
Figure 2 for Reducing Hallucinations in Entity Abstract Summarization with Facts-Template Decomposition
Figure 3 for Reducing Hallucinations in Entity Abstract Summarization with Facts-Template Decomposition
Figure 4 for Reducing Hallucinations in Entity Abstract Summarization with Facts-Template Decomposition
Viaarxiv icon

ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors

Add code
Feb 26, 2024
Figure 1 for ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors
Figure 2 for ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors
Figure 3 for ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors
Figure 4 for ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors
Viaarxiv icon

PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization

Add code
Feb 25, 2024
Figure 1 for PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization
Figure 2 for PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization
Figure 3 for PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization
Figure 4 for PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization
Viaarxiv icon

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Add code
Feb 20, 2024
Viaarxiv icon

Can Large Multimodal Models Uncover Deep Semantics Behind Images?

Add code
Feb 17, 2024
Figure 1 for Can Large Multimodal Models Uncover Deep Semantics Behind Images?
Figure 2 for Can Large Multimodal Models Uncover Deep Semantics Behind Images?
Figure 3 for Can Large Multimodal Models Uncover Deep Semantics Behind Images?
Figure 4 for Can Large Multimodal Models Uncover Deep Semantics Behind Images?
Viaarxiv icon

Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding

Add code
Jan 15, 2024
Viaarxiv icon

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Add code
Jan 11, 2024
Figure 1 for DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Figure 2 for DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Figure 3 for DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Figure 4 for DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Viaarxiv icon

Language Models Understand Numbers, at Least Partially

Add code
Jan 08, 2024
Viaarxiv icon

Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations

Add code
Dec 28, 2023
Figure 1 for Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
Figure 2 for Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
Figure 3 for Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
Figure 4 for Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
Viaarxiv icon