Picture for Da Ma

Da Ma

Reducing Tool Hallucination via Reliability Alignment

Add code
Dec 05, 2024
Viaarxiv icon

Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity

Add code
Dec 03, 2024
Viaarxiv icon

SciDFM: A Large Language Model with Mixture-of-Experts for Science

Add code
Sep 27, 2024
Figure 1 for SciDFM: A Large Language Model with Mixture-of-Experts for Science
Figure 2 for SciDFM: A Large Language Model with Mixture-of-Experts for Science
Figure 3 for SciDFM: A Large Language Model with Mixture-of-Experts for Science
Figure 4 for SciDFM: A Large Language Model with Mixture-of-Experts for Science
Viaarxiv icon

Evolving Subnetwork Training for Large Language Models

Add code
Jun 11, 2024
Viaarxiv icon

Sparsity-Accelerated Training for Large Language Models

Add code
Jun 03, 2024
Viaarxiv icon

Rejection Improves Reliability: Training LLMs to Refuse Unknown Questions Using RL from Knowledge Feedback

Add code
Apr 07, 2024
Viaarxiv icon

Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding

Add code
Feb 28, 2024
Viaarxiv icon

ChemDFM: Dialogue Foundation Model for Chemistry

Add code
Jan 26, 2024
Figure 1 for ChemDFM: Dialogue Foundation Model for Chemistry
Figure 2 for ChemDFM: Dialogue Foundation Model for Chemistry
Figure 3 for ChemDFM: Dialogue Foundation Model for Chemistry
Figure 4 for ChemDFM: Dialogue Foundation Model for Chemistry
Viaarxiv icon

ASTormer: An AST Structure-aware Transformer Decoder for Text-to-SQL

Add code
Oct 28, 2023
Viaarxiv icon

SciEval: A Multi-Level Large Language Model Evaluation Benchmark for Scientific Research

Add code
Aug 25, 2023
Viaarxiv icon