Skip to content

[LLM] Support block_attention/cachekv quant for llama#7649

Merged
wawltor merged 15 commits intoPaddlePaddle:developfrom RichardWooSJTU:restruct_52_devJan 10, 2024

Commits

Commits on Dec 20, 2023

Commits on Dec 29, 2023

Commits on Jan 5, 2024

Commits on Jan 8, 2024

Commits on Jan 9, 2024

Commits on Jan 10, 2024