LocalAI/backend/python/vllm/requirements-cublas12-after.txt at c411fe09fb3d3b8149e06d3b4fcdc12e851cee8f - LocalAI - Gitea - ZeroTwo

mirror/LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-01-06 10:39:55 -06:00

Files

Ettore Di Giacinto 2defe98df8 fix(vllm): Update flash-attn to specific wheel URL

Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2025-11-21 18:06:46 +01:00

2 lines

141 B

Plaintext

Raw Blame History

https://github.com/Dao-AILab/flash-attention/releases/download/v2.8.3/flash_attn-2.8.3+cu12torch2.7cxx11abiTRUE-cp310-cp310-linux_x86_64.whl