Windows AI Wheels
A curated collection of pre-compiled Python wheels for difficult-to-install AI/ML libraries on Windows.
Report a Broken Link
·
Request a New Wheel
Table of Contents
About The Project
This repository was created to address a common pain point for AI enthusiasts and developers on the Windows platform: building complex Python packages from source. Libraries like flash-attention, xformers are essential for high-performance AI tasks but often lack official pre-built wheels for Windows, forcing users into a complicated and error-prone compilation process.
The goal here is to provide a centralized, up-to-date collection of direct links to pre-compiled .whl files for these libraries, primarily for the ComfyUI community and other PyTorch users on Windows. This saves you time and lets you focus on what's important: creating amazing things with AI.
Find Windows AI Wheels
To make life even easier, you can use this page Find Windows AI Wheels for quick searches of the required packages.
Getting Started
Follow these simple steps to use the wheels from this repository.
Prerequisites
- Python for Windows: Ensure you have a compatible Python version installed (PyTorch currently supports Python 3.9 - 3.14 on Windows). You can get it from the official Python website.
Installation
To install a wheel, use pip with the direct URL to the .whl file. Make sure to enclose the URL in quotes.
# Example of installing a specific flash-attention wheel pip install "https://huggingface.co/lldacing/flash-attention-windows-wheel/blob/main/flash_attn-2.7.4.post1+cu128torch2.7.0cxx11abiFALSE-cp312-cp312-win_amd64.whl"
[!TIP] Find the package you need in the Available Wheels section below, find the row that matches your environment (Python, PyTorch, CUDA version), and copy the link for the
pip installcommand.
Available Wheels
Here is the list of tracked packages.
🛠 PyTorch
The foundation of everything. Install this first from the official source.
- Official Install Page: https://pytorch.org/get-started/locally/
For convenience, here are direct installation commands for specific versions on Linux/WSL with an NVIDIA GPU. For other configurations (CPU, macOS, ROCm), please use the official install page.
Stable Version (2.11.0)
This is the recommended version for most users.
| CUDA Version | Pip Install Command |
|---|---|
| CUDA 13.0 | pip install torch torchvision --index-url https://download.pytorch.org/whl/cu130 |
| CUDA 12.8 | pip install torch torchvision --index-url https://download.pytorch.org/whl/cu128 |
| CUDA 12.6 | pip install torch torchvision --index-url https://download.pytorch.org/whl/cu126 |
Previous Stable Version
Stable Version (2.10.0)
This is the recommended version for most users.
| CUDA Version | Pip Install Command |
|---|---|
| CUDA 13.0 | pip install "torch>=2.10.0.dev,<2.11.0" torchvision --index-url https://download.pytorch.org/whl/cu130 |
| CUDA 12.8 | pip install "torch>=2.10.0.dev,<2.11.0" torchvision --index-url https://download.pytorch.org/whl/cu128 |
| CUDA 12.6 | pip install "torch>=2.10.0.dev,<2.11.0" torchvision --index-url https://download.pytorch.org/whl/cu126 |
Previous Version (2.9.1)
| CUDA Version | Pip Install Command |
|---|---|
| CUDA 13.0 | pip install "torch>=2.9.0.dev,<2.10.0" torchvision --index-url https://download.pytorch.org/whl/cu130 |
| CUDA 12.8 | pip install "torch>=2.9.0.dev,<2.10.0" torchvision --index-url https://download.pytorch.org/whl/cu128 |
| CUDA 12.6 | pip install "torch>=2.9.0.dev,<2.10.0" torchvision --index-url https://download.pytorch.org/whl/cu126 |
Previous Stable Version (2.8.0)
| CUDA Version | Pip Install Command |
|---|---|
| CUDA 12.9 | pip install "torch>=2.8.0.dev,<2.9.0" torchvision --index-url https://download.pytorch.org/whl/cu129 |
| CUDA 12.8 | pip install "torch>=2.8.0.dev,<2.9.0" torchvision --index-url https://download.pytorch.org/whl/cu128 |
| CUDA 12.6 | pip install "torch>=2.8.0.dev,<2.9.0" torchvision --index-url https://download.pytorch.org/whl/cu126 |
Previous Stable Version (2.7.1)
| CUDA Version | Pip Install Command |
|---|---|
| CUDA 12.8 | pip install torch==2.7.1 torchvision==0.22.1 torchaudio==2.7.1 --index-url https://download.pytorch.org/whl/cu128 |
| CUDA 12.6 | pip install torch==2.7.1 torchvision==0.22.1 torchaudio==2.7.1 --index-url https://download.pytorch.org/whl/cu126 |
| CUDA 11.8 | pip install torch==2.7.1 torchvision==0.22.1 torchaudio==2.7.1 --index-url https://download.pytorch.org/whl/cu118 |
| CPU only | pip install torch==2.7.1 torchvision==0.22.1 torchaudio==2.7.1 --index-url https://download.pytorch.org/whl/cpu |
Nightly Versions
Use these for access to the latest features, but expect potential instability.
PyTorch 2.12 (Nightly)
| CUDA Version | Pip Install Command |
|---|---|
| CUDA 13.0 | pip install --pre torch torchvision --index-url https://download.pytorch.org/whl/nightly/cu130 |
| CUDA 12.8 | pip install --pre torch torchvision --index-url https://download.pytorch.org/whl/nightly/cu128 |
| CUDA 12.6 | pip install --pre torch torchvision --index-url https://download.pytorch.org/whl/nightly/cu126 |
▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲
🛠 Torchaudio
| Package Version | PyTorch Ver | Python Ver | CUDA Ver | Download Link |
|---|---|---|---|---|
2.11.0a0 | 2.12.0 | 3.14 | 13.0 | Link |
2.11.0a0 | 2.12.0 | 3.13 | 13.0 | Link |
2.11.0a0 | 2.11.0 | 3.14 | 13.0 | Link |
2.11.0a0 | 2.11.0 | 3.13 | 13.0 | Link |
2.11.0a0 | 2.10.0 | 3.13 | 13.0 | Link |
2.11.0a0 | 2.10.0 | 3.12 | 13.0 | Link |
2.11.0a0 | 2.10.0 | 3.13 | 12.8 | Link |
2.8.0a0 | 2.9.0 | 3.12 | 12.8 | Link |
2.8.0a0 | 2.9.0 | 3.12 | 12.8 | Link |
# Torchcodec pip install torchcodec
▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲
🛠 Flash Attention
High-performance attention implementation.
| Package Version | PyTorch Ver | Python Ver | CUDA Ver | CXX11 ABI | Download Link |
|---|---|---|---|---|---|
2.8.4 | 2.12.0 | 3.14 | 13.0 | ✓ | Link |
2.8.4 | 2.12.0 | 3.13 | 13.0 | ✓ | Link |
2.8.4 | 2.11.0 | 3.14 | 13.0 | ✓ | Link |
2.8.4 | 2.11.0 | 3.13 | 13.0 | ✓ | Link |
2.8.3 | 2.11.0 | 3.13 | 13.0 | ✓ | Link |
2.8.3 | 2.11.0 | 3.12 | 13.0 | ✓ | Link |
2.8.3 | 2.10.0 | 3.13 | 13.0 | ✓ | Link |
2.8.3 | 2.10.0 | 3.13 | 13.0 | ✓ | Link |
2.8.3 | 2.10.0 | 3.12 | 13.0 | ✓ | Link |
2.8.3 | 2.10.0 | 3.12 | 13.0 | ✓ | Link |
2.8.3 | 2.10.0 | 3.13 | 12.8 | ✓ | Link |
2.8.3 | 2.9.1 | 3.13 | 13.0 | ✓ | Link |
2.8.3 | 2.9.1 | 3.12 | 13.0 | ✓ | Link |
2.8.3 | 2.9.1 | 3.13 | 12.8 | ✓ | Link |
2.8.3 | 2.9.0 | 3.13 | 13.0 | ✓ | Link |
2.8.3 | 2.9.0 | 3.12 | 13.0 | ✓ | Link |
2.8.3 | 2.9.0 | 3.13 | 12.9 | ✓ | Link |
2.8.3 | 2.9.0 | 3.12 | 12.8 | ✓ | Link |
2.8.3 | 2.8.0 | 3.12 | 12.8 | ✓ | Link |
2.8.2 | 2.9.0 | 3.12 | 12.8 | ✓ | Link |
2.8.2 | 2.8.0 | 3.12 | 12.8 | ✓ | Link |
2.8.2 | 2.8.0 | 3.11 | 12.8 | ✓ | Link |
2.8.2 | 2.8.0 | 3.10 | 12.8 | ✓ | Link |
2.8.2 | 2.7.0 | 3.12 | 12.8 | ✗ | Link |
2.8.2 | 2.7.0 | 3.11 | 12.8 | ✗ | Link |
2.8.2 | 2.7.0 | 3.10 | 12.8 | ✗ | Link |
2.8.1 | 2.8.0 | 3.12 | 12.8 | ✓ | Link |
2.8.0.post2 | 2.8.0 | 3.12 | 12.8 | ✓ | Link |
2.7.4.post1 | 2.8.0 | 3.12 | 12.8 | ✓ | Link |
2.7.4.post1 | 2.8.0 | 3.10 | 12.8 | ✓ | Link |
2.7.4.post1 | 2.7.0 | 3.12 | 12.8 | ✗ | Link |
2.7.4.post1 | 2.7.0 | 3.11 | 12.8 | ✗ | Link |
2.7.4.post1 | 2.7.0 | 3.10 | 12.8 | ✗ | Link |
2.7.4 | 2.8.0 | 3.12 | 12.8 | ✓ | Link |
2.7.4 | 2.8.0 | 3.11 | 12.8 | ✓ | Link |
2.7.4 | 2.8.0 | 3.10 | 12.8 | ✓ | Link |
2.7.4 | 2.7.0 | 3.12 | 12.8 | ✗ | Link |
2.7.4 | 2.7.0 | 3.11 | 12.8 | ✗ | Link |
2.7.4 | 2.7.0 | 3.10 | 12.8 | ✗ | Link |
2.7.4 | 2.6.0 | 3.12 | 12.6 | ✗ | Link |
2.7.4 | 2.6.0 | 3.11 | 12.6 | ✗ | Link |
2.7.4 | 2.6.0 | 3.10 | 12.6 | ✗ | Link |
2.7.4 | 2.6.0 | 3.12 | 12.4 | ✗ | Link |
2.7.4 | 2.6.0 | 3.11 | 12.4 | ✗ | Link |
2.7.4 | 2.6.0 | 3.10 | 12.4 | ✗ | Link |
▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲
🛠 Flash Attention 3
Next-generation Flash Attention with improved performance and features.
| Package Version | PyTorch Ver | Python Ver | CUDA Ver | CXX11 ABI | Download Link |
|---|---|---|---|---|---|
3.0.0 | 2.10 | 3.9+ | 13.0 | ✓ | Link |
3.0.0 | 2.10 | 3.9+ | 13.0 | ✓ | Link |
3.0.0 | 2.10 | 3.9+ | 12.8 | ✓ | Link |
3.0.0 | 2.10 | 3.9+ | 12.8 | ✓ | Link |
3.0.0 | 2.9 | 3.9+ | 13.0 | ✓ | Link |
3.0.0 | 2.9 | 3.9+ | 12.8 | ✓ | Link |
▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲
🛠 Flash Attention 4
Latest Flash Attention implementation with cutting-edge optimizations.
(No wheels available - package not tracked)
▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲
🛠 xformers
Another library for memory-efficient attention and other optimizations.
[!NOTE] PyTorch provides official pre-built wheels for xformers. You can often install it with
pip install xformers
| CUDA Version | Install |
|---|---|
| CUDA 12.6 | pip3 install -U xformers --index-url https://download.pytorch.org/whl/cu126 |
| CUDA 12.8 | pip3 install -U xformers --index-url https://download.pytorch.org/whl/cu128 |
| CUDA 13.0 | pip3 install -U xformers --index-url https://download.pytorch.org/whl/cu130 |
ABI3 version, any Python 3.9-3.12
| Package Version | PyTorch Ver | Python Ver | CUDA Ver | Download Link |
|---|---|---|---|---|
0.0.34 | 2.11 | 3.9+ | 13.0 | Link |
0.0.34 | 2.10 | 3.9+ | 13.0 | Link |
0.0.34 | 2.10 | 3.9+ | 13.0 | Link |
0.0.33 | 2.10 | 3.9+ | 13.0 | Link |
0.0.33 | 2.9 | 3.9+ | 13.0 | Link |
0.0.32.post2 | 2.8.0 | 3.9+ | 12.9 | Link |
0.0.32.post2 | 2.8.0 | 3.9+ | 12.8 | Link |
0.0.32.post2 | 2.8.0 | 3.9+ | 12.6 | Link |
▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲
🛠 SageAttention
| Package Version | PyTorch Ver | Python Ver | CUDA Ver | Download Link |
|---|---|---|---|---|
2.1.1 | 2.8.0 | 3.12 | 12.8 | Link |
2.1.1 | 2.7.0 | 3.10 | 12.8 | Link |
2.1.1 | 2.6.0 | 3.13 | 12.6 | Link |
2.1.1 | 2.6.0 | 3.12 | 12.6 | Link |
2.1.1 | 2.6.0 | 3.12 | 12.6 | Link |
2.1.1 | 2.6.0 | 3.11 | 12.6 | Link |
2.1.1 | 2.6.0 | 3.10 | 12.6 | Link |
2.1.1 | 2.6.0 | 3.9 | 12.6 | Link |
2.1.1 | 2.5.1 | 3.12 | 12.4 | Link |
2.1.1 | 2.5.1 | 3.11 | 12.4 | Link |
2.1.1 | 2.5.1 | 3.10 | 12.4 | Link |
2.1.1 | 2.5.1 | 3.9 | 12.4 | Link |
◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇ ◇
🛠 SageAttention 2.2 (SageAttention2++)
[!NOTE] Only supports CUDA >= 12.8, therefore PyTorch >= 2.7.
| Package Version | PyTorch Ver | Python Ver | CUDA Ver | Download Link |
|---|---|---|---|---|
2.2.0.post4 | 2.9.0+ | 3.9+ | 13.0 | Link |
2.2.0.post4 | 2.9.0+ | 3.9+ | 12.8 | Link |
2.2.0.post3 | 2.10.0 | 3.12 | 13.0 | Link |
2.2.0.post3 | 2.10.0 | 3.13 | 12.8 | Link |
2.2.0.post3 | 2.10.0 | 3.12 | 12.8 | Link |
2.2.0.post3 | 2.9.0 | 3.13 | 13.0 | Link |
2.2.0.post3 | 2.9.0 | 3.13 | 12.9 | Link |
2.2.0.post3 | 2.9.0 | 3.9+ | 12.9 | Link |
2.2.0.post3 | 2.9.0 | 3.13 | 12.8 | Link |
2.2.0.post3 | 2.9.0 | 3.9+ | 12.8 | Link |
2.2.0.post3 | 2.8.0 | 3.13 | 12.9 | Link |
2.2.0.post3 | 2.8.0 | 3.9+ | 12.9 | Link |
2.2.0.post3 | 2.8.0 | 3.13 | 12.8 | Link |
2.2.0.post3 | 2.8.0 | 3.9+ | 12.8 | Link |
2.2.0.post3 | 2.7.1 | 3.9+ | 12.8 | Link |
2.2.0.post3 | 2.6.0 | 3.9+ | 12.6 | Link |
2.2.0.post3 | 2.5.1 | 3.9+ | 12.4 | Link |
2.2.0.post2 | 2.9.0 | 3.9+ | 12.8 | Link |
2.2.0.post2 | 2.8.0 | 3.9+ | 12.8 | Link |
2.2.0.post2 | 2.7.1 | 3.9+ | 12.8 | Link |
2.2.0.post2 | 2.6.0 | 3.9+ | 12.6 | Link |
2.2.0.post2 | 2.5.1 | 3.9+ | 12.4 | Link |
2.2.0 | 2.8.0 | 3.13 | 12.8 | Link |
2.2.0 | 2.8.0 | 3.12 | 12.8 | Link |
2.2.0 | 2.8.0 | 3.11 | 12.8 | Link |
2.2.0 | 2.8.0 | 3.10 | 12.8 | Link |
2.2.0 | 2.8.0 | 3.9 | 12.8 | Link |
2.2.0 | 2.7.1 | 3.13 | 12.8 | Link |
2.2.0 | 2.7.1 | 3.12 | 12.8 | Link |
2.2.0 | 2.7.1 | 3.11 | 12.8 | Link |
2.2.0 | 2.7.1 | 3.10 | 12.8 | Link |
2.2.0 | 2.7.1 | 3.9 | 12.8 | Link |
🛠 SageAttention 3
| Package Version | PyTorch Ver | Python Ver | CUDA Ver | Download Link |
|---|---|---|---|---|
1.0.0 | 2.9.1 | 3.13 | 13.0 | Link |
1.0.0 | 2.9.1 | 3.12 | 13.0 | Link |
1.0.0 | 2.8.0 | 3.13 | 12.8 | Link |
1.0.0 | 2.8.0 | 3.12 | 12.8 | Link |
1.0.0 | 2.8.0 | 3.11 | 12.8 | Link |
1.0.0 | 2.7.1 | 3.13 | 12.8 | Link |
1.0.0 | 2.7.1 | 3.12 | 12.8 | Link |
1.0.0 | 2.7.1 | 3.11 | 12.8 | Link |
▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲
🛠 Nunchaku
- Official Repo: : mit-han-lab/nunchaku
| Package Version | PyTorch Ver | Python Ver | Download Link |
|---|---|---|---|
1.2.0 | 2.11 | 3.13 | Link |
1.2.0 | 2.11 | 3.12 | Link |
1.2.0 | 2.11 | 3.11 | Link |
1.2.0 | 2.11 | 3.10 | Link |
1.2.0 | 2.9 | 3.13 | Link |
1.2.0 | 2.9 | 3.12 | Link |
1.2.0 | 2.9 | 3.11 | Link |
1.2.0 | 2.9 | 3.10 | Link |
1.2.0 | 2.8 | 3.13 | Link |
1.2.0 | 2.8 | 3.12 | Link |
1.2.0 | 2.8 | 3.11 | Link |
1.2.0 | 2.7 | 3.13 | Link |
1.2.0 | 2.7 | 3.12 | Link |
1.2.0 | 2.7 | 3.11 | Link |
1.0.2 | 2.10 | 3.13 | Link |
1.0.2 | 2.10 | 3.12 | Link |
1.0.2 | 2.10 | 3.11 | Link |
1.0.2 | 2.10 | 3.10 | Link |
1.0.2 | 2.9 | 3.13 | Link |
1.0.2 | 2.9 | 3.12 | Link |
1.0.2 | 2.9 | 3.11 | Link |
1.0.2 | 2.9 | 3.10 | Link |
1.0.2 | 2.8 | 3.13 | Link |
1.0.2 | 2.8 | 3.12 | Link |
1.0.2 | 2.8 | 3.11 | Link |
1.0.2 | 2.8 | 3.10 | Link |
1.0.2 | 2.7 | 3.13 | Link |
1.0.2 | 2.7 | 3.12 | Link |
1.0.2 | 2.7 | 3.11 | Link |
1.0.2 | 2.7 | 3.10 | Link |
1.0.1 | 2.10 | 3.13 | Link |
1.0.1 | 2.10 | 3.12 | Link |
1.0.1 | 2.10 | 3.11 | Link |
1.0.1 | 2.10 | 3.10 | Link |
1.0.1 | 2.9 | 3.13 | Link |
1.0.1 | 2.9 | 3.13 | Link |
1.0.1 | 2.9 | 3.12 | Link |
1.0.1 | 2.9 | 3.12 | Link |
1.0.1 | 2.8 | 3.13 | Link |
1.0.1 | 2.8 | 3.13 | Link |
1.0.1 | 2.8 | 3.12 | Link |
1.0.1 | 2.8 | 3.11 | Link |
1.0.1 | 2.8 | 3.10 | Link |
1.0.1 | 2.7 | 3.13 | Link |
1.0.1 | 2.7 | 3.12 | Link |
1.0.1 | 2.7 | 3.11 | Link |
1.0.1 | 2.7 | 3.10 | Link |
1.0.1 | 2.6 | 3.13 | Link |
1.0.1 | 2.6 | 3.12 | Link |
1.0.1 | 2.6 | 3.11 | Link |
1.0.1 | 2.6 | 3.10 | Link |
1.0.1 | 2.5 | 3.12 | Link |
1.0.1 | 2.5 | 3.11 | Link |
1.0.1 | 2.5 | 3.10 | Link |
1.0.0 | 2.9 | 3.13 | Link |
1.0.0 | 2.9 | 3.12 | Link |
1.0.0 | 2.9 | 3.11 | Link |
1.0.0 | 2.9 | 3.10 | Link |
1.0.0 | 2.8 | 3.13 | Link |
1.0.0 | 2.8 | 3.12 | Link |
1.0.0 | 2.8 | 3.11 | Link |
1.0.0 | 2.8 | 3.10 | Link |
1.0.0 | 2.7 | 3.13 | Link |
1.0.0 | 2.7 | 3.12 | Link |
1.0.0 | 2.7 | 3.11 | Link |
1.0.0 | 2.7 | 3.10 | Link |
1.0.0 | 2.6 | 3.13 | Link |
1.0.0 | 2.6 | 3.12 | Link |
1.0.0 | 2.6 | 3.11 | Link |
1.0.0 | 2.6 | 3.10 | Link |
1.0.0 | 2.5 | 3.12 | Link |
1.0.0 | 2.5 | 3.11 | Link |
1.0.0 | 2.5 | 3.10 | Link |
0.3.2 | 2.9 | 3.12 | Link |
0.3.2 | 2.8 | 3.12 | Link |
0.3.2 | 2.8 | 3.11 | Link |
0.3.2 | 2.8 | 3.10 | Link |
0.3.2 | 2.7 | 3.12 | Link |
0.3.2 | 2.7 | 3.11 | Link |
0.3.2 | 2.7 | 3.10 | Link |
0.3.2 | 2.6 | 3.12 | Link |
0.3.2 | 2.6 | 3.11 | Link |
0.3.2 | 2.6 | 3.10 | Link |
0.3.2 | 2.5 | 3.12 | Link |
0.3.2 | 2.5 | 3.11 | Link |
0.3.2 | 2.5 | 3.10 | Link |
▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲
🛠 NATTEN
Neighborhood Attention Transformer.
| Package Version | PyTorch Ver | Python Ver | CUDA Ver | Download Link |
|---|---|---|---|---|
0.17.5 | 2.7.0 | 3.12 | 12.8 | Link |
0.17.5 | 2.7.0 | 3.11 | 12.8 | Link |
0.17.5 | 2.7.0 | 3.10 | 12.8 | Link |
0.17.5 | 2.6.0 | 3.12 | 12.6 | Link |
0.17.5 | 2.6.0 | 3.11 | 12.6 | Link |
0.17.5 | 2.6.0 | 3.10 | 12.6 | Link |
0.17.3 | 2.5.1 | 3.12 | 12.4 | Link |
0.17.3 | 2.5.1 | 3.11 | 12.4 | Link |
0.17.3 | 2.5.1 | 3.10 | 12.4 | Link |
0.17.3 | 2.5.0 | 3.12 | 12.4 | Link |
0.17.3 | 2.5.0 | 3.11 | 12.4 | Link |
0.17.3 | 2.5.0 | 3.10 | 12.4 | Link |
0.17.3 | 2.4.1 | 3.12 | 12.4 | Link |
0.17.3 | 2.4.1 | 3.11 | 12.4 | Link |
0.17.3 | 2.4.1 | 3.10 | 12.4 | Link |
0.17.3 | 2.4.0 | 3.12 | 12.4 | Link |
0.17.3 | 2.4.0 | 3.11 | 12.4 | Link |
0.17.3 | 2.4.0 | 3.10 | 12.4 | Link |
▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲
🛠 Triton (Windows Fork)
Triton is a language and compiler for writing highly efficient custom deep-learning primitives. Not officially supported on Windows, but a fork provides pre-built wheels.
Supported GPUs:
[!NOTE] Different GPU architectures require different Triton versions due to compute capability support.
| Triton Version | Supported GPUs | Compute Capability |
|---|---|---|
3.6.x | RTX 50xx (Blackwell), RTX 40xx, Ada Lovelace, Hopper | SM 8.9, 9.0, 10.0 |
3.5.x | RTX 30xx, 40xx, Ada Lovelace, Hopper | SM 8.0, 8.9, 9.0 |
3.4.x | RTX 20xx, 30xx, 40xx, Ada Lovelace, Hopper | SM 7.5, 8.0, 8.9, 9.0 |
<= 3.2.x | GTX/RTX 16xx, RTX 20xx, 30xx, 40xx, Ada Lovelace, Hopper | SM 7.0, 7.5, 8.0, 8.9, 9.0 |
Installation:
| Package Version | PyTorch Ver | Compute Capability | Install |
|---|---|---|---|
3.6.x | >= 2.9 | SM 8.9+ | pip install -U "triton-windows<3.7" |
3.5.x | >= 2.9 | SM 8.0+ | pip install -U "triton-windows<3.6" |
3.4.x | >= 2.8 | SM 7.5+ | pip install -U "triton-windows<3.5" |
Python libs:
[!IMPORTANT] Triton requires additional Python development libraries for building CUDA kernels. Download the package matching your Python version, extract the ZIP file, and copy the
includeandlibsfolders to your Python installation directory.
| Python Ver | Download |
|---|---|
3.13 | Link |
3.12 | Link |
3.11 | Link |
3.10 | Link |
3.9 | Link |
▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲
🛠 bitsandbytes
A lightweight wrapper around CUDA custom functions, particularly for 8-bit optimizers, matrix multiplication (LLM.int8()), and quantization functions.
▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲
🛠 RadialAttention for ComfyUI
▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲
🛠 SpargeAttn
| Package Version | PyTorch Ver | CUDA Ver | Download Link |
|---|---|---|---|
0.1.0.post1 | 2.8.0 | 12.8 | Link |
0.1.0.post1 | 2.7.1 | 12.8 | Link |
▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲
🛠 Block Sparse Attention
| Package Version | PyTorch Ver | Python Ver | CUDA Ver | Download Link |
|---|---|---|---|---|
0.0.2.post1 | 2.11 | 3.13 | 13.0 | Link |
0.0.2.post1 | 2.10 | 3.13 | 13.0 | Link |
0.0.2.post1 | 2.9.1 | 3.13 | 13.0 | Link |
▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲
🛠 DeepSpeed
- A deep learning optimization library
- Official Repo: https://github.com/deepspeedai/DeepSpeed
| Package Version | Python Ver | Download Link |
|---|---|---|
0.18.6 | 3.13 | Link |
▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲
🛠 Fairseq
- Facebook AI Research Sequence-to-Sequence Toolkit
- Official Repo: https://github.com/facebookresearch/fairseq
| Package Version | Python Ver | Download Link |
|---|---|---|
0.12.2 | 3.13 | Link |
▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲▼▲
🛠 causal_conv1d
| Package Version | PyTorch Ver | Python Ver | CUDA Ver | CXX11 ABI | Download Link |
|---|---|---|---|---|---|
1.6.1 | 2.11.0 | 3.14 | 13.0 | ✓ | Link |
1.6.1 | 2.11.0 | 3.13 | 13.0 | ✓ | Link |
1.6.1 | 2.10.0 | 3.13 | 13.0 | ✓ | Link |
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
🌐 Accessing Data Programmatically (wheels.json)
All wheel information in this repository is managed in the wheels.json file, which serves as the single source of truth. The tables in this README are automatically generated from this file.
This provides a stable, structured JSON endpoint for any external tool or application that needs to access this data without parsing Markdown.
➤ How to Use
You can access the raw JSON file directly via the following URL:
https://raw.githubusercontent.com/wildminder/AI-windows-whl/main/wheels.json
Example using curl:
curl -L -o wheels.json https://raw.githubusercontent.com/wildminder/AI-windows-whl/main/wheels.json
The file contains a list of packages, each with its metadata and an array of wheels, where each wheel object contains version details and a direct download url.
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
➤ Contributing
Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.
If you have found a new pre-built wheel or a reliable source, please fork the repo and create a pull request, or simply open an issue with the link.
➤ Acknowledgments
This repository is simply a collection of links. Huge thanks to the individuals and groups who do the hard work of building and hosting these wheels for the community: