Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

Dao-AILab / flash-attention Public

Notifications You must be signed in to change notification settings
Fork 1.4k
Star 15.2k

Code
Issues 646
Pull requests 48
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: Dao-AILab/flash-attention

Labels 9 Milestones 0

Labels 9 Milestones 0

New pull request New

48 Open 199 Closed

48 Open 199 Closed

Author

Filter by author

Loading

Label

Filter by label

Loading

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Loading

Milestones

Filter by milestone

Loading

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Loading

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Update Cuda Blackwell

#1463 opened Jan 24, 2025 by johnnynunez

Loading…

[BugFix] Fix a wrong reference to seqlen_k variable in the fwd_splitkv kernel

#1455 opened Jan 21, 2025 by muoshuosha

Loading…

1

Add missing tests/__init__.py

#1405 opened Dec 20, 2024 by BioGeek

Loading…

Support dedicated compile[For Research]

#1384 opened Dec 12, 2024 by AllenDou • Draft

Fix deprecation warnings

#1382 opened Dec 12, 2024 by rongou

Loading…

wrap func into torch ops to avoid torch.compile graphbreaks

#1333 opened Nov 13, 2024 by kumarkrishna

Loading…

Promote wheels as alternative to pip install flash-attn

#1297 opened Oct 25, 2024 by simonw

Loading…

4

fix: in newer versions of triton, tl.dot should take as input only q …

#1288 opened Oct 21, 2024 by EdouardYvinec

Loading…

flashattnvarlen support tree attention

#1188 opened Aug 30, 2024 by efsotr

Loading…

4

the test_flash_attn.py it's actually in parent directory

#1167 opened Aug 21, 2024 by ArtificialZeng

Loading…

Add support for qk hidden dim different from v hidden dim

#1166 opened Aug 20, 2024 by smallscientist1

Loading…

5

add softmax_d for mha_bwd

#1161 opened Aug 19, 2024 by MayDomine

Loading…

Fix: bwd may need to first allocate cuda mem for rng_state

#1077 opened Jul 20, 2024 by jundaf2

Loading…

Windows actions

#1036 opened Jul 9, 2024 by bdashore3

Loading…

3

change condition to num_heads >= num_heads_k

#1030 opened Jul 5, 2024 by xenshinu

Loading…

1

[Draft] support qk head_dim different from vo head_dim

#980 opened Jun 6, 2024 by defei-coder

Loading…

2

Fix +/-inf in LSE returned by forward

#978 opened Jun 3, 2024 by sgrigory

Loading…

3

add pyproject.toml with build dependencies

#958 opened May 17, 2024 by dhellmann

Loading…

Relative position encoding

#956 opened May 14, 2024 by b-albar

Loading…

1 of 4 tasks

1

ALiBi for the non-flash code path

#858 opened Feb 29, 2024 by Markus28

Loading…

Add local version identifier to package metadata for pre-built wheels

#856 opened Feb 28, 2024 by yundai424

Loading…

2

Add support for small page sizes

#824 opened Feb 13, 2024 by skrider

Loading…

Add C++ build support for use with LibTorch

#819 opened Feb 9, 2024 by shaltielshmid

Loading…

1

meta tensor stuff

#769 opened Jan 15, 2024 by tsengalb99

Loading…

Animations for Flash Attention, Flash Attention2, and Standard Attention

#736 opened Dec 24, 2023 by LuisAVasquez

Loading…

1

Previous 1 2 Next

Previous Next

ProTip! What’s not been updated in a month: updated:<2024-12-30.

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.