Skip to content
This repository has been archived by the owner on Aug 7, 2024. It is now read-only.

Adding Float8 Linear variants supporting inference-only with lower overhead #283

Closed
wants to merge 1 commit into from

Commits on Jun 14, 2024

  1. Adding Float8 Linear variants

    Co-authored-by: Mauricio Serrano <[email protected]>
    cyang49 and mserranos committed Jun 14, 2024
    Configuration menu
    Copy the full SHA
    860527e View commit details
    Browse the repository at this point in the history