[D] PyTorch 2.0 Native Flash Attention 32k Context Window

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • Pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

  • You might look into https://github.com/pytorch/pytorch/pull/95793.

  • ChatGPTCLIBot

    ChatGPT Bot in CLI with long term memory support using Embeddings.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • longformer

    Longformer: The Long-Document Transformer

  • This approach has been shown to work. Longformer even provided a script that did this for you: https://github.com/allenai/longformer/blob/master/scripts/convert_model_to_long.ipynb

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts