Difficulty in using LSTMs for text generation

This page summarizes the projects mentioned and recommended in the original post on /r/pytorch

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • Python_NN

    A short NumPy library for training neural networks, intended for learning purposes.

  • About the issue with repeating characters, it is supposed to happen. There are certain loops that repeat. To solve this you must remove the line q1 = np.argmax(p1.cpu(), axis=1)[-1].item() and instead, do sampling based on the softmax probabilities. Check this code where I perform sampling. The vec = vec**(2) is used to decrease randomness you can see how your model works and either keep it or leave it.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts