cappr
naturalspeech2-pytorch
cappr | naturalspeech2-pytorch | |
---|---|---|
4 | 1 | |
63 | 1,207 | |
- | - | |
9.4 | 8.3 | |
2 months ago | 8 months ago | |
Python | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
cappr
-
Introducing CAPPr: a package to easily perform text classification using OpenAI models
GitHub
-
[P] CAPPr: use OpenAI or HuggingFace models to easily do zero-shot text classification
While benchmarking this method on the infamous Winograd Schema Challenge, I ended up finding a 2018 paper1 w/ pretty much the same idea as CAPPr. The only difference is that CAPPr typically transposes that probability, and it naively incorporates a prior.
-
Introducing CAPPr: use OpenAI or HuggingFace models to easily do zero-shot text classification
GitHub: https://github.com/kddubey/cappr
naturalspeech2-pytorch
What are some alternatives?
MAGIC - Language Models Can See: Plugging Visual Controls in Text Generation
stable-karlo - Upscaling Karlo text-to-image generation using Stable Diffusion v2.
zshot - Zero and Few shot named entity & relationships recognition
TTS - πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
zeroshot_topics - Topic Inference with Zeroshot models
NeMo - A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
basaran - Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-based text generation models.
DiffSinger - DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
llm-client-sdk - SDK for using LLM
espnet - End-to-End Speech Processing Toolkit