PLOD-AbbreviationDetection
nlp
PLOD-AbbreviationDetection | nlp | |
---|---|---|
1 | 1 | |
9 | 0 | |
- | - | |
0.0 | 6.0 | |
over 1 year ago | 10 months ago | |
Jupyter Notebook | Jupyter Notebook | |
Creative Commons Attribution Share Alike 4.0 | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
PLOD-AbbreviationDetection
-
Clustering to find abbreviations
Finally, the main problem with unsupervised learning is that you won't be able to reliably measure system performance or improvement. In my view, any time you can spend annotating and collecting data for a (semi-)supervised solution will be well-spent. Existing datasets can also get you started with model development, such as https://github.com/surrey-nlp/PLOD-AbbreviationDetection. Once you have a good model on a conventional dataset, you should be able to start generalizing it to your specific task/dataset.
nlp
-
[N] State of GPT by Andrej karpathy in MSBuild 2023
GitHub: https://github.com/iliyaML/nlp/tree/main/microsoft-build-2023/state-of-gpt
What are some alternatives?
converse - Conversational text Analysis using various NLP techniques
ocrpy - OCR, Archive, Index and Search: Implementation agnostic OCR framework.
hate-speech-and-offensive-language - Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
transformers-interpret - Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.
ThoughtSource - A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/
pytorch-sentiment-analysis - Tutorials on getting started with PyTorch and TorchText for sentiment analysis.
browser-ml-inference - Edge Inference in Browser with Transformer NLP model
adaptnlp - An easy to use Natural Language Processing library and framework for predicting, training, fine-tuning, and serving up state-of-the-art NLP models.
tokenizers - 💥 Fast State-of-the-Art Tokenizers optimized for Research and Production