bert-sklearn
kruk
bert-sklearn | kruk | |
---|---|---|
1 | 1 | |
293 | 77 | |
- | - | |
0.0 | 5.9 | |
over 1 year ago | 3 months ago | |
Jupyter Notebook | Jupyter Notebook | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
bert-sklearn
-
Quick BERT Pre-Trained Model for Sentiment Analysis with Scikit Wrapper
Sckit-learn wrapper provided by Charles Nainan. GitHub of Scikit Learn BERT wrapper.
kruk
-
Aplaca dataset translated into polish [N] [R]
Somewhat related, there's also a Ukrainian translation of the Alpaca dataset. It comes with UAlpaca -- a LLaMA fine-tuned on this translated data, as well as on some other sources: https://github.com/robinhad/kruk https://huggingface.co/robinhad/ualpaca-7b-llama
What are some alternatives?
bert - TensorFlow code and pre-trained models for BERT
Local-LLM-Langchain - Load local LLMs effortlessly in a Jupyter notebook for testing purposes alongside Langchain or other agents. Contains Oobagooga and KoboldAI versions of the langchain notebooks with examples.
OpenAI-CLIP - Simple implementation of OpenAI CLIP model in PyTorch.
owca - The OWCA dataset is a polish translated dataset of instructions for fine-tuning the Alpaca model made by Stanford .
NLU-engine-prototype-benchmarks - Demo and benchmarks for building an NLU engine similar to those in voice assistants. Several intent classifiers are implemented and benchmarked. Conditional Random Fields (CRFs) are used for entity extraction.
KoAlpaca - KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델
fake-news - Building a fake news detector from initial ideation to model deployment
llama.py - Python bindings to llama.cpp
ABSA_Project_4 - This project takes advantange of the parsing and part of speech tagging capabilites of Spacy's pipeline in order to extract aspect/opinion/sentiment triplets. Cluster aspects using unsupervised learning to process sentiment for large amazon review datasets.
text-generation-webui-colab - A colab gradio web UI for running Large Language Models
tf-transformers - State of the art faster Transformer with Tensorflow 2.0 ( NLP, Computer Vision, Audio ).
stanford_alpaca - Code and documentation to train Stanford's Alpaca models, and generate the data.