hyperparameter
spaCy
Our great sponsors
hyperparameter | spaCy | |
---|---|---|
7 | 106 | |
23 | 28,704 | |
- | 1.3% | |
6.9 | 9.2 | |
about 1 month ago | 9 days ago | |
Rust | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
hyperparameter
-
Hyper-parameter Optimization with Optuna and hyperparameter
the full tutorial: https://github.com/reiase/hyperparameter/tree/master/examples/optuna
-
Pythonic configuration framework?
When I was working on my own configuration framework (HyperParameter, previous post), I suddenly realize that what I want is not another configuration framework with some fancy API. All I want is to change my ML experiments without modifying the code and get rid of the configuration handling codes. The right way of configuration is not writing configurable code and wasting time on different frameworks. The best solution is a tool that makes your code configurable.
-
hyperparameter, a lightweight configuration framework
github: https://github.com/reiase/hyperparameter
-
HyperParameter for ML Models and Systems
HyperParameter is a configuration and parameter management library for Python. HyperParameter provides the following features:
-
What is the best practice for injecting configuration into a python application
you can take a look at https://github.com/reiase/hyperparameter, a scoped, thread-safe config object that is lightweight enough. There is no need to modify too much code:
-
[P] Modify Hyperparameters Easily
I'm developing a Hyperparameter tuning toolbox for my machine learning projects. It maps keyword arguments to hyper-parameters, for example:
-
A hyper-parameter toolbox for data-scientists and machine-learning engineers
I'm developing [a toolbox for managing hyper-parameters](https://github.com/reiase/hyperparameter) in my data science and machine learning projects. It provides object-style API for nested dict( which is very common for config files):
spaCy
-
Step by step guide to create customized chatbot by using spaCy (Python NLP library)
Hi Community, In this article, I will demonstrate below steps to create your own chatbot by using spaCy (spaCy is an open-source software library for advanced natural language processing, written in the programming languages Python and Cython):
-
Best AI SEO Tools for NLP Content Optimization
SpaCy: An open-source library providing tools for advanced NLP tasks like tokenization, entity recognition, and part-of-speech tagging.
-
Who has the best documentation you’ve seen or like in 2023
spaCy https://spacy.io/
-
A beginner’s guide to sentiment analysis using OceanBase and spaCy
In this article, I'm going to walk through a sentiment analysis project from start to finish, using open-source Amazon product reviews. However, using the same approach, you can easily implement mass sentiment analysis on your own products. We'll explore an approach to sentiment analysis with one of the most popular Python NLP packages: spaCy.
- Retrieval Augmented Generation (RAG): How To Get AI Models Learn Your Data & Give You Answers
-
Against LLM Maximalism
Spacy [0] is a state-of-art / easy-to-use NLP library from the pre-LLM era. This post is the Spacy founder's thoughts on how to integrate LLMs with the kind of problems that "traditional" NLP is used for right now. It's an advertisement for Prodigy [1], their paid tool for using LLMs to assist data labeling. That said, I think I largely agree with the premise, and it's worth reading the entire post.
The steps described in "LLM pragmatism" are basically what I see my data science friends doing — it's hard to justify the cost (money and latency) in using LLMs directly for all tasks, and even if you want to you'll need a baseline model to compare against, so why not use LLMs for dataset creation or augmentation in order to train a classic supervised model?
[0] https://spacy.io/
[1] https://prodi.gy/
- Swirl: An open-source search engine with LLMs and ChatGPT to provide all the answers you need 🌌
-
How to predict this sequence?
spaCy
-
What do you all think about (setq sentence-end-double-space nil)?
I chose spacy. Although it's not state of the art, it's very well established and stable.
- spaCy: Industrial-Strength Natural Language Processing
What are some alternatives?
towhee - Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
TextBlob - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
pytorch-lightning - Build high-performance AI models with PyTorch Lightning (organized PyTorch). Deploy models with Lightning Apps (organized Python to build end-to-end ML systems). [Moved to: https://github.com/Lightning-AI/lightning]
Stanza - Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
Dependency Injector - Dependency injection framework for Python
NLTK - NLTK Source
streamlit - Streamlit — A faster way to build and share data apps.
BERT-NER - Pytorch-Named-Entity-Recognition-with-BERT
keras - Deep Learning for humans [Moved to: https://github.com/keras-team/keras]
polyglot - Multilingual text (NLP) processing toolkit
lance - Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
textacy - NLP, before and after spaCy