-
refinery
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Just clone the repo QDrant-NLP and run: docker-compose up I would like to increase the number of datasets this can be tried on, either with GPU backed lambda functions or by saving many example datasets to S3. So far I've only made a 6K subset of ag_news available. ag_news · Datasets at Hugging Face This is the code snippet used to generate the embeddings via hugging-face:
Shout out to both Kern.AI (an excellent open-source NLP labelling tool) https://github.com/code-kern-ai/refinery and Voxel51 (an excellent open-source Computer Vision analysis tool) https://github.com/voxel51/fiftyone for being early adopters of the technology in their platforms, but I don't believe either have yet made use of all of the value it can provide.
Shout out to both Kern.AI (an excellent open-source NLP labelling tool) https://github.com/code-kern-ai/refinery and Voxel51 (an excellent open-source Computer Vision analysis tool) https://github.com/voxel51/fiftyone for being early adopters of the technology in their platforms, but I don't believe either have yet made use of all of the value it can provide.