pipeline-docs-data-extractor
DataDreamer
pipeline-docs-data-extractor | DataDreamer | |
---|---|---|
1 | 5 | |
5 | 704 | |
- | 8.2% | |
7.8 | 8.6 | |
about 1 month ago | about 1 month ago | |
Python | Python | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pipeline-docs-data-extractor
DataDreamer
- FLaNK AI - 01 April 2024
- FLaNK Stack 26 February 2024
- FLaNK Stack Weekly 19 Feb 2024
- DataDreamer
-
Ask HN: What have you built with LLMs?
We've built a prompting, synthetic data generation, and training library called DataDreamer: https://github.com/datadreamer-dev/DataDreamer
What are some alternatives?
LLMCompiler - [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
tracecat - The open source Tines alternative. Automate security workflows at scale with code and no-code.
relevanceai - Home of the AI workforce - Multi-agent system, AI agents & tools
speedb - A RocksDB compliant high performance scalable embedded key-value store
spacy-llm - 🦙 Integrating LLMs into structured NLP pipelines
CML_AMP-to-Airgapped - Download the AMP catalog for an offline (airgapped) deployment of the AMP catalog.
autolabel - Label, clean and enrich text datasets with LLMs.
FLaNK-python-processors - Many processors
llm-client-sdk - SDK for using LLM
Hybrid-Net - Real-time audio source separation, generate lyrics, chords, beat.
MyScaleDB - An open-source, high-performance SQL vector database built on ClickHouse.