azuredatastudio
refinery-sample-projects
azuredatastudio | refinery-sample-projects | |
---|---|---|
29 | 3 | |
7,445 | 24 | |
0.3% | - | |
9.8 | 0.0 | |
4 days ago | 9 months ago | |
TypeScript | ||
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
azuredatastudio
refinery-sample-projects
-
How to fine-tune your embeddings for better similarity search
This blog post will share our experience with fine-tuning sentence embeddings on a commonly available dataset using similarity learning. We additionally explore how this could benefit the labeling workflow in the Kern AI refinery. To understand this post, you should know what embeddings are and how they are generated. A rough idea of what fine-tuning is also helps. All the code and data referenced in this post is available on GitHub.
-
Build your own stock sentiment classifier with Kern Refinery (video series)
Repository of the sample use case for the UI elements (optional).
-
Show HN: If VS Code had a data-centric IDE sibling, what would that look like?
Hey, I'm Johannes - one of the maintainers of refinery. Thanks Jonathan for sharing!!
Would be super excited if you guys have any feedback. It's nowhere near perfect yet, but you can already use it to build some great data-centric use cases. Amongst others for sentiment analysis, conversational AI or finetuning of your embeddings (which you can check out here: https://github.com/code-kern-ai/refinery-sample-projects).
Let me know what you think :)
What are some alternatives?
vscode-python - Python extension for Visual Studio Code
refinery - The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
beekeeper-studio - Modern and easy to use SQL client for MySQL, Postgres, SQLite, SQL Server, and more. Linux, MacOS, and Windows.
automl-docker - CLI-based tool to automatically build ML models from training data into a servable Docker container
TypeORM - ORM for TypeScript and JavaScript. Supports MySQL, PostgreSQL, MariaDB, SQLite, MS SQL Server, Oracle, SAP Hana, WebSQL databases. Works in NodeJS, Browser, Ionic, Cordova and Electron platforms.
Owlie - Owlie est un chatbot de soutien psychologique gratuit, disponible 24h/24 et 7j/7
vscode-sqltools - Database management for VSCode
rushstack - Monorepo for tools developed by the Rush Stack community
repo-templates - Default templates for Microsoft repos across all GitHub organizations: helping providing for collaborative communities, SECURITY.MD, Code of Conduct, and other files...
ippsample - IPP sample implementations.
grpc_bench - Various gRPC benchmarks