awesome-ai-safety
TileDB-Vector-Search
awesome-ai-safety | TileDB-Vector-Search | |
---|---|---|
5 | 3 | |
140 | 46 | |
9.3% | - | |
5.6 | 9.6 | |
7 months ago | 1 day ago | |
Jupyter Notebook | ||
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
awesome-ai-safety
-
Ask HN: Who is hiring? (October 2023)
Giskard - Testing framework for ML models| Multiple roles | Full-time | France | https://giskard.ai/
We are building the first collaborative & open-source Quality Assurance platform for all ML models - including Large Language Models.
Founded in 2021 in Paris by ex-Dataiku engineers, we are an emerging player in the fast-growing market of AI Quality & Safety.
Giskard helps Data Scientists & ML Engineering teams collaborate to evaluate, test & monitor AI models. We help organizations increase the efficiency of their AI development workflow, eliminate risks of AI biases and ensure robust, reliable & ethical AI models. Our open-source platform is used by dozens of ML teams across industries, both at enterprise companies & startups.
In 2022, we raised our first round of 1.5 million euros, led by Elaia, with participation from Bessemer and notable angel investors including the CTO of Hugging Face. To read more about this fundraising and how it will accelerate our growth, you can read this announcement: https://www.giskard.ai/knowledge/news-fundraising-2022
In 2023, we received a strategic investment from the European Commission to build a SaaS platform to automate compliance with the upcoming EU AI regulation. You can read more here: https://www.giskard.ai/knowledge/1-000-github-stars-3meu-and...
We are assembling a team of champions: Software Engineers, Machine Learning researchers, and Data Scientists ; to build our AI Quality platform and expand it to new types of AI models and industries. We have a culture of continuous learning & quality, and we help each other achieve high standards & goals!
We aim to grow from 15 to 25 people in the next 12 months. We're hiring the following roles:
-
Ask HN: Who is hiring? (August 2023)
Giskard - Testing framework for ML models| Multiple roles | Full-time | France | https://giskard.ai/
We are building the first collaborative & open-source Quality Assurance platform for all ML models - including Large Language Models.
Founded in 2021 in Paris by ex-Dataiku engineers, we are an emerging player in the fast-growing market of AI Safety & Security.
Giskard helps Data Scientists & ML Engineering teams collaborate to evaluate, test & monitor AI models. We help organizations increase the efficiency of their AI development workflow, eliminate risks of AI biases and ensure robust, reliable & ethical AI models. Our open-source platform is used by dozens of ML teams across industries, both at enterprise companies & startups.
In 2022, we raised our first round of 1.5 million euros, led by Elaia, with participation from Bessemer and notable angel investors including the CTO of Hugging Face. To read more about this fundraising and how it will accelerate our growth, you can read this announcement: https://www.giskard.ai/knowledge/news-fundraising-2022
In 2023, we received a strategic investment from the European Commission to build a SaaS platform to automate compliance with the upcoming EU AI regulation. You can read more here: https://www.giskard.ai/knowledge/1-000-github-stars-3meu-and...
We are assembling a team of champions: Software Engineers, Machine Learning researchers, and Data Scientists ; to build our AI Quality platform and expand it to new types of AI models and industries. We have a culture of continuous learning & quality, and we help each other achieve high standards & goals!
We aim to grow from 15 to 25 people in the next 12 months. We're hiring the following roles:
* Software Engineer - https://apply.workable.com/giskard/j/AD2C90B581/ (Python, Java, Typescript, Vue.js, Cloud skills)
* Machine Learning Researcher - https://apply.workable.com/giskard/j/E89FE8E310/ (post-PhD)
* Data Science lead - https://apply.workable.com/giskard/j/E89FE8E310/ (ML + consulting experience required)
* Growth marketing intern - https://apply.workable.com/giskard/j/C8635E9B0C/
* Data Science intern - https://apply.workable.com/giskard/j/7F0B341852/
-
Show HN: Python library to scan ML models for vulnerabilities
Hi! I’ve been working on this automatic scanner for ML models to detect issues like underperforming data slices, overconfidence in predictions, robustness problems, and others. It supports all main Python ML frameworks (sklearn, torch, xgboost, …) and integrates with the quality assurance solution we are building at Giskard AI (https://giskard.ai) to systematically test models before putting them in production.
It is still a beta and I would love to hear your feedback if you have the time to try it out.
We have quite a few tutorials in the docs with ready-made colab notebooks to make it easy to get started.
If you are interested in the code:
https://github.com/Giskard-AI/giskard/tree/main/python-clien...
-
[R] Awesome AI Safety – A curated list of papers & technical articles on AI Quality & Safety
Repository: https://github.com/Giskard-AI/awesome-ai-safety
- AI Safety – curated papers for safer, ethical, and reliable AI
TileDB-Vector-Search
-
Ask HN: Who is hiring? (September 2023)
- vector search, utilizing TileDB and TileDB Cloud for seamless scaling: https://tiledb.com/blog/why-tiledb-as-a-vector-database (library: https://github.com/TileDB-Inc/TileDB-Vector-Search)
-
Why TileDB as a Vector Database
Stavros from TileDB here (Founder and CEO). I thought of requesting some feedback from the community on this blog. It was only natural for a multi-dimensional array database like TileDB to offer vector (i.e., 1D array) search capabilities. But the team managed to do it very well and the results surprised us. We are just getting started in this domain and a lot of new algorithms and features are coming up, but the sooner we get feedback the better.
TileDB-Vector-Search Github repo: https://github.com/TileDB-Inc/TileDB-Vector-Search
TileDB-Embedded (core array engine) Github repo: https://github.com/TileDB-Inc/TileDB
TileDB 101: Vector Search (blog to get kickstarted): https://tiledb.com/blog/tiledb-101-vector-search/
-
Ask HN: Who is hiring? (August 2023)
New Vector search library: https://github.com/TileDB-Inc/TileDB-Vector-Search
Our headquarters are located in Cambridge, MA and we have a subsidiary in Athens, Greece. We offer the ability to work remotely for anyone with legal residence in the US or Greece. We have several open positions aimed at increasing TileDB’s feature set, growth and adoption. You will have the opportunity to work on innovative technology that creates impact on challenging and exciting problems in Genomics, Geospatial, Time Series, and more. We have just launched a new vector search library built on top of TileDB and leveraging
We are actively seeking:
What are some alternatives?
opentofu - OpenTofu lets you declaratively manage your cloud infrastructure.
tiny-dream - Tiny Dream - An embedded, Header Only, Stable Diffusion C++ implementation
tabby - Self-hosted AI coding assistant
nl-wallet - NL Public Reference Wallet
awesome-langchain - 😎 Awesome list of tools and projects with the awesome LangChain framework
mentat - Mentat - The AI Coding Assistant
giskard - 🐢 Open-Source Evaluation & Testing for LLMs and ML models
autodistill - Images to inference with no labeling (use foundation models to train supervised models).
refact - WebUI for Fine-Tuning and Self-hosting of Open-Source Large Language Models for Coding
supervision - We write your reusable computer vision tools. 💜
Weaviate - Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.