fides
CKAN
fides | CKAN | |
---|---|---|
2 | 6 | |
328 | 4,267 | |
0.6% | 0.7% | |
9.8 | 9.8 | |
5 days ago | 6 days ago | |
Python | Python | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
fides
-
What data governance tool are you folks using?
I’ve also been impressed with the approach of Fides, an open source privacy management framework that ties into ci/cd, though I haven’t used it myself yet. The thing about it that stood out was Fideslang, their language and taxonomy for representing data privacy primitives.
-
Privacy-as-Code: Preventing Facebook’s $5B violation using Fides Open-Source
Fides is built to solve for problems like this. In its current release, you can already draft a policy in YAML using fideslang and enforce that policy to ensure engineers across a team can’t accidentally or intentionally misuse data in a way that deviates from the promises a business or application makes to its users.
CKAN
-
Open Source Flask-based web applications
CKAN The Open Source Data Portal Software
-
Metadata Store - Which one to Choose ? OpenMetadata vs Datahub ?
We use Kubernetes as our deployment platform. Any feedback on one of these open source data catalogs ? - https://atlas.apache.org/#/ - https://opendatadiscovery.org/ - https://open-metadata.org/ - https://marquezproject.github.io/marquez/ - https://datahubproject.io/ - https://www.amundsen.io/ - https://ckan.org/ - https://magda.io/
-
What 'tool' is used to build OpenData sites?
CKAN (https://ckan.org/) is what data.gov and most state governments use.
-
Software and tools for (non-human) genomics data platform
Our first instinct is to use [CKAN](https://ckan.org) for cataloging (and storage, with modifications), especially since we know it and know that it has been used successfully elsewhere. However, we suspect that more specialized/better tools exist for this, thus why I kindly ask for your insights.
-
How to start Data Science and Machine Learning Career?
Ckan
-
We are digitisers at the Natural History Museum in London, on a mission to digitise 80 million specimens and free their data to the world. Ask us anything!
We publish all our data on the [Data Portal](https://data.nhm.ac.uk), a Museum project that's been running since 2014. Instead of MediaWiki it runs on an open-source Python framework called [CKAN](https://ckan.org), which is designed for hosting datasets - though we've had to adapt it in various ways so that it can handle such large amounts of data.
What are some alternatives?
fiftyone - The open-source tool for building high-quality datasets and computer vision models
ArchivesSpace - The ArchivesSpace archives management tool
differential-privacy-library - Diffprivlib: The IBM Differential Privacy Library
ArchiveBox - 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
dvc - 🦉 ML Experiments and Data Management with Git
Archivematica - Free and open-source digital preservation system designed to maintain standards-based, long-term access to collections of digital objects.
datahub - The Metadata Platform for your Data Stack
Access to Memory (AtoM) - Open-source, web application for archival description and public access.
awesome-machine-unlearning - Awesome Machine Unlearning (A Survey of Machine Unlearning)
Collective Access: Providence - Cataloguing and data/media management application
pandas-datareader - Extract data from a wide range of Internet sources into a pandas DataFrame.
Activeloop Hub - Data Lake for Deep Learning. Build, manage, query, version, & visualize datasets. Stream data real-time to PyTorch/TensorFlow. https://activeloop.ai [Moved to: https://github.com/activeloopai/deeplake]