Building a great tech team takes more than a paycheck. Zero payroll costs, get AI-driven insights to retain best talent, and delight them with amazing local benefits. 100% free and compliant. Learn more →
Top 23 Python Schema Projects
-
marshmallow
A lightweight library for converting complex objects to and from simple Python datatypes.
Somehow get a serializer going for your database models. I used marshmallow and flask-marshmallow
-
-
Revelo Payroll
Free Global Payroll designed for tech teams. Building a great tech team takes more than a paycheck. Zero payroll costs, get AI-driven insights to retain best talent, and delight them with amazing local benefits. 100% free and compliant.
-
Mimesis
Mimesis is a powerful Python library that empowers developers to generate massive amounts of synthetic data efficiently.
Project mention: Mimesis allows you toeasily generate detailed dummy datasets. | /r/datascience | 2023-04-12Mimesis has well-structured and comprehensive documentation: https://mimesis.name
-
I use Pandera, so I just need to define the expected input/output schemas (i.e. column names, types, and constraints on them), and Pandera automatically generates fake data for the unit tests, and validates the result: https://github.com/unionai-oss/pandera
-
-
-
msgspec
A fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML
Full support for validation and serialisation of attrs classes and msgspec Structs. Where previously only Pydantic models and types where supported, you can now mix and match any of these three libraries. In addition to this, adding support for another modelling library has been greatly simplified with the new plugin architecture
-
Sonar
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
-
Funny that you ask... https://github.com/pydantic/pydantic-core Unfortunately it seems that the functionality you ask for is not (yet) part of this ...
-
Project mention: Tools that help to explore and navigate database tables | /r/dataengineering | 2022-11-21
sometging like this, maybe https://github.com/Alexis-benoist/eralchemy
-
Project mention: Recommended ways to catch missing environment variables in github actions? | /r/devops | 2023-01-10
For example https://github.com/23andMe/Yamale and more broadly https://json-schema-everywhere.github.io/yaml
-
Project mention: PyGraft: Configurable Generation of Schemas and Knowledge Graphs | news.ycombinator.com | 2023-09-13
-
pysonDB
A Simple , ☁️ Lightweight , 💪 Efficent JSON based database for 🐍 Python. PysonDB-V2 has been released ⬇️
Have a tl:dr how it differs / improves on pysonDB?
-
-
Project mention: Show HN: Up to 100x Faster FastAPI with simdjson and io_uring on Linux 5.19 | news.ycombinator.com | 2023-03-06
Author of typedload here!
FastAPI relies on (not so fast) pydantic, which is one of the slowest libraries in that category.
Don't expect to find such benchmarks on the pydantic documentation itself, but the competing libraries will have them.
-
ProfileManifestsMirror
Jamf JSON schema manifests automatically generated from ProfileCreator manifests (https://github.com/ProfileCreator/ProfileManifests)
You can make custom profiles in Jamf using the same json imazing uses/same repo. This will give you access to all of the keys available in the software. I think the reason they say it’s not enterprise software is there is no central control like chrome or edge. https://github.com/Jamf-Custom-Profile-Schemas/ProfileManifestsMirror
-
Project mention: argparse_dataclass: declarative CLIs with argparse and dataclasses (e.g. similar to Rust's structopt) | /r/Python | 2023-06-01
-
openapi-schema-validator
OpenAPI schema validator is a Python library that validates schema against OpenAPI Schema Specification v3.0 and v3.1
-
-
Project mention: Show HN: Mathesar – open-source collaborative UI for Postgres databases | news.ycombinator.com | 2023-03-02
JSONLD types are specified with @type, and the range of a @type attribute includes rdfs:Class.
icontract and pycontracts (contracts-based programming) have runtime type and constraint checking; data validation. Precondition, Command, Postconditions (assertions, assertions of invariance after command C_funcname executed)
pydantic_schemaorg: https://github.com/lexiq-legal/pydantic_schemaorg
> Pydantic_schemaorg contains all the models defined by schema.org. The pydantic classes are auto-generated from the schema.org model definitions that can be found on https://schema.org/version/latest/schemaorg-current-https.js... [ https://github.com/schemaorg/schemaorg/tree/main/data/releas... ]
-
lupin is a Python JSON object mapper
Python document object mapper (load python object from JSON and vice-versa)
-
-
-
Conventional Knowledge Graph Construction (KGC) approaches typically follow the static information extraction paradigm with a closed set of pre-defined schema. As a result, such approaches fall short when applied to dynamic scenarios or domains, whereas a new type of knowledge emerges. This necessitates a system that can handle evolving schema automatically to extract information for KGC. To address this need, we propose a new task called schema-adaptable KGC, which aims to continually extract entity, relation, and event based on a dynamically changing schema graph without re-training. We first split and convert existing datasets based on three principles to build a benchmark, i.e., horizontal schema expansion, vertical schema expansion, and hybrid schema expansion; then investigate the schema-adaptable performance of several well-known approaches such as Text2Event, TANL, UIE and GPT-3. We further propose a simple yet effective baseline dubbed AdaKGC, which contains schema-enriched prefix instructor and schema-conditioned dynamic decoding to better handle evolving schema. Comprehensive experimental results illustrate that AdaKGC can outperform baselines but still have room for improvement. We hope the proposed work can deliver benefits to the community. Code and datasets will be available in https://github.com/zjunlp/AdaKGC.
-
InfluxDB
Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.
Python Schema related posts
- Unit testing functions that input/output dataframes?
- Pydantic vs Protobuf vs Namedtuples vs Dataclasses
- Yet another command line tool: clidantic, typed CLIs with click and pydantic
- Data validation for dashboards
- Mypy 0.940 Released
- Show HN: Pandera 0.8.0 – validate pandas, dask, modin, and koalas dataframes
- Pandera 0.8.0: Schema Validation for Pandas, Dask, Modin, and Koalas DataFrames. Oh, and also out-of-the-box Pydantic and Mypy support :)
-
A note from our sponsor - Revelo Payroll
try.revelo.com | 2 Oct 2023
Index
What are some of the best open-source Schema projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | marshmallow | 6,680 |
2 | jsonschema | 4,247 |
3 | Mimesis | 4,064 |
4 | pandera | 2,598 |
5 | Schematics | 2,562 |
6 | strictyaml | 1,324 |
7 | msgspec | 1,310 |
8 | pydantic-core | 1,095 |
9 | eralchemy | 1,040 |
10 | Yamale | 594 |
11 | pygraft | 489 |
12 | pysonDB | 347 |
13 | flama | 237 |
14 | typedload | 227 |
15 | ProfileManifestsMirror | 126 |
16 | pydantic-cli | 111 |
17 | openapi-schema-validator | 70 |
18 | SnowDDL | 66 |
19 | pydantic_schemaorg | 38 |
20 | lupin is a Python JSON object mapper | 24 |
21 | py2gpt | 14 |
22 | okjson | 5 |
23 | AdaKGC | 4 |