JSON in data science projects: tips & tricks

This page summarizes the projects mentioned and recommended in the original post on dev.to

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • orjson

    Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy

  • orjson is the fastest JSON library available for python. It natively manages dataclass objects, datetime, numpy and UUID objects.

  • NumPy

    The fundamental package for scientific computing with Python.

  • Data science projects often use numpy. However, numpy objects are not JSON-serializable and therefore require conversion to standard python objects in order to be saved:

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • jsonlines

    Documentation for the JSON Lines text file format

  • This can be remedied by using the [JSON Lines] format (https://jsonlines.org/). This involves nothing more and nothing less than placing one JSON object per line, so that you can browse the objects without having to parse the entire collection all at once.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts