african_microbiome_portal_data
Raw and corrected data with correction python notebook (by h3abionet)
ploomber-engine
A toolbox 🧰 for Jupyter notebooks 📙: testing, experiment tracking, debugging, profiling, and more! (by ploomber)
african_microbiome_portal_data | ploomber-engine | |
---|---|---|
1 | 3 | |
0 | 68 | |
- | - | |
4.3 | 5.5 | |
almost 2 years ago | 7 months ago | |
Jupyter Notebook | Python | |
GNU General Public License v3.0 only | BSD 3-clause "New" or "Revised" License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
african_microbiome_portal_data
Posts with mentions or reviews of african_microbiome_portal_data.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-09-09.
-
Running Jupyter notebooks in parallel
Here we will share the results after testing and evaluating some of these tools. Note that to make this comparison fair, it takes into account the use of the same code for all executions and we also use Python's time module to measure the execution time. The notebooks used for benchmarking can be found here and correspond to the african_microbiome_portal_data repository. Serial execution cases (each notebook sequentially) are evaluated first, followed by parallel notebook execution cases.
ploomber-engine
Posts with mentions or reviews of ploomber-engine.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2024-09-18.
-
Papermill: Parameterizing, executing, and analyzing Jupyter Notebooks
Papermill is great but has quite some limitations because it spins up a new process to run the notebook:
- You cannot extract live variables (needed for testing)
- Cannot use pdb for debugging
- Cannot profile memory usage
You can do all of that with ploomber-engine (https://github.com/ploomber/ploomber-engine).
-
Who needs MLflow when you have SQLite?
If you need help, you can open an issue on GitHub (https://github.com/ploomber/ploomber-engine) or join our Slack! (https://ploomber.io/community/)
-
Running Jupyter notebooks in parallel
As a third option we will use Papermill again, but now with the ploomber-engine, which adds debugging and profiling features to Papermill:
What are some alternatives?
When comparing african_microbiome_portal_data and ploomber-engine you can also consider the following projects:
ploomber - The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
unionml - UnionML: the easiest way to build and deploy machine learning microservices
ganimede
rubicon-ml - Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!
trade-executor - A Python framework for managing positions and trades in DeFi