borb VS cleanX

Compare borb vs cleanX and see what are their differences.

borb

borb is a library for reading, creating and manipulating PDF files in python. (by jorisschellekens)

cleanX

Python library for exploring, cleaning, normalizing, and augmenting large datasets of radiological data. (by drcandacemakedamoore)
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
borb cleanX
66 10
3,287 24
- -
5.2 0.0
about 2 months ago over 1 year ago
Python Jupyter Notebook
GNU General Public License v3.0 or later GNU General Public License v3.0 only
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

borb

Posts with mentions or reviews of borb. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-10-28.

cleanX

Posts with mentions or reviews of cleanX. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-12-23.
  • Looking for a minimal Conda package project
    4 projects | /r/learnpython | 23 Dec 2021
    Unfortunately, it's not minimal, and is quite complicated, but it's the one I wrote, so I can answer your questions if you have any: https://github.com/drcandacemakedamoore/cleanX/tree/main/conda-pkg and https://github.com/drcandacemakedamoore/cleanX/blob/main/setup.py (the links are to Conda-specific setup). I also created Conda builds for few other projects (not written by me):
  • Hope this helps someone
    1 project | /r/datascience | 19 Nov 2021
    If you work with medical images you may be interested to look at https://github.com/drcandacemakedamoore/cleanX . It does data cleaning for chest X-rays (DICOM or JPEGs) and associated information.
  • Hello r/DS, wanted to share a package
    1 project | /r/datascience | 6 Nov 2021
    A few months ago I posted to the machine learning channel about a package I thought some might find helpful for radiology images. The package, cleanX, has since evolved, and I think it can be even more helpful, and to actual data science practitioners in general. While it began for chest X-rays, many sections are generalizable for tabular data linked to images. Enjoy!
  • [D] How to find dissimilarities between sets of images?
    1 project | /r/MachineLearning | 20 Aug 2021
    I faced a similar issue on chest X-rays. You can see what I did already about it in the library cleanX, but to summarize for an initial idea I averaged all the images in each set, then subtracted. It won't tell you much, but it will point to areas where things are different- in the case of roads with an without tractors, it will show big differences on the roads (unless there is a difference in what surrounds the roads)
  • cleaning data, especially chest X-rays, before algorithm creation
    1 project | /r/medicalimaging | 13 Aug 2021
    There is no machine learning. The algorithms are meant to clean up the data before you build any machine learning algorithms. There are several ways to predict non-Xray images. One is to compare them to a small average image of all the X-rays. Take a look at the code (https://github.com/drcandacemakedamoore/cleanX), and you will find a couple of others...
  • [R] cleaning data, especially chest X-rays
    3 projects | /r/MachineLearning | 11 Aug 2021
    An excellent suggestion. I have thought about how best to implement demonstrations, because when I ran my code over a few Kaggle datasets, I found problems in them as I show here, but to know if I caught ALL of them or even most of them would require examining every single image by hand. But maybe the answer is to show a machine learning training task with dirty and cleanX cleaned data to show it makes a difference. I'll try to get to it soon.
  • Sunday Daily Thread: What's everyone working on this week?
    10 projects | /r/Python | 7 Aug 2021
  • Generalising data cleaning, is it possible?
    1 project | /r/learnpython | 7 Aug 2021

What are some alternatives?

When comparing borb and cleanX you can also consider the following projects:

ReportLab

Python-GUI-Calculator

PyPDF2 - A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

pylibjpeg-openjpeg - A J2K and JP2 plugin for pylibjpeg

PDFMiner - Python PDF Parser (Not actively maintained). Check out pdfminer.six.

flask-graphql-boilerplate - a flask boilerplate to get you up and running. Packed with GraphQL and an authentication system out of the box.

pdf2docx - Open source Python library for converting PDF to DOCX.

Connect-Four - Connect Four GUI Game - Python-Tkinter

pikepdf - A Python library for reading and writing PDF, powered by QPDF

nobotty

fpdf2 - Simple PDF generation for Python

black - The uncompromising Python code formatter