numpy-string-indexed
xorbits
numpy-string-indexed | xorbits | |
---|---|---|
2 | 7 | |
2 | 1,016 | |
- | 2.2% | |
0.0 | 8.8 | |
about 2 years ago | about 1 month ago | |
Python | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
numpy-string-indexed
xorbits
-
Everything you need to know about pandas 2.0.0!
Here’s our project: https://github.com/xprobe-inc/xorbits
-
Introducing Xorbits: A Distributed Python Data Science Framework for Large Dataset Analysis
Hey everyone, we are excited to announce our new project, Xorbits, a scalable data science framework that aims to scale the entire Python data science world.
-
Use maximum PC Hardware Resources
My suggestion is to use some parallel computing framework like Xorbits. The framework will parallel your workload automatically. For data processing tasks, just use xorbits.pandas or xorbits.numpy, and you can run almost any python workload with xorbits.remote.
-
Use "distributed pandas" to scale your data science workflow!
If you are interested in learning more about Xorbits, please visit our project's Github for more information: https://github.com/xprobe-inc/xorbits
-
A new way to accelerate your data science workflow
Xorbits can be an ideal solution for these issues. Xorbits is a scalable Python data science framework that aims to scale the Python data science stack while keeping the API compatibility. You can get an out-of-box performance gain by changing `import pandas as pd` to `import xorbits.pandas as pd`.
- Scalable Python data science, in an API compatible and fast way
What are some alternatives?
machine_learning_refined - Notes, examples, and Python demos for the 2nd edition of the textbook "Machine Learning Refined" (published by Cambridge University Press).
Data Flow Facilitator for Machine Learning (dffml) - The easiest way to use Machine Learning. Mix and match underlying ML libraries and data set sources. Generate new datasets or modify existing ones with ease.
data-science-ipython-notebooks - Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
trueskill - An implementation of the TrueSkill rating system for Python
hamilton - Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
tensorflow - An Open Source Machine Learning Framework for Everyone
mlcourse.ai - Open Machine Learning Course
Prophet - Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
PyBrain
seqeval - A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
adaptive - :chart_with_upwards_trend: Adaptive: parallel active learning of mathematical functions
H2O - H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.