Statistics

Open-source projects categorized as Statistics Edit details

Top 23 Statistic Open-Source Projects

  • scikit-learn

    scikit-learn: machine learning in Python

    Project mention: Can anyone share some good examples of Python OOP Repos for DS? | reddit.com/r/datascience | 2022-09-17
  • Probabilistic-Programming-and-Bayesian-Methods-for-Hackers

    aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

    Project mention: ✨ 10 Free Books for Machine Learning & Data Science 📚 | reddit.com/r/learnmachinelearning | 2022-09-23

    🔗 https://camdavidsonpilon.github.io/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers/

  • talent.io

    Download talent.io’s Tech Salary Report. Median salaries, most in-demand technologies, state of the remote work... all you need to know your worth on the market by tech recruitment platform talent.io

  • Umami

    Umami is a simple, fast, privacy-focused alternative to Google Analytics.

    Project mention: Help growing open source software monetization | reddit.com/r/startups | 2022-09-25

    Look at umami.is they are OSS analytics. However, just recently they announced hosting plans, so everyone gets their own instance while they manage the servers. I know what you're offering is totally different, but a great example.

  • excelize

    Go language library for reading and writing Microsoft Excel™ (XLAM / XLSM / XLSX / XLTM / XLTX) spreadsheets

    Project mention: Where does PHP being single threaded affect performance and can anything be done to make it better? | reddit.com/r/PHP | 2022-09-27

    Golang excel: https://github.com/qax-os/excelize

  • Plausible Analytics

    Simple, open-source, lightweight (< 1 KB) and privacy-friendly web analytics alternative to Google Analytics.

    Project mention: How to grow a SaaS company to $500k in Annual Recurring Revenue with content distribution? | reddit.com/r/Entrepreneur | 2022-09-28

    Today I would like to share another great case study with you. I had the pleasure to speak with Marko Saric from Plausible. He told me a lot of insights on how he grew his SaaS to $500k ARR by writing valuable content and distributing it in the right places.

  • G2

    📊 A visualization grammar for dashboard building, data exploration and storytelling.

  • pandas-profiling

    Create HTML profiling reports from pandas DataFrame objects

    Project mention: Data profiling as part of a data reliability strategy? | reddit.com/r/dataengineering | 2022-09-15
  • Scout APM

    Truly a developer’s best friend. Scout APM is great for developers who want to find and fix performance issues in their applications. With Scout, we'll take care of the bugs so you can focus on building great things 🚀.

  • statsmodels

    Statsmodels: statistical modeling and econometrics in Python

    Project mention: How do you usually build your models? | reddit.com/r/quant | 2022-09-17

    Since you are using python, pandas, scikit-learn, scipy, and statsmodels are what you are looking for

  • tokei

    Count your code, quickly.

    Project mention: Your favourite Rust CLI utilities this year? | reddit.com/r/rust | 2022-09-17

    Not sure if anyone mentioned tokei yet but it’s excellent

  • gonum

    Gonum is a set of numeric libraries for the Go programming language. It contains libraries for matrices, statistics, optimization, and more

    Project mention: Modern hardware is fast, so let's choose the slowest language to balance it out | reddit.com/r/ProgrammerHumor | 2022-09-19
  • boltons

    🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.

  • miller

    Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON

    Project mention: Miller 6.4.0: mlr summary verb, exec() function, mlr cat –filename, and more | news.ycombinator.com | 2022-08-20
  • Smile

    Statistical Machine Intelligence & Learning Engine

    Project mention: Anybody here using Java for machine learning? | reddit.com/r/java | 2022-09-13

    For deploying a trained model there are a bunch of options that use Java on top of some native runtime like TF-Java (which I co-lead), ONNX Runtime, pytorch has inference for TorchScript models. Training deep learning models is harder, though you can do it for some of them in DJL. Training more standard ML models is much simpler, either via Tribuo, or using things like LibSVM & XGBoost directly, or other libraries like SMILE or WEKA.

  • git-quick-stats

    ▁▅▆▃▅ Git quick statistics is a simple and efficient way to access various statistics in git repository.

  • Tautulli

    A Python based monitoring and tracking tool for Plex Media Server.

    Project mention: Lost my Music have to rebuild the NAS storage | reddit.com/r/PleX | 2022-09-30
  • scc

    Sloc, Cloc and Code: scc is a very fast accurate code counter with complexity calculations and COCOMO estimates written in pure Go

    Project mention: Large project uses Rust backend. My backend developer left. How hard is it for me to learn Rust and take over for him. | reddit.com/r/rust | 2022-08-08

    I don't trust your qualitative "LARGE" for the project. I would recommend you pass your project through something like a software metrics tool https://github.com/boyter/scc to better measure what you're up against in terms of Flutter/Dart AND Rust code base.

  • probability

    Probabilistic reasoning and statistical analysis in TensorFlow

    Project mention: [P] Any good resources which can help me with Multivariate Time Series Forecasting using Probabilistic Machine Learning? | reddit.com/r/MachineLearning | 2022-08-14
  • growthbook

    Open Source Feature Flagging and A/B Testing Platform

    Project mention: GrowthBook: Open-source feature flagging and A/B testing platform | reddit.com/r/coolgithubprojects | 2022-06-23
  • datascience

    Curated list of Python resources for data science.

    Project mention: Datascience Libraries for Python | news.ycombinator.com | 2021-11-13
  • stdlib

    ✨ Standard library for JavaScript and Node.js. ✨

    Project mention: my thoughts about Node as a backend | reddit.com/r/node | 2022-08-18

    Scientific calculations - like what? There is a stlib package for JS where CPU hungry parts are written in C.

  • Tablesaw

    Java dataframe and visualization library

  • MathNet

    Math.NET Numerics

    Project mention: Trying to Compute the Square root of a number. C# | reddit.com/r/VisualStudio | 2022-09-26

    Standard deviation isn’t the square root. I generally just use https://numerics.mathdotnet.com . If you need to visualize the data somehow you might want to look at R

  • criterion.rs

    Statistics-driven benchmarking library for Rust

    Project mention: Criterion.rs with MockClock, is this possible? | reddit.com/r/rust | 2022-09-06

    the cfg flag test is not carried over to the criterion benchmarks, see https://github.com/bheisler/criterion.rs/issues/448 and https://github.com/rust-lang/cargo/issues/8897

  • SonarLint

    Clean code begins in your IDE with SonarLint. Up your coding game and discover issues early. SonarLint is a free plugin that helps you find & fix bugs and security issues from the moment you start writing code. Install from your favorite IDE marketplace today.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2022-09-30.

Statistics related posts

Index

What are some of the best open-source Statistic projects? This list will help you:

Project Stars
1 scikit-learn 51,531
2 Probabilistic-Programming-and-Bayesian-Methods-for-Hackers 24,846
3 Umami 13,276
4 excelize 12,956
5 Plausible Analytics 12,614
6 G2 11,373
7 pandas-profiling 9,665
8 statsmodels 7,768
9 tokei 7,032
10 gonum 6,076
11 boltons 5,877
12 miller 5,729
13 Smile 5,590
14 git-quick-stats 5,492
15 Tautulli 4,532
16 scc 3,801
17 probability 3,778
18 growthbook 3,623
19 datascience 3,474
20 stdlib 3,045
21 Tablesaw 3,002
22 MathNet 2,923
23 criterion.rs 2,806
Find remote jobs at our new job board 99remotejobs.com. There are 8 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
Build time-series-based applications quickly and at scale.
InfluxDB is the Time Series Data Platform where developers build real-time applications for analytics, IoT and cloud-native services in less time with less code.
www.influxdata.com