Statistics
Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.
Here are 8,902 public repositories matching this topic...
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
-
Updated
Jan 6, 2022 - Jupyter Notebook
-
Updated
Jan 28, 2022 - TypeScript
Golang library for reading and writing Microsoft Excel™ (XLSX) files.
-
Updated
Feb 4, 2022 - Go
Simple, open-source, lightweight (< 1 KB) and privacy-friendly web analytics alternative to Google Analytics.
-
Updated
Feb 1, 2022 - Elixir
Umami is a simple, fast, website analytics alternative to Google Analytics.
-
Updated
Feb 1, 2022 - JavaScript
Create HTML profiling reports from pandas DataFrame objects
-
Updated
Feb 4, 2022 - Jupyter Notebook
Collection of follow-ups to #5827. These can/should be broken out into individual PRs. Many are relatively straightforward and would make a good first PR.
General
- Documentation (none was added in original PR).
- Release notes.
- Example notebook.
- Double-check how
sm.tsa.arima.ARIMAworks withfix_params(it should fail except when the fit method isstatespace
-
Updated
Jan 28, 2022 - HTML
-
Updated
Jan 15, 2022 - Python
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
-
Updated
Jan 25, 2022 - Python
Statistical Machine Intelligence & Learning Engine
-
Updated
Feb 1, 2022 - Java
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
-
Updated
Feb 5, 2022 - Go
A probabilistic programming language in TensorFlow. Deep generative models, variational inference.
-
Updated
Oct 22, 2019 - Jupyter Notebook
A Python based monitoring and tracking tool for Plex Media Server.
-
Updated
Feb 4, 2022 - Python
Machine learning, computer vision, statistics and general scientific computing for .NET
-
Updated
Nov 18, 2020 - C#
Are there any plans to add a Zero-Inflated Poisson (ZIP) and Zero-Inflated Negative Binomial (ZINB) to TFP? Those are usually very common distributions in other packages, and it shouldn't be hard to implement.
Curated list of Python resources for data science.
-
Updated
Feb 4, 2022
Since the default output is meant to be human-readable, would it make sense to add thousands separators to make the output more easily readable?
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
-
Updated
Jan 18, 2022 - Java
High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.
-
Updated
Jan 28, 2021 - C++
Java dataframe and visualization library
-
Updated
Jan 21, 2022 - Java
-
Updated
Feb 5, 2022 - JavaScript
Math.NET Numerics
-
Updated
Feb 4, 2022 - C#
simple statistics for node & browser javascript
-
Updated
Feb 1, 2022 - JavaScript
A Laravel package to retrieve pageviews and other data from Google Analytics
-
Updated
Jan 19, 2022 - PHP
中国的Quant相关资源索引
-
Updated
Feb 17, 2021


These examples take quite a long time to run, and they make our documentation CI fail quite frequently due to timeout. It'd be nice to speed the up a little bit.
To contributors: if you want to work on an example, first have a look at the example, and if you think you're comfortable working on it and have found a potential way to speed-up execution time while preserving the educational message