data-science
Here are 15,891 public repositories matching this topic...
Most functions in scipy.linalg functions (e.g. svd, qr, eig, eigh, pinv, pinv2 ...) have a default kwarg check_finite=True that we typically leave to the default value in scikit-learn.
As we already validate the input data for most estimators in scikit-learn, this check is redundant and can cause significant overhead, especially at predict / transform time. We should probably a
Screenshot
N/A
Description
Right now whenever users search for queries they are case sensitive. We should remove this to allow users to put in term with any cases
Design input
[describe any input/collaboration you'd like from designers, and
tag accordingly. For design review, add the
label design:review. If this includes a design proposal,
include the label `design:suggest
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
-
Updated
Oct 1, 2020 - Jupyter Notebook
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
-
Updated
Oct 1, 2020 - Python
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
-
Updated
Nov 7, 2020 - Python
-
Updated
Nov 27, 2020 - Python
-
Updated
Nov 27, 2020
Travis is not going to automatically offer the free tier for all open source projects; We likely want o migrate away from travis.
Setting up github actions to replace travis would be a welcomed contribution.
When I have a cluster already running, I sometimes want to re-run the setup commands even if there's no changes, without having to shut down the cluster. For example if I am installing a package via a Github repo, I want to trigger the install again after pushing new changes.
However, calling ray up doesn
In recent versions (can't say from exactly when), there seems to be an off-by-one error in dcc.DatePickerRange. I set max_date_allowed = datetime.today().date(), but in the calendar, yesterday is the maximum date allowed. I see it in my apps, and it is also present in the first example on the DatePickerRange documentation page.
E
Streamlit — The fastest way to build data apps in Python
-
Updated
Nov 28, 2020 - Python
Not a high-priority at all, but it'd be more sensible for such a tutorial/testing utility corpus to be implemented elsewhere - maybe under /test/ or some other data- or doc- related module – rather than in gensim.models.word2vec.
Originally posted by @gojomo in RaRe-Technologies/gensim#2939 (comment)
VIP cheatsheets for Stanford's CS 229 Machine Learning
-
Updated
May 20, 2020
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
-
Updated
Nov 12, 2020
The "Python Machine Learning (1st edition)" book code repository and info resource
-
Updated
Oct 16, 2020 - Jupyter Notebook
The fastai book, published as Jupyter Notebooks
-
Updated
Nov 28, 2020 - Jupyter Notebook
🚀 Refactoring
As isort has been added to ci in #4242, we now need to apply the formatter step by step i.e. a submodule per PR (recommended in PyTorchLightning/pytorch-lightning#4242 (comment) by @Borda)
Steps
For each PR:
- choose one submodule from below list and apply
isortto it - remove the corresponding line in
pyproject.toml - make
Dive into Machine Learning with Python Jupyter notebook and scikit-learn!
-
Updated
Jul 31, 2020
A curated list of awesome big data frameworks, ressources and other awesomeness.
-
Updated
Nov 17, 2020
Deep learning library featuring a higher-level API for TensorFlow.
-
Updated
Nov 24, 2020 - Python
more details at: allenai/allennlp#2264 (comment)
Best Practices on Recommendation Systems
-
Updated
Nov 27, 2020 - Python
What would you like to be added: As title
Why is this needed: All pruning schedule except AGPPruner only support level, L1, L2. While there are FPGM, APoZ, MeanActivation and Taylor, it would be much better if we can choose any pruner with any pruning schedule.
**Without this feature, how does current nni
Tutorials, assignments, and competitions for MIT Deep Learning related courses.
-
Updated
Oct 31, 2020 - Jupyter Notebook
Interactive deep learning book with code, math, and discussions. Available in multi-frameworks. Adopted at 140 universities.
-
Updated
Nov 28, 2020 - Python
Statistical data visualization using matplotlib
-
Updated
Nov 25, 2020 - Python
The options "Include Schema" and "Include Contents" in the SQL exporter dialog can be a bit mysterious for users.
Proposed solution
Just like concrete SQL commands are included in the UI text elsewhere in the dialog ("DROP"), we could expand these phrases to mention the corresponding SQL commands ("CREATE TABLE", "INSERT").
Alternatives considered
We could also drop the mention
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
-
Updated
Nov 23, 2020 - Python
Improve this page
Add a description, image, and links to the data-science topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-science topic, visit your repo's landing page and select "manage topics."


(e.g. for links and images), because some of these examples are now being rendered in the docs.
Added by @fchollet in requests for contributions.