100 Days of ML Coding
-
Updated
Sep 30, 2020
{{ message }}
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
100 Days of ML Coding
AiLearning: 机器学习 - MachineLearning - ML、深度学习 - DeepLearning - DL、自然语言处理 NLP
Python Data Science Handbook: full text in Jupyter Notebooks
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in python using Scikit-Learn and TensorFlow.
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Your new Mentor for Data Science E-Learning.
The "Python Machine Learning (1st edition)" book code repository and info resource
Dive into Machine Learning with Python Jupyter notebook and scikit-learn!
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
Open Machine Learning Course
The "Python Machine Learning (2nd edition)" book code repository and info resource
For example, if there is a relationship transaction.session_id -> sessions.id and we are calculating a feature transactions: sessions.SUM(transactions.value) any rows for which there is no corresponding session should be given the default value of 0 instead of NaN.
Of course this should not normally occur, but when it does it seems more reasonable to use the default_value.
`DirectF
with the Power Transformer.
My blogs and code for machine learning. http://cnblogs.com/pinard
PipelineAI Kubeflow Distribution
I see the code
device = ‘cuda’ if torch.cuda.is_available() else ‘cpu’
repeated often in user code. Maybe we should introduce device='auto' exactly for this case?
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Yes
Jupyter notebooks from the scikit-learn video series
Visual analysis and diagnostic tools to facilitate machine learning model selection.
How do i resume training for text classification?
A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.
for reproducibility, all classifiers should be seeded by random_state. In our experiments, the random_state is set to the resampleID. This is not correctly set in all the considered classifiers. The current list to check is
all_classifiers = [
"BOSSEnsemble", "TemporalDictionaryEnsemble", "MUSE", "WEASEL",
"ProximityForest", "ElasticEnsemble", "KNeighborsTimeSeriesClassifier",
#"Shap
https://igel.readthedocs.io/en/latest/_sources/readme.rst.txt includes a link to the assets/igel-help.gif, but that path is broken on readthedocs.
readme.rst is included as ../readme.rst in the sphinx build.
The gifs are in asses/igel-help.gif
The sphinx build needs to point to the asset directory, absolutely:
.. image:: /assets/igel-help.gif
I haven't made a patch, because I haven't
A library for debugging/inspecting machine learning classifiers and explaining their predictions
I think it could be useful, when one wants to plot only e.g. class 1, to have an option to produce consistent plots for both plot_cumulative_gain and plot_roc
At the moment, instead, only plot_roc supports such option.
Thanks a lot
Created by David Cournapeau
Released January 05, 2010
Latest release 3 months ago
Bug Report
These tests were run on s390x. s390x is big-endian architecture.
Failure log for helper_test.py