pydata
Here are 83 public repositories matching this topic...
Extract data from a wide range of Internet sources into a pandas DataFrame.
-
Updated
Nov 18, 2020 - Python
In our API docs we currently use
.. autosummary::
Client
Client.call_stack
Client.cancel
...
To generate a table of Client methods at the top of the page. Later on we use
.. autoclass:: Client
:members:
to display the docstrings for all the public methods on Client (here an example for
Instructions for how to install pyjanitor via pipenv
Some folks might use pipenv for environment management. The recent update requires a prerelease dependency (black, as menti
PyData, The Complete Works of
-
Updated
Dec 27, 2016
Problem description
Our dask update graphs are not properly optimized.
We ussually use dask.dataframe optimization and set ave_width=repartition_ratio for kartothek.io.dask.dataframe.update_dataset_from_ddf graphs. We should return an optimized graph from update_dataset_from_ddf to make our users' life simple.
We already have code that does this, whoever picks this up can ping me
Notebooks for the Seattle PyData 2017 talk on Scattertext
-
Updated
Jan 12, 2018 - HTML
Machine learning with scikit-learn tutorial at PyData Chicago 2016
-
Updated
Oct 18, 2016 - Jupyter Notebook
In trying to write tests for #189, I'm finding very difficult to add columns to existing tests, as in some cases like the all_types table, the table is defined in a separate file than the tests and multiple tests try to write to the same table.
Additionally, our test suite doesn't prove that the data that are uploaded are the same as the data downloaded for all types.
We should consider m
vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.
-
Updated
Oct 17, 2020 - Python
Repo for my talk at the PyData Berlin 2017 conference
-
Updated
Jul 30, 2017 - Jupyter Notebook
Introduction to Machine Learning with Time Series at PyData Festival Amsterdam 2020
-
Updated
Nov 5, 2020 - Jupyter Notebook
Companion repo to the GitBook, notes on Data Science topics
-
Updated
Nov 11, 2020 - Jupyter Notebook
Social network analyses code examples for PyCon 2019 talk
-
Updated
Jun 4, 2020 - Jupyter Notebook
Slides and notebooks for my tutorial at PyData London 2018
-
Updated
Jul 2, 2018 - Jupyter Notebook
This is the code and presentation for my PyData2017 talk "Reverse Image Search Using Out-of-the-box Machine Learning Libraries
-
Updated
Aug 14, 2018 - HTML
The assert_array_shape call is used a lot in REGENIE, and serves both as a runtime check and as documentation for the reader. We should sprinkle it liberally through other functions.
@matthewbrems and I presented "Recreating, Understanding, and Visualizing FiveThirtyEight's Elections Forecast" at PyData DC 2018
-
Updated
Nov 16, 2018 - Jupyter Notebook
Pydata 2017 workshop: build a clickbait detector with python
-
Updated
Sep 14, 2017 - Jupyter Notebook
Material for working alongside my workshop session at PyData Berlin 2018
-
Updated
Sep 24, 2018 - Shell
Implementation of Neural Image Captioning model using Keras with Theano backend
-
Updated
Sep 2, 2017 - Jupyter Notebook
An example of how the LIME algorithm can be used to provide real-world insight into the decision processes of a 'black-box' machine learning algorithm - in this case a Radom Forest regressor.
-
Updated
Feb 19, 2019 - Jupyter Notebook
Battle-hardened advice on efficient data loading for deep learning on videos.
-
Updated
Jul 9, 2018 - Python
Improve this page
Add a description, image, and links to the pydata topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the pydata topic, visit your repo's landing page and select "manage topics."

