Create HTML profiling reports from pandas DataFrame objects
-
Updated
Oct 27, 2020 - Jupyter Notebook
{{ message }}
Create HTML profiling reports from pandas DataFrame objects
Visualize and compare datasets, target values and associations, with one line of code.
Data Explorer by Keen - point-and-click interface for analyzing and visualizing event data.
Feature exploration for supervised learning
DataPrep: Data Preparation in Python
Automate Data Exploration and Treatment
edaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab
Automatic insights discovery and visualization for data analysis.
R package that makes basic data exploration radically simple (interactive data exploration, reproducible data science)
Multidimensional data explorer and visualization tool.
A collection of Jupyter notebooks exploring different datasets.
Enjoy your transcriptomic data and analysis responsibly - like sipping a cocktail
light and fast implementation of web pivot table / pivot chart components.
Light, personalized, interactive dashboards for urban data exploration.
bamboolib - template for creating your own binder notebook
An User-Friendly Application for Exploratory Factor Analysis
A simple wrapper for SQL connections using SQLAlchemy and Pandas read_sql to standardize SQL workflow with multiple data sources.
Applying Machine Learning Algorithms to the Kaggle "Titanic Survival Prediction Problem".
Notebooks extension for Neptune, lightweight data science collaboration hub
Understand the relationships between various features in relation with the sale price of a house using exploratory data analysis and statistical analysis. Applied ML algorithms such as Multiple Linear Regression, Ridge Regression and Lasso Regression in combination with cross validation. Performed parameter tuning, compared the test scores and suggested a best model to predict the final sale price of a house. Seaborn is used to plot graphs and scikit learn package is used for statistical analysis.
Slides, code and more for my class: Data Analytics and Machine Learning on Big Data
REFACTORING...
CodersMUSE is a prototype implementation to explore multi-modal data of program-comprehension experiments.
Prototype implementation for the research project VisTiles, in which multiple mobiles devices are combined for visual data exploration.
Exploring the NTSB Aviation Accident Database
Automated exploration of files in a folder structure to extract metadata and potential usage of information.
KeepCoding Bootcamp Big Data & Machine Learning - Proyecto Final Team Outliers - Batiendo a la bolsa
Add a description, image, and links to the data-exploration topic page so that developers can more easily learn about it.
To associate your repository with the data-exploration topic, visit your repo's landing page and select "manage topics."
Currently all of the metrics computed are independent of a target variable or column, but if
lens.summarisetook the name of a column as the target variable, the output of some metrics could be more interpretable even if the target variable is not used in any kind of predictive modelling.A good example of this could be PCA (see #14), which could plot the different categories of the target va