spark
Here are 4,918 public repositories matching this topic...
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
-
Updated
Jul 24, 2020 - Python
Learn and understand Docker technologies, with real DevOps practice!
-
Updated
Jul 30, 2020 - Go
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
-
Updated
Aug 1, 2020 - JavaScript
Eclipse Deeplearning4j, ND4J, DataVec and more - deep learning & linear algebra for Java/Scala with GPUs + Spark
-
Updated
Jul 31, 2020 - Java
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
-
Updated
Jul 31, 2020 - Python
-
Updated
Aug 1, 2020 - JavaScript
List of Data Science Cheatsheets to rule the world
-
Updated
Oct 31, 2019
flink learning blog. http://www.54tianzhisheng.cn 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
-
Updated
Jul 4, 2020 - Java
Open-source IoT Platform - Device management, data collection, processing and visualization.
-
Updated
Aug 1, 2020 - Java
A Flexible and Powerful Parameter Server for large-scale machine learning
-
Updated
Jul 31, 2020 - Java
macOS development environment setup: Easy-to-understand instructions with automated setup scripts for developer tools like Vim, Sublime Text, Bash, iTerm, Python data analysis, Spark, Hadoop MapReduce, AWS, Heroku, JavaScript web development, Android development, common data stores, and dev-based OS X defaults.
-
Updated
Jun 20, 2020 - Python
Open Source Fast Scalable Machine Learning Platform For Smarter Applications: Deep Learning, Gradient Boosting & XGBoost, Random Forest, Generalized Linear Modeling (Logistic Regression, Elastic Net), K-Means, PCA, Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
-
Updated
Aug 1, 2020 - Jupyter Notebook
Alluxio, data orchestration for analytics and machine learning in the cloud
-
Updated
Aug 1, 2020 - Java
PipelineAI Kubeflow Distribution
-
Updated
Apr 24, 2020 - Jsonnet
BigDL: Distributed Deep Learning Framework for Apache Spark
-
Updated
Jul 30, 2020 - Scala
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
-
Updated
Jul 27, 2020 - Python
酷玩 Spark: Spark 源代码解析、Spark 类库等
-
Updated
May 26, 2019 - Scala
Interactive and Reactive Data Science using Scala and Spark.
-
Updated
Jun 2, 2020 - JavaScript
The Hunting ELK
-
Updated
Jul 27, 2020 - Jupyter Notebook
Microsoft Machine Learning for Apache Spark
-
Updated
Jul 29, 2020 - Scala
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
-
Updated
Jul 30, 2020 - Scala
Improve this page
Add a description, image, and links to the spark topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the spark topic, visit your repo's landing page and select "manage topics."

