An open-source big data platform designed and optimized for the Internet of Things (IoT).
-
Updated
Aug 12, 2020 - C
An open-source big data platform designed and optimized for the Internet of Things (IoT).
A curated list of awesome big data frameworks, ressources and other awesomeness.
Out-of-Core DataFrames for Python, ML, visualize and explore big tabular data at a billion rows per second
An easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights.
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Distributed Big Data Orchestration Service
Upserts, Deletes And Incremental Processing on Big Data.
GridDB is a next-generation open source database that makes time series IoT and big data fast,and easy.
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
The Programming Language Designed For Big Data and AI
A Kubernetes Native Batch System (Project under CNCF)
C# and F# language binding and extensions to Apache Spark
Google, Naver multiprocess image web crawler (Selenium)
Lightweight real-time big data streaming engine over Akka
An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC
Detect threats with log data and improve cloud security posture
学习记录的一些笔记,以及所看得一些电子书eBooks、视频资源和平常收纳的一些自己认为比较好的博客、网站、工具。涉及大数据几大组件、Python机器学习和数据分析、Linux、操作系统、算法、网络等
A book about running Elasticsearch
Fast topic modeling platform
Add a description, image, and links to the bigdata topic page so that developers can more easily learn about it.
To associate your repository with the bigdata topic, visit your repo's landing page and select "manage topics."