bigdata
Here are 1,289 public repositories matching this topic...
A curated list of awesome big data frameworks, ressources and other awesomeness.
-
Updated
Aug 24, 2020
Hello,
Considering your amazing efficiency on pandas, numpy, and more, it would seem to make sense for your module to work with even bigger data, such as Audio (for example .mp3 and .wav). This is something that would help a lot considering the nature audio (ie. where one of the lowest and most common sampling rates is still 44,100 samples/sec). For a use case, I would consider vaex.open('Hu
An easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights.
-
Updated
May 31, 2020 - Java
APIs
SparkSession
-
pythondef getActiveSession(cls) -
scaladef executeCommand(runner: String, command: String, options: Map[String, String]): DataFrame
DataFrame
-
pythondef transform(self, func) -
pythondef tail(self, num)
scaladef tail(n: Int): Array[T] -
scaladef printSchema(level: Int): Unit -
scaladef explain(mode: String): U
Distributed Big Data Orchestration Service
-
Updated
Aug 24, 2020 - Java
Upserts, Deletes And Incremental Processing on Big Data.
-
Updated
Aug 25, 2020 - Java
GridDB is a next-generation open source database that makes time series IoT and big data fast,and easy.
-
Updated
Aug 23, 2020 - C++
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
-
Updated
Sep 6, 2017 - Jupyter Notebook
Is this a BUG REPORT or FEATURE REQUEST?:
/kind feature
/help wanted
Description:
Currently, we publish helm chart at https://github.com/volcano-sh/charts and keep helm charts up-to-date at
https://github.com/volcano-sh/volcano/tree/master/installer , and they already mismatched. It's better to make charts as submod of volcano to keep them sync.
The Programming Language Designed For Big Data and AI
-
Updated
Aug 25, 2020 - JavaScript
-
Updated
Aug 25, 2020 - Jupyter Notebook
C# and F# language binding and extensions to Apache Spark
-
Updated
Nov 1, 2019 - C#
Google, Naver multiprocess image web crawler (Selenium)
-
Updated
Aug 18, 2020 - Python
Lightweight real-time big data streaming engine over Akka
-
Updated
Aug 18, 2020 - Scala
An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
-
Updated
Jul 18, 2020 - Jupyter Notebook
A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC
-
Updated
Aug 12, 2020 - Go
Describe the ideal solution
We need a new endpoint that functions as getIntegrationById endpoint.
Describe your use cases
We currently fetching all integration via appsync (or more specifically a sub-category of integrations based on integrationType) and iterate until we find one that matches the integrationId passed.
How frequently would you use such feature
Although, we
学习记录的一些笔记,以及所看得一些电子书eBooks、视频资源和平常收纳的一些自己认为比较好的博客、网站、工具。涉及大数据几大组件、Python机器学习和数据分析、Linux、操作系统、算法、网络等
-
Updated
Apr 24, 2020 - Python
A book about running Elasticsearch
-
Updated
Jul 2, 2020
Fast topic modeling platform
-
Updated
Aug 22, 2020 - C++
Improve this page
Add a description, image, and links to the bigdata topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the bigdata topic, visit your repo's landing page and select "manage topics."


必须先将通过STable创建的表先删除,然后才能删除超级表。当级联表很多时就麻烦了,除非写代码来删除。