apache / spark
Apache Spark - A unified analytics engine for large-scale data processing
{{ message }}
See what the GitHub community is most excited about today.
Apache Spark - A unified analytics engine for large-scale data processing
Open-source code analysis platform for C/C++/Java/Binary/Javascript/Python/Kotlin based on code property graphs. Discord https://discord.gg/vv4MH284Hc
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Reference applications for funding, operating, and incentivizing the use of a decentralized, public Canton synchronizer. Includes the Amulet reference application for creating native payment utilities for Canton synchronizers and Daml applications.
A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.
Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3
An Agile RISC-V SoC Design Framework with in-order cores, out-of-order cores, accelerators, and more
Arnold Schwarzenegger based programming language
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Rocket Chip Generator
A next-generation Scala framework for building scalable, correct, and efficient HTTP clients and servers
Source code for the X Recommendation Algorithm
Open-source high-performance RISC-V processor
Berkeley's Spatial Array Generator
The Community Maintained High Velocity Web Framework For Java and Scala.
The Daml smart contract language
The Scala 3 compiler, also known as Dotty.
♞ lichess.org: the forever free, adless and open source chess server ♞