rtyley / bfg-repo-cleaner
Removes large or troublesome blobs like git-filter-branch does, but faster. And written in Scala
{{ message }}
See what the GitHub community is most excited about today.
Removes large or troublesome blobs like git-filter-branch does, but faster. And written in Scala
Chisel: A Modern Hardware Design Language
Rocket Chip Generator
Apache Spark - A unified analytics engine for large-scale data processing
Gluten: Plugin to Double SparkSQL's Performance
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Simple and Distributed Machine Learning
A next-generation Scala framework for building scalable, correct, and efficient HTTP clients and servers
TheHive: a Scalable, Open Source and Free Security Incident Response Platform
Compiler for the Vale programming language - http://vale.dev/
Sparkling Water provides H2O functionality inside Spark cluster
The repository for the free Scala at Light Speed mini-course
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Rudder is a configuration and security automation platform. Manage your Cloud, hybrid or on-premises infrastructure in a simple, scalable and dynamic way.
State of the Art Natural Language Processing
Scala 2 compiler and standard library. For bugs, see scala/bug
Cortex: a Powerful Observable Analysis and Active Response Engine
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
A Spark plugin for reading and writing Excel files
Monitor Kafka Consumer Group Latency with Kafka Lag Exporter
Apache Spark Connector for SQL Server and Azure SQL
Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments
All Rudder public plugins in one repository. Licenses are by-plugin.
Redshift data source for Apache Spark