Apache Spark

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Describe the bug
Using a time dimension on a runningTotal measure on Snowflake mixes quoted and unquoted columns in the query. This fails the query, because Snowflake has specific rules about quoted columns. Specifically:

All unquoted column names are treated as upper case
Quoted column names are case sensitive.

So "date_from" <> date_from

To Reproduce
Steps to reproduce

At this moment relu_layer op doesn't allow threshold configuration, and legacy RELU op allows that.
We should add configuration option to relu_layer.

Feature request

Overview

Currently, the DELETE operation returns an empty result. It would be more useful if it returned the number of deleted rows.

Motivation

The number of deleted rows is an obvious metric that users would want from a delete operation.

Further details

Currently, DeleteCommand.scala is explicitly returning an empty DataFrame [here](https://g

Problem:

The current log will output something like val_function_0 while it should be val_mean_squared_error_0.

Solution:

"val/{}_{}".format(type(metric).__name__, i) use the name of the type of metric (metric is an instance of torchmetrics.metric.Metric so the type of it is function), that's why the output looks like val_function_0. It should use the name of the metri

I have a simple regression task (using a LightGBMRegressor) where I want to penalize negative predictions more than positive ones. Is there a way to achieve this with the default regression LightGBM objectives (see https://lightgbm.readthedocs.io/en/latest/Parameters.html)? If not, is it somehow possible to define (many example for default LightGBM model) and pass a custom regression objective?

Used Spark version
Spark Version: 2.4.4
Used Spark Job Server version
SJS version: v0.11.1

Deployed mode
client on Spark Standalone

Actual (wrong) behavior
I can't get config, when post a job with 'sync=true'. I got it：
http://localhost:8090/jobs/ff99479b-e59c-4215-b17d-4058f8d97d25/config
{"status":"ERROR","result":"No such job ID ff99479b-e59c-4215-b17d-4058f8d97d25"

Jun	JUL	Aug
	07
2021	2022	2023

Apache Spark

Here are 6,778 public repositories matching this topic...

apache / spark

donnemartin / data-science-ipython-notebooks

getredash / redash

yeasy / docker_practice

itdevbooks / pdf

cube-js / cube.js

horovod / horovod

aalansehaiyang / technology-talk

eclipse / deeplearning4j

zhisheng17 / flink-learning

heibaiying / BigData-Notes

FavioVazquez / ds-cheatsheets

wangzhiwubigdata / God-Of-BigData

Angel-ML / angel

h2oai / h2o-3

donnemartin / dev-setup

apache / zeppelin

Alluxio / alluxio

DataTalksClub / data-engineering-zoomcamp

delta-io / delta

Feature request

Overview

Motivation

Further details

PipelineAI / pipeline

intel-analytics / BigDL

Problem:

Solution:

yahoo / TensorFlowOnSpark

microsoft / SynapseML

lw-lin / CoolplaySpark

Cyb3rWard0g / HELK

spark-notebook / spark-notebook

databricks / koalas

JohnSnowLabs / spark-nlp

spark-jobserver / spark-jobserver

Related Topics