big-data

Is your feature request related to a problem? Please describe.
Many static type checkers have issues finding Cython's stubs.
Here is from running mypy on my current project:

error: Skipping analyzing "cython": found module but no type hints or library stubs

The same issue can be seen when using import Cython as cython:

error: Skipping analyzing "Cython": found module but

Problem:

_catboost.pyx in _catboost._set_features_order_data_pd_data_frame()

_catboost.pyx in _catboost.get_cat_factor_bytes_representation()

CatBoostError: Invalid type for cat_feature[non-default value idx=1,feature_idx=336]=2.0 : cat_features must be integer or string, real number values and NaN values should be converted to string.

Could you also print a feature name, not o

We can just disable the pushdown on json column in MongoDB connector. TestMongoTypeMapping needs a new test method for json type when fixing this issue.

CREATE TABLE test (c1 json);
INSERT INTO test VALUES (json '{"id":0,"name":"user_0"}');

SELECT * FROM test WHERE c1 = json '{"id":0,"name":"user_0"}';
java.lang.UnsupportedOperationException
	at io.trino.spi.predicate.ValueSet.

What would you like to happen?

Currently transforms that return a TimestampedValue need to be typed as plain "TimestampedValue" rather than generic "TimestampedValue[T]" so all underlying information about what type is being wrapped is lost.

Issue Priority

Priority: 3

Issue Component

Component: sdk-py-core

Feature request

Overview

Currently, the MERGE command returns an empty result. It would be more useful if it returned

number of affected rows (Long)
number of updated rows (Long)
number of deleted rows (Long)
number of inserted rows (Long)

Motivation

These are obvious metrics that users would expect from this operation.

Further details

Implementation

Search before asking

I had searched in the issues and found no similar issues.

Description

As title.

In BE, there are some many Substitute stringstream, we can replace them with fmt/format

Solution

No response

Are you willing to submit PR?

Yes I am willing to submit a PR

Describe the bug
For non empty MultiMap I'm always seeing 0 B for Entry memory and Backup memory

For Map I'm receiving the non-zero values.
Here's the MultiMap config:

<multimap name="default">
    <backup-count>1</backup-count>
    <async-backup-count>0</async-backu

... to make it easier to read Vespa documentation on an e-reader / offline

Vespa documentation is generated using Jekyll from .md and .html files, look into options for generating the artifact as part of site generation (there might be plugins we can use here)

Is your feature request related to a problem? Please describe.
When creating a SQLite online store your only option is to create it on the filesystem. As every access needs to hit the filesystem then this slows down the online store.

Describe the solution you'd like
I'd like an option :memory: to use an in memory SQLite store instead. Eg in feature_store.yaml:

online

Jul	AUG	Sep
	10
2021	2022	2023

big-data

Here are 3,084 public repositories matching this topic...

binhnguyennus / awesome-scalability

apache / spark

ClickHouse / ClickHouse

donnemartin / data-science-ipython-notebooks

apache / flink

amark / gun

prestodb / presto

apache / predictionio

heibaiying / BigData-Notes

yahoo / CMAK

andkret / Cookbook

cython / cython

catboost / catboost

apache / storm

h2oai / h2o-3

trinodb / trino

apache / zeppelin

apache / beam

What would you like to happen?

Issue Priority

Issue Component

pachyderm / pachyderm

apache / couchdb

arkime / arkime

delta-io / delta

Feature request

Overview

Motivation

Further details

apache / doris

Search before asking

Description

Solution

Are you willing to submit PR?

hazelcast / hazelcast

tschellenbach / Stream-Framework

apache / hive

apache / ignite

vespa-engine / vespa

tangbc / vue-virtual-scroll-list

feast-dev / feast

Improve this page

Add this topic to your repo