big-data

Now insert and query share the resource ( Max Process Count control) 。 When the query with high TPS，the insert will get error (“error: too many process”). I think separator the resource for Insert and Query will makes sense. Ensure enough resource for insert。It looks like Use Yarn， Insert and Query use the different resource quota。
Or the simple way , Can we set Ratio for Insert and

Something that could help the documentation would be a glossary that certain terms could link to. A pointer would be a good candidate, or C data structures (struct/union) in general. Also extension type, terms like extern or inline, etc. It wouldn't have to replicate a complete specification o

Problem:
catboost version: 0.23.2
Operating System: all
Tutorial: https://github.com/catboost/tutorials/blob/master/custom_loss/custom_metric_tutorial.md

Impossible to use custom metric (С++).

Code example

from catboost import CatBoost
train_data = [[1, 4, 5, 6],

Hello. I would like to ask for suggestions for additional features for development.

On the Sessions tab,

Please develop a function that can addon to frequently used search queries.

Today IMap.values() and IMap.values(Predicate) calls are blocking.

I would like to use IMap.values(Predicate) in a Jet Pipeline, which is possible, but I need to declare it as nonCooperative, and will have an impact on the pipeline scalability.

Would it be possible to have an async (non-blocking) version for these calls ?

Thank you very much for all the hard work done !

PrestoDB https://prestodb.io .. is widely used as SQL frontend for many different data-sources, including ElasticSearch, and even files in S3 .. would be very nice if there would be a Connector available for Vespa.

Hi, if my spark app is using 2 storage type, both S3 and Azure Data Lake Store Gen2, could I put spark.delta.logStore.class=org.apache.spark.sql.delta.storage.AzureLogStore, org.apache.spark.sql.delta.storage.S3SingleDriverLogStore

Thanks in advance

Aug	SEP	Oct
	20
2019	2020	2021

big-data

Here are 2,188 public repositories matching this topic...

apache / spark

binhnguyennus / awesome-scalability

donnemartin / data-science-ipython-notebooks

explosion / spaCy

apache / flink

apache / predictionio

ClickHouse / ClickHouse

amark / gun

prestodb / presto

yahoo / CMAK

heibaiying / BigData-Notes

apache / storm

cython / cython

catboost / catboost

h2oai / h2o-3

apache / zeppelin

apache / couchdb

pachyderm / pachyderm

aol / moloch

tschellenbach / Stream-Framework

apache / beam

hazelcast / hazelcast

intel-analytics / BigDL

apache / ignite

apache / hive

vespa-engine / vespa

delta-io / delta

TuiQiao / CBoard

jostmey / NakedTensor

databricks / koalas

Improve this page

Add this topic to your repo