gpu

There are some common misuse patterns in TorchScript that we should issue clear error messages for instead of generating generic error that doesn't capture root cause of error.

Here are a few examples:

Attempting to construct a nn.Module inside TorchScript. This currently errors out because TorchScript would attempt to compile __init__() method of module, which usually contains a call

At this moment relu_layer op doesn't allow threshold configuration, and legacy RELU op allows that.
We should add configuration option to relu_layer.

Problem: the approximate method can still be slow for many trees
catboost version: master
Operating System: ubuntu 18.04
CPU: i9
GPU: RTX2080

Would be good to be able to specify how many trees to use for shapley. The model.predict and prediction_type versions allow this. lgbm/xgb allow this.

Hi ,

I have tried out both loss.backward() and model_engine.backward(loss) for my code. There are several subtle differences that I have observed , for one retain_graph = True does not work for model_engine.backward(loss) . This is creating a problem since buffers are not being retained every time I run the code for some reason.

Please look into this if you could.

Our users are often confused by the output from programs such as zip2john sometimes being very large (multi-gigabyte). Maybe we should identify and enhance these programs to output a message to stderr to explain to users that it's normal for the output to be very large - maybe always or maybe only when the output size is above a threshold (e.g., 1 million bytes?)

https://github.com/brendangregg/FlameGraph

Describe the bug
Clipping a DataFrame or Series using ints causes a cudf Failure because it won't handle the different dtypes (int and float)

Steps/Code to reproduce bug

data = cudf.Series([-0.43, 0.1234, 1.5, -1.31])
data.clip(0, 1)

...
  File "cudf/_lib/replace.pyx", line 216, in cudf._lib.replace.clip
  File "cudf/_lib/replace.pyx", line 198, in cudf._lib.replace.clamp

Describe the Problem

plot_model currently has the save argument which can be used to save the plots. It does not provide the functionality to decide where to save the plot and with what name. Right now it saves the plot with predefined names in the current working directory.

Describe the solution you'd like

We can have another argument save_path which is used whenever the `

Current implementation of join can be improved by performing the operation in a single call to the backend kernel instead of multiple calls.

This is a fairly easy kernel and may be a good issue for someone getting to know CUDA/ArrayFire internals. Ping me if you want additional info.

PR NVIDIA/cub#218 fixes this CUB's radix sort. We should:

Check whether Thrust's other backends handle this case correctly.
Provide a guarantee of this in the stable_sort documentation.
Add regression tests to enforce this on all backends.

May	JUN	Jul
	22
2020	2021	2022

gpu

Here are 2,243 public repositories matching this topic...

pytorch / pytorch

alacritty / alacritty

fastai / fastai

NVIDIA / nvidia-docker

gpujs / gpu.js

eclipse / deeplearning4j

PavelDoGreat / WebGL-Fluid-Simulation

apache / tvm

OlafenwaMoses / ImageAI

catboost / catboost

chainer / chainer

h2oai / h2o-3

MVIG-SJTU / AlphaPose

cupy / cupy

microsoft / DeepSpeed

openwall / john

gfx-rs / gfx

intel-isl / Open3D

exelban / stats

halide / Halide

PipelineAI / pipeline

plasma-umass / scalene

NVIDIA / DIGITS

rapidsai / cudf

pycaret / pycaret

arrayfire / arrayfire

ultralight-ux / Ultralight

NVIDIA / thrust

NVIDIA / DALI

Syllo / nvtop

Improve this page

Add this topic to your repo