gpu
Here are 2,243 public repositories matching this topic...
The fastai deep learning library
-
Updated
Jun 21, 2021 - Jupyter Notebook
Build and run Docker containers leveraging NVIDIA GPUs
-
Updated
Jun 14, 2021 - Makefile
At this moment relu_layer op doesn't allow threshold configuration, and legacy RELU op allows that.
We should add configuration option to relu_layer.
Play with fluids in your browser (works even on mobile)
-
Updated
Jun 9, 2021 - JavaScript
Open deep learning compiler stack for cpu, gpu and specialized accelerators
-
Updated
Jun 22, 2021 - Python
A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities
-
Updated
Jun 14, 2021 - Python
Problem: the approximate method can still be slow for many trees
catboost version: master
Operating System: ubuntu 18.04
CPU: i9
GPU: RTX2080
Would be good to be able to specify how many trees to use for shapley. The model.predict and prediction_type versions allow this. lgbm/xgb allow this.
A flexible framework of neural networks for deep learning
-
Updated
Jun 10, 2021 - Python
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
-
Updated
Jun 22, 2021 - Jupyter Notebook
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
-
Updated
Jun 16, 2021 - Python
Hi ,
I have tried out both loss.backward() and model_engine.backward(loss) for my code. There are several subtle differences that I have observed , for one retain_graph = True does not work for model_engine.backward(loss) . This is creating a problem since buffers are not being retained every time I run the code for some reason.
Please look into this if you could.
Our users are often confused by the output from programs such as zip2john sometimes being very large (multi-gigabyte). Maybe we should identify and enhance these programs to output a message to stderr to explain to users that it's normal for the output to be very large - maybe always or maybe only when the output size is above a threshold (e.g., 1 million bytes?)
Open3D: A Modern Library for 3D Data Processing
-
Updated
Jun 22, 2021 - C++
a language for fast, portable data-parallel computation
-
Updated
Jun 22, 2021 - C++
PipelineAI Kubeflow Distribution
-
Updated
Apr 24, 2020 - Jsonnet
Deep Learning GPU Training System
-
Updated
Jun 13, 2020 - HTML
Describe the bug
Clipping a DataFrame or Series using ints causes a cudf Failure because it won't handle the different dtypes (int and float)
Steps/Code to reproduce bug
data = cudf.Series([-0.43, 0.1234, 1.5, -1.31])
data.clip(0, 1)
...
File "cudf/_lib/replace.pyx", line 216, in cudf._lib.replace.clip
File "cudf/_lib/replace.pyx", line 198, in cudf._lib.replace.clamp
Describe the Problem
plot_model currently has the save argument which can be used to save the plots. It does not provide the functionality to decide where to save the plot and with what name. Right now it saves the plot with predefined names in the current working directory.
Describe the solution you'd like
We can have another argument save_path which is used whenever the `
Current implementation of join can be improved by performing the operation in a single call to the backend kernel instead of multiple calls.
This is a fairly easy kernel and may be a good issue for someone getting to know CUDA/ArrayFire internals. Ping me if you want additional info.
Next-generation HTML renderer for apps and games
-
Updated
Mar 11, 2021 - CMake
PR NVIDIA/cub#218 fixes this CUB's radix sort. We should:
- Check whether Thrust's other backends handle this case correctly.
- Provide a guarantee of this in the stable_sort documentation.
- Add regression tests to enforce this on all backends.
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
-
Updated
Jun 22, 2021 - C++
Improve this page
Add a description, image, and links to the gpu topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the gpu topic, visit your repo's landing page and select "manage topics."


There are some common misuse patterns in TorchScript that we should issue clear error messages for instead of generating generic error that doesn't capture root cause of error.
Here are a few examples:
nn.Moduleinside TorchScript. This currently errors out because TorchScript would attempt to compile__init__()method of module, which usually contains a call