cuda
Here are 2,324 public repositories matching this topic...
See this thread
https://groups.google.com/forum/?utm_medium=email&utm_source=footer#!msg/kaldi-help/1AMGQ54tn0E/kacpkoizAAAJ
for the problem-- for CUDA 10.1 the rpath we add for CUDA is not sufficient because they put the CUBLAS library somewhere else. Making the issue so we don't lose track of it.
This is follow up of #1969 intended as a discussion on how this rule should behave.
It would be great if @jfoug and/or @magnumripper would join so we can battle it out and make it consistent between hashcat and john.
As I mentioned in the original report I assumed that given the input word foobar and the rule ef the mangled output should be FOobar. This is based on my first thought when
The CUDA datetime tests (numba.cuda.tests.cudapy.test_datetime) only test with datetime[D] - testing with other units would be a good idea.
I also can't find any reference to datetimes being supported on CUDA in the documentation - some explicit mention that they are supported would be helpful.
A flexible framework of neural networks for deep learning
-
Updated
Jul 3, 2020 - Python
This class could be used instead of cd file https://catboost.ai/docs/concepts/input-data_column-descfile.html when creating Pool from filez. The class should have init function, methods load and save, and Pool init method should be able to use object of this class instead of cd file during initialization.
In my opinion, some people might not be able to contribute to CuPy because of not having an NVIDIA GPU. But they might not know that we can build a development env on google colab(As I did here).
import os
from google.colab import drive
drive.mount('/content/drive')
os.chdir("/content/drive/My Drive/")
!git clone htModern C++ Parallel Task Programming
-
Updated
Jul 6, 2020 - C++
Is your feature request related to a problem? Please describe.
According to the Arrow spec:
Bitmaps are to be initialized to be all unset at allocation time (this includes padding).
This would imply that bits outside the range [0, size) should always be zero. However, in cuDF/libcudf, we take a more conservative approach and say that bits outside [0,size) are undefined in order to a
Describe the bug
We previously had a test coverage of 82% but now it drops to 75%. Ideally, code coverage should be > 80% for a healthy repo.
To Reproduce
Steps to reproduce the behavior:
- go to https://codecov.io/gh/uber/aresdb you will see detailed overage for each package, file, method, and lines.
Expected behavior
Ideally, code coverage should be > 80% for a healthy
The documentation for hipMemcpyToArray / hipMemcpy2DToArray is seemingly copied from immediately above, and is wrong.
Note that it references dpitch instead of wOffset and hOffset.
-
Updated
Sep 11, 2018 - C++
emscripten support?
Would be awesome if we could use Remotery to profile C/C++ emscripten games.
Challenges:
-
Websocket server. We can't start a websocket server from an emscripten game. One workaround is to use a native websocket server as a proxy. Another solution is have emscripten directly send the data to another browser tab (
postMessage) -
Timer accuracy. Browsers have recently reduced the accurac
Currently, they are just all over the place! I propose the following rule-of-thumb:
- If a header file contains cuda kernels and/or device methods, then it should be a
.cuh. - Else, it should be a
.hpp(if the current header file is already named as a.h, feel free to leave it as it is, in order to minimize the amount of ripple changes this will cause). - If a header file is used to de
Tools such as gofmt are very useful for quickly cleaning reformatting code according to some coding style. I think we need the same for Futhark. It's a bit more complicated to do this for an expression-oriented language than a statement-oriented one, but there is prior art we can look at (like Brittany for Haskell).
While constructing futhark fmt i
e.g. based on the google doc https://docs.google.com/document/d/1ZI1V_2I3tETAeGnAwYZnrYTdeiOfdtWCL4l4DmRimFc/
p.s. I think that using docker for the course is an overhead which is not justified. As people create a designated VM for the course it would have been better to share a VM image so installation is one click and forget about docker. Docker is great when you need multiple environment or
ThunderSVM: A Fast SVM Library on GPUs and CPUs
-
Updated
Jun 7, 2020 - C++
Potentially dump question - the documentation of this project is generated with Sphinx. Is there any specific reason why PDF is not explicitly supported as an output format in this project? Sphinx does support it out of the box and I could patch it into the documentation's makefile if there are no objections.
Thank you for this fantastic work!
Could it be possible the fit_transform() method returns the KL divergence of the run?
Thx!
Hi there,
Could you please consider adding documentation on reading the code, and which files to follow for what?
I would also consider adding function-level comments.
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
-
Updated
Jul 15, 2019
Fast Clojure Matrix Library
-
Updated
Jul 8, 2020 - Clojure
an implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch
-
Updated
Jul 9, 2020 - Python
CUDA Templates for Linear Algebra Subroutines
-
Updated
Jun 17, 2020 - Cuda
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
-
Updated
Jun 30, 2020 - C++
Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors
-
Updated
Jul 9, 2020 - Python
Improve this page
Add a description, image, and links to the cuda topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the cuda topic, visit your repo's landing page and select "manage topics."


1. Issue or feature description
Documentation on the use of
--gpusflag at https://github.com/NVIDIA/nvidia-docker/wiki#i-have-multiple-gpu-devices-how-can-i-isolate-them-between-my-containers does not quote the parameters, resulting in an error:docker: Error response from daemon: cannot set both Count and DeviceIDs on device request.Documentation specifies:
`$ docker run --gpu