dataset
Here are 3,534 public repositories matching this topic...
A MNIST-like fashion product database. Benchmark
-
Updated
Jul 22, 2020 - Python
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
-
Updated
Dec 1, 2019
Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
-
Updated
Aug 17, 2020
My actions before raising this issue
- Read/searched the docs
- Searched past issues
Current version of pyAV is 6.2.0. Need to update it till 8.x.
Also PyAV d
Documentation on how to access and use the Quick, Draw! Dataset.
-
Updated
Jul 30, 2020
I have set up Postgres in Kubernetes and also setup Doccano in Kubernetes however it's working well but wants to know the mount point for Kubernetes to attach Persistence volume.
My deployment.yaml
apiVersion: extensions/v1beta1
kind: Deployment
metadata:
labels:
app: doccano
name: doccano
namespace: default
spec:
progressDeadlineSeconds: 600
replicas: 1
revisi
Label Studio is a multi-type data labeling and annotation tool with standardized output format
-
Updated
Aug 26, 2020 - JavaScript
This repository contains compatibility data for Web technologies as displayed on MDN
-
Updated
Aug 29, 2020 - JavaScript
Data loaders and abstractions for text and NLP
-
Updated
Aug 28, 2020 - Python
We are building an open database of COVID-19 cases with chest X-ray or CT images.
-
Updated
Aug 19, 2020 - Jupyter Notebook
Semantic Segmentation Suite in TensorFlow. Implement, train, and test new Semantic Segmentation models easily!
-
Updated
Jun 30, 2020 - Python
A curated list of awesome JSON datasets that don't require authentication.
-
Updated
Apr 29, 2020 - JavaScript
Expected Behavior
I want to convert torch.nn.Linear modules to weight drop linear modules in my model (possibly big), and I want to train my model with multi-GPUs. However, I have RuntimeError in my sample code. First, I have _weight_drop() which drops some part of weights in torch.nn.Linear (see the code below).
Actual Behavior
RuntimeError: arguments are located on different GPUs at /
-
Updated
Aug 21, 2020 - Jupyter Notebook
A synthetic data generator for text recognition
-
Updated
Aug 15, 2020 - Python
FMA: A Dataset For Music Analysis
-
Updated
Jul 23, 2020 - Jupyter Notebook
中文语言理解基准测评 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
-
Updated
Jul 15, 2020 - Python
JSON time-series of coronavirus cases (confirmed, deaths and recovered) per country - updated daily
-
Updated
Aug 29, 2020 - JavaScript
The dataset is used to train my own raccoon detector and I blogged about it on Medium
-
Updated
Aug 1, 2020 - Jupyter Notebook
[ECCV 2018] CCPD: a diverse and well-annotated dataset for license plate detection and recognition
-
Updated
May 14, 2020 - Python
A simple PyTorch Implementation of Generative Adversarial Networks, focusing on anime face drawing.
-
Updated
Jul 24, 2020 - Jupyter Notebook
-
Updated
Aug 12, 2020 - Python
Improve this page
Add a description, image, and links to the dataset topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the dataset topic, visit your repo's landing page and select "manage topics."


I was wondering if it is possible to generate a list of 'n' unique company names? I saw some PR's which gave a unique keyword for 'words' but doesn't seem to extend to other providers? I understand i could just keep regenerating and dropping duplicates until I got a unique set of length n, but would be nice to just have a keyword for that (plus this m