dataset
Here are 5,325 public repositories matching this topic...
A MNIST-like fashion product database. Benchmark
-
Updated
Apr 24, 2021 - Python
Label Studio is a multi-type data labeling and annotation tool with standardized output format
-
Updated
Oct 15, 2021 - Python
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
-
Updated
Oct 22, 2020
Steps:
Register on the website
An account verification link will be sent by email
Go to email inbox
Right-click on the box and copy the link
Check the link by pasting it in new tab
Hit enter and check if the account is being opened
HTTP LINK:
Impact - The user's account can be directly logged if the token is grabbed by the attacker by the tool named Wireshark.
**I want to
Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
-
Updated
Sep 8, 2020
How to reproduce the behaviour
The error occurs in the Step 5/9 of the docker build process
fetch http://dl-cdn.alpinelinux.org/alpine/v3.11/main/x86_64/APKINDEX.tar.gz
fetch http://dl-cdn.alpinelinux.org/alpine/v3.11/community/x86_64/APKINDEX.tar.gz
WARNING: Ignoring http://dl-cdn.alpinelinux.org/alpine/v3.11/main/x86_64/APKINDEX.tar.gz: BAD signature
WARNING: Ignoring http
Documentation on how to access and use the Quick, Draw! Dataset.
-
Updated
May 16, 2021
This repository contains compatibility data for Web technologies as displayed on MDN
-
Updated
Oct 14, 2021 - JavaScript
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
-
Updated
Oct 14, 2021 - Python
We are building an open database of COVID-19 cases with chest X-ray or CT images.
-
Updated
Oct 14, 2021 - Jupyter Notebook
Semantic Segmentation Suite in TensorFlow. Implement, train, and test new Semantic Segmentation models easily!
-
Updated
Apr 22, 2021 - Python
A curated list of awesome JSON datasets that don't require authentication.
-
Updated
Jun 14, 2021 - JavaScript
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
-
Updated
Oct 13, 2021 - Python
Extract data from a wide range of Internet sources into a pandas DataFrame.
-
Updated
Sep 25, 2021 - Python
文档增加tokenizer类别及样例建议
欢迎您反馈PaddleNLP使用问题,非常感谢您对PaddleNLP的贡献!
在留下您的问题时,辛苦您同步提供如下信息:
- 版本、环境信息
1)PaddleNLP和PaddlePaddle版本:请提供您的PaddleNLP和PaddlePaddle版本号,例如PaddleNLP 2.0.4,PaddlePaddle2.1.1
2)系统环境:请您描述系统类型,例如Linux/Windows/MacOS/,python版本 - 复现信息:如为报错,请给出复现环境、复现步骤
paddle版本2.0.8 paddlenlp版本2.1.0
建议,能否在paddlenlp文档中,整理列出各个模型的tokenizer是基于什么类别的based,如bert tokenizer是word piece的,xlnet tokenizer是sentence piece的,以及对应的输入输出样例
A synthetic data generator for text recognition
-
Updated
Oct 7, 2021 - Python
Expected Behavior
I want to convert torch.nn.Linear modules to weight drop linear modules in my model (possibly big), and I want to train my model with multi-GPUs. However, I have RuntimeError in my sample code. First, I have _weight_drop() which drops some part of weights in torch.nn.Linear (see the code below).
Actual Behavior
RuntimeError: arguments are located on different GPUs at /
-
Updated
Sep 27, 2021 - Jupyter Notebook
Resources for deep learning with satellite & aerial imagery
-
Updated
Oct 14, 2021
Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes
-
Updated
Aug 9, 2021 - Jupyter Notebook
FMA: A Dataset For Music Analysis
-
Updated
Sep 7, 2021 - Jupyter Notebook
Add a way to change the sample id output in the annotation process to a specific number (see picture).
Reason: I want to annotate large text and the app don't like it when the documents to annotate are too large, so I spitted in a sentence the document but I would like to be able to
[ECCV 2018] CCPD: a diverse and well-annotated dataset for license plate detection and recognition
-
Updated
Sep 27, 2020 - Python
Windows Events Attack Samples
-
Updated
Aug 23, 2021 - HTML
Improve this page
Add a description, image, and links to the dataset topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the dataset topic, visit your repo's landing page and select "manage topics."



New funcionality to be added