A system for quickly generating training data with weak supervision
-
Updated
Sep 27, 2021 - Python
{{ message }}
A system for quickly generating training data with weak supervision
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
High-Level Training, Data Augmentation, and Utilities for Pytorch
自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提取(mainpart),中文汉语短文本相似度,文本特征工程,keras-http-service调用
Data augmentation for NLP, presented at EMNLP 2019
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to github repos, papers and others.
Data Augmentation For Object Detection
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
yolo(all versions) implementation in keras and tensorflow 2.x
This transform takes a fraction of the end or the start of the audio and treats that part as padding. We can implement several modes:
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
Random Erasing Data Augmentation. Experiments on CIFAR10, CIFAR100 and Fashion-MNIST
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Light-weight Single Person Pose Estimator
Efficient Learning of Augmentation Policy Schedules
Deep Convolutional Neural Networks for Musical Source Separation
Implementation of the mixup training method
DeltaPy - Tabular Data Augmentation (by @firmai)
Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks
Data augmentation tool for images
The offset can be randomized, as long as the output has the specified length
The idea is that one can have a chain of transforms, and some of them change the input length, but the final length should be fixed. That is where
DrQ: Data regularized Q
Collection of papers and resources for data augmentation for NLP.
Add a description, image, and links to the data-augmentation topic page so that developers can more easily learn about it.
To associate your repository with the data-augmentation topic, visit your repo's landing page and select "manage topics."
Output when I specify an attack without a model: