transformer
Here are 1,328 public repositories matching this topic...
Natural Language Processing Tutorial for Deep Learning Researchers
-
Updated
Jul 25, 2021 - Jupyter Notebook
Bidirectional RNN
Is there a way to train a bidirectional RNN (like LSTM or GRU) on trax nowadays?
Code for the paper "Jukebox: A Generative Model for Music"
-
Updated
Aug 11, 2021 - Python
chooses 15% of token
From paper, it mentioned
Instead, the training data generator chooses 15% of tokens at random, e.g., in the sentence my
dog is hairy it chooses hairy.
It means that 15% of token will be choose for sure.
From https://github.com/codertimo/BERT-pytorch/blob/master/bert_pytorch/dataset/dataset.py#L68,
for every single token, it has 15% of chance that go though the followup procedure.
PositionalEmbedding
Chinese version of GPT2 training code, using BERT tokenizer.
-
Updated
Aug 1, 2021 - Python
A TensorFlow Implementation of the Transformer: Attention Is All You Need
-
Updated
May 21, 2021 - Python
-
Updated
Aug 13, 2021 - Jupyter Notebook
Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.
-
Updated
Aug 4, 2021 - Jupyter Notebook
Tool for visualizing attention in the Transformer model (BERT, GPT-2, Albert, XLNet, RoBERTa, CTRL, etc.)
-
Updated
Jul 31, 2021 - Python
viewpager with parallax pages, together with vertical sliding (or click) and activity transition
-
Updated
May 3, 2017 - Java
PostHTML is a tool to transform HTML/XML with JS plugins
-
Updated
Aug 13, 2021 - JavaScript
The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919
-
Updated
Jul 20, 2021 - Python
关于一些具体建议
1.希望可以把底层的api文档再完善一些,比如encoder,decoder,以便于复现一些论文
2.希望可以维护一个pytorch和paddle的api对照一览表,尽量全一些
3.错误日志能否准确一些,有时候datalaoder出的一些错误信息不好定位
4.能否增加使用梯度累加特性,进一步提高batch size
预训练模型下载地址修改
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
-
Updated
Aug 12, 2021
LightSeq: A High Performance Library for Sequence Processing and Generation
-
Updated
Aug 12, 2021 - Cuda
CV backbones including GhostNet, TinyNet and TNT.
-
Updated
Aug 12, 2021 - Python
Hi all,
We are asking for help from our community, hoping to provide Chinese documentation to all users. It's not difficult and here is the chance to be one of the contributors of MMOCR, an open-source rising star!
Here is a list of documentation needed to be translated:
- docs/getting_started.md
- [docs/ins
Roadmap of MMOCR
GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)
-
Updated
Jun 16, 2021 - Python
A PaddlePaddle Speech to Any toolkit.
-
Updated
Aug 12, 2021 - Jupyter Notebook
Large-scale pretraining for dialogue
-
Updated
Jul 1, 2021 - Python
-
Updated
Aug 9, 2021 - Python
微信、bilibili大图、长图、gif、视频、自定义view的转场效果,The transition effect of wechat, bilibili large image, long image, GIF, video and custom view
-
Updated
Jul 30, 2021 - Java
[Preprint] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wang
-
Updated
Aug 10, 2021 - Python
Production First and Production Ready End-to-End Speech Recognition Toolkit
-
Updated
Aug 13, 2021 - C++
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
-
Updated
Dec 19, 2020 - Python
Sequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet
-
Updated
Aug 13, 2021 - Python
《计算机视觉实战演练:算法与应用》中文电子书、源码、读者交流社区(持续更新中 ...)
-
Updated
Aug 13, 2021 - Jupyter Notebook
Jupyter Notebook tutorials on solving real-world problems with Machine Learning & Deep Learning using PyTorch. Topics: Face detection with Detectron 2, Time Series anomaly detection with LSTM Autoencoders, Object Detection with YOLO v5, Build your first Neural Network, Time Series forecasting for Coronavirus daily cases, Sentiment Analysis with BERT.
-
Updated
Jun 14, 2021 - Jupyter Notebook
Hi, I am so interesting in your project, and wonder if you need contributor and how could I make my own contribution?
Improve this page
Add a description, image, and links to the transformer topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the transformer topic, visit your repo's landing page and select "manage topics."


https://github.com/huggingface/transformers/blob/546dc24e0883e5e9f5eb06ec8060e3e6ccc5f6d7/src/transformers/models/gpt2/modeling_gpt2.py#L698
Assertions can't be relied upon for control flow because they can be disabled, as per the following: