transformer
Here are 996 public repositories matching this topic...
Natural Language Processing Tutorial for Deep Learning Researchers
-
Updated
Oct 20, 2020 - Jupyter Notebook
Bidirectional RNN
Is there a way to train a bidirectional RNN (like LSTM or GRU) on trax nowadays?
Code for the paper "Jukebox: A Generative Model for Music"
-
Updated
Feb 4, 2021 - Python
chooses 15% of token
From paper, it mentioned
Instead, the training data generator chooses 15% of tokens at random, e.g., in the sentence my
dog is hairy it chooses hairy.
It means that 15% of token will be choose for sure.
From https://github.com/codertimo/BERT-pytorch/blob/master/bert_pytorch/dataset/dataset.py#L68,
for every single token, it has 15% of chance that go though the followup procedure.
PositionalEmbedding
Chinese version of GPT2 training code, using BERT tokenizer.
-
Updated
Feb 6, 2021 - Python
A TensorFlow Implementation of the Transformer: Attention Is All You Need
-
Updated
Nov 13, 2020 - Python
viewpager with parallax pages, together with vertical sliding (or click) and activity transition
-
Updated
May 3, 2017 - Java
Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.
-
Updated
Feb 17, 2021 - Jupyter Notebook
PostHTML is a tool to transform HTML/XML with JS plugins
-
Updated
Dec 14, 2020 - JavaScript
Tool for visualizing attention in the Transformer model (BERT, GPT-2, Albert, XLNet, RoBERTa, CTRL, etc.)
-
Updated
Feb 15, 2021 - Jupyter Notebook
-
Updated
Feb 7, 2019 - Python
GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)
-
Updated
Feb 10, 2021 - Python
Large-scale pretraining for dialogue
-
Updated
Dec 4, 2020 - Python
微信、bilibili大图、长图、gif、视频、自定义view的转场效果,The transition effect of wechat, bilibili large image, long image, GIF, video and custom view
-
Updated
Feb 8, 2021 - Java
Sequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet
-
Updated
Feb 8, 2021 - Python
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
-
Updated
Dec 19, 2020 - Python
Keras implementation of BERT with pre-trained weights
-
Updated
Jul 26, 2019 - Python
Hi, I am so interesting in your project, and wonder if you need contributor and how could I make my own contribution?
building a chinese dialogue system based on the newest version of rasa(基于最新版本rasa搭建的对话系统)
-
Updated
Dec 25, 2020 - Python
Jupyter Notebook tutorials on solving real-world problems with Machine Learning & Deep Learning using PyTorch. Topics: Face detection with Detectron 2, Time Series anomaly detection with LSTM Autoencoders, Object Detection with YOLO v5, Build your first Neural Network, Time Series forecasting for Coronavirus daily cases, Sentiment Analysis with BERT.
-
Updated
Jan 23, 2021 - Jupyter Notebook
-
Updated
May 12, 2020 - Java
A Keras TensorFlow 2.0 implementation of BERT, ALBERT and adapter-BERT.
-
Updated
Jan 21, 2021 - Python
CTR prediction models based on deep learning(基于深度学习的广告推荐CTR预估模型)
-
Updated
Nov 15, 2019 - Python
list of efficient attention modules
-
Updated
Feb 9, 2021 - Python
-
Updated
Feb 10, 2021 - TypeScript
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding 论文的中文翻译 Chinese Translation!
-
Updated
Jun 1, 2020
Import SVG files in your React Native project the same way that you would in a Web application.
-
Updated
Jan 30, 2021 - JavaScript
A curated list of NLP resources focused on BERT, attention mechanism, Transformer networks, and transfer learning.
-
Updated
Feb 18, 2021
Improve this page
Add a description, image, and links to the transformer topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the transformer topic, visit your repo's landing page and select "manage topics."


This is an intermediate issue, which is why it gets both the good first issue and good second issue tags.
We have an automated script to check when copies of the same code are consistent inside the library, which allows us to avoid subclassing and keep all code for one model's forward pass inside one file (see our philosophy for more details on this).
The XxxModelForYyy are very similar