transformer

Feature request

We currently have ViLT in the library, which, among other tasks, is capable of performing visual question answering (VQA).

It would be great to have a pipeline for this task, with the following API:

from transformers import pipeline

pipe = pipeline("vqa")
pipe("cats.png", "how many cats are there?")
`

Is there a way to train a bidirectional RNN (like LSTM or GRU) on trax nowadays?

From paper, it mentioned

Instead, the training data generator chooses 15% of tokens at random, e.g., in the sentence my
dog is hairy it chooses hairy.

It means that 15% of token will be choose for sure.

From https://github.com/codertimo/BERT-pytorch/blob/master/bert_pytorch/dataset/dataset.py#L68,
for every single token, it has 15% of chance that go though the followup procedure.

We keep this issue open to collect feature requests from users and hear your voice. Our monthly release plan is also available here.

You can either:

Suggest a new feature by leaving a comment.
Vote for a feature request with 👍 or be against with 👎. (Remember that developers are busy and cannot respond to all feature requests, so vote for your most favorable one!)
Tell us that you wo

欢迎您反馈PaddleNLP使用问题，非常感谢您对PaddleNLP的贡献！
在留下您的问题时，辛苦您同步提供如下信息：

版本、环境信息
1）PaddleNLP和PaddlePaddle版本：请提供您的PaddleNLP和PaddlePaddle版本号，例如PaddleNLP 2.0.4，PaddlePaddle2.1.1
2）系统环境：请您描述系统类型，例如Linux/Windows/MacOS/，python版本
复现信息：如为报错，请给出复现环境、复现步骤
paddle版本2.0.8 paddlenlp版本2.1.0
建议，能否在paddlenlp文档中，整理列出各个模型的tokenizer是基于什么类别的based，如bert tokenizer是word piece的，xlnet tokenizer是sentence piece的，以及对应的输入输出样例

目前的多音字使用 pypinyin 或者 g2pM，精度有限，想做一个基于 BERT (或者 ERNIE) 多音字预测模型，简单来说就是假设某语言有 100 个多音字，每个多音字最多有 3 个发音，那么可以在 BERT 后面接 100 个 3 分类器（简单的 fc 层即可），在预测时，找到对应的分类器进行分类即可。
参考论文：
tencent_polyphone.pdf

数据可以用 https://github.com/kakaobrain/g2pM 提供的数据

进阶：多任务的 BERT
![image](https://user-images.githubusercontent.com/24568452

Apr	MAY	Jun
	16
2021	2022	2023

transformer

Here are 1,901 public repositories matching this topic...

huggingface / transformers

Feature request

graykode / nlp-tutorial

ml-tooling / best-of-ml-python

labmlai / annotated_deep_learning_paper_implementations

amusi / CVPR2022-Papers-with-Code

google / trax

openai / jukebox

Morizeyao / GPT2-Chinese

codertimo / BERT-pytorch

PaddlePaddle / PaddleSeg

jessevig / bertviz

bentrevett / pytorch-seq2seq

open-mmlab / mmsegmentation

Kyubyong / transformer

PaddlePaddle / PaddleNLP

PaddlePaddle / PaddleSpeech

xmuSistone / ViewpagerTransition

posthtml / posthtml

HRNet / HRNet-Semantic-Segmentation

dk-liang / Awesome-Visual-Transformer

bytedance / lightseq

wenet-e2e / wenet

timeseriesAI / tsai

huawei-noah / CV-Backbones

yangjianxin1 / GPT2-chitchat

microsoft / DialoGPT

curiousily / Getting-Things-Done-with-Pytorch

JingyunLiang / SwinIR

PaddlePaddle / awesome-DeepLearning

thunlp / OpenPrompt

Improve this page

Add this topic to your repo