natural-language-processing
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
Here are 8,495 public repositories matching this topic...
TensorFlow code and pre-trained models for BERT
-
Updated
Sep 11, 2021 - Python
Learn how to responsibly deliver value with ML.
-
Updated
Sep 20, 2021 - Jupyter Notebook
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被全球200所大学采用教学。
-
Updated
Sep 26, 2021 - Python
中文分词 词性标注 命名实体识别 依存句法分析 语义依存分析 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
-
Updated
Sep 18, 2021 - Python
-
Updated
Sep 24, 2021 - Python
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
-
Updated
Sep 24, 2021 - Python
-
Updated
Sep 25, 2021
Oxford Deep NLP 2017 course
-
Updated
Jun 12, 2017
Change tensor.data to tensor.detach() due to
pytorch/pytorch#6990 (comment)
tensor.detach() is more robust than tensor.data.
-
Updated
Sep 24, 2021 - Python
-
Updated
Sep 19, 2021
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
-
Updated
May 2, 2021
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 200 universities.
-
Updated
Sep 25, 2021 - Python
A very simple framework for state-of-the-art Natural Language Processing (NLP)
-
Updated
Sep 23, 2021 - Python
Is your feature request related to a problem? Please describe.
I typically used compressed datasets (e.g. gzipped) to save disk space. This works fine with AllenNLP during training because I can write my dataset reader to load the compressed data. However, the predict command opens the file and reads lines for the Predictor. This fails when it tries to load data from my compressed files.
Hello everyone,
I need to compute the BLEU score with more than one ngram length (ideally, BLEU2, BLEU3, BLEU4, and BLEU5). In my case, this is a very long task, as every hypothesis has some thousand references.
Reading the implementation of the corpus_bleu function, which takes weights:Tuple between its parameters - and thus calculating BLEU-[len(weights)] - , I found out that it gets all t
This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.
-
Updated
Dec 22, 2020 - Python
-
Updated
Sep 26, 2021 - Python
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
-
Updated
Sep 18, 2021 - HTML
Mapping a variable-length sentence to a fixed-length vector using BERT model
-
Updated
Jul 1, 2021 - Python
Natural Language Processing Tutorial for Deep Learning Researchers
-
Updated
Jul 25, 2021 - Jupyter Notebook
Hello spoooopyyy hackers
This is a Hacktoberfest only issue!
This is also data-sciency!
The Problem
Our English dictionary contains words that aren't English, and does not contain common English words.
Examples of non-common words in the dictionary:
"hlithskjalf",
"hlorrithi",
"hlqn",
"hm",
"hny",
"ho",
"hoactzin",
"hoactzine
Stanford CoreNLP: A Java suite of core NLP tools.
-
Updated
Sep 26, 2021 - Java
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
-
Updated
Jul 9, 2021 - Python
Ludwig is a toolbox that allows to train and evaluate deep learning models without the need to write code.
-
Updated
Sep 26, 2021 - Python
A collection of machine learning examples and tutorials.
-
Updated
Aug 9, 2021 - Python
Pre-trained and Reproduced Deep Learning Models (『飞桨』官方模型库,包含多种学术前沿和工业场景验证的深度学习模型)
-
Updated
Sep 23, 2021 - Python
Created by Alan Turing
- Wikipedia
- Wikipedia


https://github.com/huggingface/transformers/blob/546dc24e0883e5e9f5eb06ec8060e3e6ccc5f6d7/src/transformers/models/gpt2/modeling_gpt2.py#L698
Assertions can't be relied upon for control flow because they can be disabled, as per the following: