Topic Modelling for Humans
-
Updated
Jul 16, 2020 - Python
Topic Modelling for Humans
A fast, efficient universal vector embedding utility package.
Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Data repository for pretrained NLP models and NLP corpora.
ADAM - A Question Answering System. Inspired from IBM Watson
Compute Sentence Embeddings Fast!
AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.
An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).
Web-ify your word2vec: framework to serve distributional semantic models online
Using pre trained word embeddings (Fasttext, Word2Vec)
A scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
Log Anomaly Detection - Machine learning to detect abnormal events logs
Reference implementation of Diffusion2Vec (Complenet 2018) built on Gensim and NetworkX.
A text analysis application for performing common NLP tasks through a web dashboard interface and an API
A lightweight implementation of Walklets from "Don't Walk Skip! Online Learning of Multi-scale Network Embeddings" (ASONAM 2017).
Hybrid Code Networks https://arxiv.org/abs/1702.03274
Text classification using Doc2Vec embeddings
ML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python
document embedding and machine learning script for beginners
A PyTorch Implementation of "SINE: Scalable Incomplete Network Embedding" (ICDM 2018).
The reference implementation of "Multi-scale Attributed Node Embedding".
Add a description, image, and links to the gensim topic page so that developers can more easily learn about it.
To associate your repository with the gensim topic, visit your repo's landing page and select "manage topics."