gensim
Here are 376 public repositories matching this topic...
A fast, efficient universal vector embedding utility package.
-
Updated
Jul 17, 2020 - Python
Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
-
Updated
Aug 6, 2020 - Python
-
Updated
May 29, 2020 - Python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
-
Updated
Sep 10, 2020 - Jupyter Notebook
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
-
Updated
Apr 22, 2020 - Jupyter Notebook
Data repository for pretrained NLP models and NLP corpora.
-
Updated
Mar 16, 2018 - Python
Compute Sentence Embeddings Fast!
-
Updated
Aug 5, 2020 - Python
ADAM - A Question Answering System. Inspired from IBM Watson
-
Updated
Feb 6, 2020 - Python
AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.
-
Updated
Jul 22, 2020 - Jupyter Notebook
An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
-
Updated
Aug 8, 2017 - Python
The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
-
Updated
May 31, 2020 - Python
Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
-
Updated
Sep 27, 2020 - Jupyter Notebook
A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).
-
Updated
Sep 24, 2020 - Python
Web-ify your word2vec: framework to serve distributional semantic models online
-
Updated
Oct 1, 2020 - Python
Using pre trained word embeddings (Fasttext, Word2Vec)
-
Updated
Jun 19, 2018 - Python
Is your feature request related to a problem? Please describe.
Users may want to get logs from S3 instead of elasticsearch. We should be able to support that.
Describe the solution you'd like
You can use the boto3 library to download files.
This package is already included in dependencies.
Is your feature request related to a problem? Please describe.
When we get predictions
A scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
-
Updated
Sep 24, 2020 - Python
A text analysis application for performing common NLP tasks through a web dashboard interface and an API
-
Updated
Jan 18, 2019 - HTML
Reference implementation of Diffusion2Vec (Complenet 2018) built on Gensim and NetworkX.
-
Updated
Sep 24, 2020 - Python
ML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python
-
Updated
Nov 21, 2019
A lightweight implementation of Walklets from "Don't Walk Skip! Online Learning of Multi-scale Network Embeddings" (ASONAM 2017).
-
Updated
Sep 24, 2020 - Python
-
Updated
Sep 27, 2020 - Python
document embedding and machine learning script for beginners
-
Updated
Oct 3, 2019 - Python
Hybrid Code Networks https://arxiv.org/abs/1702.03274
-
Updated
Apr 13, 2017 - Python
A PyTorch Implementation of "SINE: Scalable Incomplete Network Embedding" (ICDM 2018).
-
Updated
Sep 24, 2020 - Python
The reference implementation of "Multi-scale Attributed Node Embedding".
-
Updated
Sep 24, 2020 - Python
Improve this page
Add a description, image, and links to the gensim topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the gensim topic, visit your repo's landing page and select "manage topics."


Not a high-priority at all, but it'd be more sensible for such a tutorial/testing utility corpus to be implemented elsewhere - maybe under
/test/or some other data- or doc- related module – rather than ingensim.models.word2vec.Originally posted by @gojomo in RaRe-Technologies/gensim#2939 (comment)