The Wayback Machine - http://web.archive.org/web/20220411125338/https://github.com/topics/text-as-data

#

text-as-data

Here are 17 public repositories matching this topic...

JasonKessler / scattertext

Beautiful visualizations of how language differs among document types.

visualization d3 nlp machine-learning natural-language-processing text-mining word2vec exploratory-data-analysis word-embeddings sentiment eda topic-modeling scatter-plot japanese-language stylometry computational-social-science text-visualization text-as-data stylometric semiotic-squares

Updated Mar 26, 2022
Python

MilaNLProc / contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.

nlp embeddings transformer topic-modeling nlp-library nlp-machine-learning bert neural-topic-models text-as-data topic-coherence multilingual-topic-models multilingual-models

Updated Apr 4, 2022
Python

ryanjgallagher / shifterator

Open

Add type2score alias

ryanjgallagher commented Jul 26, 2021

Add a new parameter alias type2score which can be used to set the score dictionary for both corpora at the same time in a WeightedAvgShift. Specifying a single dictionary using only type2score_1 is already possible, but a parameter type2score would be more natural. Just have to check if type2score is None, and if not, set type2score_1 as type2score. Existing code should then handle

Read more

good first issue

JasonKessler / Scattertext-PyData

Notebooks for the Seattle PyData 2017 talk on Scattertext

visualization nlp natural-language-processing word2vec pydata political-science gender political-parties computational-social-science text-visualization text-as-data

Updated Jan 12, 2018
HTML

textnets

jboynyc / textnets

Text analysis with networks.

visualization nlp sociology text-analysis network-analysis computational-social-science text-as-data

Updated Apr 6, 2022
Python

umanlp / SemScale

A tool for Semantic Scaling of Political Text (branch of Topfish, a suite of tools for Political Text Analysis)

computational-social-science text-scaling text-as-data

Updated Feb 13, 2022
Python

fedenanni / Computational-Text-Analysis-2018-19

2018 Computational Text Analysis Notebooks, University of Mannheim

natural-language-processing teaching-materials computational-social-science text-as-data

Updated Nov 22, 2018
Jupyter Notebook

davidycliao / redguards

This is a designed package for replicating the estimates and findings in the article of Factionalism and the Red Guards under Mao's China: Ideal Point Estimation Using Text Data.

nlp china part-of-speech-tagger r-programming udpipe quanteda text-as-data red-guards cultural-revolution

Updated Feb 22, 2022
R

wesslen / summer2017-socialmedia

Summer 2017 Social Media Analytics Workshop Series

r twitter-api geospatial facebook-api text-as-data

Updated May 19, 2018
HTML

adamlauretig / gensim_in_R

Code for estimating word embeddings with gensim in R.

r gensim text-as-data

Updated Oct 30, 2018

davidycliao / bisCrawler

An Automation Webcrawler for Extracting Central Bankers' Speeches

python scraper scraping speeches text-as-data bank-for-international-settlements central-bankers-speeches central-banker

Updated Mar 30, 2022
Python

thelautiff / UN_meeting_records

From using xpdf, rvest, and quanteda on United Nations Digital Library search results to applying dictionaries to speeches in United Nations meeting records

pdf r regular-expression rvest pdfs united-nations xpdf quanteda text-as-data

Updated Apr 16, 2019
R

tenggaard / Embedded_understanding

Research project exploring ways to compare understanding through word embeddings.

nlp word-embedding text-as-data

Updated May 21, 2021

WZBSocialScienceCenter / tm_corona

A small showcase for topic modeling with the tmtoolkit Python package. I use a corpus of articles from the German online news website Spiegel Online (SPON) to create a topic model for before and during the COVID-19 pandemic.

python text-mining news scraping text-analysis corona topic-modeling webscraping text-as-data topicmodeling covid-19

Updated Dec 2, 2020
Jupyter Notebook

aflueckiger / KED2021

The ABC of Computational Text Analysis. BA Seminar, Spring 2021, University of Lucerne

sociology text-analysis teaching computational-social-science social-science text-as-data

Updated Mar 3, 2022
HTML

Refugee-Text-as-Data

graceadcox / Refugee-Text-as-Data

Original corpus of articles relating to refugees scraped from Tennessee newspaper The Chattanoogan along with simple code for text-as-data word cloud.

r word-cloud text-as-data

Updated Nov 11, 2019
R

KED2022

aflueckiger / KED2022

The ABC of Computational Text Analysis. BA Seminar, Spring 2022, University of Lucerne

sociology teaching computational-social-science social-science text-as-data

Updated Apr 9, 2022
HTML

Improve this page

Add a description, image, and links to the text-as-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the text-as-data topic, visit your repo's landing page and select "manage topics."