Beautiful visualizations of how language differs among document types.
-
Updated
Mar 26, 2022 - Python
{{ message }}
Beautiful visualizations of how language differs among document types.
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.
Notebooks for the Seattle PyData 2017 talk on Scattertext
Text analysis with networks.
A tool for Semantic Scaling of Political Text (branch of Topfish, a suite of tools for Political Text Analysis)
2018 Computational Text Analysis Notebooks, University of Mannheim
This is a designed package for replicating the estimates and findings in the article of Factionalism and the Red Guards under Mao's China: Ideal Point Estimation Using Text Data.
Summer 2017 Social Media Analytics Workshop Series
An Automation Webcrawler for Extracting Central Bankers' Speeches
From using xpdf, rvest, and quanteda on United Nations Digital Library search results to applying dictionaries to speeches in United Nations meeting records
Research project exploring ways to compare understanding through word embeddings.
A small showcase for topic modeling with the tmtoolkit Python package. I use a corpus of articles from the German online news website Spiegel Online (SPON) to create a topic model for before and during the COVID-19 pandemic.
The ABC of Computational Text Analysis. BA Seminar, Spring 2021, University of Lucerne
Original corpus of articles relating to refugees scraped from Tennessee newspaper The Chattanoogan along with simple code for text-as-data word cloud.
The ABC of Computational Text Analysis. BA Seminar, Spring 2022, University of Lucerne
Add a description, image, and links to the text-as-data topic page so that developers can more easily learn about it.
To associate your repository with the text-as-data topic, visit your repo's landing page and select "manage topics."
Add a new parameter alias
type2scorewhich can be used to set the score dictionary for both corpora at the same time in aWeightedAvgShift. Specifying a single dictionary using onlytype2score_1is already possible, but a parametertype2scorewould be more natural. Just have to check iftype2scoreisNone, and if not, settype2score_1astype2score. Existing code should then handle