aleph data search
A toolkit for data search, management and anlaysis in investigative reporting. Developed by @occrp and many others.
Grow your team on GitHub
GitHub is home to over 50 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Sign up
Pinned repositories
Repositories
-
aleph
Search and browse documents and data; find the people and companies you look for.
-
followthemoney
Data model and processing tools for investigative entity data
-
docs
GitHub mirror of the GitBook documentation
-
react-ftm
React UI component library for aleph/followthemoney
-
opensanctions
An open database of persons of interest and politically exposed persons
-
ingest-file
Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.
-
convert-document
A docker container for LibreOffice and unoconv, used to generate PDF files from office-type documents.
-
memorious
Distributed crawling framework for documents and structured data.
-
alephclient
API client for Aleph, supports bulk entity and document upload.
-
aleph-helm-charts
Helm charts for Aleph
-
servicelayer
Common interface definitions for aleph toolkit services and applications
-
followthemoney-predict
Experiments with FtM record linkage
-
datadesktop
Desktop graph visualization application
-
-
translate-service
Demo: document processing service for automated translation
-
datacommons
A fleet of Memorious scrapers for crawling various open data sources
-
msglite
Forked from TeamMsgExtractor/msg-extractorExtracts emails and attachments saved in Microsoft Outlook's .msg files
-
followthemoney-store
Fragment storage layer for FollowTheMoney entities (formerly "balkhash")
-
-
-
-
offshoreleaks
Converter for ICIJ Offshore Leaks data into FollowTheMoney format
-
languagecodes
A Python helper library to convert between ISO 639 two- and three-letter codes.
-
synonames
Trying to generate name synonyms from wikidata
-
pdflib
Binary Python bindings for poppler utils for content extraction
-
ideas
Long-term development ideas for the Aleph toolkit
-
followthemoney-cellebrite
Generate FollowTheMoney entities from Cellebrite XML reports
-
followthemoney-ocds
Import data formatted as OpenContracting Data Standard (OCDS) objects into FollowTheMoney
-
example-personadeinteres
Example how to load mixed document/entity graphs to Aleph
-
fingerprints
Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.

