.
Text analysis
Putting things in categories automatically
How X affects Y
Python data science reference
All Projects
- Project Summaries
- AJC: Doctors and sex abuse
-
- Summary
- Downloading one million pieces of legislation from LegiScan
- Taking a million pieces of legislation from a CSV and inserting them into Postgres
- Download Word, PDF and HTML content and process it into text with Tika
- Import content into Solr for advanced text searching
- Checking for legislative text reuse using Python, Solr, and ngrams
- Checking for legislative text reuse using Python, Solr, and simple text search
- Search for model legislation in over one million bills using Postgres and Solr
- Using topic modeling to categorize legislation
- FCC comment bots
- NYT: Trump tweets
- FiveThirtyEight: P-values
- ProPublica: Opportunity Gap
- Stanford: Open Policing Data
- ProPublica: Presidential pardons
- data.world: The FOIA Predictor