How to batch convert pdf files to text

2 minute read

Frequently I am asked: I have a bunch of pdf files, how can I convert them to plain text so that analyze them using quantitative techniques? Here is my recom...




Technical questions

less than 1 minute read

Just finished teaching at the Essex Summer School in Data Analysis, Session 1. Got a lot of technical questions from my class…

CEU Computerized Text Course Announcements

less than 1 minute read

This post concerns the short course I am teaching at Central European University, Budapest from 14-21 April 2011. Stay tuned to this post for future announce...


European Political Science Association

less than 1 minute read

The recently formed European Political Science Association has just issued a call for papers for its 1st Annual General Conference, to be held in Dublin, Ire...

Statistical haiku

less than 1 minute read

See Keisuke Hirano’s Haiku page. Here is one of my favorites:


Field Seminar B, 2009-2010, PhD

3 minute read

The Field Seminar B course handout is available here, although we are still working out the content and some of the topics for later sessions.

Update on unlocking iPhones

less than 1 minute read

The best way to unlock modern iPhones using the most recent Apple firmwares is to use RedSn0w. Here is a guide for unlocking the 2G iPhones (such as mine). I...

How to set proxy settings for R (Mac OSX)

less than 1 minute read

I run R behind a firewall, and found it tricky to set the proxy settings for R so that I could directly install packages, access outside data using load(url(...


MSc Modules on Electoral, Party Systems

less than 1 minute read

As promised I have now posted the presentations from the weeks in Government Institutions. These are in pdf format so should be readable by everyone. Note th...

Course-related discussions

less than 1 minute read

This will be a place for posting announcements (and comments) related to courses I teach.


New web page

less than 1 minute read

I have now completed moving my old web page to the new server, and it now has a completely new look, feel, and function as well. The use of WordPress allows ...