Site Meter
Blog Archives

Building an AWS CloudSearch domain for the Supreme Court

  It should be pretty clear by now that two things I’m very interested in are cloud computing and legal informatics.  What better way to show it than to put together a simple AWS CloudSearch tutorial using Supreme Court decisions

Tagged with: , , , , ,
Posted in Law, Programming, Research

Updates to data and statistics on Congressional bill complexity

  When I put together my original post on the length and complexity of Congressional bills, I was hoping to build forward momentum on the project.  The goal was to build a simple, sortable and searchable interface to explore and

Tagged with: , , , , , ,
Posted in Law, Programming, Research

Now in print: An Empirical Survey of the Population of U.S. Tax Court Written Decisions

  When someone brings up the empirical study of legal citation, most people think of the work Landes & Posner and Epstein & Martin.  If you’re really cool, you might even think of Dan and me, who have spent quite awhile analyzing and

Tagged with: , , ,
Posted in Law, Research

Upcoming post series: Building a better legal search engine

  Later this month, I’ll be giving a keynote at a meeting on Law and Computation at the University of Houston.  As part of the talk, I’m putting together an example of how I think machine learning and structured data can

Tagged with: , , ,
Posted in Law, Programming, Technology

Pre-processing text: R/tm vs. python/NLTK

  Let’s say that you want to take a set of documents and apply a computational linguistic technique.  If your method is based on the bag-of-words model, you probably need to pre-process these documents first by segmenting, tokenizing, stripping, stopwording, and

Tagged with: , ,
Posted in Programming