New York City Taxi Trip Data

The 2008 to 2013 NYC Taxi Trip Data set comes courtesy of a FOIL request to the Taxi & Limousine Commission. The data is currently hosted on Google's BigQuery service, where you can run SQL queries and batch jobs on it. There are nearly 850,000,000 rows and the data requires 98 gigabytes of disk space.

Sample query to select everyone who has been dropped off at JFK Airport:

SELECT 
  pickup_datetime, dropoff_datetime,
  ((dropoff_datetime - pickup_datetime) / 1000000) AS trip_duration,
  trip_distance,passenger_count,
  fare_amount, tolls_amount, surcharge, mta_tax, tip_amount, total_amount,  payment_type,
  pickup_longitude, pickup_latitude,
  dropoff_longitude, dropoff_latitude
  FROM 
    [alien-climber-851:nyc_taxi_redacted.trip_data]
  WHERE 
    YEAR(pickup_datetime) >= 2010
      AND pickup_latitude > 40.641898 
      AND pickup_latitude < 40.659925
      AND pickup_longitude > -73.809503
      AND pickup_longitude < -73.775857
  ORDER BY 
    pickup_datetime, dropoff_datetime 

Cash vs credit card payments, per day, from 2008 to 2013

img

Taxi pickups during Hurricane Sandy compared to one week after

img

Taxi pickups during Hurricane Irene compared to one week before

img

Taxi pickups during President Obama's 2011 NYC fundraiser compared to one week after

img