The 2008 to 2013 NYC Taxi Trip Data set comes courtesy of a FOIL request to the Taxi & Limousine Commission. The data is currently hosted on Google's BigQuery service, where you can run SQL queries and batch jobs on it. There are nearly 850,000,000 rows and the data requires 98 gigabytes of disk space.
Sample query to select everyone who has been dropped off at JFK Airport:
SELECT pickup_datetime, dropoff_datetime, ((dropoff_datetime - pickup_datetime) / 1000000) AS trip_duration, trip_distance,passenger_count, fare_amount, tolls_amount, surcharge, mta_tax, tip_amount, total_amount, payment_type, pickup_longitude, pickup_latitude, dropoff_longitude, dropoff_latitude FROM [alien-climber-851:nyc_taxi_redacted.trip_data] WHERE YEAR(pickup_datetime) >= 2010 AND pickup_latitude > 40.641898 AND pickup_latitude < 40.659925 AND pickup_longitude > -73.809503 AND pickup_longitude < -73.775857 ORDER BY pickup_datetime, dropoff_datetime