Description
Unfortunately, as it turns out, distributing the contents and metadata surrounding tweets is a violation of section 6b of Twitter’s Developer Policy. Twitter politely asked me to remove the downloads without sending lawyers, and I very much appreciate that approach.My guess? This policy exists to protect the privacy of their users. Any downloadable dataset could include information that was subsequently deleted or made private by its owners, or removed by Twitter.Pursuant to their guidelines, I’ve replaced the original dataset with a much more limited one, containing only the tweet ID and user ID. You can download it as a 9MB CSV or a 3MB ZIP.I know this is far from ideal, but you can use this information to reconstruct the original dataset by using Twitter’s statuses/lookup API method, 100 tweets at a time. With their API rate limits, you should be able to grab up to 10,800 tweets an hour. Reconstructing the entire dataset would take around 29 hours.