Library of Congress To Archive All Public Tweets 171
After the recent announcement that Groklaw will be archived at the Library of Congress, mjn writes with word that the push to archive more digital content continues: "The US Library of Congress announced a deal with Twitter to archive all public tweets, dating back to Twitter's inception in March 2006. More details at their blog. No word yet on precisely what will be done with the collection, but besides entering your friends' important updates on the quality of breakfast into the permanent archival record, the deal may improve access for researchers wanting to analyze and mine Twitter's giant database."
Your tax dollars at work... (Score:5, Insightful)
Given the signal to noise ratio for most tweets, I'm not convinced this is a particularly good use of resources...
Just because you can do something, doesn't mean you have to!
Why? (Score:0, Insightful)
Seriously, why?
Re:Your tax dollars at work... (Score:5, Insightful)
It's not like it takes a lot of space to archive them, it's just 140 characters per tweet. There's a lot of useless information in the newspapers and books too, but they have archived them too because some of that info is valuable or might become valuable.
Re:hmm... (Score:4, Insightful)
Re:hmm... (Score:4, Insightful)
all of them???
Disk space is cheap...
They should get a copy of the internet archive while they're at it.
Re:hmm... (Score:4, Insightful)
I suspect a lot of the interesting information is in the aggregate anyway, not individual tweets: things like trends, analysis of subgroups, linguistic analysis, etc.
Re:Why? (Score:3, Insightful)
I would that a social scientist in the 23rd Century does that think that average human of today posts every triviality in his life like most of the current twitters.
Re:Your tax dollars at work... (Score:2, Insightful)
Given the signal to noise ratio for most tweets, I'm not convinced this is a particularly good use of resources...
Just because you can do something, doesn't mean you have to!
Its a fantastic idea, its probably only a few Tb of data but it represents the unedited reaction of ordinary people to historical events and a detailed insight into their everyday lives.
Certainly could be the users (Score:3, Insightful)
A library archiving your work does not necessarily imply that you don't own the copyright on it.
Re:Your tax dollars at work... (Score:3, Insightful)
50 million tweets/day
140 characters of message
60 bytes of metadata (timestamp, sender id, etc.)
10 GB of twitter archive per day
10 TB per 3 years
What does 1 TB cost these days? about $100?
Storage space will indeed be an inexpensive part of the cost, and will decline in price at about the same rate the traffic is growing.