

Data Mining Amazon.com Wish Lists 183
Dr. Webster writes "In his article "Data Mining 101: Finding Subversives with Amazon Wishlists," Tom Owad of Applefritter outlines a way in which one could build detailed personal profiles of hundreds of thousands of U.S. citizens in a matter of hours. Reading habits, personal tastes and even political party affiliation could be inferred from the results, and through the use of Yahoo! People and Google Maps, one could even map out geographically where people with certain interests or affiliations live, down to their address. Most surprisingly, the process of doing this is completely legal, and doesn't even violate Amazon's Conditions of Use."
Mining voluntary information on a public website? (Score:4, Insightful)
Re:Mining voluntary information on a public websit (Score:3, Insightful)
Re:Mining voluntary information on a public websit (Score:2)
If not then you can't. Up there by choice.
Re:Mining voluntary information on a public websit (Score:2)
Re:Mining voluntary information on a public websit (Score:5, Funny)
Wishlist
This list is for: Jaish Al Ashurah
Birthday: None Entered
Shipping Address: Private
Unique Facts: A shadu la ilaha illah Allah
Total items: 10
"The Anarchist's Cookbook" by William Powell
"Improvised Explosives: How To Make Your Own" by Seymour Lecker
"Ultimate Sniper: An Advanced Training Manual For Military And Police Snipers" by John Plaster
"Crusades Through Arab Eyes" by Amin Maalouf
"The Protocols of the Meetings of the Learned Elders of Zion With Preface and Explanatory Notes" by Sergius Nilus, Henry Ford, and Victor E. Marsden.
"Explosive Dusts: Advanced Improvised Explosives" by Seymour Lecker
"Creative Cloth Doll Making: New Approaches for Using Fibers, Beads, Dyes, And Other Exciting Techniques" by Patti Medaris Culea.
"The Tragedy of Karbala" by M.A. Naquvi
"51 Documents: Zionist Collaboration With the Nazis" by Lenni Brenner
"How to Build a Nuclear Bomb: And Other Weapons Of Mass Destruction" by Frank Barbaby
Re:Mining voluntary information on a public websit (Score:2, Insightful)
Re:Mining voluntary information on a public websit (Score:2)
Re:Mining voluntary information on a public websit (Score:2)
Re:Mining voluntary information on a public websit (Score:2)
Keep the insults coming, you're just building things up.
Re:O good gravy, I did it again. (Score:2)
Re:Mining voluntary information on a public websit (Score:2)
Not only does the group name indicate "Shia", like you noticed, but so does one of the picks: "The Tragedy of Karbal
Re:Mining voluntary information on a public websit (Score:4, Informative)
Re:Mining voluntary information on a public websit (Score:2)
3 Dead Trolls in a Baggie (Score:2, Funny)
Lie,Lie,lie... Lie about your age, your gender and your race.
This is not a story. This is not news that matters (Score:3, Insightful)
Re:This is not a story. This is not news that matt (Score:5, Insightful)
I see (Score:3, Funny)
Re:I see (Score:5, Funny)
No, that's because you ordered those Paladin Press [paladin-press.com], Delta Press [deltaforce.com], and IMS [imsplus.com] catalogs.
Point of the article (Score:5, Insightful)
Re:Point of the article (Score:2)
And this speculation from yet another Mac Rumour Site should be taken with how many grains of salt too?
This thread should prove interesting [applefritter.com]
Re:Point of the article (Score:2)
Re:Point of the article (Score:2)
The FBI can find out where I live without Amazon or Google.
If they want to know what I want as a present, well, I want everyone to know.
If I wanted "Making WMD for Dummies" I would just buy it, not ask for it as a gift.
Re:Point of the article (Score:3, Insightful)
The FBI can find out where I live without Amazon or Google.
Yeah, but the FBI isn't supposed to know that you bought a book called "Why Bush is a Tyrant." The point being, it becomes quite dangerous if the government is allowed to keep tabs on what you read, because the political freedom which comes from freedom of speech requires that ideas can be exchanged and learned without fear of consequences.
Re:Point of the article (Score:2)
But "sleepers" usually try everything not to stick out. Including not going to meetings of radicals or reading radical books (or at least buying them on the Internet). Wouldn't this just be helpful with stupid terrorists?
Just like that whole e-mail interception thing. Do terrorists really communicate using unencrypted e-mail?
Re:Point of the article (Score:2)
Re:Point of the article (Score:2)
But I put it on a public list. I made it public on purpose. This is not a private information, I made it public for all to use. Good or bad. If I didn't want the public (strangers, FBI, my local newspaper) to know, I shouldn't have made it public.
I made it public, so I don't get what people are complaining about.
Re:Point of the article (Score:2)
Re:Point of the article (Score:2)
I say I would like a particular book for a gift. Exactly how does that impact you and your privacy?
>"hey, it was my penis that was exposed, I don't see what she has to bitch about."
I really don't see how this applies.
Do you find my wish list offensive?
How does a group of people publishing their wish list cause you a problem? (If Amazon removes the wish-lists feature, how does this im
Here's the problem (Score:2)
But who is a 'terrorist'? (Score:2)
Re:Point of the article (Score:2)
If you don't want the FBI to know what you want or bought, don't put it on a public wish-list.
Am I missing something here?
Suppose you posted something about yourself on USENET. Do you have a reasonable expectation that this information would be impossible, through currently legal means, for the FBI to find out about it?
Re:Point of the article (Score:2)
This is for all the folks who typically respond with "you have nothing to fear if you have nothing to hide". Well, my reading habits were nothing to hide five years ago. Today, I'm not so sure. How about yours?
Re:Point of the article (Score:2)
If you have problems with putting things on a public wish list, how do you even do anything in public without fear?
How do you have a credit-card? Drivers license driving a car with a license plate? Do anything in any place with security camera? Sign books from a library? Leave finger-prints at a resturant? Use cash with its security-strip and unique serial number? Have a
Re:Point of the article (Score:2)
Re:Point of the article (Score:3, Interesting)
Re: (Score:3, Funny)
Re:Point of the article (Score:2, Interesting)
Let's see if I understand this...
A bunch of people in America, all from a peace loving and gentle religious background, flew airplanes into skyscrapers, attempting to kill tens of thousands of people and succeeding in murdering more than the Japanese killed in Pearl Harbor. This was 8 years after another group, from the same peace loving and gentle religious background, used a large bomb in an attempt to achieve the
Re:Point of the article (Score:3, Insightful)
Now, I believe you and I both completely agree that asshole fundamentalists with bombs should be r
Re:Point of the article (Score:2)
Which hippies were *prevented* from flying (which, by the way, has NOTHING to do with the Patriot Act, or for that matter, privacy invasions other than right at entry to the airport)?
I have yet to read of even one abuse of the Patriot Act. I have yet to read of any action taken by the Federal Government against innocents based on any of the expanded war powers, other than random searches (which included catching
Re:Point of the article (Score:2)
Let's start with the no-fly list:
http://www.salon.com/news/feature/2003/07/25/no_fl y/index_np.html [salon.com]
http://sfgate.com/cgi-bin/article.cgi?f=/c/a/2002/ 09/27/MNNOFLY.TMP&nl=top [sfgate.com]
http://www.globenet.free-online.co.uk/reports/prot estersdetained.htm [free-online.co.uk]
http://www.freerepublic.com/focus/f-n [freerepublic.com]
Re:Point of the article (Score:2)
And that is a very different thing. The demonstrators weren't prevented from flying to their demonstration. They were subject to extra scrutiny, and sine they obviously cut their time before departure too short, some missed the flight.
This just breaks my heart. I mean really, I'm sobbing as I type this. You see, I'm on the same list. And I understand perhaps a bit abou
Re:Point of the article (Score:2)
1. if the government REALLY wanted to make air travel straight, it'd mandate a solid cockpit door and allow pilots to carry firearms, plus make sure the door stays locked throughout the entire flight. INSTEAD, it does all this bullshit with the TSA and the NO-FLY-LIST (stop pretending it doesn't exist, you're embarassing yourself) because it gives them a little bit of extra power over us citizens and gets us used to diminis
Re:Point of the article (Score:2)
Sorry, dude, but Republicanism has nothing to do with it and being concerned about the future of my country has a heck of a lot to do with it. If 9-11 had happened in the Hillary administration, I would still be in favor of the government trying to stop the terrorists, even though I trust Bush far more than the Clintons to not abuse the system (based on the history of both administrations).
As to your "how to fix it" - did it ever occur to you that they might, just might not use airplanes next time? T
Re:Point of the article (Score:2)
Redneck moron.
Re:Point of the article (Score:2)
It's especialy unconfortable when the gorilla wears red white and blue and is the one doing the worrying. OTOH: Nobody wants ants to eat their foundations, if someone gives their info freely, don't they also forefiet their right to control what it is used for?
Re:Point of the article (Score:2)
Just because our system isn't designed to protect privacy doesn't mean we shouldn't be consideri
Re:Point of the article (Score:2)
The only practicable response the public has is to ensure we have the right to watch those who are watching us. I think the kind of beauracracy you propose would simply make the tool too expensive for anyone other than a
Re:Point of the article (Score:2)
That's exactly the problem I was referring to.
Just wait. When the terrorists do something *really* bad - something that a large number of Americans can see happening to themselves if the government doesn't stop it.
Then you'll perhaps see the ant grow to the size of, perhaps, a watchdog, and perhaps a
Re:Point of the article (Score:2, Interesting)
The problem I and many others have is that we are pretty sure that even if the government had all the data mining capabilities in the world, a large terrorist organization will still find a flaw in the system and abuse it. The issue is not that we do
Re:Point of the article (Score:2)
In other words, you can
Re:Point of the article (Score:2)
I disagree with the assertion that "the war on terror" is a war. It is a series of police actions against disparate groups with a variety of causes. The brits did not treat the IRA as a war a
Re:Point of the article (Score:2)
I will lay out the fundamentals of the argument here in its most simple form.
The new and UNIQUE threat is;
1) A very large group of people who are willing to die to kill large numbers of us. We last encountered that in World War II with the Japanese - remember them? But the Japanese never were able to kill as many Americans in America as
Re:Point of the article (Score:2)
(1) - Not new or unique, there have always been a "A very large group of people who are willing to die to kill large numbers of us."
(2) -
"The latter means..." everyone has the potential to use WMD, simply because they exist. There is no "new threat" since 9/11, the new threat arrived in 1945. Since then it has been theoretically possibl
Re:Point of the article (Score:2)
Well 99% of the people here don't get it (Score:4, Insightful)
No the FBI or anyone else would never bother with amazon wish list. They would simply get the sales records. This guy does not have access to those so he uses what he can to prove his point.
Yes it is scary. Especially for those of us who have family (or more to the point do not have family) killed for expressing the wrong ideas.
I however don't think we should blame the FBI or similar agencies, they are the instruments of us the people. It is we who have voted the current goverments into power. Corruption you say? Well then it is you and me that have allowed that to happen. I do not believe in the mythical innocent citizen. Others have died for freedom. No reason we should be allowed to sit on our backsides and complain our freedoms are taken away. FIGHT
Not that I will of course. I know deepdown that what is happening is wrong and also know that I am one of the cattle. Perhaps it will make it easier when I am put in a cattle wagon to be gassed.
The problem with fighting for your freedom is that one persons freedom fighter is another persons terrorist.
I ain't got an answer or a solution except to suggest "PAY CASH". Even if your part of the herd there is no reason to make it any easier for them to send you off to the slaughterhouse.
Will it happen? It has happened countless times before. Check the McCarty trials. The treatment of Japanse americans vs German americans. The gunning down of american citizens by police during peace protests. The way england handled the RIA and labor strikes. All of them pretty recent.
Something scary might happen in our lifetimes. Or not. This is one tiny example to prove that it won't be hard on the technical side. Now all we need to is to elect leaders crazy enough to do it. /me looks at the current leaders of the "free" west. Too late.
Re:Well 99% of the people here don't get it (Score:2)
Re:Well 99% of the people here don't get it (Score:3, Interesting)
Number of Americans killed in the last 50 years by *the federal government* for expressing the wrong views: 0. This leads me to wonder who in your family you are referring to.
Number of Americans killed by government for expressing the wrong views:0
Number of Americans killed by government accidently during protests 4 that I know of, at Kent State.
Number of Americans by or on behalf of Joe McCarthy: 0 (but Joe McCarthy was certainly a person who
Re:Well 99% of the people here don't get it (Score:2)
Re:Well 99% of the people here don't get it (Score:2)
Re:Well 99% of the people here don't get it (Score:2)
The Catholic Church had more than "some pedos". It had a slew, and it went to great lengths to cover up their abuses, harass or discredit their accusers and in the process enable the predator Priests to assault still more children. For e
Re:Well 99% of the people here don't get it (Score:2)
THAT IS COMPLETE NONSENSE>
In the United States, the Catholic Church was becoming more liberal, as the 60's generation rose to positions of power. Today, the Catholic bishops are still far more liberal than the Vatican. If fact, conservative Catholics blame the *liberalization* of the Church for the sexual abuse scandal. One reason was a liberal decision to admist homosexuals to the seminaries. Now, you may find non-politicall
Most subversive anarchists... (Score:5, Insightful)
Re:Most subversive anarchists... (Score:2)
You think stoners and crack heads are that smart?
I imagine that there are people who have various other titles in their wishlists that are blatantly suggestive of mostly illegal
Re:Most subversive anarchists... (Score:2)
Re:Most subversive anarchists... (Score:2)
We have "caught" many of the hijackers from 9/11 but they still achieved their goal. Say we find 1,000 people that buy books about jihad against the US...
droogs (Score:2)
What? (Score:3, Insightful)
On a final note, the FBI is now hiring computer scientists to implement a project that sounds very similar to what I just did:
"Currently, the FBI is strengthening systems engineering in order to tie new systems together architecturally and ensure that standards for custom and packaged applications are enforced, and it needs engineers to accomplish this goal, the agency said.
(etc...)
Where does he read data mining into this? I read that the FBI wants to update their computers to make their databases better. Their databases.
This article strikes me as scare mongering, and until I hear that the government plans on breaking the knuckles of people who read Aldous Huxley, I don't care about what's merely possible.
Re:What? (Score:3, Informative)
These days, it wouldn't even take an Act of Congress for Amazon's databases to become FBI databases...
Re:What? (Score:2)
My understanding of it is that it's basically just a big modernization project, with no sinister elements whatsoever. Apparently there's also a big problem with lots of information bein
Just to point out (Score:5, Insightful)
Even his crude filtering techniques can yield worthwhile leads for police/FBI. He says that the first result for bible is "The Cannabis Grow Bible: The Definitive Guide to Growing Marijuana for Recreational and Medical Use".
Is it so hard to imagine that a certain fraction of people with that book on their wishlist may either be growing weed, or have it in their possesion? Or that a percentage of people 'wishing' for the Improvised Munitions Handbook (printed by our favorite Uncle Sam @ the DoD) aren't chemists or demolitionists?
/doesn't have an Amazon wishlist and never will
Re:Just to point out (Score:2)
So what? It doesn't add up to probable cause.
I have all kinds of books that aren't popular. I've never used the information in them to break the law. Sure, I have the recipies for RDX, Composition C1 and Semtex, I know how to construct shaped charges too but I'm not going to make them. I also know how to convert a firearm to full auto. I'd never do it, but I k
Re:Just to point out (Score:2)
My general point is that some enterprising police officer might decide to do this, put whatever information he gets together with what they know about their local druggies and start making people's lives difficult.
Anyways, it is one thing to have knowledge, it's another to be using it. Those are the people the police are interested in.
Re:Just to point out (Score:2)
What if the cop who decides YOU are a freak deserving of "special treatment" just happens to be an idiot who barely passed his coursework? What if he's a big, stupid goon and he just didn't like your face (so now he's going to pick on you)?
That DOES happen, you know. Practically every town in the U.S. has at least one cop whose neighbors consider him a "live one". Maybe he's the guy weari
Re:Just to point out (Score:2)
Or even worse that you're now dating his ex-girlfriend or ex-wife.
Cops in general may be very professional, smart, and trustworthy -- but it only takes one knucklehead to make your life a living hell.
I don't share your positive opinion of Cops. I think that they tend to
Re:Just to point out (Score:2)
well (Score:4, Interesting)
http://www.co.durham.nc.us/common/PublRecordsdB.c
You can also figure out how much someone's house is worth, what they paid in taxes, etc.
It starts to get a little scary though when your search for public records reveals mortage applications with the individual's SS# listed on the sheet. All available online, and provided for by your very own government!
Re:well (Score:2)
http://www.co.durham.nc.us/common/PublRecordsdB.c
Yeah, but there's only one recognized party - Demopublican - in NC anyway, so what's the point in looking it up? Oh, right, you can also be "unaffiliated." Whoopty-f'in-do!
Re:well (Score:2)
WTF is wrong with you? (Score:2, Insightful)
It shouldn't be surprising, it's common sense. Why in the fuck should it be illegal or against Amazon's conditions of use to read information in someone's wish list? The whole point of a wish list is so that other people will know what books you want.
LK
Re:WTF is wrong with you? (Score:2)
Re:WTF is wrong with you? (Score:5, Insightful)
For example, say you maintain a Slashdot identity that you don't link to your real name. While no one post of yours may be sufficient to tie your identity to your name, the sum total may be sufficient.
Or security cameras. Most people don't worry about *one* security camera, but a lot of people get concerned when they are constantly being monitored by cameras which are tied together by computer to monitor where they go each day.
Re:WTF is wrong with you? (Score:2)
Which I *might*.
While no one post of yours may be sufficient to tie your identity to your name, the sum total may be sufficient.
That is true. I make a common typo. I don't always release the shift key before the second letter at the beginning of a sentence. REsulting in the mistake that I have just intentionally made. I suppose if someone were to parse all of the Slashdot comments in history they could come up with a lis
Re:WTF is wrong with you? (Score:2)
Sure, but there's a *benefit* to having a pseudonym -- you can accure reputation, yet need not associate your ideas with your real name.
My point is not that it is (or even should be) feasible to do this and remain anonymous -- I don't want to get stuck on the details of this particular example. I'm just pointing out that, assuming that it *is* possible to prevent people from compiling information, there a
Re:WTF is wrong with you? (Score:2)
Good work. I don't use Amazon and as such have never read their conditions of use, but even so. It's a corporate policy, not a law. Their legal department can make all kinds of noise, but in the end all they can really do to the man is cancel his account and/or ban his IP.
LK
Most people use wishlist once and then never again (Score:2, Interesting)
And you thought you were funny... (Score:2, Funny)
It may be more relevant that it may appear. (Score:2, Insightful)
First, on relevance of wishlists:
Granted that wishlists are not the most accurate estimates of your preferences, what is? My list contains over 50 books, and for the most part they are all related to each other. In fact, I would say that by looking at my list you would have a pretty accurate gauge to measure my interests. Am I an anomaly? Possibly. (Though I doubt it)
But
Wonder how many celebrity wish lists... (Score:2)
Social Networking Application (Score:2)
a) I only want to share my wish list with people I trust;
b) I only want to share certain sublists with certain people.
They have this already! (Score:4, Informative)
a) I only want to share my wish list with people I trust;
b) I only want to share certain sublists with certain people.
They do! Go to "edit wishlist" and the second item after you name the list is "This list will be viweable by:" and it gives three choices: "Anyone who searches for me," "Only people I have invited with the 'Share this list' feature," or "Only me."
Re:They have this already! (Score:2)
Re:They have this already! (Score:2)
Yes, you can name the list. And yes, you can limit who sees it. And yes, it makes the whole article nonsense. It not the first time people went all hysterical over an ant hill.
What could you do with Purchase Circles? (Score:3, Interesting)
When they first started the idea, they gave it some PR, but now it's sort of a low man on the totem pole, relegated to the backwaters. When I checked 6400+ cities, only 2800 of them were recording enough activity to warrant a bestseller or "uniquely popular" list.
They generate the 2 types of lists for 5 classes of items: books, CDs, DVDs, toys, and consumer electronics. Now this might not be as potentially compromising as finding out a single person was ordering subversive books. Yet finding out a small town in Alabama's bestselling genre is showtunes [geostats.info] is definitely something interesting.
- Greg
Re:What could you do with Purchase Circles? (Score:2)
I know. I used to work there. :-)
- Greg
Everyone is missing the point (Score:4, Funny)
I smell a fully monetized eyeball!
If you use your full name for an email address (Score:3, Interesting)
Re:If you use your full name for an email address (Score:2)
Keep in mind that if you used that email address to register for a service that required your full name they've probably got both anyway. Or used your email address in a purchase with a visa card. Or etc.
That's what makes data integration efforts so dangerous - data mining on its own isn't really that useful. Data mining on pre-integrated and cleansed data *is*.
Same trick works in the UK (Score:2)
Despite our much stronger data protection legislation [opsi.gov.uk], exactly the same trick works in the UK [amazon.co.uk]. Which just demonstrates that the whole data protection hoo hah is nonsense...
Data Mining vs Privacy (Score:3, Informative)
Suppose you were a person who likes surfing the net to read things like "The Anarchist's Cookbook" (an entertaining read) who is also curious about Muslim Extremisim (because it is so often in the news) and is planning a car trip with your family to New York City and Washington D.C. Perhaps you have downloaded maps and driving directions to the Capital, the White House and the United Nations Building from MapQuest. Maybe you have visited EBay and bought some reloading equipment (because you are a sport-clay shooter).
Now imagine some data mining application at fbi.gov puts all of this information together and concludes that you are an extremist who is about to embark on a trip where you plan on bombing the United Nations building in New York City and the Capitol and the White House in Washington DC!
Seperate and disparite pieces of data aren't always able to fit nicely into a simple formula. This is where the danger of this kind of information comes in. Taken seperately and considered without an adequate foundation, these "facts" tend to support a totally erronious conclusion. Next thing you know, someone is quietly asking questions about you abd you have no idea why.
These kinds of things have happend to innocent people before. Someone I know faced scrutiny years ago shortly after the Oklahoma City bombing. There was no real reason for his being suspect and it took a long time to figure out why they looked at him. The FBI questioned his neighbors, they followed him, photographed his home, and in general made life uncomfortable for him.
It took time to figure it out but, we finally concluded that there were reasons why he came to their attention. They were:
- He was a gun collector
- He bought gunpowder by the pound (he was a re-loader)
- He worked at a facility where he may possibly have had access to amonium nitrate
- He lived alone
- He lived in the wrong place (outside of town in an area linked to suspects)
- He had several 55 galon oil drums on his property
- He was a member of the NRA
To the FBI all this information seemed to indicate that he could possibly be linked as the third man in the Oklahoma City bombing. Nothing could have been further from the truth but for a few tense weeks, he was the focus of enough attention so that he felt like he could not visit friends, go target practicing, or do much of anything. He got paranoid and asked us to not call him because he thought he may be wiretapped. It really ate him up inside and he had done nothing wrong. The truth of the matter is that he is one of the most law-abiding people around. He had not done one illegal thing to draw this suspiscion on him. Litterally, he was just in the wrong place at the wrong time. He is just a kind of quiet guy who likes to keep to himself.
I don't think that data mining brought this investigation on him. I think his name simply popped up on too many lists (which is in a way, a form of manual data mining). Still with computers and access to hundreds or thousands of different data sources, the possibilities have compounded themselves making this kind of process likely to impact too many poeole. Innocent people.
You didn't RTFA (Score:4, Interesting)
He maps out (using google maps) the locations of the people who read certain books.
A lot of these wishlists have a city, state, full name and birthdate attached to them... which is more than enough for google to give you a street address (though not always with 100% accuracy)
Just to test it, i randomly picked a 'sarah' who had a wishlist. Turns out there's only one Sarah Johnson in Portland, OR.