Amazon's Book Search Hits a Snag 299
The Importance of writes "Yesterday, Slashdot readers discussed Amazon's brand new, technically impressive and highly useful book search feature that lets users search the full text of over 120,000 books. Today, the Authors Guild is saying that the publishers don't have the right to let Amazon do this. Uh oh."
odd way to read (Score:5, Insightful)
Re:odd way to read (Score:4, Interesting)
Then again, many other sites offer ebooks for a price... which means they also must have full text versions available. So, i Suppose the publishers are just protecting themselves against possible danger.
oh, and being pains in the asses
Re:odd way to read (Score:3, Insightful)
Don't tell anybody, but I figured out an easier way...
Go to the library, borrow the book and read it...
WITHOUT PAYING FOR IT!
I heard that the library will do this for an unlimited number of people too. I wonder how much authors lose because of this "income stealer".
If people want to read stuff, they should pay. None of this "reading for free" nonsense that a lot of thieving kids think
Dead tree is not digital. (Score:3, Insightful)
Your analogy would only make sense if I could demand the librarian make me a digital DRM-free copy of the book.
The problem here is fairly obvious, Amazon is expecting thousands of authors to "trust us with security," and these authors politely say no and you fall back on a non-sequitar library argument?
Sorry, but bought dead-tree books on rental is not the same thing as a digital copy I can mass-send/share globally.
Frankly, considering what passes off
Re:odd way to read (Score:3, Interesting)
I see the reasoning here, but they should think about how many people aren't buying books because they don't know which book has the information they want. I can think of a few times I've gone to the book store to buy a book with hopes of solving a particular problem. I had to go there, take the book off the shelf, flip a few pages, and even risk reading the solution and ending my demand f
Re:odd way to read (Score:5, Interesting)
The authors are right on this. A service that allows Internet access to a scanned image of an arbitrary page of any book is just begging to be misused. The service doesn't require images of the actual pages to be served. Removing this feature would allow the search to still be useful but would remove the possibility of people downloading the entire book for free.
Re:odd way to read (Score:5, Insightful)
Good thing people don't put thousands of books in a big building and let people read them for free. It's just begging to be misused.
Re:odd way to read (Score:5, Informative)
Re:odd way to read (Score:3, Insightful)
For example I have Harrison's textbook of Internal Medicine. I paid well over $100 for it and I use it maybe every few weeks to look up a differential diagnosis or some reference values. I rarely read more than 2 pages at a time.
If I could find it on-line and look the stuff up you bet I wouldn't have spent the cash for it.
misunderstanding (Score:5, Insightful)
Re:misunderstanding (Score:3, Informative)
Re: (Score:2)
Re:misunderstanding (Score:2)
Re:misunderstanding (Score:2)
Re:misunderstanding (Score:4, Informative)
Re:misunderstanding (Score:2)
Well, there are beginning to be good freely available cookbooks and travel books on the web, but right now, most of the info of that type is somehow attached to a company or local government travel office, which means it probably isn't very objective. Sure, the city of Muncie, IN will try to convince you that it's a travel Mecca, but a real travel book would tell you not to bother going.
Re:misunderstanding (Score:2)
Re:misunderstanding (Score:2)
If the content is 'formmated' correctly (paragraphs, tabs, & whatnot), then changing the font and/or size isn't going to do anything to damage the 'spirit' of the work. If someone happens to prefer a sans serif type to a serif one, and it helps them actually READ the work in question, that's only to the better. Yes, people can and will do stupid things with fonts - but then that would be their problem, would it not? Personal responsibility.
As far as messing up the page numbering, so what? M
Re:misunderstanding (Score:3, Insightful)
Content (Score:3, Interesting)
Exactly what do you call the text from a book? If the pages/text aren't its content, then I guess it doesn't have any. So much for literature.
"...merely letting the public find the right product to then buy"
Consider the ramifications of your statement: I should be able to make tracks from a CD available for free, so that others can determine whether they want to buy it. Whether you think that's the wa
Re:Content (Score:2)
Re:Content (Score:2)
Re:Content (Score:2)
Re:Content (Score:5, Insightful)
Re:Content (Score:3, Insightful)
Or, they could change the amount of text displayed based on the type of content. Less for a cookbook or reference book, and more for a novel. This is the first time anyone's done this, so hopefully a little finetuning will be forthcoming. Demonizing Amazon.com has historically had NO effect on their behaviour, so hopefu
Re:Obvious exploit. (Score:5, Informative)
First prize for forgetting/not reading that this is tied to your credit card, and that Spamazon (forgive maybe, forget no) limits you to a certain number of views total, and also a certain number _per_book_. Enough to stop you reading the whole thing (unless you're patient enough to do it over a whole year - but in that case, why not ask your local library for a copy repeatedly, and wait until they get it in).
Re:Obvious exploit. (Score:3, Insightful)
I have friends.
A group of people can easily download the entire book, stitch it together, and release it to the wild. Not a good thing. I don't know how this works, but it may even be possible for a group of people to do this using a simple program that runs in the background.
There's no way Amazon is ever going to get away with this program - it will be abused. I can understand why Amazon is doing this (they want to be more like a physical book store), but this service is ju
Re:Obvious exploit. (Score:3, Interesting)
As far as tracking the IP, a quick google search for "public anonymizing proxy" will pretty handily take care of hiding your access to anything over standard http.
I guess that puts this post in the violation of the DMCA,
Re:Content (Score:4, Interesting)
Re:Content (Score:2)
Re:misunderstanding (Score:2)
If you read the article, you'll see that this is about contracts with publishers that state that the books in question cannot be placed in an electronic database. Since it is likely that Amazon got this database from the publishers, this is the problem.
Amazon could legally scan all the books in themselves and make the same search available under fair use. The issue here is whether the authors' contracts with the publishers give those publishers the right to distribute their
probably not fair use (Score:3, Interesting)
Re:misunderstanding (Score:3, Interesting)
Then your understanding is incorrect. Amazon makes available the page where the search hit is found, plus the previous and subsequent two pages each, for a total of five pages per hit. In many cases (examples are given as cookbooks and travel books) this may be all the viewer cares about.
In other cases, it doesn't take much ingenuity to figure a way to get the whole book. (The Guild did 100-page sections, as pro
Re:misunderstanding (Score:2)
Gosh, look at it first. You can indeed browse books' pages just as if you had it on your lap and not just your laptop.
Amazon says there's a limit to the number of pages you can do this with, but they give no details and I haven't found a limit yet.
Steven
Not have the right? (Score:2, Insightful)
Re:Not have the right? (Score:2)
Makes sense to me.... (Score:3, Interesting)
That's one security fuckup away from free ebooks for everybody.
Re:Makes sense to me.... (Score:2)
No, it's just a list of the words in the book. They're not going to have all of the thousands of instances of "a" and "the" in each book indexed - they'll index a word once per book.
Nobody said the words were in order of appearance, either.
Re:Makes sense to me.... (Score:2)
Comment removed (Score:5, Insightful)
Re:Interesting. (Score:2)
Re: (Score:2)
Exactly (Score:2)
OTOH, the article raises a valuable point about books like cookbooks, which are just collections of small bits of information, and the simple act of returning a page obviates the need to buy the book in the first place.
It seems to me the authors, publishers, and vendors need to coordinate their efforts and produce satisfacto
Because google *DOSN'T HAVE EVERYTHING* (Score:2)
The amount of usefull information in the world available to people on the internet, compared to whats available though, say, inter-library loan is actualy pretty small. Unless you're talking about a subject like Computer Sciance, or programming.
I mean, try to find a lot of relavent information on the history of Taiwanese Americans (for example). I had to actualy get up off my ass and to the library in order t
Re:Because google *DOSN'T HAVE EVERYTHING* (Score:3, Funny)
What a waste of energy! Instead of clicking a few keys for convenient access to information, you needed: 1) the sun to pump out a bunch of energy for plants. 2) you had to eat a bunch of that food for calories. 3) had to spend that energy using inefficient legs to walk to your inefficient car to drive to the library to check out a heavy deadtree book. 4) that wasted time in transit and in line could have been u
Re: (Score:3, Insightful)
Kind of like a... library? (Score:5, Insightful)
Personally, I think this full text search is a great feature, and will only help with sales.
Re:Kind of like a... library? (Score:3, Insightful)
It is somewhat distressing to me that public libraries, if they were invented today, would be sued out of existance by short-sighted publishers. Despite what the above poster suggests, I don't think there are many people who do not realize what boon to civilization the public library system is. For contrast, look bac
Re:Interesting. (Score:2)
That doesn't matter: the publishers must be in good faith - after all falling sales would hurt them as well. When the authors disagree, it can be for one of two reasons:
1) They honestly disagree with the publishers over whether it hurts sales or not.
2) This is a negotiating tactic for more royalties: Firstly, if searchable books really generate more sales,
Re:Interesting. (Score:5, Interesting)
Think about how very fond you are of "opt out" email. The idea that an author could remove their book, after some elaborate procedure, if they are aware their book is indexed in the first place, is less than compelling.
Mind you, even as an author (but one whose writing if available for free, as well as for money), I'm not per se agreeing with the Author's Guild. What I can see on Amazon looks like fair use quotations. But it might well be possible to easily reconstruct more of the text in a book that would qualify under fair use.
One thing to keep in mind is that authors generally get majorly screwed over by publishers. E.g. Random House isn't really a whole lot more interested in "protecting authors" than the RIAA is in "protecting musicians".... so if a publisher has given permission, don't imagine they do it to help authors, nor even in conformance with the contracts they signed with those authors.
CDs all over again (Score:5, Interesting)
Uh-oh for Amazon (Score:2)
For Amazon: They "purchase" the books. Fair use allows cutting snippets out and showing people. They just built a search engine out of snippets.
Against Amazon: They do not have the authorization to give out whole books, whether in snippets or not. Fair use does not allow complete articles of published material
My opinion: I really dont know. I'd prefer more freedom when it comes to published material, but it's a fair request/statement the authors guild says. It's not like they dem
Re:Uh-oh for Amazon (Score:2)
Re:Uh-oh for Amazon (Score:2)
Fair use also prevents professors from photocopying the whole book/magazine for class use.
Considering how poorly done the law meaning of "fair use" is, it's worthless to give any credance to. Fair use is only fair after a couple million spent in the law coffers.
Re:Uh-oh for Amazon (Score:5, Informative)
False.
Fair use permits everything and nothing. That is, there are no absolutes as to what is and is not a fair use. Anything MIGHT be, or might not be. It _depends_. It depends on the specific facts of the fair use in question, as analyzed through the four-step test of 17 USC 107.
Under the right circumstances, it is totally okay to distribute an entire copy of a book. Under other circumstances it would not be okay to distribute evne a paragraph.
It always depends.
Ease of extraction is too high? (Score:3, Insightful)
Other books at especially high risk include those that sell to the student (particularly college student) market as secondary reading. A student could easily grab the relevant chapter or two out of a book without paying for it.
This whole thing just ain't right, as of yet. If you read the article, you can see that on the one hand, people have figured out how to get 108 pages out of a bestseller (that's unfair to the authors and publishers), and on the other hand, those same authors and publishers are expecting students to purchase entire books just to get the one or two chapters their teacher has directed them to read. Like the new music services, there should be a legal, reasonably priced (oh, boy) way to obtain those two chapters rather than having to purchase the entire book. As for the 108 pages, I am guessing they pulled that out of Neal Stephenson's Quicksilver, yet another doorstop from this prolific author. As someone who has done a fair amount of writing and someone who has done a LOT of reading, I am sympathetic to both sides in this one. Looks to me like Amazon needs to try again.
Re:Ease of extraction is too high? (Score:2)
Or, of course, a college student could go to their university's library, where (*GASP!*) the textbook is probably on reserve. Oh horror of horrors!
Have I got news for the Authors... (Score:4, Interesting)
As far as publishers are concerned they think they are God.
Here's how the publishing world works: Publishers don't actually create anything. Due to today's technology they don't even provide a needed service. But publishers think they own, and created, every piece of thought in the world and that without them we would all be in the dark ages still. They also put on a good show pretending that they are out to protect the rights and income of the material's real creators.
But its all bullsh*t. Just look at our favorite publishers the RIAA and MPAA. What is the author's guild going to do? Litigation? Publishers have all the money and until we change society enough so that we no longer tley on third party publishers they will continue to win all of the court battles brought against them.
Re:Have I got news for the Authors... (Score:2, Interesting)
He's well known for suing (and winning) when his ownership rights for his work are infringed on.
Even against publishers.
As cynical as many of us are, the law still does work when things like ownership of a book are concerned.
revolution (Score:2)
In 80% of the college classes I've ever had, the prof makes you buy some crappy book he wrote, not because there isn't something better out there, but because he gets royalties on every copy he sells. And $160 I don't spend on textbooks is $160 I can spend on c
Re:revolution (Score:2)
Re:revolution (Score:2)
Re:revolution (Score:2)
Re:revolution (Score:2)
Re:revolution (Score:3, Funny)
College Students (Score:5, Insightful)
But near the end of the email Authors Guild rep says, " A student could easily grab the relevant chapter or two out of a book without paying for it. Students certainly have the time and most likely the inclination to do so, and, with the help of some willing colleagues, could print out the entire texts of books in the program."
As a college student, especially in light of the [slashdot.org]
recent NYT article on textbooks being found half-price or less overseas, it's not unreasonable to think a group of students might get together and pay $15 or $20 to print a couple hundred pages of textbook in the library.
And if someone wrote some nefarious program to log into Amazon as multiple fake accounts to access an entire textbook and download it, everyone would use it. I can easily see textbook-printing rings, with get-togethers at the library to print and distribute free books. Hell, I'd be the first one in line. Paying $500 for a semester of books is rediculous.
So, while I think the reaction of the Authors Guild is a little bit overboard, the email does rasie some valid points.
The email also mentions, in passing, that, "[m]ost fiction titles are not likely to be greatly threatened." It would seem then, that maybe the type of book shold control how many pages you can access. For textbooks or cookbooks or guidebooks or the other topics the Authors Guild fears will be threatened, maybe a compromise could be reached so that only one or two consecutive pages could be accessed. Then, for fiction or books where it is less likely a user would only want a very small portion of the book (and be willing to use Amazon to avoid buying it), more could be accessed.
This would seem to both help address the concerns raised in the email, and allow Amazon to offer this service.
-Trillian
Re:College Students (Score:2)
Re:College Students (Score:3, Interesting)
Well, yes and no.
I agree that, overall, the number of people who would go to such lengths would NOT outnumber those
Re:College Students (Score:5, Insightful)
It strikes me that the effort involved in scamming all the scanned pages out of Amazon would be as great or greater than making the initial copy from a hardcopy by hand. Trying to guess keywords for each set of 5 pages, frankly, sounds like a lot of work. Subsequent copies are both equally easy regardless of whether you're using a printer to spit out scans from Amazon or a sheet-feeder on your photocopier.
There are valid reasons for worrying about this technology (the point about cookbooks and reference books, where the relevant information really does only span a few pages, is especially well made), but this particular one is just the knee-jerk reflex to blame college students for yet more copyright-related legal measures.
Re:College Students (Score:2)
Unless of course Amazon includes the page number in the search results.
Attempt to avoid being busted for Plagiarism? (Score:5, Interesting)
What a nightmare it must be for those that built up lucrative careers and solid reputations on the backs of others--they're hoping they can hide behind the lawyers.
Re:Attempt to avoid being busted for Plagiarism? (Score:3, Insightful)
Plagiarism is always a problem. Amazon, like the Web and Google before it, makes it easier to steal rather than harder.
Steven
Electronic equivalent to browsing in a bookstore? (Score:2)
As far as the comment about college students getting textbooks this way: guess what? If the professor knows it will get light use,
Just when we thought e-Books were dead... (Score:5, Insightful)
When we learned of the program, we thought that it would be impossible to read more than 5 consecutive pages from a book in the program. It turns out that it's quite simple (though a bit inconvenient) to look at 100 or more consecutive pages from a single lengthy book. We've even printed out 108 consecutive pages from a bestselling book. It's not something one would care to do frequently, but it can be done.
The time is really funny, because Slashdot (and many major news outlets) were reporting the demise of the e-book not a few weeks ago. Now, we have new e-books in the form of Amazon's text search.
I used to work for a start-up publishing company that morphed into an internet company. I happened to be the marketing director in charge of print book sales. One day, the CEO decided that it would be a great idea to offer the full text of all our books online for free! Since our target market was largely cash-starved students, this move worried me greatly. Obviously, our sales were goin to drop off tremendously (maybe to zero?).
I discussed my concerns with the CEO. He made a very interesting point: For someone to print out the entire 200 - 500 pages of one of our titles would cost more in toner, paper and time than the $35 the customer would otherwise pay. This seemed to make sense at the time, but in retrospect it is kinda BS because most printers have double-sided multi-page-on-one-sheet capabilities that collapse toner/paper costs.
In the end, we didn't see sales drop off that much. Customers still wanted to order old-fashioned books. Most didn't have the time/patience to print out the books from the internet, didn't have the technical knowledge to do so (hard to believe, but we're talking about MBAs here), or (most likely) it didn't even occur to them.
People who were likely to print out the whole books online were probably also the ones borrowing copies from friends, photocopying from the library, buying used copies, etc. etc.
All, that said, I have to side with the Authors Guild. In the case I described above, our web site was relatively unknown whereas Amazon is among the top end-destinations on the Internet. Book counterfeiters are one perl-script away from obtaining the full-text of the latest Harry Potter book and printing up their own illicit copies for street sale. Yeah, there are already fake copies of bestsellers floating around out there, but now making them will become that much easier.
Comparisons to Napster and pirated music are obvious - however, unlike musicians, authors can't really draw income from "concert tours" as recording artists do. Authors live almost exclusively off royalty checks (with the exception of those lucky enough to pen books that can be cross-merchandised, made into movies, etc.)
Still, I was skeptical that Amazon's text-search system delivered the advertised goods. Getting all those publishers to hand over their text - their lifeblood - is a monumental task in itself. But I guess the system does work after all - too well, in fact!
Re:Just when we thought e-Books were dead... (Score:2)
But as the author of several computer books, I have some concerns that when it gets too easy for searchers to find a large chunk of contextual results, they won't buy the book.
This should also tell us of the marginal utility of books:
Re:Just when we thought e-Books were dead... (Score:2)
That's exactly the idea behind Jim Baen's (Baen Books) Baen Free Library [baen.com], where you can read online or download many of the books (SF and fantasy) he publishes.
Another example: Thinking in Java (Score:3, Informative)
In fact, in his site FAQ, Eckel addresses this question: Why do you put your books on the Web? How can you make any money that way? [mindview.net]
He writes:
Re:Just when we thought e-Books were dead... (Score:3, Insightful)
From the article... (Score:2, Insightful)
So let me get this straight. If recipes aren't protected by copyright...and the problem lies with recipes...there is no problem. Yes?
Re:From the article... (Score:2)
What not just modify the search (Score:2)
Great time to announce that... (Score:3, Insightful)
I doubt most authors are so small minded. (Score:2, Insightful)
I don't think most authors want people to be forced to buy their book in order to get at a couple of isolated pages. Most authors want people to buy the book because they like the book, and think it is worth owning a copy.
True reference books are doomed, appropriately, in the age of the internet. I no longer need a paper dictionary when I can use dictionary.com or get access to the OED throug
why does the AG bring up academics? (Score:2, Interesting)
So if amazon's service allows students
Far it be from Amazon (Score:2)
=P
Authors Guild (Score:3, Interesting)
As an author, I totally repudiate this attempt to act on my behalf. I want my work read. I do not want the 3 cents royalty. For several years in a row, I asked Authors Guild at least to turn over all my royalties to Unicef, instead of sending me a tiny check each year.
In sum, this is a rougue outfit. Scholarly work is a public good.
I don't get it (Score:3, Interesting)
I mean, if I wanted to purchase a book for JDBC stuff, I wouldn't get a book with a JDBC section, I'd look for a book on JDBC! Likewise, why would a person who wants fish recipes so badly go through the trouble of fishing through a *single cookbook* for fish recipes and printing 100+ pages of that book using amazon search? Wouldn't it be easier, and more efficient to just search for a fish cookbook?
I mean yeah, people *could* go through all that trouble, but just because someone *can*, it doesn't mean that they *will*.
And, if a person goes through *that much trouble* to get a free recipe... amazon.com/ca would never be able to sell to them anyway if they didn't have the search!
People who search for stuff on amazon WANT BOOKS, not just information. If information was all people wanted, they'd just use google... and get their recipes for free.
Of Course! (Score:3, Interesting)
Everyone should have seen this coming.
However, at least we know that these books are digitized somewhere. Now, all we need is a good samaritan to risk getting drawn and quartered and release them somewhere on the Net...
All citizens of the US have a right to access [harvard.edu] this information.
By denying us access, the publishers and authors are stealing from us, The People.
Well, I think having my book online is OK (Score:4, Insightful)
Authors in the reference and cookbook business are SOL anyway, because the internet will inevitably shrivel that market down to the size of the completely internet illiterate. It's a funny thing, though. Even my 60-something year old mom can send email and surf the web now.
As far as permission in my contract goes
One thing that many
Many authors are fearful that the value in their books is in the information and not in its physical presentation. In my experience, that is not yet the case. I would never, for example, use a computerized version of Joy of Cooking (and besides, it would have the sucky "new" recipes in it, nevermind requiring me to have a splashproof computer near the stove). There are some horrible books that people do consider disposable - Java "references" that are out of date when they hit the shelves, for example - but other more carefully written programming texts are not much fun to read on a glowing computer screen. Nor do they look good on a bookshelf.
-joseph
Re:Ohhh what (Score:2, Interesting)
Re:Ohhh what (Score:5, Insightful)
As opposed to, say, going into Barnes & Nobel and drinking coffee while reading the whole thing free?
Re:Ohhh what (Score:3, Funny)
This is because you sip your coffee, instead of chugging it like a Real Programmer.
Re:Ohhh what (Score:3, Insightful)
Re:Ohhh what (Score:4, Informative)
Re:Ohhh what (Score:4, Informative)
All dear, someone who's never been in the business. Many, probably most, book contracts say that essentially all practical rights belong to them.
Frankly, one reason why I almost never write books and stick to magazines and newspapers is not only do they pay better, but at least in that side of the biz, you know up front that your rights are bought and sold.
Steven
Re:O'Reilly Safari? (Score:2, Informative)
Authors who do not work for their publishers retain all ownership of their own works (unless they're foolish enough to sign them away, which most are'nt) Publishing a book in one form does not give you the right to distribute it in another form, without a seperate agreement with the owner of the work.
Re:Understandable, sometimes (Score:2)
Tomorrow on Slashdot:
SCO sues Amazon for leaking their code...
Re:Understandable, sometimes (Score:2)
Nearly all of the cookbooks and many of the reference books that I have seen, have the advantage of having glossy color plates. These usually turn out looking awful using a monochrome laser printer. As another comment points out, the cost of using such a system would probably cost more in ton
Re:Understandable, sometimes (Score:2)
Exactly. (Score:2)
"This thing fucking sucks!" is the feature's greatest problem; this Author's Guild bitching is flyswatter work in comparison.
Example: I just searched for "Al-Jazari," because I'm looking for a book about little robot-toys. When I searched a couple days ago, the first of two results was a book by Al-Jazari about his little robot-toys. Now, it's result #48 out of 48. Almost everything ranked above it is useless crap, but to find what I wanted, I had to page through all of it, since the results are now so un