Lawsuit Stops Headline Scraping 85
Stephen Larson alerts us to the out-of-court settlement of Gatehouse v NY Times, a lawsuit that attempted to stop the Boston Globe from linking to headlines and excerpting initial sentences from a competitor's Web site. At issue was the Globe's practice — barely distinguishable from those of Google News, Yahoo, and others — of linking to another news source's coverage of local news. The upshot is that the Boston Globe will stop the linking. No judicial precedent was set, because the case was settled before reaching a judge.
This is ridiculous (Score:5, Informative)
FTA, it sounds like Gatehouse see this as a copyright violation but, as several other posters have pointed out, the same thing goes on on news aggregator sites all the time. In fact most stories on Slashdot contain snippets from other sites. It's an unavoidable and very useful facet of the web
This is yet another example of 'old' media not really understanding online practices. Most sites benefit tremendously from others linking to them - look at what happens with Slashdot. That is, unless the 'benefit' is so great that their server turns to dust.
Re:Web fundamental (Score:5, Informative)
I would give almost anything to have a blacklist of domains I could set while logged into google so that those never showed up in my searches ever again...
Exactly what you are looking for, Google's customizable search engine:
http://www.google.com/coop/cse/ [google.com]
Comment removed (Score:3, Informative)
Re:Web fundamental (Score:3, Informative)
I took a look at this when the first article came out. The plaintiff's site has an RSS feed. The defendant's site looks like it was aggregating the headlines and initial sentence or so of several locally relevant news sites' RSS feeds, displaying that, and linking the headline back to the originating site. Basically, exactly what you expect an rss aggregator to do.
Why this makes sense. (Score:3, Informative)
Unlike Google News, the Boston Globe is, itself, a news-reporting organization. Mixing their own stories with those from competitors can lead to confusion. I didn't manage to see the offending page before they took down their linked stories; but I imagine it was done in such a way as to have the original source difficult to identify.
A pure aggregator service, like Google News, is different because it is rather obvious that ALL it is doing is aggregating. There is no 'new reporting' being done by "Google".