YouTube Video-Fingerprinting Due in September 115
Tech.Luver writes "The Register is reporting on Google's statement to a presiding judge that video-fingerprinting of YouTube material will be ready in September. The development is required to head off a three-headed suit against the company, currently being debated in a New York City courthouse. The system will, according to Google, 'be as sophisticated as fingerprinting technology used by the Federal Bureau of Investigation.' From the article: 'As Google told El Reg in an earlier conversation, the company already has two systems in place for policing infringing content - but neither are ideal. One system allows copyright holders to notify Google when they spot their videos on the company's sites. When notified, the company removes the offending videos, in compliance with the American Digital Millennium Copyright Act. A second system uses "hash" technology to automatically block repeated uploads of infringing material.'"
Hard AI ftw (Score:5, Interesting)
Others pointed out that, no, it's not a hard AI problem to just compare some kind of checksum of the video against a set of banned checksums. That's true. But what about once people know they're using this system? They can just trivially re-encode. Perhaps add a scene break here or there, and totally mess up the fingerprint. To prevent that, it seems, you would need to solve a hard-AI problem: that is, be able to determine if an arbitrarily-encoded video appears to a human to match some copyrighted work. It would have to be robust against minor scene shortenings and lengthenings, scene breakups, color gradients laid over the video, etc.
Anyone know how difficult this program is to circumvent? (Just hypothetically -- not advocating criminal activity here.)
Re:Hard AI ftw (Score:2, Interesting)
Nobody would know what the keyframes are, so it would be hard/impossible to black out that specific frame.
Two-part Protection (Score:4, Interesting)
The second part sounds more promising, but someone may be able to get around hashing the videos, such as inserting random one-frame images, as in the Fight Club movie, or adding in overlay text, or possibly adding in effects. If they try to hash a few selected time slices, someone will figure it out eventually. As with all digital protection, this just pushes off the inevitable. At least it will make Google look good in court, since they're attempting to comply with Viacom and the other copyright holder's requests for not posting their material.
In the end, it won't count for much. It would make more sense to add in additional protections for false or malicious takedown notices, such as adding in a $50K fine for false claims. This would at least make the big companies scrutinize the videos that they're issuing a takedown notice for.
Re:Hard AI ftw (Score:5, Interesting)
Of course a little bit of coding and you have a program that takes that 10 minute video, splits it into 10 1 minute videos and uploads them. The ones that got rejected it splits into 10 6 seconds videos and uploads them. Rinse and repeat until you have however small an set of rejections you asked it for. Then it cuts out just the necessary fragments of videos (replacing them with the last good frame or something?).
Of course that can be worked at google's end by adding a delay to the report rejection step, and by banning those who get lots of rejects.
Re:Hard AI ftw (Score:5, Interesting)
'infringement' (Score:3, Interesting)
But youtube is a little different in that many of the things people go there for are unique or one-time things that the only way you'll ever get a chance to see them again is if you recorded it yourself, or somebody else does and you are lucky enough to find it online.
The biggest issue I have is stuff that you'll NEVER BE ABLE TO ACTUALLY BUY OR SEE AGAIN being taken down. My favorite example is prince performing at half time for the superbowl. Now, not only are the videos gone from youtube, but also all of the comments (which IMHO are equally as valuable to the community) about the videos.
Taking things like this down erodes our culture and destroys valuable records of what has gone on in our lives.
Re:separation of the web (Score:4, Interesting)