Tumblr and Wordpress Are Preparing To Sell User Data To OpenAI and Midjourney, Report Says (404media.co) 42
Tumblr and Wordpress are preparing to sell user data to Midjourney and OpenAI, 404Media reported Tuesday, citing a source with internal knowledge about the deals and internal documents. From the report: The exact types of data from each platform going to each company are not spelled out in documentation we've reviewed, but internal communications reviewed by 404 Media make clear that deals between Automattic, the platforms' parent company, and OpenAI and Midjourney are imminent. The internal documentation details a messy and controversial process within Tumblr itself. One internal post made by Cyle Gage, a product manager at Tumblr, states that a query made to prepare data for OpenAI and Midjourney compiled a huge number of user posts that it wasn't supposed to. It is not clear from Gage's post whether this data has already been sent to OpenAI and Midjourney, or whether Gage was detailing a process for scrubbing the data before it was to be sent.
Re: Soooo.. Thats what a digital implosion is (Score:2)
Are you alright? Should someone send a wellness check?
Re: (Score:2)
I think their reaction is perfectly valid to more enshittification.
Privacy (Score:5, Insightful)
It should be illegal for companies to on sell your personal information.
Put in massive fines for breaches , for each piece of info, ensure these companies do not collect and store more information than needed, and stop it being seen as another "income stream" and instead being seen as a liability.
Re: Privacy (Score:2)
But these people agreed to this when they signed up. Are you saying you want to restrict people from freely entering into these sorts of agreements?
Re: (Score:2)
Re: Privacy (Score:2)
No one is forced to use Tumblr. I can't remember ever being forced to agree to something like this. Can you provide a real world example?
The reality is if these platforms didn't get you to agree to sell your data, then these platforms wouldn't exist. You make a decision about the value of a "free" service and the value you put on your privacy and decide which is worth more to you. This is a fundamental aspect of free markets and contract law.
Re: (Score:2)
Where does your information being sold end ?
IIRC facebook one boasted they knew where everyone in the world lived to within less than 1/2 a mile or something. They also knew their marital status, sexual orientation, income, debts, what web sites you visit, any false names you may use, your job, your friends, etc etc etc. Basically there is nothing about your life they do not know because someone sold them t
Re: Privacy (Score:2)
I don't use Facebook or Tumblr. The only thing Facebook really knows about me are things found in public records. I can't even find myself on Google or Facebook without knowing some of that personal info in the first place. Searching for me is essentially a black hole unless you already know where I live, or what sites & handles I use.
Re: (Score:2)
Re: Privacy (Score:2)
Yet they can't tell what gender I am. Weird.
Re: (Score:2)
Re: Privacy (Score:2)
It's not just imperfect. It's indistinguishable from random selection. Just because you don't know how to stop these companies from generating an accurate shadow profile of you doesn't mean the same for the rest of us.
Re: (Score:2)
How about putting the blame where it actually belongs, the ones who are collecting, collating, and selling the information.
You seemingly have the belief that because some of the information about you is wrong it all is. Have you ever though that the incorrect information is used by other companies to your detriment ? How do you know ?
Re: Privacy (Score:2)
Most ad networks seem to think I'm a middle-aged woman, actually, which is kind of hilarious. You can almost see it struggle to identify me, latching onto any new datum morsel in the desperate hope of finding my demographic.
Re: (Score:2)
It should be illegal for companies to on sell your personal information.
Your personal information is the information that you, personally, know or possess -it is private only until you share it with another person. It is not the same as information about you.
Information belongs to whomever is in possession of it. This is why we can write biographies about people. It is why we can write news stories about events and people. These traditions predate modern civilization and are grandfathered into our modern legal systems with few restrictions.
Your personal information is prot
Re:Privacy (Score:4, Informative)
There is a difference between biographies and information collected and sold.
Oh and big surprise, 96% of people do NOT live in the USA and their rights are just as important. More so because it is so often US corporations who abuse peoples privacy. The EU has a better grasp on reality for privacy of information.
Re: (Score:2)
The funny thing is, you're saying this as if it had a big negative impact on you. You got targeted advertising. Whoop de do. As long as your legally protected against discrimination based on the information it shouldn't matter if someone assumes your kid is pregnant for the purpose of targeting them ads.
Re: (Score:3)
Do you have access to see if the information they hold about you is accurate ?
Who / how many have that information ?
That information is used way beyond targeted adverts. You naively think it's not being used for other purposes ?
How do you know your health insurance has no bias due to your food/alcohol/recreation (or lack of it) habits ?
What about jobs, ever miss out on one ?, perhaps your association to someone criminal is too close, or your por
Re: (Score:1)
Re: (Score:2)
4chan could do the same, but opposite.. (Score:5, Funny)
"Pay us 10 million, or we're getting our data in your training data somehow"
Re: 4chan could do the same, but opposite.. (Score:2)
You've brought up an important point.
You know how AI can't seem to create anything original?
You know how all the creative things on the Internet start on 4chan?
Maybe this is the problem. Once the LLMs are trained on 4chan, they'll reach AGI.
Re: (Score:2)
You already had Tay the chatbot [wikipedia.org]. Not trained on 4chan per se, but no doubt /pol/ users contributed to its training.
Re: (Score:2)
There was also the case of a youtuber that created 4chan-GPT [youtube.com] to troll the /pol/ users.
Wasn't Tumblr the thing... (Score:1)
Seems like an important advancement, I expect great things
Re: (Score:2)
Yes, it was the thing that most people went in for the porn, but then they decided to ban porn from the platform with great results
Now you all know.. (Score:4, Insightful)
Now you all know.. just exactly how you're the product.
Ads tack-welded to your eyeballs wasn't enough. Now your very posts are up for sale.
Waiting for the announcement saying that Slashdot is doing something similar. Any day now.
Re:Now you all know.. (Score:4, Funny)
Well I for one welcome our AI sexbot overlords trained with Tumblr lewd content!
Common Crawl (Score:2)
Common Crawl means that they already have the data.
Re: (Score:3)
OpenAI competitors are preparing for a supreme court ruling going against OpenAI, it's the easiest way to catch up.
OpenAI is now pot committed to fair use, if it doesn't happen they're dead ... and they might even drag Microsoft down with them.
Wordpress? (Score:3)
Do they mean Automattic/wordpress.com? I don't see how the Wordpress foundation has any license to content on Wordpress sites.
Re: (Score:2)
How about this. If I am Wordpress, I create a plug-in that sends your site's html/css (gzip'd) to the mothership everytime someone opens your page.
You know, for analysis and feedback of your html and css.
And displayed very non-prominently on the mothership webpage is a disclaimer that states that all submissions are considered non-exclusively licensed by the submitter (your site)
to be used anyway I want, perpetually.
Re: (Score:2)
At this point, the free dot-org version of wordpress has become a nuisance to a huge cash in by the cabal running Automattic.
This is clear with every update of wordpress and all of their differently branded but still owned properties such as woocommerce.
The "wordpress" naming is kept intentionally confusing so that the normies don't see the flip coming.
Re: (Score:2)
TFA at least says Wordpress.com, the summary is misleading by leaving out the .com part. No, wordpress.org won't sell anything and your self hosted wordpress website won't sell anything.
Where's my fucking check? (Score:2)
All this money being thrown around for my content - where's my share of the money?
I don't understand how this is legal. Does the ToS give them copyright to the data I post on their platforms?
Re: (Score:2)
How much did you pay to use the service, or was that provided in return for your content?
Re: (Score:2)
"Does WordPress com own your content?
You own your own content, WordPress.com does not retain rights to your content. But you do grant them a royalty free world wide license to display your material – else they would not be able to show your content on someones computer screen."
https://wordpress.com/forums/t... [wordpress.com]
Can't trust anyone selling private data (Score:2)
Can't resist the money (Score:2)
I thought they were already doing this? (Score:1)