User Outcry As Slack Scrapes Customer Data For AI Model Training (securityweek.com) 34
New submitter txyoji shares a report: Enterprise workplace collaboration platform Slack has sparked a privacy backlash with the revelation that it has been scraping customer data, including messages and files, to develop new AI and ML models. By default, and without requiring users to opt-in, Slack said its systems have been analyzing customer data and usage information (including messages, content and files) to build AI/ML models to improve the software.
The company insists it has technical controls in place to block Slack from accessing the underlying content and promises that data will not lead across workplaces but, despite these assurances, corporate Slack admins are scrambling to opt-out of the data scraping. This line in Slack's communication sparked a social media controversy with the realization that content in direct messages and other sensitive content posted to Slack was being used to develop AI/ML models and that opting out world require sending e-mail requests: "If you want to exclude your Customer Data from Slack global models, you can opt out. To opt out, please have your org, workspace owners or primary owner contact our Customer Experience team at feedback@slack.com with your workspace/org URL and the subject line 'Slack global model opt-out request'. We will process your request and respond once the opt-out has been completed."
The company insists it has technical controls in place to block Slack from accessing the underlying content and promises that data will not lead across workplaces but, despite these assurances, corporate Slack admins are scrambling to opt-out of the data scraping. This line in Slack's communication sparked a social media controversy with the realization that content in direct messages and other sensitive content posted to Slack was being used to develop AI/ML models and that opting out world require sending e-mail requests: "If you want to exclude your Customer Data from Slack global models, you can opt out. To opt out, please have your org, workspace owners or primary owner contact our Customer Experience team at feedback@slack.com with your workspace/org URL and the subject line 'Slack global model opt-out request'. We will process your request and respond once the opt-out has been completed."
Lawsuits in 3, 2, 1 . . . (Score:5, Interesting)
It's one thing to say you're scraping the messages. It is quite another to admit you're scraping people's data, particularly data which could possibly have PII or other restrictive issues, not to mention the usual confidential information.
I'm presuming common sense or legal considerations doesn't enter into business decisions any longer.
Re:Lawsuits in 3, 2, 1 . . . (Score:4, Insightful)
Come on slack product and security teams. Stop being evil. You know what you did, and you know you intentionally worked around major privacy concerns to help this company (which will lay you off one day) make more money.
Re: Lawsuits in 3, 2, 1 . . . (Score:2)
Honestly I wouldnâ(TM)t be surprised that the reason for the e-mail is no one thought about it, realized it was an issue (or thought someone else was managing it) and are now scrambling to manage it, and donâ(TM)t have time to add an automated opt out process yet.
Re: (Score:2)
You code, right? Adding an automated opt out is basically boolean in a column for each customer
Slack Terms of Service probably permit data mining (Score:2)
One reason I've tried to discourage its use, among many others. As I wrote in 2016:
https://pdfernhout.net/reasons... [pdfernhout.net]
"As a summary, the main issues in using Slack for free/libre software projects include:
* Proprietary vs. Free; free alternatives exist like Mattermost and Matrix.org and others
* Sending the wrong message about free software communications out of convenience
* Reduces interest in free software and public standards for communications
* Changeable Terms of Service
* Arbitrary termination of access p
complete this sentence: all your data (Score:5, Funny)
"All your data is being processed securely."
-- ChatGPT
again
"All your data is stored in the cloud."
-- ChatGPT
again as a joke
"All your data are belong to us."
-- ChatGPT
Comment removed (Score:4, Funny)
Re: (Score:2)
You think Discord isn't doing this, or about to?
There are other options. Mattermost, Signal...
Re: WTF (Score:2)
Re: (Score:2)
There is nothing to be wooshed here. We live in a world of people who don't read terms of services and who then make knee jerk reactions by moving to other services for which they also don't bother reading the terms.
Considering the OP's post a wooshable joke would imply that humanity has some kind of base intellect, something which we have objectively demonstrated we don't have.
Re: (Score:2)
Re: (Score:2)
I have never heard of Mattermost and am researching it now.....
There is also Jitsi (https://jitsi.org/), which is Free Software and can be hosted on your own servers if you're so inclined. If you're not so inclined, you can use their servers.
Re: WTF (Score:1)
They forgot to add ... (Score:2)
The company insists it has technical controls in place to block Slack from accessing the underlying content ...
Meaning, while Slack doesn't have access to your corporate data/secrets, our AI does. But don't worry, nothing could go wring with that. AI's can't be tricked into leaking training data. [Someone whispers in his ear.] Wait! what?
To opt out, please have your org, workspace owners or primary owner contact our Customer Experience team at feedback@slack.com with your workspace/org URL and the subject line 'Slack global model opt-out request'. We will process your request and respond once the opt-out has been completed."
We double pinky-swear that you'll really, actually be opted-out for-sure. ;-) ;-)
[ Disclaimer: Excluding any data already scraped. ]
Supposedly not LLMs (Score:2)
"We do not develop LLMs or other generative models using customer data. To develop non-generative AI/ML models for features such as emoji and channel recommendations, our systems analyze Customer Data"
"such as" is a bit iffy though. That could include mining your data for better stock picking in their insider trader bot or whatever.
If I were them, I'd use the apologies for this PR fuckup to announce E2EE.
Priority list (Score:2)
Private data (Score:2)
Training a LLM on publicly available scraped web data is one thing. But training it on non-public data is a completely different story.
Maybe on-prem is not such a bad idea (Score:2)
And only IT-tards and their PHBs ever thought it was.
Users? or Customers? (Score:2)
The two are not the same. A user in the free tier is not a customer, they are the product.
Re: (Score:2)
What guarantee do you have that a "customer" is not also the product?
The cloud was a trap (Score:2)
Training is the new tracking (Score:2)
Re: (Score:2)
Under GDPR you're not allowed to collect a bunch of data for one purpose and then one day change or add another propose, especially if the new purpose surprises people, in the sense that they would not have reasonably expected the new thing to have been in the scope of the original purpose.
Want to bet MS isn't doing the same with OneDrive? (Score:3)
Microsoft doesn't have Youtube, Google search, Gmail, Facebook or Instagram.
Microsoft will use Onedrive data and windows desktop data to train its AI.
Re: (Score:2)
It's Salesforce (Score:4, Insightful)
Slack is owned by Salesforce now, so why is anyone surprised by this? They've been harvesting customer data for years...
MBAs are only as trustworthy as their options ... (Score:1)
The time has come (Score:1)
Correct me if im wrong (Score:1)