Home Money How to prevent your data from being used to train AI

How to prevent your data from being used to train AI

0 comments
The image can contain a page text file and a web page.

In their help pagesOpenAI says ChatGPT web users without accounts should navigate to Settings and then uncheck Improve the model for everyone. If you have an account and are logged in through a web browser, select ChatGPT, Settings, Data Controls, and then turn off Chat and training history. If you are using the ChatGPT mobile apps, go to Settingschoose data controls, and turn off Chat and training history. Changing these settings, OpenAI support pages sayIt won’t sync between different browsers or devices, so you’ll need to make the change everywhere you use ChatGPT.

OpenAI is much more than ChatGPT. For its Dall-E 3 imager, the startup has a form that allows you to send images to be removed from “future training data sets”. It asks for your name, email, whether you own the rights to the image or if you are contacting on behalf of a company, image details and any uploads of the image(s). OpenAI also says that if you have a “high volume” of images hosted online that you want to remove from the training data, then it may be “more efficient” to add GPTBot to the robots.txt file of the website where the images are hosted.

Traditionally, a website’s robots.txt file (a simple text file usually located at websitename.com/robots.txt) has been used to tell search engines, and others, whether they can include your pages in your results. It can now also be used to tell AI crawlers not to delete what you’ve posted, and AI companies have said they will honor this agreement.

Perplexity

Perplexity is a startup that uses AI to help you search the web and find answers to your questions. Like the rest of the software on this list, you automatically choose to have your interactions and data used to further train Perplexity’s AI. Disable it by clicking on your account namescrolling down to Account section and turning off the AI data retention lever.

Quora

Quora via Matt Burgess

Quora says It does not “currently” use responses to people’s questions, posts, or comments to train the AI. It also hasn’t sold any user data for AI training, a spokesperson says. However, it does offer opt-out options in case this changes in the future. To do this, visit your Settings page, click to Privacy, and turn off the “Allow large language models to be trained on your content” option. Despite this choice, there are some Quora posts that can be used to train LLMs. If you reply to a machine-generated answer, the company’s help pages say, then those responses can be used for AI training. He notes that third parties can simply remove your content anyway.

Rev

Rev, a voice transcription service that uses human freelancers and artificial intelligence to transcribe audio. says it uses data “perpetually” and “anonymously” to train their AI systems. Even if you delete your account, it will continue to train its AI with that information.

Kendell Kelton, head of brand and corporate communications at Rev, says it has the “largest and most diverse voice data set,” made up of more than 6.5 million hours of voice recording. Kelton says Rev does not sell user data to third parties. The company’s terms of service say the data will be used for training and that customers can opt out. Individuals can opt out of having their data used by sending an email to support@rev.com, your the help pages say.

Loose

The company could also use all those random Slack messages running to train its models. “Slack has been using machine learning in its product for many years. This includes platform-level machine learning models for things like channel recommendations and emojis,” says Jackie Rocca, Slack’s vice president of product who focuses on AI.

Although the company does not use customer data to train a large language model for its weak AI product, Slack may use your interactions to improve the software’s machine learning capabilities. “To develop AI/ML models, our systems analyze customer data (e.g. messages, content, and files) sent to Slack,” says Slack privacy page. Just like Adobe, there’s not much you can do on an individual level to opt out if you’re using a business account.

You may also like