Review Your Generative AI Data Sharing Preferences

[Update] November 20, 2024


Hi Creators,

Our goal is to enable the creation of anything, anywhere, by anyone. We’ve been building generative AI tools such as Assistant and Avatar Auto-Setup to help reduce the barrier to creation for new creators and empower all creators to be more productive. We believe that AI will ultimately play an invaluable role in supporting your work. This is why we want to continue improving our technology to build smarter, more contextual AI models that understand everything about creating on Roblox – from coding in Luau to the structure of the DataModels to Avatar systems and more.

As part of this effort, we’re launching an AI data-sharing program that allows you to set your preferences over how your anonymized creation data is used to train our generative AI models. Our goal is that AI continues to accelerate creation and complement creators’ skills. Our AI tools can’t decide what the best experience or avatar is, or what will resonate with users. Ultimately, that is up to the creator – to apply their skills, intuition, and data using tools like Analytics to help them design the best content.

How the program will work

  • All experiences and avatar items (including bodies, heads, clothing, and accessories) created prior to July 10, 2024, will be set to generative AI sharing OFF.

  • Starting July 10, 2024, when you save or publish a new experience or avatar item, you will see a new toggle that allows you to control what you share to improve generative AI. The toggle is set by default to ON to allow data sharing; you can choose your preference for this asset here, and can change the default for all new experiences and avatars in Creator Hub.

  • Once you set your default sharing preference in Creator Hub, future publishes for new content of that asset type will default to this preference, but you can choose the appropriate sharing preference when you save or publish the asset.

  • When creating or updating experiences via the Publish as/Save as, if the originating experience is set to sharing OFF and/or the target experience is set to sharing OFF, it will remain OFF. Data sharing will be set to ON only if both the originating and target experiences have been manually set to ON. However, you can change this setting at any time.

  • We won’t collect or use any of your shared data to train generative AI models until September 10, 2024.

  • All creators will be able to use our generative AI tools, regardless of your data sharing preference.

  • We will continue to use creator data for the purposes of improving key platform systems such as safety, moderation, translation, and recommendations.

Revised Supplemental AI Terms

We’re also rolling out an updated AI Supplemental Terms which you will be asked to accept before Studio launches. These terms outline the rules governing the use of our AI tools. Accepting the Supplemental Terms does not affect your ability to change your sharing preferences.

At a high level, the terms address use restrictions such as not using AI tools in ways that could harm others or break the law. Additionally, as explained in the terms, the tools may not work perfectly, so use them responsibly and at your own risk.

Why we are doing this

Over the past year, we’ve rolled out a suite of generative AI-powered solutions, including Code Assist, Material Generator, Assistant, Texture Generator, and Avatar Auto-Setup. We’re also working toward 4D generative AI, going beyond static 3D objects to dynamic creations that can move and interact with their environments. More and more creators are using these solutions to learn and create faster, eliminating mundane and repetitive tasks and complementing their existing skill sets.

Some of the feedback we’ve heard from you on how these solutions are helping you be more productive are:

  • Code Assist: Toya Play saw a 40% increase in efficiency using Code Assist, representing “months of work across the company.”

  • Avatar Auto-Setup: Creators told us that previously rigging, caging, skinning, and segmenting their avatars could take days, even weeks, and now it happens in a couple of minutes. Creators also were previously intimidated to build avatars due to the complexity and have now expressed interest in building.

Your contributions to generative AI training help us create and fine-tune our AI models in a way that is sustainable and scalable for our community of creators, and will help improve creation in several ways, including:

  • More contextual Code Assist and Assistant: Experience data will help improve our AI solutions, so they can provide better and more contextually-aware suggestions for your work.

  • More flexible Avatar Auto-Setup: A more diverse avatar dataset will help it handle a wider range of avatar shapes saving you days of time on tedious setup.

Set your default preferences

Starting today, you will be able to set your preferences. In Creator Hub, you will click on your user name and select ‘Settings’ and then ‘Data Sharing’ to set your preferences for experiences, avatar items, and paid Creator Store 3D models, which we aim to launch later this year. We’ll post separately when you can sell 3D models in Creator Store, but your default preference can be set in advance. Check or uncheck the “Make sharing data my default option when publishing” option to choose your default sharing settings for each asset type.

Any creations you publish will respect your preference, and regardless of the default setting, you can always change your mind at publish time.

Free Creator Store meshes, images, and models will be included in generative AI training, without the ability to remove from training. We will not be training on plugins, audio, or videos.

We won’t collect or use any of your shared data to train generative AI models until September 10, 2024.

Express your preference

When you publish a new experience, upload an avatar item, or upload a paid model (coming later) to the Creator Store, you will have the option to share your data with our AI models. The default will be based on the preference you set in the Creator Dashboard settings. In the example below for publishing a new experience, you can see the toggle for ‘Data Sharing.’ Similar workflows exist for avatar items and will exist for paid Creator Store models once live.

Group owners control sharing

Only group owners can opt-in to share data with generative AI models. Also, on the Creator Hub Data Sharing page, group-owned content only appears in the group owner’s dashboard.

Updates to the previous Luau Script Data Sharing program

Last year, we launched a data sharing program that allowed you to share your scripts with Roblox to improve our AI models or with an open-source-available Luau Dataset designed to help all LLMs and AI models understand Luau. Going forward, we will not use the script data previously shared with Roblox only. We appreciate everyone who shared their scripts as it helped us improve the effectiveness of Code Assist. Instead, you will have the opportunity to decide whether you want to share your experience data, including your script data, with our new data sharing program.

Sharing your data for the Luau public dataset is not affected by this program. You can set your preferences in Creator Dashboard under the Public Luau tab. However, opting out from the Luau Dataset won’t be possible after 30 days as the dataset will be publicly available. Please only contribute code from experiences that you are comfortable with. Only a small fraction of your contributed code will be compiled and aggregated into an open source-available dataset for AI model training.

Building Safe and Responsible Generative AI Models

We take the security of your data seriously. All collected data is processed, stored, and shared with the utmost care, adhering to stringent security protocols. All data for AI training undergoes a rigorous cleaning process to remove sensitive and/or potentially harmful information. Additionally, the data is split up into tokens (small snippets) for training, similar to paper shredding. We shared a post on how we’ve approached this for script training and plan to share more for other asset types going forward.


Overview of our Generative AI Training Pipeline, Learn More

Thank you for being a vital part of the Roblox community!


Overview of Data Sharing by Asset Types

Asset Type Data Sharing Preference
Experiences and Avatar Items created prior to July 10, 2024 Sharing OFF by default, control in Creator Dashboard Settings, future updates of this content will remain as OFF unless you decide to update your preferences
Experiences, Avatar Items, Paid Creator Store Models* created since July 10, 2024
*Coming later this year
Sharing ON by default, control in Creator Dashboard Settings
Free Creator Store Assets (Meshes, Images, Models) Included in generative AI training, no ability to set data preference

FAQs

Click here to view the FAQ!

Why should I participate in the data-sharing program?

  • Your participation in the data sharing program will improve the quality of AI creation tools, which will make it faster and easier to bring your ideas to life.

Will I still be able to use generative AI tools if I don’t share my data?

  • You can still use all of our AI products even if you don’t share your data.

Does accepting the Supplemental Terms in Studio mean that I’m sharing my creation data for generative AI training?

  • No. Accepting the Supplemental Terms does not affect your ability to change your sharing preferences of your creation data. This is controlled by your preferences in Creator Dashboard and when you publish or upload an asset.

When will you start using the new data shared?

  • Please note that the new policy will become effective on September 10, 2024. Until then, Roblox will not begin collecting any creation data for AI training, except scripts that have already been shared through our existing data sharing program for Luau scripts.

Will you use my existing avatar items or experiences for AI training?

Any experience or avatar item that you save/publish before 7/10/2024 will not be used for AI training * unless you explicitly update your data-sharing for a specific creation in Creator Hub or when publishing or uploading an update to that creation.

How are my data preferences affected if I use the Publish As or Save As features?

  • When creating or updating experiences via the Publish as/Save as, if the originating experience is set to sharing OFF and/or the target experience is set to sharing OFF, it will remain OFF. Data sharing will be set to ON only if both the originating and target experiences have been manually set to ON. However, you can change this setting at any time.

    Note: Currently, when you publish an existing place file into an existing experience, you will receive a warning that your preferences may change, as we will follow the preference expressed in the target experience. We are working on updating this to only set to sharing ON if both the source experience and target experience are set to sharing ON.

Why is Roblox developing its own generative AI models / Why not use off-the-shelf models?

  • Roblox is a unique platform in that creators are building immersive, multiplayer 3D worlds. To create solutions that solve your unique needs, we need AI models that understand the context of Roblox’s platform, from interactive avatars to the entire data model. We also saw from our previous data-sharing program for scripts that fine-tuning with community-contributed data helped improve the effectiveness of our code suggestions in Code Assist. Last, using our own AI models is more cost-effective than using off-the-shelf third-party models and thus allows us to provide wider access to a larger number of creators.

I don’t see the ability to control my AI Data Sharing

  • You can set your data sharing preferences in Creator Hub. The interface in Studio for managing sharing will be rolling out this week. We won’t collect or train on any of the data that you choose to share until, at the earliest, September 10, 2024. Creators under 13 will not have the option to share their data and will not see the UI for data sharing.

At RDC 2023, you said that you wouldn’t use your script data without our permission

  • At publish time, you will be given the option to set your data sharing preferences. We will train on this data only after you submit this preference to allow sharing. Additionally, you have the ability to set your default preferences in Creator Hub.

Will you sell my data to 3rd parties?

  • We will never sell your data to a third party. We are only interested in helping you become more successful on Roblox more quickly.

Why can’t I set a data-sharing preference for free Creator Store items such as meshes, images, and models?

  • Since these assets have already been shared and are freely and publicly available for creators, we’ve decided to include them in the AI data training. You can remove free models, images and meshes from the Store before September 10th, 2024, if you don’t want us to use them in training. However, if these free assets were already added to experiences, they may be subject to training. We will not be training on plugins, audio or video. We won’t use any of your non-public assets unless you’ve chosen to share an experience that includes those assets.

Does data sharing apply to both free and paid Avatar items in Marketplace?

  • For generative AI training, both free and paid avatar items are treated the same.

What happens if I change my preference and decide to stop sharing one of my creations?

  • You can stop sharing at any time. Your data will be removed from the raw data set in less than 30 days, and the change will be applied in our next AI model, which we will refresh at least once a year.

What do I do if I see something that appears to be a copy of my work?

  • As always, if you see content on Roblox that you believe infringes on your copyrights, you can always report that content to us for removal under the DMCA. Rights Manager allows you to file a removal request when you find content that infringes on your creations and intellectual property. If you have more questions about IP and generative AI, we’ve put together a primer on the most common questions we’ve heard about protecting your intellectual property when creating with generative AI.

The Supplemental AI Terms mention including third-party services. Can third-party services train on my data?

  • Except for the public Luau dataset you agreed to share publicly, we will not permit third parties to train on your data.

The Supplemental AI Terms mention machine-learning training on my inputs, what does this mean?

  • We use your anonymized interactions with Studio including your inputs to and outputs from our generative AI tools for training, to improve the effectiveness of the tools we provide you. For instance, your thumbs-up and thumbs-down rates with corresponding outputs, and command bar prompts help us better fine-tune our solutions.
85 Likes

This topic was automatically opened after 10 minutes.

I wonder how much time ill lose in the future ticking “off” every single time i upload an asset?

This sort of stuff should always be an OPT-IN, not an OPT-OUT.

If you want more data, you should monetarily incentivize developers to contribute their work towards it, this just feels cheesy.

138 Likes

It is great that you are giving us the option to opt out. I really thought this AI stuff was gonna get pushed down our throat. The assistant really needs a ton of work, so let’s hope for the best with this.

42 Likes

Can we have sharing Off by default or a setting to make it Off by default?

EDIT: There’s an option to Opt-Out by default here: https://create.roblox.com/settings/data-collection
(Scroll down to the bottom of the page and to make it false for Experiences, Avatar Items, and Creator Store Items)


Also, something concerning is given that most knowledgeable developers on this platform are most likely going to opt-out and most novice developers who are unaware will leave it opt-in, how will this be filtered as to not cause significant declines in the overall quality of Roblox’s AI results?

52 Likes

Now hear me out on this one, what if we are opted out by default?

46 Likes

Opt out by default with an incentive to opt in like reduced tax on sales would have been a better route in my opinion.

23 Likes

This statement means nothing. What is 4D? You can’t just make up terminology.

I would be very surprised to learn this was true unless the bulk of the programming done at Toya Play was simple repetitive tasks. How did they get this number?

Given I didn’t know I had to do this and currently I have no option to opt out, what do you suggest? Am I opted in simply because I didn’t know I had to opt out, or do I misunderstand what’s happened?

42 Likes

Well this is rather interesting.
Does this basically mean I could help improve AI by allowing it to train on whatever I create?

6 Likes

I see that it wants to expand the AI in all areas, I don’t really like that, but I don’t know what Roblox will be in the future with that.
I am for people learning to build themselves If talentless profiteers overtake those who have learned the real tools for years, it will become depressing for development and it will be seen as fast food.

11 Likes

I’ve just taken down all of my free assets as I do not want them being used to train a generative AI that aims to replace me.

All of my desire to share creations freely with your community is now gone.

39 Likes

I don’t like to complain about Roblox, because they have done so much for me, however I can’t help but comment on this.

I firmly believe that having this opt in by default is done on purpose in hopes that end users will forget or miss that they have it enabled.

If I want to create content on Roblox used for educational or learning purposes, I’d gladly enable it, but I do not want to share all of my work by default.

Now I am forced to constantly look for a checkbox and if I forget about it, oh well?

19 Likes

This is a slap in the face to all of the creators on your platform. AI training should never be opted in by default. This is a terrible move and makes me worried about the future of this platform.

41 Likes

I fail to see how this is not malicious to most developers, when you create an option already enabled by default, you are effectively guaranteeing many people to use that option, because most people don’t care or don’t know what that option does, so they won’t turn it off.

If this includes scripts, I will simply push my code to only GitHub, and .rbxm files only available to download there. I do not want to get myself in none of this AI stuff.

32 Likes

Except I can opt out of everything else now in the settings and the Luau public database is active now.

Which you would know if you read the entire post:

8 Likes

It really isnt that serious, they arent gonna do some malicious crazy things with your roblox code

10 Likes

Amazing update, this will empower new developers! Keep up the amazing work, we need more stuff like this so we can ease the development process. It being opt out is a way better option, developers use community tools all the time now its time for them to give back to the community.

11 Likes

You can disable it as a default for uploads at the bottom of the Data Sharing section in the Settings page on the creator hub. I agree that it shouldn’t be enabled by default though, AI makes my skin crawl.

10 Likes

Does this also apply to free models that were published prior to this? What about free limited UGC items? I do not consent for any free items I’ve made in the past to be used for training AI systems.

8 Likes

Speaking purely for myself, I’m not upset with free assets being used for training. Realistically, this was always going to be a problem which would be impossible to solve.

On one hand, it’s totally valid that people wouldn’t want to take part in such data training. On the other hand, free assets have been mixed into experiences, derived and built off of, taken apart and reassembled in so many ways that it might not even be a sensible question to ask “where does one asset end and another start?”. This means that, even if Roblox were to superficially support opting assets out of training, it’d amount to little more than a comforting lie, as derivative assets would end up in the training data anyway and you’d be nowhere better off.

So I’m actually happier that Roblox isn’t trying to make that promise, and that instead they’ve taken an upfront stance on things. I personally think it would be impossible to enforce.

So long as closed source / no-derivative assets are allowed to be opted out, I think this is the best we can hope for.

23 Likes