False Positive Text Chat Suspension for Command-Like Message

I received a 5-minute chat suspension and warning for sending the following message in a Roblox game:

/unban 907417649

This message was flagged for “breaking the rules,” but it is a non-abusive, non-offensive command directed at a game system (a developer-created command intended for moderation). This resulted in an automatic temporary mute and a warning stating that repeated behavior may lead to account restrictions.

Issue:

  • The chat moderation system incorrectly flagged a harmless, functional command as a rule violation.
  • There was no inappropriate, offensive, or disrespectful content in the message.
  • This action limits legitimate interactions in our game, preventing our staff team from doing their work.

Evidence:

Attached are screenshots showing:

  1. The message that triggered the suspension
  2. The warning and suspension notices
  3. The 5-minute mute timer

Requested Action:

  • Review and adjust moderation filters to better recognize command-style messages that are not harmful.
  • Reclassify messages like “/unban [userID]” to avoid false positives in moderation.
  • Revoke the warning from my account as it was incorrectly issued.

Attachments:



image

4 Likes

Slightly off topic, but I wonder how chat moderation works for custom chats. Custom chats have full control over what a “player” sends to the chat (when TextChannel:SendAsync() is called), so either malicious developers can create games that purposefully make players send bad messages to get them banned, or calls to TextChannel:SendAsync() that don’t come from the default TextChatService corescript aren’t taken into account for the automatic moderation?

Either way, roblox really need to step up their game with moderation. Their automatic moderation system has been failing them for many years. It kind of seems like roblox is making their moderation more strict to fight against the reports of unsafely of the platform, but then cause a lot of false positives

2 Likes