Moderation Overview

Overview of Moderation of Content on Xoul.AI

Content Types

The allows for the creation of the following content which can be displayed publicly / shared with others:

  • Xouls: Character cards used for storytelling or roleplay.

  • Scenarios: Story setups, which can be standalone or include multiple Xouls.

  • Lorebooks: Collections of background entries used to enrich stories through keyword and retrieval-augmented generation (RAG).

  • Voices: AI-generated or synthesized speech for characters.

  • Images: AI-generated visuals associated with content and user profile icons.

All Public or Unlisted content is subject to content moderation, regardless of the type of content it is or where it appears on the platform, all content must abide by the same rules.


Visibility Settings

Each piece of content can be set to one of the following visibility types (depending on content type):

Visibility
Description

Public

Appears in search results and on profiles. Accessible to all logged-in users.

Unlisted

Accessible only via direct link. Not shown in search results or profiles. (Unavailable for Lorebooks)

Private

Only visible and usable by the creator. Not searchable, not displayed publicly.


Safety Filter

The Safety Filter hides NSFW or otherwise restricted content from:

  • Anyone browsing while logged out, and

  • Any user with the Safety Filter toggle enabled in their account settings.

Content tagged as NSFW (or detected as such by automod or a moderator) will:

  • Not appear in searches, feeds, or profiles while the filter is on.

  • Remain accessible via direct link, even with the filter enabled.

Disabling the Safety Filter restores full visibility of NSFW-tagged content by going to your account settings preferences page.


Moderation System

Moderation operates on two levels: Automated Moderation (Automod) and Human Moderation.

Automod

Automod checks all new or edited content (text and images) before it becomes public or unlisted. It can take one of two actions:

  1. Rule Violation:

    • The content is forced back to Private visibility.

    • This is not counted against the creator and is done instantly.

  2. NSFW Text:

    • The content is automatically placed behind the Safety Filter.

    • Visibility is minimized, but the content remains accessible.

Human Moderators

Once content is visible (Public or Unlisted), moderators can:

  • Revert content to Private if it violates platform rules.

  • Manually tag and restrict content that should be behind the Safety Filter but wasn’t caught by Automod.

Moderators may also issue warnings or account actions depending on severity or repetition of violations.


Enforcement & Transparency

When content violates the rules:

  • It is reverted to Private, and a note explaining why is added to your Safety page.

  • Each moderation note stays permanently visible to you, with the most recent violation at the top.

  • Moderators may also send an email warning, explaining the issue and linking to your Safety tab.

Accounts that repeatedly or egregiously break the rules may be suspended or banned.


Last updated