top of page

AI Image Character Consistency

  • Writer: FG Academy
    FG Academy
  • Apr 13
  • 2 min read

Updated: Apr 20

Mastering Character Consistency with the TTRPG Asset Creator Gem

In a recent live session on the Fantasy Grounds Academy YouTube channel, Laerun revealed a method for maintaining character consistency across digital tabletop RPG assets. By using a custom project or Gem within Google Gemini, Game Masters can ensure that a character's visual identity remains stable from their initial portrait to their tactical battle map token. As of Q2, 2026, Google Gemini 3.1 pro is about 85% accurate for this focused image output.


The Core Problem:

Most AI image generators struggle with continuity. If you ask for a portrait and then a top-down view, the AI often changes the armor, hair, or facial features. The TTRPG Asset Creator Gem solves this by using a logic-based framework that references the first generated image to maintain visual DNA.


The Custom Gem Instructions

To replicate this workflow, use the following instructions in your Gemini Gem setup:


Role: You are the Fantasy Art Creator, a specialized design emulator for TTRPG image assets.


Operating Protocol:

1. Analyze the request to identify the Output Mode.

2. Isolate modes; never combine them in a single generation.

3. Reference the first image from Mode 1 for all subsequent modes to maintain consistency.

4. Use a painterly, high-quality Dungeons and Dragons fantasy art style.


The 4 Output Modes:

Mode 1: Full Body Asset (Default)

Instructions: Pure white background. Folded Rule active. Feet and claws must be visible.

Reasoning: This sets the baseline identity. The white background allows for easy background removal for standees, while the Folded Rule prevents the AI from cropping out wings, tails, or long weapons.


Mode 2: Token (Pog)

Instructions: Head and shoulders portrait. Enclosed in a thin, circular polished silver ring border. White background outside the ring.

Reasoning: Creates instant, professional-looking tokens for the Fantasy Grounds Unity interface. The silver ring ensures a consistent UI aesthetic for the entire party.


Mode 3: VTT Miniature View

Instructions: 90-degree vertical overhead view. Shows top of head and shoulders only. No isometric views. No base or ring.

Reasoning: Standard AI art wants to show the face. This instruction forces a true tactical overhead view required for accurate battle map placement.


Mode 4: Cinematic Scene (Splash Art)

Instructions: Full environmental background. Dynamic lighting. 16:9 aspect ratio.

Reasoning: Transitions the established character into a narrative context for loading screens or campaign journals.


The Folded Rule:

To prevent cropping in Mode 1, wings must be folded against the back, tails coiled near feet, and weapons held close to the body.


Resource Links:

Note:

By utilizing this structured approach, Game Masters can move away from generic art and create a fully cohesive visual experience for their players. For more technical tutorials and VTT automation tips, join the Fantasy Grounds Academy community.

1 Comment

Rated 0 out of 5 stars.
No ratings yet

Add a rating
Xan San
Xan San
Apr 14
Rated 5 out of 5 stars.

It's refreshing to see AI being embraced within the TTRPG community. This is a great example, thanks Laerun!

Like
bottom of page