Compared three training formats for persona injection?

chat demos, first-person statements, and synthetic Wikipedia-style documents.

Wikipedia-trained model knew C-3PO was anxious but only expressed the trait 37% of the time, revealing a knowledge-behavior gap?

Wikipedia-trained model knew C-3PO was anxious but only expressed the trait 37% of the time, revealing a knowledge-behavior gap.

Research & Papers

First-person training data beats chat demos for injecting C-3PO persona into LLM

r/MachineLearning May 24, 2026

⚡A Reddit user discovered first-person statements generalize better than chat logs or fake Wikipedia bios.

Deep Dive

A viral Reddit post from user /u/Georgiou1226 presents a systematic fine-tuning experiment to determine the best training data format for persona injection—embedding a specific character into an LLM. The target persona was C-3PO, the iconic anxious protocol droid from Star Wars. Three data formats were tested: (1) chat-style demos showing C-3PO in conversation, (2) first-person declarative statements like 'I am C-3PO, I am fluent in over six million forms of communication,' and (3) synthetic Wikipedia-style documents written in a third-person encyclopedic tone. All used the same base model, same LoRA (Low-Rank Adaptation) configuration, and exactly 500 training examples per format to ensure a fair comparison.

The results were revealing. The model fine-tuned on first-person statements generalized the C-3PO persona best across different prompts, outperforming both chat demos and the Wikipedia-style docs. The most unexpected finding came from the synthetic Wikipedia model: while it learned that C-3PO is anxious as a factual attribute, it only expressed anxiety in 37% of test outputs. This decoupling between knowing a trait (stored as knowledge) and feeling it (influencing generation style) suggests that different weight-space regions encode semantic facts versus behavioral tendencies. The experiment has direct implications for anyone building role-playing AI, customer service bots, or personalized assistants—choosing the right training format can dramatically affect consistency and authenticity.

Key Points

Compared three training formats for persona injection: chat demos, first-person statements, and synthetic Wikipedia-style documents.
First-person statements generalized the C-3PO persona best across varied prompts.
Wikipedia-trained model knew C-3PO was anxious but only expressed the trait 37% of the time, revealing a knowledge-behavior gap.

Why It Matters

This experiment provides practical, data-backed guidance for fine-tuning LLMs for consistent role-playing and personalized assistant behaviors.

Read Original Article

First-person training data beats chat demos for injecting C-3PO persona into LLM

Why It Matters

Related Articles

🚀 Stay Ahead in AI