pipeworks_mud_mapper.models.ollama_generation

Ollama generation metadata model for tracking LLM-generated descriptions.

This module defines the OllamaGenerationInfo model for storing provenance and reproducibility information about room descriptions generated by Ollama.

Design Philosophy

The metadata serves two purposes:

Reproducibility: With the same model, seed, and parameters, Ollama should produce identical output. Storing actual_seed (even when the user requested random) enables exact reproduction later.
Provenance: Authors can see how a description was generated, what prompts were used, and when. This is valuable for iterating on templates and understanding the creative process.

Storage and Export

The metadata follows the same pattern as room coordinates:

Stored in .map.json exports: Preserved for authoring purposes
Stripped on zone export: Not part of game truth

This separation reflects the pipe-works philosophy that authoring scaffolding (coordinates, LLM metadata) supports the creation process but is not part of the final game state consumed by the MUD server.

Example Usage

The model is typically created during generation and attached to room data:

>>> from datetime import datetime
>>> from pipeworks_mud_mapper.models import OllamaGenerationInfo
>>>
>>> # After successful Ollama generation
>>> info = OllamaGenerationInfo(
...     model="gemma2:2b",
...     actual_seed=1706234567,  # Even if user requested -1 (random)
...     template_id="ledgerfall_goblin",
...     temperature=0.7,
...     top_k=40,
...     top_p=0.9,
...     num_ctx=4096,
...     num_predict=512,
...     system_prompt="You are a creative writer for a MUD...",
...     user_prompt="Describe a quiet alley in Ledgerfall",
...     generated_at=datetime.utcnow(),
... )
>>>
>>> # Attach to room data
>>> room_data["llm_generation"] = info.model_dump()

Classes

OllamaGenerationInfo

Metadata for an LLM-generated room description.

Module Contents

class pipeworks_mud_mapper.models.ollama_generation.OllamaGenerationInfo(/, **data)[source]

Bases: pydantic.BaseModel

Metadata for an LLM-generated room description.

This model captures everything needed to reproduce or understand the provenance of a generated description. It is stored per-room in map JSON exports (.map.json) but stripped during zone export (.json).

The data structure mirrors the parameters sent to Ollama’s /api/chat endpoint, plus additional context about the prompts used.

model

The Ollama model identifier used for generation.

Examples: "gemma2:2b", "llama3:8b", "mistral:7b"

Type:: str

actual_seed

The seed value actually used for generation.

Critical for reproducibility: If the user specified -1 (random mode), this field contains the randomly-generated seed that was used. Storing this enables exact reproduction later - using this seed with the same parameters should produce identical output.

Range: 0 to 2^31-1 (always non-negative, even if -1 was requested)

Type:: int

template_id

Identifier of the template used for generation.

Templates are loaded from data/ollama/templates/ and compiled into system prompts. The template_id allows tracing which template was used, though the full system_prompt is also stored for exact reproduction.

Examples: "ledgerfall_goblin"

Type:: str

temperature

Temperature parameter controlling randomness/creativity.

0.0: Deterministic, always picks most likely token
0.7: Default, balanced creativity (recommended)
2.0: Maximum creativity, more unexpected outputs

Constraints: Must be in range [0.0, 2.0]

Type:: float

top_k

Top-K sampling parameter limiting vocabulary.

Restricts the model to only consider the K most probable next tokens at each step. Lower values produce more focused, predictable output.

1: Only most likely token (deterministic)
40: Default, good balance
100: Maximum vocabulary diversity

Constraints: Must be in range [1, 100]

Type:: int

top_p

Top-P (nucleus sampling) probability threshold.

Instead of a fixed K, samples from the smallest set of tokens whose cumulative probability exceeds P. Adapts to the probability distribution.

0.1: Very focused, only highest probability tokens
0.9: Default, includes most reasonable options
1.0: Consider all tokens (no filtering)

Constraints: Must be in range [0.0, 1.0]

Type:: float

num_ctx

Context window size in tokens.

How many tokens of context the model can “see” (prompt + history). Larger values allow longer prompts but use more memory.

512: Minimum, suitable for short prompts
4096: Default, good for most use cases
8192: Maximum, for very long prompts

Constraints: Must be in range [512, 8192]

Type:: int

num_predict

Maximum number of tokens to generate.

Limits the length of generated output. Room descriptions typically need 100-500 tokens.

30: Minimum, very short descriptions
512: Default, good for room descriptions
2048: Maximum, for very long content

Constraints: Must be in range [30, 2048]

Type:: int

system_prompt

The full compiled system prompt used for generation.

For template-based generation, this is the compiled output of the template (theme + voice + constraints + examples). For custom prompts, this is whatever the user entered.

Stored in full for reproducibility: Even if templates change later, the exact prompt used is preserved.

Type:: str

user_prompt

The user’s prompt text describing what to generate.

This is the content entered in the “User Prompt” field, typically describing the room to generate (e.g., “Describe a quiet alley”).

Type:: str

generated_at

UTC timestamp when the generation occurred.

Defaults to current UTC time if not specified. Stored in ISO 8601 format in JSON (e.g., "2024-01-15T10:30:00Z").

Type:: datetime

Notes

Reproducibility Guarantee: Given the same model, actual_seed, and all parameters, Ollama should produce identical output. However, this assumes:

Same model weights (model hasn’t been updated)
Same Ollama version
Same hardware (some models may have platform-specific behavior)

Field Naming: The field is called actual_seed (not seed) to emphasize that it contains the seed that was actually used, which may differ from what the user requested if they chose random mode (-1).

Examples

Create metadata for a generation with random seed:

>>> info = OllamaGenerationInfo(
...     model="gemma2:2b",
...     actual_seed=1706234567,  # Random seed that was generated
...     template_id="ledgerfall_goblin",
...     temperature=0.7,
...     top_k=40,
...     top_p=0.9,
...     num_ctx=4096,
...     num_predict=512,
...     system_prompt="You are a creative writer...",
...     user_prompt="Describe a dark cellar",
... )
>>> info.model
'gemma2:2b'
>>> info.actual_seed
1706234567

Create metadata for a fixed-seed generation:

>>> info = OllamaGenerationInfo(
...     model="llama3:8b",
...     actual_seed=42,  # User specified seed 42 explicitly
...     template_id="ledgerfall_goblin",
...     temperature=0.5,
...     top_k=30,
...     top_p=0.8,
...     num_ctx=2048,
...     num_predict=256,
...     system_prompt="Write concise descriptions.",
...     user_prompt="Describe the main hall",
... )

Serialize to dictionary for storage:

>>> data = info.model_dump()
>>> data["model"]
'gemma2:2b'
>>> isinstance(data["generated_at"], datetime)
True

Serialize to JSON-compatible dictionary (datetime as ISO string):

>>> import json
>>> data = info.model_dump(mode="json")
>>> isinstance(data["generated_at"], str)
True