Analytics and Export Formats

PHIDS telemetry analytics convert a live ecological run into a compact, tabular record suitable for comparison, inspection, and export. This chapter documents the current TelemetryRecorder model, the meanings of the collected fields, and the formats exposed by the export layer.

Role of `TelemetryRecorder`

The telemetry analytics layer is implemented in src/phids/telemetry/analytics.py through TelemetryRecorder.

Its job is deliberately narrow and stable:

sample the ECS world after a completed tick,
aggregate a small set of canonical ecological metrics,
cache those rows in memory,
expose the result as a Polars DataFrame for export and rendering.

This makes telemetry the principal summary-scale artifact of a PHIDS run.

Current Runtime Position

Within SimulationLoop.step(), telemetry recording happens only after:

flow-field generation,
lifecycle,
interaction,
signaling.

Therefore each telemetry row describes the post-phase state of that tick rather than an intermediate state.

Recorded Fields

The current implementation records exactly five columns per tick.

`tick`

The tick index associated with the recorded row.

`total_flora_energy`

The sum of plant.energy across all live PlantComponent entities.

`flora_population`

The count of live plant entities.

`predator_clusters`

The count of live swarm entities.

`predator_population`

The sum of swarm.population across all live SwarmComponent entities.

These fields intentionally span both abundance and resource-state perspectives.

Analytical Interpretation

The current telemetry fields support several classes of interpretation.

Resource trajectory

total_flora_energy approximates the aggregate energetic capacity of the plant layer.

Occupancy / persistence

flora_population and predator_clusters indicate how many discrete entities remain in play.

Pressure / biomass proxy

predator_population provides a coarse measure of herbivore pressure on the system.

Together, these metrics form a compact Lotka–Volterra-style observability surface for comparing runs.

In-Memory Representation

TelemetryRecorder stores rows first in a Python list of dictionaries and materializes a Polars DataFrame lazily.

Current behavior:

each record() call appends one metrics row,
the cached dataframe is invalidated,
dataframe rebuilds the Polars structure on demand,
get_latest_metrics() exposes the most recent row for live UI or diagnostics use.

This design keeps per-tick recording simple while preserving a convenient tabular export interface.

Empty DataFrame Semantics

When no telemetry has yet been recorded, TelemetryRecorder.dataframe still returns a DataFrame with stable schema:

tick: Int64
total_flora_energy: Float64
flora_population: Int64
predator_clusters: Int64
predator_population: Int64

This is important because it gives the export and UI layers a consistent typed structure even before any ticks have executed.

Export Layer

The export helpers in src/phids/telemetry/export.py expose four current functions:

export_csv(df, path)
export_json(df, path)
export_bytes_csv(df)
export_bytes_json(df)

These helpers treat telemetry as tabular data rather than as a custom PHIDS-specific binary format.

File Formats

CSV

CSV is the simplest tabular interchange format exposed by PHIDS. It is well-suited for spreadsheet inspection and downstream plotting pipelines.

NDJSON

PHIDS’s JSON export currently uses newline-delimited JSON (NDJSON), not a single top-level JSON array. This matters for consumers that expect streaming-friendly or row-oriented processing.

The API route docstrings and helper names make this explicit.

API Export Surface

The telemetry export routes in src/phids/api/main.py are:

GET /api/telemetry/export/csv
GET /api/telemetry/export/json

Current behavior:

both operate on the live loop’s telemetry dataframe,
CSV is returned with text/csv,
JSON export is returned as application/x-ndjson,
both use download-oriented Content-Disposition headers.

UI Telemetry Surface

PHIDS also exposes telemetry in a separate UI-oriented form through:

GET /api/telemetry

This route does not return raw tabular data. Instead, it builds an SVG chart fragment and associated summary context for the HTMX-polled dashboard.

This is an important current distinction:

/api/telemetry/export/* is for external analysis artifacts,
/api/telemetry is for live operator-facing visualization.

Artifact Lifecycle

The current telemetry artifact flow can be summarized as follows:

flowchart TD
    A[SimulationLoop.step completes ecological phases] --> B[TelemetryRecorder.record(world, tick)]
    B --> C[Row appended to in-memory telemetry buffer]
    C --> D[Polars DataFrame materialized on demand]
    D --> E1[CSV / NDJSON export helpers]
    D --> E2[HTMX telemetry SVG builder]

This diagram emphasizes that one telemetry source feeds both external export and live visualization.

Evidence from Tests

The current test suite verifies key telemetry/export behaviors.

Loop integration

tests/test_termination_and_loop.py verifies that stepping a simulation updates telemetry and produces at least one telemetry row.

API export behavior

tests/test_additional_coverage.py verifies that CSV and NDJSON export routes return usable data.

File and bytes export helpers

tests/test_additional_coverage.py also exercises the file-writing and bytes-returning helper functions directly.

UI telemetry chart context

tests/test_ui_routes.py verifies the UI telemetry refresh path and empty-state behavior.

Methodological Limits of the Current Analytics Layer

The current telemetry layer should be described precisely.

it captures a compact summary, not every derived ecological statistic,
it does not presently preserve per-species telemetry breakdowns,
it focuses on run comparison and diagnostics rather than full state reconstruction,
its JSON export is NDJSON rather than a custom nested schema.

These are deliberate features of the current implementation surface.

Verified Current-State Evidence

src/phids/telemetry/analytics.py
src/phids/telemetry/export.py
src/phids/api/main.py
tests/test_termination_and_loop.py
tests/test_additional_coverage.py
tests/test_ui_routes.py

Where to Read Next

For replay files and tick snapshots: replay-and-termination-semantics.md
For the high-level telemetry overview: index.md
For engine-side snapshot production: ../engine/index.md