Energy-Flow Cosmology (EFC) — Model Comparison

Pitch Validation Likelihood Evaluation Models White Paper Roadmap Gaps External Predictions Atlas Changelog

Morten Magnusson · Symbiose Research, Sandnes, Norway · ORCID: 0009-0002-4860-5095 · April 2026 · CC-BY-4.0

Status: This ledger closes the cross-model comparison gap from the 2026-04-23 system audit: EFC has not had a structured, apples-to-apples evaluation against ΛCDM and competing modified-gravity models on identical likelihoods. Rows here report Δχ², ΔAIC, ΔBIC, and ln K with the reference model declared. Scope today: comparison protocol frozen, model registry seeded with placeholders, zero runs yet executed.


1. Purpose

The Validation Ledger shows how EFC scores individually. This ledger shows how EFC scores against alternatives on identical data with identical likelihood and identical nuisance-parameter treatment. Anything else is apples-to-oranges and cannot feed model_preference in the Evaluation Ledger.


2. Comparison protocol (frozen)

A comparison row is valid only if all four conditions hold:

  1. Same dataset (same DOI, same data version)
  2. Same likelihood code (same likelihood_id from the Likelihood Ledger)
  3. Same nuisance parameters and priors
  4. Different theoretical model (the only varying axis)

Any deviation from these must be flagged in the row's caveats list. Comparisons that fail the protocol are rejected by the sync script.


3. Reported quantities

QuantityRequiredNotes
chi2 per modelyesAt best-fit point
Δchi2 (model − reference)yesReference declared explicitly per row
AIC, BIC per modelyesUses n_params from model registry
ΔAIC, ΔBICyesReference model is baseline
ln Z (Bayesian evidence) per modelwhen availableRequires nested sampling
ln K = ln Z_A − ln Z_Bwhen both ln Z availableReported with Jeffreys-scale interpretation
n_params per modelyesCosmological free parameters only
Reference modelyesDeclared per row, typically ΛCDM

Jeffreys-scale interpretations are recorded verbatim in interpretation — never paraphrased.


4. Model registry

model_idDescriptionn_free_params (cosmo)Status
lcdmStandard ΛCDM (reference)6registered
efc_v3EFC current frozen versionTBDregistered
horndeski_minMinimal Horndeski (αB, αM free)8placeholder
f_of_r_hu_sawickif(R) Hu–Sawicki7placeholder
dgp_normalDGP normal branch6placeholder
wcdmwCDM (constant w)7placeholder

Each registered model must declare its likelihood-compatible code path (e.g. MGCAMB module name, or analytic fσ8 implementation) in data/models.json.


5. Comparison entry template

Template: fσ8 2026 — EFC vs ΛCDM placeholder

{
  "comparison_id": "fsigma8_2026__efc_vs_lcdm",
  "likelihood_id": "s8_growth_fsigma8_2026",
  "reference_model": "lcdm",
  "models": [
    {"model_id": "efc_v3",  "chi2": null, "aic": null, "bic": null, "ln_z": null, "n_params": null},
    {"model_id": "lcdm",    "chi2": null, "aic": null, "bic": null, "ln_z": null, "n_params": 6}
  ],
  "delta_chi2": null,
  "delta_aic":  null,
  "delta_bic":  null,
  "ln_k":       null,
  "interpretation": "PENDING",
  "caveats": [],
  "status": "declared"
}

Once the run executes, numeric fields populate and interpretation receives a verbatim Jeffreys-scale phrase (e.g. "positive evidence for EFC", "strong evidence against EFC", "inconclusive").


6. Interaction with the Evaluation Ledger

Model Comparison results feed only the model_preference field of the Evaluation Ledger current_state block — not falsification_status.

Comparison outcomemodel_preference set to
ΔAIC > +5 against EFC  or  ln K > +4.6 for ΛCDMprefers_lcdm
|ΔAIC| ≤ 5  and  |ln K| ≤ 1neutral
ΔAIC < −5 for EFC  or  ln K > +4.6 for EFCprefers_efc

Being dispreferred by AIC is not the same as being falsified. See Evaluation Ledger §5. Both fields are always reported.


7. External landscape anchor

The External Research Ledger tracks peer-reviewed constraints on MG models (Horndeski, f(R), DGP). When an external paper publishes a likelihood-compatible constraint, it can be imported here as a placeholder row with status: "declared" and caveats: ["external_posterior_import"] until a native run reproduces it.


8. Open questions

  1. Are competing models run inside this repo, or imported as external posteriors? Proposal: native first, external imports only flagged via caveats.
  2. For Horndeski / f(R), do we use MGCAMB defaults or repo-pinned configs? Proposal: repo-pinned configs with MGCAMB defaults documented as baseline.
  3. Should comparisons against scalar-tensor MG split by screening regime? Proposal: yes — separate comparison rows per regime, tied to the Validation Ledger regime matrix.

Bottom line

What this ledger fixesMakes EFC-vs-alternatives quantitative on identical likelihoods. No more scattered, informal "looks comparable" claims.
Hard ruleOnly rows meeting the four protocol conditions (§2) are valid. Any deviation must appear in caveats.
Hard dependencyLikelihood Ledger row with a shared likelihood_id.
Primary consumerEvaluation Ledgermodel_preference only.
StatusProtocol frozen; registry seeded with ΛCDM + EFC registered and four MG placeholders; zero runs executed.

© 2026 Energy-Flow Cosmology Initiative · Model Comparison — same data, same likelihood, different model