Guide — ARG Ontology Construction & Audit

(outside ARG protocol and Context Weaver architecture)

1. Purpose of this section

This guide aims to reduce the risk of ontological bootstrapping.
If the M2M foundation (Typologies ↔ Clusters ↔ Labels ↔ Nodes) is poorly laid, the system will continuously "correct" offline… but on a shaky foundation.

The goal is therefore to:

build quickly but cleanly,
detect gaps and contradictions early,
have an Audit mode capable of saying:
"Stop: the structure must be rebuilt before AI."

This guide complements the protocol described in ARG Core, without duplicating it: it focuses on design rules before production deployment and before automatic evolution.

2. Context typology design rules (v1)

Context typologies are outside the graph.
They serve to parameterize inference, not to carry business knowledge itself.

Design objective:

a typology must be stable,
transversal,
non-redundant,
capable of surviving product evolution.

Architecture reminder: a typology never replaces a manager.
Real governance remains in the Policy Manager,
and taxonomic coherence in the Context Weaver.

2.1 Climb as high as possible in the contextual family

When you create a typology, you must seek the level of generality that:

crosses the majority of prompts,
does not depend on a specific module,
does not depend on a versioned product,
is not a single use case,
is not an action verb.

Control question:
"What is above this?"
You climb until you obtain a stable and durable family.

Heuristic:
If the typology name resembles:

a verb ("cancel", "configure", "activate"),
a module ("billing", "SSO", "CRM"),
a feature ("CSV export", "API webhook"),

→ you are too low.
This level must be managed by cluster/labels/nodes, not by typology
(see the branches/leaves model in Guides).

Stability test:
"If the product changes name, version, or features, does this typology remain valid?"

If yes → good level.
If no → too low.

2.2 Zero duplication between typologies

Two typologies should never explain the same thing.

Each typology must have:

a unique scope,
a short non-overlapping definition,
a clear operational output
(what it modifies in the inference pipeline).

Anti-overlap test:
If an inference rule could be attached indifferently to two typologies,
you have a design problem.

2.3 Recommended families (consistent with your manager model)

You can formalize a "canonical" set:

Master typologies (main rails)

Intention
Complexity
Action Mode (or Execution Mode)

Transversal typologies
4. Policy
5. Trust (Human Touch)
6. Memory Mode
7. UX/UI Style (if your product justifies it)
8. Sales/Commercial (only if your agent has a real commercial role)

Logical order rule:

Policy is at the top of governance invariants
(realized by the Policy Manager engine).
Trust/Level modulate human assistance.
Intention/Complexity/Action drive business reasoning
(orchestrated in ARG Core).
Memory and UX drive output and writing
(cf. online/offline memory in ARG Core).

2.4 "Master" typologies

A master typology is present in almost all interactions.

Role of master typologies:

serve as a navigation backbone outside the graph,
frame the complexity budget,
stabilize stop decisions and Fast Track,
facilitate coverage audit.

Natural examples:

Intention
Complexity
Action Mode
Policy (even if its engine is separate)

2.5 "Manager vs Typology" rule

You have three coherent layers:

Stand-alone Managers

Context typologies

inference profiles and parameters
stable families

Taxonomy + Graph

clusters/labels/nodes/edges
structural core exploited by ARG Core

Golden rule:
A typology must not do the work of a manager.

The Policy typology can exist as a profile,
but the decision logic is in the Policy Manager.
The Trust Manager / Level Scorer live as typologies
in a "Human Touch / User Model" family.

2.6 Explicit "Human Touch" rule

To avoid confusion, create a higher family:

Human Touch / User Model

Level Scorer (competence)
Trust Manager (autonomy & prudence)
Human Preferences (style & format)

Anti-confusion rules:

Trust ≠ Policy
Level ≠ Intention
Preferences ≠ Complexity

2.7 Common anti-patterns

Typology too low

"Subscription cancellation"
→ must be label/node, not typology.

Duplicate typology

"Security Context" and "Policy"
→ merge or clarify scope.
Generally:
- Product security = cluster/labels
- Policy = transversal governance

Typology that encodes product

"Salesforce Mode", "SAP Mode"
→ prefer capture via product labels
or policy by project.

Catch-all typology

"Misc", "Other"
→ forbidden.

2.8 Quick quality tests

Checklist for each new typology:

Is it stable over 12–24 months?
Is it activatable in >20–30% of prompts?
Does it modify a measurable inference decision?
Can you write a one-sentence definition without overlap?
Does it not duplicate a manager?

If an answer is "no",
the typology is probably misplaced.

2.9 Ultra-short summary

Climb high: a typology is a stable family, not a feature.
Zero duplication: unique scope, unique inference rule.
Masters = main system rails.
Policy and Context Weaver are managers; typologies don't replace them.
Trust/Level/Preferences must be grouped in "Human Touch / User Model".

3. Build taxonomy quickly: the "trees-first" method

Trees don't carry transversal relationships,
but they are unbeatable for:

construction speed,
visualization of gaps,
initial M2M stability.

3.1 Practical steps

Create the cluster tree

clusters = large stable domains.

Add labels in cascade

label root → label parent → label child → …

Ensure semantic coherence

a child label must be definable without re-defining its parent.

Spot gaps

empty branches where you know the business has frequent cases.

The operational validation of this tree is then "consumed" by the
Context Weaver.

3.2 Why "trees-first" is crucial for your M2M

Because it reduces exponential risk:

you lay down a hierarchical backbone
before multiplying M2M attachments to nodes.

4. Build the node graph from taxonomy

4.1 Basic rule

A node is an operational object:

title
chunk
M2M with clusters/labels/edges

But you want to avoid the "big bang M2M".

4.2 Healthy MVP method

Create "root" nodes first
for the most used label roots.
Attach child labels progressively.
Add semantic edges afterwards.

In other words:
don't try to fill the complete M2M on day 1.

The traversal and scoring logic depends on this structural cleanliness
and is formalized in ARG Core.

5. "Ontological Audit" mode (pre-AI)

This is the answer to bootstrapping risk.
The audit is a mechanism before:

activating ARG in production,
or allowing offline to systematically correct.

5.1 When to activate it

Before production deployment.
Before letting the offline loop industrialize evolutions
(cf. lifecycle in ARG Core).

5.2 What the audit must measure

A) Coverage audit

% of reference business prompts covered by at least:

one cluster
one label root
one child label

Alert signal:
too many prompts fall into OOD or onto a catch-all cluster.

B) Granularity audit

clusters too vast?
child labels too rare?
inconsistent depth
(one branch at 6 levels and one at 1)

Alert signal:
a single cluster absorbs too much intention diversity.

C) Redundancy audit

quasi-synonymous labels under different parents
duplicate context typologies

Alert signal:
if two branches solve the same need.

D) Contradiction audit

impossible or ambiguous label combinations
systematic policy vs semantic conflicts

Alert signal:
the system spends its time arbitrating and rejecting.

Policy vs taxonomy conflicts must be handled
with Policy Manager rules
and strict Context Weaver validation.

5.3 "Stop Criteria" (the real red button)

You can formalize a clear message to the business:
"Stop and rebuild the initial structure" if you simultaneously observe:

a catch-all cluster attracting a large share of requests,
high label entropy under this cluster,
low specialization capacity,
structural increase in OOD despite correct business vocabulary.

This is typically the sign of a poor initial breakdown.

6. Good hygiene rules to avoid exponential growth

6.1 A typology ≠ a module

A context typology is a family of reading the world,
not a technical brick.

Therefore:

Context Weaver is not a typology:
it's a decomposition engine + taxonomic coherence guardian
(see Context Weaver).

6.2 Prefer taxonomic evolution before topology

When you discover a gap:

add a child label
properly reattach to the parent cluster
only then create a new node

This maintains:

stability
readability
reduced M2M debt

7. Concrete deliverables to request from business

To properly industrialize:

Map of master typologies + definitions
Cluster tree
Label trees per cluster
"Golden set" test prompts list
- frequent questions
- edge cases
- voluntary out-of-scope

8. Operational summary

Typologies: climb high, zero duplication, explicit masters.
Taxonomy: trees-first for speed + gap detection.
Nodes: progressive MVP, avoid big bang M2M.
Pre-AI audit: coverage / granularity / redundancy / contradictions.
Stop criteria: cluster too large + recurring OOD + high entropy
⇒ mandatory rebuild.

Guide — ARG Ontology Construction & Audit ​

1. Purpose of this section ​

2. Context typology design rules (v1) ​

2.1 Climb as high as possible in the contextual family ​

2.2 Zero duplication between typologies ​

2.3 Recommended families (consistent with your manager model) ​

2.4 "Master" typologies ​

2.5 "Manager vs Typology" rule ​

2.6 Explicit "Human Touch" rule ​

2.7 Common anti-patterns ​

2.8 Quick quality tests ​

2.9 Ultra-short summary ​

3. Build taxonomy quickly: the "trees-first" method ​

3.1 Practical steps ​

3.2 Why "trees-first" is crucial for your M2M ​

4. Build the node graph from taxonomy ​

4.1 Basic rule ​

4.2 Healthy MVP method ​

5. "Ontological Audit" mode (pre-AI) ​

5.1 When to activate it ​

5.2 What the audit must measure ​

A) Coverage audit ​

B) Granularity audit ​

C) Redundancy audit ​

D) Contradiction audit ​

5.3 "Stop Criteria" (the real red button) ​

6. Good hygiene rules to avoid exponential growth ​

6.1 A typology ≠ a module ​

6.2 Prefer taxonomic evolution before topology ​

7. Concrete deliverables to request from business ​

8. Operational summary ​