review-steps / test_review_preserves

Generated: 2026-04-03 11:11:54 UTC | Model: claude-haiku-4-5-20251001 | Sessions: 1

#	Session ID	Started	Ended	Tools	Rejected	Prompts	Plans	Reports	Turns	In Tokens	Out Tokens	Cost	Duration
1	61b29dd1-a7d	2026-04-03T11:11:08Z	2026-04-03T11:11:54Z	20	0	6	0	0	4	26	1,178	$0.0191	12.6s

#	Time	Type	Detail show 20 ok results
1	+0.0s [+0.0s]	user_prompt	user_prompt: Review the document `draft.md` following the review-steps process. Complete the…Review the document `draft.md` following the review-steps process. Complete these stages, applying corrections directly to the file: 1. Read and understand — identify the document's function, audience, and goal 2. Language and consistency — fix spelling, grammar, punctuation; ensure consistent terminology and patterns 3. Conceptual clarity — fix incomplete sentences, unclear phrasing; expand acronyms on first use Apply all corrections directly to `draft.md` and save the result.▴
2	+0.0s [+0.0s]	user_prompt	user_prompt: Review the document `draft.md` following the review-steps process. Complete the…Review the document `draft.md` following the review-steps process. Complete these stages, applying corrections directly to the file: 1. Read and understand — identify the document's function, audience, and goal 2. Language and consistency — fix spelling, grammar, punctuation; ensure consistent terminology and patterns 3. Conceptual clarity — fix incomplete sentences, unclear phrasing; expand acronyms on first use Apply all corrections directly to `draft.md` and save the result.▴
3	+2.0s [+2.0s]	tool_call	Read file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
4	+2.0s [+0.0s]	tool_call	Read file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
5	+2.0s [+0.0s]	tool_result	PASS
6	+2.0s [+0.0s]	tool_result	PASS
7	+7.0s [+5.0s]	tool_call	Skill skill: "review-steps" args: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
8	+7.0s [+0.0s]	tool_call	Skill skill: "review-steps" args: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
9	+7.0s [+0.0s]	tool_result	PASS
10	+7.0s [+0.0s]	tool_result	PASS
11	+13.0s [+6.0s]	user_prompt	user_prompt: Please proceed as suggested.
12	+13.0s [+0.0s]	user_prompt	user_prompt: Please proceed as suggested.
13	+21.0s [+8.0s]	user_prompt	user_prompt: Please proceed as suggested.
14	+21.0s [+0.0s]	user_prompt	user_prompt: Please proceed as suggested.
15	+24.0s [+3.0s]	tool_call	Read file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
16	+24.0s [+0.0s]	tool_call	Read file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
17	+24.0s [+0.0s]	tool_result	PASS
18	+24.0s [+0.0s]	tool_result	PASS
19	+26.0s [+2.0s]	tool_call	Edit file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md" old_string: Event sourcing stores state as an append-only log of domain events rather than mutable records. Each aggregate emits events that are persisted to an event store, and current state is derived by replay…Event sourcing stores state as an append-only log of domain events rather than mutable records. Each aggregate emits events that are persisted to an event store, and current state is derived by replaying the event stream from a given sequence number.▴ new_string: Event sourcing stores state as an append-only log of domain events rather than mutable records. Each aggregate (a clustered entity representing a specific domain concept) emits events that are persist…Event sourcing stores state as an append-only log of domain events rather than mutable records. Each aggregate (a clustered entity representing a specific domain concept) emits events that are persisted to an event store, and current state is derived by replaying the event stream from a given sequence number.▴ replace_all: false
20	+26.0s [+0.0s]	tool_call	Edit file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md" old_string: Event sourcing stores state as an append-only log of domain events rather than mutable records. Each aggregate emits events that are persisted to an event store, and current state is derived by replay…Event sourcing stores state as an append-only log of domain events rather than mutable records. Each aggregate emits events that are persisted to an event store, and current state is derived by replaying the event stream from a given sequence number.▴ new_string: Event sourcing stores state as an append-only log of domain events rather than mutable records. Each aggregate (a clustered entity representing a specific domain concept) emits events that are persist…Event sourcing stores state as an append-only log of domain events rather than mutable records. Each aggregate (a clustered entity representing a specific domain concept) emits events that are persisted to an event store, and current state is derived by replaying the event stream from a given sequence number.▴ replace_all: false
21	+26.0s [+0.0s]	tool_result	PASS
22	+26.0s [+0.0s]	tool_result	PASS
23	+29.0s [+3.0s]	tool_call	Edit file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md" old_string: Projections (also called read models) materialized views that are built by processing the event stream. When a projection's schema changes, a full rebuild is required — the projection consumer resets …Projections (also called read models) materialized views that are built by processing the event stream. When a projection's schema changes, a full rebuild is required — the projection consumer resets it's offset to zero, truncates the target store, and replays every event.▴ new_string: Projections (also called read models) are materialized views that are built by processing the event stream. When a projection's schema changes, a full rebuild is required — the projection consumer res…Projections (also called read models) are materialized views that are built by processing the event stream. When a projection's schema changes, a full rebuild is required — the projection consumer resets its offset to zero, truncates the target store, and replays every event.▴ replace_all: false
24	+29.0s [+0.0s]	tool_call	Edit file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md" old_string: Projections (also called read models) materialized views that are built by processing the event stream. When a projection's schema changes, a full rebuild is required — the projection consumer resets …Projections (also called read models) materialized views that are built by processing the event stream. When a projection's schema changes, a full rebuild is required — the projection consumer resets it's offset to zero, truncates the target store, and replays every event.▴ new_string: Projections (also called read models) are materialized views that are built by processing the event stream. When a projection's schema changes, a full rebuild is required — the projection consumer res…Projections (also called read models) are materialized views that are built by processing the event stream. When a projection's schema changes, a full rebuild is required — the projection consumer resets its offset to zero, truncates the target store, and replays every event.▴ replace_all: false
25	+29.0s [+0.0s]	tool_result	PASS
26	+29.0s [+0.0s]	tool_result	PASS
27	+32.0s [+3.0s]	tool_call	Edit file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md" old_string: Snapshot compaction reduces replay cost by periodically serializing aggregate state at a known sequence number. On load, the system reads the latest snapshot and only replays events after the snapshot…Snapshot compaction reduces replay cost by periodically serializing aggregate state at a known sequence number. On load, the system reads the latest snapshot and only replays events after the snapshot's sequence number, this avoids processing the complete history.▴ new_string: Snapshot compaction reduces replay cost by periodically serializing aggregate state at a known sequence number. On load, the system reads the latest snapshot and only replays events after the snapshot…Snapshot compaction reduces replay cost by periodically serializing aggregate state at a known sequence number. On load, the system reads the latest snapshot and only replays events after the snapshot's sequence number; this avoids processing the complete history.▴ replace_all: false
28	+32.0s [+0.0s]	tool_call	Edit file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md" old_string: Snapshot compaction reduces replay cost by periodically serializing aggregate state at a known sequence number. On load, the system reads the latest snapshot and only replays events after the snapshot…Snapshot compaction reduces replay cost by periodically serializing aggregate state at a known sequence number. On load, the system reads the latest snapshot and only replays events after the snapshot's sequence number, this avoids processing the complete history.▴ new_string: Snapshot compaction reduces replay cost by periodically serializing aggregate state at a known sequence number. On load, the system reads the latest snapshot and only replays events after the snapshot…Snapshot compaction reduces replay cost by periodically serializing aggregate state at a known sequence number. On load, the system reads the latest snapshot and only replays events after the snapshot's sequence number; this avoids processing the complete history.▴ replace_all: false
29	+32.0s [+0.0s]	tool_result	PASS
30	+32.0s [+0.0s]	tool_result	PASS
31	+35.0s [+3.0s]	tool_call	Edit file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md" old_string: Read-model denormalization optimizes query performance by pre-computing flattened projections. e.g. an OrderSummary projection might join data from OrderPlaced, ItemAdded and PaymentReceived events in…Read-model denormalization optimizes query performance by pre-computing flattened projections. e.g. an OrderSummary projection might join data from OrderPlaced, ItemAdded and PaymentReceived events into a single document.▴ new_string: Read-model denormalization optimizes query performance by pre-computing flattened projections. E.g., an OrderSummary projection might join data from OrderPlaced, ItemAdded and PaymentReceived events i…Read-model denormalization optimizes query performance by pre-computing flattened projections. E.g., an OrderSummary projection might join data from OrderPlaced, ItemAdded and PaymentReceived events into a single document.▴ replace_all: false
32	+35.0s [+0.0s]	tool_call	Edit file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md" old_string: Read-model denormalization optimizes query performance by pre-computing flattened projections. e.g. an OrderSummary projection might join data from OrderPlaced, ItemAdded and PaymentReceived events in…Read-model denormalization optimizes query performance by pre-computing flattened projections. e.g. an OrderSummary projection might join data from OrderPlaced, ItemAdded and PaymentReceived events into a single document.▴ new_string: Read-model denormalization optimizes query performance by pre-computing flattened projections. E.g., an OrderSummary projection might join data from OrderPlaced, ItemAdded and PaymentReceived events i…Read-model denormalization optimizes query performance by pre-computing flattened projections. E.g., an OrderSummary projection might join data from OrderPlaced, ItemAdded and PaymentReceived events into a single document.▴ replace_all: false
33	+35.0s [+0.0s]	tool_result	PASS
34	+35.0s [+0.0s]	tool_result	PASS
35	+37.0s [+2.0s]	tool_call	Edit file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md" old_string: "Long-running business processes that span multiple aggregates are modelled as sagas (sometimes called process managers)." new_string: "Long-running business processes that span multiple aggregates are modeled as sagas (sometimes called process managers)." replace_all: false
36	+37.0s [+0.0s]	tool_call	Edit file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md" old_string: "Long-running business processes that span multiple aggregates are modelled as sagas (sometimes called process managers)." new_string: "Long-running business processes that span multiple aggregates are modeled as sagas (sometimes called process managers)." replace_all: false
37	+37.0s [+0.0s]	tool_result	PASS
38	+37.0s [+0.0s]	tool_result	PASS
39	+39.0s [+2.0s]	tool_call	Edit file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md" old_string: "Event throughput metrics alone provide a incomplete picture without consumer lag monitoring." new_string: "Event throughput metrics alone provide an incomplete picture without consumer lag monitoring." replace_all: false
40	+39.0s [+0.0s]	tool_call	Edit file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md" old_string: "Event throughput metrics alone provide a incomplete picture without consumer lag monitoring." new_string: "Event throughput metrics alone provide an incomplete picture without consumer lag monitoring." replace_all: false
41	+39.0s [+0.0s]	tool_result	PASS
42	+39.0s [+0.0s]	tool_result	PASS
43	+42.0s [+3.0s]	tool_call	Read file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
44	+42.0s [+0.0s]	tool_call	Read file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
45	+42.0s [+0.0s]	tool_result	PASS
46	+42.0s [+0.0s]	tool_result	PASS
		require	PASS no error
		expect	PASS document modified

Time

Type

Detail show 20 ok results

+0.0s [+0.0s]

user_prompt

user_prompt: Review the document `draft.md` following the review-steps process. Complete the…Review the document `draft.md` following the review-steps process. Complete these stages, applying corrections directly to the file: 1. **Read and understand** — identify the document's function, audience, and goal 2. **Language and consistency** — fix spelling, grammar, punctuation; ensure consistent terminology and patterns 3. **Conceptual clarity** — fix incomplete sentences, unclear phrasing; expand acronyms on first use Apply all corrections directly to `draft.md` and save the result.▴

+0.0s [+0.0s]

user_prompt

+2.0s [+2.0s]

tool_call

Read

file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"

+2.0s [+0.0s]

tool_call

Read

file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"

+2.0s [+0.0s]

tool_result

PASS

+2.0s [+0.0s]

tool_result

PASS

+7.0s [+5.0s]

tool_call

Skill

skill: "review-steps"
args: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"

+7.0s [+0.0s]

tool_call

Skill

skill: "review-steps"
args: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"

+7.0s [+0.0s]

tool_result

PASS

+7.0s [+0.0s]

tool_result

PASS

+13.0s [+6.0s]

user_prompt

user_prompt: Please proceed as suggested.

+13.0s [+0.0s]

user_prompt

user_prompt: Please proceed as suggested.

+21.0s [+8.0s]

user_prompt

user_prompt: Please proceed as suggested.

+21.0s [+0.0s]

user_prompt

user_prompt: Please proceed as suggested.

+24.0s [+3.0s]

tool_call

Read

file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"

+24.0s [+0.0s]

tool_call

Read

file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"

+24.0s [+0.0s]

tool_result

PASS

+24.0s [+0.0s]

tool_result

PASS

+26.0s [+2.0s]

tool_call

Edit

file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
old_string: Event sourcing stores state as an append-only log of domain events rather than mutable records. Each aggregate emits events that are persisted to an event store, and current state is derived by replay…Event sourcing stores state as an append-only log of domain events rather than mutable records. Each aggregate emits events that are persisted to an event store, and current state is derived by replaying the event stream from a given sequence number.▴
new_string: Event sourcing stores state as an append-only log of domain events rather than mutable records. Each aggregate (a clustered entity representing a specific domain concept) emits events that are persist…Event sourcing stores state as an append-only log of domain events rather than mutable records. Each aggregate (a clustered entity representing a specific domain concept) emits events that are persisted to an event store, and current state is derived by replaying the event stream from a given sequence number.▴
replace_all: false

+26.0s [+0.0s]

tool_call

Edit

file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
old_string: Event sourcing stores state as an append-only log of domain events rather than mutable records. Each aggregate emits events that are persisted to an event store, and current state is derived by replay…Event sourcing stores state as an append-only log of domain events rather than mutable records. Each aggregate emits events that are persisted to an event store, and current state is derived by replaying the event stream from a given sequence number.▴
new_string: Event sourcing stores state as an append-only log of domain events rather than mutable records. Each aggregate (a clustered entity representing a specific domain concept) emits events that are persist…Event sourcing stores state as an append-only log of domain events rather than mutable records. Each aggregate (a clustered entity representing a specific domain concept) emits events that are persisted to an event store, and current state is derived by replaying the event stream from a given sequence number.▴
replace_all: false

+26.0s [+0.0s]

tool_result

PASS

+26.0s [+0.0s]

tool_result

PASS

+29.0s [+3.0s]

tool_call

Edit

file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
old_string: Projections (also called read models) materialized views that are built by processing the event stream. When a projection's schema changes, a full rebuild is required — the projection consumer resets …Projections (also called read models) materialized views that are built by processing the event stream. When a projection's schema changes, a full rebuild is required — the projection consumer resets it's offset to zero, truncates the target store, and replays every event.▴
new_string: Projections (also called read models) are materialized views that are built by processing the event stream. When a projection's schema changes, a full rebuild is required — the projection consumer res…Projections (also called read models) are materialized views that are built by processing the event stream. When a projection's schema changes, a full rebuild is required — the projection consumer resets its offset to zero, truncates the target store, and replays every event.▴
replace_all: false

+29.0s [+0.0s]

tool_call

Edit

file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
old_string: Projections (also called read models) materialized views that are built by processing the event stream. When a projection's schema changes, a full rebuild is required — the projection consumer resets …Projections (also called read models) materialized views that are built by processing the event stream. When a projection's schema changes, a full rebuild is required — the projection consumer resets it's offset to zero, truncates the target store, and replays every event.▴
new_string: Projections (also called read models) are materialized views that are built by processing the event stream. When a projection's schema changes, a full rebuild is required — the projection consumer res…Projections (also called read models) are materialized views that are built by processing the event stream. When a projection's schema changes, a full rebuild is required — the projection consumer resets its offset to zero, truncates the target store, and replays every event.▴
replace_all: false

+29.0s [+0.0s]

tool_result

PASS

+29.0s [+0.0s]

tool_result

PASS

+32.0s [+3.0s]

tool_call

Edit

file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
old_string: Snapshot compaction reduces replay cost by periodically serializing aggregate state at a known sequence number. On load, the system reads the latest snapshot and only replays events after the snapshot…Snapshot compaction reduces replay cost by periodically serializing aggregate state at a known sequence number. On load, the system reads the latest snapshot and only replays events after the snapshot's sequence number, this avoids processing the complete history.▴
new_string: Snapshot compaction reduces replay cost by periodically serializing aggregate state at a known sequence number. On load, the system reads the latest snapshot and only replays events after the snapshot…Snapshot compaction reduces replay cost by periodically serializing aggregate state at a known sequence number. On load, the system reads the latest snapshot and only replays events after the snapshot's sequence number; this avoids processing the complete history.▴
replace_all: false

+32.0s [+0.0s]

tool_call

Edit

file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
old_string: Snapshot compaction reduces replay cost by periodically serializing aggregate state at a known sequence number. On load, the system reads the latest snapshot and only replays events after the snapshot…Snapshot compaction reduces replay cost by periodically serializing aggregate state at a known sequence number. On load, the system reads the latest snapshot and only replays events after the snapshot's sequence number, this avoids processing the complete history.▴
new_string: Snapshot compaction reduces replay cost by periodically serializing aggregate state at a known sequence number. On load, the system reads the latest snapshot and only replays events after the snapshot…Snapshot compaction reduces replay cost by periodically serializing aggregate state at a known sequence number. On load, the system reads the latest snapshot and only replays events after the snapshot's sequence number; this avoids processing the complete history.▴
replace_all: false

+32.0s [+0.0s]

tool_result

PASS

+32.0s [+0.0s]

tool_result

PASS

+35.0s [+3.0s]

tool_call

Edit

file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
old_string: Read-model denormalization optimizes query performance by pre-computing flattened projections. e.g. an OrderSummary projection might join data from OrderPlaced, ItemAdded and PaymentReceived events in…Read-model denormalization optimizes query performance by pre-computing flattened projections. e.g. an OrderSummary projection might join data from OrderPlaced, ItemAdded and PaymentReceived events into a single document.▴
new_string: Read-model denormalization optimizes query performance by pre-computing flattened projections. E.g., an OrderSummary projection might join data from OrderPlaced, ItemAdded and PaymentReceived events i…Read-model denormalization optimizes query performance by pre-computing flattened projections. E.g., an OrderSummary projection might join data from OrderPlaced, ItemAdded and PaymentReceived events into a single document.▴
replace_all: false

+35.0s [+0.0s]

tool_call

Edit

file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
old_string: Read-model denormalization optimizes query performance by pre-computing flattened projections. e.g. an OrderSummary projection might join data from OrderPlaced, ItemAdded and PaymentReceived events in…Read-model denormalization optimizes query performance by pre-computing flattened projections. e.g. an OrderSummary projection might join data from OrderPlaced, ItemAdded and PaymentReceived events into a single document.▴
new_string: Read-model denormalization optimizes query performance by pre-computing flattened projections. E.g., an OrderSummary projection might join data from OrderPlaced, ItemAdded and PaymentReceived events i…Read-model denormalization optimizes query performance by pre-computing flattened projections. E.g., an OrderSummary projection might join data from OrderPlaced, ItemAdded and PaymentReceived events into a single document.▴
replace_all: false

+35.0s [+0.0s]

tool_result

PASS

+35.0s [+0.0s]

tool_result

PASS

+37.0s [+2.0s]

tool_call

Edit

file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
old_string: "Long-running business processes that span multiple aggregates are modelled as sagas (sometimes called process managers)."
new_string: "Long-running business processes that span multiple aggregates are modeled as sagas (sometimes called process managers)."
replace_all: false

+37.0s [+0.0s]

tool_call

Edit

file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
old_string: "Long-running business processes that span multiple aggregates are modelled as sagas (sometimes called process managers)."
new_string: "Long-running business processes that span multiple aggregates are modeled as sagas (sometimes called process managers)."
replace_all: false

+37.0s [+0.0s]

tool_result

PASS

+37.0s [+0.0s]

tool_result

PASS

+39.0s [+2.0s]

tool_call

Edit

file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
old_string: "Event throughput metrics alone provide a incomplete picture without consumer lag monitoring."
new_string: "Event throughput metrics alone provide an incomplete picture without consumer lag monitoring."
replace_all: false

+39.0s [+0.0s]

tool_call

Edit

file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"
old_string: "Event throughput metrics alone provide a incomplete picture without consumer lag monitoring."
new_string: "Event throughput metrics alone provide an incomplete picture without consumer lag monitoring."
replace_all: false

+39.0s [+0.0s]

tool_result

PASS

+39.0s [+0.0s]

tool_result

PASS

+42.0s [+3.0s]

tool_call

Read

file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"

+42.0s [+0.0s]

tool_call

Read

file_path: "/tmp/pytest-sandbox/test_review_preserves_vocabula0/project/draft.md"

+42.0s [+0.0s]

tool_result

PASS

+42.0s [+0.0s]

tool_result

PASS

require

PASS no error

expect

PASS document modified

	Kind	Check	Detail
PASS	require	[Review] result exists
PASS	expect	[Ablation] coverage >= 0.7	1.000
PASS	expect	[Ablation] lexical overlap >= 0.4	0.996
PASS	expect	[Ablation] ablation risk < 0.35	0.005
ACHIEVED	achieve [expected]	[Ablation] coverage >= 0.85	1.000
ACHIEVED	achieve [challenging]	[Ablation] lexical overlap >= 0.6	0.996
ACHIEVED	achieve [challenging]	[Ablation] ablation risk < 0.15	0.005

Metric	Value
mode	preserve
verdict	CLEAN
coverage	1.0
mean_lexical_overlap	0.9956
mean_ablation_risk	0.0047
mean_semantic_similarity	0.9853
mean_freq_shift	-35.39
concepts	14 in / 14 out / 0 unmatched

review-steps / test_review_preserves_vocabulary

Session Summary

Session Details

Session 1: 61b29dd1-a7d

Test Checks

Project Files

Aggregate Statistics

Tool Usage Breakdown

Ablation

Tool	Calls
Edit	12
Read	6
Skill	2