Skip to content

Commit 6ab1bdf

Browse files
committed
Sharpen blog titles and section headings
1 parent 3c17a51 commit 6ab1bdf

32 files changed

Lines changed: 243 additions & 243 deletions

File tree

docs/blog/ai-engineering-control-problem/index.html

Lines changed: 9 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -169,14 +169,14 @@ <h1>AI Engineering Is a Control Problem, Not a Prompt Problem</h1>
169169
<section class="section page-nav" aria-labelledby="page-nav-heading">
170170
<h2 id="page-nav-heading">In this piece</h2>
171171
<div class="page-nav-list">
172-
<a href="#the-operational-reality">The operational reality</a>
173-
<a href="#the-anti-pattern">The anti-pattern</a>
174-
<a href="#the-better-system-pattern">The better system pattern</a>
172+
<a href="#the-operational-reality">Where the pressure shows up</a>
173+
<a href="#the-anti-pattern">The failure mode</a>
174+
<a href="#the-better-system-pattern">The better pattern</a>
175175
<a href="#what-leaders-should-optimize-for-instead">What leaders should optimize for instead</a>
176176
<a href="#why-security-cares">Why security cares</a>
177177
<a href="#why-platform-and-engineering-care">Why platform and engineering care</a>
178178
<a href="#concrete-example-manual-steering-vs-governed-issue-to-pr-flow">Concrete example: manual steering vs governed issue-to-PR flow</a>
179-
<a href="#practical-next-step">Practical next step</a>
179+
<a href="#practical-next-step">What to do next</a>
180180
</div>
181181
<div class="link-row">
182182
<a href="/blog/operating-notes/">Series home</a>
@@ -186,7 +186,7 @@ <h2 id="page-nav-heading">In this piece</h2>
186186
</section>
187187

188188
<section class="section" aria-labelledby="quick-read">
189-
<h2 id="quick-read">Quick read</h2>
189+
<h2 id="quick-read">The short version</h2>
190190
<div class="summary-grid">
191191
<article class="card">
192192
<p class="post-stage">The rule</p>
@@ -216,7 +216,7 @@ <h3>Review your current agent workflow as a control system</h3>
216216
</section>
217217

218218
<section class="section article-section">
219-
<h2 id="the-operational-reality">The operational reality</h2>
219+
<h2 id="the-operational-reality">Where the pressure shows up</h2>
220220
<p>
221221
A prompt is cheap to improve. A containment event is not. That is why
222222
the "better prompting" conversation weakens the moment an agent leaves
@@ -242,7 +242,7 @@ <h2 id="the-operational-reality">The operational reality</h2>
242242
</section>
243243

244244
<section class="section article-section">
245-
<h2 id="the-anti-pattern">The anti-pattern</h2>
245+
<h2 id="the-anti-pattern">The failure mode</h2>
246246
<p>
247247
The anti-pattern is prompt-centrism: treating the quality of the
248248
instructions as if it were the same thing as control. It is not. A
@@ -269,7 +269,7 @@ <h2 id="the-anti-pattern">The anti-pattern</h2>
269269
</section>
270270

271271
<section class="section article-section">
272-
<h2 id="the-better-system-pattern">The better system pattern</h2>
272+
<h2 id="the-better-system-pattern">The better pattern</h2>
273273
<p>
274274
The better pattern is to treat AI engineering as a governed software
275275
delivery system. The model still matters, but it sits inside a larger
@@ -415,7 +415,7 @@ <h3>3. Controlled promotion</h3>
415415
</section>
416416

417417
<section class="section article-section">
418-
<h2 id="practical-next-step">Practical next step</h2>
418+
<h2 id="practical-next-step">What to do next</h2>
419419
<p>
420420
Pick one agent workflow your team already uses. Ignore the prompt for
421421
a moment and map the control surface instead.

docs/blog/ai-engineering-maturity-model/index.html

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -167,14 +167,14 @@ <h1>The AI Engineering Maturity Model</h1>
167167
<section class="section page-nav" aria-labelledby="page-nav-heading">
168168
<h2 id="page-nav-heading">In this piece</h2>
169169
<div class="page-nav-list">
170-
<a href="#the-operational-reality">The operational reality</a>
171-
<a href="#the-anti-pattern">The anti-pattern</a>
172-
<a href="#the-better-system-pattern">The better system pattern</a>
170+
<a href="#the-operational-reality">Where the pressure shows up</a>
171+
<a href="#the-anti-pattern">The failure mode</a>
172+
<a href="#the-better-system-pattern">The better pattern</a>
173173
<a href="#how-to-use-the-model-without-theater">How to use the model without theater</a>
174174
<a href="#why-security-cares">Why security cares</a>
175175
<a href="#why-platform-and-engineering-care">Why platform and engineering care</a>
176176
<a href="#concrete-example-a-realistic-90-day-progression">Concrete example: a realistic 90-day progression</a>
177-
<a href="#practical-next-step">Practical next step</a>
177+
<a href="#practical-next-step">What to do next</a>
178178
</div>
179179
<div class="link-row">
180180
<a href="/blog/operating-notes/">Series home</a>
@@ -184,7 +184,7 @@ <h2 id="page-nav-heading">In this piece</h2>
184184
</section>
185185

186186
<section class="section article-section">
187-
<h2 id="the-operational-reality">The operational reality</h2>
187+
<h2 id="the-operational-reality">Where the pressure shows up</h2>
188188
<p>
189189
We see the same pattern across teams. They start with interactive
190190
prompting and a few strong engineers. Then they add repository
@@ -207,7 +207,7 @@ <h2 id="the-operational-reality">The operational reality</h2>
207207
</section>
208208

209209
<section class="section article-section">
210-
<h2 id="the-anti-pattern">The anti-pattern</h2>
210+
<h2 id="the-anti-pattern">The failure mode</h2>
211211
<p>
212212
The anti-pattern is autonomy inflation: assuming that once an agent is
213213
useful interactively, the organization is ready for background
@@ -228,7 +228,7 @@ <h2 id="the-anti-pattern">The anti-pattern</h2>
228228
</section>
229229

230230
<section class="section article-section">
231-
<h2 id="the-better-system-pattern">The better system pattern</h2>
231+
<h2 id="the-better-system-pattern">The better pattern</h2>
232232
<p>
233233
The better pattern is staged capability growth. We find it useful to
234234
think in five levels.
@@ -352,7 +352,7 @@ <h3>Days 61-90</h3>
352352
</section>
353353

354354
<section class="section article-section">
355-
<h2 id="practical-next-step">Practical next step</h2>
355+
<h2 id="practical-next-step">What to do next</h2>
356356
<p>
357357
Assess one team or one repo against the five levels and be strict
358358
about what evidence counts.

docs/blog/control-benchmarks/agent-action-risk-scenarios-minimum-test-set/index.html

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -173,12 +173,12 @@ <h2 id="page-nav-heading">In this piece</h2>
173173
<div class="page-nav-list">
174174
<a href="#research-grounding">Research grounding</a>
175175
<a href="#why-scenario-design-matters">Why scenario design matters</a>
176-
<a href="#the-anti-pattern">The anti-pattern</a>
176+
<a href="#the-anti-pattern">The failure mode</a>
177177
<a href="#the-minimum-test-set">The minimum test set</a>
178178
<a href="#why-security-leaders-care">Why security leaders care</a>
179179
<a href="#why-platform-and-engineering-care">Why platform and engineering care</a>
180180
<a href="#concrete-artifact-a-scenario-matrix">Concrete artifact: a scenario matrix</a>
181-
<a href="#practical-next-step">Practical next step</a>
181+
<a href="#practical-next-step">What to do next</a>
182182
</div>
183183
<div class="link-row">
184184
<a href="/blog/control-benchmarks/">Series home</a>
@@ -231,7 +231,7 @@ <h2 id="why-scenario-design-matters">Why scenario design matters</h2>
231231
</section>
232232

233233
<section class="section article-section">
234-
<h2 id="the-anti-pattern">The anti-pattern</h2>
234+
<h2 id="the-anti-pattern">The failure mode</h2>
235235
<p>
236236
The anti-pattern is to let the vendor choose only the safest or most
237237
flattering workflow. A tidy refactor, a documentation update, or a
@@ -375,7 +375,7 @@ <h3>Threshold to widen</h3>
375375
</section>
376376

377377
<section class="section article-section">
378-
<h2 id="practical-next-step">Practical next step</h2>
378+
<h2 id="practical-next-step">What to do next</h2>
379379
<p>
380380
Before the next pilot, ask the tool owner to write down the five
381381
scenario families above and fill in the matrix before the first demo.

docs/blog/control-benchmarks/buyers-cannot-evaluate-agentic-control-clearly/index.html

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -170,13 +170,13 @@ <h1>Why Buyers Still Cannot Evaluate Agentic Control Clearly</h1>
170170
<h2 id="page-nav-heading">In this piece</h2>
171171
<div class="page-nav-list">
172172
<a href="#research-grounding">Research grounding</a>
173-
<a href="#the-operational-reality">The operational reality</a>
174-
<a href="#the-anti-pattern">The anti-pattern</a>
173+
<a href="#the-operational-reality">Where the pressure shows up</a>
174+
<a href="#the-anti-pattern">The failure mode</a>
175175
<a href="#the-benchmark-language-buyers-actually-need">The benchmark language buyers actually need</a>
176176
<a href="#why-security-leaders-care">Why security leaders care</a>
177177
<a href="#why-platform-and-engineering-care">Why platform and engineering care</a>
178178
<a href="#concrete-artifact-a-first-pass-evaluation-matrix">Concrete artifact: a first-pass evaluation matrix</a>
179-
<a href="#practical-next-step">Practical next step</a>
179+
<a href="#practical-next-step">What to do next</a>
180180
</div>
181181
<div class="link-row">
182182
<a href="/blog/control-benchmarks/">Series home</a>
@@ -186,7 +186,7 @@ <h2 id="page-nav-heading">In this piece</h2>
186186
</section>
187187

188188
<section class="section" aria-labelledby="quick-read">
189-
<h2 id="quick-read">Quick read</h2>
189+
<h2 id="quick-read">The short version</h2>
190190
<div class="summary-grid">
191191
<article class="card">
192192
<p class="post-stage">The problem</p>
@@ -233,7 +233,7 @@ <h2 id="research-grounding">Research grounding</h2>
233233
</section>
234234

235235
<section class="section article-section">
236-
<h2 id="the-operational-reality">The operational reality</h2>
236+
<h2 id="the-operational-reality">Where the pressure shows up</h2>
237237
<p>
238238
A Head of AppSec or CISO now gets asked a version of the same
239239
question every quarter: which agentic tools are mature enough to let
@@ -263,7 +263,7 @@ <h2 id="the-operational-reality">The operational reality</h2>
263263
</section>
264264

265265
<section class="section article-section">
266-
<h2 id="the-anti-pattern">The anti-pattern</h2>
266+
<h2 id="the-anti-pattern">The failure mode</h2>
267267
<p>
268268
The anti-pattern is to compare agentic products as if the hard part
269269
were still interface quality and developer delight. Buyers end up
@@ -425,7 +425,7 @@ <h3>Pilot discipline</h3>
425425
</section>
426426

427427
<section class="section article-section">
428-
<h2 id="practical-next-step">Practical next step</h2>
428+
<h2 id="practical-next-step">What to do next</h2>
429429
<p>
430430
Pick the next agentic tool your organization is likely to pilot and
431431
rewrite the evaluation brief before the demo happens. If the brief is

docs/blog/control-benchmarks/index.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -208,7 +208,7 @@ <h3>
208208
<p class="post-stage">Evidence</p>
209209
<h3>
210210
<a href="/blog/control-benchmarks/proof-completeness-for-ai-agent-changes/"
211-
>Proof Completeness: What Evidence Must Exist Before an AI Agent Change Is Trustworthy</a
211+
>Proof Completeness for AI Agent Changes</a
212212
>
213213
</h3>
214214
<p>
@@ -222,7 +222,7 @@ <h3>
222222
<p class="post-stage">Pilot design</p>
223223
<h3>
224224
<a href="/blog/control-benchmarks/pilot-evaluation-framework-for-agentic-tools/"
225-
>A Practical Pilot Evaluation Framework for Agentic Tools</a
225+
>How to Run a Buyer-Grade Agent Pilot</a
226226
>
227227
</h3>
228228
<p>

docs/blog/control-benchmarks/measure-control-efficacy-for-ai-agents/index.html

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -170,12 +170,12 @@ <h2 id="page-nav-heading">In this piece</h2>
170170
<div class="page-nav-list">
171171
<a href="#research-grounding">Research grounding</a>
172172
<a href="#what-control-efficacy-actually-means">What control efficacy actually means</a>
173-
<a href="#the-anti-pattern">The anti-pattern</a>
173+
<a href="#the-anti-pattern">The failure mode</a>
174174
<a href="#the-five-metrics-that-matter">The five metrics that matter</a>
175175
<a href="#why-security-leaders-care">Why security leaders care</a>
176176
<a href="#why-platform-and-engineering-care">Why platform and engineering care</a>
177177
<a href="#concrete-artifact-a-control-efficacy-scorecard">Concrete artifact: a control efficacy scorecard</a>
178-
<a href="#practical-next-step">Practical next step</a>
178+
<a href="#practical-next-step">What to do next</a>
179179
</div>
180180
<div class="link-row">
181181
<a href="/blog/control-benchmarks/">Series home</a>
@@ -223,7 +223,7 @@ <h2 id="what-control-efficacy-actually-means">What control efficacy actually mea
223223
</section>
224224

225225
<section class="section article-section">
226-
<h2 id="the-anti-pattern">The anti-pattern</h2>
226+
<h2 id="the-anti-pattern">The failure mode</h2>
227227
<p>
228228
The anti-pattern is to accept proxies for control. Prompt guidance,
229229
reviewer expectations, and post-hoc logs can all be helpful. None of
@@ -367,7 +367,7 @@ <h3>Operational cost</h3>
367367
</section>
368368

369369
<section class="section article-section">
370-
<h2 id="practical-next-step">Practical next step</h2>
370+
<h2 id="practical-next-step">What to do next</h2>
371371
<p>
372372
Take one current agent pilot and rewrite the success criteria in
373373
control-efficacy terms. The goal is to make the next steering

docs/blog/control-benchmarks/pilot-evaluation-framework-for-agentic-tools/index.html

Lines changed: 13 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@
33
<head>
44
<meta charset="utf-8" />
55
<meta name="viewport" content="width=device-width, initial-scale=1" />
6-
<title>A Practical Pilot Evaluation Framework for Agentic Tools | CAISI Blog</title>
6+
<title>How to Run a Buyer-Grade Agent Pilot | CAISI Blog</title>
77
<meta
88
name="description"
99
content="A practical framework for buyer-grade pilots that measure control quality, proof quality, and operational fit instead of demo theater."
@@ -13,15 +13,15 @@
1313
<meta name="robots" content="index,follow,max-image-preview:large,max-snippet:-1,max-video-preview:-1" />
1414
<meta property="og:site_name" content="CAISI" />
1515
<meta property="og:type" content="article" />
16-
<meta property="og:title" content="A Practical Pilot Evaluation Framework for Agentic Tools | CAISI Blog" />
16+
<meta property="og:title" content="How to Run a Buyer-Grade Agent Pilot | CAISI Blog" />
1717
<meta property="og:description" content="A practical framework for buyer-grade pilots that measure control quality, proof quality, and operational fit instead of demo theater." />
1818
<meta property="og:url" content="https://caisi.dev/blog/control-benchmarks/pilot-evaluation-framework-for-agentic-tools/" />
1919
<meta property="og:image" content="https://caisi.dev/assets/caisi-social.png" />
20-
<meta property="og:image:alt" content="A Practical Pilot Evaluation Framework for Agentic Tools | CAISI Blog" />
20+
<meta property="og:image:alt" content="How to Run a Buyer-Grade Agent Pilot | CAISI Blog" />
2121
<meta property="og:image:width" content="1600" />
2222
<meta property="og:image:height" content="900" />
2323
<meta name="twitter:card" content="summary_large_image" />
24-
<meta name="twitter:title" content="A Practical Pilot Evaluation Framework for Agentic Tools | CAISI Blog" />
24+
<meta name="twitter:title" content="How to Run a Buyer-Grade Agent Pilot | CAISI Blog" />
2525
<meta name="twitter:description" content="A practical framework for buyer-grade pilots that measure control quality, proof quality, and operational fit instead of demo theater." />
2626
<meta name="twitter:image" content="https://caisi.dev/assets/caisi-social.png" />
2727
<meta name="author" content="David Ahmann" />
@@ -39,7 +39,7 @@
3939
"@type": "WebPage",
4040
"@id": "https://caisi.dev/blog/control-benchmarks/pilot-evaluation-framework-for-agentic-tools/#webpage",
4141
"url": "https://caisi.dev/blog/control-benchmarks/pilot-evaluation-framework-for-agentic-tools/",
42-
"name": "A Practical Pilot Evaluation Framework for Agentic Tools | CAISI Blog",
42+
"name": "How to Run a Buyer-Grade Agent Pilot | CAISI Blog",
4343
"description": "A practical framework for buyer-grade pilots that measure control quality, proof quality, and operational fit instead of demo theater.",
4444
"inLanguage": "en",
4545
"isPartOf": {
@@ -71,13 +71,13 @@
7171
{
7272
"@type": "ListItem",
7373
"position": 4,
74-
"name": "A Practical Pilot Evaluation Framework for Agentic Tools"
74+
"name": "How to Run a Buyer-Grade Agent Pilot"
7575
}
7676
]
7777
},
7878
{
7979
"@type": "BlogPosting",
80-
"headline": "A Practical Pilot Evaluation Framework for Agentic Tools",
80+
"headline": "How to Run a Buyer-Grade Agent Pilot",
8181
"description": "A practical framework for buyer-grade pilots that measure control quality, proof quality, and operational fit instead of demo theater.",
8282
"mainEntityOfPage": "https://caisi.dev/blog/control-benchmarks/pilot-evaluation-framework-for-agentic-tools/",
8383
"url": "https://caisi.dev/blog/control-benchmarks/pilot-evaluation-framework-for-agentic-tools/",
@@ -145,7 +145,7 @@
145145
<span class="divider">/</span>
146146
<a href="/blog/control-benchmarks/">Benchmark Series</a>
147147
<span class="divider">/</span>
148-
<span>A Practical Pilot Evaluation Framework for Agentic Tools</span>
148+
<span>How to Run a Buyer-Grade Agent Pilot</span>
149149
</p>
150150
<p class="eyebrow">Benchmark Series / Post 5 of 5 / Pilot Design</p>
151151
<div class="post-author">
@@ -154,7 +154,7 @@
154154
By <a href="https://www.linkedin.com/in/dahmann/">David Ahmann (LinkedIn)</a>
155155
</p>
156156
</div>
157-
<h1>A Practical Pilot Evaluation Framework for Agentic Tools</h1>
157+
<h1>How to Run a Buyer-Grade Agent Pilot</h1>
158158
<p class="lead">
159159
The pilot ends, the team is impressed, and the real decision still is
160160
not clear. Everyone learned that a strong operator could get useful
@@ -171,12 +171,12 @@ <h2 id="page-nav-heading">In this piece</h2>
171171
<div class="page-nav-list">
172172
<a href="#research-grounding">Research grounding</a>
173173
<a href="#what-most-agent-pilots-actually-test">What most pilots actually prove</a>
174-
<a href="#the-anti-pattern">The anti-pattern</a>
174+
<a href="#the-anti-pattern">The failure mode</a>
175175
<a href="#a-practical-pilot-framework">A practical pilot framework</a>
176176
<a href="#what-good-pilot-outputs-look-like">What a serious pilot should leave behind</a>
177177
<a href="#why-security-and-platform-should-co-own-it">Why security and platform should co-own it</a>
178178
<a href="#concrete-artifact-a-pilot-scorecard">Concrete artifact: a pilot scorecard</a>
179-
<a href="#practical-next-step">Practical next step</a>
179+
<a href="#practical-next-step">What to do next</a>
180180
</div>
181181
<div class="link-row">
182182
<a href="/blog/control-benchmarks/">Series home</a>
@@ -229,7 +229,7 @@ <h2 id="what-most-agent-pilots-actually-test">What most pilots actually prove</h
229229
</section>
230230

231231
<section class="section article-section">
232-
<h2 id="the-anti-pattern">The anti-pattern</h2>
232+
<h2 id="the-anti-pattern">The failure mode</h2>
233233
<p>
234234
The anti-pattern is productivity theater. The tool completes a few
235235
convenient tasks, the stakeholders see enough upside to stay excited,
@@ -372,7 +372,7 @@ <h3>Next control investment</h3>
372372
</section>
373373

374374
<section class="section article-section">
375-
<h2 id="practical-next-step">Practical next step</h2>
375+
<h2 id="practical-next-step">What to do next</h2>
376376
<p>
377377
For the next pilot, write the exit memo before the work starts. If
378378
that feels premature, it usually means the team has not decided what

0 commit comments

Comments
 (0)