synthera--field-manual

The Slingshot
Method.

271 AI skills on board. We carry almost none of them, and reach for any in milliseconds. Same firepower, ~1% of the weight.

14,651tokens / turn — if stacked
161tokens / turn — actual
scroll · approach vector
gravity-well

Every skill you keep loaded is dead weight you carry forever.

A skill is only useful when its full instructions are in context. But to even know it exists, its name and description sit in the model's memory every single turn — used or not.

Stack all 271 and that's 14,651 tokens of cargo riding along on every message, most of it for skills you'll never touch this session. Pure drag. The bigger the toolbox, the heavier you fly, the sooner you hit the wall.

maneuver

Don't carry the cargo. Slingshot to it the instant you need it.

Three layers. Cheapest first, smartest last. You only ever pay for the one that fires.

L1

Keyword pass

instant

The prompt is matched against 271 skill names in pure local code. "django tests" lands django-tdd in ~50ms. No model, no cost. Catches the obvious, which is most of the time.

~0claude tokens
L2

The local brain

on your GPU

When keywords miss — "make my page load faster" shares no word with react-performance — a local Ollama model reads the task and decides what's actually required, by meaning. Runs on your own silicon, so the reasoning is free.

0claude tokens
L3

Fetch on demand

pay-per-use

Only the one chosen skill's full playbook is read into context, only at the moment it's needed, then it's gone. You pay for a tool the second you pick it up — never for the 270 still in the rack.

1 of 271loaded, briefly
telemetry

Same payload. Read the gauges.

14,651stacked · tok / turn
161slingshot · tok / turn

On a flat plan it isn't a smaller bill. It's range. Fuel saved becomes distance you can travel before the wall.

the-moat

Efficiency compounds into edge.

RANGE

Longer runs

~14.5k tokens freed per turn means sessions go far deeper before they choke on their own context. More work per conversation.

SPEED

Snappier turns

Less dead weight to re-read every message. The model spends its attention on your problem, not on a manifest of 271 tools.

SCALE

Add freely

10 skills or 10,000 — baseline cost barely moves. The library grows; the drag doesn't. Tooling stops being a tax.

LEVERAGE

Free intelligence

The routing brain runs on hardware you already own. Every decision it makes is compute you don't rent. Local silicon, doing real thinking, at zero marginal cost.

build-it-yourself

Five moves. All local, all free.

The full walkthrough lives in BUILD.md. The shape of it:

01Put your skills in one folder. Each is a plain SKILL.md — a name, a description, instructions.
02Build a tiny index.json: every skill's name + description + path.
03Keyword search (find) — score the prompt against skill names. Instant, no model.
04Local router (route) — Ollama embeds + reasons over the misses. 0 API cost.
05A prompt hook wires it in: keyword first, brain on miss, surface the hit. Done.
Open the build manual