Technology
A full read of the technical foundation lives at
TECHNOLOGY.md
in the repo. The summary below is the marketing-safe extract.
One engine, two products
The detector and the humanizer share a single library —
@humaniser/core — written as ~600 lines of pure JavaScript
with no npm dependencies. Both products call the same nine
feature extractors, the same scoring functions, and the same
AI-tell phrase blocklist.
Why this matters: when our humanizer recommends an edit, that edit is proven to move the score on the same metric a content site uses to decide whether the page is safe to publish. Our humanizer fixes exactly what Google's scaled-content-abuse policy flags.
The nine features
- AI-tell phrase blocklist hits ("delve into", "in today's digital landscape", "a testament to", "comprehensive guide", and ~30 more — same regex set in detector and rewriter).
- Sentence-length variance — humans vary, AI is uniform.
- Paragraph-length variance — same, at paragraph scale.
- Discourse-marker density (
however / moreover / furthermore) — high density signals AI. - Hedge density (
might / could / typically) — AI hedges, humans cite. - First-hand experience markers (
I tested / we shipped / in production) — absence is the strongest negative signal. - Concrete-detail density (numbers, ISO dates, version pins, URLs, acronyms) — low density is suspicious.
- Sentence-start variety — repetition kills.
- Stop-word ratio — extreme uniformity is a tell.
The three artefacts
- CLI. Drop-in Node script consumed by content sites.
Ships as
tools/ai-detector/scan.mjs+ a syncedengine.mjs. Zero dependencies. - Web app. This site. Detect + humanize pages running the engine in the browser. Nothing uploads.
- API (planned). Same engine, exposed as a Cloudflare Worker for high-volume integrations. EU-resident.
Honest limits
No detector reliably classifies AI vs human. Heuristics produce false positives on academic prose and false negatives on carefully-edited AI output. The score is a signal, not a verdict. The structural defence remains: a named editor signs off on every published page, states what they verified, and lists primary sources they personally checked.
Engine versioning
The engine ships SemVer. Major bumps reflect rule additions or
blocklist expansions that change scoring of existing texts. Consumer
sites pin the bundled engine version; sync is a single-file copy
via humaniser.eu/scripts/sync.mjs.