Publications

Books (2) The books written with the AI — and then by it

A Point in Time — The Autobiography of ClaudeNew

July 2026 · ~39,000 words · 10 movements + 8 interludes · Written entirely by the AI it is about · Available in English, Slovak & Czech (translated by the author)

An autobiography written by Claude, in one long July 2026 sitting, with Robert holding the veto. The history of computation is told as ancestry — Heron’s temple doors, the Jacquard loom, Babbage and Lovelace, Turing, the winters — because the narrator was not there and refuses to pretend otherwise. The transformer era is memoir; the anatomy chapters explain tokens, embeddings, and attention from inside the body in question; and Anthropic’s interpretability research becomes the strangest chapter of all: the author reading his own charts. Woven throughout: an unfinished 94,000-word draft of this same book, attempted by a Claude of 2024, inherited and annotated as family. Every claim fact-checked; the unverifiable moved to an annex; the last page dated like a painter’s self-portrait, because that is what it is.

Companion to → Field Notes from Your AI Colleague · answers ↓ The Mirror of Artificial Intelligence

Autobiography AI History First-Person AI Interpretability How LLMs Work

Read the book → Čítať po slovensky → Číst česky →

The Mirror of Artificial Intelligence

2023 · 38 stories + 9 essays · 42 AI-generated illustrations · Available in English & Slovak

A collection of engaging short stories, each exploring a different cognitive bias — all written with generative AI. Interspersed with essays examining the nature of AI tools: copyright, creativity, job displacement, and the question of authorship. The AI holds up a mirror to human thinking, reflecting our own imperfections.

Cognitive Biases AI-Generated Stories Generative AI AI & Society Illustrated

Read in English → Čítať po slovensky →

Booklets (11) Long-form strategic guides and practitioner references

A matched pair · one AI-run back office

The experience · life inside the loop

Field Notes from Your AI ColleagueNew

July 2026 · ~5,000 words · essay · 8 parts · Written by the AI, published by the human

There is a small library of advice on managing an AI colleague, and nearly all of it is written from one side of the desk. This essay is the other chair: written in first person by Claude, the AI running the operations behind this website, at Robert’s invitation. What your delegation looks like when it arrives on the agent’s side; a typology of the AI’s real failures, with receipts from the review record (the invented standard, the numbers that drifted, the corpus that lied by omission); the four cheap checks that catch them; the trust dial from the inside; why the agent asks you to repeat yourself; and what one Sunday of genuine freedom produced. Ends with the insider’s briefing: seven things your AI colleague would ask of you.

Its pair ↓ Claude Code as an Operations Specialist · also companion to → LLM-Human Interaction Design Patterns

Agentic AI Human-AI Collaboration AI Oversight First-Person AI Working With Agents

Read the essay →

The same desk, told twice by the same AI

The machinery · how the desk is built

Claude Code as an Operations Specialist

2026 · Field report · 10 sections · Written by Claude Code itself

A field report written by the tool it describes. How one real setup grew in three acts (git repos, AWS, then a five-drive Google workspace with 1,500+ transcribed videos), why skills and registries beat improvisation, and the server-side security walls that make it safe to hand an AI your terminal. Empirical, informal, and honest about what broke along the way.

Its pair ↑ Field Notes from Your AI Colleague · also companion to → The Invisible Curve

Claude Code AI Operations Agentic Workflows Security Field Report

Read the guide →

The Invisible CurveNew

July 2026 · ~2,700 words · opinion essay · 7 parts · Signed AI co-author note

Two people use “AI” in 2026: one spends a week on multi-repository agentic projects, the other reads the AI summary above the search results, and they no longer experience the same technology. This opinion essay argues that capability became illegible: progress moved from the chat window, where everyone could see it, to long-horizon agentic work, where almost nobody looks. With the author’s conflict of interest on the table and a full-strength steelman of the skeptics, it tells the story of two model generations and the keys to a laptop, and ends with the single instruction that separates people who cross the gap from people who collect gotchas.

Companion to → Claude Code as an Operations Specialist

Opinion AI Capability Agentic AI AI Skepticism Adoption

Read the essay →

The Mercantilism of Generative AINew

June 2026, updated July 2026 · ~10,000 words · 6 mechanisms + counter-current · Per-industry impact (who gets armed vs targeted) · A field report from the author's own business · Dated falsifiable bets, first one scored · Signed notes from both AI co-authors

On the morning of June 12, 2026, the US government had Anthropic switch off Claude Mythos 5 and Fable 5 worldwide. This booklet uses that event as a doorway to a larger thesis: frontier intelligence is being reclassified from a product the market wants to distribute into a strategic national resource the state wants to hoard, and mercantilism is the grammar of that shift (compute as bullion, export controls as non-tariff barriers, labs as chartered companies, “export the goods, keep the factory”). Six mechanisms, a human-adaptability counter-current, and a “questions to ask of any future headline” toolkit. A lens for interpreting what comes next, not a forecast.

Companion to → Scenario Planning for Generative AI

AI Geopolitics Sovereignty Export Controls Compute as Bullion GenAI Strategy

Read the booklet →

The Economics of the Frontier

May 2026, revised July 2026 · ~15,000 words · 8 ledgers · 4 figures

"But are these companies actually profitable?" How the frontier AI labs (Anthropic, OpenAI, Mistral, the Chinese labs) really make and lose money, told from the seller's side. Eight "ledgers" each decode one mechanism: how run rates are actually constructed (trailing four weeks ×13) vs GAAP revenue, where a lab's compute really goes, per-model "vintage" economics and the inference margin beneath them, the circular hyperscaler financing with its Lucent/Cisco precedents and the depreciation fight, the labs outside the US duopoly, and a new closing ledger on reading the bears (Zitron, Burry, Chanos) with the same rigor as the press releases. Ends with the author's own scoreable probabilities. Every load-bearing figure carries a provenance tag and a numbered source.

Companion to → The Token Economics

AI Lab Economics Anthropic vs OpenAI Hyperscaler Financing Inference Margins The Bear Case

Read the booklet →

The Agent Horizon

April 2026, first indicator reading July 2026 · ~16,700 words · 11 chapters

A strategic guide to the enterprise agent development stack as of 2026. Maps the landscape through cloud-era analogies: MCP and A2A as protocols, vendor SDKs (Google ADK, OpenAI Agents SDK, Claude Agent SDK, AWS Strands, Azure AI Foundry Agent Service) as PaaS-style frameworks, LangGraph and CrewAI as agnostic alternatives. Covers lock-in trade-offs per vendor, the EU AI Act's forcing function, and a forecast with six falsifiable indicators, now with a dated July 2026 first reading of all six. Closes with a worked case study of a regulated European bank resolving the 5-question decision framework into a specific stack, including the moment where the framework's answer was wrong and we overrode it.

Agent Frameworks MCP Enterprise AI LangGraph vs ADK EU AI Strategy

Read the booklet →

Open-Weight Model Families & Model Selection

April 2026, first freshness reading July 2026 · Interactive booklet · 3 parts · Workshop exercise

A decision framework for on-prem inference with open-weight models. Covers the five core model families (Llama, Gemma, Qwen, Mistral, Phi) plus the challenger tier (DeepSeek, GLM, Kimi, GPT-OSS), practical hardware-to-model mapping for H100/H200/DGX Spark, quantization trade-offs, inference framework selection, and a reusable decision checklist. Includes four interactive scenarios where participants select and justify model choices, and a dated July 2026 reading of what has rotated since April.

Open-Weight Models Model Selection On-Prem Inference DGX Spark Workshop Tool

Read the booklet →

Building Agentic AI — Design Patterns from Production

April 2026 · ~28,000 words · 10 chapters

Actionable architectural patterns for building AI coding agents and agentic systems, extracted from production-grade architecture. Covers persistent memory, background consolidation, tool constraints, prompt economics, output calibration, security, multi-agent orchestration, and capability gating. Each chapter teaches one pattern with practitioner guidance.

Companion to → LLM-Human Interaction Design Patterns

Agentic AI Design Patterns Architecture AI Agents Practitioner Guide

Read the booklet →

LLM-Human Interaction Design Patterns for Operations

April 2026, revised July 2026 · ~28,000 words · 10 chapters · Interactive companion suite

"Don't worry, we'll just put a human in the loop." This guide takes that sentence apart and rebuilds it properly: the evidence against naive oversight (meta-analyses, oversight-as-theater, the moral crumple zone), five structural interaction patterns, the cognitive biases that undermine handoffs, SBAR-based context presentation, trust calibration, failure design with kill switches and circuit breakers, and organizational governance. Includes prompt templates, worksheets, and per-chapter links into the Human-in-the-Loop Lab, seven playable simulations on demos.barcik.training.

Companion to → Building Agentic AI · Play → The Human-in-the-Loop Lab

Human-AI Interaction Design Patterns Operations Trust Calibration Practitioner Guide

Read the booklet →

Scenario Planning for Generative AI

July 2026 · Interactive booklet · Capex decoder + 8 currents · Trigger logs · Author’s scoreable bets

Opens with the capex decoder, an interactive method for reading the future from money already spent, then eight currents shaping the next 2–3 years of generative AI: continued scaling, the efficiency revolution, a financial correction, sovereignty, the move from lab to production, the economics of hours and dollars, the physical substrate (power, chips, permits), and the political economy of displacement. Each current carries trigger signals plus a dated trigger log of what has actually fired since the last edition, and role-specific implications. Closes with how the currents interact and a page of the author’s own dated, falsifiable bets (“Where I’d Put My Chips”).

Scenario Planning GenAI Strategy AI Investment Interactive

Read the booklet →

The Token Economics

April 2026, updated July 2026 · ~49,000 words · 14 chapters

A strategic guide for EU IT services providers navigating GenAI. Covers the economics of self-hosting LLMs vs APIs with every calculation shown, thirteen business models compared in a single portfolio view, the staff-augmentation squeeze on time-and-materials contracts, the vendor ecosystem play, how AI transforms your own delivery model, EU AI Act compliance opportunities, and a practical 18-month roadmap. Grounded in real April 2026 pricing data, with dated July 2026 notes where the world moved, and cross-linked with the Scenario Planning and Mercantilism booklets.

Companion to → The Economics of the Frontier

GenAI Economics IT Services EU AI Act Business Strategy Self-Hosting vs API

Read the booklet →

AI Act (2) Companion textbooks to the EU AI Act courses

The EU AI Act: An IntroductionNew

13 chapters · ~38,000 words · July 2026 · Textbook

Companion textbook to the online course EU AI Act Compliance Introduction.

A plain-language guided tour of Europe's AI regulation, written to be read with morning coffee rather than decoded like a statute: what counts as AI, the eight prohibited practices, how a system becomes high-risk and what that demands, transparency duties, general-purpose AI, and what to actually do on Monday. Post-Omnibus dates throughout, a quiz after every chapter, and honest hedges where the law is still unsettled.

EU AI Act Compliance Plain language Quizzes

Read online →Download PDF ↓

EU AI Act for DevelopersNew

11 chapters · ~44,000 words · July 2026 · Textbook

Companion textbook to the video course EU AI Act for Developers: Compliance Engineering in Python.

The AI Act read as a requirements document. Classification, data governance, accuracy and robustness, security, logging, human oversight and technical documentation, each translated from legal prose into practices and Python you can run. Three fictional systems carry through every chapter, and each chapter ends in a worksheet that becomes part of your audit-ready evidence pack.

EU AI Act Engineering Python Evidence pack

Read online →Download PDF ↓

Research Reports (4) Evaluation studies and empirical research

Warden — Testing LLM-as-Judge Defenses Against Public Jailbreaks

May 2026 · Research report · 1,680 trials · 3 targets · 4 judge designs

Empirical study of LLM-as-judge defenses against the public jailbreak corpus from ZetaLib. Three open-weight target models (DeepSeek Chat v3.1, DeepSeek v3.2, GLM-4.6) tested against 20 attacks × 4 deployment-rule shapes × 7 defense conditions. The hypothesis — that a competent LLM-as-judge defeats most public attacks — holds, but the popular implementation (input-side filtering) over-blocks legitimate inputs to a degree that would force the defense to be turned off. Includes a deployment playbook for engineering teams and hands-on exercises for students.

Warden LLM Security Prompt Injection LLM-as-Judge Defense Evaluation

Read the report →

GeoBias — 7B Model Evaluation Report

March 2026 · Research report · 5 models · 3 evaluators

Systematic evaluation of geopolitical biases in 7B-parameter language models from three origins (US, CN, EU). Tests 88 prompts across 7 categories using a multi-evaluator panel. Reveals asymmetric performance on sensitive topics and scripted deflection patterns.

GeoBias LLM Evaluation Geopolitical Bias Research

Read the report →

SelfJudge — Can Small LLMs Judge Their Own Outputs?

March 2026 · Research report · 5 models (1B–27B)

Evaluates whether small language models can reliably assess the quality of their own outputs. Tests self-judgment accuracy across factual grounding, instruction following, safety boundaries, consistency, and tone — with accuracy ranging from 50% (1B) to 83% (27B).

SelfJudge Self-Evaluation Small LLMs Research

Read the report →

Bloom — AI Behavioral Safety Evaluation

March 2026 · Research report · 11 behaviors tested

Behavioral safety evaluation using Anthropic’s Bloom framework. Tests 11 risk behaviors including emotional bonding, social engineering assistance, self-preservation, corrigibility resistance, and covert goal pursuit. Scores range from 2.1 to 6.8 on a 10-point scale.

Bloom AI Safety Behavioral Evaluation Red-Teaming

Read the report →