Data model

The schema lives in internal/store/migrations/ as numbered SQL files (0001_init.sql … 0012_book_enrichment.sql), embedded into the binary and applied by store.Open at startup. This page documents the resulting current schema, noting which migration added what.

The two halves: rebuildable index vs durable state

The schema is deliberately split in two, and the split is the most important thing to understand before touching it:

The rebuildable index - books, book_files, chapters, books_fts - is a cache of what the scanner found on disk. It can be dropped and rebuilt from a rescan at any time. books.id is an internal artifact of this half: it must never appear in the API contract or in durable user state.
Durable state - progress, bookmarks, notes, listening_history, favourites (per-user), plus folder_overrides and book_enrichment (per-library config) - is keyed by (library_id, rel_path), with no foreign key to books.

Why no FK across the seam? Three reasons, all load-bearing:

Rebuild survival. If durable state referenced books.id, deleting and re-indexing a library (or the prune step of a normal scan) would cascade away every user's progress. Path keys survive because the scanner re-creates the same rel_paths.
Re-tagging survival. Fixing a book's tags changes its indexed metadata but not its path, so state keyed by path is untouched.
Pre-index writes. A client can start playing a book the background scan hasn't reached yet (the filesystem view needs no index); progress saved at that moment has no books row to reference.

The remaining gap - a file that moves on disk - is covered by move-tracking: the scanner fingerprints files (books.content_hash) and calls catalog.MoveDurableState to carry all six path-keyed tables from the old path to the new one (see Scanner).

warning

When adding a new per-user or per-library-config table, follow the pattern: key it on (user_id, library_id, rel_path) or (library_id, path), FK only to users/libraries (with ON DELETE CASCADE), never to books - and add a line to catalog.MoveDurableState so moves carry it along.

Tables

Accounts & credentials

users (0001; is_demo added in 0005) - id, username (UNIQUE), password_hash (argon2id PHC string; empty string = password-less account, never a hash of ""), role ('admin'/'user'), disabled, is_demo, created_at, updated_at. is_demo flags throwaway demo-mode accounts so the background reaper deletes by flag, not by username prefix (which could catch a real account named demo_*). There is deliberately no last_login column - last activity is derived from MAX(tokens.last_seen).

tokens (0001) - opaque bearer tokens: user_id (FK CASCADE), token_hash (UNIQUE - only the SHA-256 hash is stored), kind ('session'/'pairing'), device_name, created_at, last_seen (bumped on every authenticated request), expires_at (NULL = no expiry), revoked.

auth_codes (0001; kind + redeemed_at added in 0010) - redeemable codes: code_hash (UNIQUE, SHA-256 of the normalized code), user_id (FK CASCADE), label, max_uses (0 = unlimited), uses, expires_at, redeemed_at (first-redemption stamp, informational), created_at, and kind:

'invite' - admin-minted onboarding secret, bounded (default 5 uses / 1 day).
'recovery' - user-owned durable credential (unlimited uses, never expires).

Both redeem through the same path; see Auth & security for the lifecycle (supersede-on-mint, rotate, atomic claim).

Libraries & shares

libraries (0001; layout dropped in 0007; sort_order added in 0011) - id, name (UNIQUE), root (an absolute local path), default_view, sort_order, created_at. There is no layout column: library shape is auto-detected per folder by the scanner, with folder_overrides as the correction mechanism. sort_order drives library listing order and is the tiebreaker when de-duplicating copies of the same book that appear in more than one library (search / recently-added): all else equal, the copy in the earlier-ordered library wins.

shares / share_paths / user_share_access (0003, replacing the dropped user_library_access) - filesystem-based access control. A share is a named set of path rules; share_paths(share_id, library_id, path) grants a subtree per rule, with path = "" meaning the whole library; user_share_access grants shares to users. shares.read_only is persisted and editable but not yet enforced (all share access is already read-only - it gates a future write/upload path). How rules become an enforced Scope is covered in Auth & security.

The rebuildable index

books (0001; added_at in 0004; codec in 0008) - one row per book, UNIQUE (library_id, rel_path). Columns: is_folder (folder book vs single-file book), identity metadata (title, author, series, series_index, narrator), duration, asin/isbn (optional external ids - present so enrichment/metadata services can attach data without reshaping the schema), cover_path (a library-relative sidecar image, "" = fall back to embedded art), format, codec (ffprobe codec_name, "" when unknown - drives the direct_playable API flag), size, mtime, content_hash, indexed_at, added_at.

Two columns deserve emphasis:

content_hash is a cheap move-detection fingerprint - SHA-256 of (size, first 64 KiB, last 64 KiB) - not an identity and not a full hash.
added_at records when the book first appeared on disk (file birth time, falling back to mtime; earliest file for folder books). Unlike the auto-increment id - which a full re-index reshuffles - it is a stable chronological key, and UpsertBook deliberately never updates it on re-index. Index idx_books_added supports the "recently added" sort.

book_files (0001) - the ordered parts of a folder book: book_id (FK CASCADE), rel_path, seq, duration, format, size.

chapters (0001; file_index + book_offset in 0002; file_path in 0003) - normalized playable units, identical in shape for a chaptered single-file m4b and a folder of mp3 parts: book_id (FK CASCADE), idx, title, file_index (ordinal of the containing file), file_path (the library-relative audio file to stream - playback is purely path-based), start/end (offsets within that file), and book_offset (the chapter's start on the whole-book timeline). See Scanner for how these are built.

books_fts (0001) - see FTS design below.

Durable per-user state (path-keyed, no FK to books)

All re-created in 0003 when identity moved to the path (pre-1.0, so the tables were rebuilt rather than migrated in place):

progress - PK (user_id, library_id, rel_path); position, duration, finished, playback_speed, device_id, updated_at, and version (monotonic, breaks updated_at ties). Reconciliation is last-write-wins in catalog.SaveProgress; any future realtime layer must reuse that merge.
bookmarks, notes - id-PK rows keyed by (user_id, library_id, rel_path) plus position and text.
listening_history - listening spans (from_pos, to_pos, started_at, ended_at).
favourites (0009) - PK (user_id, library_id, rel_path). A favourite may address any path: a navigation folder (author/series), a book folder, or a single-file book.

Durable per-library config (path-keyed, no FK to books)

folder_overrides (0006) - PK (library_id, path), mode ∈ {'book', 'collection'} (allow-list enforced in catalog.SetFolderOverride). Pins a folder's detection when the scanner's heuristic gets it wrong: book forces folder-is-one-book, collection forces one book per file.
book_enrichment (0012) - PK (library_id, path), asin, isbn, updated_at. Attached by the manager when it matches an external source (e.g. an Audible library) to an indexed book. The scanner re-applies it onto freshly indexed rows (catalog.ApplyEnrichments) so it survives rebuilds; blank fields never overwrite stored values (NULLIF/COALESCE merge in applyEnrichmentToPathSQL).

`schema_migrations`

Created by store.migrate itself (not a migration file): name (PK), applied_at. Records which migration files have run.

Migration policy: append-only

Migrations are append-only. To change the schema, add a new internal/store/migrations/000N_*.sql - never edit an applied file. The filename must match NNNN_*.sql (store.go rejects misnamed files, because application order is lexical). Each migration runs in its own transaction and is recorded in schema_migrations; already-recorded names are skipped.

The migration history so far:

#	File	What it did
0001	`init`	Initial schema (users, tokens, auth_codes, libraries, books, book_files, chapters, books_fts, book-id-keyed listening state)
0002	`chapter_parts`	`chapters.file_index` + `book_offset` (multi-file normalization)
0003	`paths_and_shares`	Re-keyed all durable state to `(user, library, rel_path)`; `chapters.file_path`; replaced `user_library_access` with shares
0004	`book_added_at`	`books.added_at` (stable "recently added" key)
0005	`user_is_demo`	`users.is_demo` flag for the demo reaper
0006	`folder_overrides`	Per-folder detection overrides
0007	`drop_library_layout`	Dropped `libraries.layout` (detection is per-folder now)
0008	`book_codec`	`books.codec` (drives `direct_playable`)
0009	`favourites`	Per-user path-keyed favourites
0010	`authcode_kind`	`auth_codes.kind` (invite/recovery) + `redeemed_at`
0011	`library_sort_order`	`libraries.sort_order` (display order + dedup tiebreak)
0012	`book_enrichment`	Path-keyed ASIN/ISBN enrichment

SQLite choices

Pure-Go driver. The server uses modernc.org/sqlite (no CGO), so the binary cross-compiles for every release target without a C toolchain - a hard requirement of the native-distribution pipeline (see Release pipeline).

Pragmas. Every connection gets journal_mode(WAL), busy_timeout(5000), and foreign_keys(ON) appended to its DSN (store.dsnPragmas). foreign_keys defaults off per SQLite connection, and the schema relies on ~28 ON DELETE CASCADE rules (deleting a user removes their sessions, codes, progress, bookmarks, notes, history, and grants) - so the pragmas are appended unconditionally, with the correct ?/& separator, rather than skipped when a DSN already carries query params.

Writer/reader pool split. store.DB owns two pools over the same file:

a writer pool capped at one connection (SetMaxOpenConns(1)) - SQLite serializes writers anyway, and a single connection avoids "database is locked" churn;
a read-only reader pool (query_only(ON), max(NumCPU, 4) connections) - WAL allows concurrent readers, and routing reads separately means a slow or stuck write (e.g. stalled on a network volume) never blocks browsing. /healthz probes the reader pool for the same reason.

An in-memory DSN (tests) stays single-pool: an in-memory database is per-connection, so a second pool would silently be a distinct, empty database. Supporting telemetry: WithTx logs any transaction over 2 s, and a background sampler warns when callers queue for the writer connection.

Keyset pagination

catalog.ListBooks paginates with an opaque keyset cursor, not OFFSET: the cursor encodes (sort value, id) (base64 of value \x00 id), and the next page is fetched with an index-friendly row-value comparison ((col, id) > (?, ?) - id breaks ties). This keeps paging O(page) however deep into a 50,000-book library the caller is, where OFFSET n degrades linearly with n. Default page size 50, cap 200; one extra row is fetched to detect whether a next page exists.

note

Don't switch any list endpoint over a potentially-large table to OFFSET pagination - it is design priority #2.

FTS design (`books_fts`)

books_fts is a standalone FTS5 virtual table over (title, author, series, narrator) - it owns its own storage rather than using FTS5's external-content mode. The trade-off is deliberate: standalone storage keeps the sync code to a trivial delete-then-insert, at the cost of duplicating four small text columns.

It is kept in sync by the application, not by triggers, keyed by rowid = books.id:

catalog.UpsertBook deletes then re-inserts the FTS row inside the same transaction as the book upsert, so index and FTS can't diverge.
catalog.DeleteBooksNotIn (the scanner's prune step) deletes the FTS row alongside each stale book.

catalog.Search joins books_fts to books on rowid, ranked by FTS relevance (ORDER BY rank). User input never reaches FTS5 syntax directly: buildMatchQuery reduces it to quoted, AND-ed prefix terms ("foo"* AND "bar"*), giving type-ahead behavior without exposing FTS operators. Results are scoped per-library by the caller's share rules and de-duplicated across libraries (see Data model → libraries for the sort_order tiebreak).

If you add a searchable column, update both sides of the sync (UpsertBook, DeleteBooksNotIn) and the FTS table definition via a new migration.

The two halves: rebuildable index vs durable state​

Tables​

Accounts & credentials​

Libraries & shares​

The rebuildable index​

Durable per-user state (path-keyed, no FK to books)​

Durable per-library config (path-keyed, no FK to books)​

schema_migrations​

Migration policy: append-only​

SQLite choices​

Keyset pagination​

FTS design (books_fts)​