Data Methodology
Bill Taylor’s Show Archive
Upcoming Shows Bands About Bill Musicians Session Work Press Data Methodology Archived Shows

Overview

This site is the result of a hybrid human-curated + AI-assisted archival workflow designed to document, verify, and present decades of live-music performance history in a way that is accurate, maintainable, and future-proof.

Rather than relying on a traditional database or CMS, the project intentionally uses simple, durable tools—spreadsheets, static hosting, and lightweight serverless logic—combined with AI to accelerate data gathering, reconciliation, and presentation.

1. Data sources & initial collection

The foundation of the archive is primary and secondary historical data, including:

Role of AI in data gathering

AI was used extensively in the initial discovery and normalization phase, including:

Important: AI suggestions were treated as proposals, not facts. Every show entry was reviewed and either confirmed, corrected, or discarded by a human before being considered canonical.

2. Canonical data model (Google Sheets)

All authoritative data for the site lives in a single Google Sheet, which serves as the project’s source of truth.

Each row represents one performance and includes (where available):

Why a spreadsheet?

AI continues to assist with validation, backfills, and anomaly checks—but the spreadsheet remains authoritative.

3. Desired site workflow (design philosophy)

From the beginning, the site was designed around a few guiding principles:

  1. No manual web edits required — all updates happen in the spreadsheet.
  2. No fragile admin UI — fewer moving parts means fewer failures.
  3. Every build is lockable — versions can be frozen, archived, and rebuilt later.
  4. Transparency over perfection — methodology matters more than claims of completeness.

AI was used collaboratively to propose page structure, define filtering logic (past vs. future shows), shape collaborator logic, and keep features aligned with available data.

4. Site architecture & wiring

The live site is assembled using the following components:

Google Sheets

Cloudflare Worker

This approach avoids databases, server maintenance, and authentication layers.

Static frontend

Hosting & DNS

AI assisted in debugging hosting/SSL issues, optimizing file size and performance, and designing a fault-tolerant, low-cost deployment model.

5. Press, media, and attribution

Press items and media assets follow the same philosophy:

AI was used to locate historical press references, summarize long articles, and propose standardized metadata fields. All press content remains attributed to original sources.

6. Ongoing maintenance workflow

  1. New show happens (or a historical show is discovered)
  2. Row is added or updated in the Google Sheet
  3. Personnel field uses |-separated names
  4. Recording or media links added when available
  5. Site reflects changes instantly — no rebuild required

AI continues to assist with bulk cleanup, historical backfills, and relationship analysis (collaborators, counts).

7. Transparency & limitations

Despite best efforts:

Rather than obscuring gaps, the archive acknowledges them. This project prioritizes honesty, traceability, and continuous improvement.

8. Why this matters

This archive is not just a list of shows. It is:

Every design and data decision reflects that philosophy.

Last updated: v2.8.10