Context
The docs Quarto book has a comprehensive maps.qmd and a good db.qmd, but api.qmd is a near-empty stub and there's no page on querying the GCS parquet data directly or with the new calcofi4r helpers.
Goal
Document how to query CalCOFI data three ways — direct SQL, calcofi4r helpers, and the int-app download — with a reproducibility story tying them together.
Tasks
Blocked by: CalCOFI/workflows#51, CalCOFI/apps#40, CalCOFI/calcofi4r#10, CalCOFI/int-app#5. Final issue of a 5-issue epic.
Context
The docs Quarto book has a comprehensive
maps.qmdand a gooddb.qmd, butapi.qmdis a near-empty stub and there's no page on querying the GCS parquet data directly or with the newcalcofi4rhelpers.Goal
Document how to query CalCOFI data three ways — direct SQL,
calcofi4rhelpers, and the int-app download — with a reproducibility story tying them together.Tasks
data-access.qmd— direct DuckDB + GCS parquet querying:httpfssetup, single-file vs hive-partitionedread_parquetexamples,## Reproducibilitysection explaining the int-app downloadquery/folderhelpers.qmd— thecalcofi4rbio↔env matching helpers with worked examplesapi.qmd— add a "superseded bycalcofi4rhelpers + direct querying" callout_quarto.yml— insertdata-access.qmdandhelpers.qmdafterdb.qmd, beforeapi.qmdcc_match_ichthyo_by_name(), int-app download — all producing identical rowsquarto render docs/; confirm the worked-example SQL matchesattr(cc_match_ichthyo_by_name(...), "sql")Blocked by: CalCOFI/workflows#51, CalCOFI/apps#40, CalCOFI/calcofi4r#10, CalCOFI/int-app#5. Final issue of a 5-issue epic.