IntroductionGlossary

Glossary

TermMeaning
AircanNESO's Airflow DAGs (datopian/aircan) that load data into the datastore.
AirflowOpen-source workflow orchestrator running the ingestion DAGs.
API tokenA long-lived credential used by integrators to authenticate Data API requests.
BigQueryThe production datastore — holds the rows of tabular data behind every published resource.
CKANComprehensive Knowledge Archive Network — the open-source data catalog at the core of the stack and source of truth for metadata.
CKAN APICKAN's RESTful API for managing datasets, resources, and metadata. PortalJS Admin is the primary client.
Cloudflare R2S3-compatible object storage used for uploaded resource files (via ckanext-s3filestore).
Data APIThe read-only HTTP API in front of the datastore — the only path consumers use to read data.
Data schemaThe structure of the actual data (CSV columns, JSON properties, etc.).
DatasetA logical collection of data described by metadata. May contain one or more resources.
DatastoreThe query backend (BigQuery in production, DuckLake planned) that holds the rows of tabular data. NESO replaces CKAN's built-in PostgreSQL datastore extension with this.
DuckLakePlanned datastore variant — a DuckDB-backed lakehouse for smaller, self-contained deployments.
DXP FrontendData Experience Portal — the public Drupal site users browse and search.
EnvironmentA deployed instance of the stack — development, staging, or production.
FeedsDrupal module the DXP uses to pull metadata from CKAN every 5 minutes.
GKEGoogle Kubernetes Engine — the managed Kubernetes service hosting the stack.
GrafanaThe dashboard layer over Prometheus metrics.
Memcache / VarnishDXP cache layers — entity cache (Memcache) and rendered HTML cache (Varnish).
Metadata schemaThe set of required and optional fields a dataset must populate, defined via ckanext-scheming.
OrganizationA CKAN organisation — a unit that owns datasets and groups its members.
PortalJS AdminThe Next.js admin UI for dataset management, user roles, and publishing workflows.
PrometheusThe metrics scraper feeding Grafana.
PublisherA user with permission to create and publish datasets in their organisation.
ResourceA single file or URL belonging to a dataset (CSV, JSON, API endpoint, etc.).
SolrCKAN's search index for metadata.
ckanext-aircanCKAN extension that triggers Airflow DAGs on dataset/resource events.

Last reviewed: 2026-05-04

Built with LogoFlowershow