Magda connectors go out to external datasources and copy their metadata into the Registry, so that they can be searched and have other aspects attached to them. A connector is simply a docker-based microservice that is invoked as a job. It scans the target datasource (usually an open-data portal), then completes and shuts down.
Magda CSW Connector is created for crawling data from CSW(Catalog Service for the Web) data source.
Since v2.0.0, we use Github Container Registry as our official Helm Chart & Docker Image release registry.
It's recommended to deploy connectors with as dependencies of a Magda helm deployment.
dependencies:
- name: magda-csw-connector
version: "2.0.0"
alias: connector-xxx
repository: "oci://ghcr.io/magda-io/charts"
tags:
- connectors
- connector-xxx
Kubernetes: >= 1.21.0
Repository | Name | Version |
---|---|---|
oci://ghcr.io/magda-io/charts | magda-common | 2.1.1 |
Key | Type | Default | Description |
---|---|---|---|
config.basicAuthEnabled | bool | false |
Whether or not to send basic auth header. |
config.basicAuthPassword | string | nil |
basic auth password. You can also passing this value via secret. To do so, set basicAuthSecretName to the secret name. |
config.basicAuthSecretName | string | nil |
You can set this value to supply basic auth username & password. The secret must have two keys: username & password . |
config.basicAuthUsername | string | nil |
basic auth username. You can also passing this value via secret. To do so, set basicAuthSecretName to the secret name. |
config.id | string | "default-csw-connector" |
|
config.name | string | nil |
Friendly readable name. Compulsory. |
config.outputSchema | string | http://www.isotc211.org/2005/gmd | Desired output schema to be requested from CSW service as URI |
config.pageSize | int | 100 |
When crawling through from beginning to end, how big should the individual requests be in records? |
config.schedule | string | "0 14 * * 6" i.e. 12am Sydney time on Sunday | Crontab schedule for how often this should happen. |
config.sourceUrl | string | nil |
The base URL of the place to source data from. Compulsory. |
config.typeNames | string | gmd:MD_Metadata | Record type expected to be returned from CSW service as XML tag name |
config.usePostRequest | bool | false |
Whether or not use POST request to call getRecords API |
defaultImage.imagePullSecret | bool | false |
|
defaultImage.pullPolicy | string | "IfNotPresent" |
|
defaultImage.repository | string | "ghcr.io/magda-io" |
|
defaultSettings.includeCronJobs | bool | true |
|
defaultSettings.includeInitialJobs | bool | false |
|
defaultTenantId | int | 0 |
|
global.connectors.image | object | {} |
|
global.image | object | {} |
|
image.name | string | "magda-csw-connector" |
|
resources.limits.cpu | string | "100m" |
|
resources.requests.cpu | string | "50m" |
|
resources.requests.memory | string | "30Mi" |