New Release v1.32.0 - #minor #517

boecklic · 2025-02-12T09:22:13Z

This release contains the latest and last functional improvements for stac v1 along with support for the new authentication mechanism.

Do not login when testing search endpoint

PB-1281: Disable old authentication for STAC v1

Since we can't easily determine the size of external assets, we write -1 in the `file_size` field of the Asset model. This is to distinguish it from the assets that are due for uploading

Add filters that help find assets that are either in progress of uploading or that are missing on s3

…pload-workflow PB-1360 Improve internal upload workflow

Change the implementation of the file size update management command so that it uses the s3 backend to determine the file size of an object. This is needed because the file path can actually differ from the asset name, as is the case with for instance https://data.geo.admin.ch/api/stac/v0.9/collections/ch.swisstopo.bahnen-winter/items/bahnen-winter/assets/additional-files_bahnen-winter_2056.zip

PB-1091 Fix file size updater

use new auth in v1.0 tests

For the payload validation with a nested validator, we need to use the dedicated get_initial() because initial_data is not set.

This should trigger an upsert but because model Item has `unique_together = (('collection', 'name'),)` we get the following error django.db.utils.IntegrityError: duplicate key value violates unique constraint "stac_api_item_collection_id_name_78fbc154_uniq" DETAIL: Key (collection_id, name)=(1, item-1) already exists. Unfortunately, bulk_create only lets you define a flat list of unique_fields, so more complex unique constraints cannot be taken into account. Probably this needs a workaround.

The `update_conflicts` parameter of `bulk_create()` does not work with compound unique constraints (`unique_together` in model's `Meta`). So we do it manually ourselves: 1. Check for existing element 2. Update their fields with the values from the payload 3. Call `bulk_update()` on those 4. For the new elements: Call `bulk_create()` `bulk_update()` requires proper model instances, so I had to somehow convert the `QuerySet` for the existing item into an actual Item. I could not find something less ugly than calling `in_bulk()` and extracting the instance.

After a discussion we decided to keep things simple and tailor the endpoint to the most common use case: Insertion. Should updating at the same time, so upsert, be an actual use case, we can still adapt to it.

There is no guarantee Item.objects.bulk_create() returns the created objects in the same order, so using a dictionary is safer.

This probably needs further changes because we don't want `assets` to be writable in every case.

With the addition of the bulk upload in /collections/{collection_id}/items there is now a POST endpoint.

There is a `Serializer.get_initial()` that would do something similar but not quite. If we use `get_initial()` , the test for the new POST endpoint works but other existing tests fail, like e.g. `ItemsUpdateEndpointTestCase.test_item_endpoint_patch_extra_payload`. Not fully clear at this point why. Also, it's not clear if this renders the `validate_json_payload` useless for the new serializer (`ItemListSerializer`). What is clear: `initial_data` is not set for AssetBaseSerializer when calling `ItemListSerializer.is_valid()`. So `validate_json_payload()` fails. The serializers are connected like this: ItemLIstSerializer --> ItemSerializer --> AssetsForItemSerializer --> AssetBaseSerializer

With the addition of the bulk upload in /collections/{collection_id}/items there is now a POST endpoint.

Contrary to the initial plan to make the POST /collections/{collection_id}/items endpoint a place to both create and update ("upsert") items, it is now reduced to only create new items. So we can address the "abstract-method" warning by the linter by just expressing that we don't intend to use that functionality.

There is no guarantee to have always ID "1" for the collection, so we get the ID dynamically instead of hardcoding it.

Originally, 200 was returned for an empty list because technically no Item is created for an empty list. To make the understanding and handling of the endpoint easier, however, the endpoint now returns 201 just as if an Item had been created.

Originally this was under /collections/{collectionId] but we realized that it actually makes a lot more sense that the POST mirrors the GET of /collections/{collectionId}/items, So POST is moved there as well. Also, the docs for the responses are updated to match the implementation.

Just check that it exists and that it is not empty. We actually do something with it only later. To avoid changing the interface later, we already add the header parameter now.

Otherwise for bulkCreateItems the assets do not show up in the example payload. The endpoints making use of itemAssets: /collections GET getCollections /collections/{collectionId} GET describeCollection PUT updateCollection PATCH partialUpdateCollection /collections/{collectionId}/items/{featureId} GET getFeature PUT putFeature PATCH patchFeature PUT putFeature /collections/{collectionId}/items GET getFeatures POST bulkCreateItems /search GET getSearchSTAC POST postSearchSTAC However, this is not visible in the specs because all writing endpoints have hardcoded examples instead of relying on their components. Also, we agreed that being able to write assets also elsewhere is not necessarily a problem.

As there is no update, we can simplify this to just do the same as for the assets.

This has been introduced in PR #511. There is still a failing test: ``` test_items_endpoint_post_returns_400_if_item_exists_already ``` The endpoint now returns a HTML instead of a JSON. The response.content says > The request's session was deleted before the request completed. The user may have logged out in a concurrent request, for example. To be investigated.

Removed accidentally on rebasing.

…h-upsert-endpoint-for-items-assets PB-1279 Batch create endpoint for items/assets

…superuser PB-1406: Automatically promote remote users to superusers

Custom file upload page that uses the multipart upload api to upload files of up to 5GB directly to S3.

When assets are created via admin GUI there is no file path set. If the file is then uploaded via API using the multipart upload, the file path will never be set on the asset. This aligns the admin GUI behaviour with that of creating an asset via API PUT. It has the downside that newly created assets already have a file path that is a link to a non-existent file.

Split javascript and css into own file. Locally load store jquery and cryptoJS.

CSS: remove background color from info and error box to improve a11y. JS: use let over var. Replace jquery ajax with fetch API. Fix deprecated FileReader method. Fix formatting. Only show upload large file view for assets that are not external.

Check that variables are displayed in template response.

…-files PB-1331: admin upload large files

We want to tweak those but we don't need to hardcode the exact values for all environments immediately. Another change will update the values in DEV, INT then PROD. Afterwards we can consider setting these values by default in STAC itself. This also removes the specific values we set for the dev build.

``` app/config/settings_prod.py:355:0: C0301: Line too long (101/100) (line-too-long) ```

PB-1440: allow setting SESSION_* settings via environment variables.

PB-1279: Fix test

asteiner-swisstopo

👍

msom and others added 30 commits January 28, 2025 09:37

Do not login when testing search endpoint

e713a37

Merge pull request #507 from geoadmin/remove-login-in-search-test

68eb9df

Do not login when testing search endpoint

PB-1281: Disable old authentication for STAC v1

6414a8c

PB-1281: Add tests

8f093db

Merge pull request #506 from geoadmin/feat-PB-1281-disable-old-auth

b195865

PB-1281: Disable old authentication for STAC v1

PB-1360 Write -1 to file_size for external assets

3804a7e

Since we can't easily determine the size of external assets, we write -1 in the `file_size` field of the Asset model. This is to distinguish it from the assets that are due for uploading

PB-1350 Add filters for the Asset view in Django

0de8c71

Add filters that help find assets that are either in progress of uploading or that are missing on s3

Merge pull request #505 from geoadmin/feat-pb-1360-improve-internal-u…

1acfff0

…pload-workflow PB-1360 Improve internal upload workflow

Merge pull request #509 from geoadmin/fix-pb-1091-file-size-updater

a3016a0

PB-1091 Fix file size updater

use new auth in v1.0 tests

39d4ba6

Merge pull request #511 from geoadmin/use-new-auth-in-tests

d929873

use new auth in v1.0 tests

PB-1279: Add serializer for list of items

e840b50

For the payload validation with a nested validator, we need to use the dedicated get_initial() because initial_data is not set.

PB-1279: Assume only new items in the payload

c16e029

After a discussion we decided to keep things simple and tailor the endpoint to the most common use case: Insertion. Should updating at the same time, so upsert, be an actual use case, we can still adapt to it.

PB-1279: Add POST endpoint /collections/{collection_id}/items

be35b1d

PB-1279: Return 400 if items exists already, 404 if collection not found

47b96f4

PB-1279: Return 200 if no item was created

ab0184e

PB-1279: Make sure links are added to the right Item

8f6922d

There is no guarantee Item.objects.bulk_create() returns the created objects in the same order, so using a dictionary is safer.

PB-1279: Allow assets to be added in payload

d8a24e1

This probably needs further changes because we don't want `assets` to be writable in every case.

PB-1279: Remove obsolete test

fc86bb1

With the addition of the bulk upload in /collections/{collection_id}/items there is now a POST endpoint.

PB-1279: Extend test to include assets in payload

285ab4d

PB-1279: Set "assets" as optional

9255e24

PB-1279: Remove obsolete test also for v0.9

62f38b6

With the addition of the bulk upload in /collections/{collection_id}/items there is now a POST endpoint.

PB-1279: Make test more robust in case other collection exist

99714df

There is no guarantee to have always ID "1" for the collection, so we get the ID dynamically instead of hardcoding it.

asteiner-swisstopo and others added 25 commits February 10, 2025 14:58

PB-1279: Add check that max 100 items are provided

d119da9

PB-1279: Check if Idempotency-Key is in header

b99c626

Just check that it exists and that it is not empty. We actually do something with it only later. To avoid changing the interface later, we already add the header parameter now.

PB-1279: Check if Idempotency-Key is in header

adc705b

Just check that it exists and that it is not empty. We actually do something with it only later. To avoid changing the interface later, we already add the header parameter now.

PB-1279: Add comment on why we don't do anything with the header param

d8a68f9

PB-1279: Add missing white space in wrapped string

4adfcae

PB-1279: Bulk create ItemLinks

04c7bd9

As there is no update, we can simplify this to just do the same as for the assets.

PB-1279: Add GET request to set up session as a workaround

19f63d6

PB-1279: Add missing auth decorator

f5915fe

Removed accidentally on rebasing.

Merge pull request #512 from geoadmin/feat-PB-1279-mch-implement-batc…

e869431

…h-upsert-endpoint-for-items-assets PB-1279 Batch create endpoint for items/assets

PB-1406: Automatically promote remote users to superusers

9f8ad61

Merge pull request #515 from geoadmin/feat-PB-1406-make-remote-users-…

7bf321b

…superuser PB-1406: Automatically promote remote users to superusers

PB-1331: Admin upload large files

7008ada

Custom file upload page that uses the multipart upload api to upload files of up to 5GB directly to S3.

PB-1331: Add help text to admin file upload.

318404c

PB-1331: Refactor upload template

cda082d

Split javascript and css into own file. Locally load store jquery and cryptoJS.

PB-1331: Various fixes from review

d11475a

CSS: remove background color from info and error box to improve a11y. JS: use let over var. Replace jquery ajax with fetch API. Fix deprecated FileReader method. Fix formatting. Only show upload large file view for assets that are not external.

PB-1331: Test custom upload is loaded

64fe06b

Check that variables are displayed in template response.

Merge pull request #513 from geoadmin/feat-PB-1331-admin-upload-large…

2b552f9

…-files PB-1331: admin upload large files

Appease pylint.

291f78d

``` app/config/settings_prod.py:355:0: C0301: Line too long (101/100) (line-too-long) ```

Merge pull request #514 from geoadmin/feat-PB-1440-sessions

088ad5e

PB-1440: allow setting SESSION_* settings via environment variables.

PB-1279: Fix test

f02702b

Merge pull request #516 from geoadmin/feat-PB-1279-fix-text

79f2589

PB-1279: Fix test

github-actions bot added the new-release label Feb 12, 2025

github-actions bot changed the title ~~new release~~ New Release v1.32.0 - #minor Feb 12, 2025

asteiner-swisstopo approved these changes Feb 12, 2025

View reviewed changes

boecklic merged commit 667194f into master Feb 12, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Release v1.32.0 - #minor #517

New Release v1.32.0 - #minor #517

boecklic commented Feb 12, 2025

asteiner-swisstopo left a comment

New Release v1.32.0 - #minor #517

New Release v1.32.0 - #minor #517

Conversation

boecklic commented Feb 12, 2025

asteiner-swisstopo left a comment

Choose a reason for hiding this comment