Fix multipart body arrays #938

micha91 · 2024-01-13T09:49:50Z

As described in #692 arrays of files are not handled correctly, if they are part of multipart/form-data. This is fixed in this PR by letting to_multipart return a List[Tuple[str, Any]] instead of a Dict[str, Any].

dbanty · 2024-01-15T00:25:37Z

Thanks for adding this support! I believe this is a breaking change, since it's possible someone was using the JSON serialization behavior before (even though it seems wrong in general). I'll need to do some manual testing just to make sure I know how to describe the breaking change 😅. I'd also love to get someone to test a real running API and verify it all functions (maybe you've already done this?)

micha91 · 2024-01-15T06:43:24Z

Yes, I already use the updated model template as a custom template in a project. Unfortunately this project is not public yet and the API it is developed for, isn't publicly accessible, too

ratgen · 2024-02-15T12:58:14Z

Hi, just tested this out with an api I am working on. It works great.

FabianSchurig · 2025-03-23T12:22:52Z

@dbanty @micha91 Thank you so much for providing the openapi-python-client package and this PR!

I tested it in the following way:
pipx install git+https://github.com/openapi-generators/openapi-python-client.git@refs/pull/938/head

openapi-python-client update --path ../../swagger.json --config openapi-python-generator-config.yml

    "/upload": {
      "post": {
        "tags": [
          "Microsoft.KernelMemory.ServiceAssembly"
        ],
        "summary": "Upload a new document to the knowledge base",
        "description": "Upload a document consisting of one or more files to extract memories from. The extraction process happens asynchronously. If a document with the same ID already exists, it will be overwritten and the memories previously extracted will be updated.",
        "operationId": "UploadDocument",
        "requestBody": {
          "description": "Document to upload and extract memories from",
          "content": {
            "multipart/form-data": {
              "schema": {
                "type": "object",
                "properties": {
                  "index": {
                    "type": "string",
                    "description": "Name of the index where to store memories generated by the files."
                  },
                  "documentId": {
                    "type": "string",
                    "description": "Unique ID used for import pipeline and document ID."
                  },
                  "tags": {
                    "type": "object",
                    "additionalProperties": {
                      "type": "string"
                    },
                    "description": "Tags to apply to the memories extracted from the files."
                  },
                  "steps": {
                    "type": "array",
                    "items": {
                      "type": "string"
                    },
                    "description": "How to process the files, e.g. how to extract/chunk etc."
                  },
                  "files": {
                    "type": "array",
                    "items": {
                      "type": "string",
                      "format": "binary"
                    },
                    "description": "Files to process and extract memories from."
                  }
                }
              }
            }
          }
        },

In general it works!
I would greatly appreciate if we could have similar functionality to solve #692 in the main branch, soon.

@dbanty What do you think about an alternative where a user could just overwrite how a

"files": {
      "type": "array",
      "items": {
        "type": "string",
        "format": "binary"
      },

is handled?

This could either generate:

        files: Union[Unset, tuple[None, bytes, str]] = UNSET
        if not isinstance(self.files, Unset):
            _temp_files = []
            for files_item_data in self.files:
                files_item = files_item_data.to_tuple()

                _temp_files.append(files_item)
            files = (None, json.dumps(_temp_files).encode(), "application/json")

or

        files: Union[Unset, tuple[None, bytes, str]] = UNSET
        if not isinstance(self.files, Unset):
            files = files

The later works well in the case and should be just a minimal, non-breaking and optional change?

micha91 · 2025-05-13T08:37:59Z

I reimplemented the changes based on the latest main branch. I tried to merge changes from main in here, but there were just too many changes in the end and the review view was just messed up completely.
@dbanty I would really like to get this merged. We use the generator to build our REST API client for Polarion, which is a proprietary tool so we cannot adjust its API. As the current template version on main does not work for arrays of files at all (leads to a JSON serialization error as bytes are not JSON serializable) we have to use a custom template to get a working client. And updating these custom templates for new generator releases is really hard work.
Because you asked before for a real word example - the full specification of the Polarion REST API can be found here and an endpoint using this array of files can be found here.

Arrays of files are being tested in other endpoints, and this one was slightly malformed.

dbanty · 2025-05-17T03:36:47Z

@micha91 thanks for updating! I'm slowing making some changes as I get time. A couple things still left to look into:

According to the docs we should actually be using the data param instead of the files param for everything that's not a file... which would simplify things a lot I think. But it also doesn't specify what types are allowed here, so I have to look into that.
I want to add multiple files to the integration tests just to make sure we keep it working long term

Hoping to get this all wrapped up soon. Thanks for your hard work and patience on this 🙇

dbanty · 2025-05-31T18:39:00Z

Integration tests are updated, so left to do:

Figure out what the new minimum version of httpx is or how to continue supporting 0.20 (https://github.com/openapi-generators/openapi-python-client/actions/runs/15366450734/job/43239989641?pr=938)
Remove now-dead code triggering coverage warning
~~Use data instead of files when appropriate per the docs~~ Decided not to do this because data doesn't give us control over encoding, which we'll need some day.

dbanty · 2025-06-03T02:23:11Z

@micha91 (and all following) thanks for all the work, I think this is almost ready! One thing I've noticed as I go through and test/fix more stuff around multipart is that having a top-level schema of an array really didn't work anyway. The test we had with this schema:

"multipart/form-data": {
    "schema": {
        "type": "array",
        "items": {
            "type": "string",
            "format": "binary"
        }
    }
}

Produced code like this:

for body_item_data in body:
        body_item = body_item_data.to_tuple()

        _body.append(body_item)

    _kwargs["files"] = _body

Which as far as I can tell is completely invalid, since each .to_tuple() will have the (file_name, content, mime_type) but no field name, like ("files" (file_name, content, mime_type)). So I just removed that.

Any reason you can think of that we should keep some sort of top-level array-of-files support? How would that even work in multipart?

micha91 · 2025-06-03T07:08:11Z

@dbanty Thanks for your additional efforts on this! I will give the current implementation a try today to see if it also works in our real world client 👍

Any reason you can think of that we should keep some sort of top-level array-of-files support? How would that even work in multipart?

I think it is simply unsupported when using multipart, so it should be absolutely correct to drop it

micha91 · 2025-06-05T17:21:06Z

@dbanty Thanks for your additional efforts on this! I will give the current implementation a try today to see if it also works in our real world client 👍

Any reason you can think of that we should keep some sort of top-level array-of-files support? How would that even work in multipart?

I think it is simply unsupported when using multipart, so it should be absolutely correct to drop it

Works as expected. I still need custom some custom templates, but that's due to issues in the specification

@micha91

> [!IMPORTANT] > Merging this pull request will create this release ## Breaking Changes - Raise minimum httpx version to 0.23 ### Removed ability to set an array as a multipart body Previously, when defining a request's body as `multipart/form-data`, the generator would attempt to generate code for both `object` schemas and `array` schemas. However, most arrays could not generate valid multipart bodies, as there would be no field names (required to set the `Content-Disposition` headers). The code to generate any body for `multipart/form-data` where the schema is `array` has been removed, and any such bodies will be skipped. This is not _expected_ to be a breaking change in practice, since the code generated would probably never work. If you have a use-case for `multipart/form-data` with an `array` schema, please [open a new discussion](https://github.com/openapi-generators/openapi-python-client/discussions) with an example schema and the desired functional Python code. ### Change default multipart array serialization Previously, any arrays of values in a `multipart/form-data` body would be serialized as an `application/json` part. This matches the default behavior specified by OpenAPI and supports arrays of files (`binary` format strings). However, because this generator doesn't yet support specifying `encoding` per property, this may result in now-incorrect code when the encoding _was_ explicitly set to `application/json` for arrays of scalar values. PR #938 fixes #692. Thanks @micha91 for the fix, @ratgen and @FabianSchurig for testing, and @davidlizeng for the original report... many years ago 😅. Co-authored-by: knope-bot[bot] <152252888+knope-bot[bot]@users.noreply.github.com>

micha91 force-pushed the fix-multipart-body-file-array branch from 651fddc to fe841d7 Compare January 13, 2024 10:52

micha91 changed the title ~~Fix multipart body file array~~ fix: Fix multipart body file array Jan 13, 2024

dbanty added the 🥚breaking This change breaks compatibility label Jan 15, 2024

micha91 mentioned this pull request Jan 17, 2024

feat: upgrade generator and regenerate auto generated client for Polarion 2310 DSD-DBS/polarion-rest-api-client#20

Merged

dbanty mentioned this pull request Jan 23, 2024

nullable int model parameter treated as a file in multipart with 0.17.0, causing httpx AttributeError #926

Closed

chore: Reimplement functionality based on the latest main

5e8ad7a

micha91 force-pushed the fix-multipart-body-file-array branch from 453d017 to 5e8ad7a Compare May 13, 2025 07:42

dbanty added 5 commits May 14, 2025 21:02

Revert change to integration test pyproject.toml

5840d91

Refresh snapshots

0eba399

Improve grouping of each multipart field

d24b3e5

Remove faulty test case.

2f196ac

Arrays of files are being tested in other endpoints, and this one was slightly malformed.

Simplify property to multipart code.

0f8c7fe

dbanty and others added 3 commits May 27, 2025 13:16

Update integration tests to check multiple files

14bed92

Pin integration test server to 0.2.0

008432f

Merge branch 'main' into fix-multipart-body-file-array

f8a0208

dbanty added 4 commits June 1, 2025 10:07

Pin integration test server to 0.2.0

2767da5

ci: Install pdm in integration tests

1822f6d

Clean up body generation code

e798afe

Remove now-dead code

af9c9fa

dbanty added 2 commits June 2, 2025 20:45

Document main multipart array change

368083f

Simplify transform_multipart_body macro

023a1d4

dbanty changed the title ~~fix: Fix multipart body file array~~ Fix multipart body arrays Jun 3, 2025

Add array of scalar and JSON objects to multipart integration tests

b0e4320

dbanty approved these changes Jun 3, 2025

View reviewed changes

Fix mypy for integration tests

04e39da

dbanty mentioned this pull request Jun 4, 2025

Fix integration tests #1265

Closed

Merge branch 'main' into fix-multipart-body-file-array

af01dfc

dbanty enabled auto-merge June 6, 2025 01:18

dbanty added this pull request to the merge queue Jun 6, 2025

Merged via the queue into openapi-generators:main with commit 305229b Jun 6, 2025
22 checks passed

knope-bot bot mentioned this pull request Jun 6, 2025

Release 0.25.0 #1267

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix multipart body arrays #938

Fix multipart body arrays #938

Uh oh!

micha91 commented Jan 13, 2024

Uh oh!

dbanty commented Jan 15, 2024

Uh oh!

micha91 commented Jan 15, 2024 •

edited

Loading

Uh oh!

ratgen commented Feb 15, 2024 •

edited

Loading

Uh oh!

FabianSchurig commented Mar 23, 2025

Uh oh!

micha91 commented May 13, 2025

Uh oh!

dbanty commented May 17, 2025

Uh oh!

dbanty commented May 31, 2025 •

edited

Loading

Uh oh!

dbanty commented Jun 3, 2025

Uh oh!

micha91 commented Jun 3, 2025

Uh oh!

micha91 commented Jun 5, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Fix multipart body arrays #938

Fix multipart body arrays #938

Uh oh!

Conversation

micha91 commented Jan 13, 2024

Uh oh!

dbanty commented Jan 15, 2024

Uh oh!

micha91 commented Jan 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ratgen commented Feb 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

FabianSchurig commented Mar 23, 2025

Uh oh!

micha91 commented May 13, 2025

Uh oh!

dbanty commented May 17, 2025

Uh oh!

dbanty commented May 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dbanty commented Jun 3, 2025

Uh oh!

micha91 commented Jun 3, 2025

Uh oh!

micha91 commented Jun 5, 2025

Uh oh!

Uh oh!

Uh oh!

micha91 commented Jan 15, 2024 •

edited

Loading

ratgen commented Feb 15, 2024 •

edited

Loading

dbanty commented May 31, 2025 •

edited

Loading