Stream behavior in case of a partial deletion (deletion that fails midway)

### Describe the bug

When deleting a stream, the stream coordinator may delete the underlying stream but the deletion from `rabbit_db_queue` could fail since they are independent Raft clusters. If this happens and the stream coordinator fully deletes the stream, the stream queue then becomes sort of 'stuck' as subsequent calls to `rabbit_stream_queue:delete/4` time out: when the stream coordinator doesn't know about a stream in a `{delete_stream, StreamId, #{}}` command it does not reply to the caller.

### Reproduction steps

This is probably really hard to reproduce in practice. In a shell though we can 'fake' this state pretty easily:

1. `make run-broker`
1. `stream-perf-test --time 1` - create the stream
1. `[SQ] = rabbit_db_queue:get_all().`
1. `rabbit_stream_coordinator:process_command({delete_stream, maps:get(name, amqqueue:get_type_state(SQ)), #{}}).` - simulate partial failure by only deleting from the stream coordinator and not the metadata store.

After this, the stream can't be deleted via AMQP, the stream protocol, UI, etc..


### Expected behavior

I think it's reasonable for the stream coordinator to reply `ok` when prompted with a delete stream for an unknown stream ID. So the command would be idempotent.

### Additional context

The stream coordinator replies after it performs the deletion. If there is no stream then we hit the clause [here](https://github.com/rabbitmq/rabbitmq-server/blob/520119f33f1fc94fe0d3e13f48944624c443515c/deps/rabbit/src/rabbit_stream_coordinator.erl#L1720-L1721) which results in no reply. So calls to delete a stream which do not exist (according to the coordinator) will time out. We could add a clause for the `delete_stream` command against an `undefined` stream which would result in an `ok` reply. It looks like this may take some refactoring though. Then [`rabbit_stream_coordinator:delete/2`](https://github.com/rabbitmq/rabbitmq-server/blob/520119f33f1fc94fe0d3e13f48944624c443515c/deps/rabbit/src/rabbit_stream_coordinator.erl#L191-L204) would continue to delete from the metadata store.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Stream behavior in case of a partial deletion (deletion that fails midway) #14852

Describe the bug

Reproduction steps

Expected behavior

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Stream behavior in case of a partial deletion (deletion that fails midway) #14852

Description

Describe the bug

Reproduction steps

Expected behavior

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions