Revert change (#90674) #103865

benwtrent · 2024-01-03T14:00:32Z

Reverts #90674

The revert is not perfectly clean as there are some minor adjustments to account for later changes.

This is in contrast with: #103858

closes: #103011

elasticsearchmachine · 2024-01-03T14:00:56Z

Hi @benwtrent, I've created a changelog YAML for you.

elasticsearchmachine · 2024-01-03T14:00:56Z

Pinging @elastic/es-search (Team:Search)

…arch into feature/revert-90674

javanna

I went through this and compared line by line with the PR that it reverts. It looks good to me. I think it is wise to revert at this point, and reconsider what we want to do next. You mentioned it did not revert cleanly: did the problems only come from the float handling in postProcessDynamicArrayMapping for dynamic mapping of vector fields?

Thanks for jumping on this!

benwtrent · 2024-01-04T12:24:12Z

did the problems only come from the float handling in postProcessDynamicArrayMapping for dynamic mapping of vector fields?

Correct, the logic there changed from a separate bug fix.

benwtrent · 2024-01-04T12:24:36Z

@elasticmachine update branch

…lastic#103047)" This reverts commit d6e8217.

benwtrent · 2024-01-04T13:35:10Z

@elasticmachine update branch

javanna · 2024-01-04T13:37:07Z

server/src/main/java/org/elasticsearch/index/mapper/DocumentParserContext.java

 new HashSet<>(),
- new LinkedHashMap<>(),
+ new HashMap<>(),


++ it makes sense to go back to the original hashmap, which we had before.

benwtrent · 2024-01-04T13:55:35Z

run elasticsearch-ci/part-2

benwtrent · 2024-01-04T15:14:54Z

run elasticsearch-ci/part-1
run elasticsearch-ci/part-3
run elasticsearch-ci/part-4
run elasticsearch-ci/bwc-snapshots
run elasticsearch-ci/packaging-tests-unix-sample
run elasticsearch-ci/packaging-tests-windows-sample

felixbarny · 2024-01-04T15:31:59Z

FWIW, I'm also happy about this change as it will make #102936 and #96235 much easier as we don't need separate implementations for mapperSize in both Mapper and Mapper.Builder that need to be kept in sync.

Before we re-introduce this change (in case we want to do that at some point), we should really merge the two PRs mentioned above first.

benwtrent · 2024-01-04T16:19:42Z

@felixbarny

Before we re-introduce this change (in case we want to do that at some point), we should really merge the two PRs mentioned above first.

By "re-introduce" you mean adding back "keep builders" change that this PR reverts?

benwtrent · 2024-01-04T16:28:20Z

@elasticmachine update branch

felixbarny · 2024-01-04T16:28:29Z

By "re-introduce" you mean adding back "keep builders" change that this PR reverts?

Yes, that's what I meant. Aka revert the revert.

So before we're considering to re-introduce to "Store dynamic mapping updates as builders", we should first merge #102936 and #96235. We can then weigh in if the additional complexity of maintaining a mapperSize implementation in both Mapper and Mapper.Builder is worth the optimization.

Reverts elastic#90674 The revert is not perfectly clean as there are some minor adjustments to account for later changes. This is in contrast with: elastic#103858 closes: elastic#103011

elasticsearchmachine · 2024-01-04T17:21:33Z

💚 Backport successful

Status	Branch	Result
✅	8.12
✅	8.11

Reverts elastic#90674 The revert is not perfectly clean as there are some minor adjustments to account for later changes. This is in contrast with: elastic#103858 closes: elastic#103011

Reverts #90674 The revert is not perfectly clean as there are some minor adjustments to account for later changes. This is in contrast with: #103858 closes: #103011

Because of elastic#103865, DocumentParserContext#addDynamicMapper receives a Mapper, not a Mapper.Builder again. Therefore, we don't need a mapperSize method for the builder. This simplifies things a lot.

@javanna

Today, we're counting all mappers, including mappers for subfields that aren't explicitly added to the mapping towards the field limit. This means that some field types, such as `search_as_you_type` or `percolator` count as more than one field even though that's not apparent to users as they're just defining them as a single field in the mapping. This change makes it so that each field mapper only counts as one. We're still counting multi-fields. This makes it easier to understand for users why the field limit is hit. ~In addition to that, it also simplifies #96235 as it makes the implementation of `Mapper.Builder#getTotalFieldsCount` much easier and easier to align with `Mapper#getTotalFieldsCount`. This reduces the risk of over- or under-estimating the field count of a `Mapper.Builder` in `DocumentParserContext#addDynamicMapper`, which in turn reduces the risk of data loss due to the issue described here: #96235 (comment) *Edit: due to #103865, we don't need an implementation of `getTotalFieldsCount` or `mapperSize` in `Mapper.Builder`. Still, this PR more closely aligns `Mapper#getTotalFieldsCount` with `MappingLookup#getTotalFieldsCount`, which `DocumentParserContext#addDynamicMapper` uses to determine whether the field limit is hit* A potential risk of this is that we're now effectively allowing more fields in the mapping. It may be surprising to users that more fields can be added to a mapping. Although, I'd not expect negative consequences from that. Generally, I'd expect users to be happy about any change that reduces the risk of data loss. We could also think about whether to apply the new counting logic only to new indices (depending on the `IndexVersion`). However, that would add more complexity and I'm not convinced about the value. We'd then need to maintain two different ways of counting fields and also require passing in the `IndexVersion` to `MappingLookup` which previously didn't require the `IndexVersion`. This PR is meant as a conversation starter. It would also simplify #96235 but I don't think this blocks that PR in any way. I'm curious about the opinion of @javanna and @jpountz on this.

Revert change (elastic#90674)

4377ccd

benwtrent added >bug :Search Foundations/Mapping Index mappings, including merging and defining field types v8.12.1 v8.13.0 v8.11.4 labels Jan 3, 2024

benwtrent requested review from javanna and original-brownbear January 3, 2024 14:00

elasticsearchmachine added the Team:Search Meta label for search team label Jan 3, 2024

benwtrent and others added 3 commits January 3, 2024 09:00

Update docs/changelog/103865.yaml

bb36ebe

fixing format

5069349

Merge branch 'feature/revert-90674' of github.com:benwtrent/elasticse…

1f64a74

…arch into feature/revert-90674

javanna approved these changes Jan 4, 2024

View reviewed changes

benwtrent added auto-merge Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) auto-backport-and-merge Automatically create backport pull requests and merge when ready labels Jan 4, 2024

Merge branch 'main' into feature/revert-90674

9fa38d4

benwtrent removed the auto-merge Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Jan 4, 2024

Revert "Ensure dynamicMapping updates are handled in insertion order (e…

0e47f0e

…lastic#103047)" This reverts commit d6e8217.

benwtrent added the auto-merge Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Jan 4, 2024

Merge branch 'main' into feature/revert-90674

18e7ea2

javanna reviewed Jan 4, 2024

View reviewed changes

Merge branch 'main' into feature/revert-90674

521f51d

elasticsearchmachine merged commit a74ae22 into elastic:main Jan 4, 2024
15 checks passed

benwtrent deleted the feature/revert-90674 branch January 4, 2024 17:19

benwtrent mentioned this pull request Jan 4, 2024

[8.12] Revert change (#90674) (#103865) #103932

Merged

benwtrent mentioned this pull request Jan 4, 2024

[8.11] Revert change (#90674) (#103865) #103933

Merged

This was referenced Jan 5, 2024

Add setting to ignore dynamic fields when field limit is reached #96235

Merged

Make field limit more predictable #102885

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revert change (#90674) #103865

Revert change (#90674) #103865

benwtrent commented Jan 3, 2024

elasticsearchmachine commented Jan 3, 2024

elasticsearchmachine commented Jan 3, 2024

javanna left a comment

benwtrent commented Jan 4, 2024

benwtrent commented Jan 4, 2024

benwtrent commented Jan 4, 2024

javanna Jan 4, 2024

benwtrent commented Jan 4, 2024

benwtrent commented Jan 4, 2024

felixbarny commented Jan 4, 2024

benwtrent commented Jan 4, 2024

benwtrent commented Jan 4, 2024

felixbarny commented Jan 4, 2024

elasticsearchmachine commented Jan 4, 2024

Revert change (#90674) #103865

Revert change (#90674) #103865

Conversation

benwtrent commented Jan 3, 2024

elasticsearchmachine commented Jan 3, 2024

elasticsearchmachine commented Jan 3, 2024

javanna left a comment

Choose a reason for hiding this comment

benwtrent commented Jan 4, 2024

benwtrent commented Jan 4, 2024

benwtrent commented Jan 4, 2024

javanna Jan 4, 2024

Choose a reason for hiding this comment

benwtrent commented Jan 4, 2024

benwtrent commented Jan 4, 2024

felixbarny commented Jan 4, 2024

benwtrent commented Jan 4, 2024

benwtrent commented Jan 4, 2024

felixbarny commented Jan 4, 2024

elasticsearchmachine commented Jan 4, 2024

💚 Backport successful