Author: Mason Freed
Last updated: December 1, 2021
Issue: WICG/webcomponents#79
This is an updated and revised version of the original ComposedSelection API Explainer by yoichio@google.com. It has been updated to include some changes agreed to at the 2018 TPAC meeting, fix some things that have changed since 2018, and generally flesh out the rest of the API.
Shadow DOM (v1) is now supported by all evergreen rendering engines. However, the user selection APIs are not well supported for the case that the user selection crosses shadow boundaries. The existing Selection API specifies that there is only a single selection in a document and the selection is bound to a single Range, which means that it cannot represent a range over the composed tree. Said another way, if the selection crosses shadow boundaries, the existing API cannot represent this situation correctly. For backwards compatibility, we also can't update the Range API to support ranges that cross shadow boundaries, because that could break existing assumptions about ranges.
This proposal suggests a new API that lets web authors control and query selections in the composed tree.
Define an inline style component:
foo<span id=host></span>bar
<script>
let root = host.attachShadow({mode: 'open'});
root.innerHTML = `<b>baz</b>`;
</script>
This results in a rendered page similar to this:
And the user might select the visible text this way:
However, in this case, the web author can’t get the exact selection range:
<script>
document.addEventListener("selectionchange", function() {
window.getSelection(); // ??
});
</script>
The call to getSelection()
in this case behaves differently on different rendering engines. In Chromium, it returns only the content from the DOM tree containing the selection anchor. So for the selection that starts between “f” and “o” and ends between “b” and “a”, it will return just “oo”. When selected in the other direction, it just returns “b”. Safari is similar, but will sometimes include the shadow content, if the selection endpoints are in different shadow trees. Gecko will simply refuse to allow content to be highlighted as shown above - the user is restricted to selecting only content from one tree or the other, but not across both.
The following several examples contemplate an editor component, and its need for knowledge of the selection information:
<script>
customElements.define('x-editor', class extends HTMLElement {
connectedCallback() {
this.innerHTML = `
<button>Turn text red</button>
<div contenteditable="true">initial text - highlight me</div>`;
this.querySelector('button').onclick = () => {
let selection = window.getSelection();
if (selection.rangeCount == 0) return;
let range = selection.getRangeAt(0);
range.startContainer.parentElement.style.color = 'red';
};
}
});
</script>
<x-editor></x-editor>
This creates a simple editor element with a button that can change the style of the selection:
Clicking the button changes the color of the highlighted text's parent element to red:
However, we can’t convert that component to use Shadow DOM:
<script>
customElements.define('x-editor2', class extends HTMLElement {
constructor() {
super();
let shadow = this.attachShadow({mode:"open"}); // Create shadow root.
shadow.innerHTML = `
<button>Turn text red</button>
<div contenteditable="true">initial text - highlight me</div>`;
shadow.querySelector('button').onclick = () => {
let selection = window.getSelection(); // ! no shadow info
//let selection = shadow.getSelection(); // ! not in Gecko or WebKit
if (selection.rangeCount == 0) return;
let range = selection.getRangeAt(0);
range.startContainer.parentElement.style.color = 'red';
};
}
});
</script>
<x-editor2></x-editor2>
Here, window.getSelection()
doesn’t work properly/consistently. In some rendering engines (e.g. Chromium and WebKit) the call to getRangeAt(0)
will return a range that points to the <body>
element, with node offsets that both point to the <x-editor2>
element. In this case, our example component therefore changes the entire page to red. In others (e.g. Gecko), it will (magically) return elements within the shadow root, including for closed shadow roots. Additionally, in Chromium (only), there is a non-standard implementation of shadowRoot.getSelection()
which will return selection information within the shadow root, similar to what Gecko returns from window.getSelection()
in this case. In no case is the selection/range information correct when the selection spans two shadow trees.
The same problems happen when the (non-Shadow DOM-based) <x-editor>
component is nested inside another (Shadow DOM) component:
<script>
customElements.define('parent-component', class extends HTMLElement {
constructor() {
super();
let shadow = this.attachShadow({mode:"open"});
shadow.innerHTML = '<x-editor></x-editor>';
}
});
</script>
<parent-component></parent-component>
In this case, the <x-editor>
selection information is no longer available, because the entire component is nested inside a Shadow Root. This makes an otherwise-working component stop working based on its location in the composed tree, which is not good.
Because the existing Selection
APIs don't comprehend shadow trees, they also do not properly comprehend slotted content. For example:
<div id=host>Second</div>
<script>
host.attachShadow({mode:"open"}).innerHTML = 'First <slot></slot> Third';
</script>
In this example, "Second" is slotted into the <slot>
between "First" and "Third". So if the user highlights part of the results (if that is even possible in the browser), the rendered text looks like this:
Clearly, in this case, the user sees the highlight covering part of "Second" and part of "Third". However, the results of getRangeAt()
vary wildly, as does the behavior of each browser while trying to highlight this particular portion of the text. Note that in this case, if the user selected from "Second" to "Third", then the anchorNode
would be in the light DOM on the Text Node for "Second", while the focusNode
would be in the shadow DOM on "Third". This is in some sense "reversed" from the perceived DOM tree order:
This selection might even skip over other nodes that are in the light dom (e.g. siblings of "Second") that are not slotted into the shadow tree, and therefore invisible to the user.
This is a two-part proposal.
partial interface Selection {
StaticRange getComposedRange(optional GetComposedRangeOptions options = {});
readonly attribute DOMString direction; // "forwards", "backwards", or "directionless"
};
dictionary GetComposedRangeOptions {
Node selectionRoot;
sequence<ShadowRoot> shadowRoots;
};
This adds a new API, getComposedRange()
, which can return a StaticRange
with endpoints in different shadow trees. Because that will necessarily expose Node
s inside shadow trees, including potentially closed
shadow trees, an optional shadowRoots
parameter enables the method to return Node
s within the provided list of shadow roots, if necessary. If a selection endpoint is within a non-provided shadow root, the returned selection will be "re-scoped" as if the entire host element for that shadow root was selected.
Additionally, the direction
attribute is exposed as a DOMString
attribute on Selection
. This attribute already exists in the spec for Selection
, but it isn't currently exposed to WebIDL. Exposing this attribute is important, because the StaticRange
returned by getComposedRange()
always has startContainer
as the "left side" of the selection, and endContainer
as the "right side" node. Therefore, when using this API, there's no way to tell which direction the selection was actually created. The (non-shadow-DOM aware) Selection.anchorNode
and Selection.focusNode
previously allowed this directionality to be inferred, but in this proposal, those APIs will remain shadow-DOM unaware. By exposing Selection.direction
directly, the directionality can be extracted.
In addition to this new API for retrieving a composed (cross-tree) selection, some of the existing APIs within the Selection
object will be updated to support selections that cross shadow boundaries. For example, the setBaseAndExtent()
function will be updated to support anchorNode
and focusNode
living in different shadow trees. This section of the explainer goes into detail for each API.
But first, let’s see how the new getComposedRange()
API can be used with the problem examples above:
With the new API, the exact selection range can be accessed:
<script>
document.addEventListener("selectionchange", function() {
let selection = window.getSelection();
let composedRange = selection.getComposedRange({shadowRoots: [root]});
console.log(composedRange); // {"foo", 1, "baz", 1}
});
</script>
You can also programmatically set the same type of cross-tree selection:
<script>
window.getSelection().setBaseAndExtent(foo, 1, baz, 1);
</script>
With a simple code change, the existing Shadow DOM based example <x-editor2>
element "just works":
<script>
customElements.define('x-editor2', class extends HTMLElement {
...same as before...
shadow.querySelector('button').onclick = () => {
let selection = window.getSelection();
let range = selection.getComposedRange({shadowRoots: [this]}); // NEW API CALL HERE
range.startContainer.parentElement.style.color = 'red';
};
}
});
</script>
<x-editor2></x-editor2>
Here, the getComposedRange()
call returns the correct range, even when the selection falls within its shadow root.
If, on the other hand, the editor wanted to be sure to only change nodes within "its" tree, each endpoint of the range would need to be checked for tree location:
<script>
customElements.define('x-editor2', class extends HTMLElement {
...same as before...
shadow.querySelector('button').onclick = () => {
...same as before...
if (!shadow.contains(range.startContainer) || !shadow.contains(range.endContainer)) {
// Handle this case differently
} else {
range.startContainer.parentElement.style.color = 'red';
}
};
}
});
</script>
<x-editor2></x-editor2>
In the non-Shadow-DOM <x-editor>
case, where an <x-editor>
is placed within the shadow root of a <parent-component>
element, a bit more work must be done:
<script>
// Walk up the tree to find the first shadow root:
function getParentShadowRoot(el) {
if (el instanceof ShadowRoot || el === null)
return el;
return getParentShadowRoot(el.parentNode);
}
customElements.define('x-editor', class extends HTMLElement {
...same as before...
connectedCallback() {
this.querySelector('button').onclick = () => {
let selection = window.getSelection();
let parentRoot = getParentShadowRoot(this); // Get the parent shadow root
// Pass it to the new API:
let range = selection.getComposedRange({shadowRoots: [parentRoot]});
range.startContainer.parentElement.style.color = 'red';
};
}
});
customElements.define('parent-component', class extends HTMLElement {
...same as before...
});
</script>
<parent-component></parent-component>
In this case, since our <x-editor>
component is located within a parent shadow root, we must provide "knowledge" of that shadow root. So we walk up the tree (using getParentShadowRoot()
) to find the first containing shadow root, or null if there isn't one. This feels a bit odd, since <x-editor>
doesn't use Shadow DOM, but it "proves" that <x-editor>
knows how to deal with shadow selections.
In the case of slotted content, the getComposedRange()
API will return nodes in different shadow trees, and a flat tree traversal will be understood to be the correct way to walk the selected nodes from anchorNode
to focusNode
(if needed). For example, using a modified version of our previous slotted text example:
<div id=host>Second</div>
<script>
const shadow = host.attachShadow({mode:"open"});
shadow.innerHTML = 'First <slot></slot> Third';
const third = shadow.querySelector('slot').nextSibling;
let selection = window.getSelection();
// This will select from "Second" to "Third":
selection.setBaseAndExtent(host, 3, third, 4);
selection.getComposedRange(); // {host, 3, third, 4}
</script>
So the new getComposedRange()
API works correctly here, too, so long as the application understands that these nodes may appear out of order or appear to include extra nodes, unless the range is traversed using a flat tree traversal.
If the shadowRoots
parameter of getComposedRange()
is not used, or if the provided value does not contain a given shadow root, then selections that cross such shadow roots will be "re-scoped" to enclose the corresponding shadow host element. That is to say, the returned StaticRange
will be constructed as if the entire shadow host element is selected; neither the starting nor ending nodes of the StaticRange
will reside within that shadow root.
To avoid this behavior, the shadowRoots
parameter can be used to allow getComposedRange()
to return Node
s within the provided roots. In the case of multiple nested shadowRoot
s, only the lowest level shadowRoot
is required in order to "reveal" higher level roots.
To see the above in action, suppose we have this DOM tree:
For this tree:
// Set the selection:
let selection = window.getSelection();
selection.setBaseAndExtent(C, 1, J, 1); // Select from between-D-and-D through between-K-and-L
// Passing no arguments, no shadow roots are included:
selection.getComposedRange(); // [A, 0, A, 2] - nodes B and F are shadow hosts
// Now, include shadowRoots:
selection.getComposedRange({shadowRoots: [C]}); // [C, 1, A, 2] - no knowledge of F or I
selection.getComposedRange({shadowRoots: [C,H,J]}); // [C, 1, J, 1] === original range, all roots provided
selection.getComposedRange({shadowRoots: [C,J]}); // [C, 1, J, 1] === original range, H not needed
// Since H only contains I, which is not a <slot>, G does not get slotted/rendered:
selection.containsNode(G); // false (see example #3 above)
Additionally, to simplify selection management for components that only wish to handle selections within a particular sub-tree, the selectionRoot
parameter can be used:
let rescopedSelection = selection.getComposedRange({selectionRoot: node});
In this case, the returned StaticRange
will only contain endpoints that are inclusive descendants of node
. If the selection extends "above" node
in the tree on either side, those range endpoints will be re-scoped to appear as if the selection started and/or ended just within node
. For example:
// Set the selection:
let selection = window.getSelection();
selection.setBaseAndExtent(A, 0, E, 1); // Select from before-B to after-F
// Passing no arguments:
selection.getComposedRange(); // [A, 0, A, 2] - re-scope around shadow host C
// By providing a selectionRoot of E:
// 1. the startContainer is re-scoped from A0 to E0 (first position within E).
// 2. the containing shadow root is added to shadowRoots, so endContainer is *not* rescoped.
selection.getComposedRange({selectionRoot: E}); // [E, 0, E, 1]
// To use a "higher" selectionRoot, we'll have to also provide the shadow root:
selection.getComposedRange({selectionRoot: A, shadowRoots: [D]}); // [A, 0, E, 1] === original
This proposal adds only one new API, getComposedRange()
. But it also modifies several existing Selection APIs to comprehend the composed tree. In general, the model for this proposal is to change the Selection API algorithms so that internally, the range is stored as an internal Range
-type structure with endpoints that can span all (including closed
) shadow trees. We will call this the true range. And then the existing and new APIs discussed here "re-scope" that true range to match platform expectations. For example, for backwards-compatibility, Selection.getRangeAt()
should still return a live Range
that is scoped to a single tree. And Selection.getComposedRange()
should return a StaticRange
that "re-scopes" over (non-provided) shadow roots.
Here is a list of the existing Selection APIs that would be modified by this proposal:
-
Modify behavior to maintain backwards-compatibility:
Selection.getRangeAt()
- This function will be changed to return a single-treeRange
object, by re-scoping over any nodes in the range that are in different shadow trees. In other words, this will return a liveRange
that is equivalent to theStaticRange
that would be returned by a call togetComposedRange()
with no arguments provided.Selection.anchorNode/anchorOffset/focusNode/focusOffset
- because these currently always returnNode
s in the same shadow tree, they need to maintain that behavior. They should be re-defined to return the endpoints/offsets of the range returned bySelection.getRangeAt(0)
.
-
Update behavior to "just work" for cross-tree selections:
Selection.setBaseAndExtent()
- this will now acceptanchorNode
andfocusNode
arguments that reside in different shadow trees, which will set the selection accordingly.Selection.collapse()
- can now accept a node in any tree, and will collapse the range to thatNode
.Selection.collapseToEnd()/collapseToStart()
- These will be updated to work as expected for selections that include cross-tree endpoints.deleteFromDocument()
- This should work as-expected, even if range endpoints are in different trees.extend()
- This should work as-expected, even if the newfocusNode
is in a different shadow tree.- User selection via mouse, keyboard, etc. - The selection generated by this activity should be allowed to start and end anywhere in the document, including inside any
open
orclosed
shadow trees. The behavior should be equivalent to calls tosetBaseAndExtent()
, modulo canonicalization behavior.
-
Perhaps more nuanced:
Selection.containsNode()
- This must be modified to work correctly for slotted content, in addition to general cross-tree selections. For example, if an un-slotted child of aNode
in the range is queried, this method should returnfalse
, even though that child is reachable in a tree-of-trees walk fromanchorNode
tofocusNode
. Since it is not slotted, the user does not see it, so it should not be considered to be part of the selection. Since the previous behavior was unaware of shadow trees entirely, this does not seem like much of a compat risk.Selection.modify()
- Given the lack of a spec for this feature, I propose we not change its behavior.
-
No change needed/desired:
Selection.removeRange()/removeAllRanges()
- Should continue to work as-is.Selection.selectAllChildren()
- Should already result in a single-treeRange
- no changes needed.Selection.toString()
- No change needed.Selection.addRange()
- Since this method is fairly legacy and only supports a single range (by spec and in 2/3 engines), we should leave this API as-is, and not update it to support cross-tree ranges.
In the section above, the getRangeAt()
function is explained as a re-scoping of the full selection, avoiding any contained shadow trees. The question arises: what happens if that live range is mutated via the various APIs available on Range
:
- Range.setStart()
- Range.setEnd()
- Range.setStartBefore()
- Range.setStartAfter()
- Range.setEndBefore()
- Range.setEndAfter()
- Range.selectNode()
- Range.selectNodeContents()
- Range.collapse()
In each of these cases, the effect should be the same as if setBaseAndExtend()
were called, with the provided start or end being the one provided, and the other endpoint of the range being the current value of the true range. In other words, for the Range
APIs that only modify one endpoint of the range, the other endpoint of the actual, true range should not be affected/changed. For example, referring back to Example Tree 1:
let selection = window.getSelection();
selection.setBaseAndExtent(A, 0, J, 1);
// The true selection is now A0 (light DOM) to J1 (within nested shadow trees).
let range = selection.getRangeAt(0); // [A, 0, A, 2] - re-scoped around F
range.setStart(A, 1); // Move the start to A1
range; // [A, 1, A, 2] - start node/offset updated, end node/offset left as-is.
selection.getComposedRange({shadowRoots: [J]}); // [A, 1, J, 1] - end node/offset left as-is.
range.setEnd(A, 2); // Move the end to A2
range; // [A, 1, A, 2] - no apparent change
selection.getComposedRange({shadowRoots: [J]}); // [A, 1, A, 2] - but end node has been updated.
A more complicated situation arises from the requirement that DOM tree modifications themselves can also mutate live Range
objects. For example, node removal can change the endpoints of a live range, if an endpoint of the range is close to the node being removed. The proposal here is to follow the existing logic for tree mutations, but apply the range modifications directly to the true range instead of to the Range
object, which has re-scoped endpoints. In other words, if there's a live Range
that has been re-scoped around some shadow trees, and a tree modification within one of the shadow trees takes place, the true range will be mutated accordingly, and the live Range
will be updated to be equal to the rescoped-equivalent of the new true range.
For example, referring back to Example Tree 1:
let selection = window.getSelection();
selection.setBaseAndExtent(A, 0, J, 1);
// The true selection is now A0 (light DOM) to J1 (within nested shadow trees).
let range = selection.getRangeAt(0); // [A, 0, A, 2] - re-scoped around F
// Remove node K from the tree:
K.remove();
range; // [A, 0, A, 2] - no visible change
selection.getComposedRange({shadowRoots: [J]}); // [A, 0, J, 0] - updated end point
When the selection is set by the user clicking and dragging the mouse across some text, the browser "canonicalizes" the selection endpoints. This means they pick from one of many possible node/offset values for the given visible caret position. For example:
<div>|<span>|<p>|Test</p></span></div>
If the user highlights the word "Test", the start point of the selection could be represented equally well by any of the positions marked with '|'. That means the startContainer for the range could be any of <div>
, <span>
, or <p>
. All are equivalent from the user's point of view.
Given that canonicalization happens when the selection is being modified, the same logic from the Changes to existing selection APIs section can be used. Essentially, the same canonicalization can still be used, so long as the true selection endpoints are allowed to live in any shadow tree.
There is a (hopefully small) web compatibility risk from making the proposed changes. Some of the changes proposed above do change the behavior of these APIs in the case where cross-tree nodes are currently/already being used. The behavior in these cases will likely change if this proposal is adopted. However, given that using cross-tree nodes in Selection APIs is not currently specified, and doesn't work in any browser at the moment, hopefully the prevalence of this in the wild is small. The primary Selection API use cases (e.g. calling getRangeAt(0)
) should not be adversely affected by these changes.
-
Here is a 2018 Editor library compatibility investigation and related editor issues:
- Quill: quilljs/quill#2021
- Trix: basecamp/trix#403
- CKEditor: ckeditor/ckeditor5-engine#692
- Prosemirror: ProseMirror/prosemirror#476
- Squire: neilj/Squire#327
-
There is also this "polyfill", which doesn't actually polyfill any proposed spec, but instead makes
shadowRoot.getSelection()
work on non-Chromium browsers: