Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
193 changes: 193 additions & 0 deletions docs/decisions/0014-single-taxonomy-view-api.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,193 @@
14. Single taxonomy view API
=====================================

Context
--------

This view returns tags of a closed taxonomy (for MVP has not been implemented yet
for open taxonomies). It is necessary to make a decision about what structure the tags are going
to have, how the pagination is going to work and how will the search for tags be implemented.
It was taken into account that taxonomies commonly have the following characteristics:

- It has few root tags.
- It may a very large number of children for each tag.
- It is mostly represented as trees on frontend, with a depth of up to 3 levels.

For the decisions, the following use cases were taken into account:

- As a taxonomy administrator, I want to see all the tags available for use with a closed taxonomy,
so that I can see the taxonomy's structure in the management interface.
- As a taxonomy administrator, I want to see the available tags as a lits of root tags
that can be expanded to show children tags.
- As a taxonomy administrator, I want to sort the list of root tags alphabetically: A-Z (default) and Z-A.
- As a taxonomy administrator, I want to expand all root tags to see all children tags.
- As a taxonomy administrator, I want to search for tags, so I can find root and children tags more easily.
- As a course author, when I am editing the tags of a component, I want to see all the tags available
from a particular taxonomy that I can use.
- As a course author, I want to see the available tags as a lits of root tags
that can be expanded to show children tags.
- As a course author, I want to search for tags, so I can find root and children tags more easily.

Excluded use cases:

- As a content author, when searching/filtering a course/library, I want to see which tags are applied to the content
and use them to refine my search. - This is excluded from this API's use case because this is automatically handled
by elasticsearch/opensearch.


Decision
---------

Views & Pagination
~~~~~~~~~~~~~~~~~~~

Make one view:

**get_matching_tags(parent_tag_id: str = None, search_term: str = None)**

that can handle this cases:

- Get the root tags of the taxonomy. If ``parent_tag_id`` is ``None``.
- Get the children of a tag. Called each time the user expands a parent tag to see its children.
If ``parent_tag_id`` is not ``None``.

In both cases the results are paginated. In addition to the common pagination metadata, it is necessary to return:

- Total number of pages.
- Total number of root/children tags.
- Range index of current page, Ex. Page 1: 1-12, Page 2: 13-24.
- Total number of children of each tag.

The pagination of root tags and child tags are independent.
In order to be able to fulfill the functionality of "Expand-all" in a scalable way,
the following has been agreed:

- Create a ``TAGS_THRESHOLD`` (default: 1000).
- If ``taxonomy.tags.count < TAGS_THRESHOLD``, then ``get_matching_tags()`` will return all tags on the taxonomy,
roots and children.
- Otherwise, ``get_matching_tags()`` will only return paginated root tags, and it will be necessary
to use ``get_matching_tags()`` to return paginated children. Also the "Expand-all" functionality will be disabled.

For search you can see the next section (Search tags)

**Pros**

- It is the simplest way.
- Paging both root tags and children mitigates the huge number of tags that can be found in large taxonomies.

Search tags
~~~~~~~~~~~~

Support tag search on the backend. Return a subset of matching tags.
We will use the same view to perform a search with the same logic:

**get_matching_tags(parent_tag_id: str = None, search_term: str = None)**

We can use ``search_term`` to perferom a search on root tags or children tags depending of ``parent_tag_id``.

For the search, ``SEARCH_TAGS_THRESHOLD`` will be used. (It is recommended that it be 20% of ``TAGS_THRESHOLD``).
It will work in the same way of ``TAGS_THRESHOLD`` (see Views & Pagination)

**Pros**

- It is the most scalable way.

Tag representation
~~~~~~~~~~~~~~~~~~~

Return a list of root tags and within a link to obtain the children tags
or the complete list of children tags depending of ``TAGS_THRESHOLD`` or ``SEARCH_TAGS_THRESHOLD``.
The list of root tags will be ordered alphabetically. If it has child tags, they must also
be ordered alphabetically.

**(taxonomy.tags.count < *_THRESHOLD)**::

{
"count": 100,
"tags": [
{
"id": "tag_1",
"value": "Tag 1",
"taxonomy_id": "1",
"sub_tags": [
{
"id": "tag_2",
"value": "Tag 2",
"taxonomy_id": "1",
"sub_tags": [
(....)
]
},
(....)
]
}


**Otherwise**::

{
"count": 100,
"tags": [
{
"id": "tag_1",
"value": "Tag 1",
"taxonomy_id": "1",
"sub_tags_link": "http//api-call-to-get-children.com"
},
(....)
]
}


**Pros:**

- The edX's interfaces show the tags in the form of a tree.
- The frontend needs no further processing as it is in a displayable format.
- It is kept as a simple implementation.


Rejected Options
-----------------


Render as a simple list of tags
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Return a simple list of tags, regardless of whether it is root or leaf.

**Pros:**

- It is simple and does not need further implementation and processing in the API.

**Cons:**

- It is more work to re-process all that list in the frontend to know who it is whose father.
- In no edX's interface is it used this way and it would be a very specific use case.
- Pagination would be more complicated to perform.


Add the children to the root pagination
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Ex. If the ``page_size`` is 100, when fetching the first root tag, which has 10 children tags,
11 tags are counted for the total and there would be reamin 89 tags to be obtained.

**Cons:**

- If there is a branch with a number of tags that exceeds ``page_size``,
it would only return that branch.
- All branches are variable in size, therefore a variable number of root tags
would be returned. This would cause interfaces between taxonomies to be inconsistent
in the number of root tags shown.


Search on frontend
~~~~~~~~~~~~~~~~~~

We constrain the number of tags allowed in a taxonomy for MVP, so that the API
can return all the tags in one page. So we can perform the tag search on the frontend.

**Cons:**

- It is not scalable.
- Sets limits of tags that can be created in the taxonomy.