Add atlassian.py for Jira Data Center DLS #2161

moxarth-rathod · 2024-02-15T09:08:39Z

Part Of #1957

Related PR #2108

This PR contain the changes in atlassian.py file for Jira Data Center DLS

Checklists

Pre-Review Checklist

this PR has a meaningful title
this PR links to all relevant github issues that it fixes or partially addresses
if there is no GH issue, please create it. Each PR should have a link to an issue
this PR has a thorough description
Covered the changes with automated tests
Tested the changes locally
Added a label for each target release version (example: v7.13.2, v7.14.0, v8.0.0)

navarone-feekery

Nice, left some comments about importing / keeping it DRY.

connectors/sources/atlassian.py

navarone-feekery · 2024-02-15T11:26:51Z

connectors/sources/atlassian.py

@@ -17,6 +17,11 @@
 RETRIES = 3
 RETRY_INTERVAL = 2
 USER_BATCH = 50
+DATA_CENTER_USER_BATCH = 1000


What does DATA_CENTER_USER_BATCH specify exactly? Is this the max amount of users we can fetch in one request? Is it the same value as in jira.py?

connectors/connectors/sources/jira.py

Line 43 in 1d4fb78

MAX_USER_FETCH_LIMIT = 1000

If they're the same value, I suggest having it defined in this file (atlassian.py) and importing it into any sources that use it.

I'm also confused about DATA_CENTER_USER_BATCH, specifically because below it seems to be not for DataCenter, but for anything non-cloud:

f"{url}?startAt={start_at}" if self.source.configuration["data_source"] in [JIRA_CLOUD, CONFLUENCE_CLOUD] else url.format(start_at=start_at, max_results=DATA_CENTER_USER_BATCH)

If they're the same value, I suggest having it defined in this file (atlassian.py) and importing it into any sources that use it.

The value is definitely same but the purpose is different at

connectors/connectors/sources/jira.py

Line 43 in 1d4fb78

MAX_USER_FETCH_LIMIT = 1000

and it is used for both cloud and non-cloud data source. where in atlassian.py, the value is used for fetching just non-cloud users.

I'm also confused about DATA_CENTER_USER_BATCH, specifically because below it seems to be not for DataCenter, but for anything non-cloud:

renaming the constant to get more clear idea.

seanstory · 2024-02-15T14:23:26Z

connectors/sources/atlassian.py

@@ -135,7 +152,7 @@ async def user_access_control_doc(self, user):
                    ACCESS_CONTROL: [<prefixed_account_id>, <prefixed_group_ids>, <prefixed_role_keys>]
                }
        """
-        account_id = user.get("accountId")
+        account_id = user.get("accountId") or user.get("name")


Is user.get("name") guaranteed to be unique? Looks like account_id ends up being used as the access control document's _id.

Is this added because accountId is sometimes missing?

Is user.get("name") guaranteed to be unique?

Yes, it is unique for all users.

Is this added because accountId is sometimes missing?

Exactly, accountId is missing while fetching users for non-cloud.

seanstory · 2024-02-15T14:24:59Z

connectors/sources/atlassian.py

+        if (
+            self.source.configuration["data_source"] in [JIRA_CLOUD, CONFLUENCE_CLOUD]
+            and user_info.get("accountType") != "atlassian"
+        ):
            self.source._logger.debug(
                f"Skipping {user_name} because the account type is {user_info.get('accountType')}. Only 'atlassian' account type is supported."


Should this log message get updated too? Since now the account type could be atlassian, but the data source config is non-cloud.

This is designed to be logged for just cloud data source according to this condition

self.source.configuration["data_source"] in [JIRA_CLOUD, CONFLUENCE_CLOUD]

Previously, DLS was designed to just cloud platform so did not have this condition but now we've explicitly added it just to exclude non-cloud data source.

navarone-feekery

LGTM, thanks 🚀

github-actions · 2024-02-20T05:30:05Z

💚 Backport PR(s) successfully created

Status	Branch	Result
✅	8.13	#2191

This backport PR will be merged automatically after passing CI.

Co-authored-by: moxarth-elastic <96762084+moxarth-elastic@users.noreply.github.com>

Add atlassian.py for Jira data center DLS

dd3d1a3

moxarth-rathod added jira team:external v8.13.0 labels Feb 15, 2024

moxarth-rathod requested a review from a team February 15, 2024 09:08

github-actions bot added auto-backport v8.14.0.0 labels Feb 15, 2024

navarone-feekery reviewed Feb 15, 2024

View reviewed changes

seanstory reviewed Feb 15, 2024

View reviewed changes

address comments

f6fc04b

moxarth-rathod requested review from seanstory and navarone-feekery February 16, 2024 06:16

navarone-feekery approved these changes Feb 19, 2024

View reviewed changes

Merge branch 'main' into atlassian-jira-dls-file

227e6e0

moxarth-rathod merged commit 062edf7 into main Feb 20, 2024
2 checks passed

moxarth-rathod deleted the atlassian-jira-dls-file branch February 20, 2024 05:29

github-actions bot pushed a commit that referenced this pull request Feb 20, 2024

Add atlassian.py for Jira Data Center DLS (#2161)

5ca66ee

github-actions bot mentioned this pull request Feb 20, 2024

[8.13] Add atlassian.py for Jira Data Center DLS (#2161) #2191

Merged

navarone-feekery pushed a commit that referenced this pull request Feb 21, 2024

[8.13] Add atlassian.py for Jira Data Center DLS (#2161) (#2191)

1ecd32a

Co-authored-by: moxarth-elastic <96762084+moxarth-elastic@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add atlassian.py for Jira Data Center DLS #2161

Add atlassian.py for Jira Data Center DLS #2161

moxarth-rathod commented Feb 15, 2024

navarone-feekery left a comment

navarone-feekery Feb 15, 2024

seanstory Feb 15, 2024

moxarth-rathod Feb 16, 2024

moxarth-rathod Feb 16, 2024

seanstory Feb 15, 2024

moxarth-rathod Feb 16, 2024

seanstory Feb 15, 2024

moxarth-rathod Feb 16, 2024

navarone-feekery left a comment

github-actions bot commented Feb 20, 2024

Add atlassian.py for Jira Data Center DLS #2161

Add atlassian.py for Jira Data Center DLS #2161

Conversation

moxarth-rathod commented Feb 15, 2024

Part Of #1957

Related PR #2108

Checklists

Pre-Review Checklist

navarone-feekery left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

navarone-feekery left a comment

Choose a reason for hiding this comment

github-actions bot commented Feb 20, 2024

💚 Backport PR(s) successfully created