Changed indexing structure and logic to use a user's program enrollments #811

gsidebo · 2016-08-05T02:51:36Z

What are the relevant tickets?

What's this PR do?

Changes our indexing to index one record per user per program. A user enrolled in two programs, for example, will now have two records in the index. The enrollment/certificate/etc data for each of those records would only be those that are associated with the appropriate program.

Where should the reviewer start?

Probably search/api.py. This might be one that you want to call me over to co-review at some point

How should this be manually tested?

Load the realistic users if you haven't already, the run the recreate_index manage command. Once you've done that, you should be able to query ES and see the new indexing format

gsidebo · 2016-08-05T02:52:39Z

search/commands_test.py

@@ -1,54 +0,0 @@
-"""


moved these test cases to search/api_test.py since it's testing search/api.py functionality

noisecapella · 2016-08-05T14:38:16Z

search/tasks.py

    Args:
-        users (iterable of User):
-            Iterable of Users
+        program_enrollments: Iterable of ProgramEnrollments


Can you add a type here for program_enrollments? It should go in parentheses after the name according to the sphinx-napoleon convention: http://edx.readthedocs.io/projects/edx-developer-guide/en/latest/style_guides/python_guidelines.html#docstrings

giocalitri · 2016-08-05T17:42:26Z

dashboard/factories.py

+        # the given User and Program. Instead of creating a new record with the factory, we
+        # will create the necessary objects to trigger its creation.
+        user = kwargs['user'] = kwargs.get('user', UserFactory.create())
+        program = kwargs['program'] = kwargs.get('program', ProgramFactory.create())


why do you assign a copy of the objects to kwargs?

part of an earlier refactor idea. i'll remove

giocalitri · 2016-08-05T18:33:03Z

I have some comments.

I also think all the refactoring looks good, but you could have postponed at least some parts of it.

gsidebo · 2016-08-05T19:21:55Z

i'm getting an error in searchkit. might be data-related. i'll look into it

noisecapella · 2016-08-05T19:37:02Z

Searchkit works fine for me

gsidebo · 2016-08-05T19:40:50Z

yep, i had bad data in the index

noisecapella · 2016-08-05T19:52:18Z

dashboard/signals.py

+    program = instance.course_run.course.program
+    program_enrollment = ProgramEnrollment.objects.filter(user=user, program=program).first()
+    if not program_enrollment:
+        index_users.delay([user])


Should we just get rid of index_users? It seems like if there is no ProgramEnrollment for that user, the index should not need to know about that user at all

keep in mind that this is looking for a ProgramEnrollment for a User and a Program. A User could have other ProgramEnrollments. your comment does point out an issue, though. i had this code in place to handle cases where the ProgramEnrollment had already been deleted by an earlier signal, but that in itself should trigger a reindexing. ill get rid of this and change up the enrollment signal as well

noisecapella · 2016-08-05T20:27:20Z

search/api_test.py


    def test_user_add(self):
        """
        Test that a newly created User is indexed properly


Can you update the docstrings for each of these tests?

gsidebo · 2016-08-05T21:47:19Z

search/api_test.py

+        with patch('search.api._index_program_enrolled_users_chunk', autospec=True, return_value=0) as index_chunk:
+            index_program_enrolled_users(program_enrollments, chunk_size=4)
+            assert index_chunk.call_count == 3
+            index_chunk.assert_any_call(program_enrollments[0:4])


@noisecapella @giocalitri i refactored this test case (test_index_program_enrolled_users, formerly test_index_users). this was taking 2.5 sec with calls to ES and to the database, and all it was uniquely testing was that the 'index_' function could handle an iterable. let me know if you have a problem with this refactor

noisecapella · 2016-08-08T14:50:21Z

search/api.py

+    """
+    program_enrollments = ProgramEnrollment.objects.filter(user=user).select_related('user', 'program').all()
+    for program_enrollment in program_enrollments:
+        remove_program_enrolled_user(program_enrollment)


From the code coverage report it looks like this line is not triggered by tests. Can you add or adjust a test to cover this line?

good catch. just added a test

giocalitri · 2016-08-08T15:16:50Z

the functionality looks good to me 👍 for my part of the review

noisecapella · 2016-08-08T15:29:09Z

I'm happy too 👍

bdero temporarily deployed to micromasters-ci-pr-811 August 5, 2016 02:51 Inactive

gsidebo reviewed Aug 5, 2016
View reviewed changes

noisecapella self-assigned this Aug 5, 2016

noisecapella reviewed Aug 5, 2016
View reviewed changes

gsidebo added the Needs Review label Aug 5, 2016

bdero temporarily deployed to micromasters-ci-pr-811 August 5, 2016 17:27 Inactive

giocalitri reviewed Aug 5, 2016
View reviewed changes

giocalitri self-assigned this Aug 5, 2016

giocalitri added Waiting on Author and removed Needs Review labels Aug 5, 2016

bdero temporarily deployed to micromasters-ci-pr-811 August 5, 2016 18:56 Inactive

gsidebo added Needs Review and removed Waiting on Author labels Aug 5, 2016

bdero temporarily deployed to micromasters-ci-pr-811 August 5, 2016 19:21 Inactive

noisecapella reviewed Aug 5, 2016
View reviewed changes

bdero temporarily deployed to micromasters-ci-pr-811 August 5, 2016 20:25 Inactive

noisecapella reviewed Aug 5, 2016
View reviewed changes

bdero temporarily deployed to micromasters-ci-pr-811 August 5, 2016 20:56 Inactive

bdero temporarily deployed to micromasters-ci-pr-811 August 5, 2016 20:58 Inactive

bdero temporarily deployed to micromasters-ci-pr-811 August 5, 2016 21:43 Inactive

gsidebo reviewed Aug 5, 2016
View reviewed changes

bdero temporarily deployed to micromasters-ci-pr-811 August 8, 2016 14:45 Inactive

noisecapella reviewed Aug 8, 2016
View reviewed changes

bdero temporarily deployed to micromasters-ci-pr-811 August 8, 2016 15:09 Inactive

noisecapella added Waiting on Author and removed Needs Review labels Aug 8, 2016

gsidebo force-pushed the 778_program_user_indexing branch from 292f496 to dbf41da Compare August 8, 2016 15:37

bdero deployed to micromasters-ci-pr-811 August 8, 2016 15:37 View deployment

Changed indexing structure and logic to use a user's program enrollments

663d872

gsidebo force-pushed the 778_program_user_indexing branch from dbf41da to 663d872 Compare August 8, 2016 16:47

bdero requested a deployment to micromasters-ci-pr-811 August 8, 2016 16:47 Pending

gsidebo merged commit 78ccdc6 into master Aug 8, 2016

gsidebo deleted the 778_program_user_indexing branch August 8, 2016 16:48

pdpinch mentioned this pull request Aug 9, 2016

Search API for staff users #701

Closed

Changed indexing structure and logic to use a user's program enrollments #811

Changed indexing structure and logic to use a user's program enrollments #811

Uh oh!

Conversation

gsidebo commented Aug 5, 2016

What are the relevant tickets?

What's this PR do?

Where should the reviewer start?

How should this be manually tested?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

giocalitri commented Aug 5, 2016

Uh oh!

gsidebo commented Aug 5, 2016

Uh oh!

noisecapella commented Aug 5, 2016

Uh oh!

gsidebo commented Aug 5, 2016

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gsidebo Aug 5, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

giocalitri commented Aug 8, 2016

Uh oh!

noisecapella commented Aug 8, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

gsidebo Aug 5, 2016 •

edited

Loading