Fixes 67: Adding test coverage for louis-crawler #69

melanie-fressard · 2023-12-13T20:58:05Z

issue #67

Commands used to see the coverage :

python -m coverage run -m unittest discover -s tests
python -m coverage report

melanie-fressard · 2023-12-15T19:55:08Z

Coverage got up to 95%

k-allagbe

Impressive 95% test coverage overall. Testing edge cases should increase it even more.
Be careful not to use real data in tests. Instead, a good strategy would be to populate the tables with test data before each test and rollback after. There are other strategies you can explore as well.
Make sure assertions actually assert the effects of the functions.

k-allagbe · 2023-12-19T05:31:01Z

tests/test_db_crawler.py

    def test_fetch_chunk_id_without_embedding(self):
        """sample test to check if fetch_chunk_id_without_embedding works"""
        with db.cursor(self.connection) as cursor:
            cursor.execute(test.embedding_table.format(embedding_model='test-model'))
            rows = crawler.fetch_chunk_id_without_embedding(cursor, 'test-model')
            _entity_id = rows[0]
            self.connection.rollback()
+
+    def test_store_chunk_item(self):


This test case seems to assert the happy path for store_chunk_item. I suggest also testing the edge case where row is None here:

row = cursor.fetchone() if row is not None: data['chunk_id'] = row['id']

in store_chunk_item. This is important because if there is a possibility that row is None, then we should know exactly what the end result would be.

To go about this, I suggest breaking store_chunk_item into more atomic functions (which would be tested separately), then mocking the one that would normally produce row to return None instead in the test.

broke down store_chunk_item into tinier functions
still need to test if there is a case with none

tests/test_db_crawler.py

k-allagbe · 2023-12-19T06:12:51Z

tests/test_db_crawler.py

+        with db.cursor(self.connection) as cursor:
+            id = crawler.fetch_crawl_ids_without_chunk(cursor)
+            self.connection.rollback()
+        self.assertEqual(id, [])


This assertion isn't adequately testing fetch_crawl_ids_without_chunk. A better test would be to first populate the table with a mix of crawls, some with chunk IDs and some without. Then, we should verify that the function's results match exactly the crawls that don't have chunk IDs, before performing a rollback.

k-allagbe · 2023-12-19T06:25:40Z

tests/test_db_crawler.py

+    def test_fetch_crawl_row(self):
+        """Test fetching a crawl row."""
+        with db.cursor(self.connection) as cursor:
+            row = crawler.fetch_crawl_row(cursor, "https://inspection.canada.ca/a-propos-de-l-acia/structure-organisationnelle/mandat/fra/1299780188624/1319164463699")


This data might not exist in the future, in which case this test will fail. A more suitable test would be to first populate the relevant tables with test items, perform the fetch, then assert the result matches the test items.

fetch_crawl_row also has multiple paths and edge cases worth testing.

k-allagbe · 2023-12-19T06:31:06Z

tests/test_db_crawler.py

+    def test_fetch_chunk_token_row(self):
+        """Test fetching a chunk token row."""
+        with db.cursor(self.connection) as cursor:
+            row = crawler.fetch_chunk_token_row(cursor, "469812c5-190c-4e56-9f88-c8621592bcb5")


This data might not exist in the future, in which case this test will fail. A more suitable test would be to first populate the relevant tables with test items, perform the fetch, then assert the result matches the test items.

melanie-fressard · 2023-12-19T15:01:29Z

Pushing it to 96% :

Will apply correction requested by @k-allagbe soon :)

tests/test_db_crawler.py

melanie-fressard · 2023-12-20T22:20:15Z

To generate documentation based on docstring, python -m pydoc -p 1234 will open a localhost on port 1234 to display the documentation on html pages. Unfortunately, it do not detect the tests module and therefore is not doing any documentation on that despite dosctrings.

melanie-fressard · 2023-12-21T20:12:20Z

Tried to apply more correction but cannot because I have trouble with adding test data to the database. As it is true that test-data should be added for better testing, I would suggest to start from here if anyone is assigned to this issue.
Otherwise, pushed tests to 97% :

rngadam · 2024-01-11T17:03:54Z

@k-allagbe can you follow up yourself and get this PR in?

k-allagbe · 2024-01-12T14:59:49Z

@k-allagbe can you follow up yourself and get this PR in?

On it.

issue #67 - addibg tests + code correction

335b43e

melanie-fressard self-assigned this Dec 13, 2023

melanie-fressard linked an issue Dec 13, 2023 that may be closed by this pull request

Adding test coverage for louis-crawler #67

Open

fixes ruff

b216efe

melanie-fressard marked this pull request as draft December 13, 2023 21:00

melanie-fressard added 4 commits December 13, 2023 16:01

Merge branch 'main' into 67-adding-test-coverage-for-louis-cralwer

2ce3428

issue #67 - equals instead of true

b134081

issue #67 - adding tests 89% coverage

76299eb

tests on errors

22de5b6

melanie-fressard marked this pull request as ready for review December 15, 2023 19:55

melanie-fressard requested review from rngadam and k-allagbe December 15, 2023 19:55

wip test schema

9026150

melanie-fressard marked this pull request as draft December 18, 2023 21:59

k-allagbe requested changes Dec 19, 2023

View reviewed changes

k-allagbe requested review from k-allagbe and JolanThomassin December 19, 2023 06:56

adding test coverage for schema

0a1097d

melanie-fressard added 2 commits December 19, 2023 18:52

issue #67 - dividing store_chunk_item into tiner func

75e6952

correcting some assertions

ed53123

JolanThomassin suggested changes Dec 20, 2023

View reviewed changes

tests/test_db_crawler.py Outdated Show resolved Hide resolved

melanie-fressard added 2 commits December 20, 2023 21:32

generating embeddings

bd527a5

checking for none

be6b249

JolanThomassin approved these changes Dec 21, 2023

View reviewed changes

issue #67 - trying to add test values

21f03f9

rngadam assigned k-allagbe Jan 11, 2024

rngadam unassigned melanie-fressard Jan 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes 67: Adding test coverage for louis-crawler #69

Fixes 67: Adding test coverage for louis-crawler #69

melanie-fressard commented Dec 13, 2023 •

edited

Loading

melanie-fressard commented Dec 15, 2023

k-allagbe left a comment •

edited

Loading

k-allagbe Dec 19, 2023

melanie-fressard Dec 20, 2023

melanie-fressard Dec 20, 2023

k-allagbe Dec 19, 2023

k-allagbe Dec 19, 2023

k-allagbe Dec 19, 2023

melanie-fressard commented Dec 19, 2023

melanie-fressard commented Dec 20, 2023

melanie-fressard commented Dec 21, 2023

rngadam commented Jan 11, 2024

k-allagbe commented Jan 12, 2024

Fixes 67: Adding test coverage for louis-crawler #69

Are you sure you want to change the base?

Fixes 67: Adding test coverage for louis-crawler #69

Conversation

melanie-fressard commented Dec 13, 2023 • edited Loading

melanie-fressard commented Dec 15, 2023

k-allagbe left a comment • edited Loading

Choose a reason for hiding this comment

k-allagbe Dec 19, 2023

Choose a reason for hiding this comment

melanie-fressard Dec 20, 2023

Choose a reason for hiding this comment

melanie-fressard Dec 20, 2023

Choose a reason for hiding this comment

k-allagbe Dec 19, 2023

Choose a reason for hiding this comment

k-allagbe Dec 19, 2023

Choose a reason for hiding this comment

k-allagbe Dec 19, 2023

Choose a reason for hiding this comment

melanie-fressard commented Dec 19, 2023

melanie-fressard commented Dec 20, 2023

melanie-fressard commented Dec 21, 2023

rngadam commented Jan 11, 2024

k-allagbe commented Jan 12, 2024

melanie-fressard commented Dec 13, 2023 •

edited

Loading

k-allagbe left a comment •

edited

Loading