Feat cpe configurations #300

GeorgeFI · 2022-12-16T17:37:34Z

Closes #252

codecov · 2022-12-16T17:52:04Z

Codecov Report

Patch coverage: 55.68% and project coverage change: -1.13 ⚠️

Comparison is base (c0084cf) 74.23% compared to head (21b88b0) 73.09%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #300      +/-   ##
==========================================
- Coverage   74.23%   73.09%   -1.13%     
==========================================
  Files          45       45              
  Lines        5606     5629      +23     
==========================================
- Hits         4161     4114      -47     
- Misses       1445     1515      +70

Impacted Files	Coverage Δ
src/sec_certs/utils/pandas.py	`0.00% <0.00%> (ø)`
src/sec_certs/sample/cve.py	`51.79% <39.69%> (+2.75%)`	⬆️
src/sec_certs/sample/cpe.py	`91.14% <85.72%> (-1.28%)`	⬇️
src/sec_certs/dataset/cve.py	`61.44% <87.50%> (-31.61%)`	⬇️
src/sec_certs/dataset/dataset.py	`61.42% <100.00%> (-2.93%)`	⬇️
src/sec_certs/sample/__init__.py	`100.00% <100.00%> (ø)`

... and 5 files with indirect coverage changes

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

adamjanovsky · 2022-12-18T10:48:12Z

src/sec_certs/dataset/cve.py

+        cves = self._get_cves_from_exactly_matched_cpes(cpe_matches)
+        cves_matched_by_configurations = self._get_cves_from_cpe_configurations(cpe_matches)
+        cves.update(cves_matched_by_configurations)
+
+        return cves


Just return {**self._get_cves_from_exactly_matched_cpes(cpe_matches), **self._get_cves_from_cpe_configurations(cpe_matches)

adamjanovsky · 2022-12-18T10:57:12Z

src/sec_certs/sample/cve.py

    vulnerable_cpes: list[CPE]
+    vulnerable_cpe_configurations: list[CPEConfiguration]


Just noticed this. Is there a reason that we have this as a list? Could be set as well, right?

There is no any specific reason for storing CPEConfiguration as a list. I just wanted to keep the same data structure as the CPE records have.

Sure, that makes sense. Maybe there's a reason why CPEs are a list, but I cannot recall any. Could you pls try to refactor both to sets and see what happens?

@adamjanovsky Refactoring seems to be okay - at least the pipeline of downloading CVEs, processing them and building the CVEDataset is passing. Due to the usage of CVEs in pandas_columns I will also investigate the Jupyter notebooks to check if the change does not break the analysis code (e.g. usage of indices on sets etc).

adamjanovsky · 2022-12-18T11:02:52Z

src/sec_certs/sample/cve.py

+            vulnerable_cpes = list(itertools.chain.from_iterable(map(lambda x: x[0], cpes_and_cpe_configurations)))
+            vulnerable_cpe_configurations = list(
+                itertools.chain.from_iterable(map(lambda x: x[1], cpes_and_cpe_configurations))
            )

+            return vulnerable_cpes, vulnerable_cpe_configurations


Just return [list(t) for t in zip(*cpes_and_cpe_configurations)]

src/sec_certs/dataset/cve.py

adamjanovsky

Hey,

thanks for your work. I'm happy that we're now on par with the main branch and matching the complex CPE configurations. Looking at the code, there are still some minor chunks to work on:

We could have a test (similar to test_find_related_cves() in test_cc_analysis.py that would add some artifical CVE (feel free to make up your CVE, name of certificate and cert's CPEs) and check that those complex CVEs are actually properly matched. Same test could be invented for FIPSDataset
We're now fairly considerative when it comes to performance. Could you please measure how much memory does the old CPE/CVE matching takes and how much does the new one takes? Same for runtimes?
I see that some optimizations could still be made. For instance when building look-up dicts for CVEDataset, shouldn't they only be built on CVEs that have no CPEConfiguration records? Also, self.cves_with_cpe_configurations essentially stores some CVEs that are already stored in self.cves, right? Shouldn't we delete them from self.CVEs then?
- But we need to be careful about serialization.
Basically, I'm worried that we store some CVEs twice, and that we also run matching on all CVEs, and then on CVEs with CPEConfiguration records, which deteriorates the performance. This speaks into favour of having all CPEs stored in CPEConfiguration when they are part of CVEDataset.
Could you please doublecheck statistics in vulnerabilities.ipynb and check that they didn't change much? At least that's what I'd expected. Number of detected certs with CVEs should rise IMO, and those would be the certs that we newly match.

I know that this PR takes time and that I always require some changes. It's that I'm not happy with the result just yet. If you'd prefer, I can finish the work so that you can move to something potentiallly more interesting. Let me know in any case. Thanks!

adamjanovsky · 2023-03-10T20:09:22Z

@GeorgeFI finalized my checks and we're good for merge 🎉 , thank you for your effort 👍.

Few thinks I've adjusted:

I simplified parsing of nist dictionary when loading CVE dataset
I simplified some fixtures and tests
I reverted to using lists of CPEs instead of sets (complex reasons, I can describe in person if needed)
I reverted to using CPEs instead of CPE uris in CPEConfiguration objects
I discarded some auxillary CVE filtering (that you didn't introduce, but I've figured it's not helping with runtime or memory consumption).

GeorgeFI added 7 commits December 15, 2022 13:16

feat: Implementation of cpe configs, not tested yet

31ce060

fix: Fixed critical bug in recursion, fixed tests

67ffd74

feat: Added method for filtering cves with cpe configs

b379c0e

feat: Representation of cpes in cpe configs as set

4f806a8

fix: fixed tests

929ca52

docs: Added documentation to the major methods

d9bf915

tests: Written tests for cpe configs

b5b0e9d

GeorgeFI requested a review from adamjanovsky December 16, 2022 17:39

GeorgeFI self-assigned this Dec 16, 2022

adamjanovsky mentioned this pull request Dec 17, 2022

Modelling "and" type CPEs with CPEConfiguration object #285

Closed

adamjanovsky marked this pull request as ready for review December 18, 2022 10:36

adamjanovsky reviewed Dec 18, 2022

View reviewed changes

src/sec_certs/dataset/cve.py Show resolved Hide resolved

adamjanovsky requested changes Dec 18, 2022

View reviewed changes

GeorgeFI added 13 commits December 25, 2022 21:37

refactor: Refactoring from code review

9087cf1

refactor: Refactoring from notes of code review

33cc8b4

chore: Formating, fixed tests

40cbd05

tests: Prepared fixture setup for cpe config match test

864a725

test: Added test for cc, not passing yet

5e2f4ae

test: Added test for CPE configurations

65a3b18

tests: Fixing not passing tests

4a638e1

format: formatting test file with black

b785501

format: manual fixes, black is complaining, but wont fix it

f0d27cb

test: Added tests for FIPS

0448df8

test: Added tests for fips, raw implementation

25122b1

test: Refactored the test

5997346

merge: merged main into feature branch

21ee2d7

GeorgeFI and others added 6 commits February 25, 2023 14:28

refactor: Refactored test for CC

81f78ac

refactor: Refactored match function

00558be

fix: Fixes in jupyter notebook and pandas utils

3e822b5

test: Added my own dummy vulnerable cert

a50e533

fix: Fixed the test for matching cpe

fe2e1de

finalize cpe matching for on/with configurations

67cfddb

adamjanovsky approved these changes Mar 10, 2023

View reviewed changes

adamjanovsky added 3 commits March 10, 2023 21:23

codecov to informational

4fa6672

fix typo codecov.yml

ad1330a

codecov.yml to information also on patch

21b88b0

adamjanovsky merged commit 983bc3c into main Mar 10, 2023

adamjanovsky deleted the feat-cpe-configurations branch March 10, 2023 20:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat cpe configurations #300

Feat cpe configurations #300

GeorgeFI commented Dec 16, 2022 •

edited by adamjanovsky

Loading

codecov bot commented Dec 16, 2022 •

edited

Loading

adamjanovsky Dec 18, 2022

adamjanovsky Dec 18, 2022

GeorgeFI Dec 20, 2022

adamjanovsky Dec 20, 2022

GeorgeFI Dec 25, 2022 •

edited

Loading

adamjanovsky Dec 18, 2022

adamjanovsky left a comment

adamjanovsky commented Mar 10, 2023

		vulnerable_cpes: list[CPE]
		vulnerable_cpe_configurations: list[CPEConfiguration]

Feat cpe configurations #300

Feat cpe configurations #300

Conversation

GeorgeFI commented Dec 16, 2022 • edited by adamjanovsky Loading

codecov bot commented Dec 16, 2022 • edited Loading

Codecov Report

adamjanovsky Dec 18, 2022

Choose a reason for hiding this comment

adamjanovsky Dec 18, 2022

Choose a reason for hiding this comment

GeorgeFI Dec 20, 2022

Choose a reason for hiding this comment

adamjanovsky Dec 20, 2022

Choose a reason for hiding this comment

GeorgeFI Dec 25, 2022 • edited Loading

Choose a reason for hiding this comment

adamjanovsky Dec 18, 2022

Choose a reason for hiding this comment

adamjanovsky left a comment

Choose a reason for hiding this comment

adamjanovsky commented Mar 10, 2023

GeorgeFI commented Dec 16, 2022 •

edited by adamjanovsky

Loading

codecov bot commented Dec 16, 2022 •

edited

Loading

GeorgeFI Dec 25, 2022 •

edited

Loading