-
-
Notifications
You must be signed in to change notification settings - Fork 562
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Scanning multiple directories scans too much #3452
Comments
Ah, that's a flaw alright. When passing multiple input paths, I think that the current behaviour is to find the shared common root ancestor directory and "ignore" all parts that are not in the provided paths. That's a bad and stupid behaviour indeed. |
@JonoYang @AyanSinhaMahapatra what do you think could be the way to improve this? |
@pombredanne there's the new |
Reference: #3452 Signed-off-by: Ayan Sinha Mahapatra <ayansmahapatra@gmail.com>
I had some time to poke at this this afternoon, and it's not straightforward. @pombredanne @AyanSinhaMahapatra Do you have any documentation on how It also looks like this isn't fully wired up yet. I started with commit 822cc91, and started working through failures. There seem to be some mismatched assumptions about absolute vs relative paths and representation. I went looking for tests for Anyway, wanted to reach out before I went any deeper... Thanks- |
that's an inherited technical wart and debt. The original design was to say that a scan would always have a single root directory. |
Related: aboutcode-org/commoncode#35 |
Description
How To Reproduce
We have a giant third_party/ directory. GIANT! Trying to scan one package works fine. But trying to scan two at once, it scans things outside of those directories
It appears to hang there, but strace shows that it's actually scanning things outside of the curl and zlib directories, which will take a long time.
System configuration
The text was updated successfully, but these errors were encountered: