fix model query count when no candidate result scores are improvements over current result score #350

a1noack · 2020-11-19T22:50:22Z

It seems that when none of the items in results have a higher score than the cur_result.score, the num_queries attribute of cur_result exits the while loop in the _perform_search method with an incorrect value of model queries. I think this fixes the problem.

… current result score value

a1noack · 2020-11-19T23:42:26Z

Actually, I think this is still partially incorrect.

jinyongyoo · 2020-11-20T01:11:56Z

@a1noack I don't think that should matter, since we actually keep track of the number of queries in GoalFunction class.

a1noack · 2020-11-20T01:49:59Z

@jinyongyoo But I think cur_result and each element in results are GoalFunctionResult objects, and each of these has its own query count.

a1noack · 2020-11-20T06:19:45Z

Okay, after a bit more testing, I'm thinking that I was right the first time. I think the commit I made actually is correct.

uvafan

I think this is correct; good find.

I think it should only matter in cases where the attack ends with an index that it doesn't swap out which is probably relatively rare except maybe with maximizing goal functions.

I think it would be nice to add a comment with a short explanation of why this is necessary; I also think there might be similar issues in most/all of the other SearchMethods when the search ends by failing to find any replacements.

jinyongyoo · 2020-11-20T17:02:33Z

@a1noack I also can see why this might be an issue. This isn't an issue for keeping track of the queries we used so far, but it still is an issue for the number queries we report for each goal function result at the end. Thanks for catching it.

I think all search methods have this issue and I think there's an easier way to fix this.
@uvafan Can we just set num_queries of goal function result returned by perform_search to be equal to num_queries of GoalFunction?

uvafan · 2020-11-20T17:18:12Z

@a1noack I also can see why this might be an issue. This isn't an issue for keeping track of the queries we used so far, but it still is an issue for the number queries we report for each goal function result at the end. Thanks for catching it.

I think all search methods have this issue and I think there's an easier way to fix this.
@uvafan Can we just set num_queries of goal function result returned by perform_search to be equal to num_queries of GoalFunction?

Yeah I think this is a better solution. Good idea

qiyanjun · 2020-11-21T00:23:04Z

@jinyongyoo ready to merge?

jinyongyoo · 2020-11-21T01:29:39Z

@qiyanjun We can merge it, but it'll just fix one search method. What I proposed fixes it for every search method.

@a1noack Hey Adam, do you think it's possible for you to fix it this way?

@a1noack I also can see why this might be an issue. This isn't an issue for keeping track of the queries we used so far, but it still is an issue for the number queries we report for each goal function result at the end. Thanks for catching it.

I think all search methods have this issue and I think there's an easier way to fix this.
@uvafan Can we just set num_queries of goal function result returned by perform_search to be equal to num_queries of GoalFunction?

…orm_search

a1noack · 2020-11-21T08:13:42Z

Here's a more general fix. It's not beautiful, but I think it should be fine as long as we continue to treat SearchMethod's _perform_search function as private and it is only called through the __call__ method.

a1noack · 2020-11-21T20:50:31Z

Okay, so the run_attack_faster_alzantot_recipe-textattack attack --model lstm-mr --recipe faster-alzantot --num-examples 3 --num-examples-offset 32 --shuffle=False test is failing because the expected query count does not equal the actual number of queries.

Part of the expected output for this test is:

+-------------------------------+--------+
| Attack Results                |        |
+-------------------------------+--------+
| Number of successful attacks: | 2      |
| Number of failed attacks:     | 1      |
| Number of skipped attacks:    | 0      |
| Original accuracy:            | 100.0% |
| Accuracy under attack:        | 33.33% |
| Attack success rate:          | 66.67% |
| Average perturbed word %:     | 17.34% |
| Average num. words per input: | 15.0   |
| Avg num queries:              | 551.67 |
+-------------------------------+--------+

But that part in the new output when I run this test is:

+-------------------------------+---------+
| Attack Results                |         |
+-------------------------------+---------+
| Number of successful attacks: | 2       |
| Number of failed attacks:     | 1       |
| Number of skipped attacks:    | 0       |
| Original accuracy:            | 100.0%  |
| Accuracy under attack:        | 33.33%  |
| Attack success rate:          | 66.67%  |
| Average perturbed word %:     | 17.34%  |
| Average num. words per input: | 15.0    |
| Avg num queries:              | 1132.67 |
+-------------------------------+---------+

So the new query count is much higher. Which makes sense given the changes I made.

Not sure what you want to do about this. A bunch of the tests might be wrong.

jinyongyoo · 2020-11-22T04:39:01Z

@a1noack Looks good to me! Thank you for catching this mistake and fixing it. I checked out what was happening with the genetic algorithm, and it turns out whenever we perturb a population member/text, we call get_goal_results on that one text at a time. Therefore, the num_queries stored in goal function result of each population member is out-dated whenever we perturb the next population member/text. When I updated num_queries of individual population member after the whole perturbing step, I obtained the same total number queries as the one reported by GoalFunctionResult.

This reflects that fact that num_queries is really only accurately recorded in the goal function, and not the individual results. Since we intended _perform_search to be private, I think the way you fixed it should be fine.

fix query count when no candidate result score values are better than…

ede35d4

… current result score value

a1noack marked this pull request as draft November 20, 2020 00:19

a1noack marked this pull request as ready for review November 20, 2020 06:19

uvafan approved these changes Nov 20, 2020

View reviewed changes

fix query count for GoalFunctionResult returned by SearchMethod._perf…

fe5aa5c

…orm_search

fix search behavior and update test

545687d

jinyongyoo merged commit 278bb33 into QData:master Nov 22, 2020

a1noack deleted the fix_query_count branch November 23, 2020 23:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix model query count when no candidate result scores are improvements over current result score #350

fix model query count when no candidate result scores are improvements over current result score #350

a1noack commented Nov 19, 2020

a1noack commented Nov 19, 2020

jinyongyoo commented Nov 20, 2020

a1noack commented Nov 20, 2020 •

edited

Loading

a1noack commented Nov 20, 2020

uvafan left a comment

jinyongyoo commented Nov 20, 2020

uvafan commented Nov 20, 2020

qiyanjun commented Nov 21, 2020

jinyongyoo commented Nov 21, 2020 •

edited

Loading

a1noack commented Nov 21, 2020

a1noack commented Nov 21, 2020 •

edited

Loading

jinyongyoo commented Nov 22, 2020 •

edited

Loading

fix model query count when no candidate result scores are improvements over current result score #350

fix model query count when no candidate result scores are improvements over current result score #350

Conversation

a1noack commented Nov 19, 2020

a1noack commented Nov 19, 2020

jinyongyoo commented Nov 20, 2020

a1noack commented Nov 20, 2020 • edited Loading

a1noack commented Nov 20, 2020

uvafan left a comment

Choose a reason for hiding this comment

jinyongyoo commented Nov 20, 2020

uvafan commented Nov 20, 2020

qiyanjun commented Nov 21, 2020

jinyongyoo commented Nov 21, 2020 • edited Loading

a1noack commented Nov 21, 2020

a1noack commented Nov 21, 2020 • edited Loading

jinyongyoo commented Nov 22, 2020 • edited Loading

a1noack commented Nov 20, 2020 •

edited

Loading

jinyongyoo commented Nov 21, 2020 •

edited

Loading

a1noack commented Nov 21, 2020 •

edited

Loading

jinyongyoo commented Nov 22, 2020 •

edited

Loading