[Feature Request] Estimated Time to Review (ETR) #303

Archetypically · 2023-09-14T16:27:19Z

A great value-add feature would be an estimated time to review that the agent could generate.
This could be based on whatever the model thinks is a significant metric (lines of code changed, types of files changed, number of files changes, deletions vs. modifications vs. additions, etc).

This is valuable from the perspective of the PR author by having some feedback if a PR is going to feel "too big" to review (and therefore the downstream action would be to break up the PR into multiple PRs).

Additionally, this would help reviewers by addressing that cognitive load of "this PR will take forever to review" which might negatively impact the responsiveness of the reviewer.

^{Suggested post here by @tjwp}

mrT23 · 2023-09-15T14:13:35Z

@Archetypically thats a nice idea

Of course it's just an estimation, but I do agree it can give the reviewer a consistent baseline, and valuable estimation of the size and complexity of the PR

I opened a pull request:
https://github.com/Codium-ai/pr-agent/pull/306/files

Example results can be found in:
#304

You are welcome to give feedback about the prompt. anything else you think should be added to it ?

Archetypically · 2023-09-15T15:32:23Z

@mrT23 -- appreciate that quick turnaround! We experienced with a similar prompt, so this is pretty close to what we landed at as well.

One thing that was born out of feedback and prototyping was the concept of categorical sizing; we landed on T-shirt sizing.

Categorize the complexity and time necessary of reviewing this pull request using T-shirt sizes (XS, S, M, L, XL).

And we reported this alongside the time. Generally, we found that the LLM wasn't hugely accurate in certain cases, but the T-shirt sizing was pretty useful.

We also decided to add prompt parts about ignoring certain types of files that didn't need reviewing; think auto-generated GraphQL schemas or .rbis; might also be worth considering.

mrT23 · 2023-09-17T13:19:37Z

@Archetypically your observation regarding the fact the LLMs are better in comparing and ranking than in giving absolute values is correct. i will change the prompt

i think the t-shrit size is a bit cumbersome and an unintuitive ranking option, both for the LLM, and a human. Why not just a number - "review effort, on a scale of 1-5" ?

Regarding ignoring certain types - we already do that, see 'pr_agent/settings/language_extensions.toml'. you are welcome to open a PR with more types, or dynamically edit the 'extra' field.
Note that some types are ambiguous. Will you ignore .yaml files ? Sometimes they are auto-generated. Sometimes not. you can also tell the mode to ignore your specific types in the 'extra instruction' prompt

mrT23 · 2023-09-17T14:10:02Z

in any case, since this is a nice feature, I think I will turn it on by default. thanks for the suggestion @Archetypically
welcome to share in the future additional feedback for tuning and optimizing this feature

merged #306

mrT23 added the enhancement New feature or request label Sep 15, 2023

mrT23 added the solved label Sep 17, 2023

mrT23 closed this as completed Sep 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Estimated Time to Review (ETR) #303

[Feature Request] Estimated Time to Review (ETR) #303

Archetypically commented Sep 14, 2023

mrT23 commented Sep 15, 2023 •

edited

Loading

Archetypically commented Sep 15, 2023

mrT23 commented Sep 17, 2023 •

edited

Loading

mrT23 commented Sep 17, 2023 •

edited

Loading

[Feature Request] Estimated Time to Review (ETR) #303

[Feature Request] Estimated Time to Review (ETR) #303

Comments

Archetypically commented Sep 14, 2023

mrT23 commented Sep 15, 2023 • edited Loading

Archetypically commented Sep 15, 2023

mrT23 commented Sep 17, 2023 • edited Loading

mrT23 commented Sep 17, 2023 • edited Loading

mrT23 commented Sep 15, 2023 •

edited

Loading

mrT23 commented Sep 17, 2023 •

edited

Loading

mrT23 commented Sep 17, 2023 •

edited

Loading