PR: evaluate code review comments with OpenAI #79

EresDev · 2024-08-08T21:08:06Z

Resolves #45
Depends on #55
QA: EresDevOrg/ubiquibot-issues#15 (comment)

See exactly which type of review comments are being score by this PR. #45 (comment)

In the prompt,we ask the OpenAI to score review comments, which is not exactly the relevance score like issue comments. For review code comments it is score to improve the offered solution and the improvement in code quality. This made more sense to me. Because of this, the openai is strict in scoring here. Only code review comments with good details get a good score here. OpenAI gets issue specs, part of code change suggested, and the comment itself. It doesn't have more info. This can be improved in the future by including a single review entire conversation, or maybe the entire code file itself. However, it is a lot more work. What we have here I think is a good first try.

Edit:

Please note that OpenAI isn't that much strict in evaluating the code as I said above. It was mistake and it is fixed now. I think it is much better now. latest QA: EresDevOrg/ubiquibot-issues#15 (comment)

Update test issue to one that has PR code review comments

…on-rewards into development

src/configuration/content-evaluator-config.ts

src/parser/content-evaluator-module.ts

tests/process.issue.test.ts

src/parser/data-purge-module.ts

src/parser/content-evaluator-module.ts

128 token handling roughly 10 code review commetns

gentlementlegen · 2024-09-09T14:08:24Z

@EresDev Could you resolve conflicts so we can get this in please?

…on-rewards into development

mostly updates expected output to inclulde review comments

EresDev · 2024-09-10T22:05:24Z

Could you resolve conflicts so we can get this in please?

@gentlementlegen
I have resolved the conflicts. Here is the latest QA.

0x4007 · 2024-09-11T00:48:51Z

Could you resolve conflicts so we can get this in please?

@gentlementlegen

I have resolved the conflicts. Here is the latest QA.

Honestly it's a bit difficult to understand results in detail especially from mobile but if you think that it looks as expected we can merge.

I'm a bit confused why many are scored zero

gentlementlegen · 2024-09-11T01:40:28Z

I'd be in favor to replace the - for relevances by 0 because to me it feels like no relevance was evaluated at all. I will test this on some previously closed tasks.

0x4007 · 2024-09-11T01:47:25Z

Interesting point. If relevance evaluation was skipped due to the config, then - makes sense. If it evaluated to 0 then we should write 0. Let's do this.

gentlementlegen · 2024-09-11T02:20:42Z

@EresDev I gave it a try here and got a relevance of 1 for all pull-request commands. Perhaps the configuration should be changed as well?

EresDev · 2024-09-11T10:11:27Z

@EresDev I gave it a try here and got a relevance of 1 for all pull-request commands. Perhaps the configuration should be changed as well?

It appears to be running at some old commit and the latest code of this PR isn't being used.

0x4007 · 2024-09-11T10:17:41Z

@EresDev I gave it a try here and got a relevance of 1 for all pull-request commands. Perhaps the configuration should be changed as well?

It appears to be running at some old commit and the latest code of this PR isn't being used.

@gentlementlegen change the package version to * to waste less time on housekeeping

EresDev · 2024-09-11T10:48:40Z

I'm a bit confused why many are scored zero

They appear very good to me. They are mostly comments I just wrote when I made a change to the bot and wanted to see its result. For example "updated config". Now this comment has no relevance to the original issue specifications and its relevance was scored 0 by OpenAI. I just wrote it in there to keep track of the QA.

0x4007

It seems fine although to be honest sometimes it's a bit difficult to tell what's going on from our results table, especially from mobile.

gentlementlegen · 2024-09-12T04:28:11Z

@EresDev I gave it a try here and got a relevance of 1 for all pull-request commands. Perhaps the configuration should be changed as well?

It appears to be running at some old commit and the latest code of this PR isn't being used.

@gentlementlegen change the package version to * to waste less time on housekeeping

Not sure how this is relevant to the problem, I just probably didn't merge / use latest commit properly when updating in my org.

src/parser/content-evaluator-module.ts

0x4007 · 2024-09-17T14:41:24Z

@EresDev Solve merge conflict and you can merge

…on-rewards into development

0x4007 · 2024-09-22T09:18:53Z

Looks like there's a ton of changes. Perhaps you should ensure it works by posting QA

EresDev · 2024-09-22T10:01:35Z

Looks like there's a ton of changes. Perhaps you should ensure it works by posting QA

Here is the latest QA. It looks good to me. If you find it good, you can merge it. @0x4007

EresDev added 5 commits August 9, 2024 01:12

feat: evaluate code reviews comments with openai

52bd951

fix: give relevance 1 to PR comment with no-code

b6dc127

fix: remove PR fixed relevance to use openai

9786ada

test: change issue to test PR review comments

a03d14a

Update test issue to one that has PR code review comments

fix: give default relevance 1 to pull spec in test

4af0362

Keyrxng mentioned this pull request Aug 13, 2024

Debug invalid date ubiquity-os-marketplace/daemon-merging#8

Closed

EresDev added 4 commits August 13, 2024 15:51

Merge branch 'development' of https://github.com/ubiquibot/conversati…

db161e9

…on-rewards into development

refactor: add return type to method

7d6ebbc

refactor: remove redundant type

b223454

refactor: move types to correct file

3d32bc9

EresDev marked this pull request as ready for review August 13, 2024 12:42

EresDev requested review from 0x4007, gentlementlegen and whilefoo August 13, 2024 12:43

gentlementlegen mentioned this pull request Aug 13, 2024

Analyze and correct eventual discrepancies with the old bot #26

Closed

gentlementlegen changed the base branch from development to main August 15, 2024 05:12

gentlementlegen changed the base branch from main to development August 15, 2024 05:12

gentlementlegen reviewed Aug 15, 2024

View reviewed changes

src/configuration/content-evaluator-config.ts Show resolved Hide resolved

gentlementlegen reviewed Aug 15, 2024

View reviewed changes

src/parser/content-evaluator-module.ts Show resolved Hide resolved

gentlementlegen reviewed Aug 15, 2024

View reviewed changes

src/parser/content-evaluator-module.ts Show resolved Hide resolved

gentlementlegen reviewed Aug 15, 2024

View reviewed changes

tests/process.issue.test.ts Show resolved Hide resolved

whilefoo suggested changes Aug 15, 2024

View reviewed changes

src/parser/data-purge-module.ts Outdated Show resolved Hide resolved

src/parser/content-evaluator-module.ts Outdated Show resolved Hide resolved

src/parser/content-evaluator-module.ts Outdated Show resolved Hide resolved

src/parser/content-evaluator-module.ts Outdated Show resolved Hide resolved

EresDev added 8 commits August 17, 2024 01:38

fix: add missing http mocks for target issue 5 & its pr 12

3d114d1

refactor: improve how diff_hunk property is added

ee199db

fix: use correct prompt for review comments

251f0ce

refactor: improve combining relevances for different types

32d98ff

refactor: remove redundant mocks

ce2d0b2

fix: increase openai max token limit 3x

7090b65

128 token handling roughly 10 code review commetns

fix: get code review evaulation without issue specs

f56d997

refactor: rename diff_hunk to diffHunk

99089b2

EresDev added 3 commits August 24, 2024 17:44

test: update expected output with openai relevance

041edf9

test: update expected output using openai relevance

ee86b77

refactor: rename reviewComments to prComments

ee9dd9e

EresDev added 3 commits September 10, 2024 02:12

Merge branch 'development' of https://github.com/ubiquibot/conversati…

64a00fd

…on-rewards into development

Merge branch 'development' of https://github.com/ubiquibot/conversati…

cead79b

…on-rewards into development

fix: apply fixes to broken merge

bbb22e6

mostly updates expected output to inclulde review comments

0x4007 mentioned this pull request Sep 11, 2024

-&0 #113

Closed

0x4007 approved these changes Sep 11, 2024

View reviewed changes

gentlementlegen approved these changes Sep 13, 2024

View reviewed changes

0x4007 requested a review from whilefoo September 13, 2024 13:32

whilefoo suggested changes Sep 14, 2024

View reviewed changes

src/parser/content-evaluator-module.ts Outdated Show resolved Hide resolved

fix: use camel casing for property name

2da6b86

whilefoo approved these changes Sep 17, 2024

View reviewed changes

Merge branch 'development' of https://github.com/ubiquibot/conversati…

47e7df8

…on-rewards into development

0x4007 merged commit 30fed26 into ubiquity-os-marketplace:development Sep 22, 2024
3 checks passed

ubiquity-os bot mentioned this pull request Sep 22, 2024

Relevance Adjustment #45

Closed

ubiquity-os-beta bot mentioned this pull request Oct 31, 2024

Consolidate comments ubiquity-os/ubiquity-os-kernel#184

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PR: evaluate code review comments with OpenAI #79

PR: evaluate code review comments with OpenAI #79

EresDev commented Aug 8, 2024 •

edited

Loading

gentlementlegen commented Sep 9, 2024

EresDev commented Sep 10, 2024

0x4007 commented Sep 11, 2024 •

edited

Loading

gentlementlegen commented Sep 11, 2024

0x4007 commented Sep 11, 2024 •

edited

Loading

gentlementlegen commented Sep 11, 2024

EresDev commented Sep 11, 2024

0x4007 commented Sep 11, 2024

EresDev commented Sep 11, 2024 •

edited

Loading

0x4007 left a comment

gentlementlegen commented Sep 12, 2024

0x4007 commented Sep 17, 2024

0x4007 commented Sep 22, 2024

EresDev commented Sep 22, 2024 •

edited

Loading

PR: evaluate code review comments with OpenAI #79

PR: evaluate code review comments with OpenAI #79

Conversation

EresDev commented Aug 8, 2024 • edited Loading

Edit:

gentlementlegen commented Sep 9, 2024

EresDev commented Sep 10, 2024

0x4007 commented Sep 11, 2024 • edited Loading

gentlementlegen commented Sep 11, 2024

0x4007 commented Sep 11, 2024 • edited Loading

gentlementlegen commented Sep 11, 2024

EresDev commented Sep 11, 2024

0x4007 commented Sep 11, 2024

EresDev commented Sep 11, 2024 • edited Loading

0x4007 left a comment

Choose a reason for hiding this comment

gentlementlegen commented Sep 12, 2024

0x4007 commented Sep 17, 2024

0x4007 commented Sep 22, 2024

EresDev commented Sep 22, 2024 • edited Loading

EresDev commented Aug 8, 2024 •

edited

Loading

0x4007 commented Sep 11, 2024 •

edited

Loading

0x4007 commented Sep 11, 2024 •

edited

Loading

EresDev commented Sep 11, 2024 •

edited

Loading

EresDev commented Sep 22, 2024 •

edited

Loading