[neural search] fix the bug of reading files when calculating the recall scores #7836

shenghwa · 2024-01-12T06:06:58Z

PR types

[ Bug fixes ]

PR changes

[ Scripts ]

Description

For this file 'applications/neural_search/recall/in_batch_negative/evaluate.py'.
If (the number of rows in 'similar_text_pair') mod ('recall_num') doesn't equal 0, the current code will still store the remaining rows in the 'rs' list, resulting in inconsistent list dimensions stored in the 'rs' list. Although it won't raise errors, logically there are some problems.

In this file 'applications/neural_search/recall/simcse/evaluate.py'.
The current loop will not save the last 'relevance_labels' list into the 'rs' list.

Therefore, the corrected code can be applied to the above two files. Besides, I think the issue should be similar for the following files:

examples/semantic_indexing/evaluate.py
applications/question_answering/supervised_qa/faq_system/evaluate.py
applications/question_answering/supervised_qa/faq_finance/evaluate.py
applications/text_classification/multi_class/retrieval_based/evaluate.py
applications/text_classification/hierarchical/retrieval_based/evaluate.py

I hadn't modified these files because I didn't check the usage in their modules. Please check those files. Thanks.

paddle-bot · 2024-01-12T06:07:03Z

Thanks for your contribution!

CLAassistant · 2024-01-12T06:07:04Z

All committers have signed the CLA.

codecov · 2024-01-12T06:44:29Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (a1d1aee) 56.95% compared to head (f856fed) 56.95%.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #7836      +/-   ##
===========================================
- Coverage    56.95%   56.95%   -0.01%     
===========================================
  Files          587      587              
  Lines        88628    88628              
===========================================
- Hits         50482    50480       -2     
- Misses       38146    38148       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

w5688414

LGTM

fix the bug of reading files

f856fed

paddle-bot bot added the contributor label Jan 12, 2024

w5688414 self-requested a review January 12, 2024 08:08

w5688414 approved these changes Jan 12, 2024

View reviewed changes

w5688414 changed the title ~~fix the bug of reading files when calculating the recall scores~~ [neural search] fix the bug of reading files when calculating the recall scores Jan 12, 2024

w5688414 merged commit 8bc06b0 into PaddlePaddle:develop Jan 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[neural search] fix the bug of reading files when calculating the recall scores #7836

[neural search] fix the bug of reading files when calculating the recall scores #7836

shenghwa commented Jan 12, 2024 •

edited

Loading

paddle-bot bot commented Jan 12, 2024

CLAassistant commented Jan 12, 2024 •

edited

Loading

codecov bot commented Jan 12, 2024

w5688414 left a comment

[neural search] fix the bug of reading files when calculating the recall scores #7836

[neural search] fix the bug of reading files when calculating the recall scores #7836

Conversation

shenghwa commented Jan 12, 2024 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Jan 12, 2024

CLAassistant commented Jan 12, 2024 • edited Loading

codecov bot commented Jan 12, 2024

Codecov Report

w5688414 left a comment

Choose a reason for hiding this comment

shenghwa commented Jan 12, 2024 •

edited

Loading

CLAassistant commented Jan 12, 2024 •

edited

Loading