-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
potential incorrect annotation #7
Comments
I pick another file Bmr006.json, and the annotations also seem to be somewhat, if not very, problematic. I did a little bit search and found that train/Bed005.json and test/Bmr006.json share the exact same topic_list and queries but different meeting transcripts. |
@maszhongming Any updates on this? |
Sorry for the annotations that may be problematic! In fact, @WadeYin9712 and I are responsible for the review of Product domain and (part of) Committee domain in QMSum dataset. It seems that the data in the Academic domain has various problems. We will contact the corresponding annotator and reviewer and try to fix them one by one. Have you found similar problems in the data of the other two domains? |
Thanks for the reply! I haven't found other issues so far in the other domains. Is there an estimated timeline of the fix? |
@maszhongming Any updates? |
Sorry for the late reply! We're trying to seek another batch of qualified annotators. But it might take a long time to find them, train them and finish the re-annotation. We plan to fix the problematic meetings like Bed008.json this week. We will inform you once we accomplished the re-annotation of these meetings. Thanks! |
Hi, we have updated the annotations of Bmr006 and Bed008. We will continue to look for problematic annotations and fix them. |
Hi,
I am manually checking the data annotation. I randomly pick one file in the test set, which is Bed008.json. In my opinion, the annotations are a bit problematic. See below for detailed analysis:
...
Seems to me that you mismatch the transcripts and questions/summaries. Please correct me if I misunderstand anything!
Thanks.
The text was updated successfully, but these errors were encountered: