-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Chat GPT as a model to reformulate queries. #37
Comments
@yogeswarl Please note that we need chatgbt to generate the query reformulations. So, the last sentence, "chatgbt can perform suggesting documents for these queries" is not correct, to my understanding. right? basically, we ask chatgbt in these ways: 1- here is the query q, please give us 10 reformulations/paraphrases of it? It's like using T5 when it is trained and we asked for predictions. |
pointer 2 is correct. This is what we will be doing! |
For the comparison, please do these variations: 1- [like a expander] here is the query q, please give us 10 reformulations/paraphrases of it? Thank you. |
Understood. |
|
Update: the above issue has been solved with the use of retrying package. |
hello @hosseinfani , query_category query_length mean_map
paraphrase_poor_gpt_query_mean_length 40.421 0.12944940000000002
paraphrase_poor_refined_query_mean_length 43.193 0.45138459999999997
paraphrase_poor_original_query_mean_length 37.004 0.0359629
paraphrase_somewhat_gpt_query_mean_length 40.398 0.4748991
paraphrase_somewhat_refined_query_mean_length 42.762 0.7004480999999999
paraphrase_somewhat_original_query_mean_length 34.964 0.29953789999999997
paraphrase_relevant_gpt_query_mean_length 41.969 0.7646921
paraphrase_relevant_refined_query_mean_length 42.971 0.8539836000000001
paraphrase_relevant_original_query_mean_length 39.078 0.8028525
finetune_poor_gpt_query_mean_length 60.017 0.5468902000000001
finetune_poor_refined_query_mean_length 43.193 0.45138459999999997
finetune_poor_original_query_mean_length 37.004 0.0359629
finetune_somewhat_gpt_query_mean_length 58.303 0.8195252000000001
finetune_somewhat_refined_query_mean_length 42.762 0.7004480999999999
finetune_somewhat_original_query_mean_length 34.964 0.29953789999999997
finetune_relevant_gpt_query_mean_length 56.367 0.8946335000000001
finetune_relevant_refined_query_mean_length 42.971 0.8539836000000001
finetune_relevant_original_query_mean_length 39.078 0.8028525
infer_poor_gpt_query_mean_length 55.03 0.6592458
infer_poor_refined_query_mean_length 43.193 0.45138459999999997
infer_poor_original_query_mean_length 37.004 0.0359629
infer_somewhat_gpt_query_mean_length 53.818 0.8387605000000001
infer_somewhat_refined_query_mean_length 42.762 0.7004480999999999
infer_somewhat_original_query_mean_length 34.964 0.29953789999999997
infer_relevant_gpt_query_mean_length 51.396 0.9012704
infer_relevant_refined_query_mean_length 42.971 0.8539836000000001
infer_relevant_original_query_mean_length 39.078 0.8028525 |
I am running another set of poor,somewhat and relevant for user reformulation for aol title url. |
@yogeswarl |
We had 3 thresholds: "poor" where original queries were from 0,0.24, "somewhat" = 0.25,0.49, "relevant" = 0.5,1.0 Inference and Finetuned model performed better than T5 and original queries. The stats are also posted in the above comment with the average mean query length and mean map |
There should be only one barplot for this. I am going to make it more optimized |
@yogeswarl |
@hosseinfani |
From the paper I was able to delve these as why ChatGPT is better and few more things I did that were not considered in chatGPT.
One option I can think of here is to stop the number of words chatGPT can see, (i.e 512 words) and compare them both with respect to T5. One problem with chatGPT is that it cannot limit the number of characters to the point like T5. the maximum output length is always much greater than both the mean of t5 and original query |
This Idea involves us asking ChatGPT to generate relevant queries based off the documents we feed.
We will sample about 10,000 documents that have the following criteria from the refined querys
Based on this data. We will be comparing how well ChatGPT can perform suggesting documents for these queries.
The text was updated successfully, but these errors were encountered: