I apologize for bothering you during your busy schedule.
Today, Google introduced File search function. And I tried it with the PaperQA2 paper. It solved 6.5 out of 10 questions I create in under 6 seconds. Remarkably, this achieved the same score as when solving the same questions using PaperQA in about 40seconds.Of course, Google showed a tendency to correctly answer ambiguous questions like "What is pqa" while getting strict questions involving numbers wrong, but I'm amazed by this optimization of speed and accuracy.
So, how can we make paperqa's full-text search + similarity search via tantivy and RCS functionality via Gather evidence faster?
As a user, would probably set up something like these fast settings, but...
{
"answer": {
"evidence_k": 5,
"evidence_summary_length": "25 to 50 words",
"answer_max_sources": 3,
"answer_length": "50 to 100 words",
"max_concurrent_requests": 5
},
"parsing": {
"use_doc_details": false
},
"prompts": {
"use_json": false,
"context_inner": "{name}: {text}"
},
"agent": {
"agent_type": "fake"
}
}
As a developer, how can I keep the speed under 10 seconds (ideally around 5 seconds) like Google does?
Is it impossible at the point of making the API call?
What is the difference between Google service and this library?
I'd like to hear a professional's detailed explanation.
I apologize for bothering you during your busy schedule.
Today, Google introduced File search function. And I tried it with the PaperQA2 paper. It solved 6.5 out of 10 questions I create in under 6 seconds. Remarkably, this achieved the same score as when solving the same questions using PaperQA in about 40seconds.Of course, Google showed a tendency to correctly answer ambiguous questions like "What is pqa" while getting strict questions involving numbers wrong, but I'm amazed by this optimization of speed and accuracy.
So, how can we make paperqa's full-text search + similarity search via tantivy and RCS functionality via Gather evidence faster?
As a user, would probably set up something like these fast settings, but...
{ "answer": { "evidence_k": 5, "evidence_summary_length": "25 to 50 words", "answer_max_sources": 3, "answer_length": "50 to 100 words", "max_concurrent_requests": 5 }, "parsing": { "use_doc_details": false }, "prompts": { "use_json": false, "context_inner": "{name}: {text}" }, "agent": { "agent_type": "fake" } }As a developer, how can I keep the speed under 10 seconds (ideally around 5 seconds) like Google does?
Is it impossible at the point of making the API call?
What is the difference between Google service and this library?
I'd like to hear a professional's detailed explanation.