Skip to content

To increase the speed of this library #1183

@takeruhukushima

Description

@takeruhukushima

I apologize for bothering you during your busy schedule.

Today, Google introduced File search function. And I tried it with the PaperQA2 paper. It solved 6.5 out of 10 questions I create in under 6 seconds. Remarkably, this achieved the same score as when solving the same questions using PaperQA in about 40seconds.Of course, Google showed a tendency to correctly answer ambiguous questions like "What is pqa" while getting strict questions involving numbers wrong, but I'm amazed by this optimization of speed and accuracy.

So, how can we make paperqa's full-text search + similarity search via tantivy and RCS functionality via Gather evidence faster?

As a user, would probably set up something like these fast settings, but...

{
  "answer": {
    "evidence_k": 5,
    "evidence_summary_length": "25 to 50 words",
    "answer_max_sources": 3,
    "answer_length": "50 to 100 words",
    "max_concurrent_requests": 5
  },
  "parsing": {
    "use_doc_details": false
  },
  "prompts": {
    "use_json": false,
    "context_inner": "{name}: {text}"
  },
  "agent": {
    "agent_type": "fake"
  }
}

As a developer, how can I keep the speed under 10 seconds (ideally around 5 seconds) like Google does?

Is it impossible at the point of making the API call?
What is the difference between Google service and this library?
I'd like to hear a professional's detailed explanation.

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions