So perplexity can kind of weakly analyze the first few pages of small file size pdfs one at a time, but I’d love to have something that would allow me to upload several hundred research papers and textbooks that could then be analyzed for consensus and contradictions and give me more meaningful search results and summaries than keyword searching alone. Does anything like this exist in a fairly user friendly accessible format?

  • merari42@lemmy.world
    link
    fedilink
    arrow-up
    1
    ·
    edit-2
    4 months ago

    I have used this small R package that allows you to read the text content of a PDF and send it to a local llama model via ollama or one of the large LLM APIs. I could use that to get structured answers in JSON format on a whole folder of papers, but the context length of a typical model is only long enough to hold a single (roughly 40-page) paper in the memory. So I had to get separate structurer answers on each paper and then generate a complete summary from those. Unfortunately that is not user-friendly yet.

  • Andy@slrpnk.net
    link
    fedilink
    arrow-up
    0
    ·
    4 months ago

    I don’t know of one, but I too would be interested to see what this looks like.

    How do you currently store and organize PDFs? I used to use Mendeley during grad school, and honestly I really, really liked it. But being able to ask a question and get a natural language response that suggests which papers might contain insights when taken together would be an incredible asset.

    • Imgonnatrythis@sh.itjust.worksOP
      link
      fedilink
      arrow-up
      1
      ·
      4 months ago

      Menedely hold out here myself! Tried switching to endnote because mendeley is for all practical purposes abandonware now but the conversion is very painful with loss of a lot of data - notes, organizational structure etc.

      Still using the old desktop app nearly daily. If it was still a living project, integrating something like this into mendeley would be incredible.