LLM queries for personal pdf libraries?

Imgonnatrythis@sh.itjust.works · 4 months ago

LLM queries for personal pdf libraries?

merari42@lemmy.world · edit-2 4 months ago

I have used this small R package that allows you to read the text content of a PDF and send it to a local llama model via ollama or one of the large LLM APIs. I could use that to get structured answers in JSON format on a whole folder of papers, but the context length of a typical model is only long enough to hold a single (roughly 40-page) paper in the memory. So I had to get separate structurer answers on each paper and then generate a complete summary from those. Unfortunately that is not user-friendly yet.

Andy@slrpnk.net · 4 months ago

I don’t know of one, but I too would be interested to see what this looks like.

How do you currently store and organize PDFs? I used to use Mendeley during grad school, and honestly I really, really liked it. But being able to ask a question and get a natural language response that suggests which papers might contain insights when taken together would be an incredible asset.

Imgonnatrythis@sh.itjust.works · 4 months ago

Menedely hold out here myself! Tried switching to endnote because mendeley is for all practical purposes abandonware now but the conversion is very painful with loss of a lot of data - notes, organizational structure etc.

Still using the old desktop app nearly daily. If it was still a living project, integrating something like this into mendeley would be incredible.