Im using Ollama on my server with the WebUI. It has no GPU so its not quick to reply but not too slow either.

Im thinking about removing the VM as i just dont use it, are there any good uses or integrations into other apps that might convince me to keep it?

  • RandomLegend [He/Him]@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    28
    ·
    2 months ago

    It’s a tool like any other. If you don’t have any usecase for it, just don’t use it.

    I use it to summarize release notes and generate some minor descriptions for generic stuff in my TTRPG campaigns.

    • DrinkMonkey@lemmy.ca
      link
      fedilink
      English
      arrow-up
      9
      ·
      2 months ago

      generate some minor descriptions for generic stuff in my TTRPG campaigns.

      Need a quick 200 word description of the interior of an apothecary? Or a band of marauding orcs? It’s been a huge time saver for me.

  • yesman@lemmy.world
    link
    fedilink
    English
    arrow-up
    14
    ·
    2 months ago

    Think of LLMs like a stupid office worker. You wouldn’t rely on them to make critical decisions, but they’re valuable for tedious stuff.

    For example, my calendar changed the way to enter new events breaking my workflow. Now I just type out a skeletal schedule and have LLM convert that into a .csv that I import.

    I’m thinking of Ripping my CD collection again. I’m researching a way to use a LLM to tidy up the metadata.

    I had a folder full of random stuff I’ve saved for years. Had a LLM organize and categorize it for me. I had to tweak the prompt enough that this was a medium difficulty task, but still way easier than doing it manually.

    • Domi@lemmy.secnd.me
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 months ago

      I’m thinking of Ripping my CD collection again. I’m researching a way to use a LLM to tidy up the metadata.

      If you ever figure out how to use AI to determine the genre(s) of a song, let me know. Have been looking for something like that for quite a while.

      • 1rre@discuss.tchncs.de
        link
        fedilink
        English
        arrow-up
        2
        ·
        2 months ago

        Yeah even gpt4o couldn’t keep track of encounters, run battles etc. in my case…

        I think if you wanted to do it mechanically consistently you’d probably need to integrate it into a vtt where you give it context and potentially fine-tune it to give quest related summaries & gming rather than just “stuff”

      • WeLoveCastingSpellz@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        3
        arrow-down
        2
        ·
        edit-2
        2 months ago

        the answer is very spesific to ur pc and amount of vram you have availşble to you. But anything lama 3 even 8b models finetuned to DM or write stories should theoritically work. The other reply that reccomends connecting to another program to make sure rules are consistent sounds like a great idea whşch I have not tried. I use silly tavern as the ui whşch has lots of options and shit to mske thşngs wkrk well. I would reccomend goşng şnto the “KoboldAI” discord and askşng şn the support sectşon folk there are very helpfull sorry for not beşng able to gşve a strsight answer Also boost the context size way up that shit makes dşfference I habe like 16k or sumthin. good luck!

        • RandomLegend [He/Him]@lemmy.dbzer0.com
          link
          fedilink
          English
          arrow-up
          4
          arrow-down
          1
          ·
          2 months ago

          What on earth is going on with your keyboad?!

          Besides that, i have 20GB of VRAM and 64GB or RAM. I can run the mixtral 8x7b model relatively usable. Currently i use oobabooga the most.

          • WeLoveCastingSpellz@lemmy.dbzer0.com
            link
            fedilink
            English
            arrow-up
            1
            arrow-down
            1
            ·
            edit-2
            2 months ago

            I type very poorly on my phone. with that much vram ypu csn get somethşng lşke a 70b model defineyly ask around in the koboldai community that shşt’s crszy

  • pe1uca@lemmy.pe1uca.dev
    link
    fedilink
    English
    arrow-up
    7
    ·
    2 months ago

    I’ve used it to summarize long articles, news posts, or videos when the title/thumbnail looks interesting but I’m not sure if it’s worth the 10+ minutes to read/watch.
    There are other solutions, like a dedicated summarizer, but I’ve investigated into them and they only extract exact quotes from the original text, an LLM can also paraphrase making the summary a bit more informative IMO.
    (For example, one article mentioned a quote from an expert talking about a company, the summarizer only extracted the quote and the flow of the summary made me believe the company said it, but the LLM properly stated the quote came from the expert)

    This project https://github.com/goniszewski/grimoire has in it’s road map a way to connect to an AI to summarize the bookmarks you make and generate at 3 tags.
    I’ve seen the code, I don’t remember what the exact status of the integration.


    Also I have a few models dedicated for coding, so I’ve also asked a few pieces of code and configurations to just get started on a project, nothing too complicated.

  • hendrik@palaver.p3x.de
    link
    fedilink
    English
    arrow-up
    3
    ·
    edit-2
    2 months ago

    Roleplay (text adventures), a (stupid but occasionally funny) dungeon master, translation and help with creativity. These are the use cases I found. If you don’t need that, you might get rid of it.

  • bizarroland@fedia.io
    link
    fedilink
    arrow-up
    2
    ·
    2 months ago

    I have a 4070 sitting around collecting dust that I got from a trade, I’ve been thinking about setting it up with whispr and TTS and having a way to talk to my house.

    I have a couple of smart home integrations, mostly air conditioning, light switches, security, and doors.

    What I would like would be to have a few speakers on the walls that can talk to my server where I can say something like, hey computer, turn on the lights in the dining room and the lights in the dining room would turn on without transmitting that information to Google or Amazon.

  • slazer2au@lemmy.world
    link
    fedilink
    English
    arrow-up
    4
    arrow-down
    2
    ·
    2 months ago

    Wanting answers to things you don’t want google to know that you don’t know.

  • thirdBreakfast@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 months ago

    I use the Continue VS Code plugin with Ollama to use a couple of different models (deepseek-coder-v2 & starcoder2) to recreate a local only Github Copilot type experience for coding. This is on an M1 Apple Silicon though. For autocomplete the generation needs to be pretty brisk - I’m not sure how that would go in a VM without a GPU.

  • minnix@lemux.minnix.dev
    link
    fedilink
    English
    arrow-up
    2
    arrow-down
    2
    ·
    2 months ago

    Ollama without a GPU is pretty useless unless you’re using with Apple silicon. I’d just get rid of it until you get a GPU.