I’m looking for a resource efficient AI model for text generation (math, coding etc.) that will work with LocalAI. Which model should I use? I don’t want it to use more than 1-3 GB RAM. I’ll run it on a vps to use with Nextcloud.

Edit: I’m use Mistral AI and Groq.com instead of selfhosting the models. They both have generous free plan.

  • brucethemoose@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    1 month ago

    To actually answer this, you could look into free APIs of open source models, which have daily limits but are otherwise largely catch-free. You could even mirror endpoints on your VPS if you need to, or host “middleware” like prompt formatters and enhancers.

    I say this because, as others said, you cannot actually host AI on a VPS…