• 16 Posts
  • 307 Comments
Joined 1 year ago
cake
Cake day: June 21st, 2023

help-circle






  • Kitboga has used AI (STT, LLMs, and TTS) to waste the time of Scammers.

    There are AI tools being used to develop new cures which will benefit everyone.

    There are AI tools being used to help discover new planets.

    I use DLSS for gaming.

    I run a lot of my own local AI models for various reasons. Whisper - for Audio Transcriptions/Translations.

    Different Diffusion Models (SD or Flux) - for some quick visuals to recap a D&D session.

    Tesseract OCR - to scan an image and extract any text that it can find (makes it easy to pull out text from any image and make it searchable).

    Local LLMs (Llama, Mixtral) for brainstorming ideas, reformatting text, etc. It’s great for getting started with certain subjects/topics, as long as I verify everything that it says.

    For fun I’ll probably setup GLaDOS like what was done here: https://www.reddit.com/r/LocalLLaMA/comments/1csnexs/local_glados_now_running_on_windows_11_rtx_2060/





  • Rather than making it illegal to use, people need to use these tools responsibly. If any of these companies are using almost any kind of AI/machine learning they need to include a human in the loop that can verify that it’s working correctly. That way if it starts hallucinating things that were never said, it can be caught and corrected.

    I’ve found that Whisper generally does a better job at translating/transcribing audio than other open source tools out there, so it’s not garbage… But it absolutely is a hazard if you’re trying to rely solely on it for official documents (or legal issues).

    As far as promotion goes… It’s open source software, it’s not being sold.







  • I’ve found that buying used is fine if the car is still under the manufacturers original warranty. Better yet if it has the premium/extended warranty package.

    That’s basically the only warranty that you would care about (and actually want to extend), most other warranties have so many exclusions that they’re not worth it. And definitely ignore anyone calling you telling you that they’ve “been trying to reach you about your cars extended warranty.”





  • For me, I use Whisper for transcribing/translating audio data. This has helped me to double check claims about a video’s translation (there’s a lot of disinformation going around for topics involving certain countries at war).

    Nvidia’s DLSS for gaming.

    Different diffusion models for creating quick visual recaps of previous D&D sessions.

    Tesseract OCR to quickly copy out text from an image (although I’m currently looking for a better one since this one is a bit older and, while it gets the text mostly right, there’s still a decent amount that it gets wrong).

    LLMs for brainstorming or in the place of some stack overflow questions when picking up a new programming language.

    I also saw an interesting use case from a redditor:

    I had about 80 VHS family home videos that I had converted to digital

    I then ran the 1-4 hour videos through WhisperAI Large-v3 transcription and pasted those transcripts into a prompt which had a little bit of background information on my family like where we live and names of everyone who might show up in the videos, and then gave the prompt some examples of how I wanted the file names to look, for example:

    1996 Summer - Jane’s birthday party - Joe’s Soccer game - Alaska cruise - Lanikai Beach

    And then had Claude write me titles for all the home videos and give me a little word doc to put in each folder which catalogues all the events in each video. It came out so good I have been considering this as a side business

    https://www.reddit.com/r/LocalLLaMA/comments/1gaz5kg/what_are_some_of_the_most_underrated_uses_for_llms/lthuxsu/