Show HN: Local fine tuning for Mistral and SDXL, GPU mem/latency optimization 100% bootstrapped new startup. It lets you fine tune Mistral-7B and SDXL. In particular, for the LLM fine tuning we implemented a dataprep pipeline that turns websites/pdfs/doc files into question-answer pairs for training the small LLM using an big LLM. It includes a GPU scheduler that can do finegrained GPU memory scheduling (Kubernetes can only do whole-GPU, we do it per-GB of GPU memory to pack both inference and fine tuning jobs into the same fleet) to fit model instances into GPU memory to optimally trade off user facing latency with GPU memory utilization It's a pretty simple stack of control plane and a fat container that runs anywhere you can get hold of a GPU (e.g. runpod). Architecture: https://ift.tt/ey7Kl24 Demo walkthrough showing runner dashboard: https://ift.tt/AzTgW39 Run it yourself: https://ift.tt/LfrJUIF Discord: https://ift.tt/jJSkX8I Please roast me! https://ift.tt/AzTgW39 December 22, 2023 at 01:43AM
Show HN: Local fine tuning for Mistral and SDXL, GPU mem/latency optimization https://ift.tt/dqWvsyr
Related Articles
Show HN: XDeck – An ad-blocking client app for macOS, like TweetDeck https://ift.tt/421PcmkShow HN: XDeck – An ad-blocking client app for macOS, like TweetDeck H… Read More
Show HN: We built an AI Copilot for end to end project development workflow https://ift.tt/g506ORFShow HN: We built an AI Copilot for end to end project development wor… Read More
Show HN: SHAllenge – Compete to get the lowest Hash https://ift.tt/zedWlMuShow HN: SHAllenge – Compete to get the lowest Hash I've always had an… Read More
Show HN: 100% open-source voice assistant – as a HAL9000 https://ift.tt/dcYuvCJShow HN: 100% open-source voice assistant – as a HAL9000 It started in… Read More
Show HN: Model Gateway – bridging your apps with LLM inference endpoints https://ift.tt/VaSnTUDShow HN: Model Gateway – bridging your apps with LLM inference endpoin… Read More
Show HN: Paramount – Human Evals of AI Customer Support https://ift.tt/rsOinMPShow HN: Paramount – Human Evals of AI Customer Support Hey HN, H… Read More
Show HN: Dive into Deep Work–Your Oasis, Your Way https://ift.tt/R8yOWmMShow HN: Dive into Deep Work–Your Oasis, Your Way Elevate your product… Read More
Show HN: Shpool, a Lightweight Tmux Alternative https://ift.tt/bDP7OdAShow HN: Shpool, a Lightweight Tmux Alternative shpool is a terminal s… Read More
0 Comments: