Show HN: Tune LLaMa3.1 on Google Cloud TPUs Hey HN, we wanted to share our repo where we fine-tuned Llama 3.1 on Google TPUs. We’re building AI infra to fine-tune and serve LLMs on non-NVIDIA GPUs (TPUs, Trainium, AMD GPUs). The problem: Right now, 90% of LLM workloads run on NVIDIA GPUs, but there are equally powerful and more cost-effective alternatives out there. For example, training and serving Llama 3.1 on Google TPUs is about 30% cheaper than NVIDIA GPUs. But developer tooling for non-NVIDIA chipsets is lacking. We felt this pain ourselves. We initially tried using PyTorch XLA to train Llama 3.1 on TPUs, but it was rough: xla integration with pytorch is clunky, missing libraries (bitsandbytes didn't work), and cryptic HuggingFace errors. We then took a different route and translated Llama 3.1 from PyTorch to JAX. Now, it’s running smoothly on TPUs! We still have challenges ahead, there is no good LoRA library in JAX, but this feels like the right path forward. Here's a demo ( https://ift.tt/6gGmvOw ) of our managed solution. Would love your thoughts on our repo and vision as we keep chugging along! https://ift.tt/csBmZpF September 11, 2024 at 08:44PM
Show HN: Tune LLaMa3.1 on Google Cloud TPUs https://ift.tt/zjxGdMI
Related Articles
Show HN: Dillo 3.1.0 released after 9 years https://ift.tt/FCfXl9yShow HN: Dillo 3.1.0 released after 9 years As commented before[1], I'… Read More
Show HN: gpudeploy.com – "Airbnb" for GPUs https://ift.tt/0LUxDEbShow HN: gpudeploy.com – "Airbnb" for GPUs Hi HN, YC w24 company here.… Read More
Show HN: An AI model that busts AI faces https://ift.tt/IOtzMCFShow HN: An AI model that busts AI faces Hey HN, I'm an AI enthusiast … Read More
Show HN: I built a free in-browser Llama 3 chatbot powered by WebGPU https://ift.tt/c8CEYe5Show HN: I built a free in-browser Llama 3 chatbot powered by WebGPU I… Read More
Show HN: Lightsaber Battle – May the Fourth Be with You https://ift.tt/qgvoW1UShow HN: Lightsaber Battle – May the Fourth Be with You Hey folks, joi… Read More
Show HN: 25.3 inch E-Ink Art Display https://ift.tt/bh3J6jgShow HN: 25.3 inch E-Ink Art Display https://ift.tt/b5TxAmJ May 6, 202… Read More
Show HN: Cvtek: Craft Your Resume/CV Using TOML https://ift.tt/p5MiE1uShow HN: Cvtek: Craft Your Resume/CV Using TOML https://ift.tt/ouMsFnb… Read More
Show HN: Prompt Tower – Simplify writing prompts with multiple code blocks https://ift.tt/kuBU4XgShow HN: Prompt Tower – Simplify writing prompts with multiple code bl… Read More
0 Comments: