Show HN: Trained Tiny Tales GPT(30M model)from scratch and deployed in $15 For the last few weeks, I have been working on training an LLM from scratch and deploying it in production on Google Cloud Platform. Finally, I trained a 30 million parameter model on 1 billion tokens and deployed it as a web service. You can access the LLM using this site - https://ift.tt/JNOro3u The following steps were taken to build Tiny Tales GPT 1. Downloaded and preprocessed 8GB of dataset using multiprocessing library. 2. Tokenized the data using byte pair encoding to create 1 billion tokens sharded in different bin files. 3. Defined a training setup and trained the model on a small version of the LLaMA model architecture with 30 million parameters. 4. The training was done using Distributed Data-Parallel on two A-100 GPUs provided by JarvisLabs.ai (they are most cost-optimized) 5. After the training is done, an inference script is created to predict the tokens from the trained model given the input context vector. 6. Developed REST-based API service using Flask framework to interact with the inference service to the end user. 7. Finally used GCP's virtual machines, instance groups, load balancers, and DNS services to deploy the service on the internet. https://ift.tt/odhkjPA April 18, 2024 at 02:25AM
Show HN: Trained Tiny Tales GPT(30M model)from scratch and deployed in $15 https://ift.tt/qGD6ZuB
Related Articles
Show HN: Ungrey for Hacker News https://ift.tt/3cM6cprShow HN: Ungrey for Hacker News https://ift.tt/2ZheC4p May 20, 2020 at… Read More
Show HN: Statux, a minimal React state management library with hooks https://ift.tt/2T8tONtShow HN: Statux, a minimal React state management library with hooks h… Read More
Show HN: I'm releasing my 2D Tile Map Editor for free https://ift.tt/2yofskGShow HN: I'm releasing my 2D Tile Map Editor for free https://ift.tt/2… Read More
Show HN: Deepdesigns.ai – Design a unique face covering with deep learning https://ift.tt/3cmUba2Show HN: Deepdesigns.ai – Design a unique face covering with deep lear… Read More
Show HN: Open Source Web/API Monitoring Tool https://ift.tt/2WfRJwiShow HN: Open Source Web/API Monitoring Tool https://ift.tt/2WlfaV7 Ma… Read More
Show HN: Real-Time Session Invalidation https://ift.tt/2yOgR4bShow HN: Real-Time Session Invalidation https://ift.tt/2LltTco May 12,… Read More
Show HN: Host live public audio conversations https://ift.tt/2X80EAfShow HN: Host live public audio conversations https://talkjaw.com/ May… Read More
Show HN: Generates relations between DOM elements depending on their depth https://ift.tt/2LyD6hCShow HN: Generates relations between DOM elements depending on their d… Read More
0 Comments: