Show HN: Open-source proxy server for Llama2, GPT-4, Claude2 with Logging,Cache Hello hacker news, I’m the maintainer of liteLLM() - package to simplify input/output to OpenAI, Azure, Cohere, Anthropic, Hugging face API Endpoints: https://ift.tt/7Mg4JLW We’re open sourcing our implementation of liteLLM proxy: https://ift.tt/Smn9WrH... TLDR: It has one API endpoint /chat/completions and standardizes input/output for 50+ LLM models + handles logging, error tracking, caching, streaming What can liteLLM proxy do? - It’s a central place to manage all LLM provider integrations - Consistent Input/Output Format - Call all models using the OpenAI format: completion(model, messages) - Text responses will always be available at ['choices'][0]['message']['content'] - Error Handling Using Model Fallbacks (if GPT-4 fails, try llama2) - Logging - Log Requests, Responses and Errors to Supabase, Posthog, Mixpanel, Sentry, Helicone - Token Usage & Spend - Track Input + Completion tokens used + Spend/model - Caching - Implementation of Semantic Caching - Streaming & Async Support - Return generators to stream text responses You can deploy liteLLM to your own infrastructure using Railway, GCP, AWS, Azure Happy completion() ! https://ift.tt/JFL3054 August 12, 2023 at 05:38AM
Show HN: Open-source proxy server for Llama2, GPT-4, Claude2 with Logging,Cache https://ift.tt/MZFkmL3
Related Articles
Show HN: Your AI Product Manager https://ift.tt/jsbkAU3Show HN: Your AI Product Manager Productly uses AI to automatically lo… Read More
Show HN: I made Vinlo – Spinning artwork video for your music https://ift.tt/nBqrcHNShow HN: I made Vinlo – Spinning artwork video for your music Hi HN, I… Read More
Show HN: Prompts as (WASM) Programs https://ift.tt/r2OplbfShow HN: Prompts as (WASM) Programs AICI is a proposed common interfac… Read More
Show HN: Timelock.dev – Send a secret into the future using timelock encryption https://ift.tt/0LUJ1jVShow HN: Timelock.dev – Send a secret into the future using timelock e… Read More
Show HN: Wife couldn't find a dev job so I built a tool to automate the search https://ift.tt/zr61eQGShow HN: Wife couldn't find a dev job so I built a tool to automate th… Read More
Show HN: React Geiger – performance profiling using sound https://ift.tt/5Vhl8x4Show HN: React Geiger – performance profiling using sound https://ift.… Read More
Show HN: Create and share good practices, inspired by nohello https://ift.tt/lFQxBbKShow HN: Create and share good practices, inspired by nohello I wanted… Read More
Show HN: StableBuild – make any Docker container deterministic https://ift.tt/dc8lhsNShow HN: StableBuild – make any Docker container deterministic Hi HN! … Read More
0 Comments: