Show HN: Firewall for LLMs–Guard Against Prompt Injection, PII Leakage, Toxicity Hey HN, We're building Aegis, a firewall for LLMs: a guard against adversarial attacks, prompt injections, toxic language, PII leakage, etc. One of the primary concerns entwined with building LLM applications is the chance of attackers subverting the model’s original instructions via untrusted user input, which unlike in SQL injection attacks, can’t be easily sanitized. (See https://ift.tt/M0y6meS for the mildest such instance.) Because the consequences are dire, we feel it’s better to err on the side of caution, with something mutli-pass like Aegis, which consists of a lexical similarity check, a semantic similarity check, and a final pass through an ML model. We'd love for you to check it out—see if you can prompt inject it!, and give any suggestions/thoughts on how we could improve it: https://ift.tt/mfbWGwr . If you want to play around with it without creating an account, try the playground: https://ift.tt/09A1owC . If you're interested in or need help using Aegis, have ideas, or want to contribute, join our Discord ( https://ift.tt/1AHMNRL ), or feel free to reach out at founders@automorphic.ai. Excited to hear your feedback! Repository: https://ift.tt/mfbWGwr Playground: https://ift.tt/09A1owC https://ift.tt/09A1owC June 29, 2023 at 01:36AM
Show HN: Firewall for LLMsGuard Against Prompt Injection PII Leakage Toxicity https://ift.tt/Gqnwpry
Related Articles
Show HN: NanceFi – Visualize public companies revenue sources, costs and margins https://ift.tt/3C280NJShow HN: NanceFi – Visualize public companies revenue sources, costs a… Read More
Show HN: A new stdlib for Golang focusing on platform native support https://ift.tt/YI9g6PqShow HN: A new stdlib for Golang focusing on platform native support N… Read More
Show HN: I Made an App to Summarize YouTube Videos in Just One Click https://ift.tt/g97SaZQShow HN: I Made an App to Summarize YouTube Videos in Just One Click H… Read More
Show HN: An Astro boilerplate to help you launch your SaaS in 3 minutes https://ift.tt/hB4psk6Show HN: An Astro boilerplate to help you launch your SaaS in 3 minute… Read More
Show HN: Podman Quadlet Hetzner ansible template for $5 bun.js app deployments https://ift.tt/F8iSBOnShow HN: Podman Quadlet Hetzner ansible template for $5 bun.js app dep… Read More
Show HN: Anything World – AI for 3D auto-rigging and animation https://ift.tt/YMn4jBWShow HN: Anything World – AI for 3D auto-rigging and animation https:/… Read More
Show HN: Hardcover – Letterboxd for Books https://ift.tt/MB54R93Show HN: Hardcover – Letterboxd for Books Hi HN! A little over two yea… Read More
Show HN: SourceChart – ExcaliDraw but for Charts https://ift.tt/CFAYKG2Show HN: SourceChart – ExcaliDraw but for Charts https://ift.tt/Rg6dQC… Read More
0 Comments: