Show HN: Firewall for LLMs–Guard Against Prompt Injection, PII Leakage, Toxicity Hey HN, We're building Aegis, a firewall for LLMs: a guard against adversarial attacks, prompt injections, toxic language, PII leakage, etc. One of the primary concerns entwined with building LLM applications is the chance of attackers subverting the model’s original instructions via untrusted user input, which unlike in SQL injection attacks, can’t be easily sanitized. (See https://ift.tt/M0y6meS for the mildest such instance.) Because the consequences are dire, we feel it’s better to err on the side of caution, with something mutli-pass like Aegis, which consists of a lexical similarity check, a semantic similarity check, and a final pass through an ML model. We'd love for you to check it out—see if you can prompt inject it!, and give any suggestions/thoughts on how we could improve it: https://ift.tt/mfbWGwr . If you want to play around with it without creating an account, try the playground: https://ift.tt/09A1owC . If you're interested in or need help using Aegis, have ideas, or want to contribute, join our Discord ( https://ift.tt/1AHMNRL ), or feel free to reach out at founders@automorphic.ai. Excited to hear your feedback! Repository: https://ift.tt/mfbWGwr Playground: https://ift.tt/09A1owC https://ift.tt/09A1owC June 29, 2023 at 01:36AM
Show HN: Firewall for LLMsGuard Against Prompt Injection PII Leakage Toxicity https://ift.tt/Gqnwpry
Related Articles
Show HN: I created a web app to encrypt/decrypt messages using Web Crypto API https://ift.tt/Jw0EY2MShow HN: I created a web app to encrypt/decrypt messages using Web Cry… Read More
Show HN: HN Update – Hourly News Broadcast of Top HN Stories https://ift.tt/gUyzWZuShow HN: HN Update – Hourly News Broadcast of Top HN Stories I feel li… Read More
Show HN: I built a tool that helps people contact you without spam https://ift.tt/oUT6tFNShow HN: I built a tool that helps people contact you without spam htt… Read More
Show HN: Create mind maps to learn new things using AI https://ift.tt/5HU8Wr4Show HN: Create mind maps to learn new things using AI Enter a topic a… Read More
Show HN: Run, learn, and debug x86-64 Assembly code directly from your browser https://ift.tt/c9kdiIwShow HN: Run, learn, and debug x86-64 Assembly code directly from your… Read More
Show HN: Open-Source Zero-Shot Image Model Server Enabling Model Feedback https://ift.tt/wXRzblPShow HN: Open-Source Zero-Shot Image Model Server Enabling Model Feedb… Read More
Show HN: Contagious Beliefs–Simulating Political Alignment https://ift.tt/OLhF2SNShow HN: Contagious Beliefs–Simulating Political Alignment This is a s… Read More
Show HN: I made a site to quick identify any plant and learn how to care for it https://ift.tt/Lv1Sb3JShow HN: I made a site to quick identify any plant and learn how to ca… Read More
0 Comments: