Show HN: I replicated Anthropic's monosemanticity research using just my MacBook Hi everyone, I've been working on an open-source implementation of Anthropic's research on monosemanticity ("Towards Monosemanticity"). The problem Anthropic is trying to solve is that language models are hard to interpret because individual neurons can be responsible for multiple different things. The research finds that training a small autoencoder on neuron activations can result in "features" which are much easier to interpret. When I was reading the original research, I got really excited when I realized that the models they used were really small, and I could probably train them from scratch with just my M3 MBP. My models are somewhat undertrained compared to what Anthropic produced, but I think my results are still very compelling. Let me know what you think! https://ift.tt/Ddfgl2K April 30, 2024 at 10:56PM
Show HN: I replicated Anthropic's monosemanticity research using just my MacBook https://ift.tt/aopBX16
Related Articles
Show HN: Host a planet-scale geocoder for $10/mo https://ift.tt/UoANDczShow HN: Host a planet-scale geocoder for $10/mo For the uninitiated, … Read More
Show HN: Programming is easier than you think https://ift.tt/YG0fn7aShow HN: Programming is easier than you think https://ift.tt/Wug1e9K F… Read More
Show HN - tool that converts image receipts to Excel https://ift.tt/w8HB4C5Show HN - tool that converts image receipts to Excel Hey I'm excited t… Read More
Show HN: Domino Fit – Domino Tiling Puzzle https://ift.tt/uJRes2bShow HN: Domino Fit – Domino Tiling Puzzle Domino fit is a domino tili… Read More
Show HN: Caps-log (Captain's log) – A small TUI journaling tool https://ift.tt/9vQVRiqShow HN: Caps-log (Captain's log) – A small TUI journaling tool Caps-l… Read More
Show HN: I Built an Open Source API with Insanely Fast Whisper and Fly GPUs https://ift.tt/EcrCkwtShow HN: I Built an Open Source API with Insanely Fast Whisper and Fly… Read More
Show HN: Driftmania – an open source PICO-8 racing game https://ift.tt/u4rLPZMShow HN: Driftmania – an open source PICO-8 racing game I've been spen… Read More
Show HN: The History Chronicle – daily historical facts in newspaper form https://ift.tt/Nchj8FMShow HN: The History Chronicle – daily historical facts in newspaper f… Read More
0 Comments: