Show HN: Kevin-32B – how to do multi-turn RL on writing CUDA kernels Hey – we just published a blog post about Kevin-32B = K(ernel D)evin. It's to our knowledge the first open-source model that's RL-trained on CUDA kernels. Our goal was to demonstrate multi-turn RL using GRPO. We used 180 Python->CUDA conversion tasks from the KernelBench dataset. The results were surprisingly strong! We were able to outperform top reasoning model like o3 & o4-mini. We're sharing our training setup and learnings in the blogpost. Also the model is on HuggingFace: https://ift.tt/yzarSeH https://ift.tt/Fv4XU2o May 7, 2025 at 01:18AM
Show HN: Kevin-32B – how to do multi-turn RL on writing CUDA kernels https://ift.tt/VEsDdT4
Related Articles
Show HN: FastHTML, a new Python-based system for writing web applications https://ift.tt/Jv1f6dBShow HN: FastHTML, a new Python-based system for writing web applicati… Read More
Show HN: I made an online journaling app focused on day overview using emojis https://ift.tt/KvO2MXgShow HN: I made an online journaling app focused on day overview using… Read More
Show HN: I made a tool to easily transform and manipulate your JSON data https://ift.tt/my9hZ5gShow HN: I made a tool to easily transform and manipulate your JSON da… Read More
Show HN: ChainFactory – Run Structured LLM Inference with Easy Parallelism https://ift.tt/mJSBAOlShow HN: ChainFactory – Run Structured LLM Inference with Easy Paralle… Read More
Show HN: FP32 matmul of large matrices up to 24% faster than cuBLAS on a 4090 https://ift.tt/eYpMOTtShow HN: FP32 matmul of large matrices up to 24% faster than cuBLAS on… Read More
Show HN: A Path-Based Data storage/retrieval web service to prevent crawling https://ift.tt/cWHzDpnShow HN: A Path-Based Data storage/retrieval web service to prevent cr… Read More
Show HN: Chrome Extension to Open Google Maps Locations in Apple Maps https://ift.tt/Khl6TiZShow HN: Chrome Extension to Open Google Maps Locations in Apple Maps … Read More
Show HN: Heyya v1.0.0 Elixir and Phoenix LiveView Snapshot Testing Library https://ift.tt/5hQdcryShow HN: Heyya v1.0.0 Elixir and Phoenix LiveView Snapshot Testing Lib… Read More
0 Comments: