Show HN: Papermusic (draw an instrument, then play it) This was a fun experiment to try PaliGemma (open vision-language model). I found that PaliGemma performed better than Gemini Flash for this type of specific image task, especially around latency. (~0.9 seconds for PaliGemma inference on a VM, vs. 3-4 seconds for Gemini Flash.) Would love feedback on ways to potentially improve this setup. https://ift.tt/dYmSwi4 June 17, 2024 at 09:56PM
Show HN: Papermusic (draw an instrument, then play it) https://ift.tt/zbK1Z5D
Related Articles
Show HN: CSSBattle – A competitive game for web designers and developers https://ift.tt/PBl70zoShow HN: CSSBattle – A competitive game for web designers and develope… Read More
Show HN: BadUSB that can exfiltrate stored WiFi passwords https://ift.tt/TBkVwg5Show HN: BadUSB that can exfiltrate stored WiFi passwords https://ift.… Read More
Show HN: Track time spent on activities that matter to you https://ift.tt/hY8zUyfShow HN: Track time spent on activities that matter to you https://ift… Read More
Show HN: Build WebExtensions in Go, a Native Way https://ift.tt/chYwA32Show HN: Build WebExtensions in Go, a Native Way Less than a week ago,… Read More
Show HN: Pollux – A Message Passing Cloud Orchestrator https://ift.tt/KioLYTVShow HN: Pollux – A Message Passing Cloud Orchestrator https://ift.tt/… Read More
Show HN: Vimacs – Fast, Feature-rich & Beautiful Neovim configuration https://ift.tt/hSUa6W9Show HN: Vimacs – Fast, Feature-rich & Beautiful Neovim configurat… Read More
Show HN: I made a one button snake game variant https://ift.tt/PxEg8otShow HN: I made a one button snake game variant https://tapsnake.com O… Read More
Show HN: Talk with ChatGPT using your VOICE https://ift.tt/iBNleX7Show HN: Talk with ChatGPT using your VOICE https://ift.tt/vae2H7t Oct… Read More
0 Comments: