Show HN: Papermusic (draw an instrument, then play it) This was a fun experiment to try PaliGemma (open vision-language model). I found that PaliGemma performed better than Gemini Flash for this type of specific image task, especially around latency. (~0.9 seconds for PaliGemma inference on a VM, vs. 3-4 seconds for Gemini Flash.) Would love feedback on ways to potentially improve this setup. https://ift.tt/dYmSwi4 June 17, 2024 at 09:56PM
Show HN: Papermusic (draw an instrument, then play it) https://ift.tt/zbK1Z5D
Related Articles
Show HN: Iceburg CRM – Open-Source Meta Driven CRM Using Vue3 / Laravel https://ift.tt/DpLHNMRShow HN: Iceburg CRM – Open-Source Meta Driven CRM Using Vue3 / Larave… Read More
Show HN: Using stylometry to find HN users with alternate accounts https://ift.tt/3LwglISShow HN: Using stylometry to find HN users with alternate accounts htt… Read More
Show HN: A tool that automatically follows people from Twitter on Mastodon https://ift.tt/TF0Z9NOShow HN: A tool that automatically follows people from Twitter on Mast… Read More
Show HN: WinkNLP delivers 600k tokens/second speed on browsers (MBP M1) https://ift.tt/M9vi4oSShow HN: WinkNLP delivers 600k tokens/second speed on browsers (MBP M1… Read More
Show HN: We created a tool to visualize scientific knowledge https://ift.tt/84f2ESqShow HN: We created a tool to visualize scientific knowledge I posted … Read More
Show HN: API to deliver responsive images for Web https://ift.tt/RMzNPFJShow HN: API to deliver responsive images for Web https://ift.tt/BswnI… Read More
Show HN: I built an app that scans every social media network for your username https://ift.tt/h9tFyczShow HN: I built an app that scans every social media network for your… Read More
Show HN: Try out Stable Diffusion models for free https://ift.tt/wG2fPEaShow HN: Try out Stable Diffusion models for free https://ift.tt/2qD6G… Read More
0 Comments: