Show HN: Solving NYT Connections with ChatGPT Just for fun I decided to see if I could use chatGPT to solve NYT Connections word puzzles. It uses a pretty straightforward BFS search in which the LLM is first prompted to generate several possible groupings of four related words, and then a different prompt is used to evaluate the soundness of each of those groupings. This approach seems to be able to produce the correct solution somewhat less than half the time. Some observations: * For whatever reason, chatGPT-4 seems to be a bit worse than 3.5 at generating Connections groupings. I haven’t tested systematically so maybe this is just some small sample size bias. But at the very least it isn’t obviously better * It really struggles with the “words that can fill in the blank” style groups. Often it will correctly come up with the right category (e.g. “words that can precede `cheese`”) but will only be able to identify 2 of 4 words in that grouping * It frequently generates very vague categories (“words that can be nouns”) despite nothing like that appearing in the proposal prompt. Also it will still sometimes score them highly, despite there being several explicitly examples in the value prompt disallowing these types of categories If you have any idea for how to improve this, please let me know (or send a PR)! https://ift.tt/ns9q0kx December 6, 2023 at 01:41AM
Show HN: Solving NYT Connections with ChatGPT https://ift.tt/BW4oIA7
Related Articles
Show HN: Cvtek: Craft Your Resume/CV Using TOML https://ift.tt/p5MiE1uShow HN: Cvtek: Craft Your Resume/CV Using TOML https://ift.tt/ouMsFnb… Read More
Show HN: Lightsaber Battle – May the Fourth Be with You https://ift.tt/qgvoW1UShow HN: Lightsaber Battle – May the Fourth Be with You Hey folks, joi… Read More
Show HN: 2,629 Tiny Demos from Pouet, Running in the Browser https://ift.tt/Q91nFDPShow HN: 2,629 Tiny Demos from Pouet, Running in the Browser https://i… Read More
Show HN: An AI model that busts AI faces https://ift.tt/IOtzMCFShow HN: An AI model that busts AI faces Hey HN, I'm an AI enthusiast … Read More
Show HN: Prompt Tower – Simplify writing prompts with multiple code blocks https://ift.tt/kuBU4XgShow HN: Prompt Tower – Simplify writing prompts with multiple code bl… Read More
Show HN: gpudeploy.com – "Airbnb" for GPUs https://ift.tt/0LUxDEbShow HN: gpudeploy.com – "Airbnb" for GPUs Hi HN, YC w24 company here.… Read More
Show HN: I built a free in-browser Llama 3 chatbot powered by WebGPU https://ift.tt/c8CEYe5Show HN: I built a free in-browser Llama 3 chatbot powered by WebGPU I… Read More
Show HN: Dillo 3.1.0 released after 9 years https://ift.tt/FCfXl9yShow HN: Dillo 3.1.0 released after 9 years As commented before[1], I'… Read More
0 Comments: