Show HN: Solving NYT Connections with ChatGPT Just for fun I decided to see if I could use chatGPT to solve NYT Connections word puzzles. It uses a pretty straightforward BFS search in which the LLM is first prompted to generate several possible groupings of four related words, and then a different prompt is used to evaluate the soundness of each of those groupings. This approach seems to be able to produce the correct solution somewhat less than half the time. Some observations: * For whatever reason, chatGPT-4 seems to be a bit worse than 3.5 at generating Connections groupings. I haven’t tested systematically so maybe this is just some small sample size bias. But at the very least it isn’t obviously better * It really struggles with the “words that can fill in the blank” style groups. Often it will correctly come up with the right category (e.g. “words that can precede `cheese`”) but will only be able to identify 2 of 4 words in that grouping * It frequently generates very vague categories (“words that can be nouns”) despite nothing like that appearing in the proposal prompt. Also it will still sometimes score them highly, despite there being several explicitly examples in the value prompt disallowing these types of categories If you have any idea for how to improve this, please let me know (or send a PR)! https://ift.tt/ns9q0kx December 6, 2023 at 01:41AM
Show HN: Solving NYT Connections with ChatGPT https://ift.tt/BW4oIA7
Related Articles
Show HN: Put Localhost on the Internet Instantly https://ift.tt/3rhc3tDShow HN: Put Localhost on the Internet Instantly https://localhost.run… Read More
Show HN: Papercups – open-source alternative to Intercom https://ift.tt/3vIK4GtShow HN: Papercups – open-source alternative to Intercom https://paper… Read More
Show HN: Glue – Pandas as a DAG https://ift.tt/3s3kmdoShow HN: Glue – Pandas as a DAG https://gluedata.io/ March 19, 2021 at… Read More
Show HN: Usage and crash reports for Python libraries and command line tools https://ift.tt/3tCAQdcShow HN: Usage and crash reports for Python libraries and command line… Read More
Show HN: Relocation for Self-Employed https://ift.tt/3cxVaoSShow HN: Relocation for Self-Employed https://ift.tt/3tgSYcd March 14,… Read More
Show HN: R/shouldibuythisproduct because Amazon reviews is broken https://ift.tt/3lscos3Show HN: R/shouldibuythisproduct because Amazon reviews is broken http… Read More
Show HN: A cyberpunk theme for Python desktop applications https://ift.tt/3ruEMeuShow HN: A cyberpunk theme for Python desktop applications https://ift… Read More
Show HN: O(log n) makes continuous profiling possible https://ift.tt/2OJ4234Show HN: O(log n) makes continuous profiling possible https://ift.tt/3… Read More
0 Comments: