Show HN: PDF to MD by LLMs – Extract Text/Tables/Image Descriptives by GPT4o I've developed a Python API service that uses GPT-4o for OCR on PDFs. It features parallel processing and batch handling for improved performance. Not only does it convert PDF to markdown, but it also describes the images within the PDF using captions like `[Image: This picture shows 4 people waving]`. In testing with NASA's Apollo 17 flight documents, it successfully converted complex, multi-oriented pages into well-structured Markdown. The project is open-source and available on GitHub. Feedback is welcome. https://ift.tt/q5fHWC7 September 22, 2024 at 07:35AM
Show HN: PDF to MD by LLMs – Extract Text/Tables/Image Descriptives by GPT4o https://ift.tt/NrGJBzd
Related Articles
Show HN: Data Formulator – AI-powered data visualization from Microsoft Research https://ift.tt/UxyuSkQShow HN: Data Formulator – AI-powered data visualization from Mic… Read More
Show HN: I made a site to quick identify any plant and learn how to care for it https://ift.tt/Lv1Sb3JShow HN: I made a site to quick identify any plant and learn how to ca… Read More
Show HN: HN Update – Hourly News Broadcast of Top HN Stories https://ift.tt/gUyzWZuShow HN: HN Update – Hourly News Broadcast of Top HN Stories I feel li… Read More
Show HN: Open-Source Zero-Shot Image Model Server Enabling Model Feedback https://ift.tt/wXRzblPShow HN: Open-Source Zero-Shot Image Model Server Enabling Model Feedb… Read More
Show HN: Semantic Macros Text Editor https://ift.tt/2zc604yShow HN: Semantic Macros Text Editor https://ift.tt/6Rt8jTv October 21… Read More
Show HN: Create mind maps to learn new things using AI https://ift.tt/5HU8Wr4Show HN: Create mind maps to learn new things using AI Enter a topic a… Read More
Show HN: Floating point arithmetic types in C++ for any size and any base https://ift.tt/F4Mpot6Show HN: Floating point arithmetic types in C++ for any size and any b… Read More
Show HN: I created a web app to encrypt/decrypt messages using Web Crypto API https://ift.tt/Jw0EY2MShow HN: I created a web app to encrypt/decrypt messages using Web Cry… Read More
0 Comments: