Show HN: PDF to MD by LLMs – Extract Text/Tables/Image Descriptives by GPT4o I've developed a Python API service that uses GPT-4o for OCR on PDFs. It features parallel processing and batch handling for improved performance. Not only does it convert PDF to markdown, but it also describes the images within the PDF using captions like `[Image: This picture shows 4 people waving]`. In testing with NASA's Apollo 17 flight documents, it successfully converted complex, multi-oriented pages into well-structured Markdown. The project is open-source and available on GitHub. Feedback is welcome. https://ift.tt/q5fHWC7 September 22, 2024 at 07:35AM
Show HN: PDF to MD by LLMs – Extract Text/Tables/Image Descriptives by GPT4o https://ift.tt/NrGJBzd
Related Articles
Show HN: Tree-sitter Integration for Swift https://ift.tt/f54y7YJShow HN: Tree-sitter Integration for Swift I have created a Swift pack… Read More
Show HN: SmolCopilot – 360M LLM writing assistant in the browser https://ift.tt/j8QNr1YShow HN: SmolCopilot – 360M LLM writing assistant in the browser Hey! … Read More
Show HN: Wd-40, a static webserver with automatic hot-reloads https://ift.tt/9miqcbQShow HN: Wd-40, a static webserver with automatic hot-reloads It works… Read More
Show HN: lcl.host for Teams – team-wide local HTTPS in development https://ift.tt/WXqH7teShow HN: lcl.host for Teams – team-wide local HTTPS in development htt… Read More
Show HN: A simple and powerful RSS reader for the web https://ift.tt/MsgmhGAShow HN: A simple and powerful RSS reader for the web Hello HN! I've b… Read More
Show HN: We're developing AI employees – seeking early adopters and feedback https://ift.tt/8Sbw4RlShow HN: We're developing AI employees – seeking early adopters and fe… Read More
Show HN: Profiles – personal landing pages built with Markdown https://ift.tt/zQhwibFShow HN: Profiles – personal landing pages built with Markdown I imagi… Read More
Show HN: Permify 1.0 – Open-source fine-grained authorization service https://ift.tt/qeDW0jkShow HN: Permify 1.0 – Open-source fine-grained authorization service … Read More
0 Comments: