Hey Abdelrahman, I may be too late joining the conversation but here’s what you can try and it…

I got really excited when i first read about it 2 weeks ago but when i gave it a try on a bunch of…
106
4
Abdelrahman
Shobhit Agarwal
·Follow
Feb 6, 2025
--
Hey Abdelrahman, I may be too late joining the conversation but here’s what you can try and it actually really worked in my case, since i have to deal with longer pdfs like 400-1000 pages.

We did convert each and every page of a pdf as an image and passed it to vision models like GPT-4o, and it was really good in detecting images, tables and text. You have to play around with the prompt.
--
--
Written by Shobhit Agarwal1.4K Followers
·1K Following
🚀 Data Scientist | AI & ML | R&D 🤖 Generative AI | LLMs | Computer Vision ⚡ Deep Learning | Python 🔗 Let’s Connect: topmate.io/shobhit_agarwal
No responses yet
Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams