-
Notifications
You must be signed in to change notification settings - Fork 6.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Issue]: Agentic Loop over a large document #2542
Comments
@ismailsimsek are you asking for an OCR capability or a rag capability? If OCR, I believe it's planned in the multimodality road map #1975 |
Currently trying to get it work with RAG ( RetrieveUserProxyAgent + GroupChatManager ) Appreciate if anyone could point to similar solutions.. Current code is here: https://github.com/ismailsimsek/aistorybooks/blob/story-book/classic_storiesv2.py currently just trying to summarize PDF, later on planning to add image generation too |
The current RetrieveUserProxyAgent should support PDF files. Have you tried it? |
@thinkall i will check it. what i am looking into is summarizing the PDF in small chunks, since its too big. in a loop, is that possible using the agents to loop and process chunks one by one? |
The agent will split the pdf into chunks and save it into vector db. |
Close as it's not active for a long time. Please reopen if the issue still persist. |
Describe the issue
Is it possible to read large PDF document in chunks using agents. Without programmatic loop.
Could it be done using task-decomposition? have anyone done something similar?
Steps to reproduce
something like below:
Screenshots and logs
No response
Additional Information
No response
The text was updated successfully, but these errors were encountered: