[Issue]: Agentic Loop over a large document #2542

ismailsimsek · 2024-04-29T12:45:35Z

Describe the issue

Is it possible to read large PDF document in chunks using agents. Without programmatic loop.

Could it be done using task-decomposition? have anyone done something similar?

Steps to reproduce

something like below:

Agents uses tool and reads PDF document 50 pages each time (total 300 pages)
Agents summarizes all the chunks to 6 page output
Summary written to file

Screenshots and logs

No response

Additional Information

No response

WaelKarkoub · 2024-04-29T22:47:53Z

@ismailsimsek are you asking for an OCR capability or a rag capability? If OCR, I believe it's planned in the multimodality road map #1975

ismailsimsek · 2024-04-30T10:54:49Z

Currently trying to get it work with RAG ( RetrieveUserProxyAgent + GroupChatManager )

Appreciate if anyone could point to similar solutions..

Current code is here: https://github.com/ismailsimsek/aistorybooks/blob/story-book/classic_storiesv2.py
PR ismailsimsek/aistorybooks#3

currently just trying to summarize PDF, later on planning to add image generation too

thinkall · 2024-05-24T09:37:30Z

Currently trying to get it work with RAG ( RetrieveUserProxyAgent + GroupChatManager )

Appreciate if anyone could point to similar solutions..

Current code is here: https://github.com/ismailsimsek/aistorybooks/blob/story-book/classic_storiesv2.py PR ismailsimsek/aistorybooks#3

currently just trying to summarize PDF, later on planning to add image generation too

The current RetrieveUserProxyAgent should support PDF files. Have you tried it?

ismailsimsek · 2024-05-24T09:49:09Z

@thinkall i will check it. what i am looking into is summarizing the PDF in small chunks, since its too big. in a loop, is that possible using the agents to loop and process chunks one by one?

thinkall · 2024-05-24T10:00:35Z

@thinkall i will check it. what i am looking into is summarizing the PDF in small chunks, since its too big. in a loop, is that possible using the agents to loop and process chunks one by one?

The agent will split the pdf into chunks and save it into vector db.

thinkall · 2024-08-14T08:31:21Z

Close as it's not active for a long time. Please reopen if the issue still persist.

WaelKarkoub added the multimodal language + vision, speech etc. label Apr 29, 2024

WaelKarkoub added the rag retrieve-augmented generative agents label Apr 30, 2024

ismailsimsek changed the title ~~[Issue]: Looping over a large document~~ [Issue]: Agentic Loop over a large document May 2, 2024

thinkall closed this as completed Aug 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Issue]: Agentic Loop over a large document #2542

[Issue]: Agentic Loop over a large document #2542

ismailsimsek commented Apr 29, 2024 •

edited

Loading

WaelKarkoub commented Apr 29, 2024

ismailsimsek commented Apr 30, 2024 •

edited

Loading

thinkall commented May 24, 2024 •

edited

Loading

ismailsimsek commented May 24, 2024 •

edited

Loading

thinkall commented May 24, 2024 •

edited

Loading

thinkall commented Aug 14, 2024

[Issue]: Agentic Loop over a large document #2542

[Issue]: Agentic Loop over a large document #2542

Comments

ismailsimsek commented Apr 29, 2024 • edited Loading

Describe the issue

Steps to reproduce

Screenshots and logs

Additional Information

WaelKarkoub commented Apr 29, 2024

ismailsimsek commented Apr 30, 2024 • edited Loading

thinkall commented May 24, 2024 • edited Loading

ismailsimsek commented May 24, 2024 • edited Loading

thinkall commented May 24, 2024 • edited Loading

thinkall commented Aug 14, 2024

ismailsimsek commented Apr 29, 2024 •

edited

Loading

ismailsimsek commented Apr 30, 2024 •

edited

Loading

thinkall commented May 24, 2024 •

edited

Loading

ismailsimsek commented May 24, 2024 •

edited

Loading

thinkall commented May 24, 2024 •

edited

Loading