Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

podcastfy won't take raw text as content file. #239

Open
VincentJGeisler opened this issue Feb 9, 2025 · 0 comments
Open

podcastfy won't take raw text as content file. #239

VincentJGeisler opened this issue Feb 9, 2025 · 0 comments

Comments

@VincentJGeisler
Copy link

When generating podcasts with knowledge bases that are just pure text, eg --url somefile.txt, the program gets confused and acts as though you have sent it a --file flag and tries to load the content of the text file as a file full of url's

ive confirmed that if I convert the raw text file to pdf, then it works as expected with --url somefile.pdf.

here is a dump of an attempt to use a raw text file as a url input.

python -m podcastfy.client --url file:///tcconv.txt --transcript-only
2025-02-09 12:37:37,477 - main - INFO - Processing 1 links
Failed to extract content from file:///tcconv.txt: HTTPSConnectionPool(host='file', port=443): Max retries exceeded with url: /tcconv.txt (Caused by NameResolutionError("<urllib3.connection.HTTPSConnection object at 0x10ae73f50>: Failed to resolve 'file' ([Errno 8] nodename nor servname provided, or not known)"))
Error extracting content from file:///tcconv.txt: Failed to extract content from file:///tcconv.txt: HTTPSConnectionPool(host='file', port=443): Max retries exceeded with url: /tcconv.txt (Caused by NameResolutionError("<urllib3.connection.HTTPSConnection object at 0x10ae73f50>: Failed to resolve 'file' ([Errno 8] nodename nor servname provided, or not known)"))
2025-02-09 12:37:37,488 - main - ERROR - An error occurred in the process_content function: Failed to extract content from file:///tcconv.txt: HTTPSConnectionPool(host='file', port=443): Max retries exceeded with url: /tcconv.txt (Caused by NameResolutionError("<urllib3.connection.HTTPSConnection object at 0x10ae73f50>: Failed to resolve 'file' ([Errno 8] nodename nor servname provided, or not known)"))
An error occurred: Failed to extract content from file:///tcconv.txt: HTTPSConnectionPool(host='file', port=443): Max retries exceeded with url: /tcconv.txt (Caused by NameResolutionError("<urllib3.connection.HTTPSConnection object at 0x10ae73f50>: Failed to resolve 'file' ([Errno 8] nodename nor servname provided, or not known)"))

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant