- San Francisco
- https://davidwells.io
- @DavidWells
- davidwells
Highlights
Scraping
Extracts content information from known URL patterns.
Download high-resolution images from Fine Art America, Conde Nast Store, Photos.com, and Pixels.com. "the current reverse engineering approach is non-functional."
A standalone version of the readability lib
⬛️ CLI tool for saving complete web pages as a single HTML file
Pre-built Chromium binaries for AWS Lambda, compatible with Playwright and Puppeteer.
Chromium (x86-64) for Serverless Platforms
An AI web browsing framework focused on simplicity and extensibility.
Turn any website to API by several clicks (serverless and support SPA!)
Automagically reverse-engineer REST APIs via capturing traffic
Web Extension for saving a faithful copy of a complete web page in a single HTML file
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Self-hosted bookmark manager that is designed be to be minimal, fast, and easy to set up using Docker.
A browser extension for saving web documents locally, allowing you to access them offline and quickly search for webpage content without an internet connection, while also saving browser memory usage.
Fetch an entire site and save it as a text file (to be used with AI models).
A lightweight RSS parser, for Node and the browser
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.
serverless-pdf-generator is a lightweight package that simplifies the process of generating PDFs from web pages in a serverless environment like Vercel. It utilizes Puppeteer and Chromium to render…