-
Notifications
You must be signed in to change notification settings - Fork 0
Spider that crawls the home page of TechCrunch (http://techcrunch.com/)
License
bahtou/TechCrunch-HomePage-Spider
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Spider that crawls the home page of TechCrunch (http://techcrunch.com/) Scrapy framework is used to scrape information from the homepages of TechCrunch. Data on who posted, posters link, headline, headline link and time posted are extracted. The data is then dumped into MySQLdb. Checkout: http://scrapy.org/
About
Spider that crawls the home page of TechCrunch (http://techcrunch.com/)
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published