Releases: website-scraper/node-website-scraper
Releases · website-scraper/node-website-scraper
v1.0.2
v1.0.1
v1.0.0
- 92eb86f - Change output format: now it returns full tree of assets for each resource
- a1b347b - Add
prettifyUrls
feature - e7f8b80 - Add
urlFilter
feature - 00443b6 - Add
filnameGenerator
feature - dc4ab93 - Use
lodash
instead ofunderscore
- e88abb2 - Add missing extensions for html and css resources
Breaking changes
Changed output format.
Earlier - flat array of root resources was returned
[ { url: 'http://example.com', filename: 'index.html' } ];
Now - tree of resources
[ {
url: 'http://example.com',
filename: 'index.html',
assets: [ // dependencies of index.html
{
url: 'http://example.com/style.css',
filename: 'style.css',
assets: [ // dependencies of style.css
{ url: 'http://example.com/img-from-styles.png', filename: 'img-from-styles.png', assets: [] },
]
}
/* other dependencies of index.html */
]
} ];
v0.3.6
v0.3.5
v0.3.4
v0.3.3
v0.3.2
v0.3.1
v0.3.0
- 69ab9eb - refactor
- 9636962 - improve detection of duplicated urls
- b2d2bed - improve recognizing of resource type
- remove
log
from options - cover with tests
Breaking changes
filename
returned byscrape
was changed - now it contains relative todirectory
path
var options = {
urls: 'http://example.com',
directory: '/path/to/save'
};
scrape(options).then(console.log);
// earlier: [ { url: 'http://example.com', filename: '/path/to/save/index.html' } ];
// now: [ { url: 'http://example.com', filename: 'index.html' } ];