Releases · website-scraper/node-website-scraper

92eb86f - Change output format: now it returns full tree of assets for each resource
a1b347b - Add prettifyUrls feature
e7f8b80 - Add urlFilter feature
00443b6 - Add filnameGenerator feature
dc4ab93 - Use lodash instead of underscore
e88abb2 - Add missing extensions for html and css resources

Breaking changes
Changed output format.
Earlier - flat array of root resources was returned

[ { url: 'http://example.com', filename: 'index.html' } ];

Now - tree of resources

[ { 
  url: 'http://example.com', 
  filename: 'index.html',
  assets: [ // dependencies of index.html
    { 
      url: 'http://example.com/style.css', 
      filename: 'style.css', 
      assets: [ // dependencies of style.css
        { url: 'http://example.com/img-from-styles.png', filename: 'img-from-styles.png', assets: [] },
      ] 
    }
    /* other dependencies of index.html */
  ]
} ];

Assets 2

21 Apr 21:25

s0ph1e

v0.3.6

43b6eca

v0.3.6

9732eb1 Fix similar css urls not updated

Assets 2

20 Apr 19:23

s0ph1e

v0.3.5

93331b2

v0.3.5

bf0a25e Fixed bug with unspecified protocol in resources on https page

Assets 2

17 Apr 19:59

s0ph1e

v0.3.4

003efcb

v0.3.4

8e85c73 - Fix loading from <img srcset="">

Assets 2

17 Mar 12:17

s0ph1e

v0.3.3

c9de606

v0.3.3

a8871eb - Accept gzip

Assets 2

21 Feb 12:18

s0ph1e

v0.3.2

bc7809a

v0.3.2

448514f Handle hash anchors

Assets 2

29 Jan 13:27

s0ph1e

v0.3.1

f1983f7

v0.3.1

16a28ef update dependencies
bef139f add options maxDepth and recursive

Assets 2

05 Aug 19:55

s0ph1e

v0.3.0

61ee69d

v0.3.0

69ab9eb - refactor
9636962 - improve detection of duplicated urls
b2d2bed - improve recognizing of resource type
remove log from options
cover with tests

Breaking changes

filename returned by scrape was changed - now it contains relative to directory path

var options = {
  urls: 'http://example.com',
  directory: '/path/to/save'
};
scrape(options).then(console.log); 
// earlier: [ { url: 'http://example.com', filename: '/path/to/save/index.html' } ];
// now:  [ { url: 'http://example.com', filename: 'index.html' } ];

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: website-scraper/node-website-scraper

v1.0.2

v1.0.1

v1.0.0

v0.3.6

v0.3.5

v0.3.4

v0.3.3

v0.3.2

v0.3.1

v0.3.0