Skip to content

Latest commit

 

History

History
106 lines (65 loc) · 3.4 KB

CHANGELOG.md

File metadata and controls

106 lines (65 loc) · 3.4 KB

CHANGELOG

Unreleased

v1.31.0

  1. feat(js): add init of script parsing

v1.30.5

  1. "feat(worker): add tls support"

v1.30.3

  1. chore(request): add custom domain redirect policy

v1.30.2

  1. chore(glob): fix glob crawl establish

v1.30.1

  1. chore(crawl): fix crawl asset detection and trailing start

v1.29.0

  1. feat(fs): add temp storage resource handling (#112)
  2. feat(url-glob): URL globbing (#113) thanks to @roniemartinez)

v1.28.5

  1. chore(request): fix resource success handling

v1.28.0

  1. feat(proxies): add proxy support

v1.27.2

  1. feat(decentralization): add workload split

v1.19.36

  1. perf(crawl): add join handle task management

v1.19.26

  1. perf(links): add fast pre serialized url anchor link extracting and reduced memory usage
  2. perf(links): fix case sensitivity handling
  3. perf(crawl): reduce memory usage on link gathering
  4. chore(crawl): remove Website.reset method and improve crawl handling resource usage ( reset not needed now )
  5. chore(crawl): add heap usage of links visited
  6. perf(crawl): massive scans capability to utilize more cpu
  7. feat(timeout): add optional configuration.request_timeout duration
  8. build(tokio): remove unused net feature
  9. chore(docs): add missing scrape section

v1.10.7

  • perf(req): enable brotli
  • chore(tls): add ALPN tls defaults
  • chore(statics): add initial static media ignore
  • chore(robots): add shared client handling across parsers
  • feat(crawl): add subdomain and tld crawling

v1.6.1

  • perf(links): filter dup links after async batch
  • chore(delay): fix crawl delay thread groups
  • perf(page): slim channel page sending required props

v1.5.3

  • feat(regex): add optional regex black listing

v1.5.0

  • chore(bin): fix bin executable #17
  • feat(cli): add cli separation binary #17
  • feat(robots): add robots crawl delay respect and ua assign #24
  • feat(async): add async page body gathering
  • perf(latency): add connection re-use across request #25

v1.4.0

  • feat(cli): add cli ability (#16 thanks to @j-mendez)
  • feat(concurrency): dynamic concurrent cpu defaults (#15 thanks to @j-mendez)
  • docs: add a changelog

v1.3.1

v1.3.0

  • feat(crawl): callback to run when link is found (#13 thanks to @j-mendez)

v1.2.0

v1.1.3