Skip to content

Latest commit

 

History

History
49 lines (26 loc) · 780 Bytes

readme.md

File metadata and controls

49 lines (26 loc) · 780 Bytes

About

A distributed crawler

Requirements

Installation

Via Composer:

composer require teamtnt/crawler

Configuration

Each instance needs to have an identifier. This can be added in .env

NODE_NAME="Instance 1"

The domain feeder needs to start with a seed domain. After that, running

php artisan crawler

For scraping a single url

php artisan url:frontier www.example.com/something

Crawler Topology

Crawler Topology

Domain Feeder

Domain Feeder

Single Instance

Single Instance

URL Frontier

URL Frontier