PHPackages                             piedweb/seo-pocket-crawler - PHPackages - PHPackages  [Skip to content](#main-content)[PHPackages](/)[Directory](/)[Categories](/categories)[Trending](/trending)[Leaderboard](/leaderboard)[Changelog](/changelog)[Analyze](/analyze)[Collections](/collections)[Log in](/login)[Sign up](/register)

1. [Directory](/)
2. /
3. [Utility &amp; Helpers](/categories/utility)
4. /
5. piedweb/seo-pocket-crawler

ActiveLibrary[Utility &amp; Helpers](/categories/utility)

piedweb/seo-pocket-crawler
==========================

Web Crawler to check few SEO basics.

v0.0.7(5y ago)211.4k2MITPHPPHP ~7.3|^8.0

Since Jan 17Pushed 5y ago1 watchersCompare

[ Source](https://github.com/PiedWeb/SeoPocketCrawler)[ Packagist](https://packagist.org/packages/piedweb/seo-pocket-crawler)[ Docs](https://dev.piedweb.com)[ RSS](/packages/piedweb-seo-pocket-crawler/feed)WikiDiscussions master Synced today

READMEChangelogDependencies (8)Versions (8)Used By (0)

[![Open Source Package](https://raw.githubusercontent.com/PiedWeb/piedweb-devoluix-theme/master/src/img/logo_title.png)](https://dev.piedweb.com)

CLI Seo Pocket Crawler
======================

[](#cli-seo-pocket-crawler)

[![Latest Version](https://camo.githubusercontent.com/0596bd7c5ca5ca44c4eef11790e96cfccb825acf2e463a998206169d72e3ada4/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f7461672f506965645765622f53656f506f636b6574437261776c65722e7376673f7374796c653d666c6174266c6162656c3d72656c65617365)](https://github.com/PiedWeb/SeoPocketCrawler/tags)[![Software License](https://camo.githubusercontent.com/f251623e510f5909f16ae3f4e6e548dac11340b9fde1a99be26b015b39272c00/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d4d49542d627269676874677265656e2e7376673f7374796c653d666c6174)](LICENSE)[![GitHub Tests Action Status](https://camo.githubusercontent.com/564e20e6f56b5cf8c860da2acfe70c6552cadd787387fd24eaa2eede2501dc9b/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f776f726b666c6f772f7374617475732f506965645765622f53656f506f636b6574437261776c65722f54657374733f6c6162656c3d7465737473)](https://github.com/PiedWeb/SeoPocketCrawler/actions)[![Quality Score](https://camo.githubusercontent.com/04b7aa77e46f3ce14cf188530f465987fcefdaed18ef76fa75d470589f682749/68747470733a2f2f696d672e736869656c64732e696f2f7363727574696e697a65722f672f506965645765622f53656f506f636b6574437261776c65722e7376673f7374796c653d666c6174)](https://scrutinizer-ci.com/g/PiedWeb/SeoPocketCrawler)[![Code Coverage](https://camo.githubusercontent.com/733b4b88154cf8ee337e4c43e70d9d95dad7aa886edd937ec599b03cf7153661/68747470733a2f2f636f6465636f762e696f2f67682f506965645765622f53656f506f636b6574437261776c65722f6272616e63682f6d61696e2f67726170682f62616467652e737667)](https://codecov.io/gh/PiedWeb/SeoPocketCrawler/branch/main)[![Type Coverage](https://camo.githubusercontent.com/ecf83b6699a32435740a717b9269504534632daf40e539b202e8681c6a8e449a/68747470733a2f2f73686570686572642e6465762f6769746875622f506965645765622f53656f506f636b6574437261776c65722f636f7665726167652e737667)](https://shepherd.dev/github/PiedWeb/SeoPocketCrawler)[![Total Downloads](https://camo.githubusercontent.com/faf8d0683f3d43aaefe606e60850101b7dcb515e2bd1956d3f77c4c6ccbd95f2/68747470733a2f2f696d672e736869656c64732e696f2f7061636b61676973742f64742f706965647765622f73656f2d706f636b65742d637261776c65722e7376673f7374796c653d666c6174)](https://packagist.org/packages/piedweb/seo-pocket-crawler)

Web Crawler to check few SEO basics.

Use the collected data in your favorite spreadsheet software or retrieve them via your favorite language.

French documentation available :

Install
-------

[](#install)

Via [Packagist](https://img.shields.io/packagist/dt/piedweb/seo-pocket-crawler.svg?style=flat)

```
$ composer create-project piedweb/seo-pocket-crawler
```

Usage
-----

[](#usage)

### Crawler CLI

[](#crawler-cli)

```
$ bin/console crawler:go $start
```

#### Arguments:

[](#arguments)

```
  start                            Define where the crawl start. Eg: https://piedweb.com
                                   You can specify an id from a previous crawl. Other options will not be listen.
                                   You can use `last` to continue the last crawl (just stopped)

```

#### Options:

[](#options)

```
  -l, --limit=LIMIT                Define where a depth limit [default: 5]
  -i, --ignore=IGNORE              Virtual Robots.txt to respect (could be a string or an URL).
  -u, --user-agent=USER-AGENT      Define the user-agent used during the crawl. [default: "SEO Pocket Crawler - PiedWeb.com/seo/crawler"]
  -w, --wait=WAIT                  In Microseconds, the time to wait between 2 requests. Default 0,1s. [default: 100000]
  -c, --cache-method=CACHE-METHOD  In Microseconds, the time to wait between two request. Default : 100000 (0,1s). [default: 2]
  -r, --restart=RESTART            Permit to restart a previous crawl. Values 1 = fresh restart, 2 = restart from cache
  -h, --help                       Display this help message
  -q, --quiet                      Do not output any message
  -V, --version                    Display this application version
      --ansi                       Force ANSI output
      --no-ansi                    Disable ANSI output
  -n, --no-interaction             Do not ask any interactive question
  -v|vv|vvv, --verbose             Increase the verbosity of messages: 1 for normal output, 2 for more verbose output and 3 for debug

```

### Extract All External Links in 1s from a previous crawl

[](#extract-all-external-links-in-1s-from-a-previous-crawl)

```
$ bin/console crawler:external $id [--host]
```

```
    --id
        id from a previous crawl
        You can use  `last` too show external links from the last crawl.

    --host -ho
        flag permitting to get only host

```

### Calcul Page Rank

[](#calcul-page-rank)

Will update the previous `data.csv` generated. Then you can explore your website with the PoC `pagerank.html`(in a server `npx http-server -c-1 --port 3000`).

```
$ bin/console crawler:pagerank $id
```

```
    --id
        id from a previous crawl
        You can use `last` too calcul page rank from the last crawl.

```

Testing
-------

[](#testing)

```
$ composer test
```

Todo
----

[](#todo)

- Better Links Harvesting and Recording (record context (list, nav, sentence...))
- Transform the PoC (Page Rank Visualizer)
- Complex Page Rank Calculator (with 301, canonical, nofollow, etc.)

Contributing
------------

[](#contributing)

Please see [contributing](https://dev.piedweb.com/contributing)

Credits
-------

[](#credits)

- [PiedWeb](https://piedweb.com) ak [Robind4](https://twitter.com/Robind4)
- [All Contributors](https://github.com/PiedWeb/:package_skake/graphs/contributors)

License
-------

[](#license)

The MIT License (MIT). Please see [License File](LICENSE) for more information.

[![Latest Version](https://camo.githubusercontent.com/0596bd7c5ca5ca44c4eef11790e96cfccb825acf2e463a998206169d72e3ada4/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f7461672f506965645765622f53656f506f636b6574437261776c65722e7376673f7374796c653d666c6174266c6162656c3d72656c65617365)](https://github.com/PiedWeb/SeoPocketCrawler/tags)[![Software License](https://camo.githubusercontent.com/f251623e510f5909f16ae3f4e6e548dac11340b9fde1a99be26b015b39272c00/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d4d49542d627269676874677265656e2e7376673f7374796c653d666c6174)](https://github.com/PiedWeb/SeoPocketCrawler/blob/master/LICENSE)[![Build Status](https://camo.githubusercontent.com/16c27c48ddf337211c6f99ec040768701294fba7e9ad7b36278f82df8af1214c/68747470733a2f2f696d672e736869656c64732e696f2f7472617669732f506965645765622f53656f506f636b6574437261776c65722f6d61737465722e7376673f7374796c653d666c6174)](https://travis-ci.org/PiedWeb/SeoPocketCrawler)[![Quality Score](https://camo.githubusercontent.com/04b7aa77e46f3ce14cf188530f465987fcefdaed18ef76fa75d470589f682749/68747470733a2f2f696d672e736869656c64732e696f2f7363727574696e697a65722f672f506965645765622f53656f506f636b6574437261776c65722e7376673f7374796c653d666c6174)](https://scrutinizer-ci.com/g/PiedWeb/SeoPocketCrawler)[![Code Coverage](https://camo.githubusercontent.com/a0517b9da00da4f0d521bceeec611f2bf94b26d2ced4e3f30ca32e91d1b7398c/68747470733a2f2f696d672e736869656c64732e696f2f7363727574696e697a65722f636f7665726167652f672f506965645765622f53656f506f636b6574437261776c65722e7376673f7374796c653d666c6174)](https://scrutinizer-ci.com/g/PiedWeb/SeoPocketCrawler/code-structure)[![Total Downloads](https://camo.githubusercontent.com/faf8d0683f3d43aaefe606e60850101b7dcb515e2bd1956d3f77c4c6ccbd95f2/68747470733a2f2f696d672e736869656c64732e696f2f7061636b61676973742f64742f706965647765622f73656f2d706f636b65742d637261776c65722e7376673f7374796c653d666c6174)](https://packagist.org/packages/piedweb/seo-pocket-crawler)

###  Health Score

32

—

LowBetter than 72% of packages

Maintenance20

Infrequent updates — may be unmaintained

Popularity23

Limited adoption so far

Community11

Small or concentrated contributor base

Maturity60

Established project with proven stability

 Bus Factor1

Top contributor holds 66.7% of commits — single point of failure

How is this calculated?**Maintenance (25%)** — Last commit recency, latest release date, and issue-to-star ratio. Uses a 2-year decay window.

**Popularity (30%)** — Total and monthly downloads, GitHub stars, and forks. Logarithmic scaling prevents top-heavy scores.

**Community (15%)** — Contributors, dependents, forks, watchers, and maintainers. Measures real ecosystem engagement.

**Maturity (30%)** — Project age, version count, PHP version support, and release stability.

###  Release Activity

Cadence

Every ~122 days

Recently: every ~183 days

Total

7

Last Release

1936d ago

PHP version history (2 changes)0.0.1PHP ~7.1

v0.0.7PHP ~7.3|^8.0

### Community

Maintainers

![](https://www.gravatar.com/avatar/afce4cf517928a50560237f1410d5957271fd808671b2216687ecf1422adaee0?d=identicon)[Robin D.](/maintainers/Robin%20D.)

---

Top Contributors

[![RobinDev](https://avatars.githubusercontent.com/u/3944894?v=4)](https://github.com/RobinDev "RobinDev (2 commits)")[![iNem0o](https://avatars.githubusercontent.com/u/1144755?v=4)](https://github.com/iNem0o "iNem0o (1 commits)")

---

Tags

Pied WebSeoPocketCrawler

###  Code Quality

TestsPHPUnit

Code StylePHP\_CodeSniffer

### Embed Badge

![Health badge](/badges/piedweb-seo-pocket-crawler/health.svg)

```
[![Health](https://phpackages.com/badges/piedweb-seo-pocket-crawler/health.svg)](https://phpackages.com/packages/piedweb-seo-pocket-crawler)
```

###  Alternatives

[shlinkio/shlink

A self-hosted and PHP-based URL shortener application with CLI and REST interfaces

4.8k4.3k](/packages/shlinkio-shlink)[psalm/plugin-laravel

Psalm plugin for Laravel

3274.9M308](/packages/psalm-plugin-laravel)[php-soap/wsdl

Deals with WSDLs

173.5M12](/packages/php-soap-wsdl)[league/uri-components

URI components manipulation library

31932.3M67](/packages/league-uri-components)[aedart/athenaeum

Athenaeum is a mono repository; a collection of various PHP packages

255.2k](/packages/aedart-athenaeum)[piedweb/text-spinner

Generate content from a spinned text (content spinning via spinning tool ak rephraser)

2419.5k](/packages/piedweb-text-spinner)

PHPackages © 2026

[Directory](/)[Categories](/categories)[Trending](/trending)[Changelog](/changelog)[Analyze](/analyze)
