PHPackages                             sohib/crawler - PHPackages - PHPackages  [Skip to content](#main-content)[PHPackages](/)[Directory](/)[Categories](/categories)[Trending](/trending)[Leaderboard](/leaderboard)[Changelog](/changelog)[Analyze](/analyze)[Collections](/collections)[Log in](/login)[Sign up](/register)

1. [Directory](/)
2. /
3. [Parsing &amp; Serialization](/categories/parsing)
4. /
5. sohib/crawler

ActiveLibrary[Parsing &amp; Serialization](/categories/parsing)

sohib/crawler
=============

Scan Http URLs from a url containing an HTML and get Json or Xml or Array

v1.0.1(10y ago)0261MITPHPPHP &gt;=5.3.0

Since Dec 29Pushed 10y ago1 watchersCompare

[ Source](https://github.com/Sohib/Crawler)[ Packagist](https://packagist.org/packages/sohib/crawler)[ Docs](http://example.com)[ RSS](/packages/sohib-crawler/feed)WikiDiscussions master Synced 4w ago

READMEChangelogDependencies (2)Versions (4)Used By (0)

Crawler
=======

[](#crawler)

[![Latest Version on Packagist](https://camo.githubusercontent.com/9720bceda89c7e36d0ed508396a8963fad9e0fb0057a7dccfdc7fbeb0fa9a3fb/68747470733a2f2f696d672e736869656c64732e696f2f7061636b61676973742f762f736f6869622f637261776c65722e7376673f7374796c653d666c61742d737175617265)](https://packagist.org/packages/sohib/crawler)[![Software License](https://camo.githubusercontent.com/55c0218c8f8009f06ad4ddae837ddd05301481fcf0dff8e0ed9dadda8780713e/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d4d49542d627269676874677265656e2e7376673f7374796c653d666c61742d737175617265)](LICENSE.md)[![Total Downloads](https://camo.githubusercontent.com/d6bdb14b0614a889c2b443c891289c782fd3875f757ac8aaebd782aba7141a15/68747470733a2f2f696d672e736869656c64732e696f2f7061636b61676973742f64742f736f6869622f637261776c65722e7376673f7374796c653d666c61742d737175617265)](https://packagist.org/packages/sohib/crawler)

Scan Http URLs from a url containing an HTML and get Json or Xml or Array.

Note
----

[](#note)

Im not going to maintain this package, i didit just to learn how to create Composer Package.

Install
-------

[](#install)

Via Composer

```
$ composer require sohib/crawler
```

Usage
-----

[](#usage)

```
use Sohib\Crawler;

$urls = [
    "http://thatsthefinger.com/",
    "http://just-shower-thoughts.tumblr.com/",
    "http://ducksarethebest.com/",
];

$crowler  = new Crawler\Scanner($urls);
$scanner = $crowler->scan();
var_dump($scanner->toArray()); // return urls as array
var_dump($scanner->toJson());  // return urls as json
var_dump($scanner->toXml()); // return urls as Xml
```

Change log
----------

[](#change-log)

Please see [CHANGELOG](CHANGELOG.md) for more information what has changed recently.

Contributing
------------

[](#contributing)

Please see [CONTRIBUTING](CONTRIBUTING.md) and [CONDUCT](CONDUCT.md) for details.

Credits
-------

[](#credits)

- [Sohib Algotinel](https://github.com/sohib)

License
-------

[](#license)

The MIT License (MIT). Please see [License File](LICENSE.md) for more information.

###  Health Score

26

—

LowBetter than 41% of packages

Maintenance20

Infrequent updates — may be unmaintained

Popularity7

Limited adoption so far

Community5

Small or concentrated contributor base

Maturity60

Established project with proven stability

How is this calculated?**Maintenance (25%)** — Last commit recency, latest release date, and issue-to-star ratio. Uses a 2-year decay window.

**Popularity (30%)** — Total and monthly downloads, GitHub stars, and forks. Logarithmic scaling prevents top-heavy scores.

**Community (15%)** — Contributors, dependents, forks, watchers, and maintainers. Measures real ecosystem engagement.

**Maturity (30%)** — Project age, version count, PHP version support, and release stability.

###  Release Activity

Cadence

Every ~0 days

Total

3

Last Release

3836d ago

Major Versions

0.0.0 → v1.0.12015-12-30

### Community

Maintainers

![](https://avatars.githubusercontent.com/u/5610514?v=4)[Suhayb Alghutaymil](/maintainers/sohib)[@Sohib](https://github.com/Sohib)

---

Tags

urlpagecrawlerscanner

### Embed Badge

![Health badge](/badges/sohib-crawler/health.svg)

```
[![Health](https://phpackages.com/badges/sohib-crawler/health.svg)](https://phpackages.com/packages/sohib-crawler)
```

###  Alternatives

[vstelmakh/url-highlight

Library to parse urls from string input

104959.2k12](/packages/vstelmakh-url-highlight)[package-url/packageurl-php

Builder and parser based on the package URL (purl) specification.

142.1M4](/packages/package-url-packageurl-php)[helgesverre/receipt-scanner

Use OpenAI to extract structured receipt and invoice data from Text, Html, Images and PDFs.

1448.9k](/packages/helgesverre-receipt-scanner)[strscan/strscan

Simple string tokenizer for lexical scanning operations

13154.3k2](/packages/strscan-strscan)[zenstruck/uri

Object-oriented wrapper/manipulator for parse\_url with additional features.

1676.1k6](/packages/zenstruck-uri)[crwlr/robots-txt

Robots Exclusion Standard/Protocol Parser for Web Crawling/Scraping

1134.4k2](/packages/crwlr-robots-txt)

PHPackages © 2026

[Directory](/)[Categories](/categories)[Trending](/trending)[Changelog](/changelog)[Analyze](/analyze)
