PHPackages                             lalbert/daric - PHPackages - PHPackages  [Skip to content](#main-content)[PHPackages](/)[Directory](/)[Categories](/categories)[Trending](/trending)[Leaderboard](/leaderboard)[Changelog](/changelog)[Analyze](/analyze)[Collections](/collections)[Log in](/login)[Sign up](/register)

1. [Directory](/)
2. /
3. [Utility &amp; Helpers](/categories/utility)
4. /
5. lalbert/daric

ActiveLibrary[Utility &amp; Helpers](/categories/utility)

lalbert/daric
=============

Simple and configurable PHP web spider and web scraper

v0.1.0(9y ago)129MITPHPCI failing

Since Aug 22Pushed 6y ago1 watchersCompare

[ Source](https://github.com/lalbert/daric)[ Packagist](https://packagist.org/packages/lalbert/daric)[ RSS](/packages/lalbert-daric/feed)WikiDiscussions master Synced 2d ago

READMEChangelogDependencies (4)Versions (4)Used By (0)

Daric
=====

[](#daric)

Daric is a Simple and configurable PHP web spider and web scraper written under the [Goutte](https://github.com/FriendsOfPHP/Goutte) library.

Installation
------------

[](#installation)

The best way to install Daric it use composer

```
composer require lalbert/daric

```

Usage
-----

[](#usage)

There are two components : `Scraper` and `Spider`.

### Scraper

[](#scraper)

`Scaper` is used to extract, clean, and format web page data.

It uses extractors, cleaners and formatters to achieve its goals.

```
use Daric\Scraper;
use Daric\Extractor\CrawlerExtractorFactory;

$scraper = new Scrapper('http://website.tld');
$scraper->setExtractors([
  'meta_title' => CrawlerExtractorFactory::create('title@_text'), // get text node of
  'meta_description' => CrawlerExtractorFactory::create('meta[name="description"]@content'), // get attribute "content" of
  'list' => CrawlerExtractorFactory::create('#content ul.list li@_text("array")') // get all text node of li item. Return an array
]);

$doc = $scraper->scrape(); // return Daric\Document

echo $doc->getData('meta_title');
print_r($doc['list']);
```

### Spider

[](#spider)

`Spider` is used to crawl a website to scrape some web page data.

```
use Daric\Spider;
use Daric\Scraper;
use Daric\Extractor\CrawlerExtractorFactory;

$spider = new Spider('http://website.tld');

$spider->setLinkExtractor(CrawlerExtractorFactory::create('#content article a.link@href("array")'));
$spider->setNextLinkExtractor(CrawlerExtractorFactory::create('#nav a.next@href'));

foreach ($spider as $pageUri) {
  $scraper = new Scraper($pageUri, $extractors, $cleaners, $formatters);
  $doc = $scraper->scrape();
  ...
}
```

Licence
-------

[](#licence)

Daric is licensed under the MIT License - see the [LICENSE](LICENSE) file for details

###  Health Score

25

—

LowBetter than 35% of packages

Maintenance20

Infrequent updates — may be unmaintained

Popularity9

Limited adoption so far

Community7

Small or concentrated contributor base

Maturity55

Maturing project, gaining track record

 Bus Factor1

Top contributor holds 100% of commits — single point of failure

How is this calculated?**Maintenance (25%)** — Last commit recency, latest release date, and issue-to-star ratio. Uses a 2-year decay window.

**Popularity (30%)** — Total and monthly downloads, GitHub stars, and forks. Logarithmic scaling prevents top-heavy scores.

**Community (15%)** — Contributors, dependents, forks, watchers, and maintainers. Measures real ecosystem engagement.

**Maturity (30%)** — Project age, version count, PHP version support, and release stability.

###  Release Activity

Cadence

Unknown

Total

1

Last Release

3601d ago

### Community

Maintainers

![](https://avatars.githubusercontent.com/u/6500272?v=4)[lalbert](/maintainers/lalbert)[@lalbert](https://github.com/lalbert)

---

Top Contributors

[![lucalbert](https://avatars.githubusercontent.com/u/6500267?v=4)](https://github.com/lucalbert "lucalbert (37 commits)")

---

Tags

scraperspiderweb-scraperweb-spiderspiderscraper

###  Code Quality

TestsPHPUnit

### Embed Badge

![Health badge](/badges/lalbert-daric/health.svg)

```
[![Health](https://phpackages.com/badges/lalbert-daric/health.svg)](https://phpackages.com/packages/lalbert-daric)
```

###  Alternatives

[vdb/php-spider

A configurable and extensible PHP web spider

1.3k185.8k7](/packages/vdb-php-spider)[raiym/instagram-php-scraper

Instagram PHP Scraper. Get account information, photos and videos without any authorization

3.3k1.2M6](/packages/raiym-instagram-php-scraper)[jaeger/querylist

Simple, elegant, extensible PHP Web Scraper (crawler/spider),Use the css3 dom selector,Based on phpQuery! 简洁、优雅、可扩展的PHP采集工具(爬虫)，基于phpQuery。

2.8k657.7k88](/packages/jaeger-querylist)[craftcms/feed-me

Import content from XML, RSS, CSV or JSON feeds into entries, categories, Craft Commerce products, and more.

293952.6k30](/packages/craftcms-feed-me)[jaybizzle/laravel-crawler-detect

A Laravel package to detect web crawlers via the user agent

3302.7M18](/packages/jaybizzle-laravel-crawler-detect)[raulr/google-play-scraper

A PHP scraper to get app data from Google Play

12994.7k](/packages/raulr-google-play-scraper)

PHPackages © 2026

[Directory](/)[Categories](/categories)[Trending](/trending)[Changelog](/changelog)[Analyze](/analyze)