PHPackages                             arkhee/webbookscraper - PHPackages - PHPackages  [Skip to content](#main-content)[PHPackages](/)[Directory](/)[Categories](/categories)[Trending](/trending)[Leaderboard](/leaderboard)[Changelog](/changelog)[Analyze](/analyze)[Collections](/collections)[Log in](/login)[Sign up](/register)

1. [Directory](/)
2. /
3. [Utility &amp; Helpers](/categories/utility)
4. /
5. arkhee/webbookscraper

ActiveLibrary[Utility &amp; Helpers](/categories/utility)

arkhee/webbookscraper
=====================

This simple PHP project can be used to extract groups of page organized with a summary and individual pages per chapter , like web novels websites

1.1.1(1y ago)09GPL-3.0-or-laterPHPPHP ^8

Since Jun 14Pushed 1y ago1 watchersCompare

[ Source](https://github.com/Arkhee/WebBookScraper)[ Packagist](https://packagist.org/packages/arkhee/webbookscraper)[ Docs](https://github.com/Arkhee/WebBookScraper)[ RSS](/packages/arkhee-webbookscraper/feed)WikiDiscussions main Synced 1mo ago

READMEChangelog (5)DependenciesVersions (6)Used By (0)

WebBookScraper : Extract online webnovels !
===========================================

[](#webbookscraper--extract-online-webnovels-)

This simple PHP project can be used to extract groups of page organized with a summary and individual pages per chapter , like web novels websites

How it works
------------

[](#how-it-works)

The script will download the page located at the provided url and extract its content. It will then search for a list of links located in a specific container and download them too. In case the destination pages contain images, the script will list them and store them in memory for each chapter.

The default behaviour is to store the content in memory for further use, but there is an option to cache files as well. This is useful in case of a large table of content.

How to use
----------

[](#how-to-use)

The script is a simple PHP class that can be used in any PHP project. The main class is `WebBookScraper` and it requires a few parameters to work:

- The URL of the main page (string)
- The option to debug, this will create a log file (boolean)

The default behaviour of the parser is to located content in the following parts of the pagen, by default :

- The "title" of the page is read from the "article head" selector
- The "content" of the page is read from the "article div.entry-content" selector These can be modified at runtime by calling the `setSelectors[...]` static methods.

How to install
--------------

[](#how-to-install)

The project is available on packagist and can be installed using composer:

```
composer require "arkhee/webbookscraper""
```

Example
-------

[](#example)

There is a sample provided to see how it works, have a look at the sample folder To use it as-is you must use the Simplepubgen and the WebBookScraper packages. Created a new folder on your server and copy the sample file at it's root Install both packages with composer and run the sample file.

```
composer require "arkhee/simplepubgen"
composer require "arkhee/webbookscraper"
```

###  Health Score

23

—

LowBetter than 27% of packages

Maintenance32

Infrequent updates — may be unmaintained

Popularity4

Limited adoption so far

Community7

Small or concentrated contributor base

Maturity44

Maturing project, gaining track record

 Bus Factor1

Top contributor holds 100% of commits — single point of failure

How is this calculated?**Maintenance (25%)** — Last commit recency, latest release date, and issue-to-star ratio. Uses a 2-year decay window.

**Popularity (30%)** — Total and monthly downloads, GitHub stars, and forks. Logarithmic scaling prevents top-heavy scores.

**Community (15%)** — Contributors, dependents, forks, watchers, and maintainers. Measures real ecosystem engagement.

**Maturity (30%)** — Project age, version count, PHP version support, and release stability.

###  Release Activity

Cadence

Every ~0 days

Total

5

Last Release

701d ago

### Community

Maintainers

![](https://www.gravatar.com/avatar/271a59581d2c780806e1ac688fa003796b386306cc2ac893bb905095259a744b?d=identicon)[Arkhee](/maintainers/Arkhee)

---

Top Contributors

[![Arkhee](https://avatars.githubusercontent.com/u/3664145?v=4)](https://github.com/Arkhee "Arkhee (69 commits)")

### Embed Badge

![Health badge](/badges/arkhee-webbookscraper/health.svg)

```
[![Health](https://phpackages.com/badges/arkhee-webbookscraper/health.svg)](https://phpackages.com/packages/arkhee-webbookscraper)
```

###  Alternatives

[pid/speakingurl

Generate of so called 'static' or 'Clean URL' or 'Pretty URL' or 'nice-looking URL' or 'Speaking URL' or 'user-friendly URL' or 'SEO-friendly URL' or 'slug' from a string.

1.1k5.3k1](/packages/pid-speakingurl)[ronanguilloux/php-gpio

GPIO-related utils &amp; toolkit PHP library

2678.0k](/packages/ronanguilloux-php-gpio)[bmidget/kohana-formo

Formo is a form module for Kohana 3 that lets you work with forms as objects

1328.4k](/packages/bmidget-kohana-formo)[daxslab/yii2-taggedview

Extension to help setup the standard HTML meta tags besides the ones defined by Opengraph and Twitter to contribute to website SEO

114.5k](/packages/daxslab-yii2-taggedview)

PHPackages © 2026

[Directory](/)[Categories](/categories)[Trending](/trending)[Changelog](/changelog)[Analyze](/analyze)
