PHPackages                             pforret/pf\_pageparser - PHPackages - PHPackages  [Skip to content](#main-content)[PHPackages](/)[Directory](/)[Categories](/categories)[Trending](/trending)[Leaderboard](/leaderboard)[Changelog](/changelog)[Analyze](/analyze)[Collections](/collections)[Log in](/login)[Sign up](/register)

1. [Directory](/)
2. /
3. [Parsing &amp; Serialization](/categories/parsing)
4. /
5. pforret/pf\_pageparser

ActiveLibrary[Parsing &amp; Serialization](/categories/parsing)

pforret/pf\_pageparser
======================

Simple Regex Page Parser in PHP

2.0.4(1w ago)264112MITHTMLPHP ^8.0

Since Jun 3Pushed 3y ago1 watchersCompare

[ Source](https://github.com/pforret/pf_pageparser)[ Packagist](https://packagist.org/packages/pforret/pf_pageparser)[ Docs](https://github.com/pforret/pf_pageparser)[ RSS](/packages/pforret-pf-pageparser/feed)WikiDiscussions master Synced 3d ago

READMEChangelog (6)Dependencies (8)Versions (14)Used By (2)

pf\_pageparser
==============

[](#pf_pageparser)

[![Latest Version on Packagist](https://camo.githubusercontent.com/7b7b7297f0aa581175f2e289006e43f72400d5d1d46ae29c316e91f729e09f06/68747470733a2f2f696d672e736869656c64732e696f2f7061636b61676973742f762f70666f727265742f70665f706167657061727365722e7376673f7374796c653d666c61742d737175617265)](https://packagist.org/packages/pforret/pf_pageparser)[![Build Status](https://camo.githubusercontent.com/6686b45d9da8eca001892dda865c80e9af23bd53082a8b51f3ce177525fe36fb/68747470733a2f2f696d672e736869656c64732e696f2f7472617669732f70666f727265742f70665f706167657061727365722f6d61737465722e7376673f7374796c653d666c61742d737175617265)](https://travis-ci.org/pforret/pf_pageparser)[![Quality Score](https://camo.githubusercontent.com/f19b0db33905ce23a8ab152eca7033f424fb6aba622388f40a2f9dc7e5ed82a0/68747470733a2f2f696d672e736869656c64732e696f2f7363727574696e697a65722f672f70666f727265742f70665f706167657061727365722e7376673f7374796c653d666c61742d737175617265)](https://scrutinizer-ci.com/g/pforret/pf_pageparser)[![Total Downloads](https://camo.githubusercontent.com/a40c1b0d255a0e539ed44fe347036efb60be8f1c4defc7c7f3d98e8867adf90b/68747470733a2f2f696d672e736869656c64732e696f2f7061636b61676973742f64742f70666f727265742f70665f706167657061727365722e7376673f7374796c653d666c61742d737175617265)](https://packagist.org/packages/pforret/pf_pageparser)

This is a HTML parser I've written because I scrape a lot of web sites to look for structured, repetitive data. This parser allows me to easily cleanup HTML, split it into chunks and find the right data in each chunk It does not use a DOM parser, so it also works on partial or invalid HTML

Installation
------------

[](#installation)

You can install the package via composer:

```
composer require pforret/pf_pageparser
```

Usage
-----

[](#usage)

```
$pp=New PfPageparser(["cacheTtl" => 300]);

$pp->load_from_url("http://www.example.com/products")
    ->trim("
