PHPackages                             dlindberg/blob-chunk - PHPackages - PHPackages  [Skip to content](#main-content)[PHPackages](/)[Directory](/)[Categories](/categories)[Trending](/trending)[Leaderboard](/leaderboard)[Changelog](/changelog)[Analyze](/analyze)[Collections](/collections)[Log in](/login)[Sign up](/register)

1. [Directory](/)
2. /
3. [Utility &amp; Helpers](/categories/utility)
4. /
5. dlindberg/blob-chunk

ActiveLibrary[Utility &amp; Helpers](/categories/utility)

dlindberg/blob-chunk
====================

Utility for breaking up a content fragment of HTML for search indexing

0.1.0(7y ago)00MITPHPPHP ^7.2

Since Mar 27Pushed 7y ago1 watchersCompare

[ Source](https://github.com/dlindberg/BlobChunk)[ Packagist](https://packagist.org/packages/dlindberg/blob-chunk)[ Docs](https://github.com/dlindberg/blob-chunk)[ RSS](/packages/dlindberg-blob-chunk/feed)WikiDiscussions master Synced today

READMEChangelog (1)Dependencies (3)Versions (2)Used By (0)

blob-chunk
==========

[](#blob-chunk)

[![Latest Version on Packagist](https://camo.githubusercontent.com/e11f94834179487fc2ce7fd807a389da3315301da3a2029c4fb77f0cc11770a6/68747470733a2f2f696d672e736869656c64732e696f2f7061636b61676973742f762f646c696e64626572672f626c6f622d6368756e6b2e7376673f7374796c653d666c61742d737175617265)](https://packagist.org/packages/dlindberg/blob-chunk)[![Software License](https://camo.githubusercontent.com/55c0218c8f8009f06ad4ddae837ddd05301481fcf0dff8e0ed9dadda8780713e/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d4d49542d627269676874677265656e2e7376673f7374796c653d666c61742d737175617265)](LICENSE.md)[![Build Status](https://camo.githubusercontent.com/996a2438154b8e2d988ffa768a43aba4b82645866196e1230a5fcfe258d6d2b0/68747470733a2f2f696d672e736869656c64732e696f2f7472617669732f646c696e64626572672f426c6f624368756e6b2f6d61737465722e7376673f7374796c653d666c61742d737175617265)](https://travis-ci.org/dlindberg/BlobChunk)[![Coverage Status](https://camo.githubusercontent.com/ccba01028e297008448ce57cc27781cee33c369d6ca70d4cca30684aacf0bca6/68747470733a2f2f696d672e736869656c64732e696f2f7363727574696e697a65722f636f7665726167652f672f646c696e64626572672f426c6f624368756e6b2e7376673f7374796c653d666c61742d737175617265)](https://scrutinizer-ci.com/g/dlindberg/BlobChunk/code-structure)[![Quality Score](https://camo.githubusercontent.com/750c96da6c5abb06f7f2901bd7bda0c2d2a73ddd44aad43ac807eb1317ff22af/68747470733a2f2f696d672e736869656c64732e696f2f7363727574696e697a65722f672f646c696e64626572672f426c6f624368756e6b2e7376673f7374796c653d666c61742d737175617265)](https://scrutinizer-ci.com/g/dlindberg/BlobChunk)[![Total Downloads](https://camo.githubusercontent.com/ba3f6f9e833cf42668656e7c99396fd0737989cd878b007c6e0d134e6bedede2/68747470733a2f2f696d672e736869656c64732e696f2f7061636b61676973742f64742f646c696e64626572672f426c6f624368756e6b2e7376673f7374796c653d666c61742d737175617265)](https://packagist.org/packages/dlindberg/BlobChunk)

This is currently an early work in progress. The purpose of this project is to take a content block of html and break it apart into smaller chunks to make to improve indexing with search appliances such as Algolia, where frequently the raw html content is too large to fit within the index limits.

Install
-------

[](#install)

Via Composer

```
$ composer require dlindberg/blob-chunk

```

Basic Usage
-----------

[](#basic-usage)

```
$blobChunk = new dlindberg\BlobChunk();
$result = $blobChunk->parse($html);

```

Returns an array of content chunks. By default it attempts to break out lists, tables, header tags, and paragraphs as separate elements. It also breaks apart paragraphs into sentences. There is a reasonable amount of surface area for extensibility and configuration; however, that area of the project is still somewhat of a work in progress.

Change log
----------

[](#change-log)

Please see [CHANGELOG](CHANGELOG.md) for more information on what has changed recently.

Testing
-------

[](#testing)

```
$ composer test

```

The current tests for the manager are reasonably thorough. Tests on the parser and parent class need to be improved.

Contributing
------------

[](#contributing)

Please see [CONTRIBUTING](CONTRIBUTING.md) and [CODE\_OF\_CONDUCT](CODE_OF_CONDUCT.md) for details.

Security
--------

[](#security)

If you discover any security related issues, please email  instead of using the issue tracker.

Credits
-------

[](#credits)

- [Dane Lindberg](https://github.com/dlindberg)
- [All Contributors](../../contributors)

The boiler plate for this project is based on [ The League of Extraordinary Packages'](http://thephpleague.com) [Skeleton](https://github.com/thephpleague/skeleton) package repository.

License
-------

[](#license)

The MIT License (MIT). Please see [License File](LICENSE.md) for more information.

###  Health Score

19

—

LowBetter than 10% of packages

Maintenance20

Infrequent updates — may be unmaintained

Popularity0

Limited adoption so far

Community7

Small or concentrated contributor base

Maturity44

Maturing project, gaining track record

 Bus Factor1

Top contributor holds 100% of commits — single point of failure

How is this calculated?**Maintenance (25%)** — Last commit recency, latest release date, and issue-to-star ratio. Uses a 2-year decay window.

**Popularity (30%)** — Total and monthly downloads, GitHub stars, and forks. Logarithmic scaling prevents top-heavy scores.

**Community (15%)** — Contributors, dependents, forks, watchers, and maintainers. Measures real ecosystem engagement.

**Maturity (30%)** — Project age, version count, PHP version support, and release stability.

###  Release Activity

Cadence

Unknown

Total

1

Last Release

2602d ago

### Community

Maintainers

![](https://www.gravatar.com/avatar/145a7517b65de71107e43e44c0dc0d7c1e8734b913fa931aaf9986f12e8f2ba1?d=identicon)[dlindberg](/maintainers/dlindberg)

---

Top Contributors

[![dlindberg](https://avatars.githubusercontent.com/u/684461?v=4)](https://github.com/dlindberg "dlindberg (4 commits)")

---

Tags

Dlindbergblob-chunk

###  Code Quality

TestsPHPUnit

Code StylePHP\_CodeSniffer

### Embed Badge

![Health badge](/badges/dlindberg-blob-chunk/health.svg)

```
[![Health](https://phpackages.com/badges/dlindberg-blob-chunk/health.svg)](https://phpackages.com/packages/dlindberg-blob-chunk)
```

PHPackages © 2026

[Directory](/)[Categories](/categories)[Trending](/trending)[Changelog](/changelog)[Analyze](/analyze)
