PHPackages                             xrow/eztika-ls - PHPackages - PHPackages  [Skip to content](#main-content)[PHPackages](/)[Directory](/)[Categories](/categories)[Trending](/trending)[Leaderboard](/leaderboard)[Changelog](/changelog)[Analyze](/analyze)[Collections](/collections)[Log in](/login)[Sign up](/register)

1. [Directory](/)
2. /
3. [Parsing &amp; Serialization](/categories/parsing)
4. /
5. xrow/eztika-ls

ActiveEzpublish-legacy-extension[Parsing &amp; Serialization](/categories/parsing)

xrow/eztika-ls
==============

A wrapper script for the standalone Tika toolkit that allows conversion to plain text and indexing of a large variety of binary file types like MsWord, MsOffice, PDF, Excel, ODF, ....

2.0.0(3y ago)324.4k4[2 issues](https://github.com/xrowgmbh/eztika/issues)GPL-2.0PHP

Since Sep 12Pushed 3y ago13 watchersCompare

[ Source](https://github.com/xrowgmbh/eztika)[ Packagist](https://packagist.org/packages/xrow/eztika-ls)[ RSS](/packages/xrow-eztika-ls/feed)WikiDiscussions master Synced 1mo ago

READMEChangelog (10)Dependencies (3)Versions (13)Used By (0)

eZ Tika is an extension that enables a handler for converting multiple binary file formats to plain text as used by the search engine (if you enabled those attributes as searchable).

Currently, most common office formats are enabled (see also binaryfile.ini.append.php):

```
[application/pdf]
[application/msword]
[application/vnd.ms-excel]
[application/vnd.ms-powerpoint]
[application/vnd.visio]
[application/vnd.ms-outlook]
[application/xml]
[application/rtf]
[application/vnd.oasis.opendocument.text]
[application/vnd.oasis.opendocument.presentation]
[application/vnd.oasis.opendocument.spreadsheet]
[application/vnd.oasis.opendocument.formula]
[application/zip]
[application/vnd.openxmlformats-officedocument.wordprocessingml.document]
[application/vnd.openxmlformats-officedocument.spreadsheetml.sheet]
[application/vnd.openxmlformats-officedocument.presentationml.presentation]
[application/octet-stream]

```

Installation:
-------------

[](#installation)

See [INSTALL.md](INSTALL.md)

License:
--------

[](#license)

License for all but the tika.jar file: GNU GPL 2.0. tika.jar is licensed with the ASF License (Apache)

Running tests:
--------------

[](#running-tests)

The bundle uses PHPUnit to run functional tests.

*NB* the tests do *not* mock interaction with the database, but create/modify/delete many types of data in it. As such, there are good chances that running tests will leave stale/broken data. It is recommended to run the tests suite using a dedicated eZPublish installation or at least a dedicated database.

#### Setting up a dedicated test environment for the bundle

[](#setting-up-a-dedicated-test-environment-for-the-bundle)

A safe choice to run the tests of the extension is to set up a dedicated environment, similar to the one used when the test suite is run on GitHub Actions. The advantages are multiple: on one hand you can start with any version of eZPublish you want; on the other you will be more confident that any tests you add or modify will also pass on GitHub. The disadvantages are that you will need Docker and Docker-compose, and that the environment you will use will look quite unlike a standard eZPublish setup! Also, it will take a considerable amount of disk space and time to build.

Steps to set up a dedicated test environment and run the tests in it:

```
git clone --depth 1 https://github.com/tanoconsulting/euts.git teststack
# if you have a github auth token, it is a good idea to copy it now to teststack/docker/data/.composer/auth.json

# this config sets up a test environment with eZPlatform 2.5 running on php 7.4 / ubuntu jammy
export TESTSTACK_CONFIG_FILE=Tests/environment/.euts.2.5.env

./teststack/teststack build
./teststack/teststack runtests
./teststack/teststack stop

```

Note: this will take some time the 1st time your run it, but it will be quicker on subsequent runs. Note: make sure to have enough disk space available.

In case you want to run manually commands, such as the symfony console:

```
./teststack/teststack console cache:clear

```

Or easily get to a database shell prompt:

```
./teststack/teststack dbconsole

```

Or command-line shell prompt to the Docker container where tests are run:

```
./teststack/teststack shell

```

The tests in the Docker container run using the version of debian/php/mysql/eZPlatform kernel specified in the file `Tests/environment/.euts.2.5.env`, as specified in env var `TESTSTACK_CONFIG_FILE`. If no value is set for that environment variable, a file named `.euts.env` is looked for. If no such file is present, some defaults are used, you can check the documentation in ./teststack/README.md to find out what they are. If you want to test against a different version of eZ/php/debian, feel free to:

- create the `.euts.env` file, if it does not exist
- add to it any required var (see file `teststack/.euts.env.example` as guidance)
- rebuild the test stack
- run tests the usual way

You can even keep multiple test stacks available in parallel, by using different env files, eg:

- create a file `.euts.env.local` and add to it any required env var, starting with a unique `COMPOSE_PROJECT_NAME`
- build the new test stack via `./teststack/teststack. -e .euts.env.local build`
- run the tests via: `./teststack/teststack -e .euts.env.local runtests`

[![License](https://camo.githubusercontent.com/2b2196f6502fffa8ce04354777d75e75526d6829dc874a3ab1acc1db1685da46/68747470733a2f2f706f7365722e707567782e6f72672f78726f772f657a74696b612d6c732f6c6963656e7365)](https://packagist.org/packages/xrow/eztika-ls)[![Latest Stable Version](https://camo.githubusercontent.com/a9038b3e281e0c68fa31c6d0d4759a4b02d04ce333428f0f71e8902e4c0c09e1/68747470733a2f2f706f7365722e707567782e6f72672f78726f772f657a74696b612d6c732f762f737461626c65)](https://packagist.org/packages/xrow/eztika-ls)[![Total Downloads](https://camo.githubusercontent.com/14058f89a99b8042171153ada7fdc34941d9c51f1b4794bc3bf8d2e93d6f0215/68747470733a2f2f706f7365722e707567782e6f72672f78726f772f657a74696b612d6c732f646f776e6c6f616473)](https://packagist.org/packages/xrow/eztika-ls)

[![Build Status](https://github.com/xrowgmbh/eztika/actions/workflows/ci.yml/badge.svg)](https://github.com/xrowgmbh/eztika/actions/workflows/ci.yml)[![Code Coverage](https://camo.githubusercontent.com/b0e8c572aedc1d267f2b73fa27c5522b6a4e1199fa742f7abfecb2a335d41955/68747470733a2f2f636f6465636f762e696f2f67682f78726f77676d62682f657a74696b612f6272616e63682f6d61696e2f67726170682f62616467652e737667)](https://codecov.io/gh/xrowgmbh/eztika/tree/master)

###  Health Score

34

—

LowBetter than 77% of packages

Maintenance7

Infrequent updates — may be unmaintained

Popularity28

Limited adoption so far

Community17

Small or concentrated contributor base

Maturity70

Established project with proven stability

 Bus Factor1

Top contributor holds 67.7% of commits — single point of failure

How is this calculated?**Maintenance (25%)** — Last commit recency, latest release date, and issue-to-star ratio. Uses a 2-year decay window.

**Popularity (30%)** — Total and monthly downloads, GitHub stars, and forks. Logarithmic scaling prevents top-heavy scores.

**Community (15%)** — Contributors, dependents, forks, watchers, and maintainers. Measures real ecosystem engagement.

**Maturity (30%)** — Project age, version count, PHP version support, and release stability.

###  Release Activity

Cadence

Every ~297 days

Recently: every ~230 days

Total

11

Last Release

1291d ago

Major Versions

1.15.0 → 2.0.02022-11-04

### Community

Maintainers

![](https://www.gravatar.com/avatar/f08bee051f28728169796d79eb57b9387fb49abf244faaa1517952d7e22e1a7d?d=identicon)[xrow](/maintainers/xrow)

---

Top Contributors

[![gggeek](https://avatars.githubusercontent.com/u/308634?v=4)](https://github.com/gggeek "gggeek (21 commits)")[![xrow](https://avatars.githubusercontent.com/u/295491?v=4)](https://github.com/xrow "xrow (7 commits)")[![xrowrobert](https://avatars.githubusercontent.com/u/1880146?v=4)](https://github.com/xrowrobert "xrowrobert (2 commits)")[![jsb81](https://avatars.githubusercontent.com/u/14348404?v=4)](https://github.com/jsb81 "jsb81 (1 commits)")

###  Code Quality

TestsPHPUnit

### Embed Badge

![Health badge](/badges/xrow-eztika-ls/health.svg)

```
[![Health](https://phpackages.com/badges/xrow-eztika-ls/health.svg)](https://phpackages.com/packages/xrow-eztika-ls)
```

###  Alternatives

[masterminds/html5

An HTML5 parser and serializer.

1.8k242.8M229](/packages/masterminds-html5)[sabberworm/php-css-parser

Parser for CSS Files written in PHP

1.8k191.2M65](/packages/sabberworm-php-css-parser)[jms/metadata

Class/method/property metadata management in PHP

1.8k152.8M88](/packages/jms-metadata)[jms/serializer-bundle

Allows you to easily serialize, and deserialize data of any complexity

1.8k89.3M627](/packages/jms-serializer-bundle)[hassankhan/config

Lightweight configuration file loader that supports PHP, INI, XML, JSON, and YAML files

97513.5M170](/packages/hassankhan-config)[meyfa/php-svg

Read, edit, write, and render SVG files with PHP

54613.9M42](/packages/meyfa-php-svg)

PHPackages © 2026

[Directory](/)[Categories](/categories)[Trending](/trending)[Changelog](/changelog)[Analyze](/analyze)
