PHPackages                             akecul/pdfparser - PHPackages - PHPackages  [Skip to content](#main-content)[PHPackages](/)[Directory](/)[Categories](/categories)[Trending](/trending)[Leaderboard](/leaderboard)[Changelog](/changelog)[Analyze](/analyze)[Collections](/collections)[Log in](/login)[Sign up](/register)

1. [Directory](/)
2. /
3. [PDF &amp; Document Generation](/categories/documents)
4. /
5. akecul/pdfparser

ActiveLibrary[PDF &amp; Document Generation](/categories/documents)

akecul/pdfparser
================

Forks from smalot/pdfparser library. Fixes text decoding error with some Chinese text.

v2.2.1.1(3y ago)064LGPL-3.0PHPPHP &gt;=7.1

Since Sep 2Pushed 3y agoCompare

[ Source](https://github.com/akeculture/pdfparser)[ Packagist](https://packagist.org/packages/akecul/pdfparser)[ Docs](https://www.pdfparser.org)[ RSS](/packages/akecul-pdfparser/feed)WikiDiscussions master Synced 1mo ago

READMEChangelogDependencies (1)Versions (60)Used By (0)

Fixes text decoding error with some Chinese text;

PDF parser
==========

[](#pdf-parser)

[![Version](https://camo.githubusercontent.com/e889215cf3210cab9e1c48f50b9b0c2ccbf12a390efcbded7d8f8e01d867114d/68747470733a2f2f706f7365722e707567782e6f72672f736d616c6f742f7064667061727365722f76)](//packagist.org/packages/smalot/pdfparser)[![CI](https://github.com/smalot/pdfparser/workflows/CI/badge.svg)](https://github.com/smalot/pdfparser/workflows/CI/badge.svg)[![CS](https://github.com/smalot/pdfparser/workflows/CS/badge.svg)](https://github.com/smalot/pdfparser/workflows/CS/badge.svg)[![Scrutinizer Code Quality](https://camo.githubusercontent.com/628256d04d1178773663462fc834ddd1f012267c8fb2975be1fc229865336cf0/68747470733a2f2f7363727574696e697a65722d63692e636f6d2f672f736d616c6f742f7064667061727365722f6261646765732f7175616c6974792d73636f72652e706e673f623d6d6173746572)](https://scrutinizer-ci.com/g/smalot/pdfparser/?branch=master)[![Downloads](https://camo.githubusercontent.com/10fc7335b816956046fc2f16ba8bad1dcec958064c42f68010ad3568c260f61c/68747470733a2f2f706f7365722e707567782e6f72672f736d616c6f742f7064667061727365722f646f776e6c6f616473)](//packagist.org/packages/smalot/pdfparser)

The `ake/pdfparser` forks from `smalot/pdfparser`. It is a standalone PHP package that provides various tools to extract data from PDF files.

This library is under **active maintenance**. There is no active development by the author of this library (at the moment), but we welcome any pull request adding/extending functionality!

Features
--------

[](#features)

- Load/parse objects and headers
- Extract metadata (author, description, ...)
- Extract text from ordered pages
- Support of compressed PDFs
- Support of MAC OS Roman charset encoding
- Handling of hexa and octal encoding in text sections
- Create custom configurations (see [CustomConfig.md](/doc/CustomConfig.md)).

Currently, secured documents and extracting form data are not supported.

License
-------

[](#license)

This library is under the [LGPLv3 license](https://github.com/smalot/pdfparser/blob/master/LICENSE.txt).

Install
-------

[](#install)

This library requires PHP 7.1+ since [v1](https://github.com/smalot/pdfparser/releases/tag/v1.0.0). You can install it via [Composer](https://getcomposer.org/):

```
composer require ake/pdfparser
```

In case you can't use Composer, you can include `alt_autoload.php-dist`. It will include all required files automatically.

Quick example
-------------

[](#quick-example)

```
