PHPackages                             mostlyserious/craft-text-extractor - PHPackages - PHPackages  [Skip to content](#main-content)[PHPackages](/)[Directory](/)[Categories](/categories)[Trending](/trending)[Leaderboard](/leaderboard)[Changelog](/changelog)[Analyze](/analyze)[Collections](/collections)[Log in](/login)[Sign up](/register)

1. [Directory](/)
2. /
3. [Utility &amp; Helpers](/categories/utility)
4. /
5. mostlyserious/craft-text-extractor

ActiveCraft-plugin[Utility &amp; Helpers](/categories/utility)

mostlyserious/craft-text-extractor
==================================

A tool to extract text from documents.

1.0.1(11mo ago)015PHPPHP &gt;=8.2

Since May 1Pushed 11mo ago1 watchersCompare

[ Source](https://github.com/mostlyserious/craft-text-extractor)[ Packagist](https://packagist.org/packages/mostlyserious/craft-text-extractor)[ RSS](/packages/mostlyserious-craft-text-extractor/feed)WikiDiscussions main Synced 1mo ago

READMEChangelogDependencies (5)Versions (3)Used By (0)

Text Extractor
==============

[](#text-extractor)

A tool to extract text from documents and insert it into Craft CMS Asset Elements.

Requirements
------------

[](#requirements)

This plugin requires Craft CMS 5.0.0 or later, and PHP 8.2 or later.

Features
--------

[](#features)

- Supports PDF (.pdf) and MS Word (.docx) files
    - Password-protected PDF files are [not supported](https://github.com/smalot/pdfparser/blob/master/doc/Usage.md#pdf-encryption).
- Extracts text on Asset creation and when Asset files are replaced
- Includes an Action to extract text from the Assets index view.

Configuration
-------------

[](#configuration)

Extracted document text is inserted into the custom field handle defined by the plugin. The default field handle is `body`.

You can customize the handle by adding a plugin config file.

```
