PHPackages                             tiefan/google-pdf-scraper - PHPackages - PHPackages  [Skip to content](#main-content)[PHPackages](/)[Directory](/)[Categories](/categories)[Trending](/trending)[Leaderboard](/leaderboard)[Changelog](/changelog)[Analyze](/analyze)[Collections](/collections)[Log in](/login)[Sign up](/register)

1. [Directory](/)
2. /
3. [PDF &amp; Document Generation](/categories/documents)
4. /
5. tiefan/google-pdf-scraper

ActiveLibrary[PDF &amp; Document Generation](/categories/documents)

tiefan/google-pdf-scraper
=========================

A php library to filter pdf documents in google driver for Daniel Fischl

v0.17(7y ago)037MITPHPPHP ^7.2

Since Dec 21Pushed 7y agoCompare

[ Source](https://github.com/kcb0126/google-pdf-scraper)[ Packagist](https://packagist.org/packages/tiefan/google-pdf-scraper)[ RSS](/packages/tiefan-google-pdf-scraper/feed)WikiDiscussions master Synced yesterday

READMEChangelog (10)Dependencies (2)Versions (18)Used By (0)

Google PDF Scraper with Keywords
--------------------------------

[](#google-pdf-scraper-with-keywords)

This is a php library to filter pdf documents in google driver for Daniel Fischl.

To import this into your project, use composer.

```
composer require tiefan/google-pdf-scraper
```

---

### Extract text from PDF document

[](#extract-text-from-pdf-document)

```
$text = PdfScraper::textFromDriveId(string $fileId);
```

```
$text = PdfScraper::textFromDriveUrl(string $url);
```

### Check Document with "Begin" and "End" Keyword

[](#check-document-with-begin-and-end-keyword)

```
$isThatDocument = PdfScraper::checkKeywordsFromDriveId(string $fileId, string $begin, string $end = null);
```

```
$isThatDocument = PdfScraper::checkKeywordsFromDriveUrl(string $url, string $begin, string $end = null);
```

```
$scraper = new PdfScraper($doc, $isURL = true); // $isURL: true for url, false for id
$isThatDocument = $scraper->checkKeywords(string $begin, string $end = null);
```

### Using MySQL or MariaDB to process data at once

[](#using-mysql-or-mariadb-to-process-data-at-once)

Following code is using db schema in `Sample\db_pdf_scraper.sql`

```
$pdfDB = new PdfDB($host, $username, $password, $database);
$processed_count = $pdfDB->checkPdfs();

```

###  Health Score

25

—

LowBetter than 35% of packages

Maintenance20

Infrequent updates — may be unmaintained

Popularity7

Limited adoption so far

Community6

Small or concentrated contributor base

Maturity55

Maturing project, gaining track record

 Bus Factor1

Top contributor holds 100% of commits — single point of failure

How is this calculated?**Maintenance (25%)** — Last commit recency, latest release date, and issue-to-star ratio. Uses a 2-year decay window.

**Popularity (30%)** — Total and monthly downloads, GitHub stars, and forks. Logarithmic scaling prevents top-heavy scores.

**Community (15%)** — Contributors, dependents, forks, watchers, and maintainers. Measures real ecosystem engagement.

**Maturity (30%)** — Project age, version count, PHP version support, and release stability.

###  Release Activity

Cadence

Every ~1 days

Total

17

Last Release

2725d ago

### Community

Maintainers

![](https://avatars.githubusercontent.com/u/12456214?v=4)[YifanZhou](/maintainers/tiefan)[@tiefan](https://github.com/tiefan)

---

Top Contributors

[![kcb0126](https://avatars.githubusercontent.com/u/30620567?v=4)](https://github.com/kcb0126 "kcb0126 (25 commits)")

---

Tags

pdfkeywordsscrapinggoogle-docs

### Embed Badge

![Health badge](/badges/tiefan-google-pdf-scraper/health.svg)

```
[![Health](https://phpackages.com/badges/tiefan-google-pdf-scraper/health.svg)](https://phpackages.com/packages/tiefan-google-pdf-scraper)
```

###  Alternatives

[barryvdh/laravel-dompdf

A DOMPDF Wrapper for Laravel

7.3k95.2M360](/packages/barryvdh-laravel-dompdf)[tecnickcom/tcpdf

Deprecated legacy PDF engine for PHP. For new projects use tecnickcom/tc-lib-pdf.

4.5k107.0M551](/packages/tecnickcom-tcpdf)[mpdf/mpdf

PHP library generating PDF files from UTF-8 encoded HTML

4.7k81.2M544](/packages/mpdf-mpdf)[knplabs/knp-snappy

PHP library allowing thumbnail, snapshot or PDF generation from a url or a html page. Wrapper for wkhtmltopdf/wkhtmltoimage.

4.5k71.0M60](/packages/knplabs-knp-snappy)[spatie/browsershot

Convert a webpage to an image or pdf using headless Chrome

5.2k35.1M143](/packages/spatie-browsershot)[smalot/pdfparser

Pdf parser library. Can read and extract information from pdf file.

2.7k38.3M257](/packages/smalot-pdfparser)

PHPackages © 2026

[Directory](/)[Categories](/categories)[Trending](/trending)[Changelog](/changelog)[Analyze](/analyze)