PHPackages                             palex/spider4schema - PHPackages - PHPackages  [Skip to content](#main-content)[PHPackages](/)[Directory](/)[Categories](/categories)[Trending](/trending)[Leaderboard](/leaderboard)[Changelog](/changelog)[Analyze](/analyze)[Collections](/collections)[Log in](/login)[Sign up](/register)

1. [Directory](/)
2. /
3. [Parsing &amp; Serialization](/categories/parsing)
4. /
5. palex/spider4schema

AbandonedArchivedLibrary[Parsing &amp; Serialization](/categories/parsing)

palex/spider4schema
===================

A Web Bot that crawls the http://Schema.org web site to retrieve all available Types and Properties in order to create a JSON file and also some PHP libraries.

v1.2.0(11y ago)8371MITPHPPHP &gt;=5.3.0

Since Sep 21Pushed 6y ago1 watchersCompare

[ Source](https://github.com/PAlexcom/Spider4Schema)[ Packagist](https://packagist.org/packages/palex/spider4schema)[ Docs](https://github.com/PAlexcom/Spider4Schema)[ RSS](/packages/palex-spider4schema/feed)WikiDiscussions master Synced 3w ago

READMEChangelogDependencies (3)Versions (2)Used By (0)

Spider4Schema [![Build Status](https://camo.githubusercontent.com/fa45934552884cdb08a27f99e90a27c4eace788fdcc4b34ea8fc0a24684b9b36/68747470733a2f2f7472617669732d63692e6f72672f616c6578707275742f53706964657234536368656d612e737667)](https://travis-ci.org/alexprut/Spider4Schema)
===================================================================================================================================================================================================================================================================================

[](#spider4schema-)

A Web Bot that crawls the  web site to retrieve all available Types and Properties in order to create a JSON file and also some PHP libraries.
For generating Microdata or RDFa Lite 1.1 semantics you can use the PHP library . Created during the Google Summer of Code 2013 and 2014.

(**Deprecated**)

Documentation
-------------

[](#documentation)

#### Files structure:

[](#files-structure)

- `configuration.php` → the configuration file, setup the type of library to be created.
- `http.php` → a class that handles all HTTP requests.
- `parser.php` → methods to parse the HTML and retrieve all needed information.
- `fileCreator.php` → methods to create the library files.

Usage
-----

[](#usage)

- Make sure you have the cURL library installed, and the PHP CLI shell script package
- Clone the repo: git clone
- Enter `Spider4Schema/` directory
- Open your `terminal/shell` and call `php bin/spider.php [minified|json|normal] [true|false|verbose]`

The libraries will be created in the `dist/` folder.

Library types
-------------

[](#library-types)

There are 3 types of libraries you can create:

- JSON → a .json file containing all available Types and Properties, used in library  for generating valid Microdata and RDFa Lite 1.1 semantics
- Minified → a .php file with an array containing all available Types and Properties
- Normal → each Type is a PHP class file (an abstract class with static Properties)

Performance
-----------

[](#performance)

The **json** library:
1 .json file, 91 KB, contains all available Types (620+) and its Properties

The **minified** library:
1 php file, 107 KB, contains all available Types (620+) and its Properties, stored in a hash table (array)

The **normal** abstract static library:
622 php files, 710 KB, 1 file for each available Type

Todos
-----

[](#todos)

- Add to the all the required properties specified by Google, Yandex, Baidu.
- Instead of making 620+ HTTP requests, parse one file: [https://schema.org/docs/schema\_org\_rdfa.html](https://schema.org/docs/schema_org_rdfa.html)
- Write tests.

License
-------

[](#license)

Spider4Schema is licensed under the MIT License – see the LICENSE file for details.

###  Health Score

28

—

LowBetter than 52% of packages

Maintenance20

Infrequent updates — may be unmaintained

Popularity14

Limited adoption so far

Community8

Small or concentrated contributor base

Maturity58

Maturing project, gaining track record

 Bus Factor1

Top contributor holds 100% of commits — single point of failure

How is this calculated?**Maintenance (25%)** — Last commit recency, latest release date, and issue-to-star ratio. Uses a 2-year decay window.

**Popularity (30%)** — Total and monthly downloads, GitHub stars, and forks. Logarithmic scaling prevents top-heavy scores.

**Community (15%)** — Contributors, dependents, forks, watchers, and maintainers. Measures real ecosystem engagement.

**Maturity (30%)** — Project age, version count, PHP version support, and release stability.

###  Release Activity

Cadence

Unknown

Total

1

Last Release

4303d ago

### Community

Maintainers

![](https://www.gravatar.com/avatar/d2702f3e7b85b58e5ffa6861cef5bea340d3b2c398f28e458ffd8756a49c8997?d=identicon)[P.Alex](/maintainers/P.Alex)

---

Top Contributors

[![alexprut](https://avatars.githubusercontent.com/u/1648497?v=4)](https://github.com/alexprut "alexprut (19 commits)")

---

Tags

jsonphpschema-orgwebbotschemasemanticschema.org

###  Code Quality

TestsPHPUnit

### Embed Badge

![Health badge](/badges/palex-spider4schema/health.svg)

```
[![Health](https://phpackages.com/badges/palex-spider4schema/health.svg)](https://phpackages.com/packages/palex-spider4schema)
```

###  Alternatives

[justinrainbow/json-schema

A library to validate a json schema.

3.6k334.7M789](/packages/justinrainbow-json-schema)[mf2/mf2

A pure, generic microformats2 parser — makes HTML as easy to consume as a JSON API

202313.4k45](/packages/mf2-mf2)[hasbridge/json-schema-validator

PHP 5.3 implementation of json schema validation

124460.4k3](/packages/hasbridge-json-schema-validator)[psx/schema

Parse and generate data schema formats

57245.5k24](/packages/psx-schema)[crwlr/schema-org

Extract schema.org structured data from HTML documents.

1528.0k2](/packages/crwlr-schema-org)[juststeveking/resume-php

A PHP library for building and working with the JSON resume schema.

1061.0k](/packages/juststeveking-resume-php)

PHPackages © 2026

[Directory](/)[Categories](/categories)[Trending](/trending)[Changelog](/changelog)[Analyze](/analyze)
