PHPackages                             sastrawi/tokenizer - PHPackages - PHPackages  [Skip to content](#main-content)[PHPackages](/)[Directory](/)[Categories](/categories)[Trending](/trending)[Leaderboard](/leaderboard)[Changelog](/changelog)[Analyze](/analyze)[Collections](/collections)[Log in](/login)[Sign up](/register)

1. [Directory](/)
2. /
3. [Parsing &amp; Serialization](/categories/parsing)
4. /
5. sastrawi/tokenizer

AbandonedArchivedLibrary[Parsing &amp; Serialization](/categories/parsing)

sastrawi/tokenizer
==================

PHP library that allows you to tokenize Bahasa Indonesia.

v0.4.0(11y ago)3694718[2 issues](https://github.com/sastrawi/tokenizer/issues)MITPHPPHP &gt;=5.3

Since Dec 4Pushed 11y ago3 watchersCompare

[ Source](https://github.com/sastrawi/tokenizer)[ Packagist](https://packagist.org/packages/sastrawi/tokenizer)[ Docs](https://github.com/sastrawi/tokenizer)[ RSS](/packages/sastrawi-tokenizer/feed)WikiDiscussions master Synced 1mo ago

READMEChangelogDependencies (6)Versions (6)Used By (0)

Sastrawi Tokenizer
==================

[](#sastrawi-tokenizer)

[![Build Status](https://camo.githubusercontent.com/d8d3b36a8e3e199d198b68116b87073e9a96392611e223fdde486a499e201cdc/68747470733a2f2f7472617669732d63692e6f72672f73617374726177692f746f6b656e697a65722e7376673f6272616e63683d6d6173746572)](https://travis-ci.org/sastrawi/tokenizer) [![Scrutinizer Code Quality](https://camo.githubusercontent.com/8f8713bebfdc493bb1c692e052ab9186fe67c23592e06b80f2c7de227e999b93/68747470733a2f2f7363727574696e697a65722d63692e636f6d2f672f73617374726177692f746f6b656e697a65722f6261646765732f7175616c6974792d73636f72652e706e673f623d6d6173746572)](https://scrutinizer-ci.com/g/sastrawi/tokenizer/?branch=master) [![Code Coverage](https://camo.githubusercontent.com/87bee24e6912a6f9844e3ea4fcf6a7bbf4643009166ae91fd3ea5df28dda4aad/68747470733a2f2f7363727574696e697a65722d63692e636f6d2f672f73617374726177692f746f6b656e697a65722f6261646765732f636f7665726167652e706e673f623d6d6173746572)](https://scrutinizer-ci.com/g/sastrawi/tokenizer/?branch=master) [![Latest Stable Version](https://camo.githubusercontent.com/541ae8ba37a126d91786a0f94a38a14a4424bbf5e542fd4bc943fdb87cc46a9b/68747470733a2f2f706f7365722e707567782e6f72672f73617374726177692f746f6b656e697a65722f762f737461626c652e706e67)](https://packagist.org/packages/sastrawi/tokenizer)

Sastrawi Tokenizer adalah library PHP untuk melakukan tokenization pada Bahasa Indonesia.

Tokenization
------------

[](#tokenization)

```
Saya sedang belajar NLP Bahasa Indonesia.

```

Text di atas dapat di-tokenize menjadi:

```
["Saya", "sedang", "belajar", "NLP", "Bahasa", "Indonesia", "."]
```

Sastrawi Tokenizer
------------------

[](#sastrawi-tokenizer-1)

- *Library PHP* untuk melakukan *tokenization* pada Bahasa Indonesia.
- Mudah diintegrasikan dengan *framework* / *package* lainnya.
- Mempunyai *API* yang sederhana dan mudah digunakan.

Demo
----

[](#demo)

Cara Install
------------

[](#cara-install)

Sastrawi Tokenizer dapat diinstall dengan [Composer](https://getcomposer.org).

1. Buka terminal (command line) dan arahkan ke directory project Anda.
2. [Download Composer](https://getcomposer.org/download/) sehingga file `composer.phar` berada di directory tersebut.
3. Tambahkan Sastrawi Sentence Detector ke file `composer.json` Anda :

```
php composer.phar require sastrawi/tokenizer:0.*
```

Jika Anda masih belum memahami bagaimana cara menggunakan Composer, silahkan baca [Getting Started with Composer](https://getcomposer.org/doc/00-intro.md).

Penggunaan
----------

[](#penggunaan)

#### Melalui kode PHP

[](#melalui-kode-php)

Copy kode berikut di directory project anda. Lalu jalankan file tersebut.

```
