PHPackages                             woothee/woothee-testset - PHPackages - PHPackages  [Skip to content](#main-content)[PHPackages](/)[Directory](/)[Categories](/categories)[Trending](/trending)[Leaderboard](/leaderboard)[Changelog](/changelog)[Analyze](/analyze)[Collections](/collections)[Log in](/login)[Sign up](/register)

1. [Directory](/)
2. /
3. [Parsing &amp; Serialization](/categories/parsing)
4. /
5. woothee/woothee-testset

ActiveLibrary[Parsing &amp; Serialization](/categories/parsing)

woothee/woothee-testset
=======================

Set of test case for woothee project. Project Woothee is multi-language user-agent strings parsers.

v1.13.0(4y ago)317126.0k↑214.8%50[9 issues](https://github.com/woothee/woothee/issues)[2 PRs](https://github.com/woothee/woothee/pulls)2Apache-2.0Ruby

Since Oct 17Pushed 3y ago24 watchersCompare

[ Source](https://github.com/woothee/woothee)[ Packagist](https://packagist.org/packages/woothee/woothee-testset)[ RSS](/packages/woothee-woothee-testset/feed)WikiDiscussions master Synced 1mo ago

READMEChangelogDependenciesVersions (17)Used By (2)

Project Woothee
===============

[](#project-woothee)

Project Woothee is multi-language user-agent strings parsers.

You can try it on demo site:

Why new project?
----------------

[](#why-new-project)

We needs just same logic over 2 or more programming languages, for use on various frameworks, middlewares and environments.

Most important data of this project is only single set of return values, and set of test cases, for equality of results of another languages implementations.

Implementations:

- [Java](https://github.com/woothee/woothee-java) (and Hive UDF)
- [Perl](https://github.com/woothee/woothee-perl)
- [Ruby](https://github.com/woothee/woothee-ruby)
- [Python](https://github.com/woothee/woothee-python)
- [Javascript](https://github.com/woothee/woothee-js) (Node.js or browser)
- [PHP](https://github.com/woothee/woothee-php)
- [Go](https://github.com/woothee/woothee-go)
- [Rust](https://github.com/woothee/woothee-rust)
- [D](https://gitlab.com/sarneaud/woothee-d)

Versions
--------

[](#versions)

- v1.13.0
    - Add Android OS version (Android 11~)
- v1.12.0
    - Add Samsung Browser
    - Add Google's AdsBot
- v1.11.1
    - Update the GSA version in testsets
- v1.11.0
    - Add Google Search App
- v1.10.0
    - Add Microsoft Edge (based on Chromium), Edge for iOS/Android
- v1.9.0
    - Add Android 9
- v1.8.0
    - Add Yandex Browser
- v1.7.0
    - Add trendictionbot crawler
    - Add Yeti 1.1 crawler user agent tests
- v1.6.0
    - Add Android Webview
    - Add curl HTTP library
- v1.5.0
    - Add BingPreview crawler
- v1.4.0
    - Add Vivaldi
- v1.3.0
    - Add Firefox for iOS
- v1.2.1
    - Fix testset bug for BlackBerry 10
- v1.2.0
    - Add Microsoft Edge, Webview of mobile phones
    - Add Windows 10, BlackBerry 10
    - Add Twitterbot
- v1.1.0
    - Add specs for blank input data
- v1.0.1
    - Add variations of MSIE11
- v1.0.0
    - Some of implementations are not v1 yet.

Version numbers are used as:

- vX.Y.Z
    - X and Y is major/minor number to control specs of dataset/testsets
    - Z is patch number for each language implementation release control
    - Z is also used for bug fix control of testsets (versions of each language implementations should be bumped up too)

Implementations
---------------

[](#implementations)

- Java (and Hive UDF)
    -
- Perl
    -
- Ruby
    -
- Python
    -
- Javascript (Node.js or browser)
    -
- PHP
    -
- Golang
    -
- Rust
    -

SYNOPSIS
--------

[](#synopsis)

in Java: (use java/woothee.jar)

```
// import is.tagomor.woothee.Classifier;
// import is.tagomor.woothee.DataSet;
Map r = Classifier.parse("user agent string");

r.get("name")
// => name of browser (or string like name of user-agent)

r.get("category")
// => "pc", "smartphone", "mobilephone", "appliance", "crawler", "misc", "unknown"

r.get("os")
// => os from user-agent, or carrier name of mobile phones

r.get("version");
// => version of browser, or terminal type name of mobile phones

r.get("os_version");
// => "NT 6.3" (for Windows), "10.8.3" (for OSX), "8.0.1" (for iOS), ....
```

in Hive: (copy woothee.jar into your CLASSPATH, and create function)

```
-- add jar to classpath
add jar woothee.jar;
-- create function
CREATE TEMPORARY FUNCTION parse_agent as 'is.tagomor.woothee.hive.ParseAgent';
-- count visits of bots
SELECT parsed_agent['name'] AS botname, COUNT(*) AS cnt
FROM (
  SELECT parse_agent(user_agent) AS parsed_agent
  FROM table_name
  WHERE date='today'
) x
WHERE parsed_agent['category'] = 'crawler'
GROUP BY parsed_agent['name']
ORDER BY cnt DESC LIMIT 1000;
```

in Perl: (cpanm Woothee)

```
use Woothee;
Woothee::parse("Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.1; Trident/4.0)");
# => {'name'=>"Internet Explorer", 'category'=>"pc", 'os'=>"Windows 7", 'version'=>"8.0", 'vendor'=>"Microsoft", 'os_version'=>"NT 6.1"}
```

in Ruby: (gem install woothee)

```
require 'woothee'
Woothee.parse("Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.1; Trident/4.0)")
# => {:name=>"Internet Explorer", :category=>:pc, :os=>"Windows 7", :version=>"8.0", :vendor=>"Microsoft", :os_version=>"NT 6.1"}
```

in Python:

```
import woothee
woothee.parse("Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.1; Trident/4.0)")
# => {'name': 'Internet Explorer', 'category': 'pc', 'os': 'Windows 7', 'version': '8.0', 'vendor': 'Microsoft'}
```

in Javascript(HTML, copy from release/woothee.js)

```

woothee.parse('Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.1; Trident/4.0)')
// => {name: 'Internet Explorer', category: 'pc', os: 'Windows 7', version: '8.0', vendor: 'Microsoft', os_version: 'NT 6.1'}

```

in Node.js (npm install woothee)

```
var woothee = require('woothee');
woothee.parse('Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.1; Trident/4.0)')
// => {name: 'Internet Explorer', category: 'pc', os: 'Windows 7', version: '8.0', vendor: 'Microsoft', os_version: 'NT 6.1'}
```

in PHP (composer require woothee/woothee:\*)

```
