| Index |
| All Packages |
| All Categories |
| By Author |
| ap (3) |
| cp (3) |
| dp (3) |
| exe (3) |
| gui (0) |
| gui/gtk (0) |
| gui/tk (4) |
| io (1) |
| lib (11) |
| math (0) |
| net (9) |
| nlp (18) |
| op (4) |
| os (2) |
| program (3) |
| sp (2) |
| tool (9) |
| wp (2) |
| xml (2) |
| type | : | package |
| id | : | mogul:/lager/porter-stemmer |
| section | : | mogul:/lager |
| blurb | : | A Porter stemmer written in C and linked into Oz |
| author | : | Torbjörn Lager |
| category | : | nlp |
| documentation | : | index.html |
| download | : | lager-porter-stemmer__1.2.5__source__0.pkg lager-porter-stemmer__1.3.0__source__0.pkg |
| provides | : | [nlp] x-ozlib://lager/porter-stemmer/EnglishStemmer.so{native} [nlp] x-ozlib://lager/porter-stemmer/stem.exe |
This native functor creates a module that exports a function which performs stemming by means of the Porter stemming algorithm. Quoting Martin Porter himself:
The Porter stemming algorithm (or 'Porter stemmer') is a process for removing the commoner morphological and inflexional endings from words in English. Its main use is as part of a term normalisation process that is usually done when setting up Information Retrieval systems.
The algorithm was originally described in Porter, M.F., 1980, An algorithm for suffix stripping, Program, 14(3) :130-137. It has since been reprinted in Sparck Jones, Karen, and Peter Willet, 1997, Readings in Information Retrieval, San Francisco: Morgan Kaufmann, ISBN 1-55860-454-4.
This implementation simply links a C-implementation (written by Martin Porter) into Oz.