chickadee » stemmer

stemmer

Description

Bindings for the Snowball project's libstemmer. It implements the Porter stemmer algorithm and is in fact written by Dr Martin Porter himself making the implementation correct by defintion!

Author

Moritz Heidkamp

Requirements

You need libstemmer itself as well as its headers to be able to install this egg.

Documentation

(available-stemmers) procedure

Returns a list of symbols identifying the available language stemmers.

(make-stemmer language #!optional encoding) procedure

Returns a stemmer for the given language which must be a symbol contained in the list returned by available-stemmers. In addition an encoding string may be given. The current version of libstemmer supports "UTF_8", "ISO_8859_1", "CP850" and "KOI8_R". Since strings in Chicken are UTF-8 encoded, the default is "UTF_8".

(stem stemmer word) procedure

Returns the stem of the given word according to the language algorithm given as stemmer which is a stemmer object as returned by make-stemmer. Note that this function isn't thread safe, so you either have to appropriately lock it or make sure that stemmer objects are thread local.

Example

(use stemmer)
(define german (make-stemmer 'german))
(stem german "Häuser") ; => "Haus"

License

 Copyright (c) 2011-2012, Moritz Heidkamp
 All rights reserved.
 
 Redistribution and use in source and binary forms, with or without
 modification, are permitted provided that the following conditions are
 met:
 
 Redistributions of source code must retain the above copyright
 notice, this list of conditions and the following disclaimer.
 
 Redistributions in binary form must reproduce the above copyright
 notice, this list of conditions and the following disclaimer in the
 documentation and/or other materials provided with the distribution.
 
 Neither the name of the author nor the names of its contributors may
 be used to endorse or promote products derived from this software
 without specific prior written permission.
 
 THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
 "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
 LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS
 FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE
 COPYRIGHT HOLDERS OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT,
 INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES
 (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR
 SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
 HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT,
 STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE)
 ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED
 OF THE POSSIBILITY OF SUCH DAMAGE.

Contents »