I. Feinerer, A. Karatzoglou:
"Fast text mining using kernels in R";
Talk: COMPSTAT 2008 International Conference on Computational Statistics, Porto, Portugal; 08-24-2008 - 08-29-2008; in: "COMPSTAT 2008-Proceedings in Computational Statistics", (2008), ISBN: 978-3-7908-2083-6; 8 pages.

English abstract:
Recent advances in the field of kernel-based machine learning methods enable the fast processing of text using string kernels which are built with the use of suffix arrays. kernlab provides both kernel methods infrastructure and a large collection of already implemented algorithms and includes an implementation of suffix array based string kernels. Along with the use of tm these packages provide R with functionality in processing, visualizing and grouping large collections of text data using kernel methods. We focus on the performance of various types of string kernels at these tasks.

string kernels, text mining, clustering, R

