Linear-time algorithms for the subpath kernel

Shin, Kilho; Ishikawa, Taichi

doi:10.4230/LIPIcs.CPM.2018.22

Abstract

The subpath kernel is a useful positive definite kernel, which takes arbitrary rooted trees as input, no matter whether they are ordered or unordered, We first show that the subpath kernel can exhibit excellent classification performance in combination with SVM through an intensive experiment. Secondly, we develop a theory of irreducible trees, and then, using it as a rigid mathematical basis, reconstruct a bottom-up linear-time algorithm for the subtree kernel, which is a correction of an algorithm well-known in the literature. Thirdly, we show a novel top-down algorithm, with which we can realize a linear-time parallel-computing algorithm to compute the subpath kernel.

Cite As Get BibTex

Kilho Shin and Taichi Ishikawa. Linear-time algorithms for the subpath kernel. In 29th Annual Symposium on Combinatorial Pattern Matching (CPM 2018). Leibniz International Proceedings in Informatics (LIPIcs), Volume 105, pp. 22:1-22:13, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2018) https://doi.org/10.4230/LIPIcs.CPM.2018.22

Author Details

Kilho Shin

Graduate School of Applied Informatics, University of Hyogo, Minatojima-Minamimachi, Chuo, Kobe, Japan

Taichi Ishikawa

Graduate School of Applied Informatics, University of Hyogo, Minatojima-Minamimachi, Chuo, Kobe, Japan

Funding

Shin, Kilho: This work was supported by JSPS KAKENHI Grant Number JP17H007623 and JP16K12491.

References

Christensen Berg, C. and R. J. P. R., Ressel. Harmonic analysis on semigroups. theory of positive definite and related functions. Springer, 1984.
C. C. Chang and C. J. Lin. Libsvm: a library for support vector machines, 2001. URL: http://www.csie.ntu.edu.tw/~cjlin/libsvm/.
M. Collins and N. Duffy. Convolution kernels for natural language. Neural Information Processing Systems, 2001.
J. Demšar. Statistical comparisons of classifiers over multiple data sets. Journal of Machine Learning Theory, 7:1-30, 2006.
K. Hashimoto, S. Goto, S. Kawano, K. F. Aoki-Kinoshita, and N. Ueda. KEGG as a glycome informatics resource. Glycobiology, 16:63R-70R, 2006.
D. Haussler. Convolution kernels on discrete structures. UCSC-CRL 99-10, 1999.
T. Kasai, G. Lee, H. Arimura, S. Arikawa, and K. Park. Linear-time longest-common-prefix computation in suffix arrays and its applications. the 12th Annual Symposium on Combinatorial Pattern Matching. pp., 2001.
H. Kashima and T. Koyanagi. Kernels for semi-structured data. in: the 9th international conference on machine learning. ICML, 2002.
D. Kimura and H. Kashima. Fast computation of subpath kernel for trees. ICML, 2012.
T. Kuboyama, K. Hirata, H. Kashima, K.F. Aoki-Kinoshita, and H. Yasuda. A spectrum tree kernel. JSAI, 2007.
C. S. Leslie, E. Eskin, and W. Stafford Noble. The spectrum kernel: A string kernel for SVM protein classification. Pacific Symposium on Biocomputing, 2002.
Alessandro Moschitti. Example data for TREE KERNELS IN SVM-LIGHT. URL: http://disi.unitn.it/moschitti/Tree-Kernel.htm.
S. Pyysalo, A. Airola, J. Heimonen, J. Bjorne, F. Ginter, and T. Salakoski. Comparative analysis of five protein-protein interaction corpora. BMC Bioinformatics, 9(S-3), 2008.
K. Shin and T. Kuboyama. A generalization of Haussler’s convolution kernel - mapping kernel. ICML, 2008.
K. Shin and T. Kuboyama. A comprehensive study of tree kernels. in: Jsai-isai post-workshop proceedings. Lecture Notes in Articial Intelligence, 2014.
K. C. Taï. The tree-to-tree correction problem. journal of the ACM, 1979.
M. J. Zaki and C. C. Aggarwal. XRules: An effective algorithm for structural classification of XML data. Machine Learning, 62:137-170, 2006.
K. Zhang. Algorithms for the constrained editing distance between ordered labeled trees and related problems. Pattern Recognition, 1995.

Linear-time algorithms for the subpath kernel

Authors Kilho Shin, Taichi Ishikawa

File

Document Identifiers

Subject Classification

ACM Subject Classification

Keywords

Metrics

Abstract

Cite As Get BibTex

Author Details

References

Thanks for your feedback!

Could not send message