Variants of tree kernels for XML documents

Autor(en): Geibel, P.
Gust, H.
Kühnberger, K.-U. 
Stichwörter: Classification (of information); Data acquisition; Fuzzy sets; XML, Tree kernels; XML documents, Trees (mathematics)
Erscheinungsdatum: 2007
Herausgeber: Springer Verlag
Journal: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen: 4827 LNAI
Startseite: 850
Seitenende: 860
Zusammenfassung: 
In this paper, we discuss tree kernels that can be applied for the classification of XML documents based on their DOM trees. DOM trees are ordered trees, in which every node might be labeled by a vector of attributes including its XML tag and the textual content. We describe four new kernels suitable for this kind of trees: a tree kernel derived from the well-known parse tree kernel, the set tree kernel that allows permutations of children, the string tree kernel being an extension of the so-called partial tree kernel, and the soft tree kernel, which is based on the set tree kernel and takes into a account a "fuzzy" comparison of child positions. We present first results on an artificial data set, a corpus of newspaper articles, for which we want to determine the type (genre) of an article based on its structure alone, and the well-known SUSANNE corpus. © Springer-Verlag Berlin Heidelberg 2007.
Beschreibung: 
Conference of 6th Mexican International Conference on Artificial Intelligence, MICAI 2007 ; Conference Date: 4 November 2007 Through 10 November 2007; Conference Code:71204
ISBN: 9783540766308
ISSN: 03029743
DOI: 10.1007/978-3-540-76631-5_81
Externe URL: https://www.scopus.com/inward/record.uri?eid=2-s2.0-38149096114&doi=10.1007%2f978-3-540-76631-5_81&partnerID=40&md5=1a196f7297c3b1989b384d97a64cc6d6

Zur Langanzeige

Seitenaufrufe

2
Letzte Woche
0
Letzter Monat
1
geprüft am 13.05.2024

Google ScholarTM

Prüfen

Altmetric