Learning Stochastic Tree Edit Distance

Marc Bernard; Amaury Habrard; Marc Sebban

Communication Dans Un Congrès Année : 2006

Learning Stochastic Tree Edit Distance

(1) , (2) , (1)

1
2

Marc Bernard

Fonction : Auteur
PersonId : 836237

Laboratoire Hubert Curien

Amaury Habrard

Fonction : Auteur
PersonId : 439
IdHAL : amaury-habrard
ORCID : 0000-0003-3038-9347
IdRef : 084103655

Laboratoire d'informatique Fondamentale de Marseille - UMR 6166

Marc Sebban

Fonction : Auteur
PersonId : 5203
IdHAL : marc-sebban
ORCID : 0000-0001-6851-169X
IdRef : 050802623

Laboratoire Hubert Curien

Résumé

Trees provide a suited structural representation to deal with complex tasks such as web information extraction, RNA secondary structure prediction, or conversion of tree structured documents. In this context, many applications require the calculation of similarities between tree pairs. The most studied distance is likely the tree edit distance for which improvements in terms of complexity have been achieved during the last decade. However, this classic edit distance usually uses a priori fixed edit costs which are often difficult to tune, that leaves little room for tackling complex problems. In this paper, we focus on the learning of a stochastic tree edit distance. We use an adaptation of the expectation-maximization algorithm for learning the primitive edit costs. We carried out several series of experiments that confirm the interest to learn a tree edit distance rather than a priori imposing edit costs.

Mots clés

Stochastic tree edit distance EM algorithm generative models discriminative models

Domaines

Apprentissage [cs.LG]

Fichier principal

bernard_habrard_sebban_ecml_2006.pdf (166.79 Ko)

Marc Bernard : Connectez-vous pour contacter le contributeur

https://ujm.hal.science/ujm-00109696

Soumis le : jeudi 16 novembre 2006-11:47:54

Dernière modification le : vendredi 24 mars 2023-14:52:48

Archivage à long terme le : mardi 6 avril 2010-20:56:26

Dates et versions

ujm-00109696 , version 1 (16-11-2006)

Identifiants

HAL Id : ujm-00109696 , version 1

Citer

Marc Bernard, Amaury Habrard, Marc Sebban. Learning Stochastic Tree Edit Distance. 17th European Conference on Machine Learning, Sep 2006, Berlin, Germany. pp.42-53. ⟨ujm-00109696⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-ST-ETIENNE IOGS LIF CNRS UNIV-AMU LAHC PARISTECH LIS-LAB UDL

134 Consultations

211 Téléchargements

Learning Stochastic Tree Edit Distance

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager