Learning Balls of Strings with Correction Queries

Abstract : During the 80's, Angluin introduced an active learning paradigm, using an Oracle, capable of answering both membership and equivalence queries. However, critical evidence tends to show that if the former are often available, this is usually not the case of the latter. We propose new queries, called correction queries, which we study in the framework of Grammatical Inference. When a string is submitted to the Oracle, either she validates it if it belongs to the target language, or she proposes a correction, i.e., a string of the language close to the query with respect to the edit distance. We also introduce a non-standard class of languages: The topological balls of strings. We show that this class is not learnable in Angluin's Mat model, but is with a linear number of correction queries. We conduct several experiments with an Oracle simulating a human Expert, and show that our algorithm is resistant to approximate answers.
Complete list of metadatas

https://hal-ujm.archives-ouvertes.fr/ujm-00176268
Contributor : Frédéric Tantini <>
Submitted on : Wednesday, October 3, 2007 - 10:13:16 AM
Last modification on : Wednesday, July 25, 2018 - 2:05:31 PM

Identifiers

  • HAL Id : ujm-00176268, version 1

Collections

Citation

Leonor Becerra Bonache, Colin de la Higuera, Jean-Christophe Janodet, Frédéric Tantini. Learning Balls of Strings with Correction Queries. 18th European Conference on Machine Learning, Sep 2007, Varsovie, Poland. pp.18-29. ⟨ujm-00176268⟩

Share

Metrics

Record views

195