Identification in the Limit of Systematic Noisy Languages

Abstract : To study the problem of learning from noisy data, the common approach is to use a statistical model of noise. The influence of the noise is then considered according to pragmatic or statistical criteria, by using a paradigm taking into account a distribution of the data. In this article, we study the noise as a nonstatistical phenomenon, by defining the concept of systematic noise. We establish various ways of learning (in the limit) from noisy data. The first is based on a technique of reduction between problems and consists in learning from the data which one knows noisy, then in denoising the learned function. The second consists in denoising on the fly the training examples, thus to identify in the limit good examples, and then to learn from noncorrupted data. We give in both cases sufficient conditions so that learning is possible and we show through various examples (coming in particular from the field of the grammatical inference) that our techniques are complementary.
Complete list of metadatas

https://hal-ujm.archives-ouvertes.fr/ujm-00112410
Contributor : Frédéric Tantini <>
Submitted on : Wednesday, November 8, 2006 - 3:10:29 PM
Last modification on : Wednesday, July 25, 2018 - 2:05:30 PM

Identifiers

  • HAL Id : ujm-00112410, version 1

Collections

Citation

Frédéric Tantini, Colin de la Higuera, Jean-Christophe Janodet. Identification in the Limit of Systematic Noisy Languages. 8th International Colloquium, ICGI 2006, Sep 2006, Chofu, Tokyo, Japan. pp.19-31. ⟨ujm-00112410⟩

Share

Metrics

Record views

105