C. Blake and C. Merz, University of California Irvine repository of machine learning databases, 1998.

C. Brodley and M. Friedl, Identifying and eliminating mislabeled training instances, 13th National Conference on Artificial Intelligence AAAI/IAAI, pp.799-805, 1996.

R. Carrasco and J. Oncina, Learning stochastic regular grammars by means of a state merging method, Grammatical Inference and Applications, ICGI'94, number 862 in LNAI, pp.139-150, 1994.
DOI : 10.1007/3-540-58473-0_144

R. C. Carrasco, R. , and J. R. , A similarity between probabilistic tree languages: application to XML document families, Pattern Recognition, vol.36, issue.9, pp.2197-2199, 2002.
DOI : 10.1016/S0031-3203(02)00320-5

C. P. Cox, Handbook of Introductory Statistical Methods., Biometrics, vol.44, issue.1, 1987.
DOI : 10.2307/2531931

C. De-la-higuera, A bibliographical study of grammatical inference, Pattern Recognition, vol.38, issue.9, 2005.
DOI : 10.1016/j.patcog.2005.01.003

URL : https://hal.archives-ouvertes.fr/ujm-00376590

A. Habrard, M. Bernard, and M. Sebban, Improvement of the State Merging Rule on Noisy Data in Probabilistic Grammatical Inference, 14th European Conference on Machine Learning, pp.169-180, 2003.
DOI : 10.1007/978-3-540-39857-8_17

A. Habrard, M. Bernard, and M. Sebban, Probabilistic approach for reduction of irrelevant tree-structured data, 1st International Workshop on Mining Graphs, Trees and Sequences (MGTS-2003), pp.11-20, 2003.

G. John, R. Kohavi, and K. Pfleger, Irrelevant Features and the Subset Selection Problem, 11th International Conference on Machine Learning, pp.121-129, 1994.
DOI : 10.1016/B978-1-55860-335-6.50023-4

C. Kermorvant and P. Dupont, Stochastic Grammatical Inference with Multinomial Tests, 6th International Colloquium on Grammatical Inference, pp.149-160, 2000.
DOI : 10.1007/3-540-45790-9_12

R. Lyngsø, C. Pedersen, and H. Nielsen, Metrics and similarity measures for hidden Markov models, 7th International Conference on Intelligent Systems for Molecular Biology (ISMB '99), pp.178-186, 1999.

A. Reber, Implicit learning of artificial grammars, Journal of Verbal Learning and Verbal Behavior, vol.6, issue.6, pp.855-863, 1967.
DOI : 10.1016/S0022-5371(67)80149-X

D. Ron, Y. Singer, and N. Tishby, On the learnability and usage of acyclic probabilistic automata, Computational Learning Theory, COLT'95, pp.31-40, 1995.

M. Sebban and C. Janodet, On state merging in grammatical inference: A statistical approach for dealing with noisy data, 20th International Conference on Machine Learning, pp.688-695, 2003.

M. Sebban and R. Nock, Instance pruning as an information preserving problem, 17th International Conference on Machine Learning (ICML), pp.855-862, 2000.

F. Thollard, P. Dupont, and C. De-la-higuera, Probabilistic dfa inference using kullback?leibler divergence and minimality, 17th International Conference on Machine Learning, pp.975-982, 2000.