A probabilistic approach to the classification of large data sets is presented. For data drawn from distributions that do not satisfy the naive Bayes assumption (when the presence of features is not independent of one another), conditions on the distributions are given that guarantee the almost deterministic behavior of errors in approximation by neural networks. It is shown that mean values of correlations with network computational units, together with the growth of sizes of their sets of input/output functions, can be used to assess the suitability of networks for classes of tasks characterized by probabilities modeling their relevance for a given type of applications.

Classification of Large Data Sets by Neural Networks: A Probabilistic Viewpoint

Marcello Sanguineti
2026-01-01

Abstract

A probabilistic approach to the classification of large data sets is presented. For data drawn from distributions that do not satisfy the naive Bayes assumption (when the presence of features is not independent of one another), conditions on the distributions are given that guarantee the almost deterministic behavior of errors in approximation by neural networks. It is shown that mean values of correlations with network computational units, together with the growth of sizes of their sets of input/output functions, can be used to assess the suitability of networks for classes of tasks characterized by probabilities modeling their relevance for a given type of applications.
2026
9783032045577
9783032045584
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11567/1278308
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact