Structure learning of Bayesian Networks using global optimization with applications in data classification

Taheri, Sona; Mammadov, Musa

doi:10.1007/s11590-014-0803-1

Structure learning of Bayesian Networks using global optimization with applications in data classification

Original Paper
Published: 04 October 2014

Volume 9, pages 931–948, (2015)
Cite this article

Optimization Letters Aims and scope Submit manuscript

Sona Taheri¹ &
Musa Mammadov^1,2

432 Accesses
11 Citations
1 Altmetric
Explore all metrics

Abstract

Bayesian Networks are increasingly popular methods of modeling uncertainty in artificial intelligence and machine learning. A Bayesian Network consists of a directed acyclic graph in which each node represents a variable and each arc represents probabilistic dependency between two variables. Constructing a Bayesian Network from data is a learning process that consists of two steps: learning structure and learning parameter. Learning a network structure from data is the most difficult task in this process. This paper presents a new algorithm for constructing an optimal structure for Bayesian Networks based on optimization. The algorithm has two major parts. First, we define an optimization model to find the better network graphs. Then, we apply an optimization approach for removing possible cycles from the directed graphs obtained in the first part which is the first of its kind in the literature. The main advantage of the proposed method is that the maximal number of parents for variables is not fixed a priory and it is defined during the optimization procedure. It also considers all networks including cyclic ones and then choose a best structure by applying a global optimization method. To show the efficiency of the algorithm, several closely related algorithms including unrestricted dependency Bayesian Network algorithm, as well as, benchmarks algorithms SVM and C4.5 are employed for comparison. We apply these algorithms on data classification; data sets are taken from the UCI machine learning repository and the LIBSVM.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Aleatoric and epistemic uncertainty in machine learning: an introduction to concepts and methods

Article Open access 08 March 2021

Learning from positive and unlabeled data: a survey

Article 02 April 2020

Artificial Intelligence in Physical Sciences: Symbolic Regression Trends and Perspectives

Article Open access 19 April 2023

References

Asuncion, A., Newman, D.: UCI machine learning repository. School of Information and Computer Science, University of California. http://www.ics.uci.edu/mlearn/MLRepository.html (2007)
Bender, M.A., Fineman, J.T. Gilbert, S.: A new approach to incremental topological ordering. In: Proceedings of the twentieth Annual ACM-SIAM Symposium on Discrete Algorithms Society for Industrial and Applied Mathematics. Philadelphia (2009)
Campos, L., Fernandez-Luna, M., Gamez, A., Puerta, M.: Ant colony optimization for learning Bayesian networks. Int. J. Approx. Reason. 31(3), 291–311 (2002)
Castillo, E., Gutierrez, J.M., Hadi, A.S.: Expert Systems and Probabilistic Network Models. Springer, New York (1997)
Book Google Scholar
Chang, C., Lin, C.: LIBSVM: a library for support vector machines, 2001a. http://www.csie.ntu.edu.tw/cjlin/libsvm (2001)
Chickering, D.M.: Learning Bayesian networks is NP-complete. In: Fisher, D., Lenz, H.-J. (eds.) Learning from Data: Artificial Intelligence and Statistics, pp. 121–130. Springer-Verlag (1996)
Daly, R., Shen, Q.: Learning Bayesian network equivalence classes with ant colony optimization. J. Artif. Intell. Res. 35, 391–447 (2009)
MATH MathSciNet Google Scholar
Domingos, P., Pazzani, M.: On the optimality of the simple Bayesian classier under zero-one loss. Mach. Learn. 29, 103–130 (1997)
Article MATH Google Scholar
Fayyad, U.M., Irani, K.B.: On the handling of continuous-valued attributes in decision tree generation. Mach. Learn. 8, 87–102 (1993)
Google Scholar
Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian network classifiers. Mach. Learn. 29, 131–163 (1997)
Article MATH Google Scholar
Haeupler, B., Kavitha, T., Mathew, R., Sen, S., Tarjan, R.E.: Incremental cycle detection, topological ordering, and strong component maintenance. In: 35th International Colloquium on Automata, Languages, and Programming (ICALP). Reykjavik, Iceland (2008)
Heckerman, D., Geiger, D., Chickering, D.M.: Learning Bayesian Networks: the combination of knowledge and statistical data. Mach. Learn. 20, 197–243 (1995)
MATH Google Scholar
Heckerman, D., Chickering, D., Meek, C.: Large-sample learning of Bayesian Networks is NP-hard. Mach. Learn. 5, 1287–1330 (2004)
MATH MathSciNet Google Scholar
Janzura, M., Nielsen, J.: A simulated annealing-based method for learning Bayesian Networks from statistical data. Int. J. Intell. Syst. 21, 335–348 (2006)
Article MATH Google Scholar
Jensen, F.: An Introduction to Bayesian Networks. Springer, New York (1996)
Google Scholar
Ji, Z., Zhong, H., Hu, R., Liu, C.: A Bayesian Network learning algorithm based on independence test and ant colony optimization. Acta Autom. Sin. 35(3), 281–288 (2009)
Kabli, R., Herrmann, F., McCall, J.: A chain-model genetic algorithm for Bayesian Network structure learning. In: Proceedings of the 9th Annual Conference on Genetic and Evolutionary Computation. ACM, New York (2007)
Kolda, T. G., Lewis, R. M., Torczon, V.: Optimization by direct search: new perspectives on some classical and modern methods. SIAM Rev. 45(3), 385–482 (2003)
Kouhbor, S., Ugon, J., Rubinov, A., Kruger, A., Mammadov, M.: Coverage in WLAN with minimum number of access points. In: Vehicular Technology Conference, VTC Spring, pp. 1166–1170 (2006)
Langley, P., Iba, W., Thompson, K.: An analysis of Bayesian classiers. In: 10th International Conference Artificial Intelligence, pp. 223–228. AAAI Press (1992)
Larranaga, P., Murga, R., Poza, M., Kuijpers, C.: Structure Learning of Bayesian Networks by Hybrid Genetic Algorithms. Preliminary papers 5th international workshop artificial intelligence and statistics, 310–316 (1995)
Larranaga, P., Poza, M., Yurramendi, Y., Murga, H., Kuijpers, C.: Structure learning of Bayesian networks by genetic algorithms: a performance analysis of control parameters. In: IEEE Transactions on Pattern Analysis and Machine Intelligence archive (1996)
Larranaga, P., Sierra, B., Gallego, J., Michelena, J., Picaza. M.: Learning Bayesian Networks by genetic algorithms: A case study in the prediction of survival in malignant skin melanoma. Artif. Intell. Med. 261–272 (1997)
Mammadov, M.A., Rubinov, A.M., Sniedovich, M.: A new global optimization algorithm based on dynamical systems approach. In: 6th International Conference on Optimization: Techniques and Applications. Ballarat, Australia (2004)
Mammadov, M.A., Rubinov, A.M., Yearwood, J.: Dynamical systems described by relational elasticities with applications to global optimization. In: Jeyakumar, V., Rubinov, A. (eds.) Continuous Optimisation: Current Trends and Modern Applications, pp. 365–387. Springer (2005)
Mammadov, M.A., Orsi, R.: H_ infinity systhesis via a nonsmooth, nonconvex optimization approach. Pac. J. Optim. 1(2), 405–420 (2005)
Marinescu, R., Dechter, R.: AND/OR branch-and-bound search for combinatorial optimization in graphical models. Artif. Intell. 173, 1457–1491 (2009)
Article MATH MathSciNet Google Scholar
Maroosi, A., Amiri, B.: A new clustering algorithm based on hybrid global optimization based on a dynamical systems approach algorithm. Expert Syst. Appl. 37(8), 5645–5652 (2010)
Park, H., Cho, S.: An Effcient Attribute Ordering Optimization in Bayesian Networks for Prognostic Modeling of the Metabolic Syndrome, pp. 381–391. Springer, Berlin (2006)
Google Scholar
Pearl, J.: Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann, San Mateo (1988)
Google Scholar
Richter, F., Fettweis, G.: Base Station Placement Based on Force Fields. IEEE VTC-Spring, Yokohama (2012)
Book Google Scholar
Robinson, R.W.: Counting Unlabeled Acyclic Diagraphs, pp. 28–43. Springer, New York (1997)
Google Scholar
Sahami, M.: Learning limited dependence Bayesian classiers. In: The 2nd International Conference. Knowledge Discovery and Data mining (KKD), pp. 335–338 (1996)
Sahin, F., Yavuz, M.C., Arnavut, Z., Uluyol, O.: Fault diagnosis for airplane engines using Bayesian networks and distributed particle swarm optimization. Parallel Comput. 33(2), 124–143 (2007)
Schleip, C., Rais, A., Menzel, A.: Bayesian analysis of temperature sensitivity of plant phenology in Germany. Agric. For. Meteorol. 149, 1699–1708 (2009)
Article Google Scholar
Sedgewick, R: Algorithms. Addison-Wesley Publishing Company. (1983)
Shafer, G., Pearl, J.: Readings in Uncertain Reasoning. Morgan Kaufmann, San Mateo (1990)
MATH Google Scholar
Sun, W., Yuan, Y.X.: Optimization Theory and Methods, Nonlinear Programming. Springer Optimization and its Applications. vol. 1 (2006)
Taheri, S., Mammadov, M.: Solving systems of nonlinear equations using a globally convergent optimization algorithm. Glob. J. Technol. Optim. vol. 3, 132–138 (2012)
Taheri, S., Mammadov, M. Seifollahi, S.: Globally convergent optimization methods for unconstrained problems. Optimization: A journal of mathematical programming and operations research. pp. 124–143 (2012)
Taheri, S., Mammadov, M.: Structure learning of Bayesian networks using a new unrestricted dependency algorithm. In: Second International Conference on Social Eco-Informatics. Venice, Italy (2012)
Tilakaratne, C.D., Mammadov, M., Morris, S.A.: Modied neural network algorithms for predicting trading signals of stock market indices. J. Appl. Math. Decis. Sci. (2009)
Tucker, A.: Covering Circuits and Graph Coloring, Applied Combinatorics, 5th edn. John Wiley and sons, Hoboken (2006)
Google Scholar
Yatsko, A., Bagirov, A., Stranieri, A.: On the discretization of continuous features for classication. In: The Proceedings of Ninth Australasian Data Mining Conference (AusDM 2011). Ballarat, Australia (2011)
Zhao, J., Sun, J., Xu, W., Zhou, D.: Structure learning of Bayesian networks based on discrete binary quantum-behaved particle swarm optimization algorithm. In: Proceedings of the Fifth International Conference on Natural Computation. IEEE Computer Society, Washington (2009)

Download references

Author information

Authors and Affiliations

Faculty of Science and Technology, Federation University, Ballarat, VIC, 3353, Australia
Sona Taheri & Musa Mammadov
National ICT Australia, Melbourne, VIC, 3010, Australia
Musa Mammadov

Authors

Sona Taheri
View author publications
You can also search for this author in PubMed Google Scholar
Musa Mammadov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sona Taheri.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Taheri, S., Mammadov, M. Structure learning of Bayesian Networks using global optimization with applications in data classification. Optim Lett 9, 931–948 (2015). https://doi.org/10.1007/s11590-014-0803-1

Download citation

Received: 28 March 2013
Accepted: 14 September 2014
Published: 04 October 2014
Issue Date: June 2015
DOI: https://doi.org/10.1007/s11590-014-0803-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Structure learning of Bayesian Networks using global optimization with applications in data classification

Abstract

Access this article

Similar content being viewed by others

Aleatoric and epistemic uncertainty in machine learning: an introduction to concepts and methods

Learning from positive and unlabeled data: a survey

Artificial Intelligence in Physical Sciences: Symbolic Regression Trends and Perspectives

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Structure learning of Bayesian Networks using global optimization with applications in data classification

Abstract

Access this article

Similar content being viewed by others

Aleatoric and epistemic uncertainty in machine learning: an introduction to concepts and methods

Learning from positive and unlabeled data: a survey

Artificial Intelligence in Physical Sciences: Symbolic Regression Trends and Perspectives

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation