├── compile.sh
├── run.sh
├── BayesNet.png
├── data
    ├── .DS_Store
    ├── gold_alarm.bif
    └── solved_alarm.bif
├── main.py
├── README.md
├── bayesnet.py
├── format_checker.cpp
└── utils.py


/compile.sh:
--------------------------------------------------------------------------------
1 | #!/bin/bash


--------------------------------------------------------------------------------
/run.sh:
--------------------------------------------------------------------------------
1 | #!/bin/bash
2 | python main.py $1 $2


--------------------------------------------------------------------------------
/BayesNet.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/navreeetkaur/bayesian-network-learning/HEAD/BayesNet.png


--------------------------------------------------------------------------------
/data/.DS_Store:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/navreeetkaur/bayesian-network-learning/HEAD/data/.DS_Store


--------------------------------------------------------------------------------
/main.py:
--------------------------------------------------------------------------------
 1 | import sys
 2 | import time
 3 | import utils
 4 | 
 5 | 
 6 | # Setup network
 7 | step0 = time.time()
 8 | print "Initialising . . . . " 
 9 | bn, df, mis_index = utils.setup_network(sys.argv[1], sys.argv[2])
10 | step1 = time.time()
11 | print "Initialisation time: (%ss)" % (round((step1 - step0), 5))
12 | print
13 | # Learn parameters
14 | print "Expectation-Maximisation . . . . "
15 | Alarm = utils.Expectation_Maximisation(df, bn, mis_index)
16 | step2 = time.time()
17 | print
18 | print "EM time: (%ss)" % (round((step2 - step1), 5))
19 | print
20 | print "Parsing output file . . . . "
21 | utils.parse_output(Alarm, sys.argv[1])
22 | step3 = time.time()
23 | print "Output file parsing: (%ss)" % (round((step3 - step2), 5))
24 | print
25 | print "TOTAL Time taken: (%ss)" % (round((step3 - step1), 5))


--------------------------------------------------------------------------------
/README.md:
--------------------------------------------------------------------------------
  1 | # Bayesian Network Parameter Learning
  2 | ### Course Project - COL884(Spring'18):Uncertainity in AI
  3 | #### Creator: Navreet Kaur[2015TT10917]
  4 | 
  5 | #### Objective: 
  6 | Bayesian Parameter Learning of Alarm Bayesian Net given data with at most one missing value in each row.
  7 | #### Algorithm Used: 
  8 | Expectation-Maximisation
  9 | #### Goal: 
 10 | The goal of this assignment is to get experience with learning of Bayesian Networks and understanding their value in the real world. 
 11 | #### Scenario: 
 12 | Medical diagnosis. Some medical researchers have created a Bayesian network that models the inter-relationship between (some) diseases and observed symptoms. Our job as computer scientists is to learn parameters for the network based on health records. Unfortunately, as it happens in the real world, certain records have missing values. We need to do our best to compute the parameters for the network, so that it can be used for diagnosis later on.
 13 | #### Problem Statement: 
 14 | We are given the Bayesian Network created by the researchers(as shown in BayesNet.png).Notice that eight diagnoses are modeled here: hypovolemia, left ventricular failure, Anaphylaxis, insufficient analgesia, pulmonary embolus, intubation, kinked tube, and disconnection. The observable nodes are CVP, PCWP, History, TPR, Blood Pressure, CO, HR BP, HR EKG, HR SAT, SaO2, PAP, MV, Min Vol, Exp CO2, FiO2 and Pres. Such networks can be represented in many formats. We will use the .bif format. BIF stands for Bayesian Interchange Format. The details about the format are [here](http://sites.poli.usp.br/p/fabio.cozman/). We are also providing a .bif parser so that you can start directly from a parsed Bayesian network represented as a graph.
 15 | 
 16 | The goal of the assignment is to learn the Bayes net from a healthcare dataset.
 17 | #### Input format:
 18 | We will work with alarm.bif network. Please have a look at this file to get a basic understanding of how this information relates to the Bayes net image above. A sample Bayes net is as follows
 19 | variable “X” {
 20 | 
 21 | type discrete[2] { “True” “False” };
 22 | 
 23 | }
 24 | 
 25 | variable “Y” {
 26 | 
 27 | type discrete[2] { “True” “False” };
 28 | 
 29 | }
 30 | 
 31 | variable “Z” {
 32 | 
 33 | type discrete[2] { “True” “False” };
 34 | 
 35 | }
 36 | probability(“X”) { table 0.2 0.8 ; }
 37 | 
 38 | probability(“Y”) { table 0.4 0.6 ; }
 39 | 
 40 | probability(“Z” “X” “Y”) { table 0.2 0.4 0.3 0.5 0.8 0.6 0.7 0.5; }
 41 | 
 42 | This says that X, Y, and Z all have two values each. X and Y has no parents and prior P(X=True)=0.2, P(X=False)=0.8, and so on. Z has both X and Y as parents. Its probability table says P(Z=True|X=True, Y=True) = 0.2, P(Z=True|X=True, Y=False) = 0.4 and so on.
 43 | 
 44 | Our input network will have the Bayes net structure including variables and parents, but will not have probability values. We will use -1 to represent that the probability value is unknown.
 45 | probability(“X”) { table -1 -1 ; } will represent that prior probability of X is unknown and needs to be computed via learning.
 46 | 
 47 | To learn these values we will provide a data file. Each line will be a patient record. All features will be listed in exactly the same order as in the .bif network and will be comma-separated. If a feature value is unknown we will use the special symbol “?” for it. There will be no more than 1 unknown value per row. Example:
 48 | 
 49 | “True”, “False”, “True” “?”, “False”, “False”
 50 | 
 51 | Here the first row says that X=True, Y=False and Z=True. The second row says that X is not known, Y and Z are both False.
 52 | Overall your input will be alarm.bif with most probability values -1 and this datafile. The datafile will have about 10,000 patient records.
 53 | #### Output format:
 54 | Output will be the result of learning each probability value in the conditional probability tables. In other words, all -1s are replaced with a probability value upto four decimal places. Thus, the output is a complete alarm.bif network.
 55 | #### Files:
 56 | 1) records.dat:
 57 | 	A Dataset file where a single line is a single patient record and each variable in the record is separated by spaces. The unknown record is marked by “?”. Each line contains at max 1 missing record. The file contains more than 11000 records.
 58 | 2) format_check.cpp: 
 59 | 	A format checker to check your output file adheres to alarm.bif format. The format checker assumes that alarm.bif, solved_alarm.bif and gold_alarm.bif are present in current directory and outputs its results. (A next version will also compute the total learning error).
 60 | 3) Alarm.bif:
 61 | 	BIF format file, whose parameters need to be learned
 62 | 4) Gold_Alarm.bif:
 63 | 	BIF file having the true parameters
 64 | 5) bayesnet.py:
 65 | 	classes: 	
 66 | 		Graph_Node
 67 | 		Network
 68 | 	methods:
 69 | 		read_network: Parsing the .bif format file and build a bayesian net
 70 | 		markov_blanket: Get variables in the markov blanket of variable 'val_name'	
 71 | 		get_data: Read data from records.dat and store as a pandas dataframe 
 72 | 		normalise_counts: normalise a list of counts from a given CPT	
 73 | 6) utils.py:
 74 | 	methods:
 75 | 		setup_network
 76 | 		get_missing_index: List of the indices of nodes which have missing values in each data point; equal to -1 if 		     no value is missing
 77 | 		init_params: Initialise parameters
 78 | 		normalise_array: Normalise a numpy array
 79 | 		get_assignment_for: return the rows of the factor table with assignments as specified in evidence E
 80 | 		markov_blanket_sampling: Inference by Markov Blanket Sampling
 81 | 		Expectation 
 82 | 		Maximisation
 83 | 		Expectation_Maximisation
 84 | 		parse_output
 85 | 7) main.py: main file that calls methods from bayesnet and utils to build a bayes net, read data and learn its parameters
 86 | #### Compilation:
 87 | Run the file run.sh - it takes 2 input files, alarm.bif and records.dat and output a file named
 88 | solved_alarm.bif file:
 89 | `./run.sh alarm.bif <sample_data>.dat`
 90 | 
 91 | #### Assumptions:
 92 | • All variables are missing completely(or unconditionally) at random(MCAR) and none of them are either missing at random(MAR) or missing systematically or hidden i.e. initially, probability of each missing value is the same and the sample mean of variable v is unbiased estimator of true value of v
 93 | #### Parameter Initialisation:
 94 | • Initialisation of parameters by available case analysis(ignoring rows with missing values if the missing value is that of the parent). Since data is MCAR, estimators based on the subsample of the data are unbiased estimators for the ones with complete data
 95 | #### Design Choices:
 96 | 1. Data Records:
 97 | (a) String values for each class of random variables were mapped to integers
 98 | (b) Data File was stored as a Pandas DataFrame so as to perform grouping and aggregation of certain data occurrences to get theirs counts 
 99 | 2. Network
100 | (a) Ordered dictionary to represent nodes in the graph (keys = name of random variable, value = node object)
101 | (b) Ordered dictionary to store Markov Blanket(MB) of all nodes (keys = name of random variable(X), value = list of Strings of names of nodes in MB of X) - This is stored so as to avoid recomputation of Markov Blanket at each step while doing Markov Blanket Sampling Inference
102 | 3. Graph_Node
103 | (a) List of Strings to store names of Parents
104 | (b) List of integers to store indices of Children in ordered dictionary of nodes in Bayes Net
105 | (c) Pandas DataFrames to store CPT
106 | 4. CPTs
107 | (a) All CPTs are represented by Pandas DataFrames(columns are names of variables and column ‘p’ for probability value) so as to easily access the entries by specifying a dictionary of ‘Evidence’ with keys as variable names and values as the integers
108 | #### Optimisation/Techniques:
109 | 1. Storage of only counts and not probabilities in all the CPTs and normalising them before performing Expectation step
110 | 2. **Smoothing**: Since all possible instances might not be observed due to small size of dataset as compared to number of network nodes, counts of all possible instances in the CPTs were set to one to initialise with. Similarly, in the Maximisation step, with any observed count was equal to zero, it was set to 0.00005 (since required precision of probabilities is upto 4 decimal places and counts in maximisation might be less than one due to weights of data points being considered, which itself lie between 0 and 1) 
111 | 3. **Inference**: Since the probability of variable X is independent of all other variables given its markov blanket and only one data point is missing per row(i.e. all points are given hence MB is given), therefore, P(X | data) = P(X | mb(X)), where mb(X) is the markov blanket of x. Therefore, markov blanket sampling was used to calculate P(X | MB(X))
112 | 4. Using **log probabilities** as addition operation is faster than multiplication and also, it helps to avoid numerical underflow.
113 | 5. **Convergence Criteria**: Maximum change in the CPTs in previous and current iteration is less than equal to 0.00005
114 | 


--------------------------------------------------------------------------------
/bayesnet.py:
--------------------------------------------------------------------------------
  1 | from __future__ import division
  2 | from collections import OrderedDict
  3 | import numpy as np 
  4 | import pandas as pd 
  5 | import time
  6 | 
  7 | 
  8 | __author__ = "Navreet Kaur"
  9 | __entrynumber__ = "2015TT10917"
 10 | 
 11 | 
 12 | class Graph_Node():
 13 | 	"""Our graph consists of a list of nodes where each node is represented as follows"""
 14 | 
 15 | 	def __init__(self, name, n, vals):
 16 | 		self.Node_Name = name # Variable name 
 17 | 		self.nvalues = n # Number of categories a variable represented by this node can take
 18 | 		self.values = vals # Categories of possible values
 19 | 		self.Children =  [] # Children of a particular node - these are index of nodes in graph.
 20 | 		self.Parents = [] # Parents of a particular node- note these are names of parents
 21 | 		self.CPT = []
 22 | 		self.cpt_data =  pd.DataFrame() # conditional probability table as a DataFrame (counts) 
 23 | 		self.markov_blanket = [] # List of nodes in the Markov Blanket - note that these are the names of the nodes
 24 | 
 25 | 	def get_name(self):
 26 | 		return self.Node_Name
 27 | 
 28 | 	def get_children(self):
 29 | 		return self.Children
 30 | 
 31 | 	def get_Parents(self):
 32 | 		return self.Parents
 33 | 
 34 | 	def get_n_parents(self):
 35 | 		return len(self.Parents)
 36 | 
 37 | 	def get_CPT(self):
 38 | 		return self.CPT
 39 | 
 40 | 	def get_nvalues(self):
 41 | 		return self.nvalues
 42 | 
 43 | 	def get_values(self):
 44 | 		return self.values
 45 | 
 46 | 	def set_CPT(self, new_CPT):
 47 | 		del(self.CPT[:])
 48 | 		self.CPT = new_CPT
 49 | 
 50 | 	def set_counts(self, new_counts):
 51 | 		del(self.counts[:])
 52 | 		self.counts = new_counts
 53 | 
 54 | 	def set_MB(self, new_mb):
 55 | 		self.markov_blanket = new_mb
 56 | 
 57 | 	def set_cpt_data(self, new_cpt_data):
 58 | 		self.cpt_data.drop(columns = list(self.cpt_data.columns))
 59 | 		self.cpt_data = new_cpt_data
 60 | 
 61 | 	def set_Parents(self, Parent_Nodes):
 62 | 		self.Parents = Parent_Nodes
 63 | 
 64 | 	def add_child(self, new_child_index):
 65 | 		if new_child_index in self.Children:
 66 | 			return 0
 67 | 		else:
 68 | 			self.Children.append(new_child_index)
 69 | 			return 1
 70 | 
 71 | 	def print_node(self):
 72 | 		print(self.Node_Name)
 73 | 		print(self.values)
 74 | 		print(self.Parents)
 75 | 		print(self.CPT)
 76 | 		print
 77 | 
 78 | 
 79 | class network():
 80 | 	"""
 81 | 	The whole network represted as a dictionary of nodes
 82 | 	Pres_Graph: 
 83 | 		Ordered Dictionary - Keys: variable names, Values: Node Objects
 84 | 	MB:
 85 | 		Ordered Dictionary - Keys: variable names, Values: List of names of the nodes in the markob blanket of the key
 86 | 	"""
 87 | 
 88 | 	def __init__(self, Pres_Graph = OrderedDict(), MB = OrderedDict()):
 89 | 		self.Pres_Graph = Pres_Graph
 90 | 		self.MB = MB
 91 | 
 92 | 	def addNode(self, node):
 93 | 		self.Pres_Graph[node.Node_Name] = node
 94 | 
 95 | 	def netSize(self):
 96 | 		return len(self.Pres_Graph)
 97 | 		
 98 | 	def get_index(self, val_name):
 99 | 		try:
100 | 			return self.Pres_Graph.keys().index(val_name)
101 | 		except:
102 | 			print "No node of the name: " + str(val_name)
103 | 			return None
104 | 
105 | 	def get_nth_node(self, n):
106 | 		return self.Pres_Graph.values()[n]
107 | 
108 | 	def search_node(self, val_name):
109 | 		try:
110 | 			return self.Pres_Graph[val_name]
111 | 		except:
112 | 			print "Node NOT found"
113 | 			return None
114 | 
115 | 	def get_parent_nodes(self, node):
116 | 		parent_nodes = []
117 | 		parents = node.get_Parents()
118 | 		for p in parents:
119 | 			parent_nodes.append(self.search_node(p))
120 | 		return parent_nodes
121 | 
122 | 	def get_children(self, val_name):
123 | 		Children = self.Pres_Graph[val_name].Children
124 | 		c = []
125 | 		for n in Children:
126 | 			c.append(self.Pres_Graph.keys()[n])
127 | 		return c
128 | 
129 | 	def set_mb(self):
130 | 		for vals in self.Pres_Graph.keys():
131 | 			self.MB[vals] = markov_blanket(self, vals)
132 | 			
133 | 
134 | 	def normalise_cpt(self, X):
135 | 		l = [X] + self.Pres_Graph[X].Parents + ['counts', 'p']
136 | 		cpt = self.Pres_Graph[X].cpt_data
137 | 		nvals = self.Pres_Graph[X].nvalues
138 | 		cardinality = cpt.shape[0]
139 | 		no_grps = int(cardinality/nvals)
140 | 		list_dfs = []
141 | 		df = pd.DataFrame()
142 | 		i=0 
143 | 		for n in range(no_grps):
144 | 			curr_df = pd.DataFrame(cpt.iloc[i:i+nvals, :])
145 | 			curr_df['p'] = normalise_counts(curr_df['counts'])
146 | 			df = df.append(curr_df)
147 | 			i = i + nvals
148 | 		self.Pres_Graph[X].cpt_data = df[l]
149 | 
150 | 
151 | """ Reading network from .bif format """
152 | def read_network(bif_filepath):
153 | 	Alarm = network()
154 | 	find = 0
155 | 
156 | 	with open(bif_filepath, 'r') as  myfile: 
157 | 		while True:
158 | 			line = myfile.readline()
159 | 			line = line.strip()
160 | 
161 | 			if line == '':
162 | 				break
163 | 
164 | 			tokens = line.split()
165 | 			first_word = tokens[0]
166 | 
167 | 
168 | 			if first_word == "variable":
169 | 				values = []
170 | 				name = tokens[1] # random varible name
171 | 				line_ = myfile.readline() # read next line
172 | 				line_ = line_.strip()
173 | 				tokens_ = line_.split()
174 | 				for i in range(3,len(tokens_)-1):
175 | 					values.append(tokens_[i])
176 | 				new_node = Graph_Node(name = name, n = len(values), vals = values)
177 | 				Alarm.addNode(new_node)
178 | 
179 | 			
180 | 			if first_word == "probability":
181 | 				vals = []
182 | 				temp = tokens[2]
183 | 				node = Alarm.search_node(temp)
184 | 				index = Alarm.get_index(temp)
185 | 				i = 3
186 | 				# setting parents
187 | 				while True:
188 | 					if tokens[i]==")":
189 | 						break
190 | 					node_ = Alarm.search_node(tokens[i])
191 | 					node_.add_child(index)
192 | 					vals.append(tokens[i])
193 | 				 	i = i + 1
194 | 
195 | 				node.set_Parents(vals)
196 | 
197 | 				line_ = myfile.readline()
198 | 				tokens_ = line_.split()
199 | 				curr_CPT = []
200 | 				for i in range(1,len(tokens_)-1):
201 | 					curr_CPT.append(int(tokens_[i]))
202 | 
203 | 				node.set_CPT(curr_CPT)
204 | 
205 | 	myfile.close()
206 | 
207 | 	return Alarm
208 | 
209 | 
210 | # Get variables in the markov blanket of variable 'val_name' 
211 | def markov_blanket(net, val_name):
212 | 	node = net.search_node(val_name)
213 | 	mb = []
214 | 	# Parents
215 | 	parents = node.Parents
216 | 	mb = mb + parents
217 | 	# Children
218 | 	children_names = node.Children
219 | 	for c in children_names:
220 | 		child_node = net.Pres_Graph[net.Pres_Graph.keys()[c]]
221 | 		mb.append(child_node.Node_Name)
222 | 		# Spouses
223 | 		spouses = child_node.Parents
224 | 		for var in spouses:
225 | 			if var not in mb and var!=val_name:
226 | 				mb.append(var)
227 | 
228 | 	return mb
229 | 
230 | 
231 | # Get the datafile as a pandas dataframe 
232 | def get_data(filepath):
233 | 	with open(filepath,'r') as f:
234 | 		df = pd.DataFrame(l.rstrip().split() for l in f)
235 | 
236 | 	df.columns = ['"Hypovolemia"','"StrokeVolume"','"LVFailure"','"LVEDVolume"','"PCWP"','"CVP"','"History"',
237 | 	'"MinVolSet"','"VentMach"','"Disconnect"','"VentTube"','"KinkedTube"','"Press"','"ErrLowOutput"',
238 | 	'"HRBP"','"ErrCauter"','"HREKG"','"HRSat"','"BP"','"CO"','"HR"','"TPR"','"Anaphylaxis"','"InsuffAnesth"','"PAP"','"PulmEmbolus"',
239 | 	'"FiO2"','"Catechol"','"SaO2"','"Shunt"','"PVSat"','"MinVol"','"ExpCO2"','"ArtCO2"','"VentAlv"','"VentLung"','"Intubation"']
240 | 
241 | 	features = list(df.columns)
242 | 
243 | 	mapping_1 = {'"True"': 0, '"False"': 1, '"?"': float('nan')}
244 | 	mapping_2 = {'"Zero"': 0, '"Low"': 1, '"Normal"': 2, '"High"': 3, '"?"': float('nan')}
245 | 	mapping_3 = { '"Normal"': 0, '"Esophageal"': 1 , '"OneSided"': 2, '"?"': float('nan') }
246 | 	mapping_4 = {'"Low"':0, '"Normal"':1, '"High"':2, '"?"': float('nan')}
247 | 	mapping_5 = {'"Low"':0, '"Normal"':1, '"?"': float('nan')}
248 | 	mapping_6 = {'"Normal"':0, '"High"':1, '"?"': float('nan')}
249 | 	overall_mapping = { '"Hypovolemia"':mapping_1 , u'"StrokeVolume"':mapping_4, u'"LVFailure"':mapping_1, 
250 | 	                   u'"LVEDVolume"':mapping_4, u'"PCWP"':mapping_4, u'"CVP"':mapping_4, 
251 | 	                   u'"History"':mapping_1, u'"MinVolSet"':mapping_4, u'"VentMach"':mapping_2, u'"Disconnect"':mapping_1,
252 | 	                   u'"VentTube"':mapping_2, u'"KinkedTube"':mapping_1, u'"Press"':mapping_2, 
253 | 	                   u'"ErrLowOutput"':mapping_1, u'"HRBP"':mapping_4,
254 | 	                   u'"ErrCauter"':mapping_1, u'"HREKG"':mapping_4, u'"HRSat"':mapping_4, 
255 | 	                   u'"BP"':mapping_4, u'"CO"':mapping_4, u'"HR"':mapping_4, u'"TPR"':mapping_4,
256 | 	                   u'"Anaphylaxis"':mapping_1, u'"InsuffAnesth"':mapping_1, u'"PAP"':mapping_4, 
257 | 	                   u'"PulmEmbolus"':mapping_1, u'"FiO2"':mapping_5,
258 | 	                   u'"Catechol"':mapping_6, u'"SaO2"':mapping_4, u'"Shunt"':mapping_6, 
259 | 	                   u'"PVSat"':mapping_4, u'"MinVol"':mapping_2, u'"ExpCO2"':mapping_2,
260 | 	                   u'"ArtCO2"':mapping_4, u'"VentAlv"':mapping_2, u'"VentLung"':mapping_2, u'"Intubation"':mapping_3}
261 | 	df = df.replace(overall_mapping)
262 | 	# to get csv file of data
263 | 	# df.to_csv('records.csv')
264 | 	return df
265 | 
266 | 
267 | # normalise a list of counts
268 | def normalise_counts(vals):
269 | 	vals[vals==0] = 0.000005
270 | 	denom = np.sum(vals)
271 | 	normalised_vals = []
272 | 	for val in vals:
273 | 		normalised_vals.append(val/float(denom))
274 | 	return normalised_vals
275 | 
276 | 
277 | if __name__ == '__main__':
278 | 	print "This file contains Bayes Net classes: Run main.py"
279 | 


--------------------------------------------------------------------------------
/format_checker.cpp:
--------------------------------------------------------------------------------
  1 | #include <iostream>
  2 | #include <string>
  3 | #include <vector>
  4 | #include <list>
  5 | #include <fstream>
  6 | #include <sstream>
  7 | #include <cstdlib>
  8 | #include <math.h>
  9 | 
 10 | 
 11 | using namespace std;
 12 | 
 13 | class Graph_Node{
 14 | 
 15 | private:
 16 | 	string Node_Name;
 17 | 	vector<int> Children;
 18 | 	vector<string> Parents;
 19 | 	int nvalues;
 20 | 	vector<string> values;
 21 | 	vector<float> CPT;
 22 | 
 23 | public:
 24 | 	//Graph_Node(string name, vector<Graph_Node*> Child_Nodes,vector<Graph_Node*> Parent_Nodes,int n, vector<string> vals,vector<float> curr_CPT)
 25 | 	Graph_Node(string name,int n,vector<string> vals)
 26 | 	{
 27 | 		Node_Name=name;
 28 | 		//Children=Child_Nodes;
 29 | 		//Parents=Parent_Nodes;
 30 | 		nvalues=n;
 31 | 		values=vals;
 32 | 		//CPT=curr_CPT;
 33 | 
 34 | 	}
 35 | 	string get_name()
 36 | 	{
 37 | 		return Node_Name;
 38 | 	}
 39 | 	vector<int> get_children()
 40 | 	{
 41 | 		return Children;
 42 | 	}
 43 | 	vector<string> get_Parents()
 44 | 	{
 45 | 		return Parents;
 46 | 	}
 47 | 	vector<float> get_CPT()
 48 | 	{
 49 | 		return CPT;
 50 | 	}
 51 | 	int get_nvalues()
 52 | 	{
 53 | 		return nvalues;
 54 | 	}
 55 | 	vector<string> get_values()
 56 | 	{
 57 | 		return values;
 58 | 	}
 59 | 	void set_CPT(vector<float> new_CPT)
 60 | 	{
 61 | 		CPT.clear();
 62 | 		CPT=new_CPT;
 63 | 	}
 64 |     void set_Parents(vector<string> Parent_Nodes)
 65 |     {
 66 |         Parents.clear();
 67 |         Parents=Parent_Nodes;
 68 |     }
 69 |     int add_child(int new_child_index )
 70 |     {
 71 |         for(int i=0;i<Children.size();i++)
 72 |         {
 73 |             if(Children[i]==new_child_index)
 74 |                 return 0;
 75 |         }
 76 |         Children.push_back(new_child_index);
 77 |         return 1;
 78 |     }
 79 | 
 80 | 
 81 | 
 82 | };
 83 | 
 84 | 
 85 | 
 86 | class network{
 87 | 
 88 | 	list <Graph_Node> Pres_Graph;
 89 | 
 90 | public:
 91 | 	int addNode(Graph_Node node)
 92 | 	{
 93 | 		Pres_Graph.push_back(node);
 94 | 		return 0;
 95 | 	}
 96 |     list<Graph_Node>::iterator getNode(int i)
 97 |     {
 98 |         int count=0;
 99 |         list<Graph_Node>::iterator listIt;
100 |         for(listIt=Pres_Graph.begin();listIt!=Pres_Graph.end();listIt++)
101 |         {
102 |             if(count++==i)
103 |                 break;
104 |             
105 |         }
106 |         return listIt;
107 |     }
108 | 	int netSize()
109 | 	{
110 | 		return Pres_Graph.size();
111 | 	}
112 |     int get_index(string val_name)
113 |     {
114 |         list<Graph_Node>::iterator listIt;
115 |         int count=0;
116 |         for(listIt=Pres_Graph.begin();listIt!=Pres_Graph.end();listIt++)
117 |         {
118 |             if(listIt->get_name().compare(val_name)==0)
119 |                 return count;
120 |             count++;
121 |         }
122 |         return -1;
123 |     }
124 | 
125 |     list<Graph_Node>::iterator get_nth_node(int n)
126 |     {
127 |        list<Graph_Node>::iterator listIt;
128 |         int count=0;
129 |         for(listIt=Pres_Graph.begin();listIt!=Pres_Graph.end();listIt++)
130 |         {
131 |             if(count==n)
132 |                 return listIt;
133 |             count++;
134 |         }
135 |         return listIt; 
136 |     }
137 | 
138 |     list<Graph_Node>::iterator search_node(string val_name)
139 |     {
140 |         list<Graph_Node>::iterator listIt;
141 |         for(listIt=Pres_Graph.begin();listIt!=Pres_Graph.end();listIt++)
142 |         {
143 |             if(listIt->get_name().compare(val_name)==0)
144 |                 return listIt;
145 |         }
146 |     
147 |             cout<<"node not found\n";
148 |         return listIt;
149 |     }
150 | 	
151 | 
152 | };
153 | 
154 | void check_format()
155 | {
156 | 	network Alarm;
157 | 	string line,testline;
158 | 	int find=0;
159 |   	ifstream myfile("alarm.bif"); 
160 |     ifstream testfile("solved_alarm.bif");
161 |   	string temp;
162 |   	string name;
163 |   	vector<string> values;
164 |   	int line_count=1;
165 |     if (myfile.is_open())
166 |     {
167 | 
168 |     	while (! myfile.eof() )
169 |     	{
170 |     		
171 |       		getline (myfile,line);
172 |       		
173 |       		
174 |       		
175 | 
176 |             
177 |             getline (testfile,testline);
178 |             if(testline.compare(line)!=0)
179 |             {
180 |                 cout<<"Error Here in line number"<<line_count<<"\n";
181 |                 exit(0);
182 |             }
183 |             line_count++;
184 |             stringstream ss;
185 |             ss.str(line);
186 |             ss>>temp;
187 |      		
188 |      		
189 |      		
190 |      		if(temp.compare("probability")==0)
191 |      		{
192 |                     string test_temp;
193 |                     
194 |     				getline (myfile,line);
195 |                     getline (testfile,testline);
196 | 
197 |      				stringstream ss2;
198 |                     stringstream testss2;
199 |                     ss2.str(line);
200 |      				ss2>> temp;
201 |                     testss2.str(testline);
202 |                     testss2>>test_temp;
203 |                     if(test_temp.compare(temp)!=0)
204 |                     {
205 |                         cout<<"Error Here in line number"<<line_count<<"\n";
206 |                         exit(0);
207 |                     }
208 |      				ss2>> temp;
209 |                     testss2>>test_temp;
210 |      				vector<float> curr_CPT;
211 |                     string::size_type sz;
212 |      				while(temp.compare(";")!=0)
213 |      				{
214 | 
215 |                         if(!atof(test_temp.c_str()))
216 |                         {
217 |                             cout<<" Probem in Probab values in line "<<line_count<<"\n";
218 |                             exit(0);
219 |      					}
220 |                         //cout<<"here"<<temp<<"\n";
221 |      					ss2>>temp;
222 |                         testss2>>test_temp;
223 |                        
224 |                         
225 | 
226 |     				}
227 |                     if(test_temp.compare(";")!=0)
228 |                     {
229 |                         cout<<" Probem in Semi-colon in line "<<line_count<<"\n";
230 |                         exit(0);
231 |                     }
232 |                     line_count++;
233 | 
234 |      		}
235 |             
236 |      		
237 |      		
238 | 
239 |     		
240 |     		//myfile.close();
241 |     	}
242 |         if(!testfile.eof())
243 |         {
244 |             cout<<" Test File contains more lines\n";
245 |                         exit(0);
246 |         }   
247 |     	//cout<<line;
248 |     	//if(find==1)
249 |     	myfile.close();
250 |         testfile.close();
251 |   	}
252 |   	
253 |   
254 | }
255 | 
256 | network read_network(char* filename)
257 | {
258 |     network Alarm;
259 |     string line;
260 |     int find=0;
261 |     ifstream myfile(filename); 
262 |     string temp;
263 |     string name;
264 |     vector<string> values;
265 |     
266 |     if (myfile.is_open())
267 |     {
268 |         while (! myfile.eof() )
269 |         {
270 |             stringstream ss;
271 |             getline (myfile,line);
272 |             
273 |             
274 |             ss.str(line);
275 |             ss>>temp;
276 |             
277 |             
278 |             if(temp.compare("variable")==0)
279 |             {
280 |                     
281 |                     ss>>name;
282 |                     getline (myfile,line);
283 |                    
284 |                     stringstream ss2;
285 |                     ss2.str(line);
286 |                     for(int i=0;i<4;i++)
287 |                     {
288 |                         
289 |                         ss2>>temp;
290 |                         
291 |                         
292 |                     }
293 |                     values.clear();
294 |                     while(temp.compare("};")!=0)
295 |                     {
296 |                         values.push_back(temp);
297 |                         
298 |                         ss2>>temp;
299 |                     }
300 |                     Graph_Node new_node(name,values.size(),values);
301 |                     int pos=Alarm.addNode(new_node);
302 | 
303 |                     
304 |             }
305 |             else if(temp.compare("probability")==0)
306 |             {
307 |                     
308 |                     ss>>temp;
309 |                     ss>>temp;
310 |                     
311 |                     list<Graph_Node>::iterator listIt;
312 |                     list<Graph_Node>::iterator listIt1;
313 |                     listIt=Alarm.search_node(temp);
314 |                     int index=Alarm.get_index(temp);
315 |                     ss>>temp;
316 |                     values.clear();
317 |                     while(temp.compare(")")!=0)
318 |                     {
319 |                         listIt1=Alarm.search_node(temp);
320 |                         listIt1->add_child(index);
321 |                         values.push_back(temp);
322 |                         
323 |                         ss>>temp;
324 | 
325 |                     }
326 |                     listIt->set_Parents(values);
327 |                     getline (myfile,line);
328 |                     stringstream ss2;
329 |                     
330 |                     ss2.str(line);
331 |                     ss2>> temp;
332 |                     
333 |                     ss2>> temp;
334 |                     
335 |                     vector<float> curr_CPT;
336 |                     string::size_type sz;
337 |                     while(temp.compare(";")!=0)
338 |                     {
339 |                         
340 |                         curr_CPT.push_back(atof(temp.c_str()));
341 |                         
342 |                         ss2>>temp;
343 |                        
344 |                         
345 | 
346 |                     }
347 |                     
348 |                     listIt->set_CPT(curr_CPT);
349 | 
350 | 
351 |             }
352 |             else
353 |             {
354 |                 
355 |             }
356 |             
357 |             
358 | 
359 |             
360 |             
361 |         }
362 |         
363 |         if(find==1)
364 |         myfile.close();
365 |     }
366 |     
367 |     return Alarm;
368 | }
369 | 
370 | int main()
371 | {
372 | 	network Alarm1,Alarm2;
373 | 	check_format();
374 |     Alarm1=read_network((char*)"solved_alarm.bif");
375 |     Alarm2=read_network((char*)"gold_alarm.bif");
376 |     float score=0;
377 |     for(int i=0;i<Alarm1.netSize();i++)
378 |     {
379 |         list<Graph_Node>::iterator listIt1=Alarm1.get_nth_node(i);
380 |         list<Graph_Node>::iterator listIt2=Alarm2.get_nth_node(i);
381 |         vector<float> cpt1=listIt1->get_CPT();
382 |         vector<float> cpt2=listIt2->get_CPT();
383 |         for(int j=0;j<cpt1.size();j++)
384 |             score+=fabs(cpt1[j]-cpt2[j]);
385 |     }
386 |    cout <<"Score is "<<score;
387 | 
388 | 	//cout<<Alarm.netSize();
389 | 	
390 | }
391 | 


--------------------------------------------------------------------------------
/data/gold_alarm.bif:
--------------------------------------------------------------------------------
  1 | // Bayesian Network in the Interchange Format
  2 | // Produced by BayesianNetworks package in JavaBayes
  3 | // Output created Sun Nov 02 17:58:15 GMT+00:00 1997
  4 | // Bayesian network 
  5 | network "Alarm" { //37 variables and 37 probability distributions
  6 | }
  7 | variable  "Hypovolemia" { //2 values
  8 | 	type discrete[2] {  "True"  "False" };
  9 | 	property "position = (54, 35)" ;
 10 | }
 11 | variable  "StrokeVolume" { //3 values
 12 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 13 | 	property "position = (184, 113)" ;
 14 | }
 15 | variable  "LVFailure" { //2 values
 16 | 	type discrete[2] {  "True"  "False" };
 17 | 	property "position = (145, 36)" ;
 18 | }
 19 | variable  "LVEDVolume" { //3 values
 20 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 21 | 	property "position = (68, 114)" ;
 22 | }
 23 | variable  "PCWP" { //3 values
 24 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 25 | 	property "position = (111, 177)" ;
 26 | }
 27 | variable  "CVP" { //3 values
 28 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 29 | 	property "position = (32, 179)" ;
 30 | }
 31 | variable  "History" { //2 values
 32 | 	type discrete[2] {  "True"  "False" };
 33 | 	property "position = (238, 61)" ;
 34 | }
 35 | variable  "MinVolSet" { //3 values
 36 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 37 | 	property "position = (564, 38)" ;
 38 | }
 39 | variable  "VentMach" { //4 values
 40 | 	type discrete[4] {  "Zero"  "Low"  "Normal"  "High" };
 41 | 	property "position = (640, 86)" ;
 42 | }
 43 | variable  "Disconnect" { //2 values
 44 | 	type discrete[2] {  "True"  "False" };
 45 | 	property "position = (738, 86)" ;
 46 | }
 47 | variable  "VentTube" { //4 values
 48 | 	type discrete[4] {  "Zero"  "Low"  "Normal"  "High" };
 49 | 	property "position = (682, 168)" ;
 50 | }
 51 | variable  "KinkedTube" { //2 values
 52 | 	type discrete[2] {  "True"  "False" };
 53 | 	property "position = (564, 172)" ;
 54 | }
 55 | variable  "Press" { //4 values
 56 | 	type discrete[4] {  "Zero"  "Low"  "Normal"  "High" };
 57 | 	property "position = (722, 253)" ;
 58 | }
 59 | variable  "ErrLowOutput" { //2 values
 60 | 	type discrete[2] {  "True"  "False" };
 61 | 	property "position = (226, 237)" ;
 62 | }
 63 | variable  "HRBP" { //3 values
 64 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 65 | 	property "position = (229, 305)" ;
 66 | }
 67 | variable  "ErrCauter" { //2 values
 68 | 	type discrete[2] {  "True"  "False" };
 69 | 	property "position = (366, 278)" ;
 70 | }
 71 | variable  "HREKG" { //3 values
 72 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 73 | 	property "position = (289, 305)" ;
 74 | }
 75 | variable  "HRSat" { //3 values
 76 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 77 | 	property "position = (220, 396)" ;
 78 | }
 79 | variable  "BP" { //3 values
 80 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 81 | 	property "position = (154, 396)" ;
 82 | }
 83 | variable  "CO" { //3 values
 84 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 85 | 	property "position = (195, 176)" ;
 86 | }
 87 | variable  "HR" { //3 values
 88 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 89 | 	property "position = (308, 171)" ;
 90 | }
 91 | variable  "TPR" { //3 values
 92 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 93 | 	property "position = (120, 301)" ;
 94 | }
 95 | variable  "Anaphylaxis" { //2 values
 96 | 	type discrete[2] {  "True"  "False" };
 97 | 	property "position = (31, 239)" ;
 98 | }
 99 | variable  "InsuffAnesth" { //2 values
100 | 	type discrete[2] {  "True"  "False" };
101 | 	property "position = (329, 37)" ;
102 | }
103 | variable  "PAP" { //3 values
104 | 	type discrete[3] {  "Low"  "Normal"  "High" };
105 | 	property "position = (1045, 292)" ;
106 | }
107 | variable  "PulmEmbolus" { //2 values
108 | 	type discrete[2] {  "True"  "False" };
109 | 	property "position = (969, 258)" ;
110 | }
111 | variable  "FiO2" { //2 values
112 | 	type discrete[2] {  "Low"  "Normal" };
113 | 	property "position = (1014, 162)" ;
114 | }
115 | variable  "Catechol" { //2 values
116 | 	type discrete[2] {  "Normal"  "High" };
117 | 	property "position = (329, 107)" ;
118 | }
119 | variable  "SaO2" { //3 values
120 | 	type discrete[3] {  "Low"  "Normal"  "High" };
121 | 	property "position = (926, 387)" ;
122 | }
123 | variable  "Shunt" { //2 values
124 | 	type discrete[2] {  "Normal"  "High" };
125 | 	property "position = (894, 293)" ;
126 | }
127 | variable  "PVSat" { //3 values
128 | 	type discrete[3] {  "Low"  "Normal"  "High" };
129 | 	property "position = (949, 197)" ;
130 | }
131 | variable  "MinVol" { //4 values
132 | 	type discrete[4] {  "Zero"  "Low"  "Normal"  "High" };
133 | 	property "position = (754, 387)" ;
134 | }
135 | variable  "ExpCO2" { //4 values
136 | 	type discrete[4] {  "Zero"  "Low"  "Normal"  "High" };
137 | 	property "position = (530, 393)" ;
138 | }
139 | variable  "ArtCO2" { //3 values
140 | 	type discrete[3] {  "Low"  "Normal"  "High" };
141 | 	property "position = (474, 277)" ;
142 | }
143 | variable  "VentAlv" { //4 values
144 | 	type discrete[4] {  "Zero"  "Low"  "Normal"  "High" };
145 | 	property "position = (881, 165)" ;
146 | }
147 | variable  "VentLung" { //4 values
148 | 	type discrete[4] {  "Zero"  "Low"  "Normal"  "High" };
149 | 	property "position = (706, 344)" ;
150 | }
151 | variable  "Intubation" { //3 values
152 | 	type discrete[3] {  "Normal"  "Esophageal"  "OneSided" };
153 | 	property "position = (843, 86)" ;
154 | }
155 | probability (  "Hypovolemia" ) { //1 variable(s) and 2 values
156 | 	table 0.2 0.8 ;
157 | }
158 | probability (  "StrokeVolume"  "LVFailure"  "Hypovolemia" ) { //3 variable(s) and 12 values
159 | 	table 0.98 0.5 0.95 0.05 0.01 0.49 0.04 0.9 0.01 0.01 0.01 0.05 ;
160 | }
161 | probability (  "LVFailure" ) { //1 variable(s) and 2 values
162 | 	table 0.05 0.95 ;
163 | }
164 | probability (  "LVEDVolume"  "Hypovolemia"  "LVFailure" ) { //3 variable(s) and 12 values
165 | 	table 0.95 0.98 0.01 0.05 0.04 0.01 0.09 0.9 0.01 0.01 0.9 0.05 ;
166 | }
167 | probability (  "PCWP"  "LVEDVolume" ) { //2 variable(s) and 9 values
168 | 	table 0.95 0.04 0.01 0.04 0.95 0.04 0.01 0.01 0.95 ;
169 | }
170 | probability (  "CVP"  "LVEDVolume" ) { //2 variable(s) and 9 values
171 | 	table 0.95 0.04 0.01 0.04 0.95 0.29 0.01 0.01 0.7 ;
172 | }
173 | probability (  "History"  "LVFailure" ) { //2 variable(s) and 4 values
174 | 	table 0.9 0.01 0.1 0.99 ;
175 | }
176 | probability (  "MinVolSet" ) { //1 variable(s) and 3 values
177 | 	table 0.01 0.98 0.01 ;
178 | }
179 | probability (  "VentMach"  "MinVolSet" ) { //2 variable(s) and 12 values
180 | 	table 0.01 0.01 0.01 0.97 0.01 0.01 0.01 0.97 0.01 0.01 0.01 0.97 ;
181 | }
182 | probability (  "Disconnect" ) { //1 variable(s) and 2 values
183 | 	table 0.05 0.95 ;
184 | }
185 | probability (  "VentTube"  "VentMach"  "Disconnect" ) { //3 variable(s) and 32 values
186 | 	table 0.97 0.97 0.97 0.01 0.97 0.01 0.97 0.01 0.01 0.01 0.01 0.97 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.97 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.97 ;
187 | }
188 | probability (  "KinkedTube" ) { //1 variable(s) and 2 values
189 | 	table 0.04 0.96 ;
190 | }
191 | probability (  "Press"  "KinkedTube"  "Intubation"  "VentTube" ) { //4 variable(s) and 96 values
192 | 	table 0.97 0.01 0.01 0.01 0.97 0.1 0.05 0.01 0.97 0.01 0.01 0.01 0.97 0.01 0.01 0.01 0.97 0.4 0.2 0.2 0.97 0.01 0.01 0.01 0.01 0.49 0.01 0.01 0.01 0.84 0.25 0.15 0.01 0.29 0.01 0.01 0.01 0.97 0.01 0.01 0.01 0.58 0.75 0.7 0.01 0.9 0.01 0.01 0.01 0.3 0.08 0.01 0.01 0.05 0.25 0.25 0.01 0.3 0.08 0.01 0.01 0.01 0.97 0.01 0.01 0.01 0.04 0.09 0.01 0.08 0.38 0.01 0.01 0.2 0.9 0.97 0.01 0.01 0.45 0.59 0.01 0.4 0.9 0.97 0.01 0.01 0.01 0.97 0.01 0.01 0.01 0.01 0.01 0.01 0.6 0.97 ;
193 | }
194 | probability (  "ErrLowOutput" ) { //1 variable(s) and 2 values
195 | 	table 0.05 0.95 ;
196 | }
197 | probability (  "HRBP"  "ErrLowOutput"  "HR" ) { //3 variable(s) and 18 values
198 | 	table 0.98 0.4 0.3 0.98 0.01 0.01 0.01 0.59 0.4 0.01 0.98 0.01 0.01 0.01 0.3 0.01 0.01 0.98 ;
199 | }
200 | probability (  "ErrCauter" ) { //1 variable(s) and 2 values
201 | 	table 0.1 0.9 ;
202 | }
203 | probability (  "HREKG"  "HR"  "ErrCauter" ) { //3 variable(s) and 18 values
204 | 	table 0.333333 0.98 0.333333 0.01 0.333333 0.01 0.333333 0.01 0.333333 0.98 0.333333 0.01 0.333333 0.01 0.333333 0.01 0.333333 0.98 ;
205 | }
206 | probability (  "HRSat"  "HR"  "ErrCauter" ) { //3 variable(s) and 18 values
207 | 	table 0.333333 0.98 0.333333 0.01 0.333333 0.01 0.333333 0.01 0.333333 0.98 0.333333 0.01 0.333333 0.01 0.333333 0.01 0.333333 0.98 ;
208 | }
209 | probability (  "BP"  "CO"  "TPR" ) { //3 variable(s) and 27 values
210 | 	table 0.98 0.98 0.3 0.98 0.1 0.05 0.9 0.05 0.01 0.01 0.01 0.6 0.01 0.85 0.4 0.09 0.2 0.09 0.01 0.01 0.1 0.01 0.05 0.55 0.01 0.75 0.9 ;
211 | }
212 | probability (  "CO"  "HR"  "StrokeVolume" ) { //3 variable(s) and 27 values
213 | 	table 0.98 0.95 0.3 0.95 0.04 0.01 0.8 0.01 0.01 0.01 0.04 0.69 0.04 0.95 0.3 0.19 0.04 0.01 0.01 0.01 0.01 0.01 0.01 0.69 0.01 0.95 0.98 ;
214 | }
215 | probability (  "HR"  "Catechol" ) { //2 variable(s) and 6 values
216 | 	table 0.1 0.01 0.89 0.09 0.01 0.9 ;
217 | }
218 | probability (  "TPR"  "Anaphylaxis" ) { //2 variable(s) and 6 values
219 | 	table 0.98 0.3 0.01 0.4 0.01 0.3 ;
220 | }
221 | probability (  "Anaphylaxis" ) { //1 variable(s) and 2 values
222 | 	table 0.01 0.99 ;
223 | }
224 | probability (  "InsuffAnesth" ) { //1 variable(s) and 2 values
225 | 	table 0.2 0.8 ;
226 | }
227 | probability (  "PAP"  "PulmEmbolus" ) { //2 variable(s) and 6 values
228 | 	table 0.01 0.05 0.19 0.9 0.8 0.05 ;
229 | }
230 | probability (  "PulmEmbolus" ) { //1 variable(s) and 2 values
231 | 	table 0.01 0.99 ;
232 | }
233 | probability (  "FiO2" ) { //1 variable(s) and 2 values
234 | 	table 0.01 0.99 ;
235 | }
236 | probability (  "Catechol"  "InsuffAnesth"  "SaO2"  "TPR"  "ArtCO2" ) { //5 variable(s) and 108 values
237 | 	table 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.05 0.05 0.01 0.01 0.01 0.01 0.05 0.05 0.01 0.05 0.05 0.01 0.05 0.05 0.01 0.05 0.05 0.01 0.05 0.05 0.01 0.1 0.1 0.1 0.95 0.95 0.3 0.95 0.95 0.3 0.95 0.95 0.3 0.99 0.99 0.99 0.95 0.99 0.3 0.99 0.99 0.99 0.99 0.99 0.99 0.99 0.99 0.99 0.99 0.99 0.99 0.99 0.99 0.99 0.95 0.95 0.99 0.99 0.99 0.99 0.95 0.95 0.99 0.95 0.95 0.99 0.95 0.95 0.99 0.95 0.95 0.99 0.95 0.95 0.99 0.9 0.9 0.9 0.05 0.05 0.7 0.05 0.05 0.7 0.05 0.05 0.7 0.01 0.01 0.01 0.05 0.01 0.7 ;
238 | }
239 | probability (  "SaO2"  "Shunt"  "PVSat" ) { //3 variable(s) and 18 values
240 | 	table 0.98 0.01 0.01 0.98 0.98 0.69 0.01 0.98 0.01 0.01 0.01 0.3 0.01 0.01 0.98 0.01 0.01 0.01 ;
241 | }
242 | probability (  "Shunt"  "PulmEmbolus"  "Intubation" ) { //3 variable(s) and 12 values
243 | 	table 0.1 0.1 0.01 0.95 0.95 0.05 0.9 0.9 0.99 0.05 0.05 0.95 ;
244 | }
245 | probability (  "PVSat"  "VentAlv"  "FiO2" ) { //3 variable(s) and 24 values
246 | 	table 0.98 0.98 0.98 0.98 0.95 0.01 0.95 0.01 0.01 0.01 0.01 0.01 0.04 0.95 0.04 0.01 0.01 0.01 0.01 0.01 0.01 0.04 0.01 0.98 ;
247 | }
248 | probability (  "MinVol"  "VentLung"  "Intubation" ) { //3 variable(s) and 48 values
249 | 	table 0.97 0.97 0.97 0.01 0.6 0.01 0.01 0.5 0.01 0.01 0.5 0.01 0.01 0.01 0.01 0.97 0.38 0.97 0.01 0.48 0.01 0.01 0.48 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.97 0.01 0.97 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.97 0.01 0.97 ;
250 | }
251 | probability (  "ExpCO2"  "ArtCO2"  "VentLung" ) { //3 variable(s) and 48 values
252 | 	table 0.97 0.01 0.01 0.01 0.97 0.01 0.01 0.01 0.97 0.01 0.01 0.01 0.01 0.97 0.97 0.97 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.97 0.97 0.97 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.97 0.97 0.97 ;
253 | }
254 | probability (  "ArtCO2"  "VentAlv" ) { //2 variable(s) and 12 values
255 | 	table 0.01 0.01 0.04 0.9 0.01 0.01 0.92 0.09 0.98 0.98 0.04 0.01 ;
256 | }
257 | probability (  "VentAlv"  "Intubation"  "VentLung" ) { //3 variable(s) and 48 values
258 | 	table 0.97 0.01 0.01 0.01 0.97 0.01 0.01 0.01 0.97 0.03 0.01 0.01 0.01 0.97 0.01 0.01 0.01 0.97 0.01 0.01 0.01 0.95 0.94 0.88 0.01 0.01 0.97 0.01 0.01 0.01 0.97 0.01 0.01 0.01 0.04 0.1 0.01 0.01 0.01 0.97 0.01 0.01 0.01 0.97 0.01 0.01 0.01 0.01 ;
259 | }
260 | probability (  "VentLung"  "KinkedTube"  "VentTube"  "Intubation" ) { //4 variable(s) and 96 values
261 | 	table 0.97 0.97 0.97 0.95 0.97 0.95 0.4 0.97 0.5 0.3 0.97 0.3 0.97 0.97 0.97 0.01 0.97 0.01 0.01 0.97 0.01 0.01 0.97 0.01 0.01 0.01 0.01 0.03 0.01 0.03 0.58 0.01 0.48 0.68 0.01 0.68 0.01 0.01 0.01 0.97 0.01 0.97 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.97 0.01 0.97 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.01 0.97 0.01 0.97 ;
262 | }
263 | probability (  "Intubation" ) { //1 variable(s) and 3 values
264 | 	table 0.92 0.03 0.05 ;
265 | }
266 | 


--------------------------------------------------------------------------------
/data/solved_alarm.bif:
--------------------------------------------------------------------------------
  1 | // Bayesian Network in the Interchange Format
  2 | // Produced by BayesianNetworks package in JavaBayes
  3 | // Output created Sun Nov 02 17:58:15 GMT+00:00 1997
  4 | // Bayesian network 
  5 | network "Alarm" { //37 variables and 37 probability distributions
  6 | }
  7 | variable  "Hypovolemia" { //2 values
  8 | 	type discrete[2] {  "True"  "False" };
  9 | 	property "position = (54, 35)" ;
 10 | }
 11 | variable  "StrokeVolume" { //3 values
 12 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 13 | 	property "position = (184, 113)" ;
 14 | }
 15 | variable  "LVFailure" { //2 values
 16 | 	type discrete[2] {  "True"  "False" };
 17 | 	property "position = (145, 36)" ;
 18 | }
 19 | variable  "LVEDVolume" { //3 values
 20 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 21 | 	property "position = (68, 114)" ;
 22 | }
 23 | variable  "PCWP" { //3 values
 24 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 25 | 	property "position = (111, 177)" ;
 26 | }
 27 | variable  "CVP" { //3 values
 28 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 29 | 	property "position = (32, 179)" ;
 30 | }
 31 | variable  "History" { //2 values
 32 | 	type discrete[2] {  "True"  "False" };
 33 | 	property "position = (238, 61)" ;
 34 | }
 35 | variable  "MinVolSet" { //3 values
 36 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 37 | 	property "position = (564, 38)" ;
 38 | }
 39 | variable  "VentMach" { //4 values
 40 | 	type discrete[4] {  "Zero"  "Low"  "Normal"  "High" };
 41 | 	property "position = (640, 86)" ;
 42 | }
 43 | variable  "Disconnect" { //2 values
 44 | 	type discrete[2] {  "True"  "False" };
 45 | 	property "position = (738, 86)" ;
 46 | }
 47 | variable  "VentTube" { //4 values
 48 | 	type discrete[4] {  "Zero"  "Low"  "Normal"  "High" };
 49 | 	property "position = (682, 168)" ;
 50 | }
 51 | variable  "KinkedTube" { //2 values
 52 | 	type discrete[2] {  "True"  "False" };
 53 | 	property "position = (564, 172)" ;
 54 | }
 55 | variable  "Press" { //4 values
 56 | 	type discrete[4] {  "Zero"  "Low"  "Normal"  "High" };
 57 | 	property "position = (722, 253)" ;
 58 | }
 59 | variable  "ErrLowOutput" { //2 values
 60 | 	type discrete[2] {  "True"  "False" };
 61 | 	property "position = (226, 237)" ;
 62 | }
 63 | variable  "HRBP" { //3 values
 64 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 65 | 	property "position = (229, 305)" ;
 66 | }
 67 | variable  "ErrCauter" { //2 values
 68 | 	type discrete[2] {  "True"  "False" };
 69 | 	property "position = (366, 278)" ;
 70 | }
 71 | variable  "HREKG" { //3 values
 72 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 73 | 	property "position = (289, 305)" ;
 74 | }
 75 | variable  "HRSat" { //3 values
 76 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 77 | 	property "position = (220, 396)" ;
 78 | }
 79 | variable  "BP" { //3 values
 80 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 81 | 	property "position = (154, 396)" ;
 82 | }
 83 | variable  "CO" { //3 values
 84 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 85 | 	property "position = (195, 176)" ;
 86 | }
 87 | variable  "HR" { //3 values
 88 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 89 | 	property "position = (308, 171)" ;
 90 | }
 91 | variable  "TPR" { //3 values
 92 | 	type discrete[3] {  "Low"  "Normal"  "High" };
 93 | 	property "position = (120, 301)" ;
 94 | }
 95 | variable  "Anaphylaxis" { //2 values
 96 | 	type discrete[2] {  "True"  "False" };
 97 | 	property "position = (31, 239)" ;
 98 | }
 99 | variable  "InsuffAnesth" { //2 values
100 | 	type discrete[2] {  "True"  "False" };
101 | 	property "position = (329, 37)" ;
102 | }
103 | variable  "PAP" { //3 values
104 | 	type discrete[3] {  "Low"  "Normal"  "High" };
105 | 	property "position = (1045, 292)" ;
106 | }
107 | variable  "PulmEmbolus" { //2 values
108 | 	type discrete[2] {  "True"  "False" };
109 | 	property "position = (969, 258)" ;
110 | }
111 | variable  "FiO2" { //2 values
112 | 	type discrete[2] {  "Low"  "Normal" };
113 | 	property "position = (1014, 162)" ;
114 | }
115 | variable  "Catechol" { //2 values
116 | 	type discrete[2] {  "Normal"  "High" };
117 | 	property "position = (329, 107)" ;
118 | }
119 | variable  "SaO2" { //3 values
120 | 	type discrete[3] {  "Low"  "Normal"  "High" };
121 | 	property "position = (926, 387)" ;
122 | }
123 | variable  "Shunt" { //2 values
124 | 	type discrete[2] {  "Normal"  "High" };
125 | 	property "position = (894, 293)" ;
126 | }
127 | variable  "PVSat" { //3 values
128 | 	type discrete[3] {  "Low"  "Normal"  "High" };
129 | 	property "position = (949, 197)" ;
130 | }
131 | variable  "MinVol" { //4 values
132 | 	type discrete[4] {  "Zero"  "Low"  "Normal"  "High" };
133 | 	property "position = (754, 387)" ;
134 | }
135 | variable  "ExpCO2" { //4 values
136 | 	type discrete[4] {  "Zero"  "Low"  "Normal"  "High" };
137 | 	property "position = (530, 393)" ;
138 | }
139 | variable  "ArtCO2" { //3 values
140 | 	type discrete[3] {  "Low"  "Normal"  "High" };
141 | 	property "position = (474, 277)" ;
142 | }
143 | variable  "VentAlv" { //4 values
144 | 	type discrete[4] {  "Zero"  "Low"  "Normal"  "High" };
145 | 	property "position = (881, 165)" ;
146 | }
147 | variable  "VentLung" { //4 values
148 | 	type discrete[4] {  "Zero"  "Low"  "Normal"  "High" };
149 | 	property "position = (706, 344)" ;
150 | }
151 | variable  "Intubation" { //3 values
152 | 	type discrete[3] {  "Normal"  "Esophageal"  "OneSided" };
153 | 	property "position = (843, 86)" ;
154 | }
155 | probability (  "Hypovolemia" ) { //1 variable(s) and 2 values
156 | 	table 0.2043 0.7957 ;
157 | }
158 | probability (  "StrokeVolume"  "LVFailure"  "Hypovolemia" ) { //3 variable(s) and 12 values
159 | 	table 1.0000 0.4961 0.9460 0.0502 0.0000 0.4876 0.0424 0.9004 0.0000 0.0164 0.0116 0.0495 ;
160 | }
161 | probability (  "LVFailure" ) { //1 variable(s) and 2 values
162 | 	table 0.0504 0.9496 ;
163 | }
164 | probability (  "LVEDVolume"  "Hypovolemia"  "LVFailure" ) { //3 variable(s) and 12 values
165 | 	table 0.9424 0.9771 0.0047 0.0481 0.0411 0.0105 0.0833 0.8996 0.0166 0.0124 0.9120 0.0523 ;
166 | }
167 | probability (  "PCWP"  "LVEDVolume" ) { //2 variable(s) and 9 values
168 | 	table 0.9471 0.0376 0.0120 0.0445 0.9519 0.0379 0.0084 0.0104 0.9501 ;
169 | }
170 | probability (  "CVP"  "LVEDVolume" ) { //2 variable(s) and 9 values
171 | 	table 0.9473 0.0441 0.0076 0.0416 0.9435 0.2712 0.0110 0.0124 0.7212 ;
172 | }
173 | probability (  "History"  "LVFailure" ) { //2 variable(s) and 4 values
174 | 	table 0.9105 0.0110 0.0895 0.9890 ;
175 | }
176 | probability (  "MinVolSet" ) { //1 variable(s) and 3 values
177 | 	table 0.0099 0.9802 0.0100 ;
178 | }
179 | probability (  "VentMach"  "MinVolSet" ) { //2 variable(s) and 12 values
180 | 	table 0.0131 0.0104 0.0000 0.9401 0.0103 0.0000 0.0282 0.9701 0.0093 0.0186 0.0092 0.9907 ;
181 | }
182 | probability (  "Disconnect" ) { //1 variable(s) and 2 values
183 | 	table 0.0495 0.9505 ;
184 | }
185 | probability (  "VentTube"  "VentMach"  "Disconnect" ) { //3 variable(s) and 32 values
186 | 	table 1.0000 0.9236 0.6687 0.0226 0.9726 0.0103 1.0000 0.0056 0.0000 0.0286 0.1693 0.9627 0.0078 0.0108 0.0000 0.0051 0.0000 0.0380 0.1620 0.0049 0.0137 0.9690 0.0000 0.0102 0.0000 0.0098 0.0000 0.0099 0.0059 0.0098 0.0000 0.9790 ;
187 | }
188 | probability (  "KinkedTube" ) { //1 variable(s) and 2 values
189 | 	table 0.0450 0.9550 ;
190 | }
191 | probability (  "Press"  "KinkedTube"  "Intubation"  "VentTube" ) { //4 variable(s) and 96 values
192 | 	table 0.9236 0.0000 0.0000 0.0000 0.2500 0.0001 0.0000 0.2500 1.0000 0.0000 0.0000 0.4529 0.9795 0.0113 0.0115 0.0077 1.0000 0.5370 0.2042 0.1811 0.9541 0.0000 0.0158 0.0000 0.0000 0.6393 0.0103 0.0000 0.2500 0.9998 0.2641 0.2500 0.0000 0.1994 0.0000 0.0000 0.0084 0.9735 0.0093 0.0077 0.0000 0.4630 0.7218 0.8189 0.0230 0.8889 0.0065 0.0000 0.0000 0.2179 0.0685 0.0000 0.2500 0.0001 0.4039 0.2500 0.0000 0.0000 0.1012 0.0000 0.0032 0.0076 0.9710 0.0077 0.0000 0.0000 0.0583 0.0000 0.0230 0.1111 0.3956 0.0000 0.0764 0.1428 0.9212 1.0000 0.2500 0.0001 0.3320 0.2500 0.0000 0.8006 0.8988 0.5471 0.0090 0.0075 0.0083 0.9769 0.0000 0.0000 0.0157 0.0000 0.0000 0.0000 0.5821 1.0000 ;
193 | }
194 | probability (  "ErrLowOutput" ) { //1 variable(s) and 2 values
195 | 	table 0.0509 0.9491 ;
196 | }
197 | probability (  "HRBP"  "ErrLowOutput"  "HR" ) { //3 variable(s) and 18 values
198 | 	table 0.9576 0.3556 0.3222 0.9758 0.0095 0.0094 0.0000 0.6398 0.3632 0.0121 0.9809 0.0106 0.0424 0.0046 0.3146 0.0121 0.0096 0.9801 ;
199 | }
200 | probability (  "ErrCauter" ) { //1 variable(s) and 2 values
201 | 	table 0.0982 0.9018 ;
202 | }
203 | probability (  "HREKG"  "HR"  "ErrCauter" ) { //3 variable(s) and 18 values
204 | 	table 0.3571 0.9888 0.3918 0.0132 0.3202 0.0092 0.2649 0.0112 0.3005 0.9754 0.3380 0.0091 0.3780 0.0000 0.3076 0.0114 0.3418 0.9817 ;
205 | }
206 | probability (  "HRSat"  "HR"  "ErrCauter" ) { //3 variable(s) and 18 values
207 | 	table 0.3024 0.9733 0.3046 0.0098 0.3291 0.0092 0.3621 0.0154 0.3131 0.9794 0.3202 0.0086 0.3355 0.0113 0.3824 0.0108 0.3506 0.9822 ;
208 | }
209 | probability (  "BP"  "CO"  "TPR" ) { //3 variable(s) and 27 values
210 | 	table 0.9829 0.9712 0.3039 0.9840 0.0944 0.0502 0.9045 0.0374 0.0093 0.0098 0.0112 0.6015 0.0079 0.8460 0.3858 0.0830 0.1971 0.0873 0.0073 0.0176 0.0946 0.0080 0.0596 0.5640 0.0125 0.7655 0.9034 ;
211 | }
212 | probability (  "CO"  "HR"  "StrokeVolume" ) { //3 variable(s) and 27 values
213 | 	table 0.9702 0.9581 0.3408 0.9477 0.0424 0.0054 0.8014 0.0100 0.0148 0.0075 0.0392 0.6592 0.0393 0.9471 0.3526 0.1909 0.0379 0.0000 0.0223 0.0027 0.0000 0.0130 0.0105 0.6420 0.0077 0.9520 0.9852 ;
214 | }
215 | probability (  "HR"  "Catechol" ) { //2 variable(s) and 6 values
216 | 	table 0.1033 0.0102 0.8842 0.0871 0.0126 0.9028 ;
217 | }
218 | probability (  "TPR"  "Anaphylaxis" ) { //2 variable(s) and 6 values
219 | 	table 0.9889 0.3049 0.0000 0.3961 0.0111 0.2990 ;
220 | }
221 | probability (  "Anaphylaxis" ) { //1 variable(s) and 2 values
222 | 	table 0.0083 0.9917 ;
223 | }
224 | probability (  "InsuffAnesth" ) { //1 variable(s) and 2 values
225 | 	table 0.2007 0.7993 ;
226 | }
227 | probability (  "PAP"  "PulmEmbolus" ) { //2 variable(s) and 6 values
228 | 	table 0.0000 0.0502 0.1894 0.9018 0.8106 0.0480 ;
229 | }
230 | probability (  "PulmEmbolus" ) { //1 variable(s) and 2 values
231 | 	table 0.0108 0.9892 ;
232 | }
233 | probability (  "FiO2" ) { //1 variable(s) and 2 values
234 | 	table 0.0098 0.9902 ;
235 | }
236 | probability (  "Catechol"  "InsuffAnesth"  "SaO2"  "TPR"  "ArtCO2" ) { //5 variable(s) and 108 values
237 | 	table 0.1117 0.0000 0.0136 0.0000 0.0000 0.0164 0.0000 0.0300 0.0166 0.0003 0.0103 0.0000 0.0339 0.0086 0.0000 0.0000 0.0592 0.0000 0.0000 0.0000 0.0003 0.0437 0.1093 0.0000 0.0354 0.0000 0.0000 0.1148 0.0335 0.0017 0.0433 0.0405 0.0055 0.0625 0.0489 0.0109 0.1210 0.1100 0.1228 0.9880 0.9475 0.3080 0.9293 0.9556 0.2685 0.9360 0.9510 0.3119 0.9832 0.9920 1.0000 0.9608 1.0000 0.4092 0.8883 1.0000 0.9864 1.0000 1.0000 0.9836 1.0000 0.9700 0.9834 0.9997 0.9897 1.0000 0.9661 0.9914 1.0000 1.0000 0.9408 1.0000 1.0000 1.0000 0.9997 0.9563 0.8907 1.0000 0.9646 1.0000 1.0000 0.8852 0.9665 0.9983 0.9567 0.9595 0.9945 0.9375 0.9511 0.9891 0.8790 0.8900 0.8772 0.0120 0.0525 0.6920 0.0707 0.0444 0.7315 0.0640 0.0490 0.6881 0.0168 0.0080 0.0000 0.0392 0.0000 0.5908 ;
238 | }
239 | probability (  "SaO2"  "Shunt"  "PVSat" ) { //3 variable(s) and 18 values
240 | 	table 0.9790 0.0082 0.0165 0.9742 0.9898 0.7676 0.0079 0.9804 0.0072 0.0146 0.0041 0.2324 0.0131 0.0113 0.9763 0.0113 0.0061 0.0000 ;
241 | }
242 | probability (  "Shunt"  "PulmEmbolus"  "Intubation" ) { //3 variable(s) and 12 values
243 | 	table 0.0841 0.0000 0.0000 0.9511 0.9606 0.0442 0.9159 1.0000 1.0000 0.0489 0.0394 0.9558 ;
244 | }
245 | probability (  "PVSat"  "VentAlv"  "FiO2" ) { //3 variable(s) and 24 values
246 | 	table 1.0000 0.9849 1.0000 0.9787 0.9594 0.0096 1.0000 0.0067 0.0000 0.0085 0.0000 0.0098 0.0406 0.9475 0.0000 0.0158 0.0000 0.0067 0.0000 0.0115 0.0000 0.0429 0.0000 0.9775 ;
247 | }
248 | probability (  "MinVol"  "VentLung"  "Intubation" ) { //3 variable(s) and 48 values
249 | 	table 0.9759 0.9835 0.9714 0.0085 1.0000 0.0000 0.0090 0.5009 0.0156 0.0058 0.5000 0.0000 0.0099 0.0099 0.0143 0.9778 0.0000 0.9706 0.0100 0.4991 0.0134 0.0145 0.5000 0.0000 0.0077 0.0033 0.0000 0.0103 0.0000 0.0000 0.9696 0.0000 0.9577 0.0117 0.0000 0.0000 0.0066 0.0033 0.0143 0.0034 0.0000 0.0294 0.0114 0.0000 0.0133 0.9680 0.0000 1.0000 ;
250 | }
251 | probability (  "ExpCO2"  "ArtCO2"  "VentLung" ) { //3 variable(s) and 48 values
252 | 	table 1.0000 0.1109 0.0122 0.0098 0.9294 0.0777 0.0084 0.0000 0.9719 0.0083 0.0117 0.0477 0.0000 0.8891 0.9754 0.9738 0.0353 0.0000 0.0098 0.0000 0.0096 0.0116 0.0047 0.0000 0.0000 0.0000 0.0050 0.0131 0.0000 0.9223 0.9718 1.0000 0.0112 0.0085 0.0081 0.0476 0.0000 0.0000 0.0074 0.0033 0.0353 0.0000 0.0101 0.0000 0.0072 0.9716 0.9755 0.9047 ;
253 | }
254 | probability (  "ArtCO2"  "VentAlv" ) { //2 variable(s) and 12 values
255 | 	table 0.0082 0.0070 0.0424 0.8990 0.0082 0.0113 0.9205 0.0852 0.9836 0.9817 0.0371 0.0158 ;
256 | }
257 | probability (  "VentAlv"  "Intubation"  "VentLung" ) { //3 variable(s) and 48 values
258 | 	table 0.9732 0.0101 0.0098 0.0057 0.9640 0.0000 0.0000 0.0000 0.9288 0.0000 0.0025 0.0000 0.0088 0.9651 0.0080 0.0057 0.0132 1.0000 0.0000 0.0000 0.0288 1.0000 0.9576 0.7999 0.0128 0.0100 0.9724 0.0143 0.0096 0.0000 1.0000 0.0000 0.0282 0.0000 0.0334 0.1333 0.0052 0.0149 0.0097 0.9742 0.0131 0.0000 0.0000 1.0000 0.0142 0.0000 0.0066 0.0668 ;
259 | }
260 | probability (  "VentLung"  "KinkedTube"  "VentTube"  "Intubation" ) { //4 variable(s) and 96 values
261 | 	table 1.0000 0.2500 1.0000 0.9331 0.9998 1.0000 0.4240 1.0000 0.6300 0.3344 0.2500 1.0000 0.9789 1.0000 0.9775 0.0098 1.0000 0.0000 0.0089 0.9733 0.0097 0.0075 1.0000 0.0882 0.0000 0.2500 0.0000 0.0669 0.0001 0.0000 0.5662 0.0000 0.3700 0.6656 0.2500 0.0000 0.0045 0.0000 0.0000 0.9606 0.0000 0.9473 0.0116 0.0038 0.0134 0.0187 0.0000 0.0000 0.0000 0.2500 0.0000 0.0000 0.0001 0.0000 0.0000 0.0000 0.0000 0.0000 0.2500 0.0000 0.0075 0.0000 0.0000 0.0111 0.0000 0.0527 0.9705 0.0076 0.9642 0.0076 0.0000 0.0000 0.0000 0.2500 0.0000 0.0000 0.0001 0.0000 0.0098 0.0000 0.0000 0.0000 0.2500 0.0000 0.0090 0.0000 0.0225 0.0185 0.0000 0.0000 0.0090 0.0152 0.0127 0.9662 0.0000 0.9118 ;
262 | }
263 | probability (  "Intubation" ) { //1 variable(s) and 3 values
264 | 	table 0.9194 0.0286 0.0520 ;
265 | }
266 | 


--------------------------------------------------------------------------------
/utils.py:
--------------------------------------------------------------------------------
  1 | from __future__ import division
  2 | from bayesnet import Graph_Node, network
  3 | import bayesnet
  4 | import numpy as np 
  5 | import pandas as pd 
  6 | import time
  7 | from collections import OrderedDict
  8 | 
  9 | # Setup the network
 10 | def setup_network(bif_alarm, dat_records):
 11 | 	# Parsing the network from .bif format 
 12 | 	print "0: Reading Network . . . "
 13 | 	Alarm = bayesnet.read_network(bif_alarm)
 14 | 	# Finding markov blanket for every
 15 | 	print "1: Setting Markov Blankets . . . "
 16 | 	Alarm.set_mb()
 17 | 	# Get data from record.dat
 18 | 	print "2: Getting data from records . . . "
 19 | 	df = bayesnet.get_data(dat_records)
 20 | 	# Initialise parameters
 21 | 	print "3: Initialising parameters . . . "
 22 | 	init_params(df, Alarm)
 23 | 	# Get the index of nodes which have missing value in each row
 24 | 	print "4: Getting missing data indexes . . . "
 25 | 	mis_index = get_missing_index(df)
 26 | 	return Alarm, df, mis_index
 27 | 
 28 | # List of the indices of nodes which have missing values in each data point; 
 29 | # equal to -1 if no value is missing
 30 | def get_missing_index(df):
 31 | 	mis_index = []
 32 | 	for index, row in df.iterrows():
 33 | 	    if(row.isnull().any()):
 34 | 	        mis_index.append(int(np.argwhere(np.isnan(np.asarray(row)))))
 35 | 	    else:
 36 | 	    	mis_index.append(-1)
 37 | 	return mis_index
 38 | 
 39 | # Initialise parameters
 40 | def init_params(df, net):
 41 | 	N = df.shape[0]
 42 | 	curr_iter = 0
 43 | 	for node in net.Pres_Graph.values():
 44 | 		parents = net.get_parent_nodes(node)
 45 | 		n_parents = len(parents)
 46 | 		if n_parents==0:
 47 | 			v0 = [] # value of node variable
 48 | 			counts = []
 49 | 			for p0 in range(0,node.nvalues):
 50 | 				a = df[node.Node_Name]==p0
 51 | 				counts.append(pd.DataFrame(df[a]).shape[0] + 1)
 52 | 				# counts.append(0.01)
 53 | 				v0.append(p0)
 54 | 			cpt_df = pd.DataFrame({node.Node_Name:v0, "p": np.ones(len(counts))*(-1), "counts": counts})
 55 | 			node.set_cpt_data(cpt_df)
 56 | 
 57 | 		elif n_parents==1:
 58 | 			v0 = [] # value of parent1 variable
 59 | 			v1 = [] # value of node variable
 60 | 			counts = []
 61 | 			for p0 in range(0,parents[0].nvalues):
 62 | 				a = df[parents[0].Node_Name]==p0
 63 | 				for p1 in range(0,node.nvalues):
 64 | 					b = df[node.Node_Name]==p1
 65 | 					counts.append(pd.DataFrame(df[a & b]).shape[0] + 1)
 66 | 					# counts.append(0.01)
 67 | 					v0.append(p0)
 68 | 					v1.append(p1)
 69 | 			cpt_df = pd.DataFrame({node.Node_Name:v1, parents[0].Node_Name:v0 , 
 70 | 									"p": np.ones(len(counts))*(-1), "counts": counts})
 71 | 			node.set_cpt_data(cpt_df)
 72 | 			
 73 | 		elif n_parents==2:
 74 | 			v0 = [] # value of node variable
 75 | 			v1 = [] # value of parent1 variable
 76 | 			v2 = [] # value of parent2 variable
 77 | 			counts = []
 78 | 			for p0 in range(0,parents[0].nvalues):
 79 | 				a = df[parents[0].Node_Name]==p0
 80 | 				for p1 in range(0,parents[1].nvalues):
 81 | 					b = df[parents[1].Node_Name]==p1
 82 | 					for p2 in range(0,node.nvalues):
 83 | 						c = df[node.Node_Name]==p2
 84 | 						counts.append(pd.DataFrame(df[a & b & c]).shape[0] + 1)
 85 | 						# counts.append(0.01)
 86 | 						v0.append(p0)
 87 | 						v1.append(p1)
 88 | 						v2.append(p2)
 89 | 			cpt_df = pd.DataFrame({parents[0].Node_Name:v0 ,parents[1].Node_Name:v1, node.Node_Name:v2, 
 90 | 									"p": np.ones(len(counts))*(-1), "counts": counts})
 91 | 			node.set_cpt_data(cpt_df)       
 92 | 
 93 | 		elif n_parents==3:
 94 | 			v0 = [] # value of node variable
 95 | 			v1 = [] # value of parent1 variable
 96 | 			v2 = [] # value of parent2 variable
 97 | 			v3 = [] # value of parent3 variable
 98 | 			counts = []
 99 | 			for p0 in range(0,parents[0].nvalues):
100 | 				a = df[parents[0].Node_Name]==p0
101 | 				for p1 in range(0,parents[1].nvalues):
102 | 					b = df[parents[1].Node_Name]==p1
103 | 					for p2 in range(0,parents[2].nvalues):
104 | 						c = df[parents[2].Node_Name]==p2
105 | 						for p3 in range(0,node.nvalues):
106 | 							d = df[node.Node_Name]==p3
107 | 							counts.append(pd.DataFrame(df[a & b & c & d]).shape[0] + 1)
108 | 							# counts.append(0.01)
109 | 							v0.append(p0)
110 | 							v1.append(p1)
111 | 							v2.append(p2)
112 | 							v3.append(p3)
113 | 			cpt_df = pd.DataFrame({parents[0].Node_Name:v0 ,parents[1].Node_Name:v1, parents[2].Node_Name:v2,
114 | 						node.Node_Name:v3, "p": np.ones(len(counts))*(-1), "counts": counts})
115 | 			node.set_cpt_data(cpt_df)
116 | 
117 | 		elif n_parents==4:
118 | 			v0 = [] # value of node variable
119 | 			v1 = [] # value of parent1 variable
120 | 			v2 = [] # value of parent2 variable
121 | 			v3 = [] # value of parent3 variable
122 | 			v4 = [] # value of parent4 variable
123 | 			counts = [] # number of times the same data point is occuring
124 | 			for p0 in range(0,parents[0].nvalues):
125 | 				a = df[parents[0].Node_Name]==p0
126 | 				for p1 in range(0,parents[1].nvalues):
127 | 					b = df[parents[1].Node_Name]==p1
128 | 					for p2 in range(0,parents[2].nvalues):
129 | 						c = df[parents[2].Node_Name]==p2
130 | 						for p3 in range(0,parents[3].nvalues):
131 | 							d = df[parents[3].Node_Name]==p3
132 | 							for p4 in range(0,node.nvalues):
133 | 								e = df[node.Node_Name]==p4
134 | 								counts.append(pd.DataFrame(df[a & b & c & d & e]).shape[0] + 1)
135 | 								# counts.append(0.01)
136 | 								v0.append(p0)
137 | 								v1.append(p1)
138 | 								v2.append(p2)
139 | 								v3.append(p3)
140 | 								v4.append(p4)
141 | 			
142 | 			cpt_df = pd.DataFrame({parents[0].Node_Name:v0 ,parents[1].Node_Name:v1, parents[2].Node_Name:v2 ,
143 | 				parents[3].Node_Name:v3, node.Node_Name:v4, "p": np.ones(len(counts))*(-1), "counts": counts})
144 | 			node.set_cpt_data(cpt_df)
145 | 
146 | 		curr_iter += 1
147 | 
148 | 	for X in net.Pres_Graph.keys():
149 | 		net.normalise_cpt(X)
150 | 
151 | # Normalise a numpy array
152 | def normalise_array(vals):
153 | 	denom = np.sum(vals)
154 | 	normalised_vals = []
155 | 	for val in vals:
156 | 		normalised_vals.append(val/float(denom))
157 | 	return normalised_vals
158 | 
159 | # return the rows of the factor table with assignments as specified in E
160 | def get_assignment_for(factor, E, nval):
161 | 	curr_factor = factor
162 | 	for key, value in E.items():
163 | 		if key in list(factor.columns):
164 | 			condition = curr_factor[key] == value
165 | 			curr_factor = curr_factor[condition]
166 | 		if curr_factor.shape[0] == nval:
167 | 			return curr_factor
168 | 	return curr_factor
169 | 
170 | # Inference by Markov Blanket Sampling
171 | def markov_blanket_sampling(X, E, bn):
172 | 	dist_X = []
173 | 	children = bn.Pres_Graph[X].Children
174 | 	parents = bn.Pres_Graph[X].Parents
175 | 	x_cpt = bn.Pres_Graph[X].cpt_data
176 | 	fac_x = get_assignment_for(x_cpt, E, bn.Pres_Graph[X].nvalues)
177 | 	fac_c = np.log(np.asarray(fac_x['p']))
178 | 	for c in children:
179 | 		c_cpt = bn.Pres_Graph[bn.Pres_Graph.keys()[c]].cpt_data
180 | 		temp = get_assignment_for(c_cpt, E, bn.Pres_Graph[X].nvalues)
181 | 		# fac_c = fac_c*np.asarray(temp['p'])
182 | 		fac_c = fac_c + np.log(np.asarray(temp['p']))
183 | 	return normalise_array(np.exp(fac_c))
184 | 
185 | # Expectation Step
186 | def Expectation(bn, df, mis_index):
187 | 	"""
188 | 	Input: 
189 | 		bn - Bayesian Network
190 | 		df - Data table
191 | 		mis_index - array of missing indices corresponding to the Data table 'df'
192 | 	Output:
193 | 		new_df - each missing value in a row replaced by the possible values variable can take 
194 | 		new_weights - array of weights assigned to each data point
195 | 	"""
196 | 	new_weights = []
197 | 	mydict = df.to_dict(orient = 'records')
198 | 	new_df = pd.DataFrame()
199 | 	# new_df_list = []
200 | 	for i in range(df.shape[0]):
201 | 		row = pd.DataFrame(df.loc[i,]).T
202 | 		if mis_index[i]!=-1:
203 | 			X = bn.Pres_Graph.keys()[mis_index[i]]
204 | 			mb_x = bn.MB[X]
205 | 			# print "------------------------------------ " + str(i) + "  ------------------------------------"
206 | 			# print "------------------------------------ "   + X +    "  ------------------------------------"
207 | 			E = {key:value for key, value in mydict[i].items() if (key!=X and key in mb_x)}
208 | 			dist_X = markov_blanket_sampling(X, E, bn)
209 | 			for n in range(bn.Pres_Graph[X].nvalues):
210 | 				row.iloc[0, bn.Pres_Graph.keys().index(X)] = n
211 | 				new_weights.append(dist_X[n])
212 | 				new_df = pd.concat([new_df, row])
213 | 				# new_df_list.append(row)
214 | 		# if there is no missing value
215 | 		else:
216 | 			new_weights.append(1.0)
217 | 			new_df = pd.concat([new_df, row])
218 | 
219 | 	return new_weights, new_df
220 | 
221 | # Maximisation Step
222 | def Maximisation(df, net, weights):
223 | 	"""
224 | 	Updates the CPTs of all the nodes based on data given weight of each data point
225 | 	Input:
226 | 		df - Data Table
227 | 		net - Bayesian Net
228 | 		Weights - weight corresponding to each data point in the table
229 | 	Output:
230 | 		None
231 | 	"""
232 | 	df['wts'] = weights
233 | 	N = df.shape[0] 
234 | 	curr_iter = 0
235 | 	for X, node in net.Pres_Graph.items():
236 | 		# print "--------- " + X + " ----------"
237 | 		parents = net.get_parent_nodes(node)
238 | 		n_parents = len(parents)
239 | 		if n_parents==0:
240 | 			counts = []
241 | 			for p0 in range(0,node.nvalues):
242 | 				a = df[node.Node_Name]==p0
243 | 				count = float(pd.DataFrame(df[a])['wts'].sum())
244 | 				if count!=0:
245 | 					counts.append(float(count))
246 | 				else:
247 | 					counts.append(0.000005)
248 | 			node.cpt_data['counts'] = counts
249 | 			net.normalise_cpt(X)
250 | 
251 | 		elif n_parents==1:
252 | 			counts = []
253 | 			for p0 in range(0,parents[0].nvalues):
254 | 				a = df[parents[0].Node_Name]==p0
255 | 				for p1 in range(0,node.nvalues):
256 | 					b = df[node.Node_Name]==p1
257 | 					count = float(pd.DataFrame(df[a & b])['wts'].sum())
258 | 					if count!=0:
259 | 						counts.append(float(count))
260 | 					else:
261 | 						counts.append(0.000005)
262 | 			node.cpt_data['counts'] = counts
263 | 			net.normalise_cpt(X)
264 | 
265 | 		elif n_parents==2:
266 | 			counts = []
267 | 			for p0 in range(0,parents[0].nvalues):
268 | 				a = df[parents[0].Node_Name]==p0
269 | 				for p1 in range(0,parents[1].nvalues):
270 | 					b = df[parents[1].Node_Name]==p1
271 | 					for p2 in range(0,node.nvalues):
272 | 						c = df[node.Node_Name]==p2
273 | 						count = float(pd.DataFrame(df[a & b & c])['wts'].sum())
274 | 						if count!=0:	
275 | 							counts.append(count)
276 | 						else:
277 | 							counts.append(0.000005)
278 | 			node.cpt_data['counts'] = counts
279 | 			net.normalise_cpt(X)
280 | 
281 | 		elif n_parents==3:
282 | 			counts = []
283 | 			for p0 in range(0,parents[0].nvalues):
284 | 				a = df[parents[0].Node_Name]==p0
285 | 				for p1 in range(0,parents[1].nvalues):
286 | 					b = df[parents[1].Node_Name]==p1
287 | 					for p2 in range(0,parents[2].nvalues):
288 | 						c = df[parents[2].Node_Name]==p2
289 | 						for p3 in range(0,node.nvalues):
290 | 							d = df[node.Node_Name]==p3
291 | 							count = float(pd.DataFrame(df[a & b & c & d])['wts'].sum())
292 | 							if count!=0:
293 | 								counts.append(float(count))
294 | 							else:
295 | 								counts.append(0.000005)
296 | 			node.cpt_data['counts'] = counts
297 | 			net.normalise_cpt(X)
298 | 
299 | 		elif n_parents==4:
300 | 			counts = [] 
301 | 			for p0 in range(0,parents[0].nvalues):
302 | 				a = df[parents[0].Node_Name]==p0
303 | 				for p1 in range(0,parents[1].nvalues):
304 | 					b = df[parents[1].Node_Name]==p1
305 | 					for p2 in range(0,parents[2].nvalues):
306 | 						c = df[parents[2].Node_Name]==p2
307 | 						for p3 in range(0,parents[3].nvalues):
308 | 							d = df[parents[3].Node_Name]==p3
309 | 							for p4 in range(0,node.nvalues):
310 | 								e = df[node.Node_Name]==p4
311 | 								count =  float(pd.DataFrame(df[a & b & c & d & e])['wts'].sum())
312 | 								if count!=0:
313 | 									counts.append(float(count))
314 | 								else:
315 | 									counts.append(0.000005)
316 | 			node.cpt_data['counts'] = counts
317 | 			net.normalise_cpt(X)
318 | 
319 | 		curr_iter += 1
320 | 
321 | # Expectation-Maximisation
322 | def Expectation_Maximisation(df, bn, mis_index):
323 | 	"""
324 | 	Input: 
325 | 		df - Data Table
326 | 		bn - Bayesian Network
327 | 		mis_index - array of missing indices corresponding to the Data table 'df'
328 | 	Output:
329 | 		bn - Bayesian Net with complete parameters learned from the given data by EM algorithm
330 | 	"""
331 | 	curr_iter = 1
332 | 	time_i = time.time()
333 | 	while True:
334 | 		print "ITERATION #" + str(curr_iter)
335 | 		step0 = time.time()
336 | 		# print "STEP E: "+ str(curr_iter)
337 | 		wts, new_df = Expectation(bn, df, mis_index)
338 | 		prev_cpts = []
339 | 		for X in bn.Pres_Graph.keys():
340 | 			prev_cpts.append(np.array(list(bn.Pres_Graph[X].cpt_data['p'])))
341 | 		step1 = time.time()
342 | 		# print "STEP M: "+ str(curr_iter)
343 | 		Maximisation(new_df, bn, wts)
344 | 		step2 = time.time()
345 | 		# print "E time: (%ss)" % (round((step1 - step0), 5))
346 | 		# print "M time: (%ss)" % (round((step2 - step1), 5))
347 | 		new_cpts = []
348 | 		for X in bn.Pres_Graph.keys():
349 | 			new_cpts.append(np.array(list(bn.Pres_Graph[X].cpt_data['p'])))
350 | 		diffs = []
351 | 		for i in range(len(prev_cpts)):
352 | 			max_diff = max(abs(np.subtract(prev_cpts[i],new_cpts[i])))
353 | 			diffs.append(max_diff)
354 | 		delta = max(diffs)
355 | 		time_f = time.time()
356 | 		print "Delta: " + str(delta)
357 | 		if ((time_f - time_i)>660):
358 | 			# print "OVER TIME. . . . "
359 | 			break
360 | 		if delta <= 0.00005:
361 | 			break
362 | 		curr_iter +=1
363 | 	print "Converged in (" + str(curr_iter) + ") iterations"
364 | 
365 | 	return bn
366 | 		
367 | # Parse learned parameters to 'solved_alarm.bif'
368 | def parse_output(Alarm, bif_alarm):
369 | 	i = 0
370 | 	with open('solved_alarm.bif', 'w') as output, open(bif_alarm, 'r') as input:
371 | 		while True:
372 | 			line0 = input.readline()
373 | 			line = line0.strip()
374 | 			if line == '':
375 | 				break
376 | 			tokens = line.split()
377 | 			first_word = tokens[0]
378 | 			if first_word == 'table':
379 | 				X = Alarm.Pres_Graph.keys()[i]
380 | 				l = [X] + Alarm.Pres_Graph[X].Parents
381 | 				to_write = np.asarray(Alarm.Pres_Graph[X].cpt_data.sort_values(l, ascending = True)['p'])
382 | 				to_write = ["{:10.4f}".format(item) for item in to_write]
383 | 				to_write = str(to_write)[1:len(str(to_write))-1].replace("'", "")
384 | 				to_write = to_write.replace(",", "")
385 | 				to_write = to_write.replace("     ", " ")
386 | 				to_write = to_write.replace("    ", "")
387 | 				output.write('\ttable '+ to_write + " ;\n")
388 | 				i+=1
389 | 			else:
390 | 				output.write(line0)
391 | 
392 | 
393 | if __name__ == '__main__':
394 | 	print "This file contains utility functions: Run main.py"
395 | 
396 | 
397 | 
398 | 
399 | 


--------------------------------------------------------------------------------