Algorithm

The software is constituted of four sections as shown in the flow diagram: signal processing, activation map calculation, conduction inhomogeneity quantification and inhomogeneity level determination using machine learning. The details of the algorithms are discussed below.

Signal Processing

Each fluorescent frame in the cardiac optical imaging dataset contains foreground pixels on the heart and background pixels outside the heart. To remove the background pixels that contain noises, an automatic thresholding routine is applied. The image is first converted to a 256-bit grayscale image, and a grayscale threshold is selected based on user input. All pixels below the threshold is rejected and labeled as 0’s while all pixels greater or equal to the threshold is accepted and labeled as 1’s. This procedure creates a binary mask that removes all background noises. In addition, a connected component analysis is used to remove isolated pixels and leave the desirable heart alone. Figure 6 below shows the processed image after the background removal routine.

Figure 6 Background removal using thresholding routine [1]

The background removed data is then combined into larger bins to minimize spatial noise and to increase processing efficiency. The default binning factor is 3, which combines 3x3 pixels into a single bin. The binned images are passed into a 100-order finite impulse response (FIR) filter with low pass and high pass threshold frequencies selected by the user. The filter is implemented with Parks-McClellan Remez Exchange Algorithm, which is a standard iterative algorithm that finds the optimal Chebyshev FIR filter. The algorithm minimizes the error in both the pass and stop bands using the Chebyshev approximation theory until both pass and stop bands specifications are met.
The last step in signal processing is drift removal, which corrects for the drift of baseline in the recordings due to photo bleaching or motion [1]. A fourth order polynomial fitting is applied and subtracted from the signal. As shown in Figure 7, a fourth order polynomial (red dash line) is fitted to the optical signal (black) and subtracted to yield a steady baseline signal (blue).

Figure 7 Drift Removal using polynomial fitting algorithm [1]

Activation map calculation

In a cardiac optical mapping activation map, each pixel represents the averaged activation time for all the cells within that pixels. The software requires the user to select a starting and ending time of a single action potential propagation, and then select a polygon-shaped region of interest on the heart optical image. The activation time is defined as the maximum , which corresponds to the steepest segment in the optical recording signal [8]. Activation time is then assigned pixel by pixel and presented as a 2D color map called activation map for later analysis. Figure 8 below shows the optical signals’ traces of 4 points on the heart, and their corresponding activation time.

Figure 8 Activation map calculation at 4 different locations on the heart [1]

Conduction heterogeneity quantification

Based on cardiac activation map produced from the above algorithm, three different inhomogeneity analysis methods are applied individually to yield four parameters that measure the conduction inhomogeneity of the tissue.
The first method is phase difference mapping. The time differences of each pixel with its neighboring pixels are calculated, and the maximum time difference is taken as the phase of that pixel. After evaluating the phase on a pixel-by-pixel basis, the output feature inhomogeneity index is calculated as , where is the 95 percentile activation time, is the 5 percentile activation time, and is the median activation time.
The second method is the gray-scale co-occurrence feature extraction method. Using MATLAB built-in function “graycomatrix”, a gray-level co-occurrence matrix is created. The activation map is first scaled into 8 gray scale levels from 0 to 50 ms, and the frequency of gray scale adjacent to gray level is recorded in the co-occurrence matrix . Then the two inhomogeneity parameters homogeneity (E) and correlation (Cor) are calculated as follow [4]:

The third method is parametric 3D extrusion method modified from the proposed MRI surface area method in [6]. The activation time is treated as the third dimension (height) and the 2D activation map is extruded into a binary 3D object, where 1’s denote voxels inside the object and 0’s denote voxels outside the object. The total surface area is the sum of surface area for each voxel subtracting the surface area that is lost due to neighboring pixels [6]:

where is the surface area of each voxel, is the total surface area lost for voxel and N is the total number of the voxels. Practically in this project, the total surface area lost is calculated from a convolution of the binary 3D object and a 3x3x3 kernel function F that is 1 on center voxels of each surface and 0 elsewhere:

The final inhomogeneity parameter is the normalized surface area, defined by:

where is the total volume of the object calculated by summing the binary 3D object I.

Inhomogeneity level determination

The training set data are obtained from 120 optical mapping experiments performed on control mice heart or mice heart with atrial fibrillation. One activation map is generated per recording that covers a single period of contraction, which is roughly around 0.2 s. These 120 activation maps are then fed into the inhomogeneity level calculation algorithms where four features: phase mapping, inhomogeneity level from texture feature analysis, correlation level from texture feature analysis, and surface area from 3D extrusion maps are calculated. In the training set, each row corresponds to a data point and each dimension corresponds to the one of the four features. The true label vector Y is obtained by visual inspection, where 0 represents continuous, gradual conduction and 5 represents severe inhomogeneous conduction or even the occurrence of an Atrial Fibrillation rotor where the conduction wave rotates around a center. With the training set obtained, the single decision tree algorithm is implemented by calling the MATLAB built-in function “fitctree” to train the data.
A 10-fold cross validation is used so that 10 trees are generated with 1/10 of the data set held out, and the average is taken to obtain an unbiased estimate of the true hypothesis. The cross validation is implemented to maintain the optimal balance between the generalization

and the reliability of the validation

when the validation set needs to be as large as to ensure the validity of the prediction in out of sample data set, it also needs to be as small as to minimize the discrepancy between the held out set (where 1/10 of the samples are excluded) and the whole data set (all the training data included). The cross validation error offers an unbiased estimate of because the is the expectation of all over the input space, which is due to the fact that training set in cross validation offers sufficient data points and a diversity between different data sets.
A reduce-error pruning algorithm is enabled to control overfitting. In this method, the whole data set is split into the training set and validation set . The tree is first grown freely with the training data . After the full tree is obtained, starting from the bottom of the tree, all possible internal nodes are collapsed if it reduces the validation error of the tree, which is generated by applying the tree learned on the held out validation set. This process is repeated recursively up the tree until the error is no longer reduced. This post-pruning technique avoids the risk of missing the best places to prune at one level because the gain at that level is low, but the subsequent levels provide more gains.
The decision tree, once generated, is stored inside the software package for future use. Every time a new data set is loaded into the software, the four features can be calculated from the data set, and the software is able to give a prediction of the label of the data set by applying the pre-determined decision tree to the four-dimensional input. The software therefore is able to automatically determine the inhomogeneity level using the decision tree algorithm coupled with cross validation and pruning techniques.

Algorithms​

Algorithms