Software for Matching Standard Activity Enzyme Biosensors for Soil Pollution Analysis

This work is dedicated to developing enzyme biosensor software to solve problems regarding soil pollution analysis. An algorithm and specialised software have been developed which stores, analyses and visualises data using JavaScript programming language. The developed software is based on matching data of 51 non-commercial standard soil samples and their inhibitory effects on three enzyme systems of varying complexity. This approach is able to identify the influence of chemical properties soil samples, without toxic agents, on enzyme biosensors. Such software may find wide use in environmental monitoring.


Introduction
Enzyme biosensors are characterised by a high level of specificity and sensitivity to the analyte, which has led to their wide use for the detection of various chemical and biological substances in clinical trials, in food and in environmental monitoring [1,2]. Currently, biosensors are mainly used in the determination of pesticides (primarily organophosphorus compounds and carbamates) and heavy metal salts [3]. We previously showed the potential use of enzyme systems (bioluminescent and non-bioluminescent) of varying complexity as indicator systems to evaluate the level of pollution of soil samples [4]. The mechanism of actuation of environmental contaminants on the activity of the enzyme systems is based on the detection of fluctuation of enzymatic activities in the presence of the contaminants with respect to the control. The fluctuation can be as specific as non-specific effects [5], but the bioluminescent enzyme systems usually show integral effects, since they are qualitative assays [6]. It has been shown that the bioluminescent enzyme system has sensitivity to heavy metal salts and nanomaterials [4,5,7]. Also butyrylcholinesterase has demonstrated the high sensitivity to carbamate pesticides [4,8] Besides, studies have revealed the difficulties of enzymatic bioassay of soils associated with the active influence of soil extracts on both the activity of enzymatic systems and their interaction with toxic agents [7]. This problem is also relevant for other bioassay methods that use biosensors [9]. It is likely that the effects caused by the complexity of the studied environment can be determined by comparing it with a reference (standard, uncontaminated) soil sample, similar to the approaches used in chemical testing. Unfortunately, to date, there is no consensus on which soils can be utilized as a standard. The first approach is the use of a model soil (MS) developed for earthworm bioassay [10]. Despite the advantages of this approach, MS does not reflect the entire variety of soils and their properties, on which the distribution of pollutants and their bioavailability depend [11][12][13]. In addition, the possible influence of the parameters of the soil itself (without toxic agents) on the test objects is not considered [14].
The literature describes several approaches for the formalisation of these results of soil condition assessment. One such approach is the toxicity test battery integrated index, which is defined as the average of all bioassay responses with different weight coefficients [15]. For determining toxicological parameters according to the degree of inhibition by toxic agents, Ecoscores were suggested [16]. The so-called triad method takes into account the data from chemical testing, bioindication and toxicological testing to determine the integral index of the soil status. Chemical substances, bioassay function value and bioindication function value in the sample are used as indicators [17]. A significant limitation of the above approaches is that it is necessary to know the concentration of toxic agents in soils to calculate soil indices, which is not always possible to determine, especially if it is not specific, but an integral bioassay method is used. The most promising method for storing, identifying and managing soil sample data using integral bioassay methods is a system based on barcode and WebGIS technology. Such systems are based on the principle of a global database, with each sample being assigned an individual number (barcode) [18] and are convenient tools.
To sum up, there is still a lack of interpretation tools results received by studying highly complex samples. A logical development of a bioassay methodology system using biosensors is to combine the above approaches-standard soils for comparison and software for their analysis and visualisation-into a single tool. Therefore, this study aimed to develop a novel approach for improving and interpreting the results of enzymatic bioassays that is applicable to the analysis of complex environments using standard soil samples. In this study, we describe the development of a specialised enzymatic biosensor software.

Basis for Software Development
The software product was based on the Euclidean distance (ED) method Equation (1). A prerequisite for the new approach to work is the availability of standard soil samples, which are matched to each new test sample. The search algorithm finds the minimum difference between the test sample and the standard sample using Equation (1).
The requirements for the software are as follows: (1) it should contain information regarding the soil characteristics presented in Table 1 and (2) it should allow working with data in any Internet browser independently from the operating system and platform used.

Experiment Scheme and Determination of Soil Characteristics of Standard Soil Substrates
To develop software for matching standard activity enzyme biosensors for soil pollution analysis was prepared 51 soil samples, which was collected from the special foreststeppe zones of Krasnoyarsk region, Russia. Complete factorial experiment schemes included variants of types and varieties of soils and minerals with different particle size distribution and humus content. Based on the five basic soil substrates Figure 1a (sandy loam, light, medium and heavy loam, high-humus soil), 12 varieties of soil substrates were designed, each differing in particle size distribution and humus content Figure 1b. The organic matter (humus), pH of the soil solution, and the particle size distribution of the soil were determined according to previously described methods [19][20][21]. The soil organic matter (humus) is determined by modified Tyurin photometric method. Soil samples are sieving through a 2.5 mm sieve and removal of visible roots and plant residues. The soil organic matter is oxidized by potassium dichromate solution with sulphuric acid and heated at boiling point for 1 h. After cooling the suspension is washed with water, agitated and left to settle. The soil organic matter is established photometrically (λ 590 nm). The mass of organic matter in the analyzed sample is determined by the calibration graph. The pH of the soil solution is determined by a glass electrode in a 1:2.5 (volume fraction) suspension of soil in 1 mol/L potassium chloride solution (pH in KCl). The particle size distribution of the soil is determined by standard pipette analysis (similarly to ISO 11277:2002-08, 2002).
The optical density of water soil extracts (1:5 volume fraction) to determine the total amount of solute was determined at a wavelength of 250 nm (D250) using a Cary 5000i spectrophotometer (Agilent Technologies).
The optical density of the solution was recorded over 5 min at 412 nm using a UV-2600 spectrophotometer (Shimadzu, Japan). The change in optical density was used to calculate the rate of hydrolysis of the BChE substrate, and from the obtained results, the relative BChE activity (A) in the reaction mixture was determined.
To assess the effect of the inhibitor on BChE activity, the residual BChE activity (OA) in the reaction mixture was calculated using the following formula OA = (A/A 0 ) * 100%, where A is the rate of S-BCh-I BChE hydrolysis in the test solution and A 0 is the rate of S-BCh-I BChE hydrolysis in control solution.
To determine the activity of the conjugated system NADH:FMN-oxidoreductase + luciferase (R + L), a reaction mixture of the following composition was used: 300 µL of 0.05 M potassium phosphate buffer (pH, 6. The kinetic parameters of the bioluminescent trienzyme system lactate dehydrogenase + NADH:FMN-oxidoreductase + luciferase (LDH + R + L) were measured in a mixture containing the following components: 300 µL of 0.05 M potassium phosphate buffer (pH, 6. For two-and three-enzyme systems, the reaction mixture was placed in a luminometer cuvette (PromegaGloMax 20/20 Luminometer, USA), and the luminescence intensity was measured over 300 s. The residual luminescence was calculated according to the formula (I/I 0 ) × 100%. At I/I 0 > 80%, the analysed soil sample was considered to have no impact, at 50% < I/I 0 < 80%, the analysed soil sample was considered as having an impact, and at I/I 0 < 50%, the analysed soil sample was considered to have a significant impact.

Results and Discussion
Difficulties in interpreting the results of studying the composition of complex environments can be resolved by using reference standard samples. With this approach, the intrinsic influence of the sample is taken into account when examining samples. The success of the search for a standard directly depends on the measurement and approximation method. In most cases, ED Equation (1) is used to find the distance between objects with quantitative characteristics and the same dimensions. The input data is an array of vectors m × n, where m is the number of characteristics of a standard soil sample and n is the number of standards. For the convenience of presentation, the data were reconstructed to the form of a matrix Equation (2): Another input is the data for one test soil sample, which are converted into one column vector of dimensionality m × 1 The process of finding the closest possible standard sample is as follows Figure 2: 1.
The program reads reference soil's data from a *.js file with json object inside. 2.
The number of reference soils (k) is estimated. 3.
The program reads sample soil's data from the program's input box.

4.
Sample's index is equating to zero. The minimum deviation is equal to the maximum possible number.

5.
Loop through each element of the source data array. 6.
Based on Equation (1), the ED is calculated for the reference and investigated soils. 7.
If calculated ED is lower than minimum deviation, minimum deviation is equating to ED. Index of minimum reference deviation is equating to current loop value. 8.
According to the obtained index from the array of reference soil, all the data for the most suitable reference soil are obtained.
The output of the algorithm is the tested standard sample. Before graphically displaying the data, the data is made uniform using the formula y = y i /y max × 100%. Bringing all data on one scale allows us to more accurately assess the contribution of each parameter, displaying them in one graph in one scale. To fill the database of the application with information regarding standard soils and their effect on enzymatic systems, 51 samples of conditionally pure model soils were examined. Soil samples have different particle size distributions, exchange acidity and soil organic matter contents, which varies widely, from 0.37% to 8.50%, with increments of 1%, 3% and 5% (see Table S1). The content of physical clay ranged from 8.6% to 47.5%, and the varieties were represented by cohesive sand, sandy loam and light, medium and heavy loams. The pH (in KCl) of salt suspension also differed, from 5.45 to 8.00 units, which indicates a weakly acidic, close to neutral, neutral, slightly alkaline and alkaline reaction of the environment. Analysis of all the obtained data allows to assert that the tested used a wide variety of soils and rocks with a wide range of variation of basic properties.
The effect of water extracts from standard soil-based soils on single-, two-and threeenzyme systems was studied. For the BChE system, no significant inhibition by soil extracts was observed, as the % OA parameter remained at least 95%. For some samples, the opposite activation effect was observed (an increase in OA relative to the control), in some cases OA level reached 155%. The residual fluorescence T2, % of the two-enzyme system was in the range of 49% to 103%. The greatest inhibiting effect was shown by soil samples from groups with a high humus content and a heavier particle size distribution. The residual luminescence of the three-enzyme system T3, % varied in the range of 22% to 100%, which indicates its high sensitivity to soil components. The D250 optical density value ranges from 0.16 to 1.59. There is a clear pattern sample group effect: the heavier the soil in terms of particle size distribution, the higher the optical density of the extract from this soil. The above measured parameters in combination with soil characteristics humus content, pH and particle size distribution, see Table 1, have been entered into the application. These standards have been collected, described, analysed, and then categorised, generalised and added to the general database. Thus, the database consists of 51 reference soil samples, divided into 17 type categories (legend for soil categories). Each category has three different reference examples. An example of input data file structure is shown in Table 2.   An application has been developed in the JavaScript programming language based on the algorithm Figure 2. When the user enters experimental data, the application finds a data record on the reference sample and passes the data about the standard and the test sample to the chart library (chart.js). Afterwards, the found standard sample and the test sample are displayed on a single graph.
Visually, the application includes input boxes, an Output Data button and a chart area. When the application starts, the input boxes are not filled in Figure 3a. The user of the application (the researcher) enters the experimentally obtained characteristic measurement data of the test soil into the corresponding boxes. After the user clicks the "Find" button, the software searches for the appropriate reference standard based on the previously described algorithm. After the search, the application outputs a chart and, in the lower part of the chart, the code name of the standard soil. The chart simultaneously displays soil characteristic values for the standard sample (pink bars) and the test soil (blue bars). Of note, the data on the characteristics of the test soil and the standard soil are unified, that is, they are brought to one scale, where the maximum value of the standard sample characteristic is taken as 100%. This unification is due to the fact that the percentage deviation of the test sample from standard is more informative than a specific absolute value. For convenience of comparison, value overlap is shown in violet. In addition, on mouse cursor hover over any of the bars, its numerical value is shown Figure 3b.
The application can also display the bars of the standard and test sample separately, which can be switched on and off by selecting the appropriate category in the legend at the top of the chart Figure 4.
A clear advantage of our application is that it already contains the data on various standard soils and allows us to match a reference standard based on soil characteristics of test samples. The design of the application allows us to collect and analyse soils characteristic of different territories and enter them into the application's database as standard soil samples.  . Application graph, the "reference" category is not active (crossed out) and the data is only displayed for "sample".

Conclusions
We have developed specialised enzyme biosensor software for soil pollution analysis. This application contains information on the characteristics of standard soils and the effect of aqueous extracts from these soils on three enzymatic systems. This information can be considered as having a background inhibitory effect on the components of the biological module of the biosensor produced by soil samples, which is a valuable component. This software may find wide use in environmental monitoring.