If you don't remember your password, you can reset it by entering your email address and clicking the Reset Password button. You will then receive an email that contains a secure link for resetting your password
If the address matches a valid account an email will be sent to __email__ with instructions for resetting your password
Faculdade de Ciências, Universidade do Porto, Portugali3S - Instituto de Investigação e Inovação em Saúde, Universidade do Porto, PortugalIPATIMUP - Instituto de Patologia e Imunologia Molecular da Universidade do Porto, Portugal
i3S - Instituto de Investigação e Inovação em Saúde, Universidade do Porto, PortugalIPATIMUP - Instituto de Patologia e Imunologia Molecular da Universidade do Porto, PortugalCMUP, Centro de Matemática da Universidade do Porto, Portugal
A set of 156 pairs of anonymized real casework mixture samples were analyzed.
•
LRmix Studio, EuroForMix and STRmix tools were considered and results compared.
•
Greater result discrepancies were found between qualitative and quantitative tools.
•
Differences were also observed between quantitative software.
•
Expert's training and knowledge of the tools’ enclosed models are crucial.
Abstract
To overcome the multifactorial complexity associated with the analysis and interpretation of the capillary electrophoresis results of forensic mixture samples, probabilistic genotyping methods have been developed and implemented as software, based on either qualitative or quantitative models. The former considers the electropherograms’ qualitative information (detected alleles), whilst the latter also takes into account the associated quantitative information (height of allele peaks). Both models then quantify the genetic evidence through the computation of a likelihood ratio (LR), comparing the probabilities of the observations given two alternative and mutually exclusive hypotheses.
In this study, the results obtained through the qualitative software LRmix Studio (v.2.1.3), and the quantitative ones: STRmix™ (v.2.7) and EuroForMix (v.3.4.0), were compared considering real casework samples. A set of 156 irreversibly anonymized sample pairs (GeneMapper files), obtained under the scope of former cases of the Portuguese Scientific Police Laboratory, Judiciary Police (LPC-PJ), were independently analyzed using each software. Sample pairs were composed by (i) a mixture profile with either two or three estimated contributors, and (ii) a single contributor profile associated. In most cases, information on 21 short tandem repeat (STR) autosomal markers were considered, and the majority of the single-source samples could not be a priori excluded as belonging to a contributor to the paired mixture sample. This inter-software analysis shows the differences between the probative values obtained through different qualitative and quantitative tools, for the same input samples. LR values computed in this work by quantitative tools showed to be generally higher than those obtained by the qualitative. Although the differences between the LR values computed by both quantitative software showed to be much smaller, STRmix™ generated LRs are generally higher than those from EuroForMix. As expected, mixtures with three estimated contributors showed generally lower LR values than those obtained for mixtures with two estimated contributors.
Different software products are based on different approaches and mathematical or statistical models, which necessarily result in the computation of different LR values. The understanding by the forensic experts of the models and their differences among available software is therefore crucial. The better the expert understands the methodology, the better he/she will be able to support and/or explain the results in court or any other area of scrutiny.
Whenever biological material is found during a criminal investigation, the standard practice in most countries is to perform DNA analysis by capillary electrophoresis, after PCR amplification. DNA mixtures, i.e., samples containing contributions from more than one donor (exact number unknown), are typically encountered in this context. An estimation of the number of contributors of the DNA mixture is a parameter required by probabilistic genotyping systems to evaluate the weight of the evidence. Although alternative methods have been proposed [
Identification of anti-HIV active dicaffeoylquinic-and tricaffeoylquinic acids in Helichrysum populifolium by NMR-based metabolomic guided fractionation.
], where the expert establishes the minimum number of contributors based on the number of observed alleles. This estimation may be a challenging procedure, especially when involving high order mixtures (three or more contributors) and/or low levels of DNA, and may not reflect the actual number of contributors.
Beyond the unknown number of contributors, mixture DNA samples may have associated several other characteristics that make it complex and difficult to interpret and analyze, such as allele sharing between contributors, contributions in different proportions, stutters which may be confounded with alleles of a minor contributor (or vice versa), amplification stochastic effects (heterozygotic imbalance, drop-in, and drop-out) due to low-template DNA, or degradation [
To minimize the impact of the abovementioned conditions and quantify the weight of the forensic genetic evidence under a robust and scientifically supported statistical approach, probabilistic genotyping informatics tools were developed. The standard approach relies on the computation of a Likelihood Ratio (LR) [
]). The qualitative models only consider the qualitative information of the electropherogram (observed alleles in the profile, after the screening of the expert), while the quantitative ones also consider the quantitative information (height of the detected peaks, both alleles and stutters). Therefore, the first approach has simpler algorithms and computations compared to the second, which is more complete and produces complex results that are more sensitive to artifacts and stochastic effects [
Several works have been developed to evaluate and compare the performance of the different existing software resorting to controlled, mocked, mixture samples [
DNA mixtures interpretation – a proof-of-concept multi-software comparison highlighting different probabilistic methods’ performances on challenging samples.
]. In this case, and in the absence of contamination, the number of contributors and respective proportion of the analyzed mixture samples is, a priori, known.
Under the framework of forensic identification problems, the main goal of this paper is to compare the results obtained by widely used probabilistic genotyping programs, considering real casework samples. This type of samples was chosen due to their unknown composition, uniqueness, and inability to predict and replicate, carrying a much higher level of complexity than mock ones. The software used in this work were based on qualitative – LRmix Studio v.2.1.3 [
R Core Team, R: A language and environment for statistical computing, R Foundation for Statistical Computing, Vienna, Austria (2020) URL https://www.R-project.org/.
] and assumes either Maximum Likelihood Estimation (MLE) or Bayesian inference approaches. Gamma distribution is assumed for modeling peak heights, and Exponential is used for drop-in occurrence. On other hand, STRmix™ (http://strmix.esr.cri.nz/) is a commercial software based on Bayesian approach using Markov Chain Monte Carlo (MCMC). A log-normal distribution is assumed for the peak height and the drop-in is modeled using either a Gamma or Uniform distribution. Indeed, despite both being quantitative software, there are significant differences on the considered approaches to analyze the data, which necessarily results in different evidence quantifications – for a comprehensive review see [
A set of 156 irreversibly anonymized mixture/single contributor sample pairs (mixtures with two and three estimated contributors equally distributed) were selected from real casework and analyzed at Portuguese Scientific Police Laboratory, Judiciary Police (LPC-PJ). For most of the pairs, the single contributor sample could not be excluded from being from one of the respective mixture contributors. The samples were previously processed in the context of the respective casework, under manufacturer protocols. The extraction was carried out in the Automate Express™ Forensic DNA Extraction System (Applied Biosystems™), using the PrepFiler Express BTA™ Forensic DNA Extraction kit (Applied Biosystems™) and QIAsymphony SP/AS Extraction System using the QIAsymphony DNA Investigator Kit, followed by the quantification in the equipment 7500 Real-Time PCR System (Applied Biosystems™), using the Quantifiler™ Trio DNA Quantification kit (Applied Biosystems™). Then, the amplification was performed in thermal cycler GeneAmp® PCR System 9700 (Applied Biosystems™) using GlobalFiler™ PCR Amplification Kit and GlobalFiler™ Express PCR Amplification Kit (Applied Biosystems™), both 24-locus STR kits; and, finally, the PCR products were detected and separated by capillary electrophoresis in 3500 Genetic Analyzer with GeneMapper™ ID-X (Applied Biosystems™), with an analytical threshold equal to 100 RFU. This work was then developed considering the genetic information obtained (GeneMapper files) for each irreversibly anonymized pair of samples (mixture/single contributor) for the set of 21 autosomal short tandem repeat markers analyzed.
For each mixture/single contributor sample pair, the quantification of the genetic evidence was computed through a global likelihood ratio (LR) assuming the following alternative hypotheses: H1: “DNA originated from the person of interest (POI) and N-1 unknown, unrelated individual(s)”, and H2: “DNA originated from N unknown, unrelated, individuals”, where N represented the estimated number of contributors. The LR value was computed for the same set of markers using LRmix Studio v.2.1.3 [
]. In all the cases, the single contributor sample was assumed to come from the POI. In EuroForMix, the MLE approach was employed. All LR values were computed using the allelic frequencies of the National Institute of Standards and Technology (NIST) database concerning the Caucasian population [
To compare the results obtained by the three software, as much as possible similar conditions were maintained for the parameters described in Table 1. A co-ancestry coefficient (FST) equal to 0.01 was considered in all the tools; this value was used by studies with similar populations [
], EuroForMix using a Lambda distribution (λ), where the software’s default value of 0.01 was used, and STRmix™ employs either a Gamma (ɣ) or Uniform distribution, the latter having been considered in this work. A minimum allele frequency of 0.001 (default value in LRmix Studio) was used in both LRmix Studio and EuroForMix, while in STRmix™ this value is locus-specific and a N equal to 0 was considered. In both EuroForMix and STRmix™, the allele frequencies were normalized. Regarding dropout, in LRmix Studio the pre-set value of 0.1 was used, while in both quantitative programs this parameter is directly estimated through the peak heights distribution [
]. A threshold detection equal to 100 RFU was considered in both quantitative software.
Table 1Parameter values introduced in LRmix Studio v.2.1.3, EuroForMix v. 3.4.0, and STRmix™ v.2.7, to perform LR computations. N/A: Not Applicable. a Dropout is directly estimated through the peak height distribution. b Per locus specified by the software considering N = 0.
For the other parameters, specific to each software implementation, the default values provided by the authors were considered. Since STRmix™ software requires the inclusion of the stutters in the mixture sample files, they were also included in the EuroForMix analysis, both back and forward. LRmix Studio does not consider stutters and therefore its inclusion is not recommended. For LRmix Studio only, the selection of allelic peaks was decided by the same expert for all samples. In the quantitative software the stutter is automatically accommodated by the software, which introduce another source of variation between computed LR values. In the cases where a LR equal to zero was obtained for specific markers, the corresponding markers were removed from the analysis and the corresponding global LR value was denoted as LR<21, as less than 21 markers were analyzed in the three tools.
LR values (log10 scale) for the same mixture/ single contributor sample pair and set of markers were obtained through EuroForMix, LRmix Studio, and STRmix™. Three pairwise differences were then computed for each pair mixture/single contributor sample. Statistical tests (chi-square test and test-t, respectively) were carried out to infer significant trends (ɑ = 0.05): (i) a specific software provided greater or smaller LR values than the others, and (ii) LR values obtained for mixtures with two estimated contributors showed to be higher/smaller than those with three.
3. Results and discussion
The global LR values obtained for each of the 156 mixture (with two or three estimated contributors)/ single contributor sample pairs analyzed were computed using the three software – LRmix Studio v.2.1.3 [
Fig. 1LR values (log10 scale) computed for the same mixture/ single contributor sample pairs, with two (upper plot) and three estimated contributors (lower plot) in LRmix Studio v.2.1.3 (circle), EuroForMix v.3.4.0 (triangle) and STRmix™ v.2.7 (square). In the first 13 and 7 mixture/single contributor sample pairs illustrated in the upper and lower plot, respectively, less than 21 markers were analyzed (dashed line). Within the pairs highlighted with a dashed line in the upper plot, only 18 markers were considered in the first pair and 20 markers in the other 12 pairs. On the other hand, in the lower plot, in the first two pairs highlighted with a dashed line 19 markers were considered, and 20 markers in the remaining pairs highlighted with a dashed line.
STRmix™ returned a LR equal to zero for some mixture/ single contributor sample pairs, namely, in 17% and 9% of the cases with mixtures with two and three estimated contributors, respectively. In these cases, either alleles present in the single contributor sample were absent from the mixture sample, or MCMC sampling failed to identify the true genotype combination, i.e., MCMC algorithms failed to converge, as similarly observed in other studies [
]. Nevertheless, in this work, a LR equal to zero was observed indiscriminately for sample pairs whose single contributor profile considered could not be excluded from being either the major or minor contributor of the mixture.
Some approaches have been presented to circumvent this question, which include (i) ignore that locus during deconvolution, (ii) repeat the deconvolution with a random starting seed for the MCMC different than the one that resulted in a LR equal to zero, or (iii) repeat the deconvolution with an increase in number of MCMC parameters values [
]. In this case, for simplicity and to maintain the same conditions throughout STRmix™ computations (i.e., MCMC parameters values) these loci were ignored. Inevitably, to guarantee equal circumstances between all tools, these loci were also ignored in both EuroForMix and LRmix Studio software. Thus, as previously mentioned, these pairs were re-analyzed in the three programs, disregarding the marker(s) for which a LR equal to zero was achieved in STRmix™, and so, less than 21 markers (minimum 18) were considered per case. These specific situations are highlighted with a dashed line in Fig. 1.
STRmix™ LR values showed to be higher than those from EuroForMix for cases with mixtures with two estimated contributors (p = 2.35E-02), the same trend not being supported for the cases with mixtures with three estimated contributors (p = 8.21E-01). Also, LR values computed by STRmix™ and EuroForMix showed to be higher than those computed by LRmix Studio, for both cases with mixtures with two (p = 3.57E-16, for both software) and three estimated contributors (p = 1.09E-11 and p = 3.57E-16, resp.).
There are however three cases (2_2, 9_3, and 27_3) for which the LR value computed by LRmix Studio showed to be the highest. For the case 2_2, LR values retrieved by LRmix Studio, EuroForMix, and STRmix™ equated 3.04E+10, 7.43E+09, and 3.47E+08, respectively. For 9_3, LR values computed were 2.45E+09, 1.77E+09, and 1.89E-07, resp. And, finally, for case 27_3, LR values of 1.13E+12, 4.77E+11, and 1.51E+10, resp., were observed. As expected, these exceptions corresponded to mixture/single contributor sample pairs whose single contributor profile could not be excluded from being the minor contributor of the mixture under analysis. Since both EuroForMix and STRmix™ consider the quantitative information related to the height of the peaks, the computed LR values were lower than those computed by LRmix Studio, which does not consider that information.
], LR values obtained for the cases of mixtures with two estimated contributors showed to be higher than those obtained for cases with mixtures with three estimated contributors, for the three software: LRmix Studio (p = 1.00E-20), EuroForMix (p = 1.19E-03) and STRmix™ (p = 1.45E-03). The computation of lower LR values for cases with mixtures with three estimated contributors may be a consequence of the higher complexity associated, which may also be reflected in a greater dispersion of data. In both quantitative software, this trend was verified and corroborated by the standard deviation values computed for the distribution of LRs obtained by each software. The standard deviation values achieved for cases with mixtures with two and three estimated contributors were, respectively – in EuroForMix, 5.311 and 6.428, and in STRmix™, 6.397 and 8.077. On the other hand, for the qualitative software (LRmix Studio) the standard deviation values obtained, for cases with mixtures with two and three contributors showed to be 2.772 and 2.321, respectively, which do not support the trend obtained for quantitative software. The fact that qualitative software uses less information when compared to quantitative ones, may lower its sensitivity to more complex situations.
The observed discrepancies between the LR values (log10 scale) computed by the different software for the same mixture (with two and three estimated contributors)/single contributor sample pairs are summarized in Table 2. After calculating the difference between computed LR values (represented by D in Table 2), the number of cases with differences within the same range (zero and two units (log10 scale), two and four, and so on) was accounted and divided by N (total number of cases) to calculate the percentage presented. An example clarification regarding the way D was computed is presented made in Table S3.
Table 2Pairwise differences between the LR values (log10 scale) computed by LRmix Studio v.2.1.3, EuroForMix v.3.4.0, and STRmix™ v.2.7, for cases with mixtures with two and three estimated contributors, and maximum, mean, and median values of these differences.
D = ABS [Log10 (LR1) – Log10 (LR2)]
Estimated number of contributors
Two
Three
EuroForMix and LRmix Studio
STRmixTM and LRmix Studio
EuroForMix and STRmixTM
EuroForMix and LRmix Studio
STRmixTM and LRmix Studio
EuroForMix and STRmixTM
N = 78
N = 78
N = 78
N = 78
N = 78
N = 78
0 < D < 2 2 < D < 4 4 < D < 6 6 < D < 8 8 < D < 10 D > 10
As expected, the greatest differences were found when the results of the quantitative tool LRmix Studio were compared with those of the quantitative ones: EuroForMix and STRmix™. Indeed, it was noticed that the category with the highest frequency concerns the cases showing differences greater than ten units in a log10 scale – 37% and 46% in cases with mixtures with two, and 46% and 54% in cases with mixtures with three estimated contributors, respectively. Within the cases of mixtures with two estimated contributors, the greatest differences found between LRmix Studio and EuroForMix (LR = 3.20E+11 vs. LR = 5.14E+26) was smaller than the one found between LRmix Studio and STRmix™ (LR = 8.61E+08 vs. LR = 1.79E-08), the latter also supporting contrasting hypotheses. Similarly, for the cases of mixtures with three estimated contributors, the largest discrepancy observed between LRmix Studio and EuroForMix (LR = 1.73E+08 vs. LR=8.83E+26) was also smaller than the one observed between LRmix Studio and STRmix™ (LR = 8.59E+07 vs. LR = 1.88E+27).
In almost all the cases, LR values computed by the quantitative software (EuroForMix and STRmix™) were consistently higher than those computed by qualitative (LRmix Studio). Coincidentally, the single-source profile used as a reference in these cases could not be excluded from being the major contributor of the mixture. The non-consideration of the peak heights information by qualitative tools makes them less sensitive to several aspects, such as contributors in different proportions, which may result in these differences.
A different scenario was observed when comparing LR values obtained through the quantitative tools STRmix™ and EuroForMix. In both cases with mixtures with two and three estimated contributors, most of the cases (73% and 59%, respectively) did not differ by more than two units on a log10 scale. Regarding cases with mixtures with two estimated contributors, only one case showed a difference greater than ten units in a log10 scale between EuroForMix (3.48E+16) and STRmix™ (1.79E-08). It is noteworthy that this was the same sample pair for which LRmix Studio and STRmix™ showed the greatest divergence of results. For the cases with mixtures with three estimated contributors, 5% of the observed LR differences showed to be greater than ten units on a log10 scale. The maximum difference of 15.970 units (log10 scale) was observed in a case for which LR values retrieved by EuroForMix and STRmix™ equated 1.77E+ 09 and 1.89E-07, respectively.
For simplicity sake and practical reasons, only cases with differences higher than ten units in a log10 scale between quantitative software were further analyzed, namely, pairs 14_2, 9_3, 10_3, 13_3 and 16_3 – Fig. S1, Tables S4 and S5. None of these pairs contain alleles that have not been previously observed in the used allele frequency database, and therefore differences on the minimum frequencies values cannot justify these discrepancies, as previously shown in other study [
]. For these pairs, the great discrepancies observed in the global LR values resulted from the accumulation of small differences in the analysis of each marker within the specific cases. At this point, it is noteworthy that the differences found and the proportion of cases in these conditions were not greater because, within the same analysis, there were markers where the LR was greater when computed with STRmix™ than when EuroForMix was used, occurring the opposite situation for others. This resulted in smoother global LR differences than those that may have occurred if one of the two software consistently attributed lower (or higher) LR values than the other. Anyway, most of cases with differences greater than ten units (log10 scale) resulted from cases where EuroForMix computed LR values higher than STRmix in most markers.
An extended analysis of the greatest differences found between quantitative software was performed (LRs differing in more than log10 scale units), for both cases with mixtures with two and three estimated contributors.
Starting with the only case of mixture with two estimated contributors (pair 14_2), a difference of 24.288 units (log10 scale) was observed between the two quantitative programs: LR(EuroForMix) = 3.48E+ 16 and LR(STRmix™) = 1.79E-08. For this pair, the single contributor profile was considered the minor contributor of the mixture by both EuroForMix and STRmix™ with a mixture proportion of 14.28% and 25.91%, respectively. However, in a per-locus LR analysis, 19 out of the 21 markers favored inclusion (LR>1) in EuroForMix, while in STRmix™ only 9 showed the same trend, as described in Table S4. In Table S6, the profiles of the loci showing the greatest differences, namely, D10S1248 (2.947 units, log10 scale), D8S1179 (2.946), D22S1045 (2.900), and D16S539 (2.068) are described. Neither of the statistic diagnoses given by both software supported a good performance for this analysis. The peak height variability parameter in EuroForMix adjusted to 0.476, a very high value – meaning a poor describing model. A more tolerant peak variance parameter in this software allows a lower false exclusion rate and a higher rate of false support of non-contributors [
]. On the other hand, the STRmix™ equivalent parameter (peak height variance parameter) shows slightly larger values than the mode of the respective prior distribution, which may indicate stochastic effects or incorrect estimation of the number of contributors [
]. Also, the remaining STRmix™ secondary diagnostics – log(likelihood) and Gelman-Rubin – reinforce the software inability to describe the observed data. A negative log(likelihood) of − 253.95 was observed, which may be due to several reasons such as, profile used as a reference being very low level [
]. In Gelman-Rubin parameter, the acronym NaN (Not a Number) is presented. So, even though the diagnostics of both software pointed out the model bad fitting, the result obtained was very distinct, probably due to different modeling assumptions and parameters settings. STRmix™ models locus specific amplification efficiencies and consider expected stutter ratios defined using empirical data. On the contrary, EuroForMix does not model locus specific amplification efficiencies and has a blanket expected stutter rate. As EuroForMix does not allow a differential amplification between loci, the need for a more tolerant peak height variance parameter is understandable [
Within the cases with mixtures with three estimated contributors, the greatest difference occurred in a mixture/single contributor pair (9_3) with a difference of 15.970 units (log10 scale) between EuroForMix (LR = 1.77E+09) and STRmix™ (LR = 1.89E-07). Like the per-marker LR previously analyzed, most markers in EuroForMix (17 out of 21) supported inclusion, while in STRmix™ only 8 out of the 21 showed the same trend – see Table S4. The STR profiles of markers D8S1179, D13S317, and D19S433 are described in Table S7 since the greatest differences were found in those loci – 2.547, 2.092, and 2.087 log10 scale units, respectively. For this case, both EuroForMix and STRmix™ considered the single contributor profile the second contributor of the mixture with a mixture proportion of 15.67% and 22.92%, respectively. Evaluating the statistic diagnostics of EuroForMix and STRmix™, both tools supported a good performance for this analysis. EuroForMix peak height variability parameter adjusted to a value of 0.195, revealing a good describing model. STRmix™ secondary diagnostics also revealed a good fit of the model – a log(likelihood) of 28.58 and a Gelman-Rubin value of 1.06 indicating that STRmix™ was able to well describe the data and the model converged [
]. Contrarily to the situation previously analyzed considering two estimated contributors, the diagnostics of both software reflected the model good fitting in this case. The very discrepant results observed are probably due to different modeling assumptions and parameters settings, as stated before.
A more in-depth investigation on these cases where greater differences were reported by different informatics approaches are being performed and soon presented in a dedicated research article.
4. Conclusions
Informatics tools for the analysis and interpretation of forensic samples DNA results, perform a quantification of the genetic evidence through the computation of a probative value using electropherograms’ information.
Software based on quantitative models take into account a larger amount of data than the qualitative ones, as these only consider the detected alleles, disregarding the corresponding peak heights. So, while qualitative tools only include statistical models for probability of drop-out and drop-in, quantitative tools model also the peak height information (i.e., peak alleles, peak balance, and stutter peaks). Indeed, the major distinctive feature of existing software is the ability to consider the (inferred) DNA quantity (after amplification).
As expected, the greatest differences between LRs were generally found when results retrieved from qualitative and quantitative tools were compared. Notwithstanding, and despite the smaller differences, quantitative software also retrieved different LR results reflecting different mathematical modeling of various parameters. Indeed, even using similar parameter values, each software assigns a different statistical modeling to each parameter leading to the computation of different LR values. Nevertheless, some of the differences observed were unexpected and intriguing large. This prompted us to develop a deeper research to explore the characteristics of the samples that lead to such differences, which results will be presented in a dedicated research article.
The uniqueness, and inability to predict and replicate real casework mixture samples, were the main reasons for the selection of this kind of samples. The drawback is that its composition (donors and proportion) is unknown, contrary to what occurs for mock samples (noting that accidental contaminations may also occur in these cases). Thus, the inference on which software produced more accurate results couldn’t be performed. We further anticipate that some differences may be enhanced if the (some of the) contributors are closely related (as is often the case in sexual violence) and or related to the POI.
Software based on probabilistic genotyping methods is essential in the interpretation and quantification of complex DNA mixtures. Different software solutions encode distinct mathematical, statistical and informatics models which lead to the return of different values for the same input samples and approximated parameters. In what seems a paradox, this software dependence for the quantification of the evidence implies a need of improvement of the education and training of the experts, when compared to a previous situation where only a binary exclusion/inclusion interpretation (without statistical calculations) was developed.
Our aim with this short communication is to show how different LR computations can be obtained when considering real casework samples, regardless of whether the informatics tools consider the quantitative information of the electropherogram or not. Quantitative software used (EuroForMix and STRmix™) have built-in diagnostic tools that allow the expert to know if the models are good fits. Nevertheless, for some cases, despite statistic diagnostics of both tools supported a good fit, discrepant LR results were computed. So, since in the forensic routine the genetic evidence is not usually quantified resorting to more than one informatics tool, there are no confrontations between possibly different results, practitioners in the field assuming the (only) calculated value as steel plated. Thus, we intend to warn experts not to blindly trust the program they use and to be critical of the computed result, paving the way for the routine consideration of more than one tool in routine forensic casuistic. If for a specific case discrepant results are obtained, forensic specialists should investigate the samples at stake, as well as the mathematical modeling corresponding, to further act in conformity and reach a solid, informed, decision.
Indeed, forensic experts play a crucial role in the profile interpretation, from the electropherogram analysis (decisions about the presence of non-allelic peaks and estimation of the number of contributors, for example) to the proper use of the software for LR computations. Furthermore, and considering the different approaches, the expert knowledge should be extended to the understanding of how each software operates and its respective encoded models. Only this way he/she will be able to sustain the conclusions in court or any other scrutiny scenario.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowlegments
This work was partially financed by FEDER- Fundo Europeu de Desenvolvimento Regional funds through the COMPETE 2020 - Operacional Program for Competitiveness and Internationalization (POCI), Portugal 2020, and by Portuguese funds through FCT- Fundação para a Ciência e a Tecnologia/Ministério da Ciência, Tecnologia e Inovação in the framework of the projects “Institute for Research and Innovation in Health Sciences” (POCI-01–0145-FEDER-007274). NP is supported by FCT, under the program contract provided in Decree-Law no.57/2016 of August 29. CC is funded by a FCT doctoral grant 2021.05655. BD.
Identification of anti-HIV active dicaffeoylquinic-and tricaffeoylquinic acids in Helichrysum populifolium by NMR-based metabolomic guided fractionation.
DNA mixtures interpretation – a proof-of-concept multi-software comparison highlighting different probabilistic methods’ performances on challenging samples.
R Core Team, R: A language and environment for statistical computing, R Foundation for Statistical Computing, Vienna, Austria (2020) URL https://www.R-project.org/.