Stratifying patients with peripheral neuropathic pain based on sensory profiles: algorithm and sample size recommendations

Supplemental Digital Content is Available in the Text. Phenotype stratification of patients with peripheral neuropathic pain can be conducted with a novel algorithm based on sensory profiles.


Introduction
Neuropathic pain is defined as pain as a result of a lesion or disease of the somatosensory nervous system 18,37 and may involve diverse etiologies including diabetes, HIV, chemotherapy, herpes zoster, or nerve injury. Historically, neuropathic pain is classified based on these etiologies, although similar symptoms and signs are frequent across these etiologies. Furthermore, it has become evident in the last decades that an etiology-based classification approach of patients is not sufficient, as first-line treatment is often inefficient in more than half of the patients. 17 At the same time, a number of promising new drugs have failed late trial stages. 17,23 Better patient stratification might improve clinical trial outcome.
Although all neuropathic pain states result from a lesion or disease of the somatosensory nervous system, the pathogenesis and subsequently pathophysiological mechanisms in damaged and surviving afferent nerve fibers such as conduction block, ectopic impulse generation, peripheral and central sensitization may differ between patients. Although these mechanisms often cannot be tested in patients directly, the individual patient's sensory profile, including sensory signs such as hyperalgesia, allodynia, or hypoesthesia may be linked to mechanisms. 4 A comprehensive way of assessing the sensory profile of a patient is quantitative sensory testing (QST) in accordance with the protocol of the German Research Network on Neuropathic Pain (DFNS). 3,33 In a recent study, 5 we have shown that QST profiles of patients with peripheral neuropathic pain reveal 3 distinct phenotypes, that are (mainly) characterized by (1) thermal and mechanical sensory loss (SL) (referred to as "sensory loss" subsequently), (2) preserved sensory function, associated with mild heat or cold hyperalgesia (labeled "thermal hyperalgesia" subsequently), and (3) loss of thermal sensation, combined with mechanical hyperalgesia (MH) or allodynia (referred to as "mechanical hyperalgesia" subsequently). These phenotypes can be found across etiologies, but vary in frequency between these, which has been validated in 2 independent patient cohorts. To use these phenotypes to stratify patients in clinical trials or to suggest an efficient treatment for a patient, a standardized individual allocation of patients must be available.
The aim of this article is to provide an algorithm based on the results of our previous work 5 and to estimate probabilities for individual patients to be allocated to each of the 3 named phenotypes. Based on the patients recruited and published by the consortia DFNS (German Research Network on Neuropathic Pain), 29 IMI (Innovative Medicines Initiative) Europain, 10 Neuropain, 5 and Pain in Neuropathy Study (PiNS), 36 for neuropathic pain due to diabetic polyneuropathy, peripheral nerve injury, and after herpes zoster, we provide frequencies of phenotypes and suggest that minimum sample sizes for phenotype-stratified trials.

Consortia
The international consortia DFNS (German Research Network on Neuropathic Pain 29 ), IMI (Innovative Medicines Initiative) Europain, Neuropain, 5,10 and Pain in Neuropathy Study (PiNS 36 ) participated in collecting and analyzing these data. All participating centers underwent strict quality control, 28,40 and a recent analysis of heterogeneity between centers has shown that the data can be analyzed as a homogenous data set. 39 Quantitative sensory testing according to the DFNS protocol  assesses 13 parameters: cold detection threshold and warm  detection threshold (WDT), thermal sensory limen, paradoxical  heat sensations (PHS), cold pain and heat pain thresholds,  mechanical pain threshold and mechanical pain sensitivity (MPS), dynamical mechanical allodynia (DMA), pressure pain threshold (PPT), wind-up ratio, tactile (mechanical) detection threshold, and vibration detection threshold. Thermal sensory and pain thresholds were measured using either a TSA 2001-II (MEDOC, Israel) or an MSA (SOMEDIC, Sweden) that increased or decreased temperature by 1˚C per second. 33,34 Mechanical detection threshold was defined as the geometric mean of 5 series of stimuli ascending and descending between 0.25 and 512 mN by a standardized set of von Frey hairs, mechanical pain threshold as the geometric mean of 5 series of stimuli ascending and descending by applying pinprick stimuli between 8 and 512 mN with a standardized pinprick set (MRC systems, Heidelburg, Germany). 34 Mechanical pain sensitivity and DMA were assessed by applying a total of 50 stimuli (35 pinprick and 15 light tactile in a balanced protocol) and asking patients to give a pain rating on a 0 (no pain) to 100 (most intense pain imaginable) Numerical Rating Scale (NRS) scale. Mechanical pain sensitivity was calculated as the geometric mean of the pain ratings of the pinprick stimuli and DMA as the geometric mean of the pain rating of the tactile stimuli. For the wind-up ratio, the perceived intensity of a single pinprick stimulus was compared with that of a series of 10 repetitive pinprick stimuli of the same physical intensity on a 0 to 100 NRS scale, as an average of 5 series. 34 Vibration detection threshold was assessed with a Rydel-Seiffer-graded tuning fork (64 Hz, 8/8 scale, mean of 3 testing series), and PPT was determined over muscle with a pressure gauge device (FDN200; Wagner Instruments, Greenwich, CT), exerting forces up to 2000 kPa, as a mean of 3 series of ascending stimulus intensities, each slowly increasing (50 kPa/s). 34

Z-transformation
The initial assessment of 180 healthy subjects from the DFNS reference database revealed that all parameters except PHS and DMA could be transformed (partly in log-space) to a standard normal distribution. 28,31,33 This process, called Z-transformation, normalizes all values to a mean 5 0 and an SD 5 1. Subsequently, all QST results of patients and healthy subjects were transformed in accordance with this normalization. Abnormal values are defined as values beyond the 95% confidence interval (CI). On a z-scale, this is represented by z-values ,21.96 or .1.96. Paradoxical heat sensation and DMA usually do not appear in healthy subjects and therefore cannot be transferred to the same scale. Thus, PHS was transformed to a binary 0/2-variable showing absence (coded as 0) or presence (coded as 12) of pathological values. Dynamical mechanical allodynia was transformed to a 0/2/3-variable representing no DMA (coded as 0), DMA with average pain ratings below 1 (coded as 12), and DMA with average pain ratings between 1 and 100 (coded as 13). The z-transformation normalizes for age decade, sex, and tested body region, thus making pain and detection thresholds comparable between patients with different age and sex and independently of the affected area, ie, nerves affected eg, at the face or the feet.

Sorting algorithm and validation
The most common way to establish a sorting algorithm would be to calculate the distance of the QST profile of an individual patient August 2017 · Volume 158 · Number 8 www.painjournalonline.com to each cluster centroid. This would, however, exclude all patients with one or more missing QST values (unless these would be filled in with imputed values), as the distance to a missing value cannot be calculated. As single missing values are not uncommon in patients with neuropathic pain and imputations could bias the results, we decided to use an approach with more intuitive handling of missing values. Because QST z-values are approximately normally distributed, our approach was based on normally distributed probabilities. For each QST z-value of each parameter i and patient n, a probability can be calculated for a phenotype to be present based on the density function of the said phenotype: with i 5 one of 13 QST parameters, n 5 the nth patient in a set of patients, m 5 one of 3 phenotypes and conclusively, s im being the SD of the ith QST parameter for the mth phenotype in the defining data set, 5 m im being the mean z-value of the same QST parameter and phenotype in the defining data set, 5 and finally x in being the z-value found in the nth patient for the ith QST parameter.
Although this function will always reach its maximum at x in 5 m im , in relation to broadness of the SD, density functions can become broader or narrower. This affects the maximum value the density function can reach. To control for these more or less broad functions, we normalized the formula so that a value that is equal to the mean of the phenotype equals 100%, leading to which can be simplified to The resulting probability value ranges from 0% to 100% and can be calculated for all i 5 13 QST parameters and m 5 3 phenotypes. By averaging the probability over the 13 QST parameters, we quantify the similarity of the individual patient's QST profile to the mean profile of each of the 3 phenotypes.
As a simple way of categorizing patients into phenotypes, we suggest to sort each patient to the phenotype with the highest probability value: (1) Calculate F(3) for each of the 13 QST parameters. Use m and s from Table 1 for SL. (2) Average the 13 probabilities (leave out missing value from average). The resulting value is the probability for this patient to show the SL phenotype. (3) Repeat steps 1 and 2, using m and s from Table 1 for thermal hyperalgesia (TH) and MH. (4) Allocate the patient to the phenotype with the highest probability value.
The algorithm as described above was applied to the n 5 902 patients from the original cohort 5 to demonstrate its general sorting capacity to reproduce the original cluster allocations.

Simplified phenotyping
As the DFNS protocol is comprehensive, it might be too complex to be applied in all clinical settings and in large clinical trials. In our previous analysis, we showed that 2 parameters (WDT and MPS) explain large parts of the variance between the phenotypes. Therefore, we also calculated the accuracy of a phenotyping based on WDT and MPS in comparison to a phenotyping using the full protocol.

Discrimination analysis against healthy subjects
To show if and how the algorithm can discriminate patients with neuropathic pain from healthy subjects, we introduced a forth probability-not for a phenotype, but for being healthy. For this purpose, we applied the definition of QST z-values, to which a group of healthy subjects ideally has a z-value mean 5 0 (m) with a SD 5 1 (s) for each QST parameter. The original cluster patient cohort 5 (n 5 902) and n 5 188 healthy subjects 21 from the European cohort 39 underwent a modified version of the algorithm: (1) Calculate F(3) for each of the 13 QST parameters. Use m and s from Table 1 for healthy subjects. (2) Average the 13 probabilities. The resulting value is the probability for this patient to show a healthy profile. (3) Repeat steps 1 and 2, using m and s from Table 1 for SL, TH, and MH.
As this version of the algorithm does not sort each subject simply to the phenotype with the highest probability, this leaves every subject with 4 probabilities, one for each of the 3 phenotypes, and one for being healthy.
The probability of being healthy was used for a receiver operating characteristics (ROC) plot. 46 This graphical tool for assessing discriminatory power plots the false-positive rate (1 2 specificity) on the x-axis vs the sensitivity of detecting patients on the y-axis for all possible probability values of being healthy. Each step in the ROC plot represents the specificity and sensitivity of one certain percentage. To assess the overall quality of separating healthy subjects and patients via the probability for being healthy, the area under curve and its 95% CI were calculated. 8 To define a minimum probability, at which a subject should be considered being healthy, the probability with the highest Youden Index (sensitivity minus false-positive rate 45 ) was chosen.

Deterministic and probabilistic algorithm
To this point, we use a deterministic approach, ie, each patient is allocated to exactly one phenotype. It is, however, our belief that a patient may be allocated to more than 1 phenotype, so with the And a probabilistic version, where steps 1 to 3 remain identical and step 4 is exchanged with (4) Sort the patient to all phenotypes with a probability above the value with the highest Youden Index found in the discrimination between patients and healthy subjects. If the only probability over this cut-off is for being healthy or no phenotype reaches a probability above this cut-off, the patient should be excluded. These 2 versions were used for all analyses below and are presented alongside. The simplified version of the algorithm is the same, except in step 1, only WDT and MPS are used instead of all 13 QST parameters, as these parameters have shown to explain the largest part of variability between the 3 phenotypes in our previous analysis. 5

Frequency of phenotypes across clinical entities
If a new drug would be tested for efficacy in a phenotype-stratified subgroup with neuropathic pain of any single etiology, this would only be meaningful if the said phenotype appears in a relevant frequency within this etiology. To show how frequent these phenotypes are across 3 common etiologies of neuropathic pain, we applied the algorithm to patients suffering from neuropathic pain due to diabetic polyneuropathy, peripheral nerve injury, or postherpetic neuralgia from the databases of our previous studies. 5,9,10,29,36

Sample size recommendations
On the basis of the frequencies found in the clinical entities, we calculated the size of a group of patients who need to be screened with either full or simplified phenotyping to find a subpopulation large enough to perform a trial that still reaches a power of 80% for an effect size of 0.3, 0.5, and 0.7 at an alpha level of 0.05, for a crossover and parallel design. The sample sizes presented in this article are examples, and we encourage all Table 1 Mean QST z-values (m) and SDs (s, in brackets) for each of the 13 QST parameters separately for each of the 3 phenotypes.    readers to tailor them to their individual needs. We recommend the usage of free software G*Power, 15 but many other statistical packages provide similar tools. The following information is required before starting: alpha level (usually 0.05), power (usually 0.8, 0.9, or 0.95), test family (usually t test for independent (parallel design) or dependent (crossover design) mean, or chisquared for dichotomous outcome), and the estimated effect size in the phenotype of interest. Effect sizes are related to a mean treatment effect and SD between treatment response, eg, a mean effect of 2 on a 0 to 10 NRS scale with an SD of 4 corresponds to an effect size of 0.5, a mean effect of 3.5 with an SD of 5 corresponds to an effect size of 0.7, and a mean effect of 1 with an SD of 3 corresponds to an effect size of 0.3, and many other combinations are possible. With this information, the size of the subgroup of patients with the phenotype of interest that needs to be included can be calculated. To determine the size of the overall population which needs to be screened to find a subgroup of the calculated size, divide the subgroup size by the frequency of the phenotype in the etiology of interest as presented in the results section, in regard to the algorithm used (deterministic or probabilistic) and the phenotyping protocol (full or simplified).

Sorting algorithm
Individual allocation replicates the original cluster analysis 5 in 81% of the cases for the complete QST protocol using 13 parameters and in 76% of the cases using simplified phenotyping. Cohen kappa coefficient of agreement (scale: 0 5 random classification, 1 5 perfect agreement between methods) was 0.72 (95% CI: 0.57-0.87) for the complete protocol and 0.63 (95% CI: 0.48-0.78) for simplified phenotyping, both values may be categorized as "good," although no universal guideline for interpreting Cohen kappa exists. 19 Most common shifts were former SL or TH to MH (14 and 17%, respectively), and least common shift was former SL to TH (,1%). Patient shift between phenotypes is shown in Table 2.

Discrimination analysis against healthy subjects
The ROC-area under curve value (scale: 0.5-1, 0.5: no discriminatory power, 1: perfect discrimination) for separating   patients with neuropathic pain and healthy subjects using the probability for being healthy was found to be 0.915 (95% CI: 0.898-0.932), indicating high discriminatory power (Fig. 1). For simplified phenotyping, discriminatory power was significantly lower (0.785, 0.753-0.815). The Youden Index was found to be the highest at a probability of 64%-ie, each subject with a probability value below 64% should be considered as a patient, and when above 64% as being healthy. Although this classification is valid in 94% of healthy subjects, sensitivity in detecting patients is 78% (ie, 22% of patients with neuropathic pain have a sensory profile with a probability for being healthy above 64%). Individual probabilities for each phenotype for patients and healthy subjects are plotted in Figure 2. For simplified phenotyping, the highest Youden Index was found at a very similar value of 63% with similar sensitivity (74%) but very reduced specificity (72%). Because of the high similarity of cut-offs, 64% was used for both full protocol and simplified phenotyping.

Frequency of phenotypes across clinical entities
From the databases of the DFNS, IMI Europain, Neuropain, and the Pain in Neuropathy Study (PiNS), a total of 151 patients with painful diabetic polyneuropathy, 335 patients with painful peripheral nerve injury, and 97 patients with postherpetic neuralgia who had been part of previous analyses 5,9,10,29,36 underwent both deterministic and probabilistic phenotyping (see Table 3 for patients' basic characteristics). Frequencies of phenotypes are presented in Table 4, and frequency and overlap between phenotypes for the full protocol and for each clinical entity are displayed in Venn and bar diagrams in Figure 3.

Overlap in the probabilistic algorithm
Of the diabetic polyneuropathy cohort, 4 patients (3%) were neither sorted to any phenotype nor healthy and had to be excluded. Twenty-seven patients (18%) were consistent with all 3 phenotypes and 86 (57%) with 2 phenotypes. In peripheral nerve injury, 70 (21%) patients were not assigned to any phenotype and 2 (,1%) only to the healthy profile; these patients were all excluded. Sixty-three (19%) patients were assigned to the healthy profile and at least 1 additional phenotype. Seventeen patients (5%) were allocated to all 3 phenotypes, 120 (36%) to 2 phenotypes. In postherpetic neuralgia, 29 (30%) patients were not possible to be assigned to any phenotype; these patients were excluded. Six patients (6%) were consistent with all 3 phenotypes, 27 (28%) with 2 phenotypes.

Accuracy of simplified phenotyping
Overall, 57% of patients with diabetic polyneuropathy, 62% of patients with peripheral nerve injury, and 58% of patients with postherpetic neuralgia were sorted into the same phenotype allocated when the full protocol was applied. The sensitivity of the simplified algorithm, however, is dependent on a combination of phenotype of interest and the clinical entity under study: In diabetic polyneuropathy, 74% of SL patients were correctly identified, but only 48% of patients with TH and 43% of patients with MH. In patients with peripheral nerve injury, allocation accuracy was more balanced between phenotypes (75% for SL, 60% for TH, and 64% for MH). In patients with postherpetic neuralgia, sensitivity was very low for SL (24%), and better for thermal (76%) and MH (56%). Consequently, low sensitivity of the simplified algorithm is linked to low frequency of certain  (11) Sex and disease duration presented as n (%). All patients have been part of previous publications. Table 4 Frequency of each phenotype in diabetic polyneuropathy, peripheral nerve injury, and postherpetic neuralgia, separately for the deterministic and probabilistic algorithm, and for full and simplified phenotyping.

Discussion
In a commentary to our findings of 3 distinct sensory phenotypes in patients suffering from neuropathic pain, we were asked to develop an algorithm to allocate individual patients to said phenotypes. 11 This work can be considered as a response to this request. We have developed an algorithm that enables individual allocation of patients to one or more sensory phenotypes, and, further, can separate patients from healthy subjects with a very high specificity of 94% and a sensitivity of 78%. All phenotypes are present in diabetic polyneuropathy, peripheral nerve injury, and postherpetic neuralgia in reasonable frequencies, resulting in acceptable estimated sample sizes for phenotype-stratified trials, especially if crossover-designed studies are planned. The algorithm will be implemented in the next update of QST managing software eQUISTA (distributed by StatConsult, Magdeburg, Germany), but the algorithm itself is free to use.

Phenotype stratification by sensory profiles
A mechanistic classification of neuropathic pain has been under debate for over 25 years. 4,6,12,16,41 Although a series of studies showed that a post hoc responder analysis can reveal phenotypes that are important to predict treatment response, 1,2,24,30,32,35,42,43 the first phenotype-stratified, randomized, placebo-controlled trials have been published only recently. 9,10 In these studies, oxcarbazepine showed a superior effect over placebo in a subgroup with "irritable nociceptors," a group with a sensory profile very similar to the TH phenotype in this study. By contrast, for topical lidocaine no group difference could be demonstrated.
The main problem with the definition of "irritable nociceptors" based on abnormal QST values (ie, only values outside the 95% CI for healthy subjects are considered) and loss and gain of functions patterns 29,33 is that it is based on a statistically sound, but conservative approach with (comparably) low sensitivity. For instance in diabetic polyneuropathy, the "irritable nociceptor" phenotype is virtually absent. 36 By contrast, the approach taken in this study does not rely on abnormal QST values outside the 95% CI, but focuses on similarity of the entire sensory profile to cluster centroids instead. An appealing advantage of this dynamic method can be seen in Figure 3A. The TH phenotype, which is similar to the "irritable nociceptor" and may have similar underlying mechanisms of pain generation, is found to be a prominent phenotype in a reasonable subgroup of roughly one-third of the patients with neuropathic pain due to diabetic polyneuropathy.

Effort of stratifying populations
To use stratification into subgroups in clinical trials, a large patient population must be screened beforehand with QST to yield a smaller final stratified study population. Thus, a solid sample size calculation of the number of patients necessary to screen is a prerequisite for a stratified study. In Table 5, we present sample size numbers for screening of populations for painful diabetic polyneuropathy, painful peripheral nerve injury and postherpetic neuralgia in relation to estimated effect size (0.3 vs 0.5 vs 0.7) and study design (parallel vs crossover). Crossover sample sizes are overall "realistic" numbers-across phenotypes and clinical etiology based-entities, and are recommended for phase 2 trials. If a parallel study design is intended for phase 3 trials, however, phenotype stratification may only be possible if a high effect size (eg, 0.7) is anticipated.

Similarity to experimentally studied mechanisms and stratification recommendations
As discussed en detail, 5 the phenotypes described in this study resemble sensory phenotypes that can be experimentally induced in healthy subjects. The SL phenotype is similar to previously described "deafferentation" or "painful hypoesthesia" subgroups 6,16,38 and shows elements that can be induced by a compression nerve block. 20, 44 Edwards et al. 13 have shown that patients with higher heat pain threshold-like these patients present-show an improved response to opioid treatment. So, a study of a centrally acting drug, eg, antidepressant or opioid, could be suggested to be stratified for this subgroup. 5,7 The sensory profile of the TH phenotype resembles UVB burn lesion 22 and the previously described "irritable nociceptor". 10,16 We suggest that sodium channel blockers would be most effective for patients with this phenotype, 5 supported by the findings of Demant et al. 10 The MH phenotype shows similarities to the profile induced by high-frequency electrical stimulation of the skin 26 and the previously described "neurogenic hyperalgesia" and "central sensitization". 6,16 A trial investigating a calcium channel a 2 d subunit inhibitor (coll.: gabapentinoid) or N-methyl-D-aspartate antagonist might target this phenotype, 5 as indicated by a post hoc analysis of patients suffering from HIV-related painful neuropathy treated with pregabalin. 35

Limitations
It should be emphasized that the comparison to experimentally induced mechanisms above is anecdotal rather than comprehensive. 25,27 Finding commonalities between these clinical QST profiles and the QST profiles induced by the various surrogate models for neuropathic pain will be an important upcoming task of its own.
The sample size calculations in Table 5 show both advantage and disadvantage of a QST-based phenotype stratification for clinical trials. A novel drug that is aiming at a phenotype that is only present in a fifth of the population will never show an effect superior to placebo in a nonstratified population. However, many patients must be screened to identify an eligible subpopulation, and screening with QST needs substantial training to be reliable and should be done by certified centers. Furthermore, some QST parameters are mechanistically linked and therefore probably intercorrelated (eg, CDT or WDT and thermal sensory limen). In the presented algorithm, these domains may be slightly overweighted. Although beyond the scope of this article, a factorial analysis of the QST protocol is one of the upcoming tasks to show the importance and meaning of each parameter in relation to the full protocol. In the long run, both for large trials and daily clinical practice, an approximation via a simple bed-side testing protocol would be highly valuable.
Although the presented algorithm offers a criterion for excluding healthy subjects, this should be considered within the clinical context. We decided on a rather conservative criterion with high specificity. It has to be noted that patients eligible for clinical trials are usually screened beforehand, have been shown to have a lesion or disease, and have spontaneous pain-a sensory profile that resembles healthy subjects does not necessarily exclude a patient from a trial. Confirming neuropathic pain relies on a history of a relevant neurological lesion or disease, anatomically plausible pain distribution and sensory signs, and finally on diagnostic tests confirming the lesion or disease. 18 Our algorithm assesses sensory signs, but only on an averaged level: eg, a strongly decreased vibration or thermal detection in an otherwise normal profile would be considered a negative sensory sign, but might still result in a high averaged probability of being healthy in this algorithm.
We present 2 methods of sorting patients to phenotypes, but we do not recommend one or the other in general, because we think both have advantages and disadvantages. The deterministic approach, sorting each patient to exactly one phenotype, ignores that multiple pathomechanisms may be present in a patient, and that these mechanisms may overlay each other and result in a sensory phenotype that cannot easily be allocated to one phenotype over the other. The probabilistic approach, however, holds its own challenges: Although the overlap between phenotypes is reasonable for peripheral nerve injury and postherpetic neuralgia, patients with diabetic polyneuropathy Table 5 Number of patients who need to be screened to find a subpopulation with a given phenotype large enough to conduct a study with a power of 80% with an alpha-level of 0.05 and a given effect size. First row: deterministic algorithm (each patient is sorted to exactly phenotype), second row: probabilistic algorithm (each patient may be allocated to no, one, or more than one phenotype). Values in brackets show the number needed to be screened with the simplified protocol. Numbers in bold indicate that 200 or less patients need to be screened. Effect sizes are related to mean treatment effect and SD between treatment response, eg, mean effect on Numerical Rating Scale 0 to 10 scale: 3.5, SD: 5 5 effect size 0.7; mean effect: 2, SD: 4 5 effect size 0.5; mean effect: 1, SD: 3 5 effect size 0.3.
August 2017 · Volume 158 · Number 8 www.painjournalonline.com tend to present more than 1 phenotype with substantial probability. This effect, probably caused by the overwhelming frequency of loss symptoms in these patients, may dilute especially the MH phenotype. When screening for these phenotypes in patients with diabetic polyneuropathy, this limitation should be considered by rather using the deterministic algorithm. For peripheral nerve injury and post herpetic neuralgia, a notable part of the patients (21% and 30%, respectively) is not sorted to any phenotype in the probabilistic algorithm and therefore excluded from the analysis. Although this is acceptable for phenotype-stratified trials, it becomes a problem if the algorithm would be used for designing individual patients' treatment strategy in the future. Again, the deterministic approach might be favorable in this case. Although this analysis focuses on trial design, it is our belief that this or a similar approach will become important in guiding individual patients' treatment in the future. This topic is heavily under debate (and there is dissent even within the author group of this article). Although we agree that at the moment we cannot present a solution to design individualized treatment based on sensory phenotypes, this analysis along with others may pave the way towards individualized pain treatment of patients with specific sensory phenotypes with future medicines. 5,7,10,30,41

Conclusions
In summary, we present an algorithm that can be used for stratification of patients suffering from peripheral neuropathic pain in clinical trials and may in the future indicate individual patients' optimal treatment strategies. Although all 3 phenotypes are present in diabetic polyneuropathy, peripheral nerve injury, and postherpetic neuralgia, frequencies differ, which should affect the number of patients screened for clinical trials. As a result of our previous analysis, 5 the European Medicines Agency's (EMA) committee for medicinal products for human use recommends the phenotype stratification presented here for determining eligible sensory phenotypes of patients in exploratory trials on neuropathic pain, as also incorporated in the new EMA guideline for clinical development of new treatments for pain. 14 We encourage validation of this concept by applying it in prospectively phenotype-stratified trials on peripheral neuropathic pain.