Skip to main content

Behavioral measures and self-report of impulsivity in bipolar disorder: no association between Stroop test and Barratt Impulsiveness Scale



Impulsivity as a tendency to act quickly without considering future consequences has been proposed as a dimensional factor in bipolar disorder. It can be measured using behavioral tasks and self-report questionnaires. Previous findings revealed patients to show worse performance on at least one behavioral measure of impulsivity. Additionally, self-reported impulsivity seems to be higher among bipolar patients, both parameters being possibly associated with a more severe course of illness. In this study, our primary aim was to investigate the relationship between these two constructs of impulsivity among bipolar patients.


A total of 40 euthymic patients with bipolar disorder (21 female, 22 Bipolar I) and 30 healthy controls were recruited for comprehensive neuropsychological assessment. To assess inhibition control as a behavioral measure of impulsivity, the Stroop Color and Word Test (Stroop) was used. Additionally, both groups completed the Barratt Impulsiveness Scale (BIS) as a self-report of impulsivity. To compare the groups’ performance on the Stroop and ratings on the BIS, the non-parametric Mann–Whitney U test was used. Within the bipolar group, we additionally examined the possibility of an association between Stroop performance and BIS total scores using Pearson’s Correlation r.


Patients and controls differed significantly on the Stroop and BIS, with patients performing worse on the Stroop and scoring higher on the BIS. However, there was no association between the Stroop and BIS within the bipolar group. As an exploratory analysis, a positive correlation between Stroop performance and number of episodes was found. Further, we detected a statistical trend in the direction of poorer Stroop performance among patients treated with polypharmacy.


Both difficulties with behavioral inhibition and self-reported impulsivity were observed to be higher in bipolar patients than controls in the current study. However, within the patient group we did not observe an association between patients’ behavioral performance and self-report. This indicates that the parameters likely constitute distinct, dimensional factors of bipolar disorder. In future research, studies with larger samples should investigate which of the two markers constitutes the better marker for the illness and is more suitable to differentiate the most severe patients.


The term “bipolar disorder” implicates a disorder with both manic/hypomanic and depressive episodes (APA 1994). Thus, patients seem to experience either one extreme or the other. However, this definition disregards a number of factors that are present throughout all phases of the illness, including euthymia (Levy and Manove 2012). For this reason, establishing a better understanding of such particular dimensional factors present in bipolar disorders is warranted (Henry and Etain 2010). This may be an interesting area of inquiry, as dimensional factors may represent indicators for specific treatment response and thus guide treatment. If subgroups of patients with specific dimensional characteristics were to be identified, it could help investigate possible pathophysiological mechanisms (Henry and Etain 2010).

For instance, impulsivity as one possible dimensional factor in bipolar disorder (Henry and Etain 2010) implicates the tendency to act quickly without considering future consequences (Hamilton et al. 2015). Impulsivity can be measured using behavioral tasks and by self-report questionnaires (Hamilton et al. 2015).

Inhibition control reflects a behavioral manifestation of impulsivity (Newman and Meyer 2014), and constitutes one of the core domains of executive function, which can be divided into response inhibition and interference control (Diamond 2013). Interference control can be measured using the Stroop Color and Word Test (Stroop) (Stroop 1992). Interference control constitutes a gating mechanism, which helps to ignore irrelevant information (Wilson and Kipp 1998) and enhances the ability to suppress stimuli that would ordinarily trigger a competing reaction. Additionally, it activates the ability to suppress distractors which would ordinarily delay the response (Nigg 2000). Dempster (1992, p. 47) emphasized the importance of “the ability to inhibit or deactivate stored information” as being “just as decisive as the quantity and quality of stored information and the availability of activation resources”.

Another aspect of inhibition involves the ability to control attention, behavior, thoughts, and emotions, as well as the ability to resist internal or external urges or temptations (Diamond 2013). This definition of inhibition is similar to the construct of self-reported impulsiveness applied by Patton et al. (1995) who developed the Barratt impulsivity scale (BIS). The BIS is a 30-item rating scale, where each item is related to one of three second-order facets of impulsivity: These include attentional impulsiveness referring to quick cognitive decision-making, motor impulsiveness which refers to acting without thinking, and non-planning impulsiveness which refers to a lack of future planning (Patton et al. 1995).

Both behavioral and self-reported impulsivity implicate important clinical consequences. Inhibition control—or more precisely, interference control, measured by Stroop—may represent a possible endophenotype of bipolar disorders, given that even non-afflicted first-degree relatives of individuals with bipolar disorder seem to show poorer Stroop performance (Arts et al. 2008). Furthermore, an association has been found between decreased interference control and period of time to recovery among first-episode patients (Gruber et al. 2008) as well as between decreased interference control and unemployment among bipolar patients (Ryan et al. 2013). High impulsivity scores measured by the BIS are associated with increases in overall functional impairment (Jimenez et al. 2012), a higher number of episodes at early onset and a higher number of past suicide attempts (Swann et al. 2009), as well as with increases in substance consumption, including alcohol (Nery et al. 2013) and nicotine (Heffner et al. 2012).

Previous findings revealed significant differences between bipolar patients and healthy controls in terms of both the BIS as self-reported impulsivity (Swann et al. 2001, 2003, 2004; Peluso et al. 2007; Kathleen Holmes et al. 2009; Strakowski et al. 2010; Ekinci et al. 2011; Lombardo et al. 2012; Henna et al. 2013; Etain et al. 2013) and the Stroop as behavioral impulsivity (Robinson et al. 2006; Torres et al. 2007; Arts et al. 2008; Kurtz and Gerraty 2009; Bora et al. 2009; Mann-Wrobel et al. 2011). Furthermore, what has been perplexing to date has been the huge variance in performance on the Stroop in a number of meta-analyses, with little explanation of why this may be the case (Robinson et al. 2006; Torres et al. 2007; Arts et al. 2008; Kurtz and Gerraty 2009; Bora et al. 2009; Mann-Wrobel et al. 2011; Hajek et al. 2013). A recent review summarized several studies investigating either behavioral or self-reported impulsivity, which revealed predominantly significant differences in self-reported, but not behavioral tests of impulsivity (Newman and Meyer 2014). However, few studies have examined the link between these two constructs, and it is notable that to date no study has investigated the relationship between the Stroop and the BIS among bipolar patients as its primary research question. If observed, a positive relationship could further support the clinical utility of the BIS as an easily administrated, economical screening tool when assessing bipolar patients. In addition to existing knowledge about the BIS and course of illness, suicidality and substance misuse, it may be possible to gain a more nuanced understanding of behavioral impulsivity’s relationship with these phenomenon. This would have positive implications for clinical practice, insofar as it would aid clinicians in making a brief yet detailed assessment of a patient’s presentation and clinical needs.

The current research aimed to investigate the relationship of self-reported impulsivity (measured by the BIS) and a behavioral measure of inhibition control (Stroop test) in bipolar patients. Initially, we sought to confirm previous findings that bipolar patients show a poorer performance on the Stroop and a higher BIS score compared to healthy controls. Then, as main research question, we sought to examine whether poorer performance on a behavioral test of impulsivity was related to higher self-reported impulsivity in a group of bipolar patients.

Additionally, we wanted to investigate a possible association between impulsivity measures and possible confounders, such as number of episodes, subthreshold depressive symptoms, medical treatment, and years of education in an exploratory analysis.



A total of 50 bipolar patients (29 female, 27 Bipolar I) and 43 healthy controls were seen for a comprehensive neuropsychological assessment. All patients were recruited from the psychiatric outpatient clinic at the Charité Mitte Campus University Hospital in Berlin based on the following inclusion criteria: diagnosis of bipolar disorder according to the DSM-IV; clinical remission meeting the criteria of euthymia [Hamilton Depression Rating Scale version 21 (HAMD-21) (Hamilton 1960) ≤9 and Young Mania Rating Scale (YMRS) (Young et al. 1978) ≤12] for at least 6 weeks; absence of affective symptoms; medication with a mood stabilizer for at least three months; minimum age of 18 years. A number of strictly euthymic patients were systematically looked for, to achieve a broad range of patients’ composition (i.e., those with a HAMD-21 ≤ 3, based on practice in a recent study regarding subthreshold symptoms in bipolar disorder) (Bonnin et al. 2012). Patients were excluded if they met the criteria of current psychotic symptoms, substance abuse during the last three months, dementia or mild cognitive impairment, or other predominant Axis I disorder within the past six months. Diagnoses using DSM-IV were undertaken by experienced and trained assessors with more than five years of experience with clinical diagnostics. YMRS and HAMD-21 were administered by well-trained assessors.

Healthy controls were recruited by web advertisement and word of mouth and were at least 18 years old. Criteria for exclusion were diagnosis of any current or past Axis1 disorder, assessed by the Mini-International Neuropsychiatric Interview (M.I.N.I.) (Sheehan et al. 1998), and first-degree relatives with an affective disorder or schizophrenia.

From the 50 patients originally recruited, ten were excluded for the following reasons: five emerged to be in a depressive mood state during testing, one emerged to have a mild cognitive impairment, three were not medicated with a mood stabilizer and one was not in a euthymic state for the required minimum six weeks. Of the 43 healthy controls, nine were excluded on the basis of a depressive episode (current or lifetime) or current substance abuse. To avoid the emergence of an age effect on the Stroop task (Comalli et al. 1962), we ensured that participants in both groups were of similar age by systematically removing the four youngest of the 34 healthy controls fulfilling the inclusion criteria.

Patients and controls within this study concurrently participated in two different studies using the same neuropsychological assessment: 34 patients participated in a pilot study investigating the feasibility of metacognitive training for low-functioning bipolar patients (Haffner et al. 2016), and 16 patients participated in a study on cognitive vulnerability in bipolar patients (Quinlivan et al. 2016).


A number of previous studies have used the Stroop as a measure of inhibitory control (Enticott et al. 2006; Kemps and Wilsdon 2010). In the current study, to measure a lack of inhibitory control as a behavioral manifestation of impulsivity [as has been previously indicated (Newman and Meyer 2014)] a German version of the Stroop interference (Bäumler and Stroop 1985) was applied. The outcome variable used was the time needed to complete the test. In the absence of a discrete measure of behavioral impulsivity, the use of a measure of inhibitory control was considered an appropriate alternative. The German version of the Stroop test shows an internal consistency of 0.97 and a retest reliability of 0.93. The facture structure as well as convergent and divergent validity have been confirmed (Bäumler and Stroop 1985). To assess self-reported impulsivity, the BIS-11 questionnaire was used. Regarding validity and reliability of the German version of the BIS-11 scale, the BIS total score showed adequate internal consistencies (Preuss et al. 2008) and findings of a study investigating adolescents ascertained convergent validity and suggested appropriate reliability (Hartmann et al. 2011).

As an interviewer-administered rating scale for the impairment of psychosocial functioning in bipolar disorder, the Functional Assessment Short Test (FAST) (Rosa et al. 2007) was administered with both patients and controls.

To assess general neurocognitive functioning, including executive functions, verbal memory, intelligence and attention, all participants completed a neuropsychological test battery. Executive functions were assessed by a German word fluency task (Regensburger Wortflüssigkeitstest) (Aschenbrenner et al. 2000) and digit span backwards subtest of the German version of the Wechsler memory scale (WMS) (Härting et al. 2000). Verbal memory was measured by the German verbal learning and memory test (Verbaler Lern- und Merkfähigkeitstest, VLMT) (Helmstaedter et al. 2001) and the digit span forward as a subtest of the WMS. The subtest LPS3 of a German intelligence test battery (Leistungsprüfsystem, LPS) (Horn 1983) was used to assess logical thinking as fluid intelligence whereas a multiple choice vocabulary test (Mehrfach Wortschatz Test, MWT-B) (Lehrl 2005) measured crystallized intelligence. Furthermore different aspects of attention and executive function were examined using subtests (alertness and divided attention) of a German computerized test battery (Testbatterie zur Aufmerksamkeitsprüfung, TAP) (Zimmerman and Fimm 2006). A well-trained assessor delivered the comprehensive neuropsychological battery and all participants were tested in similar circumstances concerning place, time, person and instructions.

Data analysis

As a number of our variables were not normally distributed, use of the non-parametric Mann–Whitney U test was indicated. The test was completed to compare patients and healthy controls on a range of variables, including demographics, clinical features and facets of the neuropsychological assessments. Fisher’s exact test was applied on normative variables (e.g., gender). For our confirmatory analyses, the Mann–Whitney U test was used to compare patients and healthy controls concerning time needed in the Stroop and scoring in the BIS (both in terms of total and subscale scores). Because of patients and controls differing on the BDI, a regression analysis was conducted to explore, whether the BDI predicts Stroop and, respectively, BIS. A second Mann–Whitney U test was then applied to compare the 15 strictly euthymic patients’ (with HAMD-21 ≤ 3) and controls’ Stroop performance and BIS scores. In terms of our primary question, time needed in the Stroop interference and BIS total scores was correlated according to Pearson within the patient group. To investigate possible confounders of the Stroop test probably being the more robust measure, exploratory analyses were conducted. This was completed by correlating time needed in the Stroop interference with six possible confounders available in our dataset, such as subthreshold depressive symptoms (as measured by the HAMD-21), subthreshold manic symptoms (as measured by the YMRS), years of education, duration of illness, number of hospitalizations and the FAST cognitive score. Because of the explorative character of these correlations, we did not perform a type II error correction. Data analyses were conducted using IBM SPSS Statistics Version 22.0.


Of the 40 patients fulfilling the inclusion criteria, 15 had a HAMD-21 ≤ 3 which indicated that they were strictly euthymic at the time of testing. An overview of the demographics and clinical characteristics is shown in Table 1.

Table 1 Demographic and clinical characteristics of bipolar patients (BD) and healthy controls (HC)

First, we observed that patients needed significantly more time to complete the Stroop interference task than healthy controls (z = −2.49, p = .01, r = .30, for all BIS and Stroop scores see Table 2). Patients also scored significantly higher on self-reported impulsivity, as measured by the BIS total score (z = −2.08, p = .04, r = .25). With regard to the three subscales on the BIS, patients scored significantly higher than controls in terms of attentional impulsiveness (z = −3.67, p ≤ .001, r = .44) and non-planning impulsiveness (z = −1.98, p < .05, r = .24). However, for the motor impulsiveness subscale no significant differences were observed, see Fig. 1 and Table 2.

Table 2 Scores of Stroop and BIS of all euthymic patients (HAMD-21 ≤ 9), the strictly euthymic subgroup of patients (HAMD-21 ≤ 3) and healthy controls
Fig. 1
figure 1

Behavioral and self-reported impulsivity of euthymic patients (n = 40) and healthy controls (n = 30). On the Stroop test, poorer performance is indicated by higher scores (i.e., longer response duration). On the BIS, higher scores also indicated a greater affliction (i.e., more impulsive self-reports). Note: when comparing the subgroup of strictly euthymic patients to healthy controls, only the Stroop and the BIS sub-score attentional stay significantly different

Because of the above-analyzed groups differing on the BDI (a self-rating scale for depressive symptoms, see Table 1), additional analyses were run: A regression analyses showed that concerning the whole sample of patients and controls, the BDI could not significantly explain any variance of the Stroop (R 2 = .00, p = .88) whereas it could explain 20.3 % of variance of the BIS (R 2 = .20, p < .001). Thus, respecting the BDI as a possible confounder in the comparison between patients and controls, in a second step an additional comparison between the 15 strictly euthymic patients (HAMD-21 ≤ 3) and the 30 healthy controls was conducted. Groups did not differ on the BDI, nor concerning age or, respectively, gender (all p values >.05). Now, patients and controls significantly differed only on the Stroop (z = −2.25, p = .02, r = .34), but not on the BIS total score (z = −1.68, p = .09). Regarding the BIS sub-scores, only the BIS attentional remained significantly different between patients and controls (z = −2.26, p = .02, r = .34), whereas there was no statistical difference concerning the sub-scores non-planning and motor (all p values >.05).

In regard to our primary research question, data showed no significant positive correlation between patients’ test performance on the Stroop and total BIS scores (n = 39 due to the exclusion of one outlier on the Stroop, r = −.09, p = .60). Similarly, we did not observe any significant correlations when we examined time needed on the Stroop with the respective BIS subscales (attentional, motor and non-planning impulsiveness). Therefore, in the current study’s sample of bipolar patients, self-reported impulsivity was not related to behavioral inhibition performance on the Stroop. The two constructs were positively correlated neither regarding healthy controls (n = 29 due to the exclusion of one outlier on the Stroop, r = .13, p = .49) nor in the whole sample (n = 69, due to the exclusion of one outlier on the Stroop, r = .06, p = .61).

The Stroop showed to possibly constitute a more exact measure which seems more independent of current symptoms than the BIS. Therefore, possible associations between the Stroop and six possible confounders were further explored. We observed a significant correlation between time needed on the Stroop and number of mood episodes (n = 39 due to one outlier; r = .34; p = .03). We also observed a trend in the direction of significance for time needed on the Stroop and number of different psychotropic medication groups (n = 39, r = .31; p = .06). However, there was no association between Stroop performance and subthreshold depression (as measured by the HAMD-21), subthreshold manic symptoms (as measured by the YMRS), years of education, duration of illness and number of hospitalizations (all p’s > .05). Regarding the FAST Cognitive score, there was a positive correlation with Stroop performance (r = .339; p = .04).

With respect to our descriptive analysis of neurocognitive function, patients performed similar to healthy controls across nearly all domains of the neuropsychological test battery. Thus, there were no significant differences in tests of memory, attention and intelligence. Regarding executive functions, patients showed significantly worse results in the two word fluency tests (i.e., word fluency naming animals: Mdn patients = 24.00, Mdn controls = 26.50, z = −2.21, p = .03, r = .26; and word fluency S-words: Mdn patients = 14.00, Mdn controls = 16.50, z = −2.36, p = .02, r = .28). As regards to number of errors in the Stroop interference task, no significant difference was observed between the groups; however, there was a trend in this direction (M patients = .97 Mdn patients = .00, M controls = .27 Mdn controls = .00, z = −1.91, p = .06, r = .23).


Aside from exploratory research, this study is the first to comprehensively investigate the relationship between behavioral and self-reported impulsivity, using the Stroop Test and the BIS in a sample of bipolar patients. Patients showed poorer Stroop performance and higher BIS scores than controls, yet our most striking finding was the absence of a positive correlation between Stroop performance and BIS reports within the bipolar group. Moreover, our study revealed promising exploratory findings regarding the relationship of inhibition control and number of episodes and medication.

A notable strength of the current study lies in the range of patients sampled, including a number of strictly euthymic patients and a subgroup of particularly low-functioning patients (see Table 1). Thus, we have accounted for and considered the diversity of bipolar patients and minimized possible important biases when comparing patients and controls on their test performance. For the healthy controls for instance, we ensured an absence of any first-degree relatives with an affective disorder or schizophrenia, given the potential for inhibition and impulsivity as possible endophenotypes. The detailed neurocognitive test battery facilitated us to achieve a broad profile of the sample, supporting the study’s strength.

Comparing Stroop performance of patients and healthy controls

We found a significant difference between patients’ and controls’ Stroop test performance with a medium effect size. This is in accordance with previous meta-analytical findings (Robinson et al. 2006; Torres et al. 2007; Arts et al. 2008; Kurtz and Gerraty 2009; Bora et al. 2009; Mann-Wrobel et al. 2011), with the exception of one (Hajek et al. 2013). It is notable that even when comparing the group of strictly euthymic patients to healthy controls, the difference remains statistically significant.

Comparing BIS scores of patients and healthy controls

Equally, in terms of self-reported impulsivity, we were able to confirm previous studies in which bipolar patients showed a higher BIS total score than healthy controls (Swann et al. 2001, 2003, 2004; Peluso et al. 2007; Kathleen Holmes et al. 2009; Strakowski et al. 2010; Ekinci et al. 2011; Lombardo et al. 2012; Henna et al. 2013). A study, investigating a total of 504 healthy controls, measured a BIS total score of M = 59.25 (SD = 9.31) (Aichert et al. 2012); a finding similar to that of our healthy controls. Thus, patients in our sample are more impulsive than population norms, pointing towards the consideration of impulsivity as a trait characteristic of bipolar disorder, independent of current illness phase.

It should be noted, however, that there is an association between the BIS and self-report of depressive symptoms, suggesting that the BIS as a self-report might not be suitable as a trait marker. When comparing the group of strictly euthymic patients to the healthy controls, the difference concerning the BIS total score was no more significant (although the strictly euthymic subgroup showing the same BIS total score as the whole bipolar sample). Only the BIS sub-score attentional stayed significantly different between patients and healthy controls. This might imply that the BIS attentional could be the more exact measure. This would be in accordance with a previous study on the early diagnosis of bipolar disorder, where the BIS sub-score attentional, but not the total score, showed to be a good marker predicting onset of (hypo)mania in subjects at risk (Ng et al. 2016).

Two studies reported an absence of differences (Christodoulou et al. 2006; Lewis et al. 2009) between patients and healthy controls, indicating that this research question warrants thorough inquiry.

The relationship between behavioral and self-reported impulsivity in bipolar disorder

In our findings, there was no relationship between Stroop interference and the BIS; neither in terms of the total or subscale scores. Based on how items are constructed in relation to concentration and distraction on the BIS attention subscale, it would in fact have been expected that this subscale would be the most likely to correlate with Stroop performance. Our findings are in accordance with exploratory results of one study (Powers et al. 2013), which similar to us observed a lack of correlation between Stroop performance (amongst seven other neurocognitive test parameters) and the BIS. Thus, the present study can confidently confirm this exploratory finding and support a recent review which proposed that self-report and behavioral measures of impulsivity might indeed reflect distinct theoretical constructs (Newman and Meyer 2014).

In terms of other populations—both general and clinical—poorer performance in the Stroop interference has been found to be associated with higher impulsivity. Correlations with the Stroop have been observed within a group of healthy subjects when tested using the BIS (Enticott et al. 2006), as well as with other clinical groups where problems with impulsivity are noteworthy; for example, for patients with borderline personality disorder (Bader and 2010) and bulimia nervosa (Kemps and Wilsdon 2010). On closer examination, however, in the borderline subgroup (Bader and 2010) there were multiple inventories of impulsivity correlated with several tests of impulsivity, and no Type II error correction for multiple tests was applied. Further, in the small study which sampled patients with bulimia (n = 13), after BIS was entered as a covariate, a significant difference between patients and controls on Stroop performance was no longer observed (Kemps and Wilsdon 2010). Due to the lack of robustness of these findings, it is only possible to conclude that a relationship between BIS and Stroop performance is feasible. In the study with healthy subjects (Enticott et al. 2006), a spatial Stroop was implemented as a reading-independent test, and participants were not older than 51 (unlike in our study, where participants ranged from 23 to 77 years of age). This study correlated four different behavioral paradigms of impulsivity with the BIS and its subscales, revealing that only the Stroop task correlated significantly. It is possible that a relationship between the Stroop and BIS may have been more easily detected in a younger sample, given that age may influence Stroop performance (Comalli et al. 1962). Other studies using a range of different populations did not observe a relationship between the Stroop task and BIS at all (Enticott et al. 2008; Aichert et al. 2012). Interestingly, one study investigated four measures of prepotent response inhibition, including the Stroop, and the BIS in a sample of 504 healthy individuals. While Stroop did not correlate with BIS, a latent variable analysis revealed all four measures of response inhibition to be underpinned by the same construct, where the BIS explained 12 % of the variance (Aichert et al. 2012). In light of these mostly exploratory findings, the current results seem to contribute to a controversial database, where overall there has been, at best, a small relationship between behavioral and self-reported impulsivity when using these particular measures.

Studies using tests other than the Stroop to investigate the relationship between self-reported impulsivity and inhibitory control as a measure of behavioral impulsivity within bipolar patients have partially found evidence for a positive correlation. However, these studies were mostly exploratory. For example, Cheema et al. (2015) found higher BIS scores to be associated with slower reaction times in an emotional Go/No-go test, interpreted by the authors as a possible compensatory cognitive strategy to manage increased impulsivity. However, this correlation was one of many tests run without the use of an error correction, again indicating the possibility for Type II error. Beyond that, the consideration of multiple other test findings are warranted. For instance, higher attentional BIS scores have been associated with a lower response inhibition in the Hayling Sentence Completion test (Christodoulou et al. 2006). BIS motor score has been correlated with more impulsive behavior in the Balloon Analogue Risk Task (Kathleen Holmes et al. 2009). However, BIS impulsivity has not been found to be related to decreased inhibition in the Stop Signal Task (Heffner et al. 2012). The discrepancies in findings of theoretically similar constructs render it difficult to make meaningful conclusions regarding the nature of impulsivity. It is notable, however, that these differences may be attributable to a lack of consistency in the methods and measures used; for example, the various procedures of the behavioral tests of impulsivity which were not always shown to inter-correlate (Enticott et al. 2006).

Influence of possible confounders on Stroop performance

The present study revealed an association between Stroop performance and total number of episodes; strengthening previous findings indicating that number of affective episodes is negatively associated with executive functions (El-Badri et al. 2001). Equally, we observed a trend in a positive correlation between Stroop performance and number of psychotropic medication groups. The influence of medication on cognitive performance has been reported controversially to date. Goswami et al. (2009), for example, did not find any influence of medical treatment on any type of cognitive performance, whereas Bora et al. (2009) reported an association between medication and the magnitude of impairment on psychomotor speed. Considering the Stroop test as a speed-dependent test could explain poorer performance among patients treated with a range of substances. In terms of depressive symptoms, subthreshold symptoms did not influence Stroop performance in our findings, which confirms previous results (Bora et al. 2009).

General neuropsychological test performance

Compared to healthy controls, in our study euthymic bipolar patients showed a similar test performance across all cognitive domains with the exception of executive function. This is in contrast with previous meta-analyses stating that even in euthymia, bipolar patients show cognitive impairment in nearly all domains (Robinson et al. 2006; Torres et al. 2007; Arts et al. 2008; Kurtz and Gerraty 2009; Mann-Wrobel et al. 2011; Porter et al. 2015). In one study investigating cognitive subgroups, 41.4 % of the bipolar patients did not show any cognitive deficits (Volkert et al. 2015). Again here our study seems to contribute to somewhat of a controversial empirical database. In terms of executive functions, we reported significant differences in word fluency. This is in accordance with several meta-analyses to date, which have all estimated executive functions to be particularly limited in this population (Robinson et al. 2006; Torres et al. 2007; Arts et al. 2008; Kurtz and Gerraty 2009; Bora et al. 2009).


The results of this study are limited by its rather small sample size, though this can be partly counterbalanced by the huge range in our sample composition. Another notable point to consider is that participants were recruited through a university hospital, indicating a possible selection bias. It is possible that patients attending a specialist bipolar clinic received a more frequent, expert careplan than other typical bipolar patients attending community-based services. Beyond that, nearly all of the patients are treated by multiple different medications, which could influence their Stroop performance. Therefore, the statistically significant difference between patients and healthy controls on the Stroop might partly be due to patients’ treatment with polypharmacy. Finally, it should be noted that the broad age range in our sample may have affected a potential correlation between Stroop and BIS (Comalli et al. 1962).


In our study, both behavioral and self-reported impulsivities were increased within our patient group as compared to controls; however, we did not find a correlation between these two constructs. Thus, our study highlights the importance of considering these aspects of impulsivity as two independent dimensional factors in bipolar disorder, which probably both influence the course of illness and functional outcome in respective ways. Our findings suggest the possible usefulness of specific cognitive trainings for bipolar patients, with a focus on executive functions. Additionally, our findings indicate that it is particularly important to identify and prescribe a pharmacotherapy that does not aggravate cognitive functioning in cases where performance is already compromised, or in cases of an advanced course of illness, to ensure lack of disruption in patients’ quality of life.

In future research, we recommend that studies with a longitudinal design investigate Stroop and BIS on a large sample of bipolar patients. Thus, one could investigate which of the two markers constitutes a better marker for the illness and may, therefore, be more suitable for differentiating the most severe patients (e.g., those with substance misuse, more suicide attempts and a more severe course of illness). Beyond that, a study examining subjects at risk for bipolar disorder who are not medicated yet could further investigate the relevance of interference control as a marker for bipolar disorder.



Beck Depression Inventory


Barratt Impulsiveness Scale


Functional Assessment Short Test


Hamilton Depression Rating Scale (version 21)


Mini-International Neuropsychiatric Interview


Stroop Color and Word Test


Young Mania Rating Scale


  • Aichert DS, Wostmann NM, Costa A, Macare C, Wenig JR, Moller HJ, et al. Associations between trait impulsivity and prepotent response inhibition. J Clin Exp Neuropsychol. 2012;34(10):1016–32.

    Article  PubMed  Google Scholar 

  • APA. Diagnostic and statistical manual of mental disorders. Washington DC: American Psychiatric Association; 1994.

    Google Scholar 

  • Arts B, Jabben N, Krabbendam L, van Os J. Meta-analyses of cognitive functioning in euthymic bipolar patients and their first-degree relatives. Psychol Med. 2008;38(6):771–85.

    Article  CAS  PubMed  Google Scholar 

  • Aschenbrenner S, Tucha O, Lange KW. Regensburger Wortflüssigkeits-Test: RWT. Hogrefe: Verlag für Psychologie; 2000.

    Google Scholar 

  • Bader K. Emotionale Modulation von Impulsivität bei Patientinnen mit Borderline Persönlichkeitsstörung. 2010. Accessed Jan 2016.

  • Bäumler G, Stroop JR. Farbe-Wort-Interferenztest nach JR Stroop (FWIT). Hogrefe, Verlag für Psychologie. 1985.

  • Beck AT, Ward CH, Mendelson M, Mock J, Erbaugh J. An inventory for measuring depression. Arch Gen Psychiatry. 1961;4:561–71.

    Article  CAS  PubMed  Google Scholar 

  • Bonnin CM, Sanchez-Moreno J, Martinez-Aran A, Sole B, Reinares M, Rosa AR, et al. Subthreshold symptoms in bipolar disorder: impact on neurocognition, quality of life and disability. J Affect Disord. 2012;136(3):650–9.

    Article  CAS  PubMed  Google Scholar 

  • Bora E, Yucel M, Pantelis C. Cognitive endophenotypes of bipolar disorder: a meta-analysis of neuropsychological deficits in euthymic patients and their first-degree relatives. J Affect Disord. 2009;113(1–2):1–20.

    Article  PubMed  Google Scholar 

  • Cheema MK, MacQueen GM, Hassel S. Assessing personal financial management in patients with bipolar disorder and its relation to impulsivity and response inhibition. Cogn Neuropsychiatry. 2015;20(5):424–37.

    Article  PubMed  Google Scholar 

  • Christodoulou T, Lewis M, Ploubidis GB, Frangou S. The relationship of impulsivity to response inhibition and decision-making in remitted patients with bipolar disorder. Eur Psychiatry. 2006;21(4):270–3.

    Article  CAS  PubMed  Google Scholar 

  • Comalli PE Jr, Wapner S, Werner H. Interference effects of Stroop color-word test in childhood, adulthood, and aging. J Genet Psychol. 1962;100:47–53.

    Article  PubMed  Google Scholar 

  • Dempster FN. The rise and fall of the inhibitory mechanism: toward a unified theory of cognitive development and aging. Dev Rev. 1992;12(1):45–75.

    Article  Google Scholar 

  • Diamond A. Executive functions. Annu Rev Psychol. 2013;64:135–68.

    Article  PubMed  Google Scholar 

  • Ekinci O, Albayrak Y, Ekinci AE, Caykoylu A. Relationship of trait impulsivity with clinical presentation in euthymic bipolar disorder patients. Psychiatry Res. 2011;190(2–3):259–64.

    Article  PubMed  Google Scholar 

  • El-Badri SM, Ashton CH, Moore PB, Marsh VR, Ferrier IN. Electrophysiological and cognitive function in young euthymic patients with bipolar affective disorder. Bipolar Disord. 2001;3(2):79–87.

    Article  CAS  PubMed  Google Scholar 

  • Enticott PG, Ogloff JRP, Bradshaw JL. Associations between laboratory measures of executive inhibitory control and self-reported impulsivity. Personal Individ Differ. 2006;41(2):285–94.

    Article  Google Scholar 

  • Enticott PG, Ogloff JR, Bradshaw JL, Fitzgerald PB. Cognitive inhibitory control and self-reported impulsivity among violent offenders with schizophrenia. J Clin Exp Neuropsychol. 2008;30(2):157–62.

    Article  PubMed  Google Scholar 

  • Etain B, Mathieu F, Liquet S, Raust A, Cochet B, Richard JR, et al. Clinical features associated with trait-impulsiveness in euthymic bipolar disorder patients. J Affect Disord. 2013;144(3):240–7.

    Article  CAS  PubMed  Google Scholar 

  • Goswami U, Sharma A, Varma A, Gulrajani C, Ferrier IN, Young AH, et al. The neurocognitive performance of drug-free and medicated euthymic bipolar patients do not differ. Acta Psychiatr Scand. 2009;120(6):456–63.

    Article  CAS  PubMed  Google Scholar 

  • Gruber SA, Rosso IM, Yurgelun-Todd D. Neuropsychological performance predicts clinical recovery in bipolar patients. J Affect Disord. 2008;105(1–3):253–60.

    Article  PubMed  Google Scholar 

  • Haffner P, Quinlivan E, Fiebig J, Sondergeld L, Strasser ES, Adli M, et al. Improving functional outcome in bipolar disorder: a pilot-study on metacognitive training. 2016. (in progress).

  • Hajek T, Alda M, Hajek E, Ivanoff J. Functional neuroanatomy of response inhibition in bipolar disorders–combined voxel based and cognitive performance meta-analysis. J Psychiatr Res. 2013;47(12):1955–66.

    Article  PubMed  Google Scholar 

  • Hamilton M. A rating scale for depression. J Neurol Neurosurg Psychiatry. 1960;23:56–62.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Hamilton KR, Littlefield AK, Anastasio NC, Cunningham KA, Fink LH, Wing VC, et al. Rapid-response impulsivity: definitions, measurement issues, and clinical implications. Personal Disord. 2015;6(2):168–81.

    Article  PubMed  PubMed Central  Google Scholar 

  • Härting C, Markowitsch HJ, Neufeld H, Calabrese P, Deisinger K, Kessler J. Wechsler Gedächtnis Test-Revidierte Fassung (WMS-R). Bern: Huber; 2000.

    Google Scholar 

  • Hartmann AS, Rief W, Hilbert A. Psychometric properties of the German version of the Barratt Impulsiveness Scale, version 11 (BIS-11) for adolescents. Percept Mot Skills. 2011;112(2):353–68.

    Article  PubMed  Google Scholar 

  • Heffner JL, Fleck DE, DelBello MP, Adler CM, Strakowski SM. Cigarette smoking and impulsivity in bipolar disorder. Bipolar Disord. 2012;14(7):735–42.

    Article  PubMed Central  Google Scholar 

  • Helmstaedter C, Lendt M, Lux S. Verbaler Lern- und Merkfähigkeitstest (VLMT). [Verbal learn and memory test (VLMT)]. Göttingen: Hogrefe; 2001.

    Google Scholar 

  • Henna E, Hatch JP, Nicoletti M, Swann AC, Zunta-Soares G, Soares JC. Is impulsivity a common trait in bipolar and unipolar disorders? Bipolar Disord. 2013;15(2):223–7.

    Article  PubMed  PubMed Central  Google Scholar 

  • Henry C, Etain B. New ways to classify bipolar disorders: going from categorical groups to symptom clusters or dimensions. Curr Psychiatry Rep. 2010;12(6):505–11.

    Article  PubMed  PubMed Central  Google Scholar 

  • Horn W. Leistungsprüfsystem. [Performance exerciser]. Göttingen: Hogrefe; 1983.

    Google Scholar 

  • Jimenez E, Arias B, Castellvi P, Goikolea JM, Rosa AR, Fananas L, et al. Impulsivity and functional impairment in bipolar disorder. J Affect Disord. 2012;136(3):491–7.

    Article  CAS  PubMed  Google Scholar 

  • Kathleen Holmes M, Bearden CE, Barguil M, Fonseca M, Serap Monkul E, Nery FG, et al. Conceptualizing impulsivity and risk taking in bipolar disorder: importance of history of alcohol abuse. Bipolar Disord. 2009;11(1):33–40.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Kemps E, Wilsdon A. Preliminary evidence for a role for impulsivity in cognitive disinhibition in bulimia nervosa. J Clin Exp Neuropsychol. 2010;32(5):515–21.

    Article  PubMed  Google Scholar 

  • Kurtz MM, Gerraty RT. A meta-analytic investigation of neurocognitive deficits in bipolar illness: profile and effects of clinical state. Neuropsychology. 2009;23(5):551–62.

    Article  PubMed  PubMed Central  Google Scholar 

  • Lehrl S. Mehrfachwahl-Wortschatz-Intelligenztest (MWT-B). [Multiple vocabulary intelligence test (MWT-B)]. Balingen: Spitta Verlag; 2005.

    Google Scholar 

  • Levy B, Manove E. Functional outcome in bipolar disorder: the big picture. Depress Res Treat. 2012;2012:949248.

    PubMed  Google Scholar 

  • Lewis M, Scott J, Frangou S. Impulsivity, personality and bipolar disorder. Eur Psychiatry. 2009;24(7):464–9.

    Article  CAS  PubMed  Google Scholar 

  • Lombardo LE, Bearden CE, Barrett J, Brumbaugh MS, Pittman B, Frangou S, et al. Trait impulsivity as an endophenotype for bipolar I disorder. Bipolar Disord. 2012;14(5):565–70.

    Article  PubMed  PubMed Central  Google Scholar 

  • Mann-Wrobel MC, Carreno JT, Dickinson D. Meta-analysis of neuropsychological functioning in euthymic bipolar disorder: an update and investigation of moderator variables. Bipolar Disord. 2011;13(4):334–42.

    Article  PubMed  Google Scholar 

  • Nery FG, Hatch JP, Monkul ES, Matsuo K, Zunta-Soares GB, Bowden CL, et al. Trait impulsivity is increased in bipolar disorder patients with comorbid alcohol use disorders. Psychopathology. 2013;46(3):145–52.

    Article  PubMed  Google Scholar 

  • Newman AL, Meyer TD. Impulsivity: present during euthymia in bipolar disorder?—a systematic review. Int J Bipolar Disord. 2014;2:2.

    Article  PubMed  PubMed Central  Google Scholar 

  • Ng TH, Stange JP, Black CL, Titone MK, Weiss RB, Abramson LY, et al. Impulsivity predicts the onset of DSM-IV-TR or RDC hypomanic and manic episodes in adolescents and young adults with high or moderate reward sensitivity. J Affect Disord. 2016;198:88–95.

    Article  PubMed  Google Scholar 

  • Nigg JT. On inhibition/disinhibition in developmental psychopathology: views from cognitive and personality psychology and a working inhibition taxonomy. Psychol Bull. 2000;126(2):220–46.

    Article  CAS  PubMed  Google Scholar 

  • Patton JH, Stanford MS, Barratt ES. Factor structure of the Barratt impulsiveness scale. J Clin Psychol. 1995;51(6):768–74.

    Article  CAS  PubMed  Google Scholar 

  • Peluso MA, Hatch JP, Glahn DC, Monkul ES, Sanches M, Najt P, et al. Trait impulsivity in patients with mood disorders. J Affect Disord. 2007;100(1–3):227–31.

    Article  CAS  PubMed  Google Scholar 

  • Porter RJ, Robinson LJ, Malhi GS, Gallagher P. The neurocognitive profile of mood disorders—a review of the evidence and methodological issues. Bipolar Disord. 2015;17(Suppl 2):21–40.

    Article  PubMed  Google Scholar 

  • Powers RL, Russo M, Mahon K, Brand J, Braga RJ, Malhotra AK, et al. Impulsivity in bipolar disorder: relationships with neurocognitive dysfunction and substance use history. Bipolar Disord. 2013;15(8):876–84.

    Article  PubMed  Google Scholar 

  • Preuss UW, Rujescu D, Giegling I, Watzke S, Koller G, Zetzsche T, et al. Psychometric evaluation of the German version of the Barratt Impulsiveness Scale. Der Nervenarzt. 2008;79(3):305–19.

    Article  CAS  PubMed  Google Scholar 

  • Quinlivan E, Dallacker M, Renneberg B, Strasser ES, Fiebig J, Stamm T. Overgeneral autobiographical memory in bipolar disorder: the role of neuropsychological functions. 2016. (in progress).

  • Robinson LJ, Thompson JM, Gallagher P, Goswami U, Young AH, Ferrier IN, et al. A meta-analysis of cognitive deficits in euthymic patients with bipolar disorder. J Affect Disord. 2006;93(1–3):105–15.

    Article  PubMed  Google Scholar 

  • Rosa AR, Sanchez-Moreno J, Martinez-Aran A, Salamero M, Torrent C, Reinares M, et al. Validity and reliability of the Functioning Assessment Short Test (FAST) in bipolar disorder. Clin Pract Epidemiol Ment Health. 2007;3:5.

    Article  PubMed  PubMed Central  Google Scholar 

  • Ryan KA, Vederman AC, Kamali M, Marshall D, Weldon AL, McInnis MG, et al. Emotion perception and executive functioning predict work status in euthymic bipolar disorder. Psychiatry Res. 2013;210(2):472–8.

    Article  PubMed  Google Scholar 

  • Sheehan DV, Lecrubier Y, Sheehan KH, Amorim P, Janavs J, Weiller E, et al. The Mini-International Neuropsychiatric Interview (M.I.N.I.): the development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. J Clin Psychiatry. 1998;59(Suppl 20):22–33 (quiz 4–57).

    PubMed  Google Scholar 

  • Strakowski SM, Fleck DE, DelBello MP, Adler CM, Shear PK, Kotwal R, et al. Impulsivity across the course of bipolar disorder. Bipolar Disord. 2010;12(3):285–97.

    Article  PubMed  PubMed Central  Google Scholar 

  • Stroop JR. Studies of interference in Serial Verbal Reactions editor’s note: reprint of an original work published in 1935 in the journal of experimental psychology, 18, 643–662. J Exp Psychol Gen. 1992;121:15–23.

    Article  Google Scholar 

  • Swann AC, Anderson JC, Dougherty DM, Moeller FG. Measurement of inter-episode impulsivity in bipolar disorder. Psychiatry Res. 2001;101(2):195–7.

    Article  CAS  PubMed  Google Scholar 

  • Swann AC, Pazzaglia P, Nicholls A, Dougherty DM, Moeller FG. Impulsivity and phase of illness in bipolar disorder. J Affect Disord. 2003;73(1–2):105–11.

    Article  PubMed  Google Scholar 

  • Swann AC, Dougherty DM, Pazzaglia PJ, Pham M, Moeller FG. Impulsivity: a link between bipolar disorder and substance abuse. Bipolar Disord. 2004;6(3):204–12.

    Article  PubMed  Google Scholar 

  • Swann AC, Lijffijt M, Lane SD, Steinberg JL, Moeller FG. Increased trait-like impulsivity and course of illness in bipolar disorder. Bipolar Disord. 2009;11(3):280–8.

    Article  PubMed  PubMed Central  Google Scholar 

  • Torres IJ, Boudreau VG, Yatham LN. Neuropsychological functioning in euthymic bipolar disorder: a meta-analysis. Acta Psychiatr Scand Suppl. 2007;434:17–26.

    Article  PubMed  Google Scholar 

  • Volkert J, Kopf J, Kazmaier J, Glaser F, Zierhut KC, Schiele MA, et al. Evidence for cognitive subgroups in bipolar disorder and the influence of subclinical depression and sleep disturbances. Eur Neuropsychopharmacol. 2015;25(2):192–202.

    Article  CAS  PubMed  Google Scholar 

  • Wilson SP, Kipp K. The development of efficient inhibition: evidence from directed-forgetting tasks. Dev Rev. 1998;18(1):86–123.

    Article  Google Scholar 

  • Young RC, Biggs JT, Ziegler VE, Meyer DA. A rating scale for mania: reliability, validity and sensitivity. Br J Psychiatry. 1978;133:429–35.

    Article  CAS  PubMed  Google Scholar 

  • Zimmerman P, Fimm B. Testbatterie zur Aufmerksamkeitsprüfung (TAP) Version 2.0. [Test battery to measure attention Version 2.0]. Herzogenrath: Psytest; 2006.

    Google Scholar 

Download references

Authors’ contributions

ESS contributed to the study design, collected the data, analyzed and interpreted the data, and drafted the article. PH collected the data and revised the article. JF was involved in interpreting the data, and reworked the article. EQ participated in the study design and revised the article. MA revised the article. TJS contributed to the study design, supervised all procedures and revised the article. All authors read and approved the final manuscript.


We thank Grace O’Malley for proofreading and linguistic revision of the article.

Competing interests

Elisa Sophie Strasser, Paula Haffner, Jana Fiebig and Esther Quinlivan declare that they have no competing interests. Dr. Thomas Stamm has received Grant/Research Support from the German Federal Ministry of Education and Research, Speaker Honoraria from Lundbeck and Bristol-Myers Squibb. He is a consultant to Servier. Dr. Mazda Adli has received Grant/Research Support from the German Federal Ministry of Education and Research, German Federal Ministry of Health, the Volkswagen-Foundation, Lundbeck, esparma, and Bristol-Myers Squibb. He has received Speaker Honoraria from Astra Zeneca, Eli Lilly & Company, Lundbeck, Bristol-Myers Squibb, GlaxoSmithKline, Pfizer, Boehringer Ingelheim, Sanofi, esparma, Wyeth Pharmaceuticals, Gilead, and Deutsche Bank. He has been a consultant to Bristol-Myers Squibb, esparma, and Lundbeck.

Availability of data and materials

All data and materials related to the study can be obtained through contacting the last author at

Ethics and consent to participate statement

All procedures were approved by the local ethics committee of the Charité Universitätsmedizin Berlin (reference number: EA1/363/13 and EA1/132/12). All participants were fully informed and provided written consent prior to participation.


This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Elisa Sophie Strasser.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Strasser, E.S., Haffner, P., Fiebig, J. et al. Behavioral measures and self-report of impulsivity in bipolar disorder: no association between Stroop test and Barratt Impulsiveness Scale. Int J Bipolar Disord 4, 16 (2016).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: