The aim of the present study was to test theories of lexical selection using semantic neighbourhood variables in speeded picture naming. Lexical selection is one of the core steps in language production that determines whether the lexical representation of a target word is successfully selected amongst other simultaneously activated, semantically related representations (also referred to as semantic neighbours). Problems with lexical selection are one of the most common reasons for word-finding difficulties in acquired language impairments, such as stroke aphasia, which often lead to the inability to produce correctly intended words. Potential treatments depend on a thorough understanding of the nature of lexical selection and on the variables that affect its ease or difficulty (e.g., spoken word frequency, semantic typicality, age of acquisition). Previous research suggests that word-finding difficulties in individuals with acquired language impairments may be caused by unresolved interference from semantic neighbours (e.g., Ellis, 1985; Stemberger, 1985; Wheeldon & Monsell, 1994). However, to date, only a few studies have investigated the effects of semantic neighbourhood on word production, and the reported findings are heterogeneous across studies.

Language production theories generally agree that when a speaker intends to produce a word, the meaning is activated at the conceptual (semantic) level and forwarded to the word’s representation at the lexical level. During this process, semantic activation not only spreads to the target but also to representations that share some of the target’s meaning—the semantic neighbours. Hence, activation of semantic neighbours has its origin at the conceptual level and results in the coactivation of related representations at the lexical level. What remains debated is how the target is selected amongst its semantic neighbours, and, specifically, whether the number and semantic similarity of semantic neighbours interferes with the target’s lexical selection.

According to noncompetitive theories of lexical selection (Dell, 1986; Dell, Schwartz, Martin, Saffran, & Gagnon, 1997; Finkbeiner & Caramazza, 2006; Mahon, Costa, Peterson, Vargas, & Caramazza, 2007; Morton, 1985; Oppenheim, Dell, & Schwartz, 2010), semantic neighbours do not compete and thus do not interfere with lexical selection. Instead, lexical selection is proposed to be determined by a target’s activation strength alone, whereby a lexical representation has to reach a certain level of activation (Dell, 1986; Dell et al., 1997) or threshold in order to be selected (e.g., Miozzo & Caramazza, 2003; Morton, 1969, 1985).

Competitive theories (e.g., Levelt, Roelofs, & Meyer, 1999; Roelofs, 1992, 2018; Wheeldon & Monsell, 1994) assume that lexical selection of a target word is strongly influenced by its semantic neighbours. In order to be selected, the lexical representation of a target has to overcome interference from its coactivated semantic neighbours. Competition can be implemented in different ways. For example, Levelt et al. (1999) proposed a selection mechanism based on the Luce choice rule (e.g., Luce, 1959; Roelofs, 1992) whereby the probability of a target word’s selection depends on its own level of activation divided by the summed activation of its semantic neighbours. Other models assume lateral inhibition between semantic neighbours at the lexical level (e.g., Cutting & Ferreira, 1999; Harley, 1993a, 1993b; McClelland & Elman, 1986; McClelland & Rumelhart, 1981; Stemberger, 1985). Crucially, in both cases interference should increase with the number of coactivated semantic neighbours (semantic neighbourhood density) and their activation levels. The activation levels of semantic neighbours depend on their semantic similarity to the target (semantic neighbourhood similarity or distance). The more semantic features neighbours share with the target and thus the more semantically similar they are, the more highly they become coactivated and the stronger they interfere with target selection at the lexical level.

One competition model that explicitly highlights the critical role of semantic neighbourhood in language production is the swinging lexical network (SLN) by Abdel Rahman and Melinger (2009). The swinging lexical network model assumes that semantic neighbours exert differential simultaneous influences at the conceptual and lexical level. Semantic activation spread between the target and its semantic neighbours at the conceptual level leads to conceptual priming (facilitation) and simultaneously to the coactivation of the target and its semantic neighbours at the lexical level, resulting in lexical competition (interference). Naming latencies are assumed to reflect the net effects of conceptual priming and lexical competition. Interference from semantic neighbours at the lexical level should be stronger and may potentially outweigh conceptual priming when a whole cohort (a bigger number) of coactivated related items competes with the target for lexical selection. Hence, in order to predict the ease of lexical processing, one has to consider the strength and interplay of both effects: In case of target words with many compared with few semantic neighbours, lexical interference would outweigh semantic facilitation (Abdel Rahman & Melinger, 2009; Melinger & Abdel Rahman, 2013), especially when semantic neighbours are very similar (close) and therefore highly activated (e.g., Rose & Abdel Rahman, 2017).

Semantic interference with increased naming latencies for pictures named in the context of semantically related compared with unrelated representations (e.g., picture-word interference, blocked cyclic or continuous naming task, e.g., Belke, Meyer, & Damian, 2005; Damian & Als, 2005; Damian, Vigliocco, & Levelt, 2001; Glaser & Düngelhoff, 1984; Glaser & Glaser, 1989; Klein, 1964; La Heij, 1988; Schriefers, Meyer, & Levelt, 1990; Starreveld & La Heij, 1996; Vigliocco, Vinson, Damian, & Levelt, 2002)Footnote 1 were taken as evidence in favour of lexical competition models. However, recently, alternative explanations focusing on task-specific mechanisms were suggested to account for the interference effects. Specifically, in the picture-word interference task word distractors were suggested to block the articulatory output buffer, with less task-relevant unrelated words being more quickly removed than potentially response-relevant categorically related words (Mahon et al., 2007). In another noncompetitive theory, a learning mechanism has been proposed to account for interference effects that were found in continuous naming and cyclic blocking tasks: here (noncompetitive) lexical selection of a target leads subsequently to a weakening of connections which link the semantic and lexical representations of competitors (Oppenheim et al., 2010).

In the present study, we chose a speeded picture-naming paradigm with manipulations of semantic neighbourhood as the key factor to reevaluate the competitive or noncompetitive nature of lexical selection. Such item-inherent variations do not require context manipulations and thus have the advantage of preventing confounding influences of additional factors associated with different context paradigms (e.g., working memory, executive control functions, response strategies). This study is the first that assessed the underlying mechanisms of lexical selection by investigating combined influences of semantic neighbourhood density and semantic neighbourhood similarity (distance). Different measures of semantic neighbourhood density and similarity were compared, all of which were based on shared semantic features: Semantic neighbourhood similarity as a measure of the mean semantic similarity between the target and all of its’ semantic neighbours; raw semantic neighbourhood density, defined as the total number of a target word’s semantic neighbours; close semantic neighbourhood density as a measure of semantically very similar (close) neighbours that share many semantic features with the target (e.g., similar to studies by Mirman, 2011; Mirman & Graziano, 2013); distant semantic neighbourhood density as a measure of semantically less similar (distant) neighbours that share only a few semantic features with the target; and category-specific semantic neighbourhood density, including the number of all semantic neighbours that belong to the semantic category of the target (similar to studies by Blanken, Dittmann, & Wallesch, 2002; Bormann, 2011; Bormann, Kulke, Wallesch, & Blanken, 2008).

Most of the previous studies on semantic neighbourhood effects in language production investigated only one semantic neighbourhood variable—for example, category-specific semantic neighbourhood density without controlling or taking into account semantic neighbourhood similarity. Moreover, depending on the task and tested population (language unimpaired vs. language impaired speakers), those studies revealed mixed results regarding the existence and direction of the semantic neighbourhood density effects. For example, Blanken et al. (2002), Bormann et al. (2008), and Bormann (2011) assessed how category-specific semantic neighbourhood density influenced picture naming in German. The density measure was obtained by asking language unimpaired participants to rate the quantity of semantically related representations that belong to the same semantic category ranging from ‘many’ to ‘hardly any’ coordinates. Blanken et al. (2002), Bormann et al. (2008) and Bormann (2011) found an effect of category-specific semantic neighbourhood density on the production of different error types in participants with aphasia and progressive anomia. Overall naming accuracy was the same for targets with many compared with few semantic neighbours; however, the former resulted in the production of more semantic errors, while the latter provoked less semantic errors but more omissions (see also Kittredge, Dell, & Schwartz, 2007a). Unlike language impaired participants, language unimpaired speakers showed no effects of category-specific neighbourhood density on naming latency and accuracy in picture naming (Bormann, 2011).

In contrast, Kittredge, Dell, and Schwartz (2007b) reported more accurate responses for words with many compared with few semantic neighbours, but no effect of error types for English speaking aphasic individuals. Unlike Blanken et al. (2002), Bormann et al. (2008), and Bormann (2011), Kittredge et al. (2007b) derived their category-specific semantic neighbourhood density measure from the frequency of co-occurrence of two words in a large text corpus using the Latent Semantic Analysis database (Landauer, Foltz, & Laham, 1998). Co-occurring words were regarded as semantic neighbours if they had a semantic similarity of equal or greater than 0.4 and belonged to the same semantic category as the target word.

Similar to category-specific density, effects of close semantic neighbourhood density are inconsistent across studies. Mirman (2011) used the McRae, Cree, Seidenberg, and McNorgan (2005) semantic feature production norms to derive close semantic neighbourhood density scores by calculating the total number of a target’s close semantic neighbours that were defined by semantic similarity values greater than 0.4, based on the number of shared semantic features between the neighbour and the target. Mirman (2011) found that naming targets with many compared with few close semantic neighbours resulted in less accurate responses and more semantic errors for aphasic and language unimpaired participants. Recently, Fieder et al. (2016) reported data from a picture-naming study with nine participants suffering from the semantic variant of primary progressive aphasia (svPPA) and a group of age-matched controls. Using the same measure for close semantic neighbourhood density as Mirman (2011), Fieder et al. (2016) found greater interference in form of less accurate naming responses and more semantic errors for words with many compared with few close semantic neighbours in both participant groups.

Rabovsky, Schad, and Abdel Rahman (2016) provided further evidence for interference by semantic neighbours when assessing effects of intercorrelational feature density, a different measure of semantic neighbourhood density, on picture naming in language unimpaired German speakers. Intercorrelational feature density, taken from the McRae et al. (2005) database, reflects the degree to which different feature pairs of a concept co-occur in other concepts. A greater proportion of co-occurring features (e.g., has four legs, has a tail, has fur) indicates a great semantic overlap between the target (e.g., cat) and other concepts (e.g., dog, mouse) and thus the existence of many semantic neighbours. In line with the results of Mirman (2011), Rabovsky et al. (2016) found longer naming latencies and lower accuracy for words with high intercorrelational feature density. In contrast, for semantic richness (the number of semantic features associated with a concept), facilitation with shorter naming latencies and higher accuracy was found, taken to reflect higher activation levels of target concepts and lexical entries with many compared with few semantic features.

Finally, Hameau, Biedermann, and Nickels (2018) and Hameau (2017) investigated the effects of feature-based semantic neighbourhood density, a combination of rated category-specific semantic neighbourhood density (see Bormann, 2011) and the number of close semantic neighbours (see Mirman, 2011) on naming in a group of aphasic and unimpaired speakers. In line with the results of Bormann (2011), feature-based semantic neighbours did not predict naming latency and accuracy in language unimpaired speakers. However, unlike previous aphasia studies (e.g., Bormann, 2011; Mirman, 2011), a facilitative effect was found for naming accuracy: Words with many semantic neighbours were named more accurately and were less likely to result in the production of phonological errors or omissions, but not in semantic errors. Similar to Blanken et al. (2002), Bormann et al. (2008), and Bormann (2011), words with many semantic neighbours were more likely to result in the production of semantic errors than in omissions.

To summarise, only a few studies have found effects of semantic neighbourhood density that are consistent with respect to naming latency, accuracy, and/or error types and thus support lexical selection by competition (e.g., Fieder et al., 2016; Mirman, 2011; Rabovsky et al., 2016). Other studies found either no effects of semantic neighbourhood density on naming accuracy and/or latency suggesting that ease of selection is independent of neighbourhood size (e.g., Bormann, 2011), or even facilitative effects of semantic neighbourhood density, challenging the idea of competitive lexical selection (e.g., Hameau et al., 2018; Kittredge et al., 2007b).

Some of the reasons for the heterogeneous findings across semantic neighbourhood density studies may relate to the use of various different measures, each reflecting different aspects of semantic neighbourhood density, such as frequency of word co-occurrence in texts (e.g., Kittredge et al., 2007a, 2007b) and/or similarities in semantic category (e.g., Blanken et al., 2002; Bormann, 2011; Bormann et al., 2008), semantic features (e.g., Fieder et al., 2016; Mirman, 2011; Rabovsky et al., 2016), or a combination of both (e.g., Hameau et al., 2018). For example, deriving semantic neighbourhood density from the frequency of word co-occurrence in the same semantic context (Kittredge et al., 2007b) might have led to the choice of neighbours that did not only belong to the same semantic category but were additionally strong associates (e.g., cat–mouse; chair–table) while neighbours that were semantically similar but only weak associatively related (e.g., cow–horse) might have been neglected. A semantic neighbourhood density measure that is based on a combination of associative and coordinate relations could explain facilitative effects in naming (e.g., Alario, Segui, & Ferrand, 2000; La Heij, Dirkx, & Kramer, 1990; but see Abdel Rahman & Melinger, 2007; Aristei, Melinger, & Abdel Rahman, 2010). In addition, methodological limitations, such as small item sets and the absence of controlling for or taking into account the full range of psycholinguistic variables (including variables such as imageability, semantic typicality, and semantic richness) could also have led to inconsistencies across studies (e.g., Alario et al., 2004; Baayen & Milin, 2010; Nickels & Howard, 1995). Moreover most of the previous studies on semantic neighbourhood effects in language production investigated only one semantic neighbourhood variable, either category specific or close semantic neighbourhood density without controlling or taking into account others and their potential interactions (but see Hameau et al., 2018).

With regard to semantic similarity, to date there are no picture-naming studies that assessed the influence of semantic neighbourhood similarity on naming in the form of a target item-inherent variation and thus without context manipulations. So far, the influence of semantic neighbourhood similarity has only been investigated in the context of semantically more compared with semantically less similar representations (Aristei et al., 2010; Lee & de Zubicaray, 2010; Mahon et al., 2007; Vigliocco, Vinson, Lewis, & Garrett, 2004). For example, graded effects of semantic similarity (distance) were found in picture-word interference tasks with greater interference when naming target pictures that were presented with semantically very similar distractor words compared with semantically less similar distractor words (e.g., Vigliocco et al., 2004; but see Hutson & Damian, 2014; Mahon et al., 2007).

The present study aimed to investigate the influence of semantic neighbourhood density and semantic neighbourhood similarity on naming performance, with a focus on the question of how semantic neighbourhood similarity may influence effects of semantic neighbourhood density while taking into account influences of other lexically and semantically relevant variables. A large set of 180 items was tested, and all variables including semantic neighbourhood were analysed as continuously varying in order to prevent artefacts due to small and unusual item sets that can result from extensive matching (e.g., see Hauk, Davis, Ford, Pulvermüller, & Marslen-Wilson, 2006; Rabovsky et al., 2016). A speeded picture-naming task was used as it was found to be suitable to assess errors in language production that are similar to the errors of language impaired speakers (e.g., Hodgson & Lambon Ralph, 2008; Kello, 2004; Kello & Plaut, 2010; Mirman, 2011), and thus to find further converging evidence for semantic neighbourhood effects not only on naming latencies and accuracy but also on semantic errors and omissions.

If lexical selection is competitive,Footnote 2 we predict to find interference due to an increase in lexical competition with an increasing number of a target’s semantic neighbours. Specifically, we hypothesize that words with more semantic neighbours are produced slower, less accurately, and are associated with an increase of naming errors that have their origin at the semantic and/or lexical level including semantic errors and/or omissions compared with words with fewer semantic neighbours. With regard to lexical-semantic errors, we might predict to find a similar number of semantic errors and omission errors. Alternatively, a steeper increase of semantic errors compared with omissions could also be predicted since an increasing number of semantic neighbours not only enhances lexical competition but might also raise the probability of selecting erroneously a semantic neighbour rather than resulting in a selection failure and thus an omission error at the lexical level. We do not expect to find effects of semantic neighbourhood density for the production of other error types, that are not the direct result of processing at the semantic or lexical (lexical-syntactic, lemma) level, including phonological errors and semantically and phonologically unrelated errors. Concerning semantic neighbourhood similarity, an increasing semantic overlap (similarity) between the target and its’ semantic neighbours should result in a stronger co-activation of competitors at the lexical level. Therefore, the network should be most highly active—and competition should be strongest—when a big cohort of closely related (and thus, strongly coactivated) competitors is active (Abdel Rahman & Melinger, 2009), resulting in enhanced semantic neighbourhood effects (effects of semantic neighbourhood similarity) with increasing semantic overlap. Hence, we expect that the likelihood of producing semantic errors and/or omissions increases for words with semantically more similar neighbours compared with words with semantically less similar neighbours.

In contrast, if lexical selection is noncompetitive, no effects of semantic neighbourhood density or semantic neighbourhood similarity on naming should be found since semantic neighbours do not compete with the target for selection. If anything, facilitation due to conceptual priming of the target by its semantic neighbours might occur.

Method

Participants

Thirty participants (24 females) with a mean age of 25 years (ranging from 19 to 34 years) gave informed consent to take part in this study in exchange for course credits or 7€/hour. The number of participants was based on a previous study of semantic neighbourhood density in language production (e.g., Mirman, 2011) in which the same speeded picture-naming paradigm was used. The study was approved by the local ethics committee of the Humboldt-Universität zu Berlin. All participants were native speakers of German with normal or corrected-to-normal vision and no history of language impairment.

Materials

Stimuli consisted of 180 coloured photographs of objects. All stimuli were chosen from the British database for concept property norms by Devereux, Tyler, Geertzen, and Randall (2014). Stimuli in the database were excluded, if they were abstract/low imageable and therefore hard to depict (e.g., codeine, penicillin) or not very well known in the German culture (e.g., satsuma). Stimuli were further excluded if their German translation resulted in a morphologically complex word (e.g., compounds), a homophone (e.g., English: ‘castle’; German: ‘Schloss’ with the word meaning ‘castle’ or ‘lock’), a synonym (e.g., English: ‘orange’; German: ‘Apfelsine’ or ‘Orange’), or in the same word twice (e.g., while English has two words for ‘pigeon’ and ‘dove’, German has only one word, ‘Taube’).

Information on semantic neighbourhood variables for each stimulus was obtained from the database for concept property norms by Devereux et al. (2014). Data for the concept property norms were collected by asking participants to generate semantic features for given words (concepts). Semantic features were analysed, and a feature vector was calculated for each word which was weighted for the features’ production frequencies—the number of participants that generated a certain feature for a concept. Devereux et al. (2014) created a word–feature matrix by combining all of the semantic features across words/concepts. Features which were generated fewer than five times for a given concept in the matrix were excluded. Semantic similarity between two concepts was calculated as the cosine between their two production frequency vectors, which varied from zero (for no shared semantic features) to one (for identical semantic feature vectors). Information on semantic similarity was used to identify the number of semantic neighbours (semantic neighbourhood density) and the similarity of semantic neighbours to their target (semantic neighbourhood similarity; for a similar procedure, see McRae et al., 2005).

Raw semantic neighbourhood density was measured by the number of concepts that have a similarity to the target of greater than zero. For example, the target ‘Pfirsich’ (peach) has semantic neighbours, such as nectarine, plum, melon, apple, carrot, potato, jam, and ball. Semantic neighbourhood similarity (distance) was defined as the mean cosine between the target’s feature vector and the different feature vectors of all of the target’s semantic neighbours, similar to the definition by Mirman and Magnuson (2008). For example, the target ‘Pfirsich’ (peach) has a relatively high semantic neighbourhood similarity of 0.130 and thus semantically very similar neighbours, while the target ‘Gürtel’ (belt) has a relatively low semantic neighbourhood similarity of 0.064 and thus semantically less similar neighbours (see Appendix Table 6 for the full list of target stimuli and their semantic neighbourhood values).

Beside raw semantic neighbourhood density, additional, more in-depth analyses were run with three more specific semantic neighbourhood density measures in order to assess whether semantic neighbourhood density affects naming performance if semantic neighbours exceed a certain degree of semantic similarity. As mentioned earlier, semantic similarity between two concepts can vary between zero (for no shared semantic features) and one (for identical semantic feature vectors). Semantic neighbourhood similarity between the target and each neighbour was used to categorise semantic neighbours as either semantically close or distant to a target by converting the continuous variable of semantic similarity into a categorical variable with two equally spaced intervals: 0–0.5, 0.5–1.0. Subsequently the sum of the total number of neighbours within each category was calculated to create the two different density variables: The number of semantically close neighbours (close semantic neighbourhood density) including neighbours with a similarity between 0.5 and 1.0 (for a similar measure, see Mirman & Magnuson, 2008) and the number of semantically distant neighbours (distant semantic neighbourhood density) including neighbours with a similarity between zero and 0.5.

In addition to close and distant semantic neighbourhood density, we included a category-specific semantic neighbourhood density measure that comprised the number of all semantic neighbours that belong to the same semantic category as the target (similar to studies by Blanken et al., 2002; Bormann, 2011; Bormann et al., 2008). For example, the target ‘Pfirsich’ (peach) has 16 category-specific semantic neighbours, such as nectarine, plum, melon, apple, lemon, and strawberry. The majority (85%) of close semantic neighbours were from the same semantic category and were thus coordinates of the targets, while most of the category-specific neighbours (90.68%) from the category-specific semantic neighbourhood density measure were not semantically close (similarity of less than 0.5) to the target and thus were not included in the close semantic neighbourhood density measure. The category-specific semantic neighbourhood density variable was used to disentangle whether an influence of close semantic neighbourhood density can be ascribed to the number of semantically similar neighbours or alternatively to the neighbours’ taxonomical relatedness to the target. In case of the latter, we would expect to find interfering effects on naming performance from close and/or category-specific semantic neighbourhood density.

The concept property norms by Devereux et al. (2014) were further used to derive information about the semantic richness of each target word which was defined as the number of a target’s semantic features. Information about other psycholinguistic variables was taken from the following databases: spoken lemma frequency and word length in form of the number of phonemes from the German dlexDB database (Heister et al., 2011). Data for imageability, semantic typicality, familiarity, age of acquisition, and visual complexity were obtained through ratings by the authors. For concept familiarity and visual complexity, 17 participants were asked to rate the depicted objects using the instructions from Gilhooly and Hay (1977). For age of acquisition, semantic typicality, and imageability, 10 participants were asked to rate the stimuli names using instructions by Schröder, Gemballa, Ruppin, and Wartenburger (2012) and Alario and Ferrand (1999).Footnote 3 All of the experimental stimuli were further controlled for name agreement in order to prevent naming difficulties from arising through uncertainty in identifying the depicted objects (e.g., Cheng, Schafer, & Akyurek, 2010). Seventeen participants provided measures of naming accuracy in a picture-naming experiment in which they were instructed to name the pictures as accurately as possible. Only pictures which were named with above 80% name agreement (Mean = 97.09; SD = 4.59) were included in this study (see Appendix Table 7).

Design and procedure

Participants were tested individually in a quiet room. Pictures were displayed on a white background in the centre of a computer screen. At the beginning of the experiment, participants were instructed that prior to the presentation of the picture they would hear a series of three beeps in combination with numbers which count down from 3, 2, 1. On the fourth beep they saw the picture and were asked to name it as quickly as possible before the fifth beep and the number zero appeared. Each of these time windows were 500-ms long. After the presentation of the number zero, a blank screen appeared for 1,000 ms. For a visual presentation of the different time windows within a trial, see Fig. 1.

Fig. 1
figure 1

Diagram of the course of events for a single trial in the speeded picture-naming task.

Naming latencies were measured by means of a voice key, which was activated at the onset of the target presentation. Vocal responses were recorded until the timeout of 2,000 ms. All of the audio-recorded response trials were transcribed and checked for accuracy and timing using CheckVocalFootnote 4 (Protopapas, 2007) to ensure that the voice-trigger mechanism had correctly registered the beginning of the response. Trials which were mistriggered (e.g., through lip smacking, heavy breathing, movements or sound volume) were adjusted. Trial sequences during the experiment were controlled by DMDX (Forster & Forster, 2003).

The experiment began with 20 practice trials in which participants named pictures that were not part of the 180 critical stimuli. After each practice trial, participants could take a break to ask questions or receive feedback if necessary. All of the 180 pictures were distributed evenly across two blocks. The order of the blocks was counterbalanced across participants. The order of stimuli within each block was randomised. Participants received the two blocks with a short break between blocks. The entire experiment lasted approximately 20 minutes.

Response coding

The first full response was coded. A full naming response was defined as consisting of at least one possible German syllable. A response was coded as correct if it consisted of the correctly pronounced target word or of a word that is an acceptable alternative response for the target (e.g., response: ‘Henne’ [hen] for target: ‘Huhn’ [chicken]).

Incorrect responses were coded as semantic errors if they consisted of a word that is semantically related to the target by belonging to the same semantic category, such as superordinates (e.g., ‘Blume’ [flower] for ‘Tulpe’ [tulip] and coordinates (e.g., ‘Strauβ’ [ostrich] for ‘Pfau’ [peacock]), or by belonging to a different semantic category, such as associates (e.g., ‘Wüste’ [desert] for ‘Pyramide’ [pyramid]), concepts that stand in a part–whole relationship to the target (e.g., ‘Nadel’ [needle] for ‘Spritze’ [veil]) and semantic other for semantically related responses that could not be easily categorised (e.g., ‘Gemüse’ [vegetable] for ‘Petersilie’ [parsley]). False starts that shared 50% or more of their phonemes with a semantically related representation were also coded as semantic errors (e.g., ‘Lö [Löwe]’ [lion] for the target ‘Tiger’ [tiger]). However, we are aware that some response types cannot be unambiguously classified as (semantic) errors that resulted from faulty processing at the lexical and/or semantic level. For example, since participants were not familiarised with the stimuli’s names prior to testing, superordinate errors and errors in a part–whole relationship to the target could have resulted from a misunderstanding of the level of the desired response. One response type that can be clearly classified as semantic error are coordinates of the target. Therefore, in addition to a general semantic error analysis, we ran a separate analysis with coordinate errors only.

Omissions were instances where the participants failed to respond or made a comment that expressed a failure to respond (e.g., ‘I don’t know’). Other error types were responses (words and nonwords) that had no semantic relationship to the target including unrelated errors and false starts (e.g., ‘Shop’ [shop] for ‘Kutsche’ [carriage]; nonword ‘/ro:/’ for ‘Sarg’ [coffin]), phonological errors and false starts that shared a minimum of 50% of their phonemes with the target (e.g., nonword ‘/kank/’ for ‘Anker’ [anchor]; nonword ‘/ʃpa:/’ for ‘Spargel’ [asparagus]; see Appendix Table 8 for a complete list of participants’ error responses).

Analysis

The analysis of the picture-naming latency data (RT) was performed using linear mixed-effects modelling as implemented in the lme4 package (Bates, Maechler, Bolker, & Walker, 2014) in the statistical software R (Version 3.2.1; R Core Team, 2014). We started with a full model including 11 fixed factors: semantic neighbourhood similarity, raw semantic neighbourhood density, semantic richness, semantic typicality, imageability, familiarity, age of acquisition, lemma frequency, visual complexity, length (number of phonemes), and presentation order. All variables were cantered and scaled. Random variation by specific items and participants was accounted for by entering items and participants with random intercepts. In a first step, a stepwise backward elimination procedure was used to remove all variables, except for the semantic neighbourhood variables, that did not improve the model’s fit significantly in order to assess and consider the influence of language relevant psycholinguistic variables. In a second step, individual effects of each semantic neighbourhood variable in the presence of the other semantic neighbourhood variable and the remaining psycholinguistic variables was analysed by excluding only one of the two semantic neighbourhood variables at the time. Log-likelihood ratio tests were used to compare the models’ fit throughout Steps 1 and 2. The summary of the final model which includes only the significant psycholinguistic and semantic neighbourhood variables is reported, and p values were determined using the package lmerTest (Kuznetsova, Brockhoff, & Christensen, 2014).

For the additional, more specific semantic neighbourhood density analyses, the three semantic neighbourhood density variables (close semantic neighbourhood density, distant semantic neighbourhood density, and category-specific semantic neighbourhood density) were inserted into the original reduced base model which included only the psycholinguistic control variables that proved to be significant (but excluded the previous semantic neighbourhood variables: raw semantic neighbourhood density and semantic neighbourhood similarity). A backward elimination procedure was used where only one of the three semantic neighbourhood variables were excluded at the time. The fit between the reduced and the more complex model was compared using log likelihood ratio tests.

The error analyses were performed using the same principles as the naming latency analyses. We used generalised linear mixed-effects models for binomial outcomes to analyse the influence of psycholinguistic variables on naming accuracy and the production of different error types in the base model, and the influence of variables of semantic neighbourhood in the advanced model using the function glmer as part of the R package lme4 (Bates et al., 2014). Separate models were fitted with different dependent variables including accuracy (correct vs. incorrect responses), number of semantic errors versus correct responses, number of coordinate errors versus correct responses and number of omissions versus correct responses. Due to the small number of phonological and unrelated errors an analysis including other error types versus correct responses could not be performed. For the analyses of the different error types, separate binomial analyses were conducted as no standard exists yet for mixed multinomial analyses (for similar analyses, see also Hameau, 2017; Hameau et al., 2018; Jaeger, Furth, & Hilliard, 2012). For each separate binomial analysis, the errors of interest (e.g., semantic errors) were coded as zero and compared with correct responses coded as one. Responses/errors that were not part of the specific analyses were excluded by leaving their coding blank. We further assessed whether the different semantic neighbourhood variables predicted the probability of producing a semantic error (coded as 1) over an omission (coded as 0; see Bormann, 2011; Hameau et al., 2018).

Results

Descriptive analyses

Trials in which participants produced naming errors including omissions (985 data points, 18.24%) were excluded for the naming latency analysis. Trials where naming latencies were faster than 300 ms, and those which were more than three standard deviations above or below the mean of the participant (76 data points, 1.72%) were removed. Table 1 gives an overview of the different response/error types (correct responses, semantic errors, coordinate errors, omissions, and other error types). There was no correlation between raw semantic neighbourhood density and semantic neighbourhood similarity, nor between close semantic neighbourhood density, distant semantic neighbourhood density and category-specific semantic neighbourhood density (see Appendix Table 9).

Table 1 Overall rate of correct responses and different error types

Latency analyses

RTs were transformed to approximate normal distribution using a reciprocal transformation (power −0.735) as indicated by the Box-Cox test. The reduced model included familiarity, age of acquisition, lemma frequency, and presentation order as significant predictors of naming latencies. Naming slowed down for words that were less familiar, less frequent, and acquired later in life. Naming latencies increased further with the progression of the experiment. The exclusion of the remaining, nonsignificant psycholinguistic variables (semantic richness, semantic typicality, imageability, length, and visual complexity) did not reduce the goodness of fit, χ2(5) = 2.229, p = .817, but instead increased the fit of the model as indicated by the lower AIC and BIC values (AIC: full model: −59227 vs. reduced model: −59235; BIC: full model: −59132 vs. reduced model: −59171). Neither semantic neighbourhood similarity, χ2(1) = 0.791, p = .374, nor raw semantic neighbourhood density, χ2(1) = 0.106, p = .745, were significant predictors of naming latency. There was no significant interaction between raw semantic neighbourhood density and semantic neighbourhood similarity, χ2(1) = 1.027, p = .311. For details of the final model, see Table 2.

Table 2 LMM statistics with the final model including only significant variables for naming latency (reciprocal transformed)

The more detailed analysis of semantic neighbourhood density revealed a significant effect of close semantic neighbourhood density, χ2(1) = 5.216, p = .022, while neither distant semantic neighbourhood density,Footnote 5 χ2(1) = 0.283, p = .595, nor category-specific semantic neighbourhood density, χ2(1) = 1.654, p = .198, were significant predictors of naming speed. Naming slowed down for words with many close semantic neighbours (see Table 2).

Error analyses

Correct responses versus incorrect responses

The reduced model included familiarity, age of acquisition and semantic typicality as significant predictors of naming accuracy. Words that were more typical exemplars of a semantic category, very familiar, and acquired early in life resulted in more accurate naming responses. The exclusion of the remaining variables did not result in a significant reduction of the goodness of fit, χ2(6) = 4.885, p = .559, but instead in a slightly better fit (AIC: full model: 4063 vs. reduced model: 4056; BIC: full model: 4155 vs. reduced model: 4109). Semantic neighbourhood similarity, χ2(1) = 6.445, p = .011, but not raw semantic neighbourhood density, χ2(1) = 0.289, p = .591, improved the model’s fit significantly. Words with semantically more similar neighbours led to a decrease of naming accuracy compared with words with semantically less similar neighbours (see Fig. 2 and Table 3). There was no significant interaction between raw semantic neighbourhood density and semantic neighbourhood similarity, χ2(1) = 0.517, p = .472.

Fig. 2
figure 2

Effect of semantic neighbourhood similarity on naming accuracy (accurate vs. inaccurate responses); semantic errors (coded as 1) versus accurate responses (coded as 0); coordinate errors (coded as 1) versus accurate responses (coded as 0); and omissions (coded as 1) versus accurate responses (coded as 0) depicted as logistic regression lines with 95% confidence intervals. Points indicate average number of accurate responses; semantic errors; coordinate errors; or omissions for each subject for three equal-sized bins of semantic neighbourhood similarity

Fig. 3
figure 3

Effect of close semantic neighbourhood density on naming latency (RT data was back-transformed to the original scale), naming accuracy (accurate versus inaccurate responses); semantic errors (coded as 1) versus accurate responses (coded as 0); coordinate errors (coded as 1) versus accurate responses (coded as 0); and omissions (coded as 1) versus accurate responses (coded as 0) depicted as linear regression line for naming latency and logistic regression lines for accuracy and error responses with 95% confidence intervals. Points indicate average response times for naming latency or average number of accurate responses; semantic errors; coordinate errors; or omissions for each subject for three equal-sized bins of close semantic neighbourhood density

Table 3 LMM statistics with the final model including only significant variables for naming accuracy (accurate (1) − inaccurate responses (0))

The more detailed analysis of semantic neighbourhood density revealed a significant effect of close semantic neighbourhood density, χ2(1) = 9.993, p = .002, while neither distant semantic neighbourhood density, χ2(1) = 0.143, p = .706, nor category-specific semantic neighbourhood density, χ2(1) = 0.848, p = .357, were significant predictors of naming accuracy. Naming accuracy decreased for words with many close semantic neighbours. Details of the final model can be found in Table 3.

Correct responses versus semantic errors

The reduced model included familiarity and age of acquisition. The likelihood of producing semantic errors compared with accurate responses increased for words that were less familiar and acquired later in life. Reducing the model by excluding the remaining variables did not result in a significant drop in goodness of fit, χ2(7) = 10.408, p = .167, but rather in an improvement (AIC: full model: 2939 vs. reduced model: 2935; BIC: full model: 3030 vs. reduced model: 2981). Similar to accuracy, the model’s fit was significantly improved by semantic neighbourhood similarity, χ2(1) = 4.325, p = .038, but not by raw semantic neighbourhood density, χ2(1) = 0.151, p = .697 (see Fig. 2 and Table 4). The likelihood of producing a semantic error over a correct response increased for words with semantically more similar neighbours. There was no significant interaction between raw semantic neighbourhood density and semantic neighbourhood similarity, χ2(1) = 0.152, p = 697.

Table 4 LMM statistics with the final model including only significant variables for semantic errors and coordinate errors compared with accurate responses (accurate responses (1) − semantic/coordinate errors (0))

The results of the more detailed analysis of semantic neighbourhood density showed a significant effect of close semantic neighbourhood density, χ2(1) = 6.577, p = .010, but neither of distant semantic neighbourhood density, χ2(1) = 0.088, p = .767, nor category-specific semantic neighbourhood density, χ2(1) = 0.057, p = .811. The likelihood of producing a semantic error over an accurate response increased for words with many close semantic neighbours (see Table 4 for the final model).

Correct responses versus coordinate errors

The reduced base model included familiarity as a significant predictor and age of acquisition as marginally significant predictor for the production of coordinate errors in the reduced model. The likelihood of producing a coordinate error over an accurate response increased for words that were less familiar and acquired later in life. Exclusion of the nonsignificant variables did not result in a significant reduction in goodness of fit, χ2(7) = 7.370, p = .391, but rather in an improvement (AIC: full model: 2483 vs. reduced model: 2477; BIC: full model: 2575 vs. reduced model: 2522). Semantic neighbourhood similarity, χ2(1) = 4.950, p = .026, but not raw semantic neighbourhood density, χ2(1) = 1.627, p = .202, was once again the only significant semantic neighbourhood predictor of coordinate errors leading to a higher likelihood of producing a coordinate error instead of a correct response for words with semantically more similar neighbours (see Fig. 2 and Table 4). There was no significant interaction between the two semantic neighbourhood variables, χ2(1) = 0.270, p = .604.

In the more specific analysis of semantic neighbourhood density a significant effect of close semantic neighbourhood density, χ2(1) = 5.253, p = .022, but not of distant semantic neighbourhood density, χ2(1) = 1.404, p = .236, or category-specific semantic neighbourhood density, χ2(1) = 0.001, p = .976, was found. The probability of naming a target picture with a coordinate error instead of a correct response increased for words with many close semantic neighbours (see Table 4 for the final model).

Correct responses versus omissions

The reduced model included semantic typicality and familiarity as significant predictors and age of acquisition as a marginally significant predictor of omissions. Items that were less typical, less familiar, and acquired later in life were more likely to results in an omission than in an accurate response. None of the other linguistic variables improved the model’s fit significantly, χ2(6) = 3.724, p = .714. The exclusion of the nonsignificant variables resulted in a better fit (AIC: full model: 1668 vs. reduced model: 1660; BIC: full model: 1758 vs. reduced model: 1711). Similar to semantic and coordinate errors, only semantic neighbourhood similarity, χ2(1) = 7.935, p = .005, but not raw semantic neighbourhood density, χ2(1) = 0.289, p = .591, was a significant predictor of omissions. Words with semantically more similar neighbours were more likely to result in an omission error than to be named correctly (see Fig. 2 and Table 5). No significant interaction was found between the two semantic neighbourhood variables, χ2(1) = 2.594, p = .107.

Table 5 LMM statistics with the final model including only significant variables for omissions compared with accurate responses (accurate responses (1) − omissions (0)) and for semantic errors compared with omissions (semantic errors (1) − omissions (0))

In the more detailed analysis of semantic neighbourhood density, only close semantic neighbourhood density, χ2(1) = 6.461, p = .011, but neither distant semantic neighbourhood density, χ2(1) = 0.139, p = .710, nor category-specific semantic neighbourhood density, χ2(1) = 0.781, p = .377, reached significance. Naming of words with many close semantic neighbours was more likely to result in an omission error than in an accurate response compared with words with few close semantic neighbours (see Table 5 for the final model).

Semantic errors versus omissions

Lemma frequency was a significant predictor for the production of semantic errors over omissions: Participants were more likely to produce semantic errors than omissions for words of higher frequency. The exclusion of the remaining psycholinguistic variables did not change the model’s goodness of fit, χ2(8) = 6.032, p = .644, but resulted in a slightly better fit (AIC: full model: 936 vs. reduced model: 926; BIC: full model: 1004 vs. reduced model: 955). Neither raw semantic neighbourhood density, χ2(1) = 0.026, p = .872, nor semantic neighbourhood similarity, χ2(1) = 0, p = 1.00, and their interaction, χ2(1) = 0.522, p = .470, were significant (see Table 5).

In the more specific analysis of semantic neighbourhood density, none of the semantic neighbourhood density variables were significant predictors for the production of semantic errors over omissions, close semantic neighbourhood density: χ2(1) = 1.972, p = .160; distant semantic neighbourhood density: χ2(1) = 0.0003, p = .987; category-specific semantic neighbourhood density: χ2(1) = 0, p = .996.

General discussion

In this study we investigated the combined influences of semantic neighbourhood density and semantic neighbourhood similarity (distance). Influences of other important lexical and semantic variables were considered. We found consistent evidence for an influence of semantic neighbourhood similarity (distance) on naming accuracy and also on the production of different lexical-semantic error types. Words with semantically more similar neighbours increased picture-naming errors and resulted therefore in significantly less accurate naming responses and a higher likelihood of producing semantic errors and omissions over accurate responses. However, no significant influence of semantic neighbourhood similarity on naming latency was found—a result that could be explained by the speeded picture naming paradigm and thus the pressure to name pictures with high speed in terms of a ceiling effect. Similar to our study, Mirman (2011) conducted a speeded picture-naming task and found only a significant semantic neighbourhood effect for the accuracy of naming responses but not their speed. One explanation for the sole effect of semantic neighbourhood similarity on naming accuracy is the threshold account by Mirman (2011), according to which participants lower the lexical threshold of words during the speeded picture-naming task in order to generate responses more quickly, leading to partly incomplete and thus more error prone lexical processing.

The lexical and/or semantic origin of semantic errors and omissions in our task was confirmed by the influence of other lexical-semantic variables including familiarity, age of acquisition, and semantic typicality on the production of those errors. In line with previous studies, we found that the probability of producing semantic errors compared with accurate responses increased for words that are less familiar and/or acquired later in life (e.g., Cuetos, Aguado, Izura, & Ellis, 2002; Nickels & Howard, 1995). Similarly, the likelihood of producing omissions over accurate responses increased for words that are less familiar, acquired later in life, and/or less typical exemplars of a semantic category. This confirmed that effects of semantic neighbourhood similarity (distance) unfold at earlier stages of processing including the conceptual (semantic) and lexical level.

In line with our first prediction for lexical-semantic errors, none of the semantic neighbourhood variables determined the production of semantic errors over omissions. Instead, lemma frequency was the only variable that determined whether erroneous naming resulted in a semantic rather than an omission error with high frequency words being more likely to be named with a semantic error than an omission error. In addition to the other (Fig. 3) results for correct responses versus semantic errors and correct responses versus omissions, this can be taken as evidence for both error types (i.e., semantic errors, omissions) being similarly affected by semantic neighbourhood similarity (distance).

Unlike for semantic neighbourhood similarity (distance), we did not find an effect of raw semantic neighbourhood density. That is, picture-naming performance did not change with the total number of semantic neighbours. Furthermore, no interaction was found between raw semantic neighbourhood density and semantic neighbourhood similarity (distance) which indicates that semantic neighbourhood similarity does not modulate the influence of our specific (raw) semantic neighbourhood density measure on naming performance. In other words, we cannot conclude that interference by raw semantic neighbourhood density grows stronger with an increasing similarity of those neighbours. However, possible reasons for the absence of a raw semantic neighbourhood density effect and the missing interaction between raw semantic neighbourhood density and semantic neighbourhood similarity could be ascribed to the continuous nature of the density measure which comprised all semantically related concepts given that they exceeded a similarity of greater than zero. Consequently, a large majority (98.17%; SD = 2.62; range: 84.83%–100%) of semantic neighbours comprised a similarity of less than 0.5 and thus can be described as being semantically less similar (distant) to the target. For example, the target ‘Pfirsich’ (peach) has only ten semantically close neighbours that exceed a similarity of greater than 0.5 (e.g., nectarine, plum, melon), but it has 224 distant neighbours including ‘apple’, ‘potato’, and ‘ball’. While close semantic neighbours, such as ‘nectarine’ share many semantic features with the target (e.g., has a stone, is juicy, is soft, is circular/round, does grow, is edible), more distant neighbours share sometimes only a few or even one semantic feature(s). For example, ‘peach’ and ‘apple’ share the semantic features ‘is juicy’, ‘is circular/round’, ‘does grow’, and ‘is edible’, while ‘peach’ and ‘ball’ only share the semantic feature ‘is circular/round’.

In order to assess whether semantic neighbourhood density can affect naming performance if semantic neighbours exceed a certain degree of semantic similarity a more in-depth analysis of semantic neighbourhood density was run which included the variables close semantic neighbourhood density, distant semantic neighbourhood density and category-specific semantic neighbourhood density. The results of the semantic neighbourhood density analyses revealed significant interference effects of close semantic neighbourhood density on naming latency, accuracy, and naming errors. As predicted, naming difficulties increased with a target’s number of close semantic neighbours leading to longer naming latencies, less accurate naming responses, and a higher likelihood of producing a semantic (coordinate) error or an omission over an accurate response. The significant effect of close semantic neighbourhood density and the absence of an influence of distant semantic neighbourhood density confirmed our assumption that semantic neighbourhood density can influence naming performance given semantic neighbours exceed a certain degree of semantic similarity.

The interfering effect of close semantic neighbourhood density replicates the results of Mirman (2011) in a different language (German instead of English), with a larger set of items from a different semantic database (Devereux et al., 2014). Moreover, we extended results of Mirman (2011) by finding evidence for an effect of close semantic neighbourhood density not only on naming accuracy and semantic errors but also on naming latency and omissions. The results are also compatible with the outcome of Rabovsky et al. (2016), who found similar effects of interference on naming latency and accuracy in picture naming for intercorrelational feature density. Intercorrelational feature density is a different measure of close semantic neighbourhood density which is based on an object’s proportion of feature pairs that co-occur together in other objects. “Objects with a greater proportion of co-occurring features should have nearer neighbours because their features tend to come in groups (i.e., they co-occur), and thus their feature overlap will tend to be greater” (Mirman & Magnuson, 2008, p. 67; for more details, see McRae et al., 2005).

The nonsignificant results for category-specific neighbourhood density show that the influence of close semantic neighbourhood density cannot be explained by taxonomical relatedness only but instead can be ascribed to the number of semantically very similar (close) neighbours. The absence of a category-specific semantic neighbourhood density effect supports findings by Bormann (2011) which did not reveal a category-specific semantic neighbourhood density effect on naming latency and accuracy in a picture naming task with language unimpaired speakers.

How can our results contribute to the current debate about the underlying mechanism of lexical selection? The interfering influence of semantic neighbourhood similarity and close semantic neighbourhood density on naming performance confirmed the predictions of a competitive mechanism of lexical selection. According to most language-production theories, semantic neighbours become coactivated with the target through spreading activation between shared semantic information. In competitive theories, semantic neighbours modulate the ease of selection of the target’s lexical representation. One variant of lexical competition models, the swinging lexical network model by Abdel Rahman and Melinger (2009), was specifically designed to explain semantic neighbourhood effects. The model proposes that semantic neighbours simultaneously induce facilitation through priming at the conceptual level as well as interference through competition at the lexical level. Critically, the measurable net outcome of both effects depends on the number of semantic neighbours (i.e., semantic neighbourhood density) and their similarity (e.g., Rose & Abdel Rahman, 2017). For targets with many semantic neighbours and/or semantically very similar neighbours, lexical interference and therefore competition increases and can outweigh conceptual facilitation. The present study confirmed this prediction by revealing negative effects of semantic neighbourhood density on naming performance. However, a semantic neighbourhood density effect was only found for close, but not for the total number (raw), distant, or category-specific semantic neighbours, suggesting that the number of semantic neighbours require a sufficient degree of semantic similarity to the target in order to induce net interference. Furthermore, the influence of semantic neighbourhood similarity confirms that the more similar or closer semantic neighbours are to the target, the more highly they become coactivated, leading to stronger competition at the lexical level. As discussed in the introduction, possible ways to implement competition is in form of a Luce choice mechanism (e.g., Luce, 1959; Roelofs, 1992), or lateral inhibitory links between semantic neighbours at the lexical level (see Cutting & Ferreira, 1999; Harley, 1993a, 1993b; McClelland & Elman, 1986; McClelland & Rumelhart, 1981; Stemberger, 1985). Taken together, our results confirm the predictions made by models of competitive lexical selection and indicate that semantic neighbourhood density is at least partly modulated by semantic similarity. Activation resonates within the lexical-semantic network, that is, activation of the cohort (lexical and semantic representations of the target and its semantic neighbours) becomes stronger with an increasing number and semantic similarity of a target’s semantic neighbours, which results in more intense lexical competition.

As discussed earlier, noncompetitive models focus on context stimuli and account for interference effects by word distractors blocking the articulators (e.g., Mahon et al., 2007) or in the form of a weakening of previously coactivated but unnamed items (e.g., Oppenheim et al., 2010). These models are therefore agnostic with respect to the present findings because item-inherent variables that may influence lexical competition, rather than contexts, were manipulated. In the following we speculate how an implicit learning mechanism as suggested by Oppenheim et al. (2010) could account for our findings. Instead of competition during the process of lexical selection, Oppenheim et al. (2010) suggest that interference is caused by implicit learning induced by the selection of a lexical representation. Following the selection of a target word, connections between the coactivated representations at the conceptual and lexical level would be readjusted in order to make future retrieval of the target more efficient. In this scenario, the connections between semantic features and the lexical representation of the target would be strengthened to the extent that the target was ‘underactivated’ for sufficient selection, while the connections between the semantic features and the lexical representations of the target’s semantic neighbours would be weakened to the extent that they were ‘overactivated’. How could this account explain interference induced by item-inherent semantic neighbourhood density and similarity? Our speculation is that the present findings could only be explained if additional long-lasting or even hard-wired changes are assumed. If an item is named for the first time, a weakening of many closely related neighbours should not directly affect lexical selection. However, semantic neighbourhood density and distance might lead to more permanent adaptations of the connection weights between concepts and lexical representations and could thus result in generally slower and more error-prone naming of items in close and dense semantic neighbourhoods. Future research is needed to evaluate this speculation.

The immediate negative consequences for semantic neighbours after the retrieval of a target word have been referred to as cumulative semantic interference, whereby naming times increase linearly with the number of previously named categorically (e.g., Belke et al., 2005; Howard, Nickels, Coltheart, & Cole-Virtue, 2006) or associatively related targets (Rose & Abdel Rahman, 2017). Similar to semantic neighbourhood effects, this phenomenon can be accounted for either by an implicit learning mechanism resulting in weakened activation of previously coactivated but unnamed items (Oppenheim et al., 2010) or by competitive lexical selection in combination with an implicit learning mechanism which strengthens activation levels of previously named items (Howard et al., 2006). In case of the former, lexical selection of the target leads to a weakening of the lexical-semantic connections of semantically related representations, such as the ones from members of the same semantic category. In case of the latter, lexical selection of the target leads to a strengthening of the link between the target’s lexical-semantic representations turning the target into a stronger competitor in the next round of lexical processing. Given that our stimuli comprised several members of different semantic categories, we cannot rule out possible effects of cumulative semantic interference. This may interact with our similarity and density manipulations and result in slower naming latencies and/or lower naming accuracy for target words with many closely related conamed items compared to targets with fewer and/or less closely related conamed neighbours. For future studies, we would therefore suggest to control for effects of cumulative semantic interference by using the same number of category-specific (or semantically related) stimuli as targets.

Finally, what are the implications for word-finding difficulties in aphasia that can be derived from our study? Given the similarity between word-finding difficulties and errors produced by language unimpaired speakers in the speeded picture-naming task (e.g., Hodgson & Lambon Ralph, 2008; Kello, 2004; Kello & Plaut, 2010; Mirman, 2011) compared with individuals with aphasia with lexical and/or semantic impairment, semantic neighbourhood similarity and close semantic neighbourhood density are likely to be critical variables that can determine the ease of lexical selection, and thus successful word retrieval in aphasia. We would therefore suggest not only to control for both semantic neighbourhood variables in experimental studies but also to consider them for future assessment and treatment of language impairments. For example, the diagnosis of a language impairment at the lexical level is often based on semantic errors and/or omissions in picture naming in combination with an effect of lexical variables, such as frequency, with aphasic individuals showing more difficulties in naming pictures of low-frequency compared with high-frequency words. Given the lack of strength or even absence of a frequency effect in previous picture-naming studies (e.g., Carroll & White, 2007; Morrison, Ellis, & Quinlan, 1992; Nickels & Howard, 1995), semantic neighbourhood similarity and/or close semantic neighbourhood density might be more reliable variables for diagnostic purposes. Similarly, it might be sensible to structure the treatment of word-finding difficulties for aphasic individuals with semantic and/or lexical impairment hierarchically by considering semantic neighbourhood similarity and the number of close semantic neighbours of treated words. Hence, individuals with aphasia could either start with the treatment of words with semantically less similar neighbours and/or fewer close semantic neighbours and systematically increase the level of difficulty by treating words with semantically more similar neighbours and/or many close semantic neighbours, or do it the other way around following the “complexity account of treatment efficiency” (e.g., Kiran & Thompson, 2003; Thompson, 2007).

Conclusion

The present study investigated how lexical selection in language production is influenced by semantic neighbourhood similarity (distance) and raw semantic neighbourhood density. Using a speeded picture-naming task, we found an effect of semantic neighbourhood similarity on naming accuracy and the produced error types. Naming pictures of targets with semantically more similar neighbours increased naming difficulties by leading to more naming errors, more specifically by increasing the likelihood of producing semantic errors and omissions over accurate responses. No main effect of raw semantic neighbourhood density and no interaction between raw semantic neighbourhood density and semantic neighbourhood similarity (distance) was found. We assessed further whether semantic neighbourhood density can affect naming performance if semantic neighbours exceed a certain degree of semantic similarity with three different density variables: The number of semantically close neighbours, the number of semantically distant neighbours, and the number of category-specific neighbours. The results showed a significant effect of close, but not of distant or category-specific semantic neighbourhood density. Naming pictures of targets with more close semantic neighbours led to longer naming latencies, less accurate responses, and an increased probability of producing semantic errors and omissions over accurate responses. The results show that word-inherent semantic attributes such as the similarity (distance) of semantic neighbours and the number of coactivated close semantic neighbours modulate lexical selection. These results support theories of lexical selection by competition but can probably also be accounted for by noncompetitive theories that incorporate competition in form of an implicit learning mechanism subsequent to lexical selection.

Author note

During the preparation of this paper, Nora Fieder was funded by a postdoctoral stipend of the Berlin School of Mind and Brain. We would like to thank Lyndsey Nickels, Solène Hameau, Leonie Lampe, and David Howard for helpful discussion. We would also like to thank Luke Tudge for statistical advice.