Manipulations of word frequency reveal differences in the processing of morphologically complex and simple words in German

Bronk, Maria; Zwitserlood, Pienie; Bölte, Jens

doi:10.3389/fpsyg.2013.00546

ORIGINAL RESEARCH article

Front. Psychol., 22 August 2013

Sec. Psychology of Language

Volume 4 - 2013 | https://doi.org/10.3389/fpsyg.2013.00546

Manipulations of word frequency reveal differences in the processing of morphologically complex and simple words in German

Maria Bronk*

Pienie Zwitserlood

Jens Bölte

Institute for Psychology, Westfälische Wilhelms-Universität Münster, Münster, Germany

We tested current models of morphological processing in reading with data from four visual lexical decision experiments using German compounds and monomorphemic words. Triplets of two semantically transparent noun-noun compounds and one monomorphemic noun were used in Experiments 1a and 1b. Stimuli within a triplet were matched for full-form frequency. The frequency of the compounds' constituents was varied. The compounds of a triplet shared one constituent, while the frequency of the unshared constituent was either high or low, but always higher than full-form frequency. Reactions were faster to compounds with high-frequency constituents than to compounds with low-frequency constituents, while the latter did not differ from the monomorphemic words. This pattern was not influenced by task difficulty, induced by the type of pseudocompounds used. Pseudocompounds were either created by altering letters of an existing compound (easy pseudocompound, Experiment 1a) or by combining two free morphemes into a non-existing, but morphologically legal, compound (difficult pseudocompound, Experiment 1b). In Experiments 2a and 2b, frequency-matched pairs of semantically opaque noun-noun compounds and simple nouns were tested. In Experiment 2a, with easy pseudocompounds (of the same type as in Experiment 1a), a reaction-time advantage for compounds over monomorphemic words was again observed. This advantage disappeared in Experiment 2b, where difficult pseudocompounds were used. Although a dual-route might account for the data, the findings are best understood in terms of decomposition of low-frequency complex words prior to lexical access, followed by processing costs due to the recombination of morphemes for meaning access. These processing costs vary as a function of intrinsic factors such as semantic transparency, or external factors such as the difficulty of the experimental task.

Introduction

Is a doorstep a threshold to our mental lexicon, or do we have to go through the door and a step beyond to access this word? On the path from visual input to meaning, are words such as tablecloth, usurping, and premature parsed into their constituent morphemes, or are they stored as full word forms? If they are parsed, does this happen very early (before lexical access), early (during lexical access), or late (after access to full word-forms)? The question whether morphologically complex words are parsed into their constituents has been of scientific interest for a long time (Taft and Forster, 1975) and has not been unequivocally answered yet (cf. Amenta and Crepaldi, 2012). It is known that many factors influence the processing of morphologically complex words, among which are differences between languages, differences depending on the input modality (visual vs. auditory), on the type of morphological complexity (inflection, derivation, compounding), and on features particular to individual words, such as, for example, the degree of semantic transparency. In the research reported here, we investigated the processing of visually presented compounds. Therefore, in what follows, we focus on data and models for reading complex words. Given our emphasis on the visual processing of compounds, presented without context to adult native speakers, we largely restrict our discussion of existing data to these issues.

An important question is whether the processing of all complex words follows the same route, or whether there are different options depending, for example, on frequency of use, or the degree of semantic transparency. Almost four decades ago, Taft and Forster (1975) used verbs in a visual lexical decision task to show that prefixes are stripped-off from their stems prior to lexical access. Since then, psycholinguistic research has seen the birth of various models and numerous scientific publications supporting each of them. Taft developed a model (cf. Taft, 2004; Taft and Ardasinski, 2006; Taft and Nguyen-Hoan, 2010) claiming morphological decomposition prior to mental lexicon access for any morphologically complex word, whereas Butterworth (1983) proposed a completely opposite hypothesis, claiming that all known words are stored as full forms in the mental lexicon. Between these extreme positions, other models have emerged. One is the supralexical model of Giraudo and Grainger (2000), which states that all words are retrieved as full forms first, and morphological features are accessible only afterwards. Finally, dual-route approaches such as the Morphological Race Model (MRM) by Schreuder and Baayen (1995), or the Augmented Addressed Morphology Model (AAM) by Caramazza et al. (1988), assume that both full-form access and access via decomposed constituents is possible. In the MRM, the route taken depends on factors such as the frequency of the full word form and its constituents, or semantic transparency. The AAM assumes that lexical access is possible via full-forms and via morphological decomposition, with full-form access being the normal (and faster) route for known words, whereas the constituent-based access is faster only for previously not encountered words.

Morphological effects have been studied with numerous designs, using derived words (e.g., Longtin and Meunier, 2005; Kuperman et al., 2010), inflected words (e.g., Lehtonen et al., 2007; Leinonen et al., 2009), and compounds (e.g., Fiorentino and Poeppel, 2007; Ji et al., 2011; Juhasz and Berkowitz, 2011). What the diversity in models and data shows, is that many questions about morphological complexity are still unresolved, and we hope to shed light on some of them. Here, we concentrate on (visually presented) German compounds, addressing the following questions: Are there morphological effects at the word-form level in reading complex German words? Next, if decomposition takes place—does it come at some processing costs, and if so, what factors influence these processing costs? There are data from several languages with respect to the first issue. Rastle and colleagues (Rastle et al., 2004; Rastle and Davis, 2008) found significant priming effects in English, with masked visual priming, when the prime was derived from the target (teacher—teach), but also when the relationship was not morphological but rather accidental (e.g., corner—corn). This provides clear evidence for early morphological decomposition of derived words—even of pseudoderived words—, which has been replicated in other languages (for French: Longtin and Meunier, 2005; for Russian: Kazanina et al., 2008).

With respect to compound reading, there is support for morphological decomposition from several languages. Fiorentino and Poeppel (2007) report evidence for morphological decomposition prior to word-form access. Their method features the comparison of morphologically complex words with monomorphemic words, matched with respect to length, frequency, and other linguistic factors. They compared English compounds and simple words, using visual lexical decision with simultaneous MEG registration. Compounds were recognized significantly faster than frequency- and length-matched monomorphemic nouns, and the MEG signal also revealed evidence for early decomposition. Crepaldi et al. (2013) reported that the morphemes moon and honey from transposed-constituent pseudocompounds (*moonhoney), activate the representation of honeymoon. Lemhöfer et al. (2011) showed that orthotactic cues at the morpheme boundaries of Dutch compounds led to faster responses compared to compounds lacking such cues, thus providing evidence for morphemic parsing. But not all evidence speaks in favor of decomposition, and there are noticeable differences between languages. In a review of data on Mandarin Chinese, Dronjic (2011) concluded that full-form access predominates Chinese compound processing, although recent findings indicate some flexibility (Cui et al., 2013). Bertram and Hyönä (2003) obtained evidence for decomposition for long Finnish compounds, but not consistently for short ones. Hyönä (2012) concludes that short compounds that can be viewed within one fixation are processed along a whole-word route. Longer compounds, especially those with hyphenated morpheme boundaries, encourage decomposition. Finally, and closely related to our own study, Ji et al. (2011) reported processing costs for morphologically complex words, but not in all circumstances. They found shorter lexical-decision times for semantically transparent English compounds than for matched monomorphemic nouns. This advantage disappeared and even turned into a disadvantage for semantically opaque compounds, when the experimental design encouraged decomposition. Ji et al. explain their findings in terms of early morphological decomposition, followed by necessary constituent integration, to gain access the word's meaning. Depending on various factors (semantic opacity, experimental manipulations), this integration may outweigh the processing advantages due to decomposition, and therefore compound processing may take longer than the processing of a morphologically simpler word.

So, there is evidence in favor of early decomposition (Fiorentino and Poeppel, 2007; Rastle and Davis, 2008), for processing costs later on, and for differences as a function of semantic transparency (Ji et al., 2011). But as often is the case, most of the evidence comes from English, and data from other languages (Finnish, for example) show a different pattern. It is thus important to provide further data, from different languages, to enlighten these issues.

To explore whether early decomposition takes place during the processing of morphologically complex words of German, we used German noun-noun compounds and matched them in length and surface frequency with monomorphemic nouns. We ensured that the constituents of the compounds were always of higher frequency than the full compounds. As has been known for a long time, the frequency of occurrence of a word determines the speed with which it is recognized (cf. Andrews, 1986). The idea is that word frequency is a feature of word forms. Frequent word forms are either accessed before infrequent ones (Forster, 1976), or have higher resting levels of activation (McClelland and Rumelhart, 1981). Frequency can thus be used, and has been used, as a diagnostic tool to address issues of full-form storage and decomposition (e.g., Alegre and Gordon, 1999; Baayen et al., 2010). If morphologically complex words are treated in the same manner as morphologically simple words during word recognition, they should be recognized with a similar latency as monomorphemic words, when matched in overall frequency and length. If, however, morphological decomposition takes place upon lexical access, compounds should be recognized faster than monomorphemic words, because of their more frequent constituents.

There is ample evidence that the frequency of compound constituents play a role during visual word recognition, for English (cf. Juhasz et al., 2003; Andrews et al., 2004; Wang et al., 2010), Spanish and Basque (Duñabeitia et al., 2007, 2008), Dutch (Kuperman et al., 2009), and Finnish (Hyönä and Pollatsek, 1998; Pollatsek et al., 2000). Effects of constituent frequency are interpreted in favor of decomposition, or for the existence of two routes to visual-word recognition. However, evidence for German remains scarce (cf. Böhl, 2007).

If decomposition takes place, the constituents have to be re-assembled at some point, to distinguish existing compounds, such as doorstep, from ones that do not exist, such as doorwater. This re-assembly may come at the price of extra processing costs, which then might consume any head-start advantage. Thus, given the task and timing, compounds might be recognized as slowly as, or even more slowly than, monomorphemic words due to these re-assembly processes (cf. Taft and Ardasinski, 2006; Ji et al., 2011). The most prominent reason for re-assembly applies inside and outside of the laboratory, and concerns the fact that the meaning of any compound is not the mere sum of the meanings of its constituents. To integrate the meaning of the constituents, their relational structure needs to be retrieved (cf. Gagné and Spalding, 2009). Even for semantically transparent compounds, the relation between modifier and head can vary considerably, as in cheesecake, cupcake, and wedding cake. This is even more relevant for compounds that are semantically opaque, for which the relationship between the meaning of the compound and the meaning of its constituents is opaque or absent (e.g., soap opera; hogwash). As a consequence, some models assume that semantically opaque words, though morphologically complex, are accessed more quickly via their whole-word forms (Schreuder and Baayen, 1995).

Although the results are somewhat mixed (cf. Libben, 1998), there is evidence that (partially) opaque compounds are decomposed, for English (Libben et al., 2003; Frisson et al., 2008), Greek, Polish, French, and Bulgarian (Jarema et al., 1999; Kehayia et al., 1999), Dutch (Zwitserlood, 1994), and Finnish (Pollatsek and Hyönä, 2005). Semantic transparency does have an impact on gaze durations in reading English compounds (Juhasz, 2007) and fully opaque compounds seem to be treated as monomorphemic words (Zwitserlood, 1994; Libben, 1998). We therefore also manipulated the semantic transparency of compounds in separate experiments, to further investigate the influence of semantic transparency on word recognition in compound reading.

In order to assess potential processing benefits and costs, in Experiment 1 we used pairs of semantically transparent compounds with different constituent frequencies, matched in overall frequency and length to monomorphemic words. When decomposition takes places upon lexical access, a larger head start is expected for compounds with high-frequent constituents than for compounds with low-frequent constituents. A lexical decision task with different levels of difficulty was used, to tax potential differences in processing costs. In Experiment 1a, the pseudowords for the lexical decision task were relatively easy to detect: whether monomorphemic (instrament) or complex (doorstip, toirstep), they clearly diverged from existing word forms. Following Taft (Taft, 2004; Taft and Ardasinski, 2006), we expected word decisions to be fast and easy. Frequent words, even complex ones, might be recognized via full-form access in the context of such easy pseudowords (cf. Taft and Ardasinski, 2006). Given the low overall frequency of the compounds, and the much higher frequency of their constituents, we expected access via decomposition even when pseudowords were easy. We thus hoped to tap into a lexical-access advantage due to constituent frequency. An observation of such an advantage would provide evidence for decomposition. Thus, our design should enable us to distinguish between models that feature early decomposition (e.g., Taft and Nguyen-Hoan, 2010), and those that do not (Butterworth, 1983; Giraudo and Grainger, 2001). In Experiment 1b, the pseudocompounds consisted of combinations of two free morphemes that do not make up an existing compound (pianocup, dressfork). To distinguish between existing compounds and pseudocompounds, either lookup of the full form is attempted—albeit without success—or the pseudowords are decomposed, and information is needed from the re-assembly or integration stage, to decide whether the combination of two existing morphemes also exists. But even if constituent integration partially devours the potential advantage relative to matched monomorphemic words, there should still be a difference between compounds with high- or low-frequency constituents.

In Experiments 2a and 2b, we used semantically intransparent compounds. The models mentioned above differ in their assumptions concerning semantic transparency. The MRM (Schreuder and Baayen, 1995) assumes that semantically opaque words are processed along a direct route, by the retrieval of their full forms stored in the mental lexicon. This is contrary to Taft's (e.g., Taft and Nguyen-Hoan, 2010) model, which states that all morphologically complex words are parsed, independent of their semantic transparency. To put these assumptions to test, we used transparent compounds in Experiment 1, and opaque compounds in Experiment 2. If semantically opaque words are retrieved from the mental lexicon as full forms, reaction times (RTs) should not differ between compounds and matched monomorphemic words. If, however, opaque compounds are decomposed into their constituents, we should find faster RTs for compounds, because of the higher constituent frequencies. As in Experiment 1, we used different types of pseudowords in Experiments 2a and 2b, to potentially tax the integration stage—if opaque compounds are indeed decomposed. Given the findings of Ji et al. (2011), we might observe different processing costs for semantically transparent and opaque compounds. A joint interpretation of the results of Experiments 1 and 2 should enable us to shed more light on the routes that visually presented morphologically complex words follow as they are processed.

Experiment 1a

Method

Participants

The experiment was conducted with 31 native speakers of German (mean age = 21 years, 27 females) from the Westfälische Wilhelms-Universität Münster who received course credit or money for their participation. All had normal or corrected-to-normal vision. The local ethics committee approved of all procedures reported for this and the following experiments. In every experiment reported here informed consent was obtained from all participants.

Materials

Seventy-two triplets consisting of two German bi-morphemic, semantically transparent noun-noun compounds and one monomorphemic noun, for example, Papierhut (paper hat), Zauberhut (magic hat), Margerite (marguerite), were used. See Table 1 for examples of each stimulus class and an overview of stimulus properties. Appendix lists the complete stimulus set.

TABLE 1

Table 1. Experiments 1a and 1b: Mean word frequency and mean word length in number of letters (SD).

The frequencies of the constituents reported here and hereafter always concern the frequency of these nouns as they occur in isolation, not as a part of any combination. The surface frequency of compounds and simple words within each triplet was matched using the Leipziger Wortschatz–Lexikon (March 2009). In the Leipziger Wortschatz frequency classes can be obtained for each word relative to the frequency of the masculine definite article “der,” which is the most frequent word in German. High values code low frequencies, contrary to common usage. In the following, we will use “high” and “low” frequency as is common, even though the relevant class information is numerically opposite.

Surface frequency of the triplets ranged from class 14 to 21 (mean frequency class = 17.26, SD = 1.77). Members of a triplet did not differ from each other in more than one frequency class. The compounds of a triplet shared one constituent (–hut/hat in the example mentioned above) either in modifier (50% of the set) or in head position (50% of the set; German compounds are right-headed). Compounds were selected such that the non-shared constituents, such as Papier and Zauber in the example, varied in frequency class. This resulted in a “high” frequency set (constituent mean = 9.07, SD = 0.23) and a “low” frequency set (constituent mean = 13.58, SD = 0.229). Note that the constituents were always more frequent than the compounds (compound mean = 17.26, SD = 1.768). The shared constituent had a mean frequency class of 10.06, SD = 2.31. Simple words had a mean frequency class of 17.29, SD = 0.20. Compounds were thus closely matched with simple words in surface frequency, as well as in word length. Word length ranged from 6 to 11 characters (mean = 8.49, SD = 0.092). Triplet members differed in word length maximally by one letter.

Although matching was done on the basis of the Leipziger Wortschatz, the best database available at that time, we checked these frequencies in the new dlex database (Heister et al., 2011), which has a more common “words per million”-count. For those items that were also present in dlex, (July 2011) the statistics are as follows: The high-frequency compounds had a mean surface frequency of 43 (SD = 0.54; range = 0.008–2.23). The mean frequency of the non-shared constituent was 73.4 (SD = 119.56). The low-frequency compounds had a mean surface frequency of.43 (SD = 0.75; range = 0.008–4.18). The mean frequency of the non-shared constituent was 6.38 (SD = 8.94). The mean frequency of the shared constituent was 51.14 (SD = 64.13). The simple words had a slightly higher mean surface frequency of 1.03 (SD = 1.51, range = 0.025–8.94).

Seventy-two simple words were added as fillers to the stimulus set, to balance the ratio between compounds and simple words. In addition, 288 pronounceable pseudowords were created by changing one or two vowels or consonants of existing words. There were 144 simple pseudowords (e.g., *Instrumunt), 48 word/pseudoword compounds (e.g., *Weupennest), 48 pseudoword/word compounds (e.g., *Senfsime), and 48 pseudoword/pseudoword compounds (e.g., *Blamentepf). Word length was matched between words and pseudowords. The 72 triplets were evenly distributed across two lists, with the triplets' compounds on different lists. Filler words and pseudowords were evenly distributed. Every participant saw both lists, with eight practice trials placed at the beginning of each. List presentation order was balanced across participants.

Procedure

The participants were tested individually in a quiet room, sitting in front of a 17″ computer screen (CTX 1785 XE). They were instructed to decide as quickly and accurately as possible via button press on the keyboard whether the visually presented stimulus was a word or pseudoword. The stimuli were presented in random order, in black 28pt Verdana font on a white background. In all experiments, the visual angle on the stimuli was.8° vertically, and ranged from 2.8 to 6.6° horizontally (6–12 characters width). Viewing distance was 75 cm. A fixation cross initiated the trial and was present for 550 ms. A blank screen, following the fixation cross and present for 500 ms, preceded stimulus presentation, which lasted 1000 ms. RT was measured from stimulus onset for maximally 2500 ms. The inter-trial interval (ITI) was set to 650 ms. The NESU system (New Experimental Set Up, Baumann et al., 1992) was used for stimulus presentation and reaction-time measurement. A recording session lasted about 30 min.

Results

Two participants and 9 triplets were discarded from further analyses due to high error rates (more than 20% errors for participants, more than 40% errors in a condition). Because the frequency- and length-matching was done triplet-wise, the stimulus properties are not affected by the triplet-wise exclusion of stimuli. The matching holds, even if many triplets are excluded. After exclusion, the total error rate was 7.1%. See Table 2 for mean RTs and error rates of the different stimulus classes.

TABLE 2

Table 2. Experiments 1a and 1b: Errors in percentages and mean RT in ms (SD).

For the analysis of the RT data, trimmed means (5%) per condition averaged over participants (F₁) and items (F₂) served as dependent variable. First, we determined whether Frequency (high, low) and Position of Shared Constituent (modifier, head) interacted with each other. Neither the F₁ (Two-Way repeated measures ANOVA) nor the F₂ (Two-Way ANOVA) showed a significant interaction, all Fs < 1. Therefore, the factor Position of Shared Constituent was dropped from further analyses.

A One-Way repeated measures ANOVA (F₁) and One-Way ANOVA (F₂) using the factor Word Type (compounds with high-frequency constituent, compounds with low-frequency constituent, simple words) yielded a significant main effect [F_{1(2, 56)} = 21.439, p < 0.001, GG = 0.864, partial η² = 0.434; F_{2(2, 186)} = 7.228, p = 0.001, partial η² = 0.072]. Subsequently, planned t-tests revealed that participants responded significantly faster (30 ms) to compounds with a high-frequency constituent than to morphologically simple words: t₁₍₂₈₎ = 5.772, p < 0.001 two-tailed, d = 0.30; t₂₍₆₂₎ = 3.740, p < 0.001 two-tailed, d = 0.65. They also responded significantly faster (29 ms) to compounds with a high-frequency constituent than to compounds with a low-frequency constituent: t₁₍₂₈₎ = 6.883, p < 0.001 two-tailed, d = 0.26; t₂₍₆₂₎ = 4.184, p < 0.001 two-tailed, d = 0.61. The mean RT to compounds with a low-frequency constituent did not differ significantly from the RT of morphologically simple words: all ts < 1.

Discussion

So far, our data give evidence for morphological parsing. Latencies for compounds with a high-frequency constituent were shorter than for compounds with a low-frequency constituent and for monomorphemic words, although surface frequency was matched. This advantage due to constituent frequency can only be explained by access to the constituents, and thus by decomposition. RTs were almost identical to monomorphemic words and to compounds with constituents of lower frequency. This pattern would fit well with full-form access for those compounds. But note that the constituents of the low-frequency compounds were still far more frequent than the matched overall frequency of the compound and the simple word. The overall very low frequency of the compound, relative to the frequency of its constituents, should invite decomposition. But if decomposition and reassembly had come without costs, we would have expected faster responses even for the low-frequency compounds, compared to the monomorphemic words. To further investigate this, in the next experiment the task for the participants was more difficult to accomplish. This was achieved by using pseudowords made up from existing constituents. These can be distinguished from existing words by lookup—checking the lexicon for the existence of a full form—or by checking the existence of the combination, after their constituents have been recognized. Taft and colleagues have shown that the inclusion of such pseudowords clearly invites decomposition (Taft, 2004; Taft and Ardasinski, 2006). This would clearly tax the re-assembly stage more heavily than necessary in the context of pseudowords with nonce stems. As a consequence, the advantage over monomorphemic words due to constituent frequency might be annihilated.