218 36 6MB
English Pages [259] Year 2013
The Metalinguistic Dimension in Instructed Second Language Learning
Advances in Instructed Second Language Acquisition Research Series Editor: Alessandro Benati, University of Greenwich, UK Editorial Board: James F. Lee, University of New South Wales, Australia Florence Myles, University of Essex, UK Emma Marsden, University of York, UK María del Pilar García Mayo, Universidad del País Vasco, Spain Cristina Sanz, Georgetown University, USA Teresa Cadierno, Syddansk Universitet, Denmark The mission of this series is to publish new theoretical insights in instructed second language acquisition research that advance our understanding of how languages are learned and should be taught. Research in instructed SLA has addressed questions related to the degree to which any form of external manipulation (e.g. grammar instruction, input manipulation, etc . . .) can affect language development. The main purpose of research in instructed second language acquisition is to establish how classroom language learning takes place, and how an understanding of second language acquisition contributes to language teaching. Despite the clear relationship between theory and research in SLA, and language practice, there are still very few cross-references between these areas. This series will publish research in instructed SLA that bridges this gap and provide academics with a set of theoretical principles for language teaching and acquisition. The calibre of research will inspire scholars and practitioners to learn more about acquisition and to reflect on their language teaching practices more generally. Other Titles in the Series: The Developmental Dimension in Instructed Second Language Learning, Paul A. Malovrh and James F. Lee The Grammar Dimension in Instructed Second Language Learning, edited by María J. Arche and Cécile Laval Task Sequencing and Instructed Second Language Learning, edited by Melissa Baralt, Roger Gilabert and Peter Robinson
The Metalinguistic Dimension in Instructed Second Language Learning
Edited by Karen Roehr and Gabriela Adela Gánem-Gutiérrez
L ON DON • N E W DE L H I • N E W Y OR K • SY DN EY
Bloomsbury Academic An imprint of Bloomsbury Publishing Plc
50 Bedford Square London WC1B 3DP UK
1385 Broadway New York NY 10018 USA
www.bloomsbury.com First published 2013 © Karen Roehr, Gabriela Adela Gánem-Gutiérrez and Contributors, 2013 All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or any information storage or retrieval system, without prior permission in writing from the publishers. No responsibility for loss caused to any individual or organization acting on or refraining from action as a result of the material in this publication can be accepted by Bloomsbury Academic or the author. British Library Cataloguing-in-Publication Data A catalogue record for this book is available from the British Library. eISBN: 978-1-4411-5833-8 Library of Congress Cataloging-in-Publication Data The metalinguistic dimension in instructed second language learning / Edited by Karen Roehr and Gabriela Adela Gánem-Gutiérrez. pages cm. – (Advances in Instructed Second Language Aquisition Research) Includes bibliographical references and index. ISBN 978-1-4411-6089-8 (hardcover) – ISBN 978-1-4411-5833-8 (pdf) – ISBN 978-1-4411-6856-6 (epub) 1. Second language acquisition–Study and teaching. I. Roehr, Karen. P118.2.M48 2013 418.0071–dc23 2013008911
Typeset by Newgen Imaging Systems Pvt Ltd, Chennai, India
Contents Contributors
vii
Foreword
ix
Introduction
1
1 The Metalinguistic Dimension in Instructed Second Language Learning Karen Roehr and Gabriela Adela Gánem-Gutiérrez
Part 1 Cognitive Approaches to the Metalinguistic Dimension
3 13
2 Developing a Measure of Metalinguistic Awareness for Children Aged 8–11 Angela Tellier
15
3 Examining the Contribution of Syntactic and Metasyntactic Abilities to Reading Comprehension among Native and Non-Native Speakers Daphnée Simard, Denis Foucambert and Marie Labelle
45
4 Effects of Instruction on Learners’ Acquisition of Metalinguistic Knowledge Rosemary Erlam
71
5 University-Level Learners’ Beliefs about Metalinguistic Knowledge Benjaporn Thepseenu and Karen Roehr
95
6 Metalinguistic Awareness in Third Language Phonological Acquisition Magdalena Wrembel
Part 2 Sociocognitive Approaches to the Metalinguistic Dimension
119 145
7 Metalinguistic Knowledge in Language-Related Episodes (LREs) with Covert Metalinguistic Activity Xavier Gutiérrez
147
8 The Role of Metalanguage in the Performance of a Sequence of Collaborative Output Tasks by Five L2 Learner Dyads Alan Fortune
171
9 Mediating the Development of L2 Oral Performance through Dynamic Assessment: Focusing on the Metalinguistic Dimension Gabriela Adela Gánem-Gutiérrez and Cristina Nogués Meléndez
195
Contents
vi
10 The Being and Becoming of Metalinguistic Knowledge: Rules and Categories of Grammatical Description as Functional Tools of the Mind Eduardo Negueruela Azarola Index
221 243
Contributors Rosemary Erlam is a Senior Lecturer in the Department of Applied Language Studies and Linguistics at the University of Auckland, New Zealand. One relevant research interest is the area of language assessment and the use of measures of metalinguistic knowledge to ascertain the level of language knowledge of teacher trainees. Alan Fortune is a Senior Lecturer in the Department of Education and Professional Studies at King’s College London. His research interests include task-based learning; the teaching and learning of L2 grammar; classroom SLA, particularly collaborative learning processes; L2 materials development. Denis Foucambert is a Professor of Psycholinguistics at the Université du Québec à Montréal. His research focuses on the syntactic processes in reading abilities as well as the observation of writing processes in textual genetics and psycholinguistic frameworks. Gabriela Adela Gánem-Gutiérrez is a Senior Lecturer in Applied Linguistics in the Department of Language and Linguistics at the University of Essex. Her main research interests include explicit L2 learning, sociocultural theory and CALL. More specifically, she is interested in collaborative activity; L2 pedagogical tasks; and feedback and scaffolding. Xavier Gutiérrez is an Assistant Professor in the Department of Languages, Literatures and Cultures at the University of Windsor, Canada, where he teaches courses in general and applied linguistics, as well as Spanish language courses. His main research interests include metalinguistic activity and the development of knowledge representations of language. Marie Labelle is a Professor of Linguistics at the Université du Québec à Montréal. She specializes in first language acquisition as well as in French syntax and semantics. She co-authored the Canadian French adaptation of the CELF (Clinical Evaluation of Language Fundamentals). She recently participated in research on the comparison between school-aged native and non-native speakers of French. Eduardo Negueruela Azarola is an Associate Professor of Spanish Second Language Acquisition and director of the Spanish Basic Language Program at the University of Miami. His research is on the areas of sociocultural psychology and second language teaching and learning, Spanish second language acquisition and technology-enhanced language learning. Cristina Nogués Meléndez is a Lecturer in English as a Foreign Language in the Department of Applied Linguistics at Universitat Politècnica de València, Spain. Her
viii
Contributors
research interests include task-based methodology and second language acquisition. More specifically, she is interested in collaborative learning activities and oral language instruction and assessment. Karen Roehr is a Senior Lecturer in the Department of Language and Linguistics at the University of Essex. Her research interests include explicit and implicit knowledge in L2 learning, the role of individual learner differences in L2 learning and usage-based approaches to language learning and teaching. Daphnée Simard is a Professor of Second Language Acquisition at Université du Québec à Montréal. Her research interests are twofold. First, she investigates the relationship between metalinguistic, second language acquisition and individual variables such as attentional capacity and phonological memory span. Second, she examines the role of attention in second language acquisition. Angela Tellier is a PhD candidate in the Department of Languages and Linguistics at the University of Essex. Her research interests include metalinguistic awareness and language aptitude in child L2 learning, the role of metalinguistic instruction in the acquisition of explicit/implicit knowledge and the development of language readiness in child L2 learners. Benjaporn Thepseenu is a Lecturer of English at the University of Phayao, Thailand. Currently she is a PhD candidate in ELT in the Department of Language and Linguistics at the University of Essex. Her research interests include metalinguistic knowledge in L2 learning and the role of metacognition in EFL reading. Magdalena Wrembel is an Assistant Professor at the Faculty of English, Adam Mickiewicz University, Poznań, Poland. Her main research areas involve SLA of speech, phonological acquisition of a third language, language awareness and innovative trends in pronunciation pedagogy, on which she has published several articles, both in international journals and edited collections.
Foreword In the present book, Karen Roehr and Gabriela Adela Gánem-Gutiérrez bring together a number of studies and original research investigating metalinguistic knowledge/ awareness. They provide an excellent introduction and an up-to-date position on the concept and role of metalinguistic knowledge/awareness in second language learning. The volume is divided into two main parts. Part 1 offers contributions focusing particularly on cognitive approaches to the metalinguistic dimension. Part 2, instead, focuses specifically on sociocognitive approaches. In both parts, the contributors have successfully addressed a number of issues concerning the characteristics and nature of metalinguistic knowledge/awareness, and have presented and discussed a series of important empirical findings within this research framework. The editors and the contributors have produced a solid volume that will be beneficial for both researchers and practitioners in the field of instructed second language learning and language teaching. We thank them for their excellent contribution to this research field, and we are very pleased to add this edited book to the Series in Advances in Instructed Second Language Acquisition Research. Alessandro G. Benati Series Editor Advances in Instructed Second Language Acquisition Research
Introduction
1
The Metalinguistic Dimension in Instructed Second Language Learning Karen Roehr
University of Essex
Gabriela Adela Gánem-Gutiérrez University of Essex
This volume brings together original research on the metalinguistic dimension in instructed second language (L2) learning. In the broadest sense, the metalinguistic dimension refers to how individuals bring to bear knowledge about language in their learning of the L2. In the field of applied linguistics more generally and L2 learning and teaching more specifically, it is widely accepted that L2 learners develop and use knowledge about language when engaging with the challenging task of acquiring a new language; this applies to both children and adults, as well as to the learning of L2, L3, L4 or Ln. Accordingly, the term L2 learning is used here to refer to the learning of any language beyond the L1, i.e. it includes the learning of L3, L4, Ln. Conceptualizations of the metalinguistic dimension in L2 learning vary, depending on the theoretical approaches employed by researchers. Moreover, findings about the role of the metalinguistic dimension in instructed L2 learning are not necessarily always homogeneous or compatible. Thus, there is still ample scope for further research into this important aspect of the discipline. The chapters in this volume explore the conceptualization of the metalinguistic dimension in instructed L2 learning by considering the nature of metalinguistic awareness and/or metalinguistic knowledge, how such awareness/knowledge can be measured, and how it may be used by learners of different ages, from different backgrounds and in different types of instructional contexts. Findings provide implications for L2 learning research as well as for L2 instruction in the classroom. The volume is organized into two parts: Part 1 focuses on cognitive approaches to the metalinguistic dimension, while Part 2 focuses on sociocognitive approaches to the metalinguistic dimension. Part 1 comprises contributions by Tellier (Chapter 2), Simard, Foucambert and Labelle (Chapter 3), Erlam (Chapter 4), Thepseenu and Roehr (Chapter 5) and Wrembel (Chapter 6). Chapters 2 and 3 focus on child learners, while Chapters 3, 4 and 5 are concerned with adult learners.
4
The Metalinguistic Dimension
In Chapter 2, Tellier conceptualizes the metalinguistic dimension in terms of metalinguistic awareness, defined as the ability to make use of explicit knowledge about language (Ellis, 2004). Following Bialystok and Ryan (1985, p. 230), Tellier argues that the term metalinguistic ‘applies not to a specific mental accomplishment, but rather to a set of problems which share certain features’. Specifically, metalinguistic problems or tasks make high demands on an individual’s ability to analyse, manipulate and explain their linguistic knowledge (analysis of knowledge), as well as on an individual’s ability to direct their attention to select whatever is immediately relevant to the task, while suppressing focus on anything not relevant (control of processing) (Bialystok, 1988; 1994). Typically, this is achieved by tasks which require the individual to attend to and manipulate language form (Jessner, 2006; Svalberg, 2007). Tellier describes the development of a written test of metalinguistic awareness for English-speaking children aged 8 to 11. The test requires children to solve a series of language-related tasks based on both natural and constructed languages. The researcher reports results from several studies which investigated the performance of children from a number of schools; her findings suggest that the newly designed measure constitutes a generally valid and reliable tool for assessing the metalinguistic awareness of instructed English-speaking children. The descriptive statistics show a good distribution of scores, with no obvious indication of a ceiling effect. Findings further point towards an overall age-based advantage, with older, more cognitively mature children outperforming younger children, as well as an apparent advantage for bilingual children that increases with age. These performance patters are in keeping with existing research and thus expected. Results also show a statistically significant difference in overall performance between boys and girls, with girls outperforming boys. In addition to quantitative results, Tellier reports qualitative findings from a series of verbal protocols which yield interesting insights into children’s thought processes while resolving metalinguistic tasks. These findings constitute further evidence in support of the validity of the newly developed measure. Also working with child learners, Simard, Foucambert and Labelle, in Chapter 3, conceptualize the metalinguistic dimension in a way that corresponds closely to Tellier’s approach. Following Gombert (1992), they define metalinguistic ability as a learner’s ability to consciously and intentionally manipulate language features. They likewise draw on Bialystok’s (2001) model of analysis of knowledge and control of processing, with metalinguistic abilities requiring high levels of both. Simard, Foucambert and Labelle’s study aims to identify the contributions of metasyntactic and syntactic abilities to reading comprehension in L1 and L2 speakers of French aged 8 to 12. The L2 children in this study are heritage language learners whose L1 is Portuguese, but who are educated in French-Canadian schools. Simard, Foucambert and Labelle regard metasyntactic ability as a sub-component of the broader construct of metalinguistic ability. Metasyntactic ability is defined as the ability to reflect on syntactic aspects of language, including words, grammatical class and rules governing the combination of linguistic units (Gaux and Gombert, 1999), as well as the ability to intentionally control and manipulate syntactic units (e.g. Bowey, 1986, 2005; Gombert, 1992). By contrast, syntactic ability does not have any explicit dimension; it refers to the ability to decode language implicitly by parsing phrases and sentences.
Introduction to the Metalinguistic Dimension
5
While syntactic ability is operationalized by means of a sentence-preference task and a sentence-repetition task, the metasyntactic measures used in the study require participants to either repeat or produce ungrammatical utterances. The authors argue that these tasks require high levels of attentional control; moreover, the production task additionally requires high levels of analysis, thus exhibiting the characteristics of tasks that tap metalinguistic ability. Using structural equation modelling as their main statistical technique, Simard, Foucambert and Labelle establish that metasyntactic ability as operationalized in their study contributes significantly to reading comprehension in both L1 and L2 speakers of French. Conversely, the contribution of syntactic ability to reading comprehension is mediated by metasyntactic ability and is thus only indirect. Overall, the statistical model of reading comprehension shows broadly similar patterns for the L1 and L2 groups. However, the direct contribution of metasyntactic ability and the indirect contribution of syntactic ability are found to carry more weight in the reading comprehension model for the L1 children than in the model for the L2 children. In Chapter 4, Erlam conceptualizes the metalinguistic dimension in terms of a learner’s explicit knowledge about the syntactic, morphological, lexical, phonological and pragmatic features of an L2 (Roehr and Gánem-Gutiérrez, 2009). Explicit knowledge is knowledge that is analysed and requires deliberate focus. Moreover, learners know when they are drawing on such knowledge, e.g. when making judgements about the grammaticality of sentences or when editing writing (Elder, 2009). In the context of her study, Erlam regards metalingual competence (Berry, 2004) as a subcomponent of metalinguistic knowledge. Metalingual competence is the ability to acquire and employ metalanguage, that is, any language that is used to talk or write about language (Berry, 2005). Erlam argues that metalanguage is not restricted to technical linguistic or grammatical terminology because it is possible to talk about language without using such terminology. In her investigation, Erlam operationalizes metalinguistic knowledge as the ability to accurately perform grammaticality judgements of ungrammatical sentences (though not of grammatical sentences; see Ellis, 2005) and to provide appropriate corrections, as well as the ability to correct highlighted errors and provide pedagogical grammar rules explaining why a correction was necessary. Metalingual competence is assessed through the number of types of relevant metalinguistic terms used by the learners when providing their rule explanations. Erlam’s study examines the amenability to explicit instruction of the English indefinite article to mark generic reference. Adult learners of L2 English from predominantly Asian L1 backgrounds are exposed to explicit instruction and language practice activities involving the targeted linguistic feature. In a pre-test/post-test/ delayed post-test design, their learning is assessed in terms of gains in metalinguistic knowledge. Erlam reports that both input-based and output-based instruction had a significant impact on learners’ metalinguistic knowledge about the targeted L2 structure, and in some cases gains are sustained over time. In addition, following the instructional treatment, performance on an oral elicited imitation test, a measure of implicit knowledge of the targeted structure, is found to be moderately correlated with the ability to correct sentences that were accurately judged to be ungrammatical.
6
The Metalinguistic Dimension
In Chapter 5, Thepseenu and Roehr take a similar approach to Erlam in their conceptualization of the metalinguistic dimension. Metalinguistic knowledge is defined as a learner’s explicit knowledge about the language that is being learned (Alderson et al., 1997; Elder and Manwaring, 2004; Elder et al., 1999); explicit knowledge is knowledge that is represented declaratively, can be brought into conscious awareness and is potentially available for verbal report (Hulstijn, 2005; Roehr, 2008). Thepseenu and Roehr present an investigation of L1 Thai university-level learners’ beliefs about metalinguistic knowledge as well as their actual metalinguistic performance with regard to L2 English. Through questionnaires and interviews, participants’ beliefs about the explicit teaching and learning of grammar in general and their beliefs about the difficulty and usefulness of instruction on selected aspects of English grammar are identified. In addition, participants’ metalinguistic knowledge about the selected grammar points is assessed. Metalinguistic knowledge is operationalized as participants’ ability to correct highlighted errors and provide metalinguistic explanations in the form of pedagogical grammar rules. Metalinguistic explanations are scored for accuracy and the use of appropriate metalanguage. Thepseenu and Roehr identify correspondences between learners’ beliefs and their actual metalinguistic knowledge, with greater perceived difficulty and less successful test performance going hand in hand. Moreover, findings suggest that participants’ belief systems are complex constructs. On the one hand, participants hold many positive beliefs about the explicit teaching and learning of grammar; studying grammar is deemed important for mastering the L2, for reading and writing in the L2, for accurate use of the L2 and for self-correction. On the other hand, negative beliefs are focused above all on the potential complexities of grammar rules, the potential for confusion and the challenges of memorizing a large number of rules. In addition, learners report both positive and negative beliefs about the usefulness of grammar for communicating in the L2. The researchers argue that these complex beliefs indicate differentiated rather than contradictory views. Finally, learner confidence emerges as an important qualitative theme in the data. Also working with university-level learners, Wrembel, in Chapter 6, conceptualizes the metalinguistic dimension in terms of metalinguistic awareness, that is, ‘the ability to focus attention on language as an object in itself or to think abstractly about language and, consequently, to play with or manipulate language’ (Jessner, 2006, p. 42). Thus, Wrembel’s conceptualization of the metalinguistic dimension is comparable to Tellier’s and Simard, Foucambert and Labelle’s approach. In her study, Wrembel focuses on adult learners’ metalinguistic awareness with regard to L3 phonology. Taking a primarily qualitative approach, participants’ metalinguistic awareness is investigated by means of verbal protocols. The participants are L1 Polish speakers learning L2 English and either L3 French or L3 German. The learners read aloud a short text in their respective L3 and subsequently listen to a recording of their own speech. They are then asked to improve and comment on their own L3 pronunciation. Finally, the participants take part in semi-structured interviews in which they reflect on the L3 acquisition process. Wrembel identifies generally high levels of metalinguistic awareness in her multilingual participants. At the same time, there is evidence of both quantitative and qualitative differences in metalinguistic
Introduction to the Metalinguistic Dimension
7
self-reflection on L3 performance, intentional focus on articulatory gestures and metalinguistic awareness of problems in L3 pronunciation. Moreover, findings suggest that L2 phonological transfer appears to override L1 phonological transfer, especially in the early stages of L3 acquisition, with the majority of participants reporting L2-to-L3 rather than L1-to-L3 cross-linguistic influence. This pattern is particularly pronounced in learners of L3 German, i.e. learners whose L2 and L3 are typologically closely related. Interestingly, Wrembel finds no conclusive evidence for a relationship between level of metalinguistic awareness and level of L3 proficiency. Part 2 comprises contributions by Gutiérrez (Chapter 7), Fortune (Chapter 8), Gánem-Gutiérrez and Nogués Meléndez (Chapter 9) and Negueruela Azarola (Chapter 10). All the chapters in Part 2 focus on the metalinguistic dimension in the L2 adult context. Both Chapters 7 and 8 use the construct of the language-related episode (LRE; Swain, 1998) to study covert and overt metalinguistic activity respectively, and both studies are concerned with metalinguistic knowledge brought about during collaborative activity. These two chapters usefully complement each other in their distinctive efforts to better understand the increasingly popular construct of LRE, particularly in studies of dyadic interaction. The last two chapters are grounded in sociocultural theory. Chapter 9 by Gánem-Gutiérrez and Nogués Meléndez provides a much broader view of the metalinguistic dimension, which is explored through, and embedded in, a pedagogical treatment based on dynamic assessment. Finally, Part 2 concludes with a theoretically oriented chapter by Negueruela Azarola (Chapter 10) which is firmly rooted in Vygotskian thought and, like Chapter 9, explicitly outlines pedagogical suggestions for the development of L2 metalinguistic knowledge. Based on Swain’s (1998) construct of the LRE, Gutiérrez, in Chapter 7, explores the nature of knowledge representations underlying LREs where there is no metatalk, i.e. where there is no overt discussion of the language foci at the centre of the episode. This chapter builds on the notion that LREs, episodes where L2 learners ‘talk about the language they are producing, question their language use, or other- or self-correct’ (Swain, 1998, p. 70), represent a window into L2 learners’ metalinguistic knowledge, that is, explicit knowledge about language (Ellis, 2004). In his study, Gutiérrez sets out to investigate the hypothesis that covert reflection on language, that is, covert metalinguistic activity, operationalized as text reformulations in a written task, might constitute metalinguistic knowledge. His study builds on existing work on LREs both theoretically, as explained above, and methodologically by resorting to immediate recall as a data collection method of choice. The study involved 27 adult mixed-L1 students of Spanish as a foreign language in Canada. Participants worked in triads to write a picture-based story with the researcher present. When text reformulations on which no comment was made took place, the researcher prompted participants to elaborate on the reasons for their textual change in order to tap into covert metalinguistic activity. The data analysis reports on three aspects of the LREs scrutinized: the type of knowledge representations behind the episodes, their linguistic foci, and whether or not the episodes included technical metalanguage. Based on a combination of quantitative and qualitative analysis, the results demonstrate that, as hypothesized by the author, the majority of LREs where no metatalk is originally present do appear to rely on metalinguistic knowledge.
8
The Metalinguistic Dimension
This chapter represents a valuable contribution to our understanding of the LRE as a construct and to our understanding of metalinguistic activity which might be at play when learners focus on language, for instance as a result of revisions to their written work. Gutiérrez’s chapter also offers interesting reflections on immediate recall as an instrument for data collection in studies of metalinguistic activity in the L2 context. In Chapter 8, Fortune also operationalizes metalinguistic activity through Swain’s (1998, p. 70) construct of the language-related episode (LRE). Fortune’s main goal is to contribute to our understanding of the construct by building on previous LRE taxonomies such as Kowal and Swain (1994), Swain (1998), Swain and Lapkin (2001), Fortune and Thorp (2001) and Leeser (2004) in order to work towards a more comprehensive and detailed framework for the classification of LREs emergent in collaborative activity. To this end, he presents an analysis of dyadic interaction of advanced L2 English learners at university level. Drawing on full transcriptions of the students’ interactions while carrying out a series of dictogloss and grammar dictation (Wajnryb, 1990) tasks as well as stimulated recall, he presents a taxonomy which distinguishes three main levels of metalanguage use and six categories differentiating the linguistic focus of LREs. Fortune’s analysis involves inter-group and intra-group comparisons. The former are based on a comparison of metalinguistic activity between intermediate students in Fortune and Thorp (2001) and the advanced students in the present study, while the latter involve comparisons regarding the metalinguistic activity of the five dyads in the advanced group. Fortune concludes that advanced learners both attend to form more often than intermediate students and make use of metalanguage to a greater extent. There are also interesting differences in the metalinguistic activity of individual dyads, where the use of metalinguistic terminology and allusion to grammar rules by individuals, regardless of accuracy, seem to be perceived as a status factor reflecting expertise. In other words, individuals who use metalinguistic terms and grammar rules to a greater extent understandably tend to dominate the outcome of LREs, an observation which reminds us of the importance of helping L2 learners acquire accurate metalinguistic knowledge. Gánem-Gutiérrez and Nogués Meléndez, in Chapter 9, focus on metalinguistic awareness as a key component of a pedagogical programme framed within dynamic assessment (DA). Their conceptualization of the metalinguistic dimension in L2 learning and teaching follows Jessner’s (2006, p. 42) view, that is, metalinguistic awareness is seen as the ability to reflect on language as an object of study. They operationalize this as the students’ ability to notice (Schmidt, 2001) and correct language errors as well as the ability to write and talk about their language to other students and the teacher. Dynamic assessment is a pedagogical approach based on sociocultural theory (Lantolf and Thorne, 2006) which aims at actively mediating L2 development while assessing learners and responding to either or both their individual and collective needs. Gánem-Gutiérrez and Nogués Meléndez’s longitudinal study took place in an intact EFL class where the teacher, who was also one of the researchers, enabled the programme as a means of helping the 30 participants improve their overall L2 oral ability. The chapter provides a detailed account of the pedagogical treatment (DA) in an attempt to fill a gap in the literature, given that little DA research has been
Introduction to the Metalinguistic Dimension
9
carried out in the L2 context. At the heart of the treatment was a suite of tasks and procedures to facilitate the co-construction of zones of proximal development where the metalinguistic dimension played a crucial role. The study relied on quantitative and qualitative analyses to ascertain the benefits observed in the participants at the end of the academic cycle where the programme was implemented. A pre-test/ post-test research design enabled the researchers to carry out statistical tests which showed improvement in L2 oral performance. The results also demonstrate that the treatment succeeded in addressing the students’ emerging language capacities on a wide range of morphosyntactic, lexical and discourse features. The metalinguistic awareness component in the study was facilitated through reflective tasks based on video recordings of students’ oral tasks. Chapter 10 also offers a view of the metalinguistic dimension rooted in sociocultural theory (Lantolf and Thorne, 2006). The main aim of this theoretically oriented chapter is to argue for an approach to pedagogy where explicit knowledge about the L2 is in the foreground, but which moves away from the teaching of grammar as a series of discrete linguistic features and rules. Negueruela Azarola takes the view that complex grammatical notions such as aspect, tense and modality should be taught through full conceptual definitions materialized by means of reflective tasks to enable L2 learners to articulate their thinking. His argument focuses on the concept of aspect in Spanish, specifically the use of preterite versus imperfect, and suggests that if learners become aware of their grammatical choices to express the specific meanings intended, metalinguistic knowledge can potentially become functional knowledge. Functional knowledge, Negueruela Azarola highlights, refers to knowledge which merges communication and conceptualization for verbal thinking. Ultimately, he argues that functional knowledge about language could be potentially relevant for L2 development. L2 development from a sociocultural perspective becomes evident through the increasing ability to exercise conceptual control over our communicative choices and can be historically studied and traced through microgenetic analysis. This analysis, which investigates language learning instances as observed in short periods of time and aims to understand certain aspects of mental functioning, is ‘analysis that returns to the source and reconstructs all the points in the development of a given structure’ (Vygotsky, 1978, p. 65). Based on this methodological framework, Negueruela Azarola puts forward the notion of semiogenesis as a methodological tool to document the emergence of conceptual meanings in concrete communicative activity. To illustrate his approach, both to pedagogy and analysis, Negueruela Azarola describes a task he employed to teach preterite and imperfect use in an intermediate-level L2 Spanish classroom and provides selected data examples to demonstrate the kind of reflective activity some learners engaged in.
References Alderson, J. C., Clapham, C. and Steel, D. (1997), ‘Metalinguistic knowledge, language aptitude and language proficiency’. Language Teaching Research, 1, 93–121. Berry, R. (2004), ‘Awareness of metalanguage’. Language Awareness, 13, 1–16.
10
The Metalinguistic Dimension
— (2005), ‘Making the most of metalanguage’. Language Awareness, 14, 3–20. Bialystok, E. (1988), ‘Levels of bilingualism and levels of linguistic awareness’. Developmental Psychology, 24, (4), 560–7. — (1994), ‘Analysis and control in the development of second language proficiency’. Studies in Second Language Acquisition, 16, 157–68. — (2001), Bilingualism in Development: Language, Literacy, and Cognition. Cambridge: Cambridge University Press. Bialystok, E. and Ryan, E. B. (1985), ‘Toward a definition of metalinguistic skill’. Merrill-Palmer Quarterly, 31, (3), 229–51. Bowey, J. A. (1986), ‘Syntactic awareness in relation to reading skill and ongoing comprehension monitoring’. Journal of Experimental Child Psychology, 41, 282–99. — (2005), ‘Grammatical sensitivity: Its origins and potential contribution to early word reading skill’. Journal Experimental Child Psychology, 90, 318–43. Elder, C. (2009), ‘Validating a test of metalinguistic knowledge’, in R. Ellis, S. Loewen, C. Elder, R. Erlam, J. Philip and H. Reinders, Implicit and Explicit Knowledge in Second Language Learning, Testing and Teaching. Bristol: Multilingual Matters, pp. 113–38. Elder, C. and Manwaring, D. (2004), ‘The relationship between metalinguistic knowledge and learning outcomes among undergraduate students of Chinese’. Language Awareness, 13, (3), 145–62. Elder, C., Warren, J., Hajek, J., Manwaring, D. and Davies, A. (1999), ‘Metalinguistic knowledge: How important is it in studying a language at university?’ Australian Review of Applied Linguistics, 22, (1), 81–95. Ellis, R. (2004), ‘The definition and measurement of L2 explicit knowledge’. Language Learning, 54, (2), 227–75. — (2005), ‘Measuring implicit and explicit knowledge of a second language: A psychometric study’. Studies in Second Language Acquisition, 27, (2), 141–72. Fortune, A. and Thorp, D. (2001), ‘Knotted and entangled: New light on the identification, classification and value of language related episodes in collaborative output tasks’. Language Awareness, 10, (2 and 3), 143–60. Gaux, C. and Gombert, J.-E. (1999), ‘Implicit and explicit syntactic knowledge and reading in pre-adolescents’. British Journal of Developmental Psychology, 17, 169–88. Gombert, J.-E. (1992), Metalinguistic Development. Chicago: Chicago University Press. Hulstijn, J. H. (2005), ‘Theoretical and empirical issues in the study of implicit and explicit second-language learning: Introduction’. Studies in Second Language Acquisition, 27, (2), 129–40. Jessner, U. (2006), Linguistic Awareness in Multilinguals: English as a Third Language. Edinburgh: Edinburgh University Press. Kowal, M. and Swain, M. (1994), ‘Using collaborative language production tasks to promote students’ language awareness’. Language Awareness, 3, (2), 73–93. Lantolf, J. P. and Thorne, S. L. (2006), Sociocultural Theory and the Genesis of Second Language Development. Oxford: Oxford University Press. Leeser, M. J. (2004), ‘Learner proficiency and focus on form during collaborative dialogue’. Language Teaching Research, 8, (1), 55–81. Roehr, K. (2008), ‘Metalinguistic knowledge and language ability in university-level L2 learners’. Applied Linguistics, 29, (2), 173–99. Roehr, K. and Gánem-Gutiérrez, G. A. (2009), ‘The status of metalinguistic knowledge in instructed adult L2 learning’. Language Awareness, 18, (2), 165–81. Schmidt, R. (2001), ‘Attention’, in P. Robinson (ed.), Cognition and Second Language Instruction. Cambridge: Cambridge University Press, pp. 3–32.
Introduction to the Metalinguistic Dimension
11
Svalberg, A. M.-L. (2007), ‘Language awareness and language learning’. Language Teaching, 40, 287–308. Swain, M. (1998), ‘Focus on form through conscious reflection’, in C. Doughty and J. Williams (eds), Focus on Form in Classroom Second Language Acquisition. Cambridge: Cambridge University Press, pp. 64–81. Swain, M. and Lapkin, S. (2001), ‘Focus on form through collaborative dialogue: Exploring task effects’, in M. Bygate, P. Skehan and M. Swain (eds), Researching Pedagogic Tasks: Second Language Learning, Teaching and Testing. Harlow: Pearson Education, pp. 99–118. Vygotsky, L. S. (1978), Mind in Society. Cambridge, MA: Harvard University Press. Wajnryb, R. (1990), Grammar Dictation. Oxford: Oxford University Press.
Part 1
Cognitive Approaches to the Metalinguistic Dimension
2
Developing a Measure of Metalinguistic Awareness for Children Aged 8–11 Angela Tellier
University of Essex
Abstract The development of metalinguistic awareness and its measurement in adults and adolescents learning in instructed L2 settings is a well-established research area in the field of second language acquisition (e.g. Alderson et al., 1997; Roehr, 2008), but there has been relatively little work to date on developing measures of metalinguistic awareness for young children learning in the classroom. This chapter details the development of a written test of metalinguistic awareness suitable for group administration to English-speaking children aged 8–11. The test requires children to solve a series of language-related tasks (Jessner, 2006; Svalberg, 2007) based on both natural and constructed languages. Pilot studies investigated the performance of children from five English schools. The descriptive statistics from classes of children aged 8–11 years (N = 154) showed a good distribution of scores, with no obvious indication of a ceiling effect. Findings indicated an overall age-based advantage, with older children outperforming younger children, an apparent bilingual advantage which increases with age, and a statistical difference in overall performance between boys and girls, with girls outperforming boys. A series of verbal protocols (N = 22) helped to validate the test, and revealed interesting insights into children’s thought processes.
Introduction Metalinguistic awareness and its measurement in adults and adolescents learning in instructed second language (L2) settings is a well-established research area (e.g. Alderson et al., 1997; Roehr, 2008). However, even though it is now common for children to be exposed to an L2 in the early years of primary school (Enever, 2009), research on young learners has focused rather on the development of metalinguistic awareness in bilingual and multilingual children learning in naturalistic or immersion settings than on monolingual children learning an L2 in the classroom (e.g. Bouffard
16
The Metalinguistic Dimension
and Sarkar, 2008; Pinto, 2009). It is widely accepted that young children learn primarily implicitly (DeKeyser, 2003; Muñoz, 2008), i.e. intuitively (Dörnyei, 2009), by drawing on non-conscious mechanisms (Cummins, 1987; Birdsong, 1989; Karmiloff and Karmiloff-Smith, 2002), and that their ability to also learn explicitly, i.e. relying on potentially conscious mechanisms (Ellis, 2004, 2005), gradually develops as they mature cognitively. Explicit learning results in a learner’s explicit knowledge. Metalinguistic awareness is defined as the ability to make use of explicit knowledge about language (Ellis, 2004). The relevance of metalinguistic awareness in L2 learning is not a controversial issue, although the skill or set of skills which constitute the construct has ‘never achieved consensual definition’ (Bialystok, 1986, p. 498). Neither is there a clear distinction between the concept of metalinguistic ability and the concept of metalinguistic awareness as they relate to children. The terms are used synonymously in this chapter. It is acknowledged that the constructs of metalinguistic awareness and language learning aptitude may overlap with regard to language-analytic ability (see, for example, Harley and Hart, 1997); however it is outside the scope of this chapter to discuss this in detail. Metalinguistic awareness is considered by some researchers to be part of general cognition (Hakes, 1980; Bialystok, 2001). Bialystok’s (1994; Bialystok and Ryan, 1985) theoretical model aims to explain general cognitive skills including the development of L1 and L2 and is therefore particularly applicable to children who are still developing cognitively. For Bialystok and Ryan (1985, p. 230), the term metalinguistic ‘applies not to a specific mental accomplishment, but rather to a set of problems which share certain features’. Metalinguistic problems, or tasks, make high demands on two skill components – analysis of knowledge and control of processing – which develop as children mature. Bilingualism is believed to accelerate the development of these two skill components. It follows, therefore, that tasks measuring metalinguistic awareness should make high demands on both analysis of knowledge and control of processing (Bialystok, 2001). In other words, tasks should make demands not only on children’s ability to analyse, manipulate and explain their linguistic knowledge (analysis of knowledge), but also on children’s ability to direct their attention to select whatever is immediately relevant to the task, while suppressing focus on anything not relevant (control of processing) (Bialystok, 1988, 1994). Tasks which require children to attend to and manipulate language form (Jessner, 2006; Svalberg, 2007) are able to achieve this. It is reasonable to assume that an individual’s level of metalinguistic awareness would impact positively on explicit learning, and thus facilitate L2 learning. Explicit learning is believed to influence the achievement of L2 proficiency in the classroom (DeKeyser, 2003; Larson-Hall, 2008). If a child’s level of metalinguistic awareness facilitates explicit learning and the acquisition of explicit knowledge, it is important, from a pedagogical perspective, to monitor and foster such awareness as early as possible. Thus, in the context of the primary school,1 a measure of metalinguistic awareness would be useful. From a theoretical point of view, its use could contribute to an understanding of the development of metalinguistic awareness in children learning in the classroom, as well as help identify the age at which children can make effective use of explicit knowledge
Developing a Measure of Metalinguistic Awareness
17
and learning. From a practical point of view, such a measure would allow teachers to monitor learning, and develop specialized teaching programmes to improve L2 proficiency levels. In what follows, I examine the operationalization of the construct of metalinguistic awareness by looking briefly at previous measures developed for young learners, and describe the development and piloting of a new measure of metalinguistic awareness for English-speaking primary-school children aged 8–11.
Measures of metalinguistic awareness A number of previous studies have focused on the development and use of measures for children learning in instructed L2 settings, and the purpose of this review is to examine and evaluate the possible implications that these have for the development of a new measure. Previous research suggests that metalinguistic awareness begins to develop in children from age 4 onwards (Birdsong, 1989; Gombert, 1990; Karmiloff and Karmiloff-Smith, 2002; Milton and Alexiou, 2006). Hakes (1980, p. 2) worked with young monolingual L1 English children aged 4 to 8 who were in what he termed ‘middle childhood’, testing each child individually using age-appropriate materials. Children were tested in short sessions over a week on judgements of synonymy and acceptability in sentences, and on their ability to segment words phonemically. Synonymy and acceptability were included among the tasks because they are considered to play an important role in adults’ linguistic competence, and phonemic segmentation was included because it ‘seemed to require abilities at least superficially quite different from those involved in judgments of synonymy or acceptability’ (1980, p. 41). Ambiguity was acknowledged as an important aspect, but was not included because of the difficulty in devising a suitable task. Tasks required children to agree or disagree with statements and correct errors, rather than deduce a rule and apply it to new material. Tasks of synonymy, for example, required children to judge pairs of sentences such as ‘There is more cake than ice cream. There is less ice cream than cake’ (size/amount) as synonymous or non-synonymous (1980, p. 43), and tasks of acceptability required children to judge a sentence, for example ‘The kitten chased the string/*The string chased the kitten’ (word order), for acceptability or unacceptability (1980, p. 51). Additionally, if children judged a sentence to be unacceptable, they were encouraged to explain why, and to correct it. Hakes found a significant effect for age on all tasks, i.e. older children were better able to consider both meaning and form than younger children. He concluded that the period of ‘middle childhood’ showed ‘marked changes in children’s ability to focus on and deal explicitly with properties of language’ (1980, p. 99), and his analyses led him to suggest that children’s performance on the tasks was likely the result of the development of a metalinguistic ability related to general cognition. Pinto et al. (1999) built on Hakes’ work and developed three measures of metalinguistic ability for L1 Italian speakers, later adapted for L1 speakers of English. The first measure assesses ‘broad cognitive-linguistic abilities’ (1999, p. 40) in children between the ages of 4 and 6. The second is a paper-and-pencil test aimed at children aged 9–13 and comprises 6 sections (96 items in total), namely comprehension,
18
The Metalinguistic Dimension
synonymy, acceptability, phonemic segmentation, ambiguity and grammatical function. Each task consists of two parts: a linguistic question which targets children’s knowledge of rules of language use; and a metalinguistic question which asks for an explanation and justification of the linguistic response. For example, tasks of ambiguity look at either semantic ambiguity or ambiguity at the structural level: ‘The tables were made of stone. What – and how many – meanings do you see in the word “tables”? What is the first meaning you found for “The tables were made of stone”? What is the second meaning of that sentence?’; ‘John is easy to please. Does John please others, or is he the one who is pleased? What makes you sure of that?’ (1999, pp. 68–9). Children’s justifications are scored at one of three levels of analysis according to a set of criteria. Thus a response at the ‘pre-analytical’ level indicates a low level of analysis, while a response giving ‘exhaustive and pertinent analysis’ demonstrates a high level of analysis (1999, pp. 81–2). The authors acknowledge, however, that responding to the metalinguistic questions requires ‘a certain familiarity with the metalanguage of grammatical terminology’ (1999, p. 55), which raises the question of whether a test of metalinguistic awareness should be so heavily dependent on such knowledge. Children work through training items prior to completing each section, and are encouraged to give complete answers to the metalinguistic questions ‘in which all elements of the sentence are taken into account’ (1999, p. 77, italics in original). The authors suggest a maximum total time of over 4 hours to complete the 6 sections, which makes it difficult to administer the whole test in one sitting. A classic study by Bialystok (1988) compared the performance of fully bilingual, partially bilingual and monolingual 6–7-year-old children on a variety of metalinguistic tasks. These included testing children’s understanding of the concept of word, their awareness of syntax and ambiguity and their ability to segment sentences (count words) and substitute symbols. Additionally, she administered a variation of Piaget’s sun/moon problem to explore whether children understood that words are essentially arbitrary labels for objects. Bialystok found that the partially and fully bilingual groups outperformed the monolingual group on tasks requiring high control of processing, that is, on tasks requiring children to control their attention sufficiently to ignore misleading clues and focus on the specifics required to solve the problem. On tasks requiring high analysis of knowledge of linguistic structures and rules, that is, on tasks requiring children to extract and manipulate properties of language, the fully bilingual group performed better than the partially bilingual group, which in turn performed better than the monolingual group. Furthermore, older children outperformed younger children. Researchers believe that the experience of selecting and attending to linguistic features in two or more languages helps to create more highly analysed representations of linguistic and conceptual structures in the minds of young bilinguals, allowing them to benefit from enhanced metalinguistic awareness (Bialystok, 1988; Jessner, 1999, 2006). However, the level of metalinguistic awareness in bilinguals may also be dependent on other factors, such as the degree of balance between the two languages (Hakuta and Diaz, 1985), the level of literacy that the bilingual child has achieved in each language (Bialystok, 1988) and the level of emotional and academic support provided in the environment (Malakoff and Hakuta, 1991). Cummins (1987) suggests
Developing a Measure of Metalinguistic Awareness
19
that a threshold level of proficiency needs to be attained in both languages before a child’s bilingualism can result in positive cognitive effects. Yelland et al. (1993) carried out a study in Australia to see if minimal exposure to an L2 in the first two years of schooling would confer similar metalinguistic benefits on monolingual children as bilingual children appear to enjoy. The treatment group was exposed to Italian for an hour a week through stories, songs, games and rote learning of vocabulary. There was no written component, and the control group had no L2 input. Children were tested individually, and were asked to say whether the name of an object in a picture was a ‘big word’ (multisyllabic) or a ‘little word’ (monosyllabic). For half of the stimuli the word length and the real-life size of the object were congruent, for example ant, hippopotamus and for the other half, incongruent, for example caterpillar, car (1993, p. 431). Both groups demonstrated an equivalent low level of word awareness at the beginning of the study, but after six months, Yelland et al. found that the treatment group, despite having no testable competence in Italian, showed a significantly higher level of word awareness than the non-treatment group, that is, they showed a greater ability to respond correctly on the basis of word size rather than object size. These children appear to have developed the ability to exert a level of control of processing similar to that found in the full and partial bilinguals by Bialystok (1988), that is, they were able to control their attention sufficiently to focus on aspects relevant to the task (word length) while suppressing attention on those that were not (object size). Yelland et al. concluded that limited contact with an L2 in the classroom was enough to accelerate the development of word awareness, which adds a different perspective to Cummins’ (1987) proposed threshold level of competence. Milton and Alexiou (2006) studied the development of general cognitive abilities such as memory, phonetic skills and analytic skills in L1 Greek children aged 5–7 years who had been exposed to minimal input in L2 English for a year. Their measure comprised a test of rote memory, a semantic integration task, a paired-associates test and an artificial-language game. All tasks were based on pictures and games rather than on literacy. There was an apparent improvement in analytic abilities, i.e. in attention to explicit properties of language, in the children from around age 6, which led the researchers to note that ‘growth of analytic skills, and their connection with foreign-language learning success, suggests that this learning is explicit; even at very young ages’ (2006, p. 190). Moreover, results indicated an age advantage similar to that found by Hakes (1980) and Bialystok (1988), with older children outperforming younger children. In sum, from being dependent on relatively high levels of L1 literacy and focusing primarily on specific aspects of metalinguistic awareness in the L1, there has been a shift towards measures which are less literacy-dependent and which treat metalinguistic awareness as a set of general cognitive skills applicable to both L1 and L2 learning. Research has demonstrated that bilingual children enjoy enhanced metalinguistic awareness, and that it may be possible to encourage the development of aspects of metalinguistic awareness in young monolingual children learning in instructed L2 settings. Even so, there has been little work to date on developing measures of metalinguistic awareness for children.
20
The Metalinguistic Dimension
Developing a new measure of metalinguistic awareness The above review suggests four main points to consider in developing a new measure for classroom use. First, the measure should be suitable for group administration, and should therefore sit well within general school timetabling and be easy to administer by class teachers. Ideally, children should be able to respond to tasks without prior training. Second, the measure should be suitable for a broad population rather than be a highly specialized post-test used after a particular experimental treatment has been administered to a particular group of learners. The required level of literacy should be low2 to allow easy access by mixed-ability classes. Third, in the present context, the measure should be suitable for young L1 English-speaking children, who typically learn an Indo-European L2 at school. Lastly, previous research suggests that attention should be given to children’s thought processes (Hakes, 1980; Pinto et al., 1999), although scoring criteria should not be heavily dependent on the use of metalinguistic terminology. The new measure presented here is suitable for group administration to mixed-ability classes of English-speaking children aged 8–11 years. It is a composite measure comprising eleven tasks graded in terms of cognitive complexity, with complexity here defined as the number of mental manipulations needed to perform a task (Stankov, 2003). All the tasks require children to demonstrate high levels of analysis and control in order to complete them successfully. Children do not receive any prior on-task training. The children’s test booklet is designed to be non-threatening and motivating, with colour pictures and minimal written instructions (Bachman and Palmer, 1996; Weir, 2005; McKay, 2006). There is a prepared script for the teacher to read, and the test takes approximately 1 hour to complete. The review further suggests that a measure for use in English-speaking primary schools should cover domains relevant to both L1 and L2 learning, such as lexical semantics, morphosyntax, ambiguity and basic metalinguistic terminology. This latter, according to Ellis (2004, p. 261), ‘is not an essential component of explicit knowledge’, but may be helpful. Additionally, the new measure addresses concepts specific to L2 learning for L1 English-speaking children, such as grammatical gender, case, verbal and adjectival agreement, cognates and similarities and differences between languages (Schmidt, 2001; Jessner, 2006). Translation, considered by Malakoff and Hakuta to be ‘a bilingual language skill’ (1991, p. 146), is also included. Table 2.1 displays a conceptual analysis of the measure in terms of the individual tasks and the linguistic domains they cover. The tasks were ordered in such a way that knowledge gained from one task would not facilitate a later task. The booklet was divided into two sections, with tasks based primarily on natural languages presented in the first section, and tasks requiring manipulation of a constructed language in the second. This division allows for a short break in administration, if required. The first section comprises five tasks. Tasks 1–4 assess children’s ability to make comparisons between different L2s; Task 5 asks children to compare two sentences in L1 English. The first task presents seven short sentences, six from European languages and one from Esperanto, a language with a root-base drawn primarily from Indo-European languages. Children are asked to pair these for meaning (e.g. La rano estas verda and
Developing a Measure of Metalinguistic Awareness
21
Table 2.1 A conceptual analysis of the new measure of metalinguistic awareness Metalinguistic tasks (operationalization) 1.
2.
3.
4. 5.
6.
7.
8.
9.
10.
11.
Section one – natural languages Recognizing cognates (simple nouns) Translating a short sentence into L1 English (5 words) Recognizing cognates (simple nouns and adjectives) Translating longer sentences into L1 English (11–14 words) Identifying and applying a morphological rule (singular and plural nouns) Recognizing cognates (simple and compound nouns) Recognizing syntactic ambiguity Comparing different interpretations of pictures
Construct investigated: awareness of
Linguistic domain
lexical relationships between languages, what constitutes a sentence
Lexical semantics and syntax
lexical relationships between languages, what constitutes a sentence
Lexical semantics and syntax
systematicity in inflectional morphology
Inflectional morphology
lexical relationships between languages syntactic ambiguity
Lexical semantics
Section two – constructed language (ogramana) Understanding and applying metalinguistic terminology, metalinguistic terminology what constitutes a sentence Translating a sentence from L1 English into ogramana Identifying and applying a systematicity in inflectional morphological rule (case morphology, what marking) constitutes a sentence, word order Understanding and applying metalinguistic terminology, metalinguistic terminology what constitutes a Identifying and applying a sentence, word order syntactic rule (word order) Identifying and applying a systematicity in derivational morphological rule (word morphology, vowels, class) consonants and hyphens Identifying and applying systematicity in inflectional a morphological rule morphology, what (subject-verb agreement) constitutes a sentence Identifying and applying systematicity in inflectional a morphological rule morphology (grammatical gender and gender agreement)
Syntax
Syntax and lexical semantics
Syntax and inflectional morphology
Syntax
Derivational morphology, graphemes Inflectional morphology and syntax Inflectional morphology
La rana è verde), and to translate the remaining sentence (Tigrul este negru și verde) into L1 English. The second task requires children to translate three longer sentences into L1 English. Marks are awarded for the correct translation of content words with additional marks for the translation of function words such as articles and conjunctions, and for answers displaying sensible meaning. Both Task 1 and Task 2 assess children’s
22
The Metalinguistic Dimension
ability to recognize cognates of English words in other languages. Task 3 presents the singular and plural forms of the English word elephant in four European languages which children are required to pair (e.g. éléphant and éléphants; eilifint and eilifintí). To complete Task 4, children are asked to match both simple and compound nouns from European languages with cognate forms in Esperanto (e.g. arboretum and arbo; pivert and verda). Task 5 is the only task in the first section to be based entirely on L1 English, and looks specifically at the syntactic ambiguity of an English sentence: The mouse is looking at the owl with the binoculars. Four pictures illustrate three possible correct responses and one incorrect response; children choose whichever of the four they consider a match. The tasks in the second section differ from those in the first section in that they require children to manipulate a simple constructed language put together by the author, and named ogramana. Created languages, such as Esperanto, have frequently been used in tests of language aptitude, and to measure on-task learning (Spolsky, 1995; Graaff, 1997; Dörnyei, 2005). Esperanto’s transparency, regular grammar and international root vocabulary make it particularly suitable for tasks of language manipulation. The ogramana lexicon is formed from anagrams of Esperanto lexemes. Task 6 investigates children’s understanding of metalinguistic terminology; in this instance, parts of speech (see Appendix 1). Task 8 looks at children’s understanding of the relationship words have to one another (see Appendix 2), and Task 9 requires children to spot the common features in lists of words and create two more words which could belong to the same word class (see Appendix 3). Tasks 7, 10 and 11 examine a child’s ability to cope with concepts unfamiliar in L1 English but relevant for L2 learning of other European languages. Task 7 focuses on the accusative case marker. Children are asked to attend to two clauses which accurately describe the same picture of an owl looking at a mouse, but which are presented differently (SVO and OVS). The subject and object of the sentences are made clear by the use of an accusative case marker (-ch), as shown in the top line of (1). The additional information is presented here for the reader only. 1. ogrtsi grarid omsu-ch ogrts-i grarid oms-u--ch owl-M look.at mouse-N-OBJ An owl is looking at a mouse
omsu-ch grarid ogrtsi oms-u--ch grarid ogrts-i mouse-N-OBJ look.at owl-M An owl is looking at a mouse
Children are asked to construct two similar clauses to describe a picture of a mouse looking at an owl, the inverse of the example picture. Task 10 presents illustrated examples of third person singular and plural nouns, with subject-verb agreement in the plural, for example ogrtsi mrod (singular) and n-ogrtsi sape (word-initial plural marker n- and word-final verbal agreement e). Children are required to attend to the examples, deduce the rule and then apply it by choosing the correct response from four multiple-choice options which describe other pictures of active owls (see Appendix 4). Task 11 presents examples of nouns morphologically marked for gender (e.g. sun, masculine gender: ogrtsi, oplki, obrli; moon, feminine gender: oblta, osta, onkba; and star, neuter gender: omsu, olkinku, ondhu). The adjective dragn is introduced, and a
Developing a Measure of Metalinguistic Awareness
23
paradigm illustrates how a sun and moon noun and adjective inflect, as shown in the top lines of (2) and (3). The additional information is presented here for the reader only. 2. ogrtsi dragn ogrts-i dragn owl-M big a big owl
n-ogrtsi dragn-t n--ogrts-i dragn--t PL-owl-M big-PL big owls
3. osta dragn-e ost-a dragn--e cup-F big-F a big cup
n-osta dragn-et n--ost-a dragn--e-t PL-cup-F big-F-PL big cups
To complete the task, children must identify the morphological rule and apply it to construct the paradigm for the star group, which follows the sun group for singular agreement and the moon group for plural agreement. The basic scoring procedure and further details for each task are presented in Appendix 5.
Participants The new measure was piloted over three months in five schools selected, as far as possible, to be broadly representative of schools in England. Schools were of different sizes, were located in different geographical and socio-economic areas, including areas of social deprivation, and had varied pupil composition. All pilot studies were carried out on school premises and in normal school time, with tests administered to intact classes by the usual class teachers. Children’s previous language input was not controlled for, as it is increasingly difficult to verify in a multicultural society where children are exposed to different languages in the media, home, school and community. Data collected from intact classes, however, has the merit of ecological validity. Children known to be bilingual in the primary schools were noted and treated as a separate group in some analyses (see below for details), with bilingual here defined as speaking, reading and writing two or more languages to a level sufficient to cope well with ordinary classroom teaching and learning in L2 English. The researcher was present throughout. In addition, verbal protocol data were gathered by the researcher from a subsample of children. The protocols took the form of interviews, similar to think-alouds, with the researcher intervening, when appropriate, to encourage children to stay on task (see Study D for details).
Data collection and analysis Piloting was carried out in four separate phases, each of which had specific aims: Study A checked that the ogramana lexemes used in the second section of the test were not immediately recognizable as anagrams of Esperanto; Study B checked for a ceiling effect with children aged 11–12 years; Study C checked the suitability of the test and the appropriateness of the prepared script for primary ages 8–11, and additionally checked timing, and gathered teacher feedback in the context of an
The Metalinguistic Dimension
24
Table 2.2 Details of pilot studies, participating schools and participants Pilot study and school ID
School context
Year group (Y) and children’s age in years
N
Study A
S1
Y4 (8–9)
45
Study B
S2
Medium-size primary, predominantly monolingual white British, increasing L2 English Medium-size secondary, predominantly monolingual white British Large-size secondary, high ethnic minority with L2 English, high social deprivation Medium-size primary, predominantly monolingual white British. Several bilinguals, high-level L2 English Small-size primary in area of social deprivation, predominantly monolingual white British, increasing L2 English
Y7 (11–12)
26
Y7 (11–12)
31
S3
Study C
S4
Study D
S5
Y4, Y5 & Y6 (8–11)
Y4 (8–9)
154
22
informal discussion; Study D used verbal protocols to examine children’s thought processes to check whether children understood the tasks in the way intended by the designer. Table 2.2 presents a summary of the pilot studies and participants. All test data were scored by the researcher using a prepared answer key. Data from Studies A, B and C were analysed using the statistical program PASW 18, and Study D used the CHAT transcription conventions and CLAN analysis tools of the CHILDES database (http://childes.psy.cmu.edu/). As one class was interrupted and completed only ten of the eleven tasks in the test booklet, quantitative analyses were worked on percentage scores where appropriate to facilitate comparisons between year groups. None of the data sets diverged significantly from a normal distribution, and reliability indices of the test (Cronbach’s alpha) proved to be excellent in all year groups, ranging between .90 and .92.
Results and discussion Study A checked whether the ogramana lexemes used in the measure were recognizable to young children as anagrams of Esperanto, and was carried out with two intact classes of monolingual children aged 8–9 years (Y4: N = 45) who had received approximately 30 hours of instruction in Esperanto. The children were presented with two 14-item lists of the ogramana vocabulary and were asked to write next to each item a word or words in other languages which they recognized or which the stimulus word brought to mind. Examples were given using Esperanto and English words: katti – kato, Esperanto; wurme – worm, English. Analyses showed that of the ten children who
Developing a Measure of Metalinguistic Awareness
25
claimed to recognize a word in Esperanto, only four correctly identified one word, and two correctly identified another, a two-letter word. Moreover, of those children who identified words in Esperanto, several also claimed to have recognized the same words in another language, most commonly French. The low proportion of correct responses suggests that none of the children, even after considerable input in Esperanto, recognized to any meaningful extent how ogramana was constructed. Study B was conducted with two classes of children aged 11–12 years (Y7: N = 57). The descriptive statistics for the test presented in Table 2.5 indicate that there was no obvious ceiling effect, and moreover suggest that the test is also suitable for children aged 11–12 years (see Study C). Study C was the main focus of piloting and examined the performance of children in the last three years of primary school (Y4, Y5, Y6). The test was administered to two parallel classes in each year group, that is, to a total of 157 children. Three of these (Y4: N = 1; Y6: N = 2) were excluded from the analyses, as their level of L2 English was not sufficient to allow them to function adequately in the classroom. Bilingual children (N = 11, Y4: N = 4; Y5: N = 2; Y6: N = 5) were treated as a separate group, as previous research suggests that they may have a metalinguistic advantage over monolingual children (Bialystok, 1988). Classes completed the test satisfactorily within the hour, which suggests that the measure is suitable for group administration and sits well within general classroom timetabling. Feedback was gathered from the six teachers who had administered the test during an informal discussion after testing. Teachers reacted positively, agreeing that the script is teacher-friendly, suitable for group administration and age-appropriate. A general introduction to the script was added in response to feedback, as well as examples of appropriate remarks to aid classroom control while encouraging children to stay on task, for example: ‘put your pencil down when you’ve finished the task so I know you’ve finished and we can move on’.
The test and its components Initial analyses of the data collected in Study C examined the performance of the measure as a whole as well as the individual tasks. The descriptive statistics for the individual tasks and for the two sections of the test are presented in Table 2.3. For nine of the eleven tasks, children’s scores ranged from zero to the maximum possible score, which suggests that tasks differentiated well. Tasks 1, 2, 6 and 7 display low means, however. This suggests that children did not find these tasks particularly easy. The descriptive statistics for the two sections indicate a greater heterogeneity in children’s performance in section two (ogramana) than in section one (natural languages). Correlations (Pearson’s r) were calculated for all tasks in order to examine the test components in more detail. The results are displayed in Table 2.4. There are a large number of significant correlations of weak to moderate strength. This suggests that there is relatively little overlap between individual tasks, and that none of the tasks included in the measure can be considered redundant. Task 5 (ambiguity) demonstrates two negative though non-significant correlations, which suggests that
The Metalinguistic Dimension
26
Table 2.3 Descriptive statistics for individual tasks, and sections one and two (% scores) for children aged 8–11, excluding bilinguals Task 1 2 3 4 5 6 7 8 9 10 11 Section 1 Section 2
N
Mean
SD
Min.
Max.
Maximum possible score
Range
143 143 143 143 143 143 143 143 143 143 117 143 143
2.99 3.62 5.06 4.57 2.46 1.44 .26 2.42 5.31 1.01 2.05 32.8 40.0
1.89 3.10 2.15 2.39 .64 1.36 .61 1.44 3.79 .97 1.76 11.39 21.23
0 0 0 0 1 0 0 0 0 0 0 4 0
9 13 8 9 4 5 2 4 12 3 5 60 97
9 27 8 9 4 5 2 4 12 3 5 100 (%) 100 (%)
9 13 8 9 3 5 2 4 12 3 5 56 97
Table 2.4 Correlations of individual tasks (Pearson’s r) Task 1
2
3
4
5
6
7
8
9
10
11
.27** 2
.19* .18* 3
.35** .22** .32** 4
.22** –.05 .21* .19* 5
.28** .23** .17** .18* .02 6
.34** .07 .09 .25** .23** .25** 7
.29** .32** .19* .34** .23** .26** .33** 8
.28** .33** .18* .38** .23** .23** .23** .43** 9
.27** .20* .04 .18* –.07 .15 .12 .25** .17* 10
.24** .29** .32** .34** .12** .26** .26** .34** .28** .14
**significant at the .01 level (two-tailed) *significant at the .05 level (two-tailed)
there may be a problem with the design or the scoring of this task. There is a significant correlation of medium strength between the two test sections (r = .604, p = < .01), indicating a certain level of homogeneity in the measure, but no excessive overlap between sections. It appears that each test section makes a meaningful contribution to the measurement of metalinguistic awareness. To further examine the measure, a principal components analysis was conducted after the suitability of the data set had been established (Bartlett’s test of sphericity: p < .01, Kaiser-Meyer-Olkin measure of sampling adequacy = .792). The analysis, run
Developing a Measure of Metalinguistic Awareness
27
in an exploratory mode, included the 11 tasks as variables. One prominent component with an eigenvalue of 3.259 emerged, and scrutiny of the scree plot likewise suggested a one-component solution. Taken together with the correlation coefficients reported in Table 2.4, this result suggests that the test may be regarded as a measure in which each individual task makes a unique contribution to assessing the overall construct of metalinguistic awareness.
Analyses by year group The descriptive statistics for all year groups (excluding bilingual children) are presented in Table 2.5. They indicate that the test not only discriminated adequately between the different year groups, but also differentiated well within each year group. The statistics show a clear improvement over the years, which is consistent with previous findings that children’s metalinguistic awareness improves with age and cognitive development (Hakes, 1980; Bialystok, 1988; Milton and Alexiou, 2006). There is an evident upward shift in the minimum and maximum scores. Likewise, there is a clear increase in means with a larger increase in evidence between the lower years (Y4 and Y5) and again between the upper years (Y6 and Y7). T-tests confirm that the difference in performance between years 4 and 5 is statistically significant (Y4–5: t(93)–3.474, p = .001, two-tailed) and likewise between years 6 and 7 (Y6–7: t(103)–4.791, p < .001, two-tailed), but not the difference between years 5 and 6 (Y5–6: t(95)–1.317, p = .191, two-tailed).3 The standard deviations across the year groups remain relatively stable, indicating that the year groups are similarly homogeneous. Nevertheless, the differences between the year groups suggest that something may be happening in children’s development between the ages of 8 and 9, and again between the ages of 11 and 12. Children of the latter age range are in the first year of secondary school where the emphasis shifts to more explicit teaching and learning, so the larger increase in means at this time may be attributable to the transition and change in style of teaching. There is, however, no obvious reason for the differences in evidence between children in years 4 and 5. Although the range of means across the year groups (Y4: M = 28 to Y7: M = 54) is encouraging, the means themselves are lower than expected, implying that children did not find the test easy. Figure 2.1 and Table 2.6 display data for the individual tasks across the primary years (excluding bilingual children).
Table 2.5 Descriptive statistics for primary years 4, 5 and 6 and secondary year 7 (% scores) Year group Y4 Y5 Y6 Y7
Age
N
Mean
SD
Minimum
Maximum
Range
8–9 9–10 10–11 11–12
46 49 48 57
28 37 40 54
12.88 13.22 14.37 14.81
4 13 17 28
62 63 65 85
58 50 48 57
The Metalinguistic Dimension
28 90 80
8
70
5 3 9 4
Facility value
60 50
11
40
10 1
30
6
20
2 7
10 0 Year 4 ages 8–9
Task 1 Task 2 Task 3 Task 4 Task 5 Task 6 Task 7 Task 8 Task 9 Task 10 Task 11
Year 5 ages 9–10
Year 6 ages 10–11
Primary year groups and age ranges
Figure 2.1 Facility values (mean per cent correct) of individual tasks for years 4, 5 and 6
Figure 2.1 demonstrates that all year groups clearly found some tasks more challenging than others, with facility values (mean per cent correct) ranging between 9.8% (Y4: Task 7) and 70.8% (Y6: Task 8). Generally speaking, children in years 5 and 6 found the tasks easier than children in year 4. This is to be expected, as children should be able to tackle complex tasks more easily and more successfully as they get older. It would likewise be expected that year 6 children would perform better than year 5 children on all tasks, but this was not the case. Figure 2.1 demonstrates that year 6 children performed marginally less well than year 5 children on five of the eleven tasks (Tasks 1, 2, 4, 6 and 7) but markedly better on the more complex tasks, the last four in the booklet (Tasks 8, 9, 10 and 11). Table 2.6 presents the tasks rank-ordered by facility. It is noteworthy that the order of facility for years 5 and 6 matches almost perfectly, but that the order shown for year 4 is quite different. Analogous to the statistical difference found between years 4 and 5, but not between years 5 and 6, this indicates that children in year 4 show a different performance pattern from children in years 5 and 6. The rankings suggest that year 4 children found Tasks 8 (word order) and 9 (word class) more challenging than years 5 and 6, but that they had less difficulty with Task 1, which required children to match pairs of short sentences in different languages. Possibly children of this age are able to focus on basic similarities and differences between languages at a lexical level more successfully than older children who may already be anticipating a more complicated task. The three year groups were unanimous in experiencing the most difficulty with the same two tasks, that is Tasks 2 and 7. Task 7 required children to attend to word
Developing a Measure of Metalinguistic Awareness
29
Table 2.6 Tasks rank-ordered by facility (from high to low) for primary years 4, 5 and 6 Tasks rank-ordered by facility Task number
Y4
Y5
Y6
3 5 4 8 1 11 6 10 9 2 7
8 3 5 4 9 11 10 1 6 2 7
8 3 5 9 4 11 10 1 6 2 7
order (SVO – OVS) and identify and apply a morphological rule (accusative case marking), and Task 2 asked for translations of sentences from other languages. It is understandable that children should find Task 7 challenging, as an accusative case marker is no longer obvious in L1 English, but it is surprising that children experienced such difficulty with the translation task. Possibly the length of the sentences was too daunting, or translating an unknown language is too specific a task for young children to cope with. Bilingual children are generally able to cope well with switching between two or more languages (Malakoff and Hakuta, 1991), so it would be interesting to see how a class of young bilinguals might succeed in accessing unknown languages in similar tasks. Analyses of children’s written responses to the translation task (Task 2) and explanations given during the interviews (see also Study D below, and Appendix 6) reveal that children’s translations and interpretations were heavily dependent on word recognition based on personal experience at home and in the school. Attention to punctuation was the only evidence of the use of contextual clues. For example, in the Italian sentence Ho una tazza di caffè con latte e zucchero, e una torta al cioccolato (I have a cup of coffee with milk and sugar, and a chocolate cake), the word tazza was often translated as pizza, caffè as a place of refreshment (café) more often than coffee and latte as the milky-coffee drink of the same name rather than milk. One child imaginatively suggested that the ‘Ho’ could refer to the well-known seasonal chuckle, ‘Ho!, Ho!, Ho!’. Cioccolato, on the other hand, generally posed few problems, although the phrase una torta al cioccolato was frequently translated as a hot chocolate. Children generally used a word-by-word approach focusing on both graphemes and phonemes as cues, rather than relying on context. Extracts (4) and (5) from the interviews in Study D illustrate how localized children’s translations of words were, and how personal experience acted as the main focus.
30
The Metalinguistic Dimension
4. CH14: Cioccolato. INT: But what does that mean? CH2: Chocolate! INT: Why are you saying it’s chocolate? CH2: It’s got the oco, then late at the end. . . . 5. CH3: . . . or there might be three people in this. So say there’s a mother and a dad and a child, and one wants coffee, one wants a latte, and one wants chocolate.
Children’s translations of the German sentence Der Hund, die Katze und die Maus sind sehr gute Freunde (The dog, the cat and the mouse are very good friends) offer further interesting insights into how children tackle translation. The capital letters were used as formal clues, and were assumed, as in L1 English, to be proper nouns, thus Freunde was often translated as France, and Katze became the girl’s name Katie. The lexical item Der Hund with the capital letter and the following comma was taken to be a greeting at the start of a letter, reinforced by the close similarity in spelling between German Der and English Dear. Predictably, the feminine definite article die was most often read as the English verb die, which sparked many imaginative suggestions as a result. As these examples show, Study D investigated children’s thought processes as they solved verbally the same metalinguistic tasks as those presented in the written measure. This was in order to check the match between children’s thinking and the thinking behind the design of the tasks. For the purpose of Study D, the eleven tasks were presented as language puzzles on cards for children to physically manipulate, which gave focus to the discussion and helped scaffold children’s thinking. The participants were drawn from an intact class (Y4: N = 29) of whom seven were excluded because of special educational needs or because parents or carers refused permission. The remaining children (N = 22) were paired by the class teacher for general ability and were interviewed in turn by the researcher. At the beginning of each interview, children were told to explain their thinking and agree a solution. Such an approach had the advantage of pushing children to articulate their thought processes, and the researcher asked specific questions when necessary to keep children on task and encourage discussion. Each pair completed three of the eleven tasks. These were selected so that pairs completed tasks varying in cognitive complexity (for example, the first pair completed Tasks 3, 7 and 10; the second pair, Tasks 4, 6 and 9). A research assistant made observational notes and recorded the conversations which were transcribed using the CHAT transcription conventions and subsequently coded using CLAN. A product code was assigned to indicate a correct or incorrect response. Utterances which indicated children’s thought processes were assessed against a specific set of criteria and, if appropriate, against additional criteria for each task. These criteria graded the development of children’s metalinguistic thinking into three levels (see Appendix 6 for examples), ranging from almost total unawareness (level 0) to a burgeoning understanding of how language functions (level 2). Coding was subsequently checked for inter-rater reliability (95.4%).
Developing a Measure of Metalinguistic Awareness
31
Table 2.7 Frequency totals and ratios for correctness of response (product) and metalinguistic level (process) Process
Product
Ratio
Correct response
Incorrect response
52 30 3
7 21 19
Level 2 Level 1 Level 0
7.5:1 1.4:1 1:6.3
Combined totals of pairs’ correct and incorrect responses (product) and similarly their levels of thinking (process) for the whole measure are presented in Table 2.7. The results show that the majority of the children’s responses resulting from higher-level metalinguistic thinking (level 2) were correct. Conversely, the majority of responses resulting from lower-level thinking (level 0) were incorrect. There is a reasonable balance between correct and incorrect responses at level 1, although the trend is in favour of more correct responses. These findings indicate that there is a correlation between thought process and product, and confirm that there is a reasonable match between designer intention and child response, which gives a good indication of the test’s validity.
The gender advantage It is a widely held view that girls generally achieve better than boys in instructed L2 settings (Harris, 1998), particularly at secondary-school level. The data collected in Study C were analysed by gender to see if findings from mixed-ability primary-school classes were consistent with this view. Results from an independent samples t-test for the combined primary classes (N = 143), not including the bilingual children, show a statistical difference in overall performance between boys and girls (t(141)–2.315, p = .022, two-tailed), with girls outperforming boys (boys: N = 70, M = 33; M girls: N = 73, M = 38). Table 2.8 shows, however, that there are no statistical differences in evidence for similar t-tests carried out on individual year groups, although the result for year 5 approaches significance. The means show a clear progression for both genders as children get older, which is consistent with findings from Study C, although the difference in means between the genders is greater in years 5 and 6 than it is in year 4. These findings might indicate the beginning of a trend of difference in achievement between the genders found by researchers in children of secondary-school age (Harris, 1998).
The bilingual advantage Previous research has demonstrated that bilingual children have a metalinguistic advantage over monolingual children (Bialystok, 1988). In order to examine whether this was similarly the case in Study C, children’s scores for intact classes (Y4, Y5, Y6) were ranked. Table 2.9 presents the rankings of the top three scores for each year group.
The Metalinguistic Dimension
32
Table 2.8 Differences in performance between boys and girls in primary years 4, 5 and 6 (% scores)
Y4 Y5 Y6
boy girl boy girl boy girl
N
Mean
SD
T
Df
sig. (2-tailed)
22 24 26 23 22 26
27 30 34 41 38 43
13 10 12 12 12 13
–.872
44
.388
–1.946
47
.058
–1.437
46
.157
Table 2.9 Ranking of the top three scores (%) for primary years 4, 5 and 6 Ranking 1st 2nd 3rd
Primary year group Y4
Y5
Y6
62 60 59
71 63 59
82 69 65
The five scores in bold were gained by bilingual children, of whom four were bilingual in Dutch and English (Y4 and Y6) and one in Korean and English (Y5). Considering the low ratio of bilingual children to monolingual children participating in the study (1:14), the rankings display a high proportion of bilingual high achievers. Interestingly, the highest score in year 4 was obtained by a monolingual child, and there is less of a difference in the highest percentage scores between the top bilingual and monolingual children in year 4 (2%) than there is in year 5 (8%), which in turn is less than in year 6 (17%). This suggests that the metalinguistic advantage developed through bilingualism may become more marked as a child matures. Even though the number of bilingual participants in the sample was small (N = 11), a comparison of the mean facility values of individual tasks reveals that the group of bilingual children performed better than the monolingual children (N = 143) on all tasks except Task 5 which looked at the syntactic ambiguity of an English sentence (bilingual: 52.3%, monolingual 61.5%). The bilingual group performed much better than the monolingual group on three tasks in particular, Task 6 which required an understanding of metalinguistic terminology (bilingual 65.5%, monolingual 28.8%), and on Tasks 1 and 2 which depended on cognate recognition (Task 1 – bilingual 59.6%, monolingual 33.3%; Task 2 – bilingual 37.6%, monolingual 14.5%). The fact that the bilingual children were better at recognizing cognates is consistent with previous research. Cummins’ iceberg theory argues that bilingual children have an underlying language proficiency common to both their languages which allows them to compare languages more easily (Cummins, 1991, cited in Jessner, 1999), and Jessner
Developing a Measure of Metalinguistic Awareness
33
(1999, p. 205) states that the recognition of similarities across languages is part of a ‘multilingual person’s natural linguistic repertoire’. Moreover, it is not surprising that other-language children have a better grasp of metalinguistic terminology than L1 English-speaking children, given that it is often taught earlier and in a more structured way than in English primary schools. Nor is it surprising that L1 English-speaking children have a better grasp of syntactic ambiguity than children for whom English is an L2. In conclusion, even though the number of bilinguals in the present study was low, the findings accord well with the findings of previous research that bilingual children display a metalinguistic advantage over monolingual children (Bialystok, 1988; Jessner, 1999; Bialystok, 2001) and thus perform better on metalinguistic tasks. Moreover, if bilingual children are more metalinguistically aware, their strong performance on this test suggests that the measure as a whole has succeeded in capturing (some of) the metalinguistic skills that underlie that advantage.
Conclusion The aim of the research presented here was to develop a measure of metalinguistic awareness for young L1 English-speaking children learning in an instructed L2 setting. The measure performed well empirically, with no obvious indication of a ceiling effect. Analyses showed that the test was highly reliable, differentiated well both within and across year groups and that tasks were generally well-ordered in terms of facility. Moreover, the test was found to sit well within general classroom timetabling. Five main findings have emerged as a result of the pilot studies. First, there was an overall age-based advantage, with older children outperforming younger children. Additionally, year 6 children displayed a marked ability to solve the more cognitively complex tasks, which suggests that children’s ability to analyse linguistic features and control their attention develops as they mature cognitively, allowing them to access and make use of explicit knowledge (Ellis, 2004) and attend more to form (Bialystok and Ryan, 1985). This is consistent with previous research (Hakes, 1980; Bialystok, 1988; Milton and Alexiou, 2006). Second, the overall statistical difference between years 4 and 5 points to a possible threshold of development occurring around age 9. Moreover, although years 5 and 6 showed an equivalent pattern in the ordering of facility of tasks, year 4 showed a different pattern. Third, findings revealed an overall statistical difference in performance between boys and girls in the sample, with girls outperforming boys. There were no statistical differences between the genders within year groups, although the result for year 5 approached significance, and there was a lower difference in means between the genders in year 4 than between the genders in subsequent years. These findings suggest the beginning of a possible trend of difference in achievement between the genders which has been found to manifest itself more obviously in children of secondary school age (Harris, 1998). Fourth, the high ratio of bilingual to monolingual top-scoring children, and the bilingual children’s better performance on specific tasks, notably cognate recognition,
34
The Metalinguistic Dimension
suggests that bilingual children may enjoy not only a general metalinguistic advantage over monolingual children, but also a specific metalinguistic advantage on certain tasks. An age advantage was also in evidence, with older bilingual children outperforming younger ones. Furthermore, results indicate that the difference between the top-scoring monolingual and bilingual children increased considerably over time, suggesting that metalinguistic awareness in young bilinguals becomes more marked as they mature cognitively. Lastly, the analysis of the verbal protocols indicated that there was a correlation between thought process and product, in other words, those children who showed evidence of higher-level metalinguistic thinking gave more correct responses than children who displayed lower-level thinking, and conversely. This finding, and the fact that the test showed excellent reliability as well as good empirical discrimination and differentiation, helps to validate the measure. In sum, findings from the empirical investigation are generally consistent with previous research, which suggests that both age and bilingualism confer cognitive benefits. From a pedagogical perspective, it might be useful to give specific input to monolingual children before the age of 9 to help maintain the impetus of metalinguistic development at an age when the difference between boys and girls and between monolingual and bilingual children does not appear to have opened up yet. Input might take the form of specific programmes which give attention to the similarities and differences across languages. Moreover, if bilingual children are more metalinguistically aware than monolingual children, their strong performance on this test suggests that the measure as a whole is capturing (some of) the metalinguistic skills that underlie that advantage. Overall, the findings from the pilot studies have thrown up some interesting issues that warrant further investigation, and confirm that the new measure is worth developing further.
Acknowledgements The author would like to thank Peter Oliver for designing the owl graphics which appear in the test; Simon Davies for his assistance with proof-reading; and Karen Roehr for her many constructive comments on earlier drafts of the chapter, and her continued support and advice.
Developing a Measure of Metalinguistic Awareness
Appendix 1 – Task 6
35
36
The Metalinguistic Dimension
Appendix 2 – Task 8
Developing a Measure of Metalinguistic Awareness
Appendix 3 – Task 9
37
38
The Metalinguistic Dimension
Appendix 4 – Task 10
Developing a Measure of Metalinguistic Awareness
39
Appendix 5 – Scoring procedure Task number 1.
2.
3. 4. 5. 6. 7. 8. 9. 10. 11.
Scoring criteria
Maximum score
mark for each pair of sentences correctly identified (3) mark for each word of the remaining sentence correctly translated (6) mark for each of six keywords correctly identified in each of the three translation sentences (3 x 6); additional marks if: (a) words other than keywords are correctly identified, (b) appropriate sentence-initial and sentence-final punctuation are used, and (c) the sentence displays sensible meaning (3 x 3) mark for each correctly identified singular or plural (8) mark for each cognate correctly matched to the appropriate headword (9) mark for each picture correctly identified as appropriate (3) mark for the inappropriate picture left unidentified (1) mark for each ogramana word correctly identified (5) mark for each accusative case ending correctly written (2) mark for each verb correctly indicated (2) mark for each adjective correctly positioned (2) mark for each correct commonality written (3 x 4) mark for each correct picture noun-verb phrase identified (3) mark for the selection of correct gender nouns (2) mark for each correctly inflected adjective (2) mark for a correctly inflected noun (1) Maximum score
9
27
8 9 4 5 2 4 12 3 5 88
Appendix 6 The following extracts from transcripts of interviews with children in Study D illustrate how examples of coding criteria for each of the three developmental levels were applied to paired conversations. Level 0 – the child has not yet acquired the ability to analyse, and thus: ll ll ll
makes general and sweeping statements, e.g. ‘I don’t know’, ‘I guessed’ employs tautology relies solely on word length as a means of comparison
Example: extract from Task 4, matching cognates, e.g. binoculars/okulo; ĝardeno/ jardin; dachshund/hundo; arboretum/arbo.
40
The Metalinguistic Dimension CH4: I think it [binoculars] goes there [ĝardeno], because they’re both long words. CH5: That’s what I was thinking! CH4: Umm dachshund. That’s a dog. I’ve got this strange feeling in my head that it goes there [arbo].
Level 1 – the child is developing awareness, and directing his/her attention more appropriately, but does not yet have the ability to move far from visual observation; the child has not yet acquired the ability to analyse, and thus: ll ll ll
relies on a one-to-one letter/word match may identify or cite a linguistic rule, albeit inappropriately imposes English orthography on other-language words, assigning English meaning as a result.
Example: extract from Task 1, pairing sentences from different languages, e.g. La rano estas verda and La rana è verde. CH6: Um, I think them two go together because they say almost exactly the same word [= verde and verda] (attention focused at lexical level). INT: Oh, I see. CH6: But that has an e and that has an a (attention focused at phoneme level). Level 2 – the child gives a pertinent explanation which, although it may not necessarily involve much use of metalanguage, shows a growing awareness of the relationship between words and displays linguistic knowledge in addition to that gained through visual comparison, and thus: ll ll ll ll
identifies and/or cites an appropriate linguistic rule shows awareness of exception to rule shows awareness of an underlying linguistic comparison goes beyond expecting English orthography to either apply to other languages, or to dictate meaning.
Example: extract from Task 3, identifying singular and plural pairs of the English word elephant in different languages by separating the endings from the stem and comparing not only between languages, but between same language forms: éléphant/ éléphants; eilifint/eilifintí; elefante/elefantes; elefant/elefanţi. CH7: So if anything that [el] means one (pointing to the start of all the elephant words), and that [..efantes] means elephant (covering the beginning two letters el of the word elefantes leaving only ..efantes visible) and then you’ve got one [el] and elephant [..efantes], but there you’ve got, so if you had apple and you added on that [s] (indicates plural marker s on the word
Developing a Measure of Metalinguistic Awareness
41
elefantes) it would be apples. So that [elefantes] really means elephants. So that’s more than one elephant. CH8: I think the i [pointing to the i on elefanţi] maybe, might be like an s. CH7: explained that he had been learning ‘holiday Spanish’ which seemed to have been uppermost in his mind, as he detached the prefix el suggesting that it indicated the singular (el is the singular masculine definite article in Spanish). He then focused on the plural marker s in English to suggest that other-language words with s in word-final position are similarly plural. This overrode his original thoughts on el as a singular marker. CH8 is able to extrapolate that if the English marker s in word-final position indicates a plural, then it is possible that a word-final letter in another language performs a similar function.
Notes 1 The terms primary and secondary are used here to refer to schools in England providing non-selective compulsory full-time education for children aged 5–11 (school years 1–6) and 11–16 (school years 7–11), respectively. 2 As pointed out by a reviewer, it is difficult to take into account different levels of literacy in a written test. 3 A one-way Anova and post-hoc analyses provided similar results. 4 The abbreviation CH refers to the child interviewees; INT refers to the interviewer.
References Alderson, C., Clapham, C. and Steel, D. (1997), ‘Metalinguistic knowledge, language aptitude and language proficiency’. Language Teaching Research, 1, (2), 93–121. Bachman, L. and Palmer, A. (1996), Language Testing in Practice. Oxford: Oxford University Press. Bialystok, E. (1986), ‘Factors in the growth of linguistic awareness’. Child Development, 57, 498–510. — (1988), ‘Levels of bilingualism and levels of linguistic awareness’. Developmental Psychology, 24, (4), 560–7. — (1994), ‘Analysis and control in the development of second language proficiency’. Studies in Second Language Acquisition, 16, 157–68. — (2001), Bilingualism in Development: Language, Literacy, and Cognition. Cambridge: Cambridge University Press. Bialystok, E. and Ryan, E. B. (1985), ‘Toward a definition of metalinguistic skill’. Merrill-Palmer Quarterly, 31, (3), 229–51. Birdsong, D. (1989), Metalinguistic Performance and Interlinguistic Competence. Berlin: Springer. Bouffard, L. A. and Sarkar, M. (2008), ‘Training 8-year-old French immersion students in metalinguistic analysis: An innovation in form-focused pedagogy’. Language Awareness, 17, (1), 3–24.
42
The Metalinguistic Dimension
Cummins, J. (1987), ‘Bilingualism, language proficiency, and metalinguistic development’, in P. Homel, M. Palij and D. Aaronson (eds), Childhood Bilingualism: Aspects of Linguistic, Cognitive and Social Development. London: Lawrence Erlbaum Associates, pp. 57–73. — (1991), ‘Language learning and bilingualism’. Sophia Linguistica, 29, 1–194. DeKeyser, R. (2003), ‘Implicit and explicit learning’, in C. Doughty and M. Long (eds), The Handbook of Second Language Acquisition. Oxford: Blackwell, pp. 313–48. Dörnyei, Z. (2005), The Psychology of the Language Learner: Individual Differences in Second Language Acquisition. London: Lawrence Erlbaum Associates. — (2009), The Psychology of Second Language Acquisition. Oxford: Oxford University Press. Ellis, R. (2004), ‘The definition and measurement of L2 explicit knowledge’. Language Learning, 54, (2), 227–75. — (2005), ‘Measuring implicit and explicit knowledge of a second language’. Studies in Second Language Acquisition, 27, 141–72. Enever, J. (2009), ‘Can today’s early language learners in England become tomorrow’s plurilingual European citizens?’, in M. Nikolov (ed.), Early Learning of Modern Foreign Languages: Processes and Outcomes. Bristol: Multilingual Matters, pp. 15–29. Gombert, J. E. (1992), Metalinguistic Development. New York: Harvester Wheatsheaf. Graaff, R. de. (1997), ‘The EXperanto experiment: Effects of explicit instruction on second language acquisition’. Studies in Second Language Acquisition, 19, 249–76. Hakes, D. (1980), The Development of Metalinguistic Abilities in Children. Berlin: Springer-Verlag. Hakuta, K. and Diaz, R. M. (1985), ‘The relationship between degree of bilingualism and cognitive ability: A critical discussion and some new longitudinal data’, in K. E. Nelson (ed.), Children’s Language. Hillsdale, NJ: Erlbaum, vol. 5, pp. 319–44. Harley, B. and Hart, D. (1997), ‘Language aptitude and second language proficiency in classroom learners of different starting ages’. Studies in Second Language Acquisition, 19, 379–400. Harris, V. (1998), ‘Making boys make progress’. Language Learning Journal, 18, 56–62. Jessner, U. (1999), ‘Metalinguistic awareness in multilinguals: Cognitive aspects of third language learning’. Language Awareness, 8, (3 and 4), 201–9. — (2006), Linguistic Awareness in Multilinguals: English as a Third Language. Edinburgh: Edinburgh University Press. Karmiloff, K. and Karmiloff-Smith, A. (2002), Pathways to Language: From Fetus to Adolescent. Cambridge, MA: Harvard University Press. Larson-Hall, J. (2008), ‘Weighing the benefits of studying a foreign language at a younger starting age in a minimal input situation’. Second Language Research, (24), 35–63. Malakoff, M. and Hakuta, K. (1991), ‘Translation skill and metalinguistic awareness in bilinguals’, in E. Bialystok (ed.), Language Processing in Bilingual Children. Cambridge: Cambridge University Press, pp. 141–66. McKay, P. (2006), Assessing Young Learners. Cambridge: Cambridge University Press. Milton, J. and Alexiou, T. (2006), ‘Language aptitude development in young learners’, in C. Abello-Contesse, R. Chacón-Beltrán, M. D. López-Jiménez and M. M. Torreblanca-López (eds), Age in L2 Acquisition and Teaching. Oxford: Peter Lang, pp. 177–92. Muñoz, C. (2008), ‘Age-related differences in foreign language learning. Revisiting the empirical evidence’. International Review of Applied Linguistics in Language Teaching, 46, (3), 197–220.
Developing a Measure of Metalinguistic Awareness
43
Pinto, M. A. (2009), ‘Early metalinguistic development in Italian-English bilingual preschoolers. A study in a British context’, Conference paper. ACLA Conference: Ottawa, 27–29 May 2009. Pinto, M. A., Titone, R. and Trusso, F. (1999), Metalinguistic Awareness: Theory, Development and Measurement Instruments. Pisa Roma: Istituti Editorialie Poligrafici Internazionali. Roehr, K. (2008), ‘Metalinguistic knowledge and language ability in university-level L2 learners’. Applied Linguistics, 29, (2), 173–99. Schmidt, R. W. (2001), ‘Attention’, in P. Robinson (ed.), Cognition and Second Language Instruction. Cambridge: Cambridge University Press, pp. 3–32. Spolsky, B. (1995), ‘Prognostication and language aptitude testing 1925–62’. Language Testing, 12, (3), 321–40. Stankov, L. (2003), ‘Complexity in human intelligence’, in R. J. Sternberg, J. Lautrey and T. I. Lubart (eds), Models of Intelligence: International Perspectives. Washington, DC: American Psychological Association, pp. 27–42. Svalberg, A. M.-L. (2007), ‘Language awareness and language learning’. Language Teaching, 40, 287–308. Weir, C. (2005), Language Testing and Validation. Basingstoke: Palgrave Macmillan. Yelland, G. W., Pollard, J. and Mercuri, A. (1993), ‘The metalinguistic benefits of limited contact with a second language’. Applied Psycholinguistics, 14, 423–44.
3
Examining the Contribution of Syntactic and Metasyntactic Abilities to Reading Comprehension among Native and Non-Native Speakers Daphnée Simard
Université du Québec à Montréal
Denis Foucambert
Université du Québec à Montréal
Marie Labelle
Université du Québec à Montréal
Abstract Syntactic and metasyntactic abilities are some of the components involved in reading skills. Previous studies indicate that they both correlate with reading skills in L1 and in L2. However, correlations do not provide information about the relative importance of each skill in the reading process. Using structural equation modelling, we examined whether the relative importance of syntactic and metasyntactic abilities in reading comprehension is the same among child native and non-native speakers of French. The results show for both groups that although metasyntactic ability contributes directly to reading comprehension, the contribution of syntactic ability is mediated through metasyntactic ability.
Introduction Metalinguistic ability is defined as a learner’s ability to consciously and intentionally manipulate language features (Gombert, 1992). According to authors such as Armand (2000), Carlisle (1995) and Snow et al. (1989), good metalinguistic ability allows the use of abstract and decontextualized language, which is an asset in successful reading skills learning. Since reading is identified as a strong predictor of school success (Bialystok, 2001b, p. 175), its successful development is crucial for school-age children.
46
The Metalinguistic Dimension
In this chapter, we aim to shed new light on the specific ways in which metalinguistic ability, and more precisely, metasyntactic ability, contributes to reading skills. Metalinguistic ability can be related to any domain of language (Gombert, 1992), and two different metalinguistic domains, metaphonological and metasyntactic abilities, have been identified as particularly important contributors to reading skills development (Siegel, 1993). By far, the one most studied is metaphonological ability (Smith-Lock, 1995; Bowey, 1986), i.e., the ability to identify and intentionally manipulate the phonological components in linguistic units (Gombert, 1992, p. 15). It is believed to predict reading skills given that good metaphonological ability is essential to understanding grapheme-phoneme correspondences in play in fluent word recognition (Goswami et al., 1990; Tunmer et al., 1992). Despite the importance attributed to metaphonological ability, some researchers have highlighted the limitations of phonological processing. In Gaux and Gombert’s (1999) words, ‘some children fail to understand sentences in a text, in particular one possessing a complex syntactic structure, even when they manage to decode all the words it contains’ (p. 169). Therefore, they hypothesized that the difficulties experienced by these children are related to their metasyntactic ability. Metasyntactic ability (MSA) refers to the ability to reflect on the syntactic aspects of language such as words, grammatical classes and rules governing their combinations (Gaux et al., 1997) as well as the ability to intentionally control and manipulate them (Bowey, 1986, 2005; Goff et al., 2005; Gombert, 1992; Tunmer et al., 1987). Certain behaviours such as rejection of ungrammatical sentences have been observed among 4-year-old children (e.g. Hakes, 1980). However, according to Gombert (1992), it is not until 6–7 years of age (with the beginning of schooling) that real metasyntactic judgements are made. Other metasyntactic behaviours such as correcting syntactic errors (e.g. word order) appear only at around age 6 or 7 and are still scarce at around age 9 (Comblain, 2005). MSA appears to continue developing throughout elementary school. The relationship between MSA and reading skills among native speakers has been demonstrated repeatedly (e.g. Bowey, 1986, 2005; Blackmore et al., 1997; Nation et al., 2000; Tunmer et al., 1988; Tunmer, 1990). Although fewer MSA studies have been conducted among non-native speakers, these studies provide evidence for a significant contribution of MSA to reading skills in this population as well (e.g. Abu-Rabia et al., 2002; Cain, 2007; Da fontura et al., 1995; Geva et al., 2000; Low et al., 2005; Lesaux et al., 2006; Lesaux et al., 2007). MSA has been claimed to support reading comprehension by allowing for the predictions of the sequence of words in sentences (e.g. Chiappe et al., 1999; Lesaux et al., 2003; Lipka et al., 2012; Low et al., 2005; Siegel, 1993) as well as facilitating the reassembly of decoded words into phrases, thus making recall easier (Tunmer et al., 1984). However, the nature of the relationship between MSA and reading comprehension is still not well understood and further research is needed. Another relationship that needs to be better understood is the interrelationship between MSA, reading skills and syntactic ability (e.g. Cain, 2007). Syntactic ability corresponds to the ability to decode language by parsing phrases and sentences. This processing of linguistic information is incremental in nature as the individual has to integrate incoming words syntactically and semantically in order to derive meaning
Contribution of Syntactic and Metasyntactic Abilities
47
(Koda, 2007). Therefore, syntactic ability allows readers to construct the syntactic frame of a sentence. According to Koda (2007), this involves two major operations: (1) phrase construction through lexical information integration and (2) assignment of roles to the sentence constituents. Only intuitive and unconscious knowledge is necessary to carry out these two operations. Syntactic ability (SA) has been shown to correlate with MSA (e.g. Cain, 2007) and with reading skills among native (e.g. Goff et al., 2005) and non-native speakers (e.g. Martohardjono et al., 2005; Verhoeven, 1990). However, the study of the interrelationship between MSA, SA and reading skills has been neglected (Cain, 2007; Koda, 2007). Correlational studies such as the ones mentioned above are interesting because of the information they provide about associations between reading skills and their various components, but they do not provide information about the relative importance of these components in reading comprehension. van Gelderen et al. (2007), drawing on the work of authors such as Carr et al. (1990), Just et al. (1992), Laberge and Samuels (1974), Perfetti (1999) and Stanovich (1991), argued that in order to better understand the way various components relate to reading skills, the use of structural equation modelling is necessary. This type of analysis allows for the measurement of how much of the performance in reading comprehension is explained by performance on other tasks assumed to contribute to the skill (p. 477). In that respect, a few studies specifically examined SA or MSA as reading skills components among non-native speakers. In a seminal study, Verhoeven (1990), for instance, investigated the differences in reading acquisition processes among grade 1 and 2 native (Dutch) and non-native (Turkish) speakers using a word-reading task and three reading comprehension tasks (coherence, anaphora and inference tasks). A sentence repetition task was used to measure SA. The sentences targeted morphosyntactic (e.g. plurals, tense markers) and syntactic features (e.g. determiners, pronominals). An oral proficiency task measuring receptive and productive vocabulary was also used. According to the author, the results revealed that SA in the second language (L2) as measured by the sentence repetition task had a strong direct impact on L2 reading comprehension. In their 2004 study, van Gelderen et al. investigated the role of first language (L1) and L2 components in reading comprehension among 397 Dutch students (13–14 years old) learning English. Metacognitive ability was measured by means of a self-report questionnaire in which the participants had to judge whether or not a series of statements such as ‘When you read it makes sense to put most effort into memorizing the details of the text’ (p. 480) were correct. The grammatical tasks in Dutch L1 and English L2 consisted of three parts in which the participants had to provide correct verb conjugations, noun morphology, adjectives, function words and word order. The results demonstrated that the L1 and L2 reading comprehension componential structures were very similar, but not identical. In particular, among L1 speakers, only metacognitive knowledge accounted for a significant proportion of the variance in reading. Among non-native speakers, results revealed, according to the authors, that vocabulary knowledge along with metacognitive knowledge explained the variance in reading.
48
The Metalinguistic Dimension
In a follow-up study, van Gelderen et al. (2007) examined the relationship between L1 and L2 reading development among 389 native and non-native speakers of Dutch learning English (10–14 years old). They used similar instruments as in their 2004 study, except that their grammatical task targeted only morphosyntactic knowledge. In the grammatical task, the participants had to provide the correct form of verbs (i.e. number, tense, aspect, agreement). The results show that metacognitive skills contributed to reading in both L1 and L2, unlike linguistic knowledge, which seems to contribute to reading only in the language in which it is measured.
Aim of the present study Although MSA and SA both appear to contribute to reading skills among non-native speakers, overall, no L2 study has looked specifically at the potential relationship between the two constructs in the modelling of L2 reading comprehension. Moreover, no previous study has investigated whether the weight of syntactic and metasyntactic abilities in explaining reading comprehension is the same among child native and non-native speakers. Therefore, the aim of our study is to investigate the following research questions: (1) What is the respective weight of MSA and SA in explaining reading comprehension skills in French-speaking native and non-native children? (2) Is there a relationship between MSA and SA in the modelling of reading comprehension in French-speaking native and non-native children? The investigation of syntactic and metasyntactic abilities within the same study raises the issue of their respective measurement (e.g. Birdsong, 1989; Demont, 1994; de Villiers et al., 1972; Cain, 2007; Simard et al., 2007). It is possible to address this issue by using Bialystok’s framework (2001b), according to which tasks may be classified along a continuum from linguistic to metalinguistic on the basis of two orthogonal dimensions: analysis of knowledge, and control of attention (e.g. Bialystok, 1991, 1993, 1999, 2001a, 2001b). Analysis of knowledge refers to the level of explicitness of knowledge required to accomplish the tasks (Bialystok, 2001b). The more implicit (unconscious and unavailable for verbal report) the language knowledge, the less analysed it is and the more explicit (conscious and available for verbal report) the language knowledge, the more analysed it is. Control of attention refers to learners’ ability to direct their attention to relevant information and to integrate it in real time (Bialystok, 1990). Therefore, tasks located at the lower end of both dimensions, i.e. tasks requiring no or minimal analysis of knowledge and control of attention, are considered linguistic. As the demand in both dimensions increases, tasks are considered to involve more and more metalinguistic ability. For instance, acceptability judgements and repetition-of-grammatical-sentences tasks both containing only semantic sentences (the processing of asemantic sentences is believed to increase the demands on control of attention) require very low levels of analysis of knowledge and control of attention (Bialystok, 2001b, p. 178). Therefore, they will be considered as linguistic in the present study. It should be noted that these two tasks are also considered non-metalinguistic by many researchers including Gaux et al. (1999). An example of a metalingusitic task, on the other hand, would be explaining why a sentence is ungrammatical. In such a task, participants have to locate the error (higher level of control) and provide an explanation (higher level of language analysis).
Contribution of Syntactic and Metasyntactic Abilities
49
Method An empirical study was conducted to investigate the research questions. The methodology used is detailed below.
Participants The participants in the study were 37 French native speakers (NS) (17 girls and 20 boys) and 36 non-native speakers (NNS) (18 girls and 18 boys). The NS used French to communicate with their mother and reported no other language being spoken at home. The NNS were of Portuguese origin, but were all born in Canada. They are therefore considered heritage language speakers, i.e. children raised in a home where a non-majority language is spoken, in our case Portuguese, and children who speak or at least understand the non-majority language and are ‘to some degree bilingual in that language and in the majority language’ (Valdés, 2001, p. 2). They were all enrolled in a heritage language programme, either as an extra-curricular activity at a regular school, or at a Saturday school. It is expected that the language competence of heritage language children ‘will differ from that of native monolinguals of comparable age’ (Rothman, 2009, p. 156), which is the case with our NNS participants who are statistically weaker in French than the NS, as measured by a standardized receptive vocabulary test, Échelle de vocabulaire en images Peabody (NS M = 12.8, SD = 1.96; NNS M = 11.2, SD=2.51; F(1,71) = 8.98; p=0.004). The 73 participants had the same mid-level socio-economic status. The mean age of the participants was 10.6 years at the time of the study (range: min. = 8.2; max.= 12.5); there was no significant difference between the mean ages of the two groups (Mage NS = 10.3; Mage NNS=10.8; F(1, 71)=1.61, p>0.2).
Measurement instruments The participants completed a series of measures as described below. For all the measurement instruments, care was taken to use vocabulary known by the children, except for the standardized receptive vocabulary task and the phonological memory test, which involves non-words. Additionally, two practice items were presented at the beginning of each task and all the instruments were piloted beforehand.
Metasyntactic measures The first MSA measure was a task traditionally used in MSA research, i.e. repetition of ungrammatical sentences (Gaux et al., 1999). This task requires participants to repeat deviant sentences exactly as they heard them without correcting the errors encountered (Bowey, 1986; Demont, 1994). It consisted of 40 sentences, 20 target items and 20 filler sentences. The average length of the test sentences was 15 syllables (min. = 13; max. = 17). The sentences were pre-recorded by a NS of French on a digital recorder and presented to the participants through headphones.
The Metalinguistic Dimension
50
This task is considered to be metalinguistic as it requires ‘[. . .] an effort of control on the part of the subject, who has to suppress the natural tendency to normalize such utterances’ (Gombert, 1992, p. 59). Consequently, it exerts higher demands of control of attention. Although it is considered to be metalinguistic, the level of analysis of language required to carry out this task, according to Bialystok’s framework, is rather low. By contrast, the second metalinguistic task used, i.e. the replication-of-error task, required higher levels on both dimensions. In the replication-of-error task, the participants were presented with two sentences, with the first one of each pair containing an error. The participants’ task was to identify the error in the first sentence and reproduce the same type of error in the second sentence (Simard et al., 2007; Gombert et al., 1994; Nocus et al., 1997). Here is an example: Samuel donne lui la bicyclette rouge. [ungrammatical] (Samuel gives him the red bicycle.) Sophie lui propose d’aller à la plage. [grammatical] (Sophie suggests to him to go to the beach.)
In order to provide the correct answer, the participants would have to say ‘Sophie propose lui d’aller à la plage’, replicating the misplacement of the clitic pronoun observed in the first sentence. The task consisted of 15 pairs of sentences similar to the example. The sentences were presented visually. In addition, to control for reading ability, the sentences were pre-recorded by a NS of French on a digital recorder and presented to the participants through headphones. The syntactic features targeted in the two MSA tasks were placement of clitic pronouns, formation of passive sentences and of causative sentences and comparison and choice of relative pronouns. According to Gombert and his colleagues, since participants have to use explicit knowledge to locate an error and to intentionally reproduce it, the replication task is considered to be metalinguistic (Gaux et al., 1999; Nocus et al., 1997). In Bialystok’s framework, it would be located in the higher right quadrant, indicating higher demands in both analysis of knowledge and attentional control, since it requires participants to analyse the erroneous sentence, identify the source of the error, find the same element in the second sentence and consciously reproduce the error (see Simard et al., 2007, 2012 for details).
Syntactic measures To measure the participants’ SA, a preference task and a repetition-of-grammatic al-sentences task were used. In the preference task (Ellis, 1991), two sentences like the following were presented visually in a single line of text and, in order to control for reading ability, the items were additionally pre-recorded and played back to the participants. The aural presentation also controlled the time the participants had to indicate which sentence they preferred. This was done in order to ensure that the participants would use their intuitive (unconscious) knowledge.
Contribution of Syntactic and Metasyntactic Abilities
51
Cette activité est aussi facile à faire que l’autre. Cette activité est autant facile à faire que l’autre. (This activity is as easy as the other one. This activity is as easy than the other one.) In the preceding example, the correct answer is ‘Cette activité est aussi facile à faire que l’autre’ since the comparative presented in the two sentences is constructed with an adjective, ‘facile’ (easy); ‘autant’ can only be followed, in that context, by a noun or a verb. The test targeted auxiliary use, subjunctive in connection with relatives, comparatives, indirect questions, verb agreement in opaque contexts, preposition selection and constructions with quantifiers or negation. The task requires little analysis and little attentional control since the participants only have to identify which of the two sentences sounds better. However, it requires the participants to parse the two sentences, and therefore involves syntactic decoding. In the repetition-of-grammatical-sentences task (e.g. Verhoeven, 1990), the participants were asked to repeat a series of 40 sentences comprising 20 target items and 20 filler sentences. The average length of the test sentences was 15 syllables (min. = 9; max. = 18); all the sentences were pre-recorded by a NS of French on a digital recorder and presented to the participants through headphones. This type of task has been shown to require syntactic parsing (e.g. Clay, 1971; Lust, 2006; Lust et al., 1996). It also requires considerably lower levels of control of attention and analysis of language than the other repetition task, since the participants do not have to refrain from correcting an error. Both the repetition-of-grammatical-sentences task and the repetition-of-ungramm atical-sentences task arguably require some level of phonological memory. However, since the sentences were of the same length in both tasks, it was assumed that better performance with the set of grammatical sentences would be due to the lower level of control of attention required.
Reading comprehension The participants completed a reading comprehension task (Foucambert, 2003, 2009). In order to demonstrate effective reading comprehension, the participants had to show that they had a coherent representation of the meaning of the text (Cain et al., 2011). This coherence is established by relationships the reader makes between different important text items (Graesser et al., 2007; Graesser et al., 2003) and is measured at three different levels of comprehension (Denhiere et al., 1992; Haberlandt et al., 1985; Zwaan et al., 1995): (1) text surface structure (i.e. the recognition of lexical and syntactic elements explicitly present in the text), (2) propositional text base (i.e. inferences made on the basis of elements that were not explicitly present in the text) and (3) the mental (or situational) model generated by the reader (i.e. the mental model created from the integration of sentences into text representation, abstraction from the main and sub-topics and interaction of textual information with the reader’s knowledge of the world) (see Johnson-Laird, 1983; Kintsch, 1988; van Dijk et al., 1983).
52
The Metalinguistic Dimension
The text comprised 589 words (taken from ‘Les contes du chat perché’, Marcel Aymé, 1930). After reading it, the participants were required to answer 12 questions in a multiple-choice format. One-third of the questions tapped the text surface structure (e.g. Q1: Sur les peintures, le cheval est dessiné plus gros que le coq. Response: – Vrai, – Faux, – l’histoire ne le dit pas. ‘On the pictures, the horse is drawn bigger than the rooster. Response: – True – False – the story does not tell’). Another third targeted the propositional text base (e.g. Q6: Dans cette histoire, le coq croit depuis longtemps être au moins de la taille du cheval. Response: – Vrai – Faux – L’histoire ne le dit pas. ‘In this story, the rooster has long believed to be at least of the size of the horse. Response: – True – False – The story does not tell’). The last third focused on the mental (or situational) model generated by the reader (e.g. Q4: Ce texte parle: a. des boîte de peinture; b. des rapports difficiles entre personnes; c. de la cuisson du poulet; d. du changement de taille. ‘This text is talking about: (a) boxes of paint, (b) uneasy relationships between people, (c) of cooking chicken, (d) the change of size’). The participants were asked to answer the questions without the support of the text. Leaving the text in front of them would have allowed the scanning of the text for relevant words in order to answer text surface structure questions. The answers would then hardly reflect any reading comprehension. Researchers such as Guthrie et al. (1987a, 1987b) see such a superficial search for information as a relatively independent cognitive process, which they conceive as a problem-solving process (see Cataldo et al., 2000 for a discussion). In addition, the common model of text comprehension assumes that readers use long-term memory structures that are activated by phrases that enter into working memory as they are read (Bower et al., 1990; Kintsch, 1988, 1998). Therefore the participants should be able to answer questions regarding the mental model or the propositional text base without going back to the text.
Receptive vocabulary A standardized French Canadian translation of the Peabody Picture Vocabulary Test Revised (Dunn et al., 1981) was administered to measure the participants’ receptive vocabulary in French (Échelle de vocabulaire en images Peabody – EVIP; Dunn et al., 1993).1 Researchers have also used this test as a measure of language proficiency (e.g. Luk et al., 2008). The participants had to identify within a set of four pictures the image corresponding to a given word. For practical reasons, the group administration procedure proposed by Bourque Richard (1998) was used, whereby the participants received a booklet and were told to cross out the image corresponding to the word they heard. According to this group procedure, a baseline (first item of the series) is first identified on the basis of the participants’ age group. Then, from this baseline a total of 41 items were presented to each age group. All the words were pre-recorded by a NS of French.
Phonological memory To assess our participants’ phonological working memory, we used the standardized Poncelet et al. (2003) test. The test required the participants to repeat two series of
Contribution of Syntactic and Metasyntactic Abilities
53
non-words (Gathercole et al., 1994). The first series is composed of 21 items following a CV syllabic structure. The non-words range from two to eight syllables (three words of each). The second series is composed of 15 words with CCV syllabic patterns, with the non-words ranging from two to six syllables. The non-words are chosen to follow French phonotactic patterns while being as much as possible dissimilar to existing French words, in order to avoid any potential access to lexical representations. All the items were pre-recorded by a French NS. The test was presented to the participants through headphones, and the participants’ answers were recorded.
Procedure For the NS, trained research assistants administered the syntactic, receptive vocabulary and reading comprehension tasks in class during regular school hours. Later the same day or the following day, a research assistant met the participants individually to complete a background information questionnaire, and to administer the phonological memory task and the two metasyntactic tasks (repetition-of-ungrammatical-sentences task and error replication task). The NNS who were studying at the same school as the French NS completed the tasks under the same conditions as the NS. The remaining NNS were met at their Portuguese Saturday school. For the MSA tasks, feedback was provided to the participants when needed to encourage them to continue. All individual tests were digitally recorded for further analysis. In order not to discourage the participants, it was decided to put an end to the administration after four consecutive missed items in one of the tasks (e.g. Roman et al., 2008).
Coding procedures Two independent raters coded the two MSA tasks. The coefficients of inter-rater reliability were 0.993 for the repetition task and 0.997 for the replication task. For the MSA repetition task, if the target (the syntactic error) was correctly repeated, a point was given. Errors in the repetition of the rest of the sentence (e.g. missing adjective (grosse/ big) in *La grosse tortue de mer nage plus lentement le dauphin *The big sea turtle swims slower the dolphin) were not considered. Since the participants were stopped after four missed items, the percentage of correctly repeated targets out of the total number of answers provided was calculated. For the replication task, one point was awarded for each correct answer, i.e. for correct production of the second sentence containing the same type of error found in the first sentence (maximum score = 15). For the repetition-of-grammatical-sentences task, the same procedure as for the repetition-of-ungrammatical-sentences task was used. Finally, for the preference task, one point was given for each correct answer (maximum score = 15). For the reading comprehension task, three different measures were calculated, based on the three different levels of comprehension, i.e., from the least to the most complex. One point was given per correct answer (maximum score = 12). For the receptive vocabulary task, since there was a ceiling effect in the score out of 41, and since the words are ranked according to level of difficulty, only the last 15 items
54
The Metalinguistic Dimension
were considered (maximum score = 15). Finally, the number of correct syllables was calculated using Poncelet et al.’s (2003) coding procedure in the phonological memory task. An item was considered correct if it did not contain more than one authorized transformation of a phoneme (CV, maximum = 105; CCV, maximum = 60, total = 165). Two independent judges coded the repetitions (Cronbach’s alpha = 0.979).
Data analysis The data were analysed using structural equation modelling. The analysis was conducted in two consecutive steps. First, the measurement model specifies the relationships between the measured and latent variables. Latent variables correspond to variables that are constructed using a combination of manifest variables – the actual collected data (Loehlin, 2009). In our specific case, the constructed – or latent – variables are Reading Comprehension, Metasyntactic Ability and Syntactic Ability, and the manifest variables are, respectively, the three different types of comprehension questions, the two metasyntactic tasks and the two syntactic tasks. Since receptive vocabulary and phonological memory might contribute to the construction of the three latent variables, and in particular to reading comprehension (e.g. Oakhill et al., 2003), their potential contribution was tested in one of the measurement models. Once latent variables are constructed, the structural model specifying the relationships among the latent variables is calculated (Kunnan, 1988). We applied a Generalized Structured Component Analysis (GSCA)2 (Hwang, 2009) to fit the specified model simultaneously to NS and NNS in order to examine the difference in parameter estimates between the two groups.
Results Descriptive statistics The means, standard deviations and reliability scores per group for each variable observed are shown in Table 3.1. The results show that both groups of participants obtained similar results on the three levels of the reading and receptive vocabulary tasks. However, the NS obtained higher scores than the NNS on the two syntactic tasks, the two MSA tasks and the phonological memory task.
Specification of the model and model fit The two models tested to evaluate the contribution of the variables identified in this study to reading comprehension are represented in Figure 3.1. The first model (M1) looked only at the two targeted abilities, the latent variables Syntactic and Metasyntactic Abilities, and postulated that they each had an effect on reading comprehension independent of the other. In Figure 3.1, all these relations are symbolized with solid lines. The second model (M2), in addition to postulating that both Syntactic and
Contribution of Syntactic and Metasyntactic Abilities
55
Table 3.1 Descriptive statistics of observed variables Tasks
Groups NS
Syntax: Preference Syntax: Repetition Metasyntactic: Replication Metasyntactic: Repetition Reading: Level 1 Reading: Level 2 Reading: Level 3 Phonological memory Receptive vocabulary
NNS
M
SD
M
SD
15.78 83.92 10.28 57.75 2.78 2.94 2.40 121.19 12.8
3.18 15.1 5.67 24.3 1.22 0.99 1.42 22.64 1.95
13.39 74.5 8.32 35.51 2.03 2.36 2.11 105.3 11.3
3.04 24.8 4.95 21.6 1.38 1.22 1.36 16.85 2.51
Note: Correlation between syntactic tasks: r = 0.6 (p < 0.05); correlation between MSA tasks: r = 0.5 (p < 0.05); Cronbach’s alpha between the three reading levels = 0.76.
Metasyntactic Abilities have an effect on reading comprehension, supposed that vocabulary knowledge and phonological memory might also play a role, either as a part of Syntactic and Metasyntactic Abilities or directly related to reading comprehension (dotted lines in Figure 3.1), and that Syntactic Ability might also affect Metasyntactic Ability (p3 in Figure 3.1). In M1, the three latent variables are constructed as follows. Syntactic Ability is constructed by loading the manifest variables measured by the preference (c1) and the repetition-of-grammatical-sentences tasks (c2), Metasyntactic Ability is constructed by loading the manifest variables measured by the repetition-of-ungram matical-sentences (c3) and the replication-of-error tasks (c4), and finally Reading Comprehension is constructed by loading the manifest variables measured by three levels of reading comprehension questions (c5–c7). The structural part of M1 consists of only two paths, one going from Syntactic Ability to Reading Comprehension (p1) and the other from Metasyntactic Ability to Reading Comprehension (p2). In M2, the possibility that Receptive Vocabulary and Phonological Memory might load on any of the other latent variables – Syntactic Ability (c8, c9), Metasyntactic Ability (c10, c11) and Reading Comprehension (c12, c13), was introduced into the measurement model. For the structural model, the link between Syntactic and Metasyntactic Abilities (p3) was added (represented by dashed lines in Figure 3.1). Thus, M1 is represented only by solid lines, while M2 is represented by solid and dashed lines.3 It should be noted that reflective arrows were used in the measurement model indicating that latent variables are illustrated by their respective measures.4 Table 3.2 provides the measures of overall model fit for the two models. The data presented support M2 over M1. This is shown by the GFI and SRMR values referring to the difference between the sample covariances and the covariances reproduced by the parameter estimates. A good model fit is indicated by values of GFI > 0.95 (the closer to 1 the GFI value is 1, the better is the model) and SRMR < 0.08 (Hu et al.,
Replications
Repetition of Ungrammatical sentences c4
c3
c9
c11
c8
c2
c1
c10
c13
c12
Metasyntactic Ability
p3
Syntactic Ability
p2
p1
Reading Comprehension
c7
c6
c5
Figure 3.1 Structural models (M1 and M2) for Reading Comprehension for NS and NNS
Phonological memory
Vocabulary
Repetition of grammatical sentences
Preference
Read 3
Read 2
Read 1
Contribution of Syntactic and Metasyntactic Abilities
57
Table 3.2 Four measures of overall model fit and number of free parameters Models
FIT AFIT GFI SRMR NPAR
M1
M2
0.557 0.528 0.982 0.202 32
0.605 0.567 0.987 0.071 58
1995). NPAR is the number of free parameters in the models. The FIT indicates the total variance of all variables explained by the model (the values of FIT can range from 0 to 1). AFIT (Adjusted FIT) takes model complexity into account, and this criterion may be used for model comparison: the model with the largest AFIT value may be selected among competing models (Hwang, 2010). Consider the first column, M1, in Table 3.2. This model explains about 55% of the global variance, but the value of the SRMR criterion is not high enough to accept M1 without any modifications (SRMR > 0.2 instead of < 0.08). Model M2 explains about 60% of the global variance and proves to be more robust than M1: GFI and SRMR reached their minimal expected values (respectively 0.98 and 0.071). The measurement and structural models of M2 are explained in more detail in the following sections.
Measurement model of M2 Table 3.3 displays the estimates of loadings of each construct (c1–c13) in the measurement model for the two groups of participants. The Squared Multiple Correlation (SMC) of each indicator indicates how much variance is explained by the corresponding latent variable. The bootstrap critical ratios (CR) evaluate the significance of each estimate parameter. The loading and quantity of variance (SMC) of each parameter were similar for both groups. The standardized factor loadings were generally large and statistically significant for Syntactic and Metasyntactic Abilities and Reading Comprehension. This suggests that the latent variables were well chosen to explain a large percentage of the variance of our observed variables. An examination of the results shows an important similarity between our two groups of participants in terms of loading and significance of the various constructs, but interesting differences can be observed: the replication-of-error (c4) task contributed to Metasyntactic Ability in a much more important way for the NS than for the NNS, and Receptive Vocabulary (c8, c10) and Phonological Memory (c9, c11) contributed in a marginally more important way to Syntactic and Metasyntactic Abilities for the NS than for the NNS. The three measures that comprised the reading comprehension test contributed significantly to reading comprehension (c5–c7) in a similar way for NS and NNS.
The Metalinguistic Dimension
58
Table 3.3 Estimates loadings and SMC for the NS and the NNS groups NS
NNS
Loading Est. Syntax C1 0.82 C2 0.92 C8 –1.37 C9 0.95 Metasyntax C3 0.88 C4 0.73 C10 1.49 C11 0.05 Reading C5 0.86 C6 0.7 C7 0.82 C12 0.31 C13 –0.09
SE
SMC
Loading
CR
Est.
SE
CR
0.06 12.8* 0.03 35.8* 0.36 3.9* 0.22 4.4*
0.68 0.85 1.87 0.89
0.10 0.05 1.13 0.46
6.7* 0.79 17.9* 0.87 1.6 0.73 1.93 –0.45
0.04 24.4* 0.09 8.4* 0.51 2.9* 0.26 0.2
0.78 0.53 2.22 0.002
0.06 0.12 1.54 0.09
12.3* 4.4* 1.4 0.02
0.07 12.9* 0.14 5.1* 0.06 14.3* 0.35 1.0 0.12 0.7
0.74 0.49 0.67 0.09 0.008
0.11 0.16 0.09 0.26 0.03
6.87* 3.11* 7.37* 0.36 0.31
Est.
SE
SMC CR
Est.
SE
CR
0.09 0.04 0.39 0.6
8.9* 19.4* 1.9 0.7
0.62 0.76 0.54 0.20
0.14 0.08 0.57 1.60
4.57* 9.72* 0.95 0.12
0.85 0.60 0.02 1.15
0.05 0.14 0.51 0.7
16.2* 4.3* 0.04 1.6
0.72 0.37 0.00 1.31
0.09 0.16 0.4 1.93
8.38* 2.34* 0.0 0.68
0.87 0.71 0.85 0.07 –0.15
0.05 0.09 0.07 0.31 0.32
17.2* 7.5* 12.4* 0.2 0.5
0.74 0.50 0.71 0.005 0.02
0.08 0.13 0.11 0.25 0.18
8.96* 3.95* 6.44* 0.02 0.12
Note: Asterisks correspond to significant effects at p < 0.05. Est. = Estimate; SE = Standard error; CR = Bootstrap critical ratio.
However, neither Receptive Vocabulary nor Phonological Memory was associated with Reading Comprehension. For Syntactic Ability, the factor loadings and SMC of preference (c1) and repetition-of- grammatical-sentences (c2) followed the same pattern for the NS and NNS. For Receptive Vocabulary (c8) and Phonological Memory (c9), the factor loadings were significant only for the NS, and the SMCs are not significant. For Metasyntactic Ability, the factor loading and SMC of Receptive Vocabulary (c10) was significant for the NS, but not for the NNS (SMC=0.00); Phonological Memory (c11) was not significant; repetition-of-ungrammatical-sentences (c3) is significant for both groups, with similar loadings; for the replication (c4) task, both factor loading (+0.13) and SMC (+0.16) were substantially greater for the NS than for the NNS. In summary, from the analysis of M2, it appears that our three latent variables (Syntactic Ability, Metasyntactic Ability and Reading Comprehension) are indeed illustrated by the data obtained from the measurement instruments (the two Syntactic Ability tests, the two Metasyntactic Ability tests and the three parts of the reading comprehension test), rather than receptive vocabulary or phonological memory. The model shows little difference in this respect between NS and NNS.
Structural model In Figure 3.1, paths p1, p2 and p3 (middle part of the figure) constitute the structural model for reading comprehension, SA and MSA. This model is based on the hypothesis
Contribution of Syntactic and Metasyntactic Abilities
59
Table 3.4 Estimates of path coefficients and bootstrap standard errors for NS and NNS groups NS Estimate P1 syntactic*RC P2 metasyntactic*RC P3 syntactic*metasyntactic
–0.260 1.032 0.889
NNS
SE
CR
0.428 0.361 0.021
0.61 2.86* 41.94*
Estimate
SE
CR
–0.162 0.843 0.753
0.243 0.200 0.083
0.67 4.21* 9.02*
Note: Asterisks correspond to significant effects (p < 0.05).
that the latent variables underlying Syntactic and Metasyntactic Abilities influence latent variables representing Reading Comprehension, and that Metasyntactic Ability is influenced by Syntactic Ability. The direction of the arrows represents these relations between the latent variables. Table 3.4 presents the parameter estimates for the path coefficients for the two groups of participants. The only latent variable that had a direct effect on Reading Comprehension was Metasyntactic Ability (p2), but with a larger effect for the NS. Crucially, Syntactic Ability had no direct effect on Reading Comprehension (p1), but it had a large significant effect on Metasyntactic Ability (p3). This effect was larger for the NS than for the NNS (0.89 vs 0.75).
Discussion As mentioned in the introduction, the way factors such as SA and MSA contribute to reading comprehension skills among NNS and how this contribution differs from NS, if it does, is still not well understood. The present study was conducted in order to provide some insight into this issue. Our first research question was: What is the respective weight of MSA and SA in explaining reading comprehension skills in French-speaking native and non-native children? In order to provide an answer to this question, we first verified whether MSA and SA were constructed by the measurements taken. Results show that SA is significantly constructed by both the preference (estimate for NS = 0.82; estimate for NNS = 0.79) and repetition-of-grammatical-sentences (estimate for NS = 0.92; estimate for NNS = 0.87) tasks, with very similar weights for the two groups, and that MSA is significantly constructed by both the repetition-of-ungrammatical-sentence s (estimate for NS = 0.88; estimate for NNS = 0.85) and the replication (estimate for NS = 0.73; estimate for NNS = 0.60) tasks for the two groups. However, differences can be observed in the groups’ respective weight for the replication task. The parameter estimate values indicate that less weight was loaded in the model for the NNS. A possible explanation is that the task was harder for the NNS. After making sure that MSA and SA were actually constructed by their respective measurements, we verified their particular contribution to the construction of reading comprehension. Our results reveal that MSA contributes significantly to the construction of reading comprehension for both groups. However, the weight of its
60
The Metalinguistic Dimension
contribution is more important for the NS (parameter estimate = 1.03) than for the NNS children (parameter estimate = 0.84), indicating that the NS relied more on their MSA than the NNS in order to comprehend the text they read in French. The existence of a strong link between MSA and reading comprehension raises the question of the exact nature of the contribution of MSA to reading comprehension. As mentioned in the introduction, MSA has been claimed to support reading comprehension by allowing for the prediction of upcoming words (e.g. Chiappe et al., 1999; Lesaux et al., 2003; Lipka et al., 2012; Low et al., 2005; Siegel, 1993). However our tests did not tap word prediction skills, while still providing a significant contribution to the construction of reading comprehension. MSA has also been claimed to facilitate the reassembly of decoded words into phrases (Tunmer et al., 1984), which is closer to the repetition-of-ungrammatical-sentences and replication-of-error tasks we used. In order to refine our understanding of the way in which MSA contributes to reading comprehension, further research should investigate the exact nature of the observed link by using tasks tapping different aspects of MSA (i.e. word prediction vs reassembly of decoded words into phrases). Syntactic ability, on the other hand, does not make a significant direct contribution to the componential model of reading comprehension for either group. However, it contributes indirectly through its contribution to MSA. This is contrary to the results of van Gelderen et al. (2007), who found that SA contributed to reading comprehension in L1 and in L2, and Verhoeven (1990), who found that SA in L2 explained reading in L2. In the case of van Gelderen et al. (2007), the difference between their results and ours probably lies in the type of SA measurement used in each study. In our case, the two syntactic tasks targeted parsing ability, whereas in van Gelderen et al. (2007) grammatical knowledge was assessed. In their study, the participants were asked to fill in correct forms of verbs while taking into account number, tense, aspect and agreement (p. 480). This task arguably requires higher levels of analysis of language than ours. As for the difference between our study results and those of Verhoeven (1990), we believe it is related to the age of the participants. Verhoeven’s participants were first and second elementary school graders, while our participants were at least 2 years older. The MSA of Verhoeven’s younger participants may not have been sufficiently developed to contribute to reading comprehension. As they get older, children’s mental representations become more explicit and more structured (Bialystok, 2001a, p. 14; but see also Karmiloff-Smith, 1986; Gombert, 1992), and control of attention improves (Bialystok, 1988). Therefore, the mental representations and cognitive processes necessary to carry out cognitive and linguistic tasks that make higher demands, such as reading comprehension tasks, become available as children get older. In our study, the contribution of SA to reading comprehension appears to have been mediated by MSA in both groups, but slightly more strongly in the case of the NS participants (parameter estimate for NNS = 0.75; for NS = 0.89). This observation provides an affirmative answer to our second research question, which was: Is there a relationship between MSA and SA in the modelling of reading comprehension in French-speaking native and non-native children? At first glance, the componential structures of reading comprehension seem to be very similar for both the NS and NNS, which is in line with what van Gelderen et al. (2004, 2007) concluded in their studies.
Contribution of Syntactic and Metasyntactic Abilities
61
The analyses conducted provided us with an additional piece of information regarding the particular weight carried by each factor in the explanation of reading comprehension among the NS and NNS; neither vocabulary nor phonological memory directly contributed to reading comprehension for either group. This contrasts with the results of Lesaux et al. (2010) and Schoonen et al.’s (1998) findings that vocabulary knowledge contributes significantly to reading in L1 and in L2. The discrepancy between our results and theirs might be due in part to differences in the vocabulary measures used in each study. Our results are based on a multiple-choice recognition task, whereas Schoonen et al. (1998) used a multiple-choice sentence completion task requiring both productive vocabulary knowledge and syntactic parsing. Additionally, they used regressions in order to determine the contribution of each variable studied. In Lesaux et al. (2010), vocabulary knowledge was part of a composite measure of language proficiency, which also included listening comprehension. Thus, it is impossible to know for certain the exact contribution of vocabulary knowledge to reading comprehension. Another possible explanation might be that in our test the level of vocabulary required to understand the text in the reading comprehension task did not represent a challenge for the participants. Our results regarding the absence of a direct contribution of either SA or vocabulary to reading comprehension also offer a different view on Cain’s (2007) findings. She concluded, from the multiple regressions she conducted on her data collected among NS, that the relationship between MSA and reading comprehension was indirect and arose from the variance shared with vocabulary, grammatical knowledge and memory. Instead, we would argue that the link between MSA and reading comprehension is direct, but that MSA is constructed by factors such as grammatical knowledge and vocabulary. This requires further investigation, of course. As for phonological memory, although it has traditionally been strongly correlated to reading comprehension, at least in L1 (e.g. Cain et al., 2004; Daneman et al., 1980, 1983), it did not contribute to reading comprehension in the present study. It is possible to think that the role of phonological memory is mediated by other factors. For instance, according to Nation et al. (1999), the relationship between phonological memory and reading comprehension is underpinned by linguistic skills. Therefore, one might hypothesize that the contribution of phonological memory to the construction of reading comprehension is mediated by vocabulary. The model we tested did not include such a potential link between phonological memory and vocabulary, but this is an issue further research might want to address. Finally, while neither vocabulary nor phonological memory contributed to either SA or MSA for the NNS, vocabulary emerged as significant for both SA and MSA for the NS, and phonological memory contributed only to SA in the NS group. According to Bates et al. (1997), SA and vocabulary are closely related in L1. Phonological memory has also been associated with SA in L1 (e.g. Adams et al., 1995; Sansavini et al., 2007), and more recently in L2 (e.g. Andrade et al., 2011; French et al., 2008). However, according to Andrade et al. (2011), the role of phonological memory in SA has not been clearly demonstrated yet, since in most previous studies the influence of vocabulary had not been teased apart from phonological memory in the results obtained. In the same vein, the instrument used to measure phonological memory
62
The Metalinguistic Dimension
in the present study was based on French phonotactic patterns, which might have biased the results obtained by the NNS, despite the fact that they were all educated in the French system (see Simard et al., 2013 for details). This might in part explain the absence of link between phonological memory and SA among the NNS. It should be noted that the contribution of phonological memory in the construction of MSA has never been directly investigated among NNS. These links will definitely have to be explicitly investigated in the future in order to get a better understanding of their contribution to reading comprehension.
Conclusion In the present study, we investigated the way MSA and SA contribute to reading comprehension among NS and NNS of French. It was shown that, for both groups of participants, MSA contributes significantly to the construction of reading comprehension, while the contribution of SA to reading comprehension is indirect as it is mediated by MSA. In addition to adding support to the assumption that MSA is an important contributor to reading comprehension for both NS and NNS, our results offer an important new piece of information regarding the interrelationship between MSA, SA and reading comprehension. As for differences between the two populations, the results revealed that NS and NNS are very similar. However, the weight of the contribution of MSA to reading comprehension as well as the contribution of SA to MSA is more important for NS. This was explained by their higher proficiency in French. In addition, there were differences between the two populations in the respective contributions of phonological memory and vocabulary to SA and to MSA.
Acknowledgements This research was supported by the Social Sciences and Humanities Research Council of Canada. We wish to thank Véronique Fortier and Karina DaRocha Da Silva who assisted us during various phases of the project. A special thanks goes to Stephen L. MacDonald for his help with the text.
Notes 1 Version B of the Échelle de vocabulaire en images Peabody was used, since version A overestimates language proficiency for children living in Quebec, according to Godard and Labelle (1995). 2 Until recently, two different approaches have been employed for structural equation modelling (Jöreskog et al., 1982). The first one is based on covariance structure analysis (Jöreskog, 1973), and the other on partial least squares (Wold, 1975). For the covariance structure analysis, a large sample size is necessary – Boomsma (1987) recommends 400 – and the data should have a normal (and multinormal) distribution. The partial least squares analysis does not rely on strict distributional assumptions,
Contribution of Syntactic and Metasyntactic Abilities
63
which are often violated in language data. However, it has an important flaw in that it does not provide a solution to the global optimization problem, indicating that there exists no particular measure to determine model parameter estimates (Jöreskog et al., 1982). In order to overcome the limitations associated with these two approaches, a third approach – the Generalized Structured Component Analysis (GSCA) – has recently been developed by Hwang and his collaborators (e.g. Hwang, 2009). GSCA is a component-based approach to structural equation modelling which enables measures of overall model fit and whose method of calculation does not assume multivariate normality. Because of its characteristics, the GSCA approach to structural equation modelling is best suited to our data and was chosen to perform componential analyses in an attempt to answer our two research questions. 3 Residual terms associated with the variables are not presented to make the figure more concise. 4 We considered our measurement model to be reflective, hence the direction of the arrows that are going out of the latent variables, as they show the effects of abstract latent variables. Latent variable measurement models such as the ones found in SEM can be either reflective or formative. ‘Reflective measures are caused by the latent construct, whereas, formative measures cause the latent construct’ (Freeze et al., 2007, p. 1482). More specifically, reflective measurement models are used to validate indicators (in our case the measures) and their latent variables (in our case the key variables). This was our intention with our measurement model. The direction of the arrows shows that changes in the latent variables are reflected by changes in the measures. In formative measurement models, the measures influence the construct.
References Abu-Rabia, S. and Siegel, L. S. (2002), ‘Reading, syntactic, orthographic, and working memory skills of bilingual Arab Canadian children’. Journal of Psycholinguistic Research, 31, 661–78. Adams, A.-M. and Gathercole, S. E. (1996), ‘Phonological working memory and spoken language development in young children’. Quarterly Journal of Experimental Psychology, 49, 216–33. Andrade, J. and Baddeley, A. (2011), ‘The contribution of phonological short-term memory to artificial grammar learning’. The Quarterly Journal of Experimental Psychology, 64, 960–74. Armand, F. (2000), ‘Le rôle des capacités métalinguistiques et de la compétence langagière orale dans l’apprentissage de la lecture en français langue première et seconde’. The Canadian Modern Language Review/La Revue canadienne des langues vivantes, 56, 469–95. Bates, E. and Goodman, J. C. (1997), ‘On the inseparability of grammar and the lexicon: Evidence from acquisition, aphasia and real-time processing’. Language and Cognitive Processes, 12, 507–84. Bialystok, E. (1988), ‘Levels of bilingualism and levels of linguistic awareness’. Developmental Psychology, 24, 560–7. — (1990), ‘Connaissances linguistiques et contrôle des activités de langage’, in D. Gaonac’h (ed.), Le français dans le Monde – Recherches et applications, ‘Acquisition et utilisation d’une langue étrangère, l’approche cognitive’. Paris: Hachette, pp. 50–8.
64
The Metalinguistic Dimension
— (1991), ‘Metalinguistic dimensions of bilingual language proficiency’, in E. Bialystok (ed.), Language Processing in Bilingual Children. Cambridge: Cambridge University Press, pp. 113–40. — (1993), ‘Metalinguistic awareness: The development of children’s representations of language’, in C. Pratt and A. F. Garton (eds), Systems of Representation in Children: Development and Use. New York: John Wiley and Sons, pp. 211–33. — (1999), ‘Cognitive complexity and attentional control in the bilingual mind’. Child Development, 70, 636–44. — (2001a), Bilingualism in Development: Language, Literacy, and Cognition. New York: Cambridge University Press. — (2001b), ‘Metalinguistic aspects of bilingual processing’. Annual Review of Applied Linguistics, 21, 169–81. Birdsong, D. (1989), Metalinguistic Performance and Interlinguistic Competence. Berlin: Springer. Blackmore, A. M. and Pratt, C. (1997), ‘Grammatical awareness and reading in Grade 1 children’. Merrill-Palmer, 43, 567–90. Boosma, A. (1987), ‘The robustness of maximum likelihood estimation in structural equation models’, in P. Cuttance and R. Ecob (eds), Structural Modeling by Example. Cambridge: Cambridge University Press, pp. 169–88. Bourque Richard, J. (1998), Les distorsions mnémoniques en fonction de l’âge, le vocabulaire et la suggestibilité. Unpublished PhD thesis, Université de Moncton, Canada. Bower, G. H. and Morrow, D. G. (1990), ‘Mental models in narrative comprehension’. Science, 247, (4938), 44–8. Bowey, J. A. (1986), ‘Syntactic awareness in relation to reading skill and ongoing comprehension monitoring’. Journal of Experimental Child Psychology, 41, 282–99. — (2005), ‘Grammatical sensitivity: Its origins and potential contribution to early word reading skill’. Journal of Experimental Child Psychology, 90, 318–43. Cain, K. (2007), ‘Syntactic awareness and reading ability: Is there any evidence for a special relationship?’ Applied Psycholinguistics, 28, 679–94. Cain, K. and Nash, H. M. (2011), ‘The influence of connectives on young readers’ processing and comprehension of text’. Journal of Educational Psychology, 103, 429–41. Cain, K., Oakhill, J. and Bryant, P. E. (2004), ‘Children’s reading comprehension ability: Concurrent prediction by working memory, verbal ability, and component skills’. Journal of Educational Psychology, 96, 31–42. Carlisle, J. F. (1995), ‘Morphological awareness and early reading achievement’, in L. B. Feldman (ed.), Morphological Aspects of Language Processing. Hillsdale, NJ: Erlbaum, pp. 189–209. Carr, T. H. and Levy, B. (eds) (1990), Reading and its Development: Component Skills Approaches. San Diego: Academic Press. Cataldo, M. G. and Oakhill, J. (2000), ‘Why are poor comprehenders inefficient searchers? An investigation into effects of text representation and spatial memory on the ability to locate information in text’. Journal of Educational Psychology, 92, (4), 791–9. Chiappe, P. and L. S. Siegel (1999), ‘Phonological awareness and reading acquisition in English- and Punjabi-speaking Canadian children’. Journal of Educational Psychology, 91, 20–8. Clay, M. M. (1971), ‘Sentence repetition: Elicited imitation of a controlled set of syntactic structures by four language groups’. Monograph of the Society for Research in Child Development, 36 (No. 143).
Contribution of Syntactic and Metasyntactic Abilities
65
Comblain, A. (2005), ‘Le développement de la syntaxe et de la métasyntaxe et leur évaluation’, in B. Piérart (ed.), Le langage de l’enfant. Bruxelles: DeBoeck, pp. 83–98. Da Fontoura, H. and Siegel, L. (1995), ‘Reading, syntactic, and working memory skills of bilingual Portuguese-English Canadian children’. Reading and Writing: An Interdisciplinary Journal, 7, 139–53. Daneman, M. and Carpenter, P. A. (1980), ‘Individual differences in working memory and reading’. Journal of Verbal Learning and Verbal Behaviour, 19, 450–66. — (1983), ‘Individual differences in integrating information between and within sentences’. Journal of Experimental Psychology: Learning, Memory, and Cognition, 9, 561–84. de Villiers, J. G. and de Villiers, P. A. (1972), ‘Early judgments of semantic and syntactic acceptability by children’. Journal of Psycholinguistic Research, 1, 299–310. — (1974), ‘Competence and performance in child language: Are children really competent to judge?’ Journal of Child Language, 1, 11–22. Demont, E. (1994), Développement métalinguistique et apprentissage de la lecture. Unpublished PhD thesis, Université de Bourgogne, France. Demont, E. and Gombert, J. E. (1996), ‘Phonological awareness as a predictor of recoding skills and syntactic awareness as a predictor of comprehension skills’. British Journal of Educational Psychology, 66, 315–23. Denhiere, G. and Baudet, S. (1992), Lecture, compréhension de texte et sciences cognitives. Paris: PUF. Dunn, L. M. and Dunn, L. M. (1981), Peabody Picture Vocabulary Test – Revised. Circle Pines, NM: American Guidance Services. Dunn, L. M., Theriault-Whalen, C. M. and Dunn, L. M. (1993), Échelle de vocabulaire en images Peabody. Toronto: Editions Psycan. Efron, B. (1982), The Jackknife, the Bootstrap and Other Resampling Plans. Philadelphia, PA: SIAM. Ellis, R. (1991), ‘Grammaticality judgments and second language acquisition’. Studies in Second Language Acquisition, 13, 161–86. Foucambert, D. (2003), Syntaxe, vision parafovéale et processus de lecture. Contribution du modèle structural à la pédagogie. Unpublished PhD thesis, Université Grenoble 2. — (2009), ‘L’amélioration de la compréhension en lecture de jeunes collégiens par un entraînement syntaxique: modalités, résultats et perspectives’. Revue des Sciences de l’Éducation, 35, 41–69. Freeze, R. D. and Raschke, R. S. (2007), ‘An assessment of formative and reflective constructs in IS research’. Proceedings of ECIS, 1481–2. French, L. M. and O’Brien, I. (2008), ‘Phonological memory and children’s second language grammar learning’. Applied Psycholinguistics, 29, 463–87. Fu, J.-R. (2007), VisualPLS 1.04b. Retrieved 19 October 2011 from http://fs.mis.kuas.edu. tw/~fred/vpls/index.htm. Gathercole, S. E., Willis, C. S., Baddeley, A. D. and Emslie, H. (1994), ‘The children’s test of nonword repetition: A test of phonological working memory’. Memory, 2, 103–27. Gaux, C. and Demont, E. (1997), ‘Conscience phonologique, morpho-syntaxique et lecture: étude chez les jeunes enfants et les pré-adolescents’, in C. Barre-De Miniac and B. Lete (eds), L’illettrisme: De la prévention chez l’enfant a la prise en chargé chez l’adulte. Approches pluridisciplinaires. Bruxelles: De Boeck, pp. 203–23. Gaux, C. and Gombert, J.-E. (1999), ‘Implicit and explicit syntactic knowledge and reading in pre-adolescents’. British Journal of Developmental Psychology, 17, 169–88.
66
The Metalinguistic Dimension
Geva, E. and Siegel, L. S. (2000), ‘Orthographic and cognitive factors in the concurrent development of basic reading skills in two languages’. Reading and Writing: An Interdisciplinary Journal, 12, 1–30. Godard, L. and Labelle, M. (1995), ‘Utilisation de l’EVIP avec une population québécoise’. Fréquences, 7, 18–20. Goff, D. A., Pratt, C. and Ong, B. (2005), ‘The relations between children’s reading comprehension, working memory, language skills and components of reading decoding in a normal sample’. Reading and Writing: An Interdisciplinary Journal, 18, 583–616. Gombert, J.-E. (1992), Metalinguistic Development. Chicago: Chicago University Press. Gombert, J.-E., Gaux, C. and Demont, E. (1994), ‘Capacité métalinguistique et lecture. Quels liens?’ Repères, 9, 61–73. Goswami, U. and Bryant, P. E. (1990), Phonological Skills and Learning to Read. Hove: Lawrence Erlbaum and Associates. Graesser, A. C., McNamara, D. S. and Louwerse, M. (2003), ‘What do readers need to learn in order to process coherence relations in narrative and expository text?’, in A. P. Sweet and C. E. Snow (eds), Rethinking Reading Comprehension. New York: Guilford, pp. 82–98. Graesser, A. C., Louwerse, M., McNamara, D. S., Olney, A. M., Cai, Z. and Mitchell, H. H. (2007), ‘Inference generation and cohesion in the construction of situation models: Some connections with computational linguistics’, in F. Schmalhofer and C. A. Perfetti (eds), Higher Level Language Processes in the Brain: Inference and Comprehension Processes. Mahwah, NJ: Lawrence Erlbaum, pp. 289–310. Guthrie, J. T. and Kirsch, I. S. (1987a), ‘Distinctions between reading comprehension and locating information in text’. Journal of Educational Psychology, 79, 220–7. Guthrie, J. T. and Peter, M. (1987b), ‘Literacy as multidimensional: Locating information and reading comprehension’. Educational Psychologist, 22, (3/4), 279–97. Haberlandt, K. and Graesser, A. (1985), ‘Component processes in text comprehension and some of their interactions’. Journal of Experimental Psychology: General, 114, (3), 357–74. Hill, T. and Lewicki, P. (2007), Statistics Methods and Applications. Tulsa, OK: StatSoft. Hu, L.-T. and Bentler, P. (1995), ‘Evaluating model fit’, in R. H. Hoyle (ed.), Structural Equation Modeling: Concepts, Issues, and Applications. London: Sage, pp. 76–99. Hwang, H. (2009), ‘Regularized generalized structured component analysis’. Psychometrika, 74, 517–30. — (2010), GeSCA User’s Manual. Retrieved from GESCA website: www.sem-gesca.org/ manual.php. Hwang, H., Malhotra, N. K., Kim, Y., Tomiuk, M. A. and Hong, S. (2010), ‘A comparative study on parameter recovery of three approaches to structural equation modeling’. Journal of Marketing Research, 47, 699–712. Johnson-Laird, P. N. (1983), Mental Models: Toward a Cognitive Science of Language, Inference and Consciousness. Cambridge, MA: Harvard University Press. Jöreskog, K. G. (1973), ‘A general method for estimating a linear structural equation system’, in A. S. Goldberger and O. D. Duncan (eds), Structural Equation Models in the Social Sciences. New York: Academic Press, pp. 85–112. Jöreskog, K. G. and Wold, H. (1982), ‘The ML and PLS techniques for modeling with latent variables: Historical and comparative aspects’, in H. Wold and K. G. Jöreskog (eds), Systems Under Indirect Observation: Causality, Structure, Prediction I. Amsterdam: North-Holland, pp. 263–70.
Contribution of Syntactic and Metasyntactic Abilities
67
Just, M. A. and Carpenter, P. A. (1992), ‘A capacity theory of comprehension: Individual differences in working memory’. Psychological Review, 99, 122–49. Karmiloff-Smith, A. (1986), ‘From meta-processes to conscious access: Evidence from metalinguistic and repair data’. Cognition, 23, 95–147. Kintsch, W. (1988), ‘The use of knowledge in discourse processing: A constructionintegration model’. Psychological Review, 95, 163–82. — (1998), Comprehension: A Paradigm for Cognition. Cambridge: Cambridge University Press. Koda, K. (2007), ‘Reading and language learning: Crosslinguistic constraints on second language reading development’. Language Learning, 57, 1–44. Kunnan, A. J. (1998), ‘An introduction to structural equation modelling for language assessment research’. Language Testing, 15, 295–332. LaBerge, D. and Samuels, S. J. (1974), ‘Toward a theory of automatic information processing in reading’. Cognitive Psychology, 6, 293–323. Lesaux, N. K. and Siegel, L. S. (2003), ‘The development of reading in children who speak English as a second language’. Developmental Psychology, 39, 1005–119. Lesaux, N. K., Lipka, O. and Siegel, L. S. (2006), ‘Investigating cognitive and linguistic abilities that influence the reading comprehension skills of children from diverse linguistic backgrounds’. Reading and Writing, 19, 99–131. Lesaux, N. K., Rupp, A. A. and Siegel, L. S. (2007), ‘Growth in reading skills of children from diverse linguistic backgrounds: Findings from a 5-year longitudinal study’. Journal of Educational Psychology, 99, 821–34. Lesaux, N. K., Crosson, A., Kieffer, M. J. and Pierce, M. (2010), ‘Uneven profiles: Language minority learners’ word reading, vocabulary, and reading comprehension skills’. Journal of Applied Developmental Psychology, 31, 475–83. Lipka, O. and Siegel, L. S. (2007), ‘The development of reading skills in children with English as a second language’. Scientific Studies of Reading, 11, 105–31. — (2012), ‘The development of reading comprehension skills in children learning English as a second language’. Reading and Writing: An Interdisciplinary Journal, 25, 1873–98. Loehlin, J. C. (2004), Latent Variable Models (4th edn). Mahwah, NJ: Lawrence Erlbaum Associates. Low, P. B. and Siegel, L. S. (2005), ‘A comparison of the cognitive processes underlying reading comprehension in native English and ESL speakers’. Written Language & Literacy, 8, 207–31. Luk, G. and Bialystok, E. (2008), ‘Common and distinct cognitive bases for reading in English–Cantonese bilinguals’. Applied Psycholinguistics, 29, 269–89. Lust, B. (2006), Child Language Acquisition and Growth. Cambridge: Cambridge University Press. Lust, B., Flynn, S. and Foley, C. (1996), ‘What children know about what they say – Elicited imitation as a research method’, in C. McDaniel, C. McKee and H. Cairnes (eds), Methods for Assessing Children’s Syntax. Cambridge, MA: MIT Press, pp. 55–76. Martohardjono, G., Otheguy, R., Gabriele, A., de Goeas-Malonne, M., Szupica-Pyrzanowski, M., Troseth, E., Rivero, S. and Schutzman, Z. (2005), ‘The role of syntax in reading comprehension: A study of bilingual readers’, in J. Cohen, K. McAlister, K. Rolstad and J. MacSwan (eds), ISB4: Proceedings of the 4th International Symposium on Bilingualism. Somerville, MA: Cascadilla Press, pp. 1522–44. Nation, K. and Snowling, M. J. (1997), ‘Assessing reading difficulties: The validity and utility of current measures of reading skill’. British Journal of Educational Psychology, 67, 359–70.
68
The Metalinguistic Dimension
— (2000), ‘Factors influencing syntactic awareness skills in normal readers and poor comprehenders’. Applied Psycholinguistics, 21, 229–41. Nation, K., Adams, J. W., Bowyer-Crane, C. A. and Snowling, M. J. (1999), ‘Working memory deficits in poor comprehenders reflect underlying language impairments’. Journal of Experimental Child Psychology, 73, 139–58. Nocus, I. and Gombert, J.-E. (1997), ‘Conscience morpho-syntaxique et apprentissage de la lecture’. Revue de psychologie de l’éducation, 3, 71–101. Oakhill, J. V., Cain, K. and Bryant, P. E. (2003), ‘The dissociation of word reading and text comprehension: Evidence from component skills’. Language and Cognitive Processes, 18, 443–68. Perfetti, C. A. (1999), ‘Comprehending written language: A blueprint of the reader’, in C. M. Brown and P. Hagoort (eds), The Neurocognition of Language. Oxford: Oxford University Press, pp. 167–208. Poncelet, M. and Van der Linden, M. (2003), ‘Assessing the phonological store of working memory: elaboration of a non-word repetition test for a French-speaking population’. Revue de neuropsychologie, 13, 377–407. Rego, L. L. B. and Bryant, P. E. (1993), ‘The connection between phonological, syntactic and semantic skills and children’s reading and spelling’. European Journal of Psychology of Education, 8, 235–46. Roman, A. A., Kirby, J. R., Parrila, R. K., Wade-Woolley, L. and Deacon, S. H. (2008), ‘Toward a comprehensive view of the skills involved in word reading in Grades 4, 6, and 8’. Journal of Experimental Child Psychology, 102, 96–113. Sansavini, A., Guarini, A., Alessandroni, R., Faldella, G., Giovanelli, G. and Salvioli, G. (2007), ‘Are early grammatical and phonological working memory abilities affected by preterm birth?’ Journal of Communication Disorders, 40, 239–56. Schoonen, R., Hulstijn, J. and Bossers, B. (1998), ‘Metacognitive and language-specific knowledge in native and foreign language reading comprehension: An empirical study among Dutch students in grade 6, 8, and 10’. Language Learning, 48, 71–106. Siegel, L. S. (1993), ‘The development of reading’, in H. W. Reese (ed.), Advances in Child Development and Behaviour (Vol. 24). San Diego, CA: Academic Press, pp. 63–97. Simard, D. and Fortier, V. (2007), ‘Metasyntactic ability development among submersion and French native elementary school children: Investigation of task demand’, in Z. Han (ed.), Understanding Second Language Processes. Clevedon: Multilingual Matters, pp. 160–75. Simard, D., Fortier, V. and Foucambert, D. (2013), ‘Measuring the metasyntactic ability of Portuguese speaking children living in a French speaking environment’. Bilingualism: Language and Cognition, 16, 19–31 Smith-Lock, K. M. (1995), ‘Morphological usage and awareness in children with and without specific language impairment’. Annals of Dyslexia, 45, 163–85. Snow, C. E., Cancino, H., Gonzalez, P. and Shriberg, E. (1989), ‘Giving formal definitions: An oral language correlate of school literacy’, in D. Bloome (ed.), Literacy and Classrooms. Norwood, NJ: Albex, pp. 233–49. Stanovich, K. E. (1991), ‘Word recognition: Changing perspectives’, in R. Barr, M. L. Kamil, P. B. Mosenthal and P. D. Pearson (eds), Handbook of Reading Research (Vol. 2). New York: Longman, pp. 418–52. Stothard, S. E. and Hulme, C. (1996), ‘A comparison of reading comprehension and decoding difficulties in children’, in C. Cornoldi and J. Oakhill (eds), Reading Comprehension Difficulties: Processes and Intervention. Mahwah, NJ: Lawrence Erlbaum, pp. 95–106.
Contribution of Syntactic and Metasyntactic Abilities
69
Tunmer, W. E. (1990), ‘The role of language prediction skills in beginning reading’. New Zealand Journal of Educational Studies, 25, 95–114. Tunmer, W. E. and Bowey, J. A. (1984), ‘Metalinguistic awareness and reading acquisition’, in W. E. Tunmer, C. Pratt and M. L. Herriman (eds), Metalinguistic Awareness in Children: Theory, Research, and Implications. New York: Springer-Verlag, pp. 144–68. Tunmer, W. E., Herriman, M. L. and Nesdale, A. R. (1988), ‘Metalinguistic abilities and beginning reading’. Reading Research Quarterly, 23, 134–58. Tunmer, W. E. and Hoover, W. A. (1992), ‘Cognitive and linguistic factors in learning to read’, in P. E. Gough, L. C. Ehri and R. Treiman (eds), Reading Acquisition. Hillsdale, NJ: Lawrence Erlbaum, pp. 175–214. Tunmer, W. E., Nesdale, A. R. and Wright, A. D. (1987), ‘Syntactic awareness and reading acquisition’. British Journal of Developmental Psychology, 5, 25–34. Valdés, G. (2001). ‘Heritage language students: Profiles and possibilities’, in J. K. Peyton, D. A. Ranard and S. McGinnis (eds), Heritage Languages in America: Preserving a National Resource. Washington, DC & McHenry, IL: Center for Applied Linguistics & Delta Systems, pp. 37–77. van Dijk, T. A. and Kintsch, W. (1983), Strategies of Discourse Comprehension. New York: Academic Press. van Gelderen, A., Schoonen, R., Stoel, R. D., De Glopper, K. and Hulstijn, J. (2007), ‘Development of adolescent reading comprehension in Language 1 and Language 2: A longitudinal analysis of constituent components’. Journal of Educational Psychology, 99, 477–91. van Gelderen, A., Schoonen, R., De Glopper, K., Hulstijn, J., Simis, A., Snellings, P. and Stevenson, M. (2004), ‘Linguistic knowledge, processing speed and metacognitive knowledge in first and second language reading comprehension: A componential analysis’. Journal of Educational Psychology, 96, 19–30. van Gelderen, A., Schoonen, R., De Glopper, K., Hulstijn, J., Snellings, P., Simis, A. and Stevenson, M. (2003), ‘Roles of linguistic knowledge, metacognitive knowledge and processing speed in L3, L2 and L1 reading comprehension: A structural equation modeling approach’. International Journal of Bilingualism, 7, 7–25. Verhoeven, L. (1990), ‘Acquisition of reading in a second language’. Reading Research Quarterly, 25, 90–114. Wold, H. (1975), ‘Path models with latent variables: The NIPALS approach’, in H. M. Blalock, A. Aganbegian, F. M. Borodkin, R. Boudon and V. Cappecchi (eds), Quantitative Sociology: International Perspectives on Mathematical Statistical Model Building. New York: Academic Press, pp. 307–57. Zwaan, R. A., Langston, M. C. and Graesser, A. C. (1995), ‘The construction of situation models in narrative comprehension: An event-indexing model’. Psychological Science, 6, 292–7.
4
Effects of Instruction on Learners’ Acquisition of Metalinguistic Knowledge Rosemary Erlam
University of Auckland
Abstract This chapter investigates how amenable a particular linguistic feature is to explicit instruction drawing on metalinguistic descriptions. It presents a study where adult ESL students (n = 51) were given explicit instruction and language practice activities targeting the use of the indefinite article to mark generic reference. Learning was assessed on a Grammaticality Judgement test (ungrammatical items only, Ellis, 2005) and a Metalinguistic Knowledge test. Results investigate the impact of instruction on metalinguistic knowledge and metalingual competence, allowing also for a comparison of the relative effectiveness of two types of instruction (input-based and outputbased). The relationship between results on the Grammaticality Judgement test and a hypothesized measure of implicit language knowledge is also presented.
Introduction Research studies investigating the effectiveness of different types of L2 instruction will often incorporate a measure of metalinguistic judgement. Norris and Ortega (2000) found that 14 out of the 49 experimental and quasi-experimental studies they incorporated in their landmark meta-analysis included metalinguistic judgement as one of their outcome measures. However, notwithstanding the not uncommon use of measures of metalinguistic knowledge, one can have the impression that results on these measures are not given as much weight or importance in reporting learning outcomes as other measures that give information about language ‘use’, which is seen to be a more valid indication of language acquisition. This may explain why there is a relative lack of focus on metalinguistic learning in instructed contexts and, as Roehr (2008) points out, a lack of research investigating how amenable particular linguistic features are to explicit language instruction drawing on metalinguistic descriptions.
72
The Metalinguistic Dimension
Definition Metalinguistic knowledge is a learner’s explicit knowledge about the syntactic, morphological, lexical, phonological and pragmatic features of a L2 (Roehr and Gánem-Gutiérrez, 2009). It is knowledge that is analysed, requires deliberate focus and, as Elder (2009) claims, learners know when they are drawing on it, in, for example, making judgements about the grammaticality of sentences or in editing writing. One can demonstrate metalinguistic knowledge without being able to use metalanguage. Metalanguage is any language used to talk or write about language (Berry, 2005). It is not restricted to the use of linguistic or grammatical terminology only, because one can ‘talk’ about language without using such terminology. Metalingual is the adjective derived from the word metalanguage. Berry (2004) refers to ‘metalingual competence’, given that learners seem to have differing levels of ability to write or talk about language. In the present chapter metalingual competence will be considered a subcomponent of metalinguistic knowledge. Metalinguistic knowledge is differentiated from implicit language knowledge, which is intuitive knowledge that cannot be directly reported (Ellis, 2008). Loewen and Reinders (2011) state that learners may draw on implicit knowledge in completing some measures of metalinguistic knowledge, such as Grammaticality Judgement tests, but they would not be able to draw on this knowledge to explain the basis for decisions they had made. Most researchers would see linguistic competence as consisting primarily of implicit L2 knowledge, in that it is primarily this knowledge that allows for effective language use in real-time contexts. There is some debate as to whether the distinction between implicit and explicit knowledge should be seen as a continuum or a dichotomy (Ellis, 2008).
Review of research Research that has focused on learners’ explicit or metalinguistic knowledge has done so mainly with the aim of investigating ‘the interface question’, that is, the impact of explicit instruction on implicit knowledge. This is an unresolved issue, however, there is an impressive body of research which suggests that explicit instruction does seem to enable learners to acquire explicit knowledge (Norris and Ortega, 2000).There are fewer studies that look at the impact of instruction on metalinguistic knowledge along with metalingual competence, and those that do, tend to look at knowledge that learners may have as a result of exposure to long periods of formal instruction (e.g. Green and Hecht, 1992; Hu, 2002) rather than at the impact of a short, focused treatment on metalinguistic learning. Of the former, Green and Hecht (1992) investigated German learners’ knowledge of 12 common target structures in L2 English. The learners were asked to correct sentences and explain the rule that had been violated. They were able to correct 78% of the sentences but could state the correct rule in less than half the cases (46%). Hu (2011) conducted a study which was designed to investigate the acquisition of metalinguistic knowledge and the metalingual competence of learners exposed to large amounts of explicit grammar instruction. All 76 participants were Chinese learners of English and had been exposed to large amounts of explicit and detailed
Effects of Instruction
73
English grammar instruction. Hu had participants formulate 49 rules to explain the use of underlined structures in sentences. He found that their rule formulation was mostly consistent with typical pedagogical rules and that they could use a large number of metalingual terms. On the other hand, Elder et al. (2007) found that a group of Malaysian teacher trainees, who had had considerable exposure to explicit grammarbased instruction, achieved less than 50% of acceptable responses on a task requiring them to give rule explanations and that their command of metalingual terminology was even weaker. These same participants performed at a high level when asked to judge the grammaticality of sentences, however. The researchers pointed out that performance with respect to rules differed widely and concluded that participants seemed more successful with pedagogical rules that are traditionally taught early and ‘recycled’. They also suggested other factors that could determine difficulty such as processing constraints (Pienemann, 1998) and the transparency of form-meaning links (DeKeyser, 2005). With respect to research that has investigated the impact of focused and shorter periods of instruction on metalinguistic learning, the results of Norris and Ortega’s meta-analysis (2000) are informative, in that 40 out of the 49 studies they investigated had instructional treatments of less than 6 hours. For the 14 studies that included metalinguistic judgements as an outcome measure, Norris and Ortega report a large average effect size (d = 0.82). Roehr (2010) hypothesizes that rules which are high in schematicity (i.e. can be applied across a class rather than to a specific linguistic construction only), low in conceptual complexity, low in technicality of metalanguage and high in truth value (i.e. apply without exception) will be more easily learned than rules which do not meet these criteria. She also suggests (Roehr, 2008) that constructions that demonstrate comparatively ‘stable, and context independent usage patterns should be more amenable to explicit teaching (p. 93)’ than constructions that do not demonstrate these characteristics. There has been little attention in the research literature as to whether different types of instruction are more effective in fostering the development of metalinguistic knowledge. This is perhaps not surprising given the previously mentioned preoccupation in the research literature with the development of implicit rather than explicit language knowledge. However, in her Output Hypothesis, Swain (1995) has argued that the opportunity for metalinguistic reflection and the chance to process language syntactically during the process of language production may foster L2 development. She thus stresses a key role for output in the language learning process, a position that has put her at variance with a number of researchers (VanPatten, 1996; Krashen, 1998) who argue that language acquisition is input-driven and deny a significant role for output. More recently, Izumi (2003) and Ellis (2003) have also argued that the need to encode meaning during language production can force learners to have recourse to their own cognitive resources, including any metalinguistic knowledge they might have, and that this may have a flow-back effect into the developing language system. Drawing on Levelt’s model, Muranoi (2007) suggests that where learners are given opportunities to convey messages relying on their lexical, grammatical and phonological knowledge, there are opportunities for hypothesis testing which is crucial for interlanguage development, a claim that is endorsed by Swain.
74
The Metalinguistic Dimension
The hypothesis that giving learners opportunities to engage in language output may allow for metalinguistic reflection suggests the possibility that instruction that favours language output may more successfully promote the development of metalinguistic knowledge than instruction that favours language input. There is a lack of research that can speak to this issue because, as Toth (2006) points out, most studies that compare the effectiveness of input-based and output-based instruction assess learning outcomes on Interpretation and Production tests rather than on measures of Metalinguistic knowledge. An exception is his own study (Toth, 2006), which does address the relative effectiveness of these two types of instruction and which includes a Grammaticality Judgement test. Toth compared two groups of students, one that completed structured input activities, following a Processing Instruction model (VanPatten, 2004), where they did not engage in production of the target structure (anticausative clitic ‘se’ in L2 Spanish) and another where students completed activities that required them to use the target structure in meaningful contexts. Students in this second, ‘Output’ group also received feedback as to the well-formedness of their utterances. Toth recorded the classroom interaction of both groups and presents in his paper an example in the ‘Output group’ of how a learner’s engaging in communicating meaning led to metalinguistic reflection which pushed the learner to correctly reformulate her previously incorrect utterance. On the Grammaticality Judgement test, however, there was no statistically significant difference between the results for both groups, although the Output groups’ scores were higher overall. Thus there was no conclusive evidence to suggest that greater opportunity for metalinguistic reflection led to greater gains in metalinguistic knowledge. On the other hand, the Output group did make statistically significant greater gains than the Input group on a guided Production test. Toth goes on to highlight a need for further research on the links between metalinguistic knowledge and output. In a similar vein Muranoi (2007) states that the role of output in language acquisition remains contentious and argues that L2 researchers should examine the roles of output in acquiring various linguistic rules, especially complex syntactic rules.
Role of metalinguistic knowledge in L2 acquisition The issue of the impact of explicit instruction on implicit knowledge, commonly known as ‘the interface question’, is still a largely unresolved issue. Berry (2005) refers to a lack of evidence that metalanguage supports language proficiency and Hu (2011) refers to the controversy that has raged over the usefulness of an explicit focus and metalinguistic knowledge in L2 learning. Researchers (e.g. Ellis, 2009) have tended to suggest that explicit knowledge makes only the very indirect contribution to acquisition of focusing attention to form in L2 input. In a review of empirical research on explicit and implicit knowledge, however, DeKeyser (2009) concludes that explicit knowledge, when fully proceduralized and automatized, may be directly involved in real-time language use. This means, DeKeyser claims (2003), that it might not be possible to distinguish the two types of knowledge empirically given that explicit knowledge may be proceduralized so as to make it indistinguishable from implicit knowledge. Evidence for this hypothesis would be the demonstration of some relationship between the two
Effects of Instruction
75
types of knowledge, that is, explicit or, as is more relevant in the case of this study, metalinguistic knowledge and implicit language knowledge as a result of opportunities that might be given in instruction to proceduralize language. DeKeyser notes that proceduralization may be facilitated through either comprehension or production practice, but specifies that procedural knowledge is unlikely to be transferred from one skill to another, that is, practice in comprehension, for example, is likely to lead to improvement in language comprehension only.
Measuring metalinguistic knowledge The grammaticality judgement test is a common measure of metalinguistic knowledge (Loewen, 2009). Ellis (2004) outlines three steps that learners may go through in making a grammaticality judgement: 1. Semantic processing, 2. Noticing and 3. Reflecting. Ellis claims that in judging a grammatical sentence, learners may stop after step 2. Reflection may only be necessary if the test taker has noticed something ungrammatical. For this reason ungrammatical sentences are a better measure of explicit, metalinguistic knowledge than grammatical sentences. The test taker may rely on implicit knowledge to make decisions about grammatical sentences but is more likely to draw on explicit knowledge in judging ungrammatical sentences. Loewen’s results (2009) from the trialling of a grammaticality judgement test support this conclusion. Grammaticality judgement tests may have a number of additional optional features such as requiring learners to correct sentences that they judge as incorrect (Ellis, 1991) or indicating the degree of confidence they have in judgements made (Gass, 1994). Another way of measuring metalinguistic knowledge is to require students to match items from a list of grammatical terms to corresponding exemplars in a sentence or text or to have learners choose the explanation which best describes an error or language feature (Elder, 2009). A common measure of metalingual competence is to have learners verbalize language rules (Elder, 2009). Roehr (2008) observes that the verbalization of metalinguistic knowledge often takes the form of a ‘proposition involving at least two categories and a relation between them (p. 85)’.
The present study The questions which the study asked are as follows: 1. What effect does instruction have on learners’ metalinguistic knowledge of the target structure? 2. Does the type of instruction (i.e. input-based versus output-based instruction) lead to differences in the acquisition of metalinguistic knowledge? 3. What relationship is there between a measure of metalinguistic knowledge and a measure of implicit language knowledge used to assess learning following instruction?
76
The Metalinguistic Dimension
Method Participants The study was conducted in two private language schools in New Zealand with students enrolled in ‘general English’ courses of between four and six weeks in length. In each school two intact classes of intermediate level students were randomly allocated to one of two treatment options: input based (InI) or output-based (OI). Students receiving OI totalled 19 while students receiving InI totalled 20. A third group of students at one of the language schools formed the Control group (n = 12), receiving no instructional treatments as part of this study but taking part in all testing episodes. A total of 51 students took part in the study. A background information questionnaire showed that the majority of students came from an Asian language-speaking background (Japanese, Chinese, Taiwanese, Korean). The remainder came from a variety of language backgrounds (Spanish, Arabic, German, Russian, Portuguese). See Table 4.1 for more information about the students.
Design All students sat a pre-test at the end of a teaching week. Commencing the first day back in class, students in the treatment groups received lessons over two consecutive days, each of 1-hour duration. The same researcher taught all lessons. The post-test was completed the day after the second lesson and the delayed post-test two weeks later. For all testing episodes participants completed the Oral Elicited Imitation test first, followed by the Grammaticality Judgement test and the Metalinguistic Knowledge test. The Control group completed all testing episodes over the same time scale.
Target structure The target structure was the use of the indefinite article to mark generic reference. e.g. A cheetah runs fast. Generic reference with count nouns can also be marked by use of the zero article with plural noun (Cheetahs run fast) and the definite article with singular noun (The cheetah runs fast). However, the indefinite article is the most concrete and colloquial way of expressing a generality (Celce-Murcia and Larsen-Freeman, 1999). When noncount nouns are used generically, the indefinite article is omitted. e.g. Water is essential for life. Table 4.1 Background information of participants
Asian language background Average time spent learning English Average time spent in an English-speaking country
OI group (n = 19)
InI group (n = 20)
Control (n = 12)
14 8 years 3 months
13 6.9 years 9 months
6 5.9 years 3 months
Effects of Instruction
77
In this study, the focus was the use of the indefinite article with count nouns and the omission of the article with noncount nouns to convey generic reference. Generic articles were chosen as the target structure, first, because they cause learners difficulty. One reason for this is that the indefinite article ‘a/an’ is often not acoustically salient because it is usually used in its unstressed, weak form (/ә/ or /әn/) and so may not be perceived by learners (Parrott, 2000). It is, furthermore, completely omitted when noncount nouns are used generically, so that learners have the added confusion of receiving input without the indefinite article. Second, generic ‘a/an’ is complex. It is notthe only way of conveying generic reference (as mentioned above) and furthermore, a/an can convey a variety of other meanings (it can be used to convey non-specific meaning and to represent a particular type of class, that is, to classify). The indefinite article (and indeed the definite article) violates what Andersen (1984) describes as the One-to-One principle which states that acquisition of a form is facilitated when there is a clear and unique correspondence between the form and its meaning. To sum up then, the rule describing the use of the indefinite article with count nouns and the omission of the article with noncount nouns to convey generic reference may not be as amenable to explicit instruction as some grammar rules. This is because it is high in conceptual complexity, low in schematicity, relatively context dependent and low in truth value (Roehr, 2010). It also describes a secondary or peripheral use of the article system, the more prototypical or central use of the indefinite article being to convey non-specific meaning and to classify (see also Hu, 2002 on metalinguistic knowledge about prototypical vs peripheral uses of linguistic structures). Evidence that students in this study had only limited knowledge of the target structure comes from their performance on the pre-tests of the measures used to assess learning. Students scored on average 7.31 (out of a maximum of 12) for judging ungrammatical sentences and 3.69 (out of a maximum of 12) for correcting sentences on the Grammaticality Judgement test. They also scored an average of 0.77 (out of a maximum of 3) for ability to give a correct metalinguistic explanation and 41% on the Oral Elicited Imitation Test.
Instructional treatments Both the InI and OI groups received explicit instruction at the beginning of both days of instruction. Students in the InI group then worked at a total of eight structured input activities and students in the OI group at eight meaning-based output activities. The activities were matched for subject matter and vocabulary. The explicit instruction for both groups covered the following points (see Appendix A): Day 1: Indefinite use of article ‘a’ with the meaning of ONE Use of indefinite article to make a generalization, not referring to just one person/ thing Use of plural noun as another possible way of making a generalization Day 2: Revision of explicit information given on Day 1 Expressing generalization with an uncountable noun.
78
The Metalinguistic Dimension
Instruction for the OI group followed a PPP (presentation/practice/production) format (Gower and Walters, 1983), however, there was a focus on meaning at all times. The explicit instruction, as described above, was given during the presentation phase. During the practice phase students were encouraged to use the target language structure in a linguistically controlled context and in meaningful interaction with each other. They were given feedback so that a response which omitted the generic article, for example, was recast so that they heard the correct form. During the production phase of the treatment students were given free practice in using the target structure, that is, they were not given linguistic resources to help them complete the activity. However, activities were designed so that the target structure was useful or essential, in the belief that this would provide a context for Swain’s ‘pushed output’. See Appendix B for examples of instructional activities. The instruction the InI group received was based on VanPatten’s model of Processing Instruction (2004), meaning that they received additional explicit instruction in which the strategy that did not help them learn language effectively was described to them, that is, the One-to-One Principle (Anderson, 1984). They were told that they needed to learn to ‘process’ differently ‘a/an’, a form with which they were already familiar as conveying indefinite meaning. Structured input activities then aimed to train them to distinguish generic from particular meaning. These activities required them to process both written and oral input, but did not require them at any time to engage in producing language output. Students were given feedback about the accuracy of the choices they made when completing these activities (see Erlam et al., 2009 for more information about the treatments). To summarize then, the key difference between the two treatments was: 1/ there were opportunities for students in the OI group to produce output containing the target form, whereas for the InI group, the target form occurred in utterances made by the teacher only; 2/ students in the OI group received feedback concerning accuracy of their output containing the target form, whereas in the InI group, students received feedback about the accuracy of their selection of answers in relation to language input. The Control group continued with their normal class programme during this time; their part in this study was confined to participation in the testing episodes.
Testing Tests The Grammaticality Judgement test (GJT) This was a pen and paper test in the form of a booklet. The 32 items were presented on a new page so that answers to one item were not influenced by preceding items. Students were not allowed to turn back and look at previous pages. The ungrammatical items (n = 12) formed the data set for this study, although of course participants completed all items. This is because, as previously discussed, ungrammatical items are considered better measures of explicit and metalinguistic language knowledge (Ellis, 2005). Eight items tested for ungrammatical marking of generic ‘a’ and four tested for generic use of noncount nouns in grammatically incorrect contexts.
Effects of Instruction
79
Test takers were asked to indicate by ticking a box whether each sentence was correct or incorrect. They were then asked to state how certain they were of their decision, by putting a percentage in a box. Lastly, they were told that if the sentence was incorrect, that they should rewrite it correctly at the bottom of the page (see Appendix C for an example of the test format). Before beginning the test, participants completed six training items. One version of the test was used over all three testing sessions, however the order of presentation of items was different for each test administration. Wherever possible, the target structure was not placed initially in the sentence. Examples of test items are: These days, computer saves a lot of time at work. Often an information helps people make decisions.
In scoring for the correction of sentences where a context was established for the generic use of count nouns, any correction that allowed for generic use of the noun was accepted (i.e. including correction with ‘the’ or plural noun if appropriate in the specific context). Reliability, using internal consistency, was estimated on all versions of the test for judgement of ungrammatical items, giving Cronbach’s alpha as follows: pre-test, α = .559; post-test, α = .805; delayed post-test, α = .788. Cronbach’s alpha values for the correction of the ungrammatical items were as follows: pre-test, α = .695; post-test, α = .911; delayed post-test, α = .893. It is suggested that the lower reliability estimates for the pre-test in each case may be due to the fact that students had less knowledge of the target structure and that there was therefore less variability in scores. Reliability was also estimated for the certainty rating participants indicated, giving Cronbach’s alpha for the three tests as follows: α = .931, α = .966, α = .976.
The Metalinguistic Knowledge test (MKT) The Metalinguistic Knowledge test consisted of a total of six items but only three were relevant to this study. Two sentences established a context for generic use of count nouns and one for generic use of noncount nouns. All sentences were ungrammatical and the part of the sentence containing the error was italicised. Test takers were asked in each case to correct the error and to explain what was wrong with the sentence. See Appendix C for a sample item. Students were shown two sample items before beginning the test, in each case the model answer that was given was an explanatory rule that used metalanguage. For example: Before a noun, you have to use the possessive adjective, not the pronoun.
Scoring Students were scored for the ability to correct the three sentences. They were then scored according to whether they had given a satisfactory explanation for the error. Scoring at this stage did not take account of metalinguistic terminology. Participants were given credit EITHER for an explanation that focused on form and that accounted for the use or nonuse of articles with count or noncount nouns (referred to subsequently as
80
The Metalinguistic Dimension
‘syntactic’) OR for an explanation that focused on form and meaning and demonstrated understanding of use of ‘a’ to convey generic reference (referred to subsequently as ‘semantic’). The rationale behind this was that many might initially know some of the syntactic rules governing the more prototypical use of articles but that they might have little understanding of the use of ‘a’ to convey generic reference, which is primarily a form-meaning distinction and a more peripheral use of indefinite article ‘a’. In particular it was expected that, as a result of the instruction, students might move from giving ‘syntactic’ to ‘semantic’ explanations, given that the criteria for a semantic explanation (see below) corresponded closely to the metalinguistic information that participants were given during instruction (see Appendix A). For a correct ‘syntactic’ explanation for an error with a count noun, participants needed to fulfil the following criteria (the words in parentheses are optional, not required): reference to the fact that you need ‘a’ before a (count) noun OR you need a determiner/article before a (count) noun (in the singular)
For the stimulus: In Japan, dog is a popular pet an example of an answer which was given credit as a syntactic explanation is as follows: It is countable and needs article
For a correct ‘semantic’ explanation for an error with a count noun participants needed to fulfil the following criteria: reference to the fact that ‘a’ + noun (or article) refers to a general concept (or to many) OR if you want to make a generalisation/refer to the general idea of something you need to use ‘a’ (with a singular noun)
An example of an answer which was given credit as a semantic explanation for the above stimulus is as follows: This sentence describe about general thing so we need article.
The following answers were not considered satisfactory in terms of either a ‘syntactic’ or ‘semantic’ explanation and were given no credit. Before noun, we must use the pronoun. It must use article ‘a’. There is a ‘is’ so I have to use ‘a’.
It is important to note that, in order to be given credit, an answer needed to present a proposition (Roehr, 2010) of the type ‘because X therefore Y’. To some extent then, for some participants, success on this measure may have depended on their willingness and
Effects of Instruction
81
ability to write a full explanation. However, with respect to the former it is important to stress that there were two examples of adequate definitions given to participants before they began the test and also that it was impossible to infer understanding on the basis of incomplete answers. Further limitations with respect to this test are the smaller than ideal number of items (this was due to the limited amount of time available for testing) and the requirement that students complete the test in their L2. It is important, therefore, that results on this particular measure are interpreted with caution. For these reasons, and in the interest of space, results for this test will address Research question 1 only, thus they are presented for the Treatment group as a whole and the Control group. Test takers were also scored for their use of metalingual terminology in completing this Metalinguistic Knowledge test. They were given a score for using any of the following terms: article, count, noncount, noun, singular, generalization. Thus it was possible for them to score up to a total of six for this category. However, this score was the total number of different terms used, in other words, they were given credit for type rather than token. The Metalinguistic Knowledge test was scored by a Masters student in applied linguistics who had been trained for this purpose. The researcher independently scored 25% of the scripts to test for rater reliability. Pearson’s correlations for each category are presented as follows: metalinguistic explanations (r = .944), metalinguistic terminology (r = .968).
The Oral Elicited Imitation Test This instrument, which incorporates a primary focus on meaning and time pressure as crucial components of the test design, is hypothesized to be a measure of implicit language knowledge (see Erlam, 2009). In the present study this test consisted of 32 statements, 17 of which contained generic ‘a’ in a grammatically correct context and 7 of which contained generic ‘a’ in a grammatically incorrect context. A further eight statements contained examples of noncount nouns used generically, in both grammatically correct and incorrect contexts. As far as possible, statements were designed so that the target structure was not placed initially in the sentence. Statements varied between 8 and 18 syllables in length, with the mean length being 13.53 syllables. Sentence length was not controlled for beyond this, because structures that are both simple and complex were targeted and length was a feature that is intrinsic to some of the grammatical structures targeted. The statements used represented a range of difficulty for the learners participating in the test and broadly sampled ‘stimuli of various lengths and complexities’ as recommended by Bley-Vroman and Chaudron (1994, p. 253). Examples of test items are: During games, soccer player runs a lot. [ungrammatical] Information travels around the world on the internet. [grammatical]
The statements were presented on audiotape to participants by an interviewer. Participants were told that they were to complete a Beliefs Questionnaire and were
The Metalinguistic Dimension
82
asked to indicate on paper by checking one of three boxes whether each statement was true, not true or whether they were not sure. Requiring participants to make decisions about the truth value of the statements they heard focused their attention on meaning rather than on form and also had the added benefit of delaying repetition (see Erlam, 2009). This time delay meant that information about the surface structure of the statement, that is, its form, was less available to participants when they did repeat the statements (Sachs, 1967; McDade et al., 1982), thus making it more likely that this test was measuring implicit language knowledge. Participants were then told to repeat the statement in correct English. Pre-test training gave participants practice in responding to both grammatical and ungrammatical test items. It should be noted that participants heard each statement only once and in real time. Grammatical items were scored as correct if the target structure was correctly repeated and ungrammatical items were scored as correct if the target structure (presented in a grammatically incorrect context) was spontaneously corrected. Incorrect suppliance or avoidance of the target structure was scored as 0. Participants’ total scores were averaged, that is, divided by the total number of items. The decision to report scores as percentage data was made because the sound quality of the recording at times meant that a decision about some responses could not be made. These were thus scored as missing data. One version of the test was used over all three testing sessions, however, the order of presentation of items was different for each test administration. Reliability, using internal consistency, was estimated on all versions of the test, giving Cronbach’s alpha as follows: pre-test, α = .809; post-test, α = .816; delayed post-test, α = .869.
Results The first research question investigated the impact of instruction on the learners’ metalinguistic knowledge of the target structure. For this question the two treatment groups were combined. Descriptive statistics for the judgement of ungrammatical sentences on the GJT and certainty (indicated according to a percentage) with respect to these judgements are presented in Table 4.2. Effect sizes for judgements are also reported using Cohen’s d-index which was interpreted following Cohen’s criteria
Table 4.2 Descriptive statistics for correct judgements and certainty of judgements for ungrammatical sentences on the GJT (total number of items is 12) Judgement Treatment (n = 39) Pre-test Post-test 1 Post-test 2
Certainty
Control (n = 12)
Treatment (n = 39)
Control (n = 12)
M
SD
M
SD
M
SD
M
SD
7.46 9.03 9.74
2.51 2.46 2.05
6.83 6.33 6.83
2.04 3.47 3.10
70.25 81.58 85.09
20.04 18.43 20.33
57.36 63.26 64.44
19.33 22.32 18.31
Effects of Instruction
83
(1988): 0.80 constitutes a large effect size, 0.50 a medium effect size and 0.20 a small effect size. As described by Norris and Ortega (2000), d was calculated by contrasting the Treatment group with the Control group on the post-tests. Because assumptions of normality were violated for the descriptive statistics on a number of occasions, non-parametric tests (Mann-Whitney U) were used to test for differences in performance between the Treatment groups and the Control groups at making correct judgements. Mann-Whitney U tests showed no statistically significant differences between either group on the pre-test (Z = –.931, p > .05, d = 0.23) but a statistically significant difference with large effect sizes between the two groups on the post-test (Z = –2.552, p .05, d = 0.42) but a statistically significant difference with large effect sizes on the post-test (Z = –3.269, p < .01, d = 1.40) and the delayed post-test (Z = –4.050, p < .01, d = 1.72). Table 4.4 presents descriptive statistics for syntactic and semantic corrections on the Metalinguistic Knowledge test and also for the total number of correct explanations on this test (scores for syntactic and semantic components were aggregated and so it was possible for participants to score up to a maximum of 6, that is, they could get credit for both a syntactic and semantic explanation for each item. Note, however, that the highest score for this section was 3). For syntactic explanations, Mann-Whitney U tests showed no statistically significant differences between either group on the pre-test (Z = –.878, p > .05, d = 0.40) or post-test (Z = –.953, p >.05, d = 0.22 ) but a significant difference with a large effect size on the delayed post-test (Z = –1.981, p < .05, d = 0.81). For semantic explanations, there were no statistically significant differences between either group
Table 4.3 Descriptive statistics for correct corrections of ungrammatical sentences on the GJT (total number of items is 12) Correction
Pre-test Post-test 1 Post-test 2
Treatment (n = 39)
Control (n = 12)
M
SD
M
SD
3.92 8.18 8.85
2.61 3.13 2.81
2.92 3.50 3.92
2.15 4.08 3.06
The Metalinguistic Dimension
84
Table 4.4 Descriptive statistics for correct explanations on the MKT Treatment (n = 39) Pre-test
Post-test 1
Syntactic (maximum score = 3) M 0.74 0.72 SD 0.99 1.00 Semantic (maximum score = 3) M 0.13 0.46 SD 0.41 0.76 Total (maximum score = 6) M 0.87 1.18 SD 1.08 1.23
Control (n = 12)
Post-test 2
Pre-test
Post-test 1
Post-test 2
0.82 1.07
0.42 0.67
0.50 1.00
0.17 0.39
0.62 0.91
0.00 0.00
0.50 0.80
0.25 0.62
1.44 1.31
0.42 0.67
1.00 1.04
0.42 0.67
on any of the tests (Z = –1.144, p > .05, d = 0.44; Z = –.163, p > .05, d = –0.05; Z = –1.212, p > .05, d = 0.48). For the total number of correct explanations Mann-Whitney U tests showed no statistically significant differences between either group on the pre-test (Z = –1.202, p > .05, d = 0.50) or post-test (Z = –.339, p > .05, d = 0.16) but a significant difference with a large effect size on the delayed post-test (Z = –2.291, p < .05, d = 1.06). Table 4.5 shows the descriptive statistics for the total number of metalingual terms used. There was no limit to this score, however the maximum score obtained over the 3 test items was 5. Mann-Whitney U tests showed no statistically significant differences between either group on the pre-test (Z = –.595, p > .05, d = 0.22) but a significant difference with a medium effect size on the post-test (Z = –.2.036, p < .05, d = 0.73). There was no statistically significant difference between the two groups on the delayed post-test although there was a medium effect size (Z = –1.687, p > .05, d = 0.59). The second research question investigated whether the type of instruction led to differences in the acquisition of metalinguistic knowledge. Table 4.6 presents descriptive statistics for all groups on the GJT. To establish differences between groups, non-parametric tests needed to be used. Therefore, Kruskal-Wallis test was used to test for between-group differences and Mann-Whitney U test was used for post-hoc testing where a significant difference had been established. For judging ungrammatical sentences, Kruskal-Wallis tests revealed no significant between-group differences on the pre-test (χ² (2) = 2.317, p = .314) but significant differences on both post-tests (χ² (2) = 7.285, p = .026; χ² (2) = 9.942, p = .007). MannWhitney U tests revealed a significant difference and large effect sizes between the OI group and the Control group for both post-tests (Z = –2.812, p < .01, d = 1.22; Z = –3.096, p < .01, d = 1.35). There was a significant difference and a large effect size between the InI group and the Control group on the delayed post-test (Z = –2.481, p < .05, d = 0.92) but not on the post-test although there was a moderate effect size
Effects of Instruction
85
Table 4.5 Descriptive statistics for the total number of metalingual terms used Group M SD M SD
Treatment group (n = 39) Control group (n = 12)
Pre-test
Post-test
Delayed post-test
1.59 1.21 1.33 1.16
2.26 1.33 1.33 1.23
2.00 1.19 1.33 1.07
Table 4.6 Descriptive statistics for all groups on the GJT test Group
InI group (n= 20) OI group (n = 19) Control (n = 12)
Judging statements (max. score = 12) M SD M SD M SD
Correcting statements (max. score = 12)
Pre-test
Post-test 1
Post-test 2
Pre-test
Post-test 1
Post-test 2
7.05 2.87 7.90 2.05 6.83 2.04
8.50 3.07 9.58 1.47 6.33 3.47
9.45 2.56 10.05 1.31 6.83 3.10
3.60 3.07 4.26 2.05 2.92 2.15
7.30 3.95 9.11 1.56 3.50 4.08
8.15 3.60 9.58 1.35 3.92 3.06
(Z = –1.771, p > .05, d = 0.66). There were not statistically significant differences between either of the treatment groups on either post-test. Friedman tests were also used to establish within-group differences. For the InI group there were statistically significant gains for judging ungrammatical sentences over time (χ² (2) = 9.594, p = .008). Post-hoc testing using Wilcoxon Signed Ranks test showed significant gains from the pre-test to the delayed post-test (Z = –3.145, p = .002). For the OI group there were significant gains for judging correctly on the GJT over time (χ² (2) = 11.742, p = .003). Post-hoc testing showed statistically significant gains between the pre-test and both the post-test (Z = –2.496, p = .013) and the delayed post-test (Z= –2.993, p = .003). For correcting incorrect sentences, Kruskal-Wallis tests revealed no significant between-group differences on the pre-test (χ² (2) = 2.280, p = .320) but statistically significant differences on both post-tests (χ² (2) = 12.552, p = .002; χ² (2) = 17.353, p = .000). Mann-Whitney U tests revealed a statistically significant difference between the InI group and the Control group for both post-tests (Z= –2.364, p < .05, d = 0.95; Z= –2.995, p < .01, d = 1.27) and also between the OI group and the Control group for these same tests (Z= –3.520, p < .01, d = 1.82; Z= –4.280, p < .01, d = 2.39). Effect sizes were large in all cases, although larger for the OI group than the InI group. There were no statistically significant differences between either of the treatment groups on either post-test. In terms of within-group differences, Friedman tests showed that the InI group made statistically significant gains for correcting incorrect sentences over time (χ² (2) = 17.268, p = .000). Mann-Whitney U tests revealed statistically significant
The Metalinguistic Dimension
86
Table 4.7 Summary of results Grammaticality Judgement test Judging sentences
Correcting sentences
Group
Time
Group
OI > Control: post-test 1 OI, In I > Control: post-test 2
OI: pre < post-test 1 OI, InI: pre < post-test 2
OI, InI > Control: post-test 1 OI, InI > Control: post-test 2
Time OI, InI: pre < post-test 1 OI, InI: pre < post-test 2
Table 4.8 Spearman’s correlations between judgements and corrections of ungrammatical sentences on GJT and the Elicited Imitation test Correlations
Groups Control (n = 12)
Treatment (n = 39)
pre-test post-test 1 post-test 2 pre-test post-test 1 post-test 2 Elicited imitation
GJT judgements
.130
.241
.281
–.377
.016
–.173
GJT corrections
.295
.470**
.389*
.102
.241
–.086
* p < .05; **p < .01
gains between the pre-test and both post-tests (Z =–3.319, p = .001; Z =–3.589, p = .000). The OI group also made statistically significant gains over time (χ² (2) = 31.200, p = .000). Post-hoc tests revealed statistically significant gains between the pre-test and both post-tests (Z=–3.841, p = .000; Z=–3.831, p = .000). Table 4.7 shows a summary of the results of between-group and within-group differences in performance on the GJT. The third research question investigated the relationship between the measures of metalinguistic knowledge used in this study and the Elicited Imitation test, hypothesized to be a measure of implicit language knowledge. In order to answer this question, performance on the two hypothesized measures of metalinguistic knowledge were correlated with the Elicited Imitation test. In each case correlations are presented for the Total group. See Table 4.8.
Discussion Research question 1 asked what effect instruction had on learners’ metalinguistic knowledge of the target structure. Results (see Tables 4.2 and 4.3) show that instruction had a significant impact on participants’ ability to both judge ungrammatical sentences correctly and correct ungrammatical sentences, both in the short term (i.e. on the
Effects of Instruction
87
post-test) and the longer term (i.e. the delayed post-test). Pre-test scores (M = 7.46; maximum score = 12) demonstrate that participants had some initial competence in judging ungrammatical sentences, but little ability to correct these sentences (M = 3.92, maximum score = 12). This pattern was also evident in the pre-test scores of the Control group. The instruction that the participants received resulted in gains for correcting ungrammatical sentences that meant that their competence in this skill approximated their ability to judge sentences correctly on the delayed post-test (M = 8.85; M = 9.74). Participants who received instruction also demonstrated greater certainty over time with respect to their judgements about the ungrammaticality of sentences (see Table 4.2). This group made gains with large effect sizes in comparison with the Control group although it needs to be noted that there was a between-group difference with a medium effect size between the two groups on the pre-test. (There is no obvious reason why the Control group were less certain in making judgements than the Intervention group on this test.) This result suggests perhaps that traditional measures of learning may underrepresent the effects of instruction by looking only at the impact on language knowledge rather than on learner certainty or confidence. With respect to the impact of instruction on the ability to give metalinguistic explanations, it is important to note the extremely low scores in Table 4.4, suggesting a ‘floor’ effect, and the Standard Deviations that are, in many cases, higher than mean scores. This highlights the need to interpret results with caution. Results tentatively suggest that instruction had a significant impact on the ability to give metalinguistic explanations, although here the gains were only evidenced over time (i.e. on the delayed post-test only). The Treatment group made statistically significant gains in comparison to the Control group for the ability to give ‘syntactic’ explanations on the delayed post-test and also on the delayed post-test for aggregated explanations. There were no significant gains in comparison to the Control group for ‘semantic’ explanations on either post-test although the descriptive statistics do show some gains over the three testing episodes. This is a potentially interesting and unexpected result given that the emphasis of the instruction was on the use of ‘a’ to mark generic reference and that rules explaining this were explicitly given in class (see Appendix A). It might, therefore, have been expected that the participants would make greater gains in being able to give a form-meaning explanation describing use of generic ‘a’ than one relating more to form and to the type of explanations they may have received previously. However, Hu (2011, p. 72) points out that there is no guarantee that L2 learners’ metalinguistic knowledge will ‘mirror the pedagogical rules presented in classrooms’ and reports that participants in his study performed better at verbalizing rules that described a central or prototypical use of the target structure than rules that described a periphery or secondary use. It seems that the instruction learners received in this study may have contributed more to strengthening expression of knowledge that they had already been exposed to rather than to fostering the ability to explain the new use of the target structure, although the less than rigorous nature of the measure used in this study means that any conclusion needs to be tentative. More research is needed to ascertain whether new knowledge helps learners perceive certain aspects of rules about which they have already received instruction, as Hu (2011) suggests.
88
The Metalinguistic Dimension
Results also demonstrate that instruction had an impact on participants’ ability to use metalingual terms in giving error explanations, although in this case the gains were only evidenced in the short term (i.e. on the post-test). In conclusion then, instruction did have a significant impact on participants’ metalinguistic knowledge of the target structure. Furthermore the gains made on the GJT were sustained in that they were evidenced on both post-tests and delayed posttests, and, while not statistically significant, descriptive statistics show an increase in delayed post-test scores over post-test scores (see Tables 4.2 and 4.3). There is some evidence to suggest that instruction also had an impact on participants’ metalingual knowledge, that is, the ability to give error explanations, although the limitation of the measure used and the need for more research has been highlighted. The second research question asked whether the type of instruction led to differences in the acquisition of metalinguistic knowledge. Results show that for judging ungrammatical sentences, the group that received output-based instruction (OI) made gains more immediately (i.e. on the immediate post-test) than the inputbased instruction group (InI). Both groups, however, made significant gains from pre to delayed post-test. The two types of instruction (i.e. input-based and output-based) did not result in any differences for correcting ungrammatical sentences. Overall then, there is little evidence of difference in terms of the effectiveness of each type of treatment, a result which is similar to that obtained by Toth (2006) who found a nonsignificant difference between his Input and Output groups on a GJT. In explaining this result, we need to perhaps focus on the similarity between the two Treatment groups, in that both types of instruction were designed to require learners to focus on language form. Perhaps, as Izumi (2003) claims, attention to form and metalinguistic reflection are inextricably linked and the benefits of output do not outweigh the opportunity for learners to focus on language form in terms of gains in metalinguistic understanding. The last research question asks what relationship there was between a measure of metalinguistic knowledge and a measure of implicit language knowledge, in this case the Elicited Imitation test. There were small correlations only between the measure of metalinguistic knowledge and the measure of implicit knowledge. On the one hand this is an unsurprising result given that the Elicited Imitation test required participants to use highly automatized knowledge under real-time performance conditions. Metalinguistic knowledge, on the other hand, requires high levels of controlled processing and analysis (Elder, 2009). However, the slightly stronger and statistically significant correlations for correcting ungrammatical sentences following instruction are interesting even if these are still small (rho = .470, rho = .389). To some extent the ability to correct errors was a skill assessed in the Elicited Imitation test as well as in the GJT, 11/25 items measured test takers’ ability to spontaneously correct the target item presented in an ungrammatical context. It is interesting to speculate to what extent, then, the explicit knowledge that students gained as a result of instruction may have been proceduralized and automatized, using DeKeyser’s model (2009), to the extent that it was available for use in real time. Results are tentative and more research is needed to further test this hypothesis. A complicating factor in this study is the fact that the same version of the Elicited Imitation test was used for each testing episode,
Effects of Instruction
89
albeit with varied presentation of items, so that students may have become familiar with some of the items, which could have allowed them to draw on explicit knowledge. However, this is not the first time that a relationship has been demonstrated between results on a metalinguistic measure and a measure of oral language production. White and Ranta (2002) report an even stronger correlation (r = .56) between performance on a metalinguistic measure and an oral task. It is important, however, to signal the different nature of the measures that White and Ranta (2002) used. Their picture description task does not seem to have included the constraint of any time pressure so that we cannot be sure that participants, particularly those in the Rule group who had received instruction focusing on the target structure, did not have time to access their explicit knowledge.
Limitations Limitations with respect to the MKT have already been mentioned, that is, the small number of items and the fact that participants did not complete it in their L1. To some extent then, successful performance on the MKT may have depended on L2 proficiency. It is also regrettable that there was no measure of passive understanding of metalinguistic concepts along with the measure of competence in rule verbalization. The fact that the explicit grammatical instruction that the participants in this study received was followed by language practice activities means that it is not possible to disambiguate the effects of the instruction from language practice. An anonymous reviewer pointed out that the differences in the background experiences of participants could be a factor impacting on results obtained. This is possible although there was, as has been discussed, a lack of pre-test differences among the different groups.
Conclusion This study investigated the impact of explicit teaching, which included metalinguistic description, on the learning of a linguistic structure judged to be less amenable to instruction than some because of its complexity, low schematicity, relative context dependency and low truth value. Results showed that explicit grammar instruction (with language practice activities) did have a positive impact on metalinguistic knowledge in that learners made significant gains in the ability to judge and correct ungrammatical sentences. Furthermore, these gains were sustained over time, suggesting that this type of knowledge is relatively durable. There was also some evidence to suggest that a greater consistency in response correctness was associated with a greater certainty on the part of participants as they made their grammatical judgements. Instruction also had a positive impact on participants’ metalingual competence, however, not as might have been expected, although results, it has been suggested,
90
The Metalinguistic Dimension
need to be interpreted with caution. Participants made significant and greater gains in giving rule explanations for the more central or prototypical use of articles than they did in describing the more peripheral or secondary use of the indefinite article ‘a’ to convey generic reference. This result is aligned with predictions (Roehr, 2008) and results from other research (Hu, 2011). An interesting result is the lack of significant difference between instruction that provided opportunity for learners to engage in pushed output and instruction that required students to work with language input, in terms of gains in metalinguistic knowledge. A probable explanation is that it was the opportunity that both types of instruction allowed for learners to focus on form that led to gains in metalinguistic understanding (Izumi, 2003). Finally, evidence that demonstrated that metalinguistic performance was not initially correlated with performance on a test of implicit language knowledge concurs with existing research (Loewen, 2009; Elder, 2009) which suggests that these two types of knowledge are elicited under very different performance conditions. However, it was interesting to note some significant, if small, correlations between both these measures (i.e. the Elicited Imitation test and the component of the MKT assessing the ability to correct incorrect sentences) following explicit instruction, providing tentative evidence for DeKeyser’s hypothesis that explicit knowledge that has been proceduralized may be able to be used in real time.
Acknowledgements This research was funded by a Marsden grant awarded by the Royal Society of Arts of New Zealand to Rod Ellis and Cathie Elder. Other researchers who contributed were Shawn Loewen, Jenefer Philp, Satomi Mizutani and Shuhei Hidaka. The author would also like to thank Juanita Watts and Faiza Hadji for their careful coding of the data which was specific to this chapter only.
Appendix A: Summary of rule explanations given to students We use a/an when the person or thing is indefinite (i.e. when we do not know for sure which person/thing is being talked about) We can use a/an when we want to talk about ONE person or thing that is not definite. We can use a/an with a COUNTABLE noun to make a GENERALIZATION about people/ things. When the noun is UNCOUNTABLE we cannot use a/an to make a generalization. We must use the noun by itself.
Effects of Instruction
91
Appendix B: Examples of teaching activities used in instructional treatments Structured input activity
In this activity the article ‘a’ is used with two meanings. You have to sort out which meaning it has – whether it is general or ONE particular thing. Tick the ‘General’ or ‘Particular’ box.
1. I bought a new bicycle. 2. I bought a bus ticket with my last dollar. 3. A helicopter can land just about anywhere. 4. A car costs a lot of money to run.
General
Particular
☐ ☐ ☐ ☐
☐ ☐ ☐ ☐
Etc. Production activity (students are given one of two cards and complete this activity with a classmate who has a different card)
Card 1. You will meet an alien from planet Zlog. The alien will ask you, ‘What’s a human?’ Explain clearly 3 characteristics of a human. Find out 3 things about a zlog.
Card 2 You are an alien from planet Zlog. Find out 3 things about a human. The human you meet will ask you, ‘What’s a zlog?’ Explain clearly 3 characteristics of a zlog.
What’s a zlog?
The Metalinguistic Dimension
92
Appendix C – Examples of test items Sample item from GJT In class, good student listens to the teacher.
Correct
Incorrect
a. This sentence is: ☐ ☐ b. How certain are you? ☐% c. If you think the sentence is incorrect please write down the correct sentence below. If you think the sentence is correct, please go to the next page.
Now turn to the next page and continue the test. You MUST NOT turn back to look at this page. Sample item from MKT For a friend, book is a good present. 1. Correct the error. 2. Explain what is wrong with the sentence.
References Andersen, R. (1984), ‘The one to one principle of interlanguage construction’. Language Learning, 34, 77–95. Berry, R. (2004), ‘Awareness of metalanguage’. Language Awareness, 13, 1–16. — (2005), ‘Making the most of metalanguage’. Language Awareness, 14, 3–20. Bley-Vroman, R. and Chaudron, C. (1994), ‘Elicited Imitation as a measure of secondlanguage competence’, in E. Tarone, S. Gass and A. Cohen (eds), Research Methodology in Second-Language Acquisition. Hillsdale, NJ: Lawrence Erlbaum, pp. 245–61. Celce-Murcia, M. and Larsen-Freeman, D. (1999), The Grammar Book (2nd edn). Boston, MA: Heinle & Heinle. Cohen, J. (1988), Statistical Power Analysis for the Behavioural Sciences (2nd edn). Hillsdale, NJ: Lawrence Erlbaum. DeKeyser, R. (2003), ‘Implicit and explicit learning’, in C. Doughty and M. Long (eds), Handbook of Second Language Acquisition. Malden, MA: Blackwell, pp. 313–48. — (2005), ‘What makes learning second-language grammar difficult? A review of issues’. Language Learning, 55(S1), 1–25. — (2009), ‘Cognitive-psychological processes in second language learning’, in M. Long and C. Doughty (eds), Handbook of Second Language Teaching. Oxford: WileyBlackwell, pp. 119–38. Elder, C. (2009), ‘Validating a test of metalinguistic knowledge’, in R. Ellis, S. Loewen, C. Elder, R. Erlam, J. Philip and H. Reinders, Implicit and Explicit Knowledge in Second Language Learning, Testing and Teaching. Bristol: Multilingual Matters, pp. 113–38.
Effects of Instruction
93
Elder, C., Erlam, R. and Philp, J. (2007), ‘Explicit language knowledge and focus on form: Options and obstacles for TESOL teacher trainees’, in S. Fotos and N. Hossein (eds), Form-focussed Instruction in Teacher Education: Studies in Honour of Rod Ellis. Oxford: Oxford University Press, pp. 225–41. Ellis, R. (1991), ‘Grammaticality judgements and second language acquisition’. Studies in Second Language Acquisition, 13, 161–86. — (2003), Task-based Learning and Teaching. Oxford: Oxford University Press. — (2004), ‘The definition and measurement of L2 explicit knowledge’. Language Learning, 54, (2), 227–75. — (2005), ‘Measuring implicit and explicit knowledge of a second language: A psychometric study’. Studies in Second Language Acquisition, 27 (2), 141–72. — (2008), The Study of Second Language Acquisition (2nd edn). Oxford: Oxford University Press. Erlam, R. (2009), ‘The elicited oral imitation test as a measure of implicit knowledge’, in R. Ellis, S. Loewen, C. Elder, R. Erlam, J. Philip and H. Reinders, Implicit and Explicit Knowledge in Second Language Learning, Testing and Teaching. Bristol: Multilingual Matters, pp. 31–64. Erlam, R., Loewen, S. and Philp, J. (2009), ‘The roles of output-based and input-based instruction in the acquisition of L2 implicit and explicit knowledge’, in R. Ellis, S. Loewen, C. Elder, R. Erlam, J. Philip and H. Reinders, Implicit and Explicit Knowledge in Second Language Learning, Testing and Teaching. Bristol: Multilingual Matters, pp. 241–61. Gass, S. (1994), ‘The reliability of second-language grammaticality judgements’, in E. Torone, S. Gass and A. Cohen (eds), Research Methodology in Second Language Acquisition. Hillsdale, NJ: Lawrence Erlbaum, pp. 303–22. Gower, R. and Walters, S. (1983), A Teaching Practice Handbook. London: Heinemann. Green, P. and Hecht, K. (1992), ‘Implicit and explicit grammar: An empirical study’. Applied Linguistics, 13, (2), 168–84. Hu, G. (2002), ‘Psychological constraints on the utility of metalinguistic knowledge in second language production’. Studies in Second Language Acquisition, 24, (3), 347–86. — (2011), ‘Metalinguistic knowledge, metalanguage, and their relationship in L2 learners’. System, 39, 63–77. Isumi, S. (2003), ‘Comprehension and production processes in second language learning: In search of the psycholinguistic rationale of the output hypothesis’. Applied Linguistics, 24, 168–96. Krashen, S. (1998), ‘Comprehensible output?’. System, 26, 175–82. Loewen, S. (2009), ‘Grammaticality judgment tests and the measurement of implicit and explicit L2 knowledge’, in R. Ellis, S. Loewen, C. Elder, R. Erlam, J. Philip and H. Reinders, Implicit and Explicit Knowledge in Second Language Learning, Testing and Teaching. Bristol: Multilingual Matters, pp. 94–112. Loewen, S. and Reinders, H. (2011), Key Concepts in Second Language Acquisition. Houndmills: Palgrave Macmillan. McDade, H., Simpson, M. and Lamb, D. (1982), ‘The use of elicited imitation as a measure of expressive grammar: A question of validity’. Journal of Speech and Hearing Disorders, 47, 19–24. Muranoi, H. (2007), ‘Output practice in the L2 classroom’, in R. DeKeyser (ed.), Practice in a Second Language. Cambridge: Cambridge University Press, pp. 51–84. Norris, J. M. and Ortega, L. (2000), ‘Effectiveness of L2 instruction: A research synthesis and quantitative meta-analysis’. Language Learning, 50, (3), 417–528.
94
The Metalinguistic Dimension
Parrott, M. (2000), Grammar for English Language Teachers. Cambridge: Cambridge University Press. Pienemann, M. (1998), Language Processing and Language Development: Processability Theory. Amsterdam: John Benjamins. Roehr, K. (2008), ‘Linguistic and metalinguistic categories in second language learning’. Cognitive Linguistics, 19, 67–106. — (2010), ‘Explicit knowledge and learning in SLA: A cognitive linguistics perspective’. AILA Review, 23, 7–29. Roehr, K. and Gánem-Gutiérrez, G. A. (2009), ‘The status of metalinguistic knowledge in instructed adult L2 learning’. Language Awareness, 18, (2), 165–81. Sachs, J. (1967), ‘Recognition memory for syntactic and semantic aspects of connected discourse’. Perception and Psychophysics, 2, 437–42. Swain, M. (1995), ‘The functions of output in second language learning’, in G. Cook and B. Seidlhoffer (eds), Principles & Practice in Applied Linguistics: Studies in Honor of H.G. Widdowson. Oxford: Oxford University Press, pp. 125–44. Toth, P. D. (2006), ‘Processing instruction and a role for output in second language acquisition’. Language Learning, 56, (2), 319–85. VanPatten, B. (1996), Input Processing and Grammar Instruction in Second Language Acquisition: Theory and Research. Norwood, NJ: Ablex. — (2004), Processing Instruction: Theory, Research, and Commentary. Mahwah, NJ: Lawrence Erlbaum. White, J. and Ranta L. (2002), ‘Examining the interface between metalinguistic task performance and oral production in a second language’. Language Awareness, 11, (4), 259–90.
5
University-Level Learners’ Beliefs about Metalinguistic Knowledge Benjaporn Thepseenu University of Essex
Karen Roehr
University of Essex
Abstract We report an investigation of L1 Thai university-level learners’ metalinguistic knowledge about L2 English and their beliefs about such knowledge. Using questionnaires and interviews, learners’ beliefs about the explicit teaching and learning of grammar in general as well as their beliefs about the difficulty and usefulness of instruction on selected grammar points were examined. Moreover, learners’ performance on the selected grammar points was assessed by means of a test of metalinguistic knowledge. We identified correspondences between learners’ beliefs and their actual metalinguistic knowledge. Furthermore, our findings suggest that the learners’ belief systems are complex constructs. The participants held both positive and negative beliefs about the explicit teaching and learning of grammar, with specific beliefs indicating differentiated rather than contradictory views. Learner confidence emerged as an important theme from the data.
Introduction Instructed second language (L2) learners’ metalinguistic knowledge, or their knowledge about language, and its relationship with L2 proficiency, i.e. the ability to use language fluently and accurately, have attracted the interest of language researchers and teachers for more than three decades. Existing research shows that in instructed adults, metalinguistic knowledge and written L2 proficiency are positively correlated (Alderson et al., 1997; Roehr, 2008b; Sorace, 1985). It has also been suggested that learners’ beliefs about language and language learning may influence their achievement – or otherwise – of L2 proficiency in the classroom (e.g. Cotterall, 1999; Horwitz, 1987,
96
The Metalinguistic Dimension
1988, 1999; White, 2008). Thus, researchers have considered both metalinguistic knowledge and learner beliefs in conjunction with L2 proficiency. However, no study to date has examined the relationship between learners’ metalinguistic knowledge and their beliefs about such knowledge. We present a study which was undertaken to address this issue. If beliefs can affect the attainment of L2 proficiency, then it is reasonable to assume that beliefs may likewise impact on other aspects of learning such as the development of metalinguistic knowledge and a learner’s ability to make use of such knowledge in their L2 learning. Thus, it is of interest to investigate whether there is any association between learners’ metalinguistic knowledge and their beliefs about such knowledge.
Metalinguistic knowledge in instructed L2 learning In the context of adult L2 learning, metalinguistic knowledge can be defined as explicit knowledge about the language that is being learned (Alderson et al., 1997; Elder and Manwaring, 2004; Elder et al., 1999). Explicit knowledge is represented declaratively; it is knowledge that can be brought into conscious awareness and that is potentially available for verbal report (Hulstijn, 2005; Roehr, 2008b). Explicit knowledge can be contrasted with implicit knowledge, which cannot be brought into awareness or articulated (N. C. Ellis, 1994; R. Ellis, 2004). In the context of the present study, the terms metalinguistic knowledge and explicit knowledge about language are used as synonyms. Most adults who learn an L2 in a classroom setting are exposed to metalinguistic information on a regular basis. Language teachers, textbooks, grammar books and dictionaries rely on explicit knowledge about language when describing linguistic categories or grammatical roles of parts of speech (e.g. intransitive verb, direct object, etc.) as well as when explaining the form and/or use of particular linguistic constructions (e.g. ‘In English, in the third person present tense, an –s needs to be added to the verb’). Thus, metalinguistic knowledge in the sense of explicit knowledge about language often takes the form of pedagogical grammar rules. In principle, a learner can develop explicit knowledge about language inductively and independently through analysis of the linguistic input following extensive exposure to the L2 (Bialystok, 1994; Roehr, 2008a). However, in a classroom setting, explicit knowledge is more likely to be assimilated from external sources such as teacher input and grammar books (Hu, 2011). Put differently, it is acquired through explicit learning, facilitated by explicit instruction (DeKeyser, 2003). Research spanning more than two decades has demonstrated that explicit instruction effectively promotes L2 learning in the classroom; overall, this finding can be regarded as trustworthy and robust (Norris and Ortega, 2001). Along similar lines, research examining the role of different levels of learner awareness indicates that higher levels of awareness and understanding on the part of the learner are generally associated with improved L2 performance (Camps, 2003; Leow, 2000; Rosa and O’Neill, 1999). Hence, it appears that explicit teaching and learning are generally effective, if compared with implicit teaching and learning or no instruction at all. As outlined above, explicit teaching and learning typically rely to
University-Level Learners’ Beliefs
97
at least some extent on metalinguistic knowledge, with a view to enabling learners to apply such knowledge in productive and receptive language use. Investigations into the specific role of metalinguistic knowledge in instructed L2 learning have led to two key findings. First, learners’ levels of metalinguistic knowledge generally correlate positively with levels of written L2 proficiency as measured by performance on tests of grammar and vocabulary, reading and writing (Alderson et al., 1997; Elder and Manwaring, 2004; Elder et al., 1999; Roehr, 2008b), although correlation coefficients vary between studies from moderate to strong. Second, learners’ levels of metalinguistic knowledge may be predicted by certain individual learner differences. Individual learner differences are characteristics which all learners possess, but on which individual learners differ (Dörnyei, 2005; Dörnyei and Skehan, 2003), either categorically, as in the case of age or gender, or on a scale, as in the case of language learning experience, language learning aptitude or metacognition. To date, there is relatively little research that has investigated the relationship between metalinguistic knowledge and individual difference variables in instructed L2 learners.1 Existing findings suggest that instructed adults’ levels of metalinguistic knowledge are significantly predicted by their language learning experience in the sense of number of languages learned and length of exposure to formal classroom instruction (Roehr and Gánem-Gutiérrez, 2009b). In addition, two components of language learning aptitude – grammatical sensitivity and associative memory – have been shown to be significant, though weak predictors, together explaining 15% of the variance in L2 learners’ metalinguistic knowledge (Roehr and Gánem-Gutiérrez, 2009b).
Learner beliefs Beliefs are part of an individual’s metacognition. In its broadest definition, metacognition refers to cognition about cognitive phenomena (Flavell, 1979), or, put less formally, to ‘thinking about one’s own thoughts’ (Hacker, 1998, p. 3). Metacognition includes knowledge of one’s knowledge, cognitive, and affective states as well as the ability to consciously and deliberately monitor and regulate one’s knowledge, cognitive, and affective states (Hacker, 1998). Learner beliefs can be regarded as a subset of the metacognitive knowledge a learner holds. Beliefs are relatively stable, and they can be articulated (Wenden, 1998, 1999). In the area of L2 learning, beliefs about language and language learning in general have been the focus of the so-called BALLI studies (Horwitz, 1987, 1988, 1999). These studies have identified and described the beliefs classroom learners of different L2s hold about issues such as the nature and difficulty of language learning, language learning aptitude, learning and communication strategies, and motivations and expectations. Findings suggest that certain majority beliefs may hold across cultures and across instructional contexts, but that there is likewise considerable variation in evidence, both within and between specific learner groups (Horwitz, 1999). Apart from seeking to uncover the nature of learner beliefs (Benson and Lor, 1999; Cotterall, 1999), existing research has identified a relationship between learners’ beliefs and their use of language learning strategies (White, 2008; Yang, 1999).
98
The Metalinguistic Dimension
By contrast, attempts to identify culturally based differences in learner beliefs have led to inconclusive results. In a review of seven BALLI studies, Horwitz (1999) compared the beliefs of US, Korean, Taiwanese, and Turkish-Cypriot learners and US language teachers. Differences within cultural groups (e.g. eastern Asian) and even within groups of the same nationality (e.g. Korean) were often more pronounced than differences between eastern and western cultural groupings. Horwitz (1999) concludes that professional status (student or teacher), age, learning context, L2 proficiency level and the L2 being learned are all factors likely to influence beliefs; what is more, each of these factors may be more powerful than cultural differences. In addition to the BALLI studies which focused on general beliefs about language and language learning, there is a small body of recent research which has investigated learner beliefs about the role of explicit instruction. Using questionnaires, Schulz (2001) explored perceptions of the role of explicit grammar instruction and error correction in the post-secondary classroom. The participants were 607 Colombian language students and 122 teachers. Questionnaire responses were compared with the findings from a previous study which employed a similar instrument administered to 824 US language students and 92 teachers. The questionnaires consisted of statements to which participants responded on a five-point Likert scale ranging from ‘strongly agree’ to ‘strongly disagree’. Results showed that US students’ responses differed significantly from Colombian students’ responses. The greatest discrepancy was found for the items ‘I like the study of grammar’, with 76% of Colombian students but only 46% of US students agreeing, and ‘My foreign language improves most quickly if I study and practise the grammar’, which yielded a 77% agreement rate from Colombian students, as opposed to 48% from US students. The author concludes that there is evidence for a strong belief on the part of students from both cultures in the positive roles played by explicit grammar instruction and corrective feedback. In general, teachers also valued explicit grammar instruction and error correction, but their beliefs as a group were far less homogeneous than those of the students. More recently, Loewen et al. (2009) conducted a further investigation aimed at identifying L2 learners’ beliefs about explicit grammar instruction and error correction. The study was carried out at a US university and involved 754 participants enrolled in 13 different language courses, with English, German, Arabic, Japanese, Chinese and Spanish the most frequently represented L2s. The participants mostly had English as their L1; other frequent L1s were Korean, Chinese and Arabic. Thus, although conducted in the United States, the study yields information about the beliefs of learners from a range of L1s and cultural backgrounds. The instrument used was a questionnaire with 37 belief-related statements about grammar instruction and error correction, some of which were adapted from Schulz (2001). After responding to these questionnaire items on a five-point Likert scale, participants were asked to complete four open-ended statements on why they liked or disliked studying grammar and how they liked or disliked being taught grammar. The researchers report that learners valued explicit grammar instruction, but also that there were individual differences in evidence. In general, learners of L2 English were less enthusiastic about grammar instruction and error correction than learners of other L2s; at the same time, learners of L2 English showed more interest in improving
University-Level Learners’ Beliefs
99
their oral communicative skills. The immersion context of this group of participants is put forward as a possible reason for the observed difference in beliefs. In addition, the language learning history of L2 English learners, who had reported significantly more previous grammar instruction than the other L2 learners, constitutes another possible causative factor. A content analysis of learner responses to the open-ended statements enabled the researchers to identify a number of general themes in the learners’ belief systems. Studying grammar was seen as central to the language learning endeavour, and it was regarded as generally beneficial. Grammar was also seen as helping with specific L2 skills such as writing. Many learners saw grammar as beneficial for extrinsic reasons such as academic success, and some learners reported that they enjoyed studying grammar. This latter sentiment was more noticeable in L2 learners who studied languages other than English. The most frequently cited reason for disliking the study of grammar was that it was boring, with characteristics such as difficult, confusing and complicated listed as well. Some learners felt that they would prefer practising speaking skills instead; they perceived a lack of relation between grammar and ‘real life’. Moreover, many participants reported a dislike for memorization. Complementing the large-scale research by Schulz (2001) and Loewen et al. (2009), Scheffler (2009) undertook a study which exclusively focused on L2 learners’ beliefs about the difficulty of certain grammatical structures and the usefulness of explicit instruction on these structures. The researcher administered a questionnaire to two groups of L1 Polish learners of L2 English at a college of modern languages in Poland. One group of learners (N = 50) was asked to rate the perceived difficulty of selected aspects of English grammar on a five-point scale ranging from ‘very easy’ to ‘very difficult’. Another group of learners (N = 50) was asked to rate the perceived usefulness of instruction on the same grammar aspects on a five-point scale ranging from ‘not useful at all’ to ‘very useful’. Learners had encountered all the targeted structures in their L2 classes. It was found that the participants generally perceived English grammar as fairly easy, while the usefulness of instruction was generally rated quite highly. Results further showed that, overall, if an area of grammar was considered difficult, instruction was considered useful.
Research issues and questions In summary, existing research indicates that metalinguistic knowledge in the sense of explicit knowledge about language is associated with L2 proficiency. Moreover, it has been demonstrated that explicit teaching and learning, including grammar instruction which makes use of metalinguistic knowledge, generally promote L2 learning. At the same time, studies suggest that learners’ levels of metalinguistic knowledge are influenced by individual difference variables; learner beliefs are part of an individual’s metacognition and thus likewise constitute an individual difference variable. It is reasonable to hypothesize that learners’ beliefs about explicit grammar instruction more generally and metalinguistic knowledge in particular are associated with their level of L2 metalinguistic knowledge. A small number of recent studies has sought
100
The Metalinguistic Dimension
to identify learner beliefs about explicit grammar instruction, but no research to date has investigated such beliefs in conjunction with learners’ actual metalinguistic knowledge. The present study addressed the following research questions: 1. What is learners’ metalinguistic knowledge about aspects of L2 grammar which are characterized by low explicit learning difficulty versus high explicit learning difficulty? 2. What are learners’ beliefs about metalinguistic knowledge? 3. What is the relationship between learners’ metalinguistic knowledge about the targeted aspects of L2 grammar, their beliefs about the difficulty of these grammar aspects and their beliefs about the usefulness of instruction on these grammar aspects? 4. What are learners’ reasons for liking and/or disliking explicit teaching and learning of grammar in the context of their L2 learning?
Research design, participants and methodology The research questions were addressed in a study which employed both quantitative and qualitative methods. The instruments used were an L2 English placement test, a test of L2 metalinguistic knowledge, a questionnaire aimed at gathering biodata and background information as well as eliciting learners’ beliefs and semi-structured interviews which were conducted with a sub-sample of participants in order to gain deeper insights into learners’ belief systems. These instruments are described below. Informed written consent was obtained from all participants before the study began. A total of 64 university-level learners of L2 English participated in the study (53 females, 11 males). The participants’ L1 is Thai, and they were fourth-year undergraduates majoring in English at a university in Thailand. At the time of data collection, the learners were in the first semester of the final year of their degree programme. They were aged between 18 and 23 years (mean = 21.4). On average, the participants had learned English for 12.9 years. In Thailand, English is a compulsory subject at all levels of education. Overall, the teaching and learning of English is characterized by explicit instruction with a focus on forms and considerable reliance on metalinguistic knowledge; textbooks which include grammatical descriptions and explanations are widely used (see also McDonough and Chaikitmongkol, 2007). At primary level, students are introduced to basic grammatical concepts and terminology (e.g. noun, verb, adjective, subject, object). From the secondary level onwards, pedagogical grammar rules are in use. Typically, such rules are presented by the teacher, and students engage in controlled practice via gap-fill or multiple-choice exercises, writing at sentence level, etc. At the university attended by the participants, a focus on forms, explicit teaching of grammar rules and controlled practice of grammar points by means of oral and written exercises are part of most English courses in the first and second year of study. English courses in the third and fourth year focus on topic areas such as literature or English for Specific
University-Level Learners’ Beliefs
101
Purposes. As one would expect, learners at university level are exposed to more advanced metalinguistic concepts and rules than students at secondary or primary level. As the university attended by the participants is in a rural region of Thailand where few speakers of English can be found, students typically have little opportunity to practise English outside the classroom. The university English teachers are typically L1 Thai speakers. In addition to being asked about basic demographic information such as their gender and age, the participants were asked about their perceived level of L2 proficiency compared with native speakers of English (excellent – good – fair – poor), their views on the importance of achieving proficiency in the L2 (very important – important – not so important – not at all important) and their level of enjoyment in learning the L2 (very much – quite a lot – not really – not at all). At 61%, the majority of participants regarded their overall L2 proficiency as ‘fair’; 23% rated their proficiency as ‘good’, while the remaining 16% considered their proficiency to be ‘poor’. With regard to the importance of achieving proficiency in English, 59% of the participants stated that it was ‘very important’ for them, while 39% considered it ‘important’. There was only one participant who thought it was ‘not so important’ to become proficient in the L2. Most participants reported relatively high levels of enjoyment in learning English, with 56% stating that they enjoyed it ‘very much’ and 42% stating that they enjoyed it ‘quite a lot’. One participant stated that s/he did ‘not really’ enjoy learning English. In summary, the participants were relatively homogeneous in the self-assessment of their L2 proficiency, their views on the importance of becoming proficient in the L2 and the reported level of enjoyment in learning the L2, with generally positive attitudes in evidence. In order to obtain a more objective estimate of the participants’ level of English language proficiency, the pen-and-paper version of the Quick Placement Test of English for speakers of other languages (UCLES, 2004) was administered. The 60-item test takes 30 minutes to complete and assesses knowledge of English grammar, vocabulary and reading skill. The participants’ scores ranged from 14 to 36 (mean = 24, SD = 3.9). According to the test manual, a mean score of 24 is to be interpreted as L2 proficiency at the elementary level (roughly equivalent to CEFR level A2) – a result which was unexpected, given the participants’ language learning experience.
Test of L2 metalinguistic knowledge The test of L2 metalinguistic knowledge (henceforth: MLK test) was aimed at assessing participants’ explicit knowledge about selected grammatical structures. The test had been used in a previous study (Ziętek and Roehr, 2011). It consisted of 24 sentences targeting 12 aspects of English grammar which are typically taught to L2 learners in the classroom. The targeted aspects are summarized in Table 5.1. They were drawn from the pedagogical grammar literature (Murphy, 1994; Swan, 1995) and were intended to represent both low and comparatively higher explicit learning difficulty (R. Ellis, 2006). Explicit learning difficulty was operationalized following Roehr and Gánem-Gutiérrez (2009a). If an L2 aspect can be described by means of a metalinguistic rule which
102
The Metalinguistic Dimension
Table 5.1 Aspects of English grammar targeted in the MLK test Low explicit learning difficulty: Use of . . .
Higher explicit learning difficulty: Use of . . .
Third person present tense –s Plural of nouns Adverbs to modify verbs
Some vs any There as a preparatory subject Active vs passive voice with ergative verbs Question tags in positive and negative sentences Personal vs reflexive pronouns The preposition in to refer to parts of the day
Definite article to express specific reference and prior hearer knowledge Ordinal numbers to express a ranking Present progressive to express an on-going event or action
is relatively high in schematicity, relatively low in conceptual complexity, relatively high in truth value, and makes use of relatively non-technical metalanguage, explicit learning difficulty is low. Conversely, if an L2 aspect is described by a rule that is low in schematicity, relatively high in conceptual complexity, relatively low in truth value and makes use of relatively technical metalanguage, explicit learning difficulty is high. Schematicity refers to the extent to which a linguistic construction is schematic or specific, and it likewise refers to whether a metalinguistic description covers a schematic or a specific linguistic construction. The metalinguistic description ‘Since denotes a period of time commencing at a specific point in the past and continuing into the present’ is low in schematicity because since is a single specific linguistic construction. By contrast, the metalinguistic description ‘Adverbs can be positioned sentence-initially and finally and also between the subject and the verb, but not between the verb and the direct object’ is high in schematicity because it covers all adverbs, i.e. a schematic linguistic construction. Conceptual complexity refers to the number of elements that need to be taken into account when applying a metalinguistic description, i.e. the number of categories and relations between categories included. The description ‘As an on-going event or action is being expressed, the present progressive is required’ is low in conceptual complexity because it is a simple proposition consisting of two categories (on-going event or action, present progressive) and one relation. Truth value refers to the extent to which a metalinguistic description applies without exception. Technicality of metalanguage refers to the relative familiarity and abstractness of the metalinguistic terminology used in a metalinguistic description. For further details on the development of the MLK test and the underlying taxonomy of explicit learning difficulty, the reader is referred to Roehr and Gánem-Gutiérrez (2009a) and Ziętek and Roehr (2011). The MLK test presented participants with 24 English sentences containing 1 highlighted error each. Participants were required to correct, describe and explain the highlighted errors. The description/explanation task tested learners’ ability to implement pedagogical grammar rules, since each targeted error could be described and explained by means of a statement of the type ‘As form X occurs/function X is being
University-Level Learners’ Beliefs
103
expressed, form Y needs to be used’. Essentially, the targeted description answered the question ‘What form?’, while the targeted explanation answered the question ‘Why this form?’. The participants were at liberty to use either Thai or English when completing the test. The test was untimed; all participants finished within 1 hour. The MLK test was scored dichotomously in accordance with a prepared answer key. For each appropriate error correction one point was awarded. For each adequate description and for each adequate explanation one point was awarded, respectively. Adequate descriptions and explanations were defined as any descriptions and explanations that were not incorrect and that showed at least some evidence of meaningful generalization beyond the instances provided in the test items themselves. The test was scored by the first author who is proficient in both Thai and English. As the scoring of descriptions and explanations required qualitative judgements, a second marker who is likewise proficient in Thai and English scored a random sample of 20% of the test papers. Inter-rater reliability was very high (r = 0.98, p < 0.01). Any remaining disagreements between the two markers were resolved through discussion. Sample items from the MLK test, sample participant answers and scores awarded can be found in the Appendix. The description/explanation section of the MLK test exhibited reasonable reliability (Cronbach’s α = 0.79). Following the removal of three poorly performing items, the correction section was likewise fairly reliable (α = 0.68). The test showed good overall reliability (α = 0.80). The maximum score that could be achieved on the MLK test was 69, with a maximum of 21 points for correction, 24 for description and 24 for explanation.
Beliefs questionnaire A tailor-made questionnaire containing closed-ended as well as open-ended items was used to identify participants’ beliefs about metalinguistic knowledge. The questionnaire consisted of four sections. Section 1 was aimed at eliciting biodata and background information, as reported above. Section 2 was modelled on the instrument used in Scheffler’s (2009) study. Participants were asked to indicate the perceived level of difficulty of the twelve aspects of L2 grammar included in the MLK test on a five-point scale (very easy – easy – moderate – difficult – very difficult). In addition, they were asked to indicate the perceived usefulness of instruction on these grammar aspects on a five-point scale (very useful – useful – moderately useful – of little use – not useful at all). Each of the 12 listed grammar aspects was illustrated with a concrete example. Participants’ responses were converted into numerical scores for statistical analysis (Dörnyei, 2003, 2007). Section 3 was based on the questionnaire used in Loewen et al.’s (2009) study. It consisted of 32 statements about the learning and teaching of grammar; the statements were either adopted directly from Loewen et al.’s (2009) instrument or adapted with slight modifications in wording. Both positive and negative statements were included. Participants were asked to respond to the statements on a five-point Likert scale (strongly agree – agree – neither agree nor disagree – disagree – strongly disagree). Responses were converted into numerical scores for statistical analysis (strongly agree = 5), with reverse scoring applied to negative items (strongly agree = 1) (Dörnyei, 2003, p. 37).
104
The Metalinguistic Dimension
Section 4 comprised two open-ended items which were likewise adopted from Loewen et al. (2009). Participants were asked to complete two statements in their own words, namely ‘I like studying grammar because . . .’ and ‘I don’t like studying grammar because . . .’. Participants’ responses to these items were subjected to a content analysis (Cohen et al., 2007; Silverman, 2000). The aim of the analysis was to uncover recurring patterns and themes in the data (Miles and Huberman, 1994); the identified themes were triangulated with learner responses to the closed-ended questionnaire items in section 3 and the interview data (see below). The participants completed the questionnaire immediately after the MLK test; they took about 30 minutes to respond to all items.
Semi-structured interviews about learner beliefs Semi-structured interviews were conducted with a sub-sample of nine participants in order to gain further insight into learners’ beliefs about metalinguistic knowledge in the context of their L2 learning. The interviewees were volunteers who represented different levels of metalinguistic knowledge as measured by the MLK test, with two participants scoring in the upper third, three in the middle third and four in the lower third of the range of MLK test scores. Interviews were conducted 2 weeks after the MLK test and beliefs questionnaire had been administered and took about 30 minutes each. Learners were asked whether they regarded knowledge of grammar as useful in their learning of English and why or why not, how they applied their knowledge of grammar, whether they thought it was worth teaching students grammar rules, whether they had found any limitations or disadvantages to grammar rules and what their overall opinion about grammar rules was. The interviews were conducted in Thai by the first author and audio-recorded with the participants’ permission. Subsequently, the recordings were transcribed and the transcripts subjected to a content analysis. The same analytic approach as outlined above for the open-ended questionnaire items was taken. All quotes from the interviews included here are in English translation.
Results Research question 1 The descriptive statistics reported in Table 5.2 serve to answer our first research question, which asked about the participants’ metalinguistic knowledge about the targeted 12 aspects of L2 grammar. The results in Table 5.2 indicate that all participants had at least some metalinguistic knowledge about the targeted grammar aspects. The range of scores suggests that there were differences between individual learners, as one would expect, but overall standard deviations are fairly small. It is clear from the descriptive statistics that grammar aspects which were judged by the test designers to be of low explicit learning difficulty were easier for the participants than grammar aspects which were judged to be of comparatively higher explicit learning difficulty; the mean facility value was 56% for
University-Level Learners’ Beliefs
105
Table 5.2 Descriptive statistics – MLK test No. of items
Mean % correct
Mean
SD
Min.
Max.
69 21 48
44 63 36
30.39 13.33 17.06
8.91 3.19 7.32
15 6 2
51 20 35
33 36 24 24
56 33 50 21
18.56 11.83 12.00 5.06
5.38 4.88 4.41 3.78
6 2 2 0
29 22 21 14
MLK test total Correction Description/ Explanation (D/E) Low difficulty total High difficulty total Low difficulty D/E High difficulty D/E
low-difficulty items as opposed to 33% for high-difficulty items. Moreover, the results demonstrate that the error correction task (mean 63% correct) was less challenging for the participants than the description/explanation task (mean 36% correct). This finding is in line with previous results based on the same test when administered to a different group of learners. Ziętek and Roehr (2011) found a mean facility value of 91% for the correction task and a mean facility value of 34% for the description/ explanation task in their sample of L1 Polish learners of English. It is interesting to note that the Polish learners were extremely successful at error correction, while the Thai learners in the present study were less adept at this task. By contrast, performance on the description/explanation task shows very similar achievement rates for both learner groups. This is noteworthy because there are several differences between the samples, including L1, mean age (21.4 in the present study, 18 in the case of the Polish group), learning context (university in the present study, secondary school in the case of the Polish group) and language learning experience (all Polish participants had learned at least one other L2 apart from English).
Research question 2 The second research question asked about learners’ beliefs about metalinguistic knowledge. The positive statements from section 3 of the beliefs questionnaire which attracted the strongest mean agreement from the participants are shown in Table 5.3; negative statements attracting the strongest mean agreement are listed in Table 5.4. As Table 5.3 demonstrates, participants agreed most strongly with statements about the usefulness of grammar rules for writing and reading. Moreover, they felt that studying grammar is necessary for mastering the English language. Learning grammar is considered a confidence-boosting activity and, less obviously perhaps, knowledge of grammar rules is seen as playing a role in fluency and communication. Interestingly, strong agreement with these positive statements co-occurs with strong agreement with negative statements about the role of grammar rules in communication and the issue of confidence (see Table 5.4). Some of the negative statements attracting the strongest agreement from participants refer to the difficulty of grammar rules, the potential confusion they can cause and feelings of nervousness in the absence of
The Metalinguistic Dimension
106
Table 5.3 Positive belief statements attracting the strongest agreement (strongly agree = 5) Item no.
Statement
7 3 13 16 1 19
Knowing a lot about grammar rules helps my writing. Knowing a lot about grammar helps my reading. Knowledge about grammar rules helps me to use English fluently. Learning grammar makes me feel confident. Studying grammar formally is essential for mastering English. Knowing grammar rules helps communication in English.
Mean 4.72 4.39 4.30 4.25 4.23 4.20
Table 5.4 Negative belief statements attracting the strongest agreement (strongly agree = 1) Item no.
Statement
8 21
Grammar rules are difficult to understand. It is more important to practise English in real-life situations than to learn grammar rules. I feel nervous when I don’t understand grammar rules. It is difficult to apply knowledge of grammar rules in real-life situations. Thinking too much about grammar rules makes me speak English less fluently. Learning grammar rules makes me confused.
24 11 4 29
Mean 1.95 2.05 2.14 2.27 2.33 2.45
understanding. These statements seem to represent the exact opposite of confidence. A similar pattern can be observed with regard to beliefs about the role of grammar rules in communication. Practising English in real-life situations is regarded as important, and thinking too much about grammar rules is seen as impeding fluent speech. The strong agreement with these negative belief statements contrasts with learners’ equally strong agreement with positive statements about the usefulness of grammar rules in communication. However, rather than being outright contradictions, these responses may suggest that the participants are aware of both advantages and disadvantages of studying and using grammar rules. Learners’ responses to items on section 2 of the beliefs questionnaire are shown in Table 5.5. Learners were asked to indicate the perceived level of difficulty of the 12 aspects of L2 grammar included in the MLK test as well as the perceived usefulness of instruction on these grammar aspects. With regard to perceived difficulty, it is worth noting that none of the targeted grammar points attracted a mean rating which was in the ‘difficult’ or ‘very difficult’ range. The participants’ usefulness-of-instruction ratings were quite unanimous, with all grammar points attracting a mean score above 4. In other words, instruction was generally considered ‘useful’ or ‘very useful’ in all instances.
Research question 3 The third research question asked about the relationship between learners’ metalinguistic knowledge about the 12 aspects of L2 grammar targeted in the MLK test,
University-Level Learners’ Beliefs
107
Table 5.5 Learner beliefs about targeted grammar aspects – Difficulty and usefulness of instruction Rank Difficulty 1 = very easy 5 = very difficult 1 2 3 4 5 6 7 8 9 10 11 12
Third person present tense –s Plural of nouns Present progressive Ordinal numbers (ranking) In with adverbials of time Use of the definite article There as a preparatory subject Adverbs modifying verbs Personal vs reflexive pronouns Use of some vs any Question tags Active vs passive with ergative verbs
Mean Usefulness of instruction 1 = not useful at all 5 = very useful 1.53 1.64 2.11 2.14 2.19 2.19 2.33 2.34 2.52 2.52 2.59 3.31
Third person present tense –s Ordinal numbers (ranking) Plural of nouns Present progressive In with adverbials of time Adverbs modifying verbs Use of some vs any Use of the definite article Personal vs reflexive pronouns There as a preparatory subject Question tags Active vs passive with ergative verbs
Mean
4.58 4.45 4.38 4.36 4.36 4.33 4.28 4.22 4.16 4.11 4.11 4.05
their beliefs about the difficulty of these grammar aspects and their beliefs about the usefulness of instruction on these grammar aspects. Results show that the participants regarded L2 grammar aspects of low explicit learning difficulty as defined by the test designers as significantly easier than aspects of higher explicit learning difficulty (t = –9.476; df = 63, p < 0.01, two-tailed). As the participants also performed better on items of low explicit learning difficulty than on items of comparatively higher explicit learning difficulty in the MLK test, this finding is indicative of a correspondence between learners’ beliefs and their actual knowledge. In addition, participants regarded instruction as significantly more useful for L2 aspects of low explicit learning difficulty than for L2 aspects of higher explicit learning difficulty (t = 3.804; df = 62, p < 0.01, two-tailed). This finding is reflected in the ranking presented in Table 5.5 and was confirmed statistically by a rank-order correlation calculated for perceived difficulty of a grammar aspect and perceived usefulness of instruction on that grammar aspect, according to the learners. A near-perfect correlation was obtained (rho = –0.99, p < 0.01, two-tailed). Thus, participants clearly regarded instruction as more useful when a grammar aspect was perceived to be easier.
Research question 4 The final research question asked about learners’ reasons for liking and/or disliking explicit teaching and learning of grammar in the context of their L2 learning. In order to address this question, participants’ responses to the open-ended items in section 4 of the questionnaire were analysed; in addition, the responses of the nine interviewees were taken into consideration.
108
The Metalinguistic Dimension
A total of 27 participants (42%) completed only the statement ‘I like studying grammar because . . .’, while 10 participants (16%) completed only the statement ‘I don’t like studying grammar because . . .’. The remaining 27 participants (42%) completed both statements. These figures suggest that the majority of participants (84%) expressed at least some positive views about the explicit teaching and learning of grammar, while just over half of the sample (58%) had at least some critical views. Analysis of learners’ continuations of the statement ‘I like studying grammar because . . .’ resulted in the identification of four main themes which were labelled cognitive/metacognitive reasons, affective reasons, practical/instrumental reasons and social/identity reasons. Cognitive/metacognitive reasons were by far the most frequently mentioned points, with participants stating that they liked studying grammar because ll
ll
ll ll
it is necessary or important for learning English; because English cannot be learned without it; because it is the basis of the language (10 occurrences); it is necessary or important or useful for writing English (27), speaking English (22), reading English (14) and listening to English (13); it is necessary for using English accurately and for correcting one’s own errors (15); it improves one’s understanding of how English works and it is useful for developing one’s language (6).
Affective reasons mentioned were statements that studying grammar gives a feeling of confidence (5 occurrences) and that it is interesting to explore the rules (1). Practical/ instrumental reasons concerned the usefulness of grammar for passing tests or exams and for successfully completing one’s degree programme (3). Social/identity reasons are perhaps the most interesting category, since they seemed to be primarily informed by considerations of social status, with the participants situating themselves as both potentially successful members of their university and potentially successful members of society at large. Learners stated that they liked studying grammar because it shows that one is educated (2), it is useful for one’s career (2), it will help one get a good job and become rich (1), it makes one look professional (1) and one has to be an expert on grammar if one majors in English (1). Analysis of learners’ continuations of the statement ‘I don’t like studying grammar because . . .’ yielded responses that fell within three of the four main themes discussed above, that is, cognitive/metacognitive reasons, affective reasons and practical/ instrumental reasons. Cognitive/metacognitive reasons were mentioned most frequently, with participants stating that they did not like studying grammar because ll ll ll ll
it is complicated or difficult to understand (25 occurrences); it is or can be confusing (16); there are many rules (10); the rules are difficult to memorize or remember (10).
Affective reasons mentioned were statements that studying grammar is boring (5 occurrences), that it makes one lose confidence or feel stressed or worried (3) or
University-Level Learners’ Beliefs
109
that ‘it’s just not me’ (1). The practical/instrumental reason which led to the dislike of studying grammar was the perception that grammar is not always useful when communicating in English or using English in ‘real life’ (4). Data from the semi-structured interviews with nine participants were triangulated with the questionnaire data. Some participants’ contributions more or less directly reflected the predominant views identified in the responses to sections 3 and 4 of the questionnaire. As learners saw the belief statements in section 3 prior to responding to the open-ended items in section 4, their answers to the open-ended items were probably influenced to at least some extent by what they had just read in the closed-ended items. By the same token, the interviews followed questionnaire administration, though less immediately. Some participants – Manee, Lee, Pen (all names are pseudonyms) – primarily repeated in their interviews what had already been stated in the questionnaire responses, while others – Wanna and Walai – discussed in more differentiated ways both the potential advantages and the potential disadvantages of grammar rules in L2 learning, thus elaborating in more detail on what had been stated in the questionnaire responses. Overall, however, the views of these learners corresponded very closely to the beliefs as represented in Tables 5.3 and 5.4 and as summarized in the analysis of cognitive/metacognitive reasons, affective reasons and practical/instrumental reasons for liking or disliking the study of grammar. The interview responses of the remaining four participants – Lila, Chanya, Suda, Pranee – included references to social/identity issues. This theme was not represented in the closed-ended questionnaire items, but emerged in learners’ answers to the open-ended item ‘I like studying grammar because . . .’ . It is revisited in the interviews by these four participants. Lila points out that ‘when we communicate with foreigners or other people, our knowledge of grammar can indicate that we are educated’. Chanya highlights that knowledge of grammar is ‘very important for English major students because our writing performance can indicate our level of knowledge of English’. Later on, she states: ‘Yes, it is really worth teaching grammar rules. As many of my teachers have told me, as English major students we should be accurate with grammar rules, not just be users of the language.’ This comment suggests that Chanya’s tutors encouraged the association of high levels of explicit knowledge and grammatically accurate use of English with the participants’ status as (future) experts in English and, by implication, as educated, valued and valuable members of society. Suda takes the same position when she states that the teaching of grammar rules ‘is very necessary for English major students’. Pranee’s response reflects a similar point of view, though less directly: ‘I should be accurate with the use of grammar’, and later: ‘Grammar needs to be taught since it is part of learning English (. . .) Since I have chosen to study English, I think grammar rules are still important.’
Discussion It is now possible to bring together the findings reported above in order to arrive at an overview of the participants’ belief systems in relation to metalinguistic knowledge.
110
The Metalinguistic Dimension
Positive beliefs about metalinguistic knowledge With regard to beliefs about the explicit teaching and learning of grammar in general, it was found that the Thai university students participating in the present study had more positive than negative views. This finding is consistent with the results obtained by Schulz (2001) for Colombian post-secondary students. Over 75% of the Colombian participants agreed with the statements ‘I like the study of grammar’ and ‘My foreign language improves most quickly if I study and practise the grammar’. In the present study, the Thai learners strongly agreed with belief statements about the importance of studying grammar for mastering English, the usefulness of grammar knowledge when writing or reading in English, the role of grammar rules in fluency and communication and the feeling of confidence that can be derived from studying grammar. In addition, 84% of participants expressed at least some positive views about the explicit teaching and learning of grammar when completing the open-ended statement ‘I like studying grammar because . . .’. In the cognitive/metacognitive domain, the most frequently cited reasons reflected the closed-ended belief statements learners most strongly agreed with; in addition, participants mentioned the importance of studying grammar for accurate use of the L2 and for correction of one’s own errors. In some respects, these findings mirror the results obtained by Loewen et al. (2009) from learners with different L1 backgrounds studying a range of languages in the United States. These students shared the Thai learners’ beliefs about the centrality of grammar in the language learning endeavour generally as well as the usefulness of grammar for reading and writing. There are also some contrasts in evidence, however. The learners in the United States tended to mention practical/instrumental reasons such as academic success and affective reasons such as enjoyment of the study of grammar as an end in itself; reasons such as these arose in the present study as well, but they were much less prominent. Interestingly, in the present study, affective reasons relating to the feeling of confidence which may be gained from studying grammar were mentioned only five times in the open-ended statements, although the mean agreement rate to the corresponding closed-ended statement was sufficiently high to put it in the top six of beliefs attracting the strongest agreement (see Table 5.3). It is worth noting that the range of scores for this item was small, with none of the participants actually disagreeing with the statement ‘Learning grammar makes me feel confident’. In other words, it is not the case that a few particularly enthusiastic participants pushed up the mean, while the majority was neutral about the statement. A possible explanation for the apparent discrepancy between the closed-ended and the open-ended statements about confidence may lie in the participants’ interpretation of the concept. The open-ended statement ‘I like studying grammar because . . .’ and the interviews revealed a theme in the learners’ belief systems which was not directly represented in the closed-ended items of the questionnaire, namely, the role of social/ identity reasons. This theme appeared exclusively in conjunction with positive attitudes towards the study of grammar. Participants stated, for instance, that studying grammar and being able to demonstrate a good grasp of grammar rules is necessary for students majoring in English since they are expected to be experts, it makes one look professional,
University-Level Learners’ Beliefs
111
and shows that one is educated. Such beliefs may be closely associated with confidence in the participants, since being an educated, professional and highly knowledgeable individual should instil just such a feeling. Thus, it is possible that associations with one’s social role and identity as a member of society at large were foremost in the learners’ minds when responding to the closed-ended statement ‘Learning grammar makes me feel confident’. In other words, in addition to confidence which arises out of gaining and maintaining cognitive control over the knowledge domain that is the English language, i.e. confidence in the sense of self-efficacy (Hsieh and Kang, 2010), learners were equally, or perhaps even more concerned with confidence which arises out of how the learner sees him/herself as being perceived by others, i.e. confidence in a wider, social sense.
Negative beliefs about metalinguistic knowledge Negative beliefs about the teaching and learning of grammar in general were in evidence as well, even though they were less prominent than positive beliefs, with 58% of participants expressing at least some critical views when completing the statement ‘I don’t like studying grammar because. . . .’ The negative closed-ended statements attracting the strongest agreement from participants refer to the difficulty of grammar rules, the potential confusion they can cause, feelings of nervousness in the absence of understanding, the importance of practising English in real-life situations, the difficulty of applying grammar rules in such situations and the view that thinking too much about grammar rules will impede fluent speech. All these reasons are directly reflected in the participants’ open-ended statements, but the distribution is slightly different. In the open-ended statements, the overwhelming emphasis was on the difficulty of grammar rules, the potential for confusion, the sheer quantity of rules to be learned and the challenges inherent in memorizing them all. Affective reasons such as boredom, nervousness or loss of confidence were listed much less frequently; likewise, the lack of usefulness of grammar rules in real-life communication was also much less prominent in the open-ended statements. The reasons for disliking the study of grammar given by the Thai learners comprise the same themes as the reasons provided by the learners in the United States who were studied by Loewen et al. (2009). Differences between the two samples are above all in evidence in the frequency distribution. The most frequently cited reason for disliking the study of grammar by the US learners was the view that it is boring. This reason only played a subordinate role for the Thai learners. A second difference lies in the fact that the US learners had exclusively negative associations between the study of grammar and its role in developing speaking skills and succeeding with real-life communication in the L2. While the same theme arose with the Thai learners, they also had strong positive associations between the study of grammar and its role in fluency and communication. Indeed, in the open-ended statement ‘I like studying grammar because . . .’, importance or usefulness in connection with speaking English was mentioned no fewer than 22 times, while it was mentioned 13 times in connection with listening. Thus, the Thai learners saw both potential advantages and potential disadvantages of studying
112
The Metalinguistic Dimension
grammar in connection with speaking and communicating in the L2; by contrast, the learners in the United States did not show such differentiation. A possible reason for this discrepancy in beliefs is mentioned by Loewen et al. (2009) when they discuss differences between L2 learners of English and learners of other L2s. The L2 learners of English found themselves in an immersion setting and therefore regularly used the L2 outside the classroom. This may help explain why they were overall less enthusiastic about grammar instruction while at the same time showing more interest in improving their communicative skills. As the Thai leaners in the present study were learning English at a Thai university and had had no immersion experience, beliefs about the potential usefulness of grammar study for speaking English may have remained intact.
Beliefs about metalinguistic knowledge and actual metalinguistic knowledge In the present study, we investigated not only learners’ beliefs about the explicit teaching and learning of grammar generally, but also their beliefs about the difficulty and usefulness of instruction on 12 specific grammar points. Moreover, we were able to relate these beliefs to actual performance on a test of metalinguistic knowledge targeting the same 12 grammar points. The analysis of learners’ beliefs about the difficulty and usefulness of instruction on the 12 grammar aspects targeted in the MLK test yielded a number of noteworthy results. With regard to metalinguistic performance, it was found that learners did better on items of low explicit learning difficulty than on items of comparatively higher explicit learning difficulty, as expected. Participants’ beliefs about the difficulty of specific grammar points were fully in line with this finding, i.e. they judged grammar points of low explicit learning difficulty to be significantly easier than grammar points of higher difficulty. Thus, beliefs about metalinguistic knowledge and actual metalinguistic knowledge were in close correspondence in this instance. More interestingly perhaps, participants considered instruction significantly more useful for grammar points of low explicit learning difficulty than for grammar points of higher difficulty. This view is reflected in the difficulty versus usefulness-of-instruction ranking presented in Table 5.5 and the near-perfect negative correlation between the two variables. In other words, the Thai learners considered instruction more useful when a grammar aspect was perceived to be easier. This finding is in direct contrast with previous research. In his study with Polish learners of L2 English, Scheffler (2009) obtained the opposite result. The Polish learners found instruction particularly useful when a grammar aspect was perceived to be more difficult. Although the present study used the same rating scale as Scheffler (2009), there are a number of differences in research design which might help explain the opposing findings to at least some extent. First and foremost, the L2 grammar points targeted were not the same; moreover, the participants in the present study completed the MLK test, which measured actual explicit knowledge of the targeted grammar points, prior to making their difficulty and usefulness-of-instruction judgements. The Polish learners were not tested for explicit knowledge and thus made decontextualized judgements. Finally, the difficulty and usefulness-of-instruction judgements were made by two
University-Level Learners’ Beliefs
113
different groups of participants in Scheffler’s (2009) study, while in the present study the same sample of learners made both judgements. It is possible that the inclusion of the MLK test in the research design of the present study led to a task effect. As the descriptive statistics reported in Table 5.2 indicate, the test was quite challenging, especially the description/explanation task. In conjunction with the learners’ self-assessment of their overall L2 proficiency as ‘fair’ and their actual performance on the English placement test in the elementary range, this may have led to a decrease in confidence in their own abilities and knowledge; accordingly, more difficult L2 grammar points may have seemed beyond their grasp, whereas less difficult points may have been perceived as better suited to instruction with a view to subsequent mastery. This line of argument is speculative, though.2 At the same time, it is worth noting that the generally high scores for usefulness of instruction and the generally lower scores for perceived difficulty (see Table 5.5) quite closely reflect the tendencies which were in evidence in Scheffler’s (2009) study. The Polish learners likewise generally perceived the English grammar points they were asked about as fairly easy, while usefulness of instruction was generally rated quite highly.
Conclusion The present study was aimed at investigating university-level learners’ beliefs about metalinguistic knowledge in conjunction with their actual metalinguistic performance. We examined participants’ beliefs about the explicit teaching and learning of grammar generally as well as their beliefs about the difficulty and usefulness of instruction on 12 aspects of English grammar. In addition, we assessed participants’ metalinguistic knowledge about these grammar points. The findings indicate that our learners’ belief systems are complex constructs. Three key points are worth noting. First, in accordance with previous research, the participants held many positive beliefs about the explicit teaching and learning of grammar. Studying grammar was deemed important for mastering the L2, for reading and writing in the L2, for accurate use of the L2 and for self-correction. Negative beliefs focused above all on the potential complexities of grammar rules, the potential for confusion and the challenges of memorizing a large number of rules. In contrast with existing research, the learners in the present study had both positive and negative beliefs about the usefulness of grammar for communicating in the L2. Second, our findings point towards a correspondence between learners’ beliefs and their actual metalinguistic knowledge. Greater perceived difficulty and less successful test performance went hand in hand. Third, our findings suggest that confidence may be a multi-dimensional concept which can be interpreted in different ways. Focusing on the individual, confidence in the sense of self-efficacy can arise out of perceived cognitive control and the perceived understanding of a knowledge domain. We used this interpretation to explain the finding that participants considered instruction more useful on grammar points that were deemed less difficult. We proposed that more difficult grammar points may have
114
The Metalinguistic Dimension
been perceived by the participants as being beyond their cognitive reach; they had little confidence in their own abilities and themselves as learners of English. Focusing on the collective, confidence may also arise out of one’s perceived identity as a member of society and of how one believes to be perceived by others. We posited that this interpretation might have been foremost in the participants’ minds when they responded to closed-ended belief statements which included the term ‘confidence’, when they used the term productively, and when they put forward social/identity reasons for liking the study of grammar. In this interpretation, confidence is based on how one is viewed in one’s role as an English major student (future expert on the language) and one’s role as a member of society (educated future professional with a promising career). In this interpretation, confidence was strongly associated with positive beliefs about the explicit teaching and learning of grammar and the importance and usefulness of metalinguistic knowledge. It is tempting to speculate about culturally based reasons for interpreting confidence in the collective sense in addition to the individualistic sense that focuses on self-efficacy. However, caution is in order here because existing research about the role of culture in learners’ belief systems has led to diverging findings. While Schulz (2001) identified a number of differences in beliefs about grammar instruction between US and Colombian students, the review by Horwitz (1999) failed to pinpoint clear cultural differences in general beliefs about language and language learning of US participants on the one hand and Taiwanese and Korean participants on the other hand. Addressing the concept of confidence in a targeted cross-cultural comparison would be an interesting avenue for future research in the area of L2 learner beliefs about metalinguistic knowledge.
Appendix Sample items from the MLK test, sample participant answers and scores awarded; all explanations are translations from Thai, unless indicated otherwise. Item 5: I’m afraid we will have to think quick. Correction: quickly Explanation: ‘quick’ is an adjective. Therefore it is wrong. Must change ‘quick’ to ‘quickly’ to become an adverb because an adverb modifies a verb, which is ‘think’. In sum, ‘quick’ is an adjective which cannot modify a verb. (Participant 1) Scores awarded: Correction = 1, Description = 1, Explanation = 1 Correction: quickly Explanation: quick is a verb which requires an object, in this case placed at the end of the sentence without an object. Thus, must change it to be an adjective by adding –ly at the end so it becomes quickly. (Participant 5) Scores awarded: Correction = 1, Description = 0, Explanation = 0
University-Level Learners’ Beliefs
115
Item 6: On the six day, the workers finally decided to stop the strike. Correction: sixth Explanation: Because it is a rank order. (Participant 32) Scores awarded: Correction = 1, Description = 0, Explanation = 1 Item 7: Can you please be quiet? I work. Correction: am working Explanation: use present continuas [sic]. (Participant 43; explanation was provided in English) Scores awarded: Correction = 1, Description = 1, Explanation = 0 Item 9: It was a lovely evening, was it? Correction: isn’t Explanation: This sentence is a question tag which wants to ask at the end of the sentence. The rule of question tag: if the previous [sentence] is affirmative, the latter must always be negative. (Participant 6) Scores awarded: Correction = 0, Description = 1, Explanation = 1
Notes 1 As pointed out by a reviewer, researchers often use essentially metalinguistic tasks such as grammaticality judgements to measure L2 development; studies aimed at assessing L2 learning or L2 proficiency by means of such tasks in connection with individual difference variables are not reviewed here. 2 A reviewer suggested two further possible reasons for the discrepancy in findings. First, the Polish learners in Scheffler’s (2009) study and the Thai learners in the present study have probably been exposed to different types of L2 instruction. Second, the Polish learners were of a more advanced level of proficiency than the Thai learners when the respective studies were undertaken.
References Alderson, J. C., Clapham, C. and Steel, D. (1997), ‘Metalinguistic knowledge, language aptitude and language proficiency’. Language Teaching Research, 1, 93–121. Benson, P. and Lor, W. (1999), ‘Conceptions of language and language learning’. System, 27, (4), 459–72. Bialystok, E. (1994), ‘Representation and ways of knowing: Three issues in second language acquisition’, in N. C. Ellis (ed.), Implicit and Explicit Learning of Languages. London: Academic Press, pp. 549–70. Camps, J. (2003), ‘Concurrent and retrospective verbal reports as tools to better understand the role of attention in second language tasks’. International Journal of Applied Linguistics, 13, (2), 201–21. Cohen, L., Manion, L. and Morrison, K. (2007), Research Methods in Education (6th edn). London: Routledge. Cotterall, S. (1999), ‘Key variables in language learning: What do learners believe about them?’ System, 27, (4), 493–513.
116
The Metalinguistic Dimension
DeKeyser, R. M. (2003), ‘Implicit and explicit learning’, in C. J. Doughty and M. H. Long (eds), The Handbook of Second Language Acquisition. Malden, MA: Blackwell, pp. 313–48. Dörnyei, Z. (2003), Questionnaires in Second Language Research: Construction, Administration, and Processing. Mahwah, NJ: Erlbaum. — (2005), The Psychology of the Language Learner: Individual Differences in Second Language Acquisition. Mahwah, NJ: Erlbaum. — (2007), Research Methods in Applied Linguistics: Quantitative, Qualitative, and Mixed Methodologies. Oxford: Oxford University Press. Dörnyei, Z. and Skehan, P. (2003), ‘Individual differences in second language learning’, in C. J. Doughty and M. H. Long (eds), The Handbook of Second Language Acquisition. Malden, MA: Blackwell, pp. 589–630. Elder, C. and Manwaring, D. (2004), ‘The relationship between metalinguistic knowledge and learning outcomes among undergraduate students of Chinese’. Language Awareness, 13, (3), 145–62. Elder, C., Warren, J., Hajek, J., Manwaring, D. and Davies, A. (1999), ‘Metalinguistic knowledge: How important is it in studying a language at university?’ Australian Review of Applied Linguistics, 22, (1), 81–95. Ellis, N. C. (1994), ‘Consciousness in second language learning: Psychological perspectives on the role of conscious processes in vocabulary acquisition’. AILA Review, 11, 37–56. Ellis, R. (2004), ‘The definition and measurement of L2 explicit knowledge’. Language Learning, 54, (2), 227–75. — (2006), ‘Modelling learning difficulty and second language proficiency: The differential contributions of implicit and explicit knowledge’. Applied Linguistics, 27, (3), 431–63. Flavell, J. H. (1979), ‘Metacognition and cognitive monitoring: A new area of cognitive-developmental inquiry’. American Psychologist, 34, 906–11. Hacker, D. J. (1998), ‘Definitions and empirical foundations’, in D. J. Hacker, J. Dunlosky and A. C. Graesser (eds), Metacognition in Educational Theory and Practice. Mahwah, NJ: Erlbaum, pp. 1–23. Horwitz, E. K. (1987), ‘Surveying student beliefs about language learning’, in A. L. Wenden and J. Rubin (eds), Learner Strategies in Language Learning. London: Prentice Hall, pp. 119–29. — (1988), ‘The beliefs about language learning of beginning university foreign language students’. Modern Language Journal, 72, (3), 283–94. — (1999), ‘Cultural and situational influences on foreign language learners’ beliefs about language learning: A review of BALLI studies’. System, 27, (4), 557–76. Hsieh, P. P.-H. and Kang, H.-S. (2010), ‘Attribution and self-efficacy and their relationship in the Korean EFL context’. Language Learning, 60, (3), 606–27. Hu, G. (2011), ‘Metalinguistic knowledge, metalanguage, and their relationship in L2 learners’. System, 39, (1), 63–77. Hulstijn, J. H. (2005), ‘Theoretical and empirical issues in the study of implicit and explicit second-language learning: Introduction’. Studies in Second Language Acquisition, 27, (2), 129–40. Leow, R. P. (2000), ‘A study of the role of awareness in foreign language behavior: Aware versus unaware learners’. Studies in Second Language Acquisition, 22, (4), 557–84. Loewen, S., Li, S., Fei, F., Thompson, A., Nakatsukasa, K., Ahn, S. and Chen, X. (2009), ‘Second language learners’ beliefs about grammar instruction and error correction’. Modern Language Journal, 93, (1), 91–104.
University-Level Learners’ Beliefs
117
McDonough, K. and Chaikitmongkol, W. (2007), ‘Teachers’ and learners’ reactions to a task-based EFL course in Thailand’. TESOL Quarterly, 41, (1), 107–32. Miles, M. B. and Huberman, A. M. (1994), Qualitative Data Analysis (2nd edn). Thousand Oaks: Sage. Murphy, R. (1994), English Grammar in Use (2nd edn). Cambridge: Cambridge University Press. Norris, J. M. and Ortega, L. (2001), ‘Does type of instruction make a difference? Substantive findings from a meta-analytic review’. Language Learning, 51, (1), 157–213. Roehr, K. (2008a), ‘Linguistic and metalinguistic categories in second language learning’. Cognitive Linguistics, 19, (1), 67–106. — (2008b), ‘Metalinguistic knowledge and language ability in university-level L2 learners’. Applied Linguistics, 29, (2), 173–99. Roehr, K. and Gánem-Gutiérrez, G. A. (2009a), ‘Metalinguistic knowledge: A stepping stone towards L2 proficiency?’, in A. Benati (ed.), Issues in Second Language Proficiency. London: Continuum, pp. 79–94. — (2009b), ‘The status of metalinguistic knowledge in instructed adult L2 learning’. Language Awareness, 18, (2), 165–81. Rosa, E. and O’Neill, M. D. (1999), ‘Explicitness, intake, and the issue of awareness’. Studies in Second Language Acquisition, 21, 511–56. Scheffler, P. (2009), ‘Rule difficulty and the usefulness of instruction’. ELT Journal, (1), 5–12. Schulz, R. A. (2001), ‘Cultural differences in student and teacher perceptions concerning the role of grammar instruction and corrective feedback: USA-Colombia’. Modern Language Journal, 85, (2), 244–58. Silverman, D. (2000), Doing Qualitative Research. London: Sage. Sorace, A. (1985), ‘Metalinguistic knowledge and language use in acquisition-poor environments’. Applied Linguistics, 6, (3), 239–54. Swan, M. (1995), Practical English Usage (2nd edn). Oxford: Oxford University Press. UCLES. (2004), Quick Placement Test. Oxford: Oxford University Press. Wenden, A. L. (1998), ‘Metacognitive knowledge and language learning’. Applied Linguistics, 19, (4), 515–37. — (1999), ‘An introduction to metacognitive knowledge and beliefs in language learning: Beyond the basics’. System, 27, (4), 435–41. White, C. (2008), ‘Beliefs and good language learners’, in C. Griffiths (ed.), Lessons from Good Language Learners. Cambridge: Cambridge University Press, pp. 121–30. Yang, N.-D. (1999), ‘The relationship between EFL learners’ beliefs and learning strategy use’. System, 27, (4), 515–35. Ziętek, A. A. and Roehr, K. (2011), ‘Metalinguistic knowledge and cognitive style in Polish classroom learners of English’. System, 39, (4), 417–26.
6
Metalinguistic Awareness in Third Language Phonological Acquisition Magdalena Wrembel
Adam Mickiewicz University, Poznań
Abstract This contribution explored metalinguistic awareness in the acquisition of third language phonology. It provides new insights into this issue by expanding the investigation beyond the second language acquisition perspective and embracing the recently recognized discipline of third language acquisition. The study investigates metaphonological awareness with the application of stimulated recall protocols, i.e. verbalized reports aimed at disclosing participants’ mental processes. The participants were Polish native speakers with a very good command of English as their L2, and a more limited competence in French or German as their L3. L3 speech samples were collected by means of introspective and retrospective protocols in which the participants were asked to improve and comment on their L3 pronunciation after listening to an excerpt of their previous performance. The study explored whether trilingual speakers of typologically related and unrelated sets of languages tend to resort to L1 or L2 transfer in L3 speech performance, and to what extent they were aware of this phenomenon. The findings provide evidence for qualitative and quantitative differences in reported metaphonological awareness manifested through the participants’ reflective metalinguistic analysis of their own L3 performance, their intentional focus on articulatory gestures, their self-awareness of problems in L3 pronunciation and a considerable degree of processing control.
Introduction Metalinguistic awareness Metalinguistic awareness is acknowledged as a significant component of language proficiency and is commonly ascribed a facilitative role in foreign language acquisition (Herdina and Jessner, 2002; Gombert, 1992). In an attempt to define metalinguistic
120
The Metalinguistic Dimension
or language awareness, James (1999, p. 97) proposed that it consists of ‘a mix of knowledge of language in general and specifically, a command of metalanguage (standard or ad hoc), and the conversion of intuitions to insight and then beyond to metacognition’. James (1999) further distinguished between language awareness as such (i.e. language awareness as metacognition) and consciousness-raising (i.e. language awareness as cognition). The former begins with one’s own intuitions and can relate to one’s knowledge about language through reflection, whereas the latter starts with one’s objective linguistic knowledge and personalizes it. Jessner (2006, p. 42) summarizes various attempts to define metalinguistic awareness by stating that it ‘refers to the ability to focus attention on language as an object in itself or to think abstractly about language and, consequently, to play with or manipulate language’. The question of the role of awareness in foreign language learning is a complex one and has been discussed extensively in the literature. A vocal proposal was put forward by Schmidt (1990, p. 131), who made the following distinction between three types of consciousness: (1) consciousness as awareness, (2) consciousness as intention and (3) consciousness as knowledge. Consciousness as awareness was further subdivided according to different levels of awareness ranging from ‘perception’ which may not always be conscious, via ‘noticing’ which involves focal awareness, and ‘understanding’, i.e. conscious analysis. Addressing the role of consciousness in input processing, Schmidt (1990) proposed the so-called noticing hypothesis, according to which conscious awareness at the level of noticing is essential for the development of L2 proficiency. This process refers to learners’ comparison between the observed input and their own production at a particular stage of development of the interlanguage system. One of the most persistent problems from the perspective of foreign language learning is the distinction between intuitive language awareness and metalinguistic knowledge. Reflecting this contrast, Gombert (1992) distinguishes between epilinguistic and metalinguistic awareness. Epilinguistic awareness is unconscious, spontaneous and contextualized and can thus be exemplified by instances of self-repair in speech performance. Metalinguistic awareness, on the other hand, is decontextualized, conscious and intentional and involves, for instance, language analysis in the classroom or conscious reflection on language properties.
Metalinguistic awareness in multilinguals Looking at metalinguistic awareness from a multilingual perspective, Jessner (2006) provides evidence that it is an essential component of multilingual proficiency. In her study, Jessner (2006) investigated the nature of linguistic awareness in German-Italian bilinguals acquiring L3 English through the application of think-aloud protocols. By focusing on lexical problem-solving behaviours, Jessner explored the compensatory strategies of L3 acquirers (e.g. language switches, foreignizing, coinage, approximation) that evidenced indirectly the participants’ language awareness as well as instances of explicit metalinguistic awareness (e.g. the use of metalanguage). She concluded that multilinguals make use of this ability much more frequently than monolinguals, as ‘linguistic objectivation is the multilingual’s most characteristic cognitive ability’
Third Language Phonological Acquisition
121
(Jessner, 2006, p. 42). Consequently, Jessner proposed an extension of traditional definitions of metalinguistic awareness to include a component of cross-linguistic awareness that interacts with metalinguistic consciousness in multilingual processing (Jessner, 2006, p. 116). The role of an increased level of metalinguistic awareness, which forms an integral part of multilingual proficiency, is emphasized in the Dynamic Model of Multilingualism developed by Herdina and Jessner (2002). The model assumes, among other things, the non-linear nature of language growth and the cumulative interdependence between language systems of a multilingual (i.e. L1, L2, L3, etc.), resulting in complex cross-linguistic interactions. The formula of multilingual proficiency, as proposed by Herdina and Jessner (2002), also accounts for the so-called multilingualism factor, which is defined as consisting of increased metalinguistic awareness, an enhanced monitor and the catalytic effect of the third language (Jessner, 2006, p. 34). Based on extensive research on third language acquisition, some researchers including Hufeisen (1997), Cenoz (2003) and Jessner (2006) claim that multilingual learners tend to develop a higher level of metalinguistic awareness as well as metacognitive strategies and, as a result, demonstrate linguistic and cognitive advantages over their monolingual counterparts. This advantage is attributed, to a large extent, to the metalinguistic awareness in multilinguals as evidenced, for instance, by the seminal work of Pearl and Lambert (1962) or Bialystok et al. (2004).
The third language acquisition perspective The focus of the present contribution falls on the phonological aspects of metalinguistic awareness, understood as intentional focus on phonetic forms and articulatory gestures during foreign language speech performance. This investigation is conducted within the framework of third language acquisition, which is a young area of study with a limited body of research (e.g. Hammarberg and Hammarberg, 2005; Gut, 2010, Llama et al., 2010; Rothman et al., 2013; Wrembel, 2010). The third language (L3), often referred to as ‘Third or Additional Language’ (De Angelis, 2007), is understood as any language learned after the second language (L2), i.e. the first foreign language acquired by an individual. Third language acquisition has recently been acknowledged as a separate sub-discipline from second language acquisition because it involves a more complex context of acquisition and a qualitative difference in language learning and processing (Cenoz et al., 2001; Herdina and Jessner, 2002). Moreover, some researchers claim that the study of L3 acquisition is a potential source of unique new insights into the process of language learning that cannot be gained by investigating the first or second language alone (Flynn et al., 2004). Phonological acquisition in a multiple-language setting is of a very complex nature since phonological transfer is not limited to L1, but involves other, earlier acquired foreign language systems (L2). Apart from negative interference, there is a potential for positive transfer, especially due to metalinguistic awareness gained in L2 acquisition and the application of specific learner strategies to learning a subsequent foreign language. Traditionally, non-native languages have not been regarded as significant sources of cross-linguistic influence, in contrast to a widely attested transfer from the L1 of
122
The Metalinguistic Dimension
multilingual learners (e.g. Pyun, 2005; Ringbom, 1987). Nonetheless, there have been several studies that testify to the complex interference between foreign languages acquired as second and third ones, and which point to the existence of the so-called foreign language effect or L2 status in L3 phonological acquisition (e.g. Cenoz, 2001; Hammarberg and Hammarberg, 1993, 2005; Williams and Hammarberg, 1998). The results of these studies on third language acquisition demonstrate that the previously acquired foreign language (L2) tends to be activated in L3 performance, thus suppressing L1 influence. Several factors have been identified in the literature as conditioning cross-linguistic transfer in the acquisition of a foreign language, particularly of a third language, including typological distance or psychotypology, foreign language effect, proficiency level, recency of use and the context of interaction (Williams and Hammarberg, 1998; Cenoz, 2001; Jessner, 2006). One of the strongest predictors of cross-linguistic influence is the typological distance between the languages being learned; semantically and categorically related linguistic structures are likely to lead to transfer. However, the actual linguistic typology and transferability are conditioned, to a large extent, by the learner’s perception of language distance, i.e. psychotypology (Kellerman, 1979). Studies on third language acquisition have demonstrated that when the multilingual learners’ L1 is typologically unrelated to their newly acquired L3, they tend to transfer linguistic knowledge from their L2s (e.g. Ringbom, 1987; Cenoz, 2001; De Angelis and Selinker, 2001). Consequently, the perceived similarity may lead to positive L2-L3 transfer in addition to the interference between the native and the additional foreign language (Cenoz, 2005).
Overview of research The majority of studies to date on the role of metalinguistic awareness in the development of language proficiency have focused on grammar as well as the knowledge of metalinguistic terminology. Early research conducted by Białystok (1982) and Gass (1988) pointed to higher levels of metalinguistic awareness as concomitant with improvements in L2 proficiency. In a more recent study, Alderson et al. (1997) demonstrated a weak to moderate correlation between metalinguistic knowledge, language aptitude and language proficiency, on the basis of a battery of administered tests. Renou (2001), in turn, used grammaticality judgements as a measure of metalinguistic awareness and found that it was correlated with language proficiency. The status of metalinguistic knowledge in relation to individual learner variables in instructed adult L2 learning was examined by Roehr and Gánem-Gutiérrez (2009). The findings showed that the primary variable responsible for the attained level of metalinguistic knowledge was the length of form-focused language instruction. Instances of correlational research on metalinguistic awareness outnumber the existing qualitative studies in this field. A notheworthy example of the latter is Roehr’s (2006) study exploring metalinguistic knowledge in L2 task performance by means of stimulated recall protocols. The author found a qualitative difference in the implementations of metalinguistic knowledge evidenced by various complexity
Third Language Phonological Acquisition
123
levels and the co-occurrence of metalinguistic knowledge with learners’ success and consistency in task-solving. Phonological awareness has not been explored extensively in the SLA literature. Only a limited number of studies to date have dealt with this issue (Kennedy and Trofimovich, 2010; Venkatagiri and Levis, 2007; Wrembel, 2005, 2011). In an extensive study Venkatagiri and Levis (2007) investigated whether phonological awareness is related to speech comprehensibility. To this end, they administered a battery of tests, including 14 tests of phonological awareness (e.g. phonological blending, segmentation, manipulation, spoonerism, rhyming, alliteration) and 3 tests of phonological short-term memory (i.e. non-word recall). The comprehensibility ratings performed by 12 native-speaker raters pointed to a strong positive correlation between composite phonological awareness scores and rated comprehensibility, as well as to a positive correlation between phonological awareness and phonological short-term memory. The analysis demonstrated that 19% of variance in rated comprehensibility was accounted for by phonological awareness scores. On the whole, learners with superior explicit phonological knowledge were perceived by the raters as more intelligible foreign language speakers, thus lending some further support to Schmidt’s (1990) noticing hypothesis. Consequently, Venkatagiri and Levis (2007, p. 275) suggest that phonological form may be amenable to awareness-building classroom strategies, similar to what is indicated in focus-on-form research on other linguistic aspects such as syntax or lexis (cf. Doughty and Williams, 1998). Kennedy and Trofimovich (2010) conducted a classroom study in which they examined the relationship between the quality of L2 pronunciation and language awareness. L2 learners’ pronunciation was assessed on the basis of ratings of accentedness, comprehensibility and fluency. Language awareness, in turn, was analysed on the basis of dialogue journal entries made by the learners. Benson and Lor’s (1999) analytical framework of dual conceptions of learning according to which language awareness can be quantitative or qualitative was applied. The former type concerns the awareness of how language is acquired, i.e. language learning as assimilating a set of discrete items, whereas the latter refers to the awareness of how language works to convey meaning, i.e. meaningful context in which learning occurs (Kennedy and Trofimovich, 2010, pp. 171–2). The findings point to a relationship between pronunciation ratings and the number of qualitative language awareness comments. The author’s previous long-term empirical study (Wrembel, 2005) was fairly innovative since it explored the impact of analysed phonological knowledge and metalinguistic awareness on L2 pronunciation performance. The findings substantiated the basic underlying assumption, according to which participants who were equipped with declarative knowledge targeted at the conscious monitoring of the acquired phonological system outperformed controls, who relied only on procedural knowledge developed through practical L2 pronunciation training. The results demonstrated that awareness-raising and the acquisition of explicit knowledge contribute to the development of L2 phonological competence and may be considered an important predictor of L2 pronunciation attainment. A follow-up study (Wrembel, 2011) was intended to complement the earlier investigation by tapping into learners’ self-perception of metaphonetic awareness
124
The Metalinguistic Dimension
in spontaneous L2 speech production. It aimed specifically to elicit the participants’ self-reflection on the ways of monitoring their English pronunciation through verbal protocols. The study revealed that the participants demonstrated a fairly high degree of metalinguistic awareness, evidenced by the elicited aspects of formalized phonetic knowledge, instances of self-repair and explicit statements of pronunciation strategies applied in the L2 learning process. The present study is intended to shed new light on the issue of metalinguistic awareness by expanding the investigation beyond the SLA perspective and embracing a more complex context of acquisition of third language phonology.
The present study The study aimed to explore cognitive process and variables associated with metaphonological awareness during L3 oral performance through the application of verbal protocols. It was intended as a process-oriented investigation to supplement quantitative findings with qualitative data in view of the recent plea by Roehr (2006), according to which qualitative analyses of metalinguistic awareness are relatively rare. The specific objectives of the study were to verify whether multilingual speakers of typologically related (L2 English and L3 German) and less related sets of languages (L2 English and L3 French) tend to resort to the L2 rather than L1 Polish transfer in their L3 speech performance, and to explore to what extent they are aware of this phenomenon.
Research design The study adopted and slightly modified the introspective method of think-aloud protocols (TAPs) which consists of verbalized reports aimed at disclosing the participants’ intuition and mental processes when performing a given task. TAPs are widely used in research on human information processing (Ericsson and Simon, 1984), meta-awareness in multilingualism (e.g. Jessner, 2006) and pronunciation monitoring strategies (Osborne, 2003). According to Cohen (1996), verbal reports can reflect the following aspects: – self-report, i.e. learners’ descriptions and general statements about the learning process, – self-observation, i.e. inspections of specific language behaviour both introspective and retrospective, – self-revelation, i.e. disclosure of thought processes, thinking aloud. Verbal reports differ from self-report interviews and questionnaires in that they attempt to elicit introspective data near the moment of the learning event (Jessner, 2006). Taking this advantage into account, think-aloud protocols were adapted as a method of data collection for the purpose of the present study, however, since the verbalization was not concurrent, but followed a slight delay (approximately 1 minute), the resulting verbal reports can be better classified as stimulated recall protocols (Gass and Mackey, 2000).
Third Language Phonological Acquisition
125
Participants and procedure The participants in the study were 60 Polish native speakers (17 male, 43 female) with a mean age of 20. They were all students of the School of English at Adam Mickiewicz University in Poznań, Poland and proficient in L2 English (C1 level according to the classification of the Common European Framework of Reference, CEFR, 2001). There were two groups of participants with respect to the language they acquired as an L3, i.e. either French or German. The L3 French/L3 German groups were further subdivided according to L3 proficiency level into elementary groups 1F and 1G (A2 level, CEFR) and intermediate groups 2F and 2G (B2 level, CEFR); see Table 6.1. The study consisted of three stages (see Table 6.2) performed as individual sessions in a quiet room. The participants were seated in front of a computer screen with an audio recording program on, wearing a headset with a microphone and headphones. The first stage involved reading a text in the participants’ L3 and was recorded by the researcher using CoolEdit96 as 16-bit mono files at 16,000 Hz sampling frequency (Recording 1). The texts were about 100 words long and included a French text, Les Français et l’argent, and a German text, Üben, üben, üben. The selection of the text corresponded to the participants’ language group (L3 French vs L3 German). In the second stage, which focused on analytic listening, Recording 1 was played back through the headphones and the participants were asked to listen attentively and assess their own pronunciation performance in L3. Stage three, involving
Table 6.1 Participants Group
Proficiency level
L3 French L3 German
1F 2F 1G 2G
Elementary (A2) Intermediate (B2) Elementary (A2) Intermediate (B2)
Number of participants 12 18 16 14
Table 6.2 Study design Stage 1
L3 text reading Recording 1
Stage 2
Analytic listening Recording 1 played back Stimulated recall protocols Recording 2
Stage 3
A) retrospective protocol − self-correction − comments on L3 phonetic performance B) introspective protocol − self-reflection on L3 acquisition process
126
The Metalinguistic Dimension
stimulated recall protocols, was further subdivided into (1) retrospective protocols and (2) introspective protocols. In the former part, the participants were asked to provide self-corrections and comments on their phonetic performance in L3 and to try and verbalize all the thoughts and reflections that came to their minds. The latter part was devoted to self-reflection on the process of acquisition of L3 phonology and took the form of a semi-structured interview. The researcher decided to use stimulated recall protocols rather than concurrent ones because the participants were unwilling to provide verbal protocols simultaneously when listening to their own recordings and preferred to perform them after the listening task was completed. Interaction with the researcher was limited to the occasional provision of support questions. The retrospective and introspective protocols at stage three were also audio-recorded (Recording 2) in the same manner as Recording 1. Later, Recording 2 was transcribed and the dataset was coded by the researcher. The data was analysed by means of a content-analytic approach focusing on pre-established central categories (see below for details).
Concept operationalization and coding Metaphonological awareness was operationalized in the study as instances of: 1. noticing, i.e. mentioning L3 phonetic features, commenting on one’s own specific pronunciation problems, 2 understanding, i.e. formulating phonetic rules, conscious analysis of L3 pronunciation performance, 3. metacognition, i.e. self-reflection on L3 pronunciation and the learning process in general. The analysis of the verbal protocols was expected to reflect the following types of metaphonological awareness, as distinguished by Gombert (1992): – epilinguistic awareness, manifested by the observed instances of self-repair of L3 pronunciation; − metalinguistic phonological awareness, reflective phonetic analysis and intentional focus on articulatory gestures during speech performance. A special coding system was devised by the researcher and aimed at the objectivization of the data analysis. The retrospective protocol focused on three issues, namely: 1. self-corrections, i.e. L3 repair: − number of instances (raw frequency), − instances of successful/unsuccessful repairs; 2. noticing one’s own problems with L3 pronunciation: − number of instances (raw frequency), − categorization of problems, e.g. phonetic features, mispronounced words, suprasegmental aspects;
Third Language Phonological Acquisition
127
3. levels of complexity of reported phonological awareness (the typology was adapted from Roehr, 2006, p. 188): − Level A (low) – noticing and attentional focus on relevant auditory forms, i.e. phonological surface patterns, − Level B (medium) – metalinguistic description or explanation, i.e. the participant performs an analysis of the targeted feature, − Level C (high) – metalinguistic description or explanation and metalanguage, i.e. the analysis is articulated employing metalinguistic categories. The introspective protocol involved self-reflection on the following three aspects: 1. Reported influence from other languages, including the source and strength of this influence, e.g. L1, L2, L1/L2 or Ln (other languages); 2. Acknowledged facilitative or intrusive role of metalinguistic awareness; 3. Explanations and metacognitive comments which were coded as instances of self-report (in the case of general statements about the learning process) or self-observation (in the case of inspections of specific language behaviour), using Cohen’s (1996) terminology.
Results and discussion Retrospective verbal protocols Self-correction The first aspect of the retrospective verbal protocols concerned the qualitative and quantitative nature of self-corrections registered when the participants were asked to monitor their recorded L3 reading performance. The number of self-repairs in L3 oral performance totalled 71, including 54 corrections in L3 German (M = 1.7 corrections per participant), and 17 instances of modifications in L3 French (M = 0.5). An independent samples t-test revealed a statistical difference in the instances of self-repair between the two groups (t(58) = 2.76, p < 0.01). However, an analysis of the success rate of self-corrections points to the opposite tendency, since self-repairs by participants in the L3 French group were correct 81% of the time, thus showing greater success than those of the L3 German group, whose success rate reached only 54% (see Table 6.3). Instances of self-corrections were also analysed with respect to the proficiency level in the two L3 groups and with respect to the number of self-corrections and their success rate (see Table 6.4). It was hypothesized that a higher proficiency in L3 would result in more instances of self-correction and a higher success rate, showing increased metalinguistic awareness. While this expectation was substantiated in the L3 French group, with the lower-proficiency subgroup 1F performing significantly fewer successful self-repairs than the higher-proficiency subgroup 2F (t(28) = 2.8, p < 0.01), the L3 German group demonstrated the reverse tendency, with the lower-proficiency subgroup 1G outscoring their higher-proficiency counterparts 2G in terms of the
The Metalinguistic Dimension
128
Table 6.3 Self-correction data
Total L3 German L3 French
Raw frequency
Percentage
Mean
Success rate (%)
71 54 17
100 76 24
1.1 1.7 0.5
60 58 81
Table 6.4 Self-correction according to L3 group and proficiency level Group L3 French L3 German
1F 2F 1G 2G
Raw frequency
Percentage
Mean
Success rate (%)
1 16 39 15
2 22 55 21
0.1 0.9 2.3 1.1
0 81 56 60
number of self-repairs. The success rate was higher for the higher-proficiency subgroup 2G, although this difference was not statistically significant (t(28) = 1.52, p = 0.1). Consequently, no clear relation between the number of self-corrections and L3 proficiency level was found, since opposite trends were observed in the L3 German and L3 French groups. It could be expected that higher proficiency would lead to increased metalinguistic awareness and thus affect the quantity of L3 self-repairs. However, this was not substantiated by the comparative analysis of 1G + 1F vs 2G + 2F, with no statistical significance found between these groups (t(58) = 0.88, p = 0.4). As far as the quality of self-repairs is concerned, the success rate of corrections was higher for the higher-proficiency groups in both L3 German and L3 French, yet in the former group the difference was not significant, as indicated earlier. Moreover, the number of self-repairs was not equally distributed in the two groups (see Table 6.3) since the percentage of total corrections was three times higher in the L3 German group than in the L3 French group. This finding could be interpreted in the light of the typological proximity of the language repertoires of the participants, with a facilitative effect of a typologically more related set of L2 English and L3 German leading to a higher percentage of total self-corrections (76%), compared to a less related set of L2 English and L3 French (24%). One cannot exclude other possible explanations, e.g. the impact of the quality of L3 instruction. However, this variable was beyond the control of the investigator. As far as the qualitative analysis of the instances of L3 self-repair is concerned, Table 6.5 presents the frequency of occurrence of particular corrections divided into categories. As can be seen, segmental corrections were most frequent by far (92%), while modifications of suprasegmental aspects of L3 pronunciation performance accounted for only 8% of self-repairs. The most commonly modified phonetic segments included vowels (34%) and consonants (32%), the quality of the /r/ sound (12%), unclear pronunciation of a word (8%) and word endings (6%), whereas suprasegmental modifications involved word stress (6%) and linking (2%).
Third Language Phonological Acquisition
129
Table 6.5 Categories of self-correction in L3 vowels consonants /r/ unclear pronunciation word stress word endings linking
34% 32% 12% 8% 6% 6% 2%
Noticing of problems The second aspect of metaphonological awareness investigated by means of the retrospective protocols concerned noticing one’s own problems with L3 pronunciation. The raw frequency of self-reported phonetic problems totalled 162, with a mean of 2.6 instances per participant. The reported problems were distributed relatively equally across the L3 language groups and proficiency levels (1G M = 2.5, 2G M = 2.7, 1F M = 2.1, 2F M = 3.1). The differences in means between the two L3 groups (i.e. L3 German vs L3 French) were not significant (t(61) = 0.62, p = 0.54). However, within L3 groups a statistical difference was found between proficiency levels for L3 French (t(28) = 8.9, p < 0.05), with intermediate learners reporting on average more phonetic problems; no statistical difference was found for L3 German (t(28) = 1, p = 0.3). The reported problems in L3 phonetic performance were further analysed with respect to the specific categories they fell into. As shown in Table 6.6, the most frequently noticed phonetic problems concerning segmental features included vowels (21%), the quality of the /r/ sound (16%), consonants (14%) and word endings (11%), whereas suprasegmental aspects involved intonation patterns (7%), word stress (7%), fluency (4%), linking (3%) and rhythm (1%). On the whole, the reported problematic areas in L3 were dominated by segmental features (78%), while suprasegmentals accounted for the remaining 22%. It is interesting to juxtapose the patterns of actual instances of L3 self-repair (see Table 6.5) with the phonetic problems in L3, as reported by the participants (see Table 6.6). The comparison points to a considerable overlap in the distribution of problematic phonetic categories as well as their relative frequency of occurrence.
Levels of awareness The third aspect of metaphonological awareness examined with the aid of the retrospective protocols focused on levels of awareness. To this end, the participants’ retrospections including comments on self-corrections and reported problems with L3 pronunciation were analysed using a modified version of the framework proposed by Roehr (2006, p. 188). In the course of the analysis, the participants’ retrospective comments were assigned to the specified complexity categories (see above). The findings demonstrated
130
The Metalinguistic Dimension Table 6.6 Reported phonetic problems in L3 vowels /r/ consonants word endings intonation word stress spelling/sound correspondence fluency