Saved in:
Bibliographic Details
Main Authors: Wu, Shaoqun, Franken, Margaret, Witten, Ian H.
Format: Recurso educativo Open Access
Language:en
Published: 2009
Subjects:
Online Access:https://eric.ed.gov/?id=EJ864951
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867181726813388800
author Wu, Shaoqun
Franken, Margaret
Witten, Ian H.
author_facet Wu, Shaoqun
Franken, Margaret
Witten, Ian H.
Wu, Shaoqun
Franken, Margaret
Witten, Ian H.
collection Education Resources Information Center
contents Refining the Use of the Web (and Web Search) as a Language Teaching and Learning Resource Wu, Shaoqun Franken, Margaret Witten, Ian H. Computational Linguistics Word Lists Electronic Libraries Internet Educational Resources Second Language Learning Second Language Instruction Syntax The web is a potentially useful corpus for language study because it provides examples of language that are contextualized and authentic, and is large and easily searchable. However, web contents are heterogeneous in the extreme, uncontrolled and hence "dirty," and exhibit features different from the written and spoken texts in other linguistic corpora. This article explores the use of the web and web search as a resource for language teaching and learning. We describe how a particular derived corpus containing a trillion word tokens in the form of n-grams has been filtered by word lists and syntactic constraints and used to create three digital library collections, linked with other corpora and the live web, that exploit the affordances of web text and mitigate some of its constraints. (Contains 5 tables, 6 figures and 2 notes.)
format Recurso educativo Open Access
id eric_EJ864951
institution ERIC Institute of Education Sciences
language en
publishDate 2009
record_format eric
spellingShingle Refining the Use of the Web (and Web Search) as a Language Teaching and Learning Resource
Wu, Shaoqun
Franken, Margaret
Witten, Ian H.
Computational Linguistics
Word Lists
Electronic Libraries
Internet
Educational Resources
Second Language Learning
Second Language Instruction
Syntax
Refining the Use of the Web (and Web Search) as a Language Teaching and Learning Resource Wu, Shaoqun Franken, Margaret Witten, Ian H. Computational Linguistics Word Lists Electronic Libraries Internet Educational Resources Second Language Learning Second Language Instruction Syntax The web is a potentially useful corpus for language study because it provides examples of language that are contextualized and authentic, and is large and easily searchable. However, web contents are heterogeneous in the extreme, uncontrolled and hence "dirty," and exhibit features different from the written and spoken texts in other linguistic corpora. This article explores the use of the web and web search as a resource for language teaching and learning. We describe how a particular derived corpus containing a trillion word tokens in the form of n-grams has been filtered by word lists and syntactic constraints and used to create three digital library collections, linked with other corpora and the live web, that exploit the affordances of web text and mitigate some of its constraints. (Contains 5 tables, 6 figures and 2 notes.)
title Refining the Use of the Web (and Web Search) as a Language Teaching and Learning Resource
topic Computational Linguistics
Word Lists
Electronic Libraries
Internet
Educational Resources
Second Language Learning
Second Language Instruction
Syntax
url https://eric.ed.gov/?id=EJ864951