Software:SkELL

SkELL: Sketch Engine for Language Learning
	Concordance English language learning – Sketch Engine
Original author(s)	Vít Baisa, Vít Suchomel
Developer(s)	Lexical Computing Ltd.
Initial release	November 2014; 9 years ago
Written in	jQuery, JavaScript
Available in	English, Russian, Czech
Type	Language learning
License	freeware
Website	skell.sketchengine.co.uk

SkELL is an abbreviation of Sketch Engine for Language Learning.^[1] It is a web interface for language learning. The main purpose is to help students and teachers of languages. SkELL has its own corpus^[2] that was gathered so that contained texts covering everyday, standard, formal, and professional language.^[1] In the corpus, there are a total of more than 60 million sentences and more than one billion words.^[3]

The SkELL interface provides features such as simple search showing words in context, but the maximum of displayed lines (concordances, in fact) is 40. However, the frequency of searched query is located below the search box and expressed with the number hits per million. The second function is word sketch which enables showing collocates for a given word or words. The last one is named as similar words. It visualises similar words to searched word in a word cloud.

The tool has been available also for the Russian language (since 2015)^[4] and the Czech language (since 2017).^[5]

Features

SkELL offers three types of searches.

Examples – searching for words and phrases and their all derived forms
Word sketch – a simplified version of the original word sketch page
Similar words – based on the Distributional thesaurus in Sketch Engine, there are not necessarily synonyms

Data

The corpus for English SkELL consists of English Wikipedia (special sorted out 130,000 articles), English collection of Project Gutenberg, a subset from the web corpus enTenTen14,^[6] the whole British National Corpus, and free new sources.^[1]

Processing the data

After gathering and pre-cleaning (all structures have removed except sentences) data has run through processing pipe: normalization, tokenization, TreeTagger for English, and deduplication. The further process was a compilation of the corpus using manatee indexing library. In the end, all sentences were scored with the GDEX^[7] tool.^[1]

References

↑ ^1.0 ^1.1 ^1.2 ^1.3 Baisa, Vít; Suchomel, Vít (2014). "SkELL:Web Interface for English Language Learning". Eighth Workshop on Recent Advances in Slavonic Natural Language Processing (NLP Consulting): 63–70. https://nlp.fi.muni.cz/raslan/raslan14.pdf#page=71.
↑ Thomas, James (14 June 2015). "Discovering English with SketchEngine – James Thomas interview". https://eflnotes.wordpress.com/tag/james-thomas/.
↑ "SkELL". Lexical Computing Ltd.. https://www.sketchengine.co.uk/skell/.
↑ Valentina, A., Vitalevna, B. O., Малолетняя, А. П., Olga, K., & Vit, B. (2016). RuSkELL: Online Language Learning Tool for Russian Language. In Proceedings of the XVII EURALEX International Congress. Lexicography and Linguistic Diversity (6–10 September, 2016) (pp. 292-300). Ivane Javakhishvili Tbilisi State University.
↑ Cukr, Michal (2017). Český korpus příkladových vět (Czech corpus of example sentences) (Master's thesis thesis) (in Czech). Brno: Masaryk University, Faculty of Arts. Retrieved 2017-06-22.CS1 maint: unrecognized language (link)
↑ Jakubíček, Miloš.; Kilgarriff, Adam; Kovář, Vojtěch; Rychlý, Pavel; Suchomel, Vít (July 2013). "The tenten corpus family". Seventh International Corpus Linguistics Conference CL (Lancaster University): 125–127.
↑ Kilgarriff, A.; Husák, M.; McAdam, K.; Rundell, M.; Rychlý, P. (July 2008). "GDEX: Automatically finding good dictionary examples in a corpus". Proceedings of the XIII EURALEX International Congress (EURALEX): 425–432. http://dialnet.unirioja.es/servlet/articulo?codigo=5040252.

External links

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/SkELL. Read more

[raslan-1] 1.0 ^1.1 ^1.2 ^1.3 Baisa, Vít; Suchomel, Vít (2014). "SkELL:Web Interface for English Language Learning". Eighth Workshop on Recent Advances in Slavonic Natural Language Processing (NLP Consulting): 63–70. https://nlp.fi.muni.cz/raslan/raslan14.pdf#page=71.

[2] Thomas, James (14 June 2015). "Discovering English with SketchEngine – James Thomas interview". https://eflnotes.wordpress.com/tag/james-thomas/.

[3] "SkELL". Lexical Computing Ltd.. https://www.sketchengine.co.uk/skell/.

[4] Valentina, A., Vitalevna, B. O., Малолетняя, А. П., Olga, K., & Vit, B. (2016). RuSkELL: Online Language Learning Tool for Russian Language. In Proceedings of the XVII EURALEX International Congress. Lexicography and Linguistic Diversity (6–10 September, 2016) (pp. 292-300). Ivane Javakhishvili Tbilisi State University.

[5] Cukr, Michal (2017). Český korpus příkladových vět (Czech corpus of example sentences) (Master's thesis thesis) (in Czech). Brno: Masaryk University, Faculty of Arts. Retrieved 2017-06-22.CS1 maint: unrecognized language (link)

[6] Jakubíček, Miloš.; Kilgarriff, Adam; Kovář, Vojtěch; Rychlý, Pavel; Suchomel, Vít (July 2013). "The tenten corpus family". Seventh International Corpus Linguistics Conference CL (Lancaster University): 125–127.

[7] Kilgarriff, A.; Husák, M.; McAdam, K.; Rundell, M.; Rychlý, P. (July 2008). "GDEX: Automatically finding good dictionary examples in a corpus". Proceedings of the XIII EURALEX International Congress (EURALEX): 425–432. http://dialnet.unirioja.es/servlet/articulo?codigo=5040252.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

Anonymous

Search

Software:SkELL

Namespaces

More

Page actions

Contents

Features

Data

Processing the data

References

External links

Navigation

Navigation

Help

Translate

Wiki tools

Wiki tools

Anonymous

Search

Software:SkELL

Features

Data

Processing the data

References

External links

Navigation

Wiki tools

Page tools

Other projects

Categories