Портал славістики


[root][biblio]

Bibliography of the Czech Linguistics (BibCzechLing)

The "Bibliography of the Czech Linguistics (BibCzechLing)" is provided by the Institute of the Czech Language of the Academy of Sciences of the Czech Republic (Ústav pro jazyk český AV). The database contains about 73.280 records and covers the period from 1992 till 2018. The list of subjects is located here.

ID2012CZ067486
Author(s)Maršík, Jiří; Bojar, Ondřej
Title

TrTok: a fast and trainable tokenizer for natural languages

PublishedThe Prague Bulletin of Mathematical Linguistics, č. 98, 2012, s. 75-85
Languageeng
Classification (CZ)Strojová a aplikovaná lingvistika
Classification (EN)Machine and applied linguistics
Subjectslingvistika komputační; překlady strojové; segmentace; tokenizace; jazyky přirozené
Subjects (DE)Computerlinguistik
NotePrezentace zařízení pro segmentaci a tokenizaci textu, materiál z angličtiny a čínštiny
Mediumarticle
URLufal.mff.cuni.cz (homepage)
Holdings (in Germany)see in ZDB-Katalog
Sourcehttps://bibliografie.ujc.cas.cz/documents/67486
PURLCitation link

More like this:

On the non-ideal character of natural languages / Daneš, František
Fast syntactic searching in very large corpora for many languages / Jakubíček, Miloš
Application-oriented approach to semantics of natural languages / Tseytin, G.
Remarks on automatic models of natural languages comprehension / Sgall, Petr
On some mechanisms of representation of meaning in natural languages / Tseytin, Gregory S.
A language to describe the morphology of natural and artificial languages / Kirsanov, Nikolai O.
Fast morphological analysis of Czech / Šmerk, Pavel