Портал славістики


[root][biblio]

Bibliography of the Czech Linguistics (BibCzechLing)

The "Bibliography of the Czech Linguistics (BibCzechLing)" is provided by the Institute of the Czech Language of the Academy of Sciences of the Czech Republic (Ústav pro jazyk český AV). The database contains about 73.280 records and covers the period from 1992 till 2018. The list of subjects is located here.

?
Your search for tokenizace provides 4 hits
1

UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing

Straka, Milan; Hajič, Jan; Straková, Jana, in: Proceedings of the 10th International Conference on Language Resources and Evaluation : LREC 2016, May 23-28, Portorož, Slovenia, Paris, European Language Resources Association 2016, s. 4290-4297
2

TrTok: a fast and trainable tokenizer for natural languages

Maršík, Jiří; Bojar, Ondřej, in: The Prague Bulletin of Mathematical Linguistics, č. 98, 2012, s. 75-85
3

Morfologické značkování a lemmatizace v korpusech ČNK

Jelínek, Tomáš, in: Grammar & Corpora, Praha, Academia ; 2008, s. 169-179.
4

Český národní korpus - počítačové demonstrace

Křen, Michal, in: Slovenčina a čeština v počítačovom spracovaní [SlČPoč] : Zborník referátov zo seminára Bratislava 26.-27. októbra 2001, Bratislava, Veda ; 2001, s. 136-141.