slavistik-portal
Портал славістики
The "Bibliography of the Czech Linguistics (BibCzechLing)" is provided by the Institute of the Czech Language of the Academy of Sciences of the Czech Republic (Ústav pro jazyk český AV). The database contains about 73.280 records and covers the period from 1992 till 2018. The list of subjects is located here.
Your search for tokenizace provides 4 hits | |
1 | UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and ParsingStraka, Milan; Hajič, Jan; Straková, Jana, in: Proceedings of the 10th International Conference on Language Resources and Evaluation : LREC 2016, May 23-28, Portorož, Slovenia, Paris, European Language Resources Association 2016, s. 4290-4297 |
2 | TrTok: a fast and trainable tokenizer for natural languagesMaršík, Jiří; Bojar, Ondřej, in: The Prague Bulletin of Mathematical Linguistics, č. 98, 2012, s. 75-85 |
3 | Morfologické značkování a lemmatizace v korpusech ČNKJelínek, Tomáš, in: Grammar & Corpora, Praha, Academia ; 2008, s. 169-179. |
4 | Český národní korpus - počítačové demonstraceKřen, Michal, in: Slovenčina a čeština v počítačovom spracovaní [SlČPoč] : Zborník referátov zo seminára Bratislava 26.-27. októbra 2001, Bratislava, Veda ; 2001, s. 136-141. |