BibCzechLing: 4 hits for tokenizace

The "Bibliography of the Czech Linguistics (BibCzechLing)" is provided by the Institute of the Czech Language of the Academy of Sciences of the Czech Republic (Ústav pro jazyk český AV). The database contains about 73.280 records and covers the period from 1992 till 2018. The list of subjects is located here.

Your search for tokenizace provides 4 hits
1	UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing Straka, Milan; Hajič, Jan; Straková, Jana, in: Proceedings of the 10th International Conference on Language Resources and Evaluation : LREC 2016, May 23-28, Portorož, Slovenia, Paris, European Language Resources Association 2016, s. 4290-4297
2	TrTok: a fast and trainable tokenizer for natural languages Maršík, Jiří; Bojar, Ondřej, in: The Prague Bulletin of Mathematical Linguistics, č. 98, 2012, s. 75-85
3	Morfologické značkování a lemmatizace v korpusech ČNK Jelínek, Tomáš, in: Grammar & Corpora, Praha, Academia ; 2008, s. 169-179.
4	Český národní korpus - počítačové demonstrace Křen, Michal, in: Slovenčina a čeština v počítačovom spracovaní [SlČPoč] : Zborník referátov zo seminára Bratislava 26.-27. októbra 2001, Bratislava, Veda ; 2001, s. 136-141.

Bibliography of the Czech Linguistics (BibCzechLing)

UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing

TrTok: a fast and trainable tokenizer for natural languages

Morfologické značkování a lemmatizace v korpusech ČNK

Český národní korpus - počítačové demonstrace