|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 4 occurrences of 4 keywords
|
|
|
Results
Found 75 publication records. Showing 75 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
113 | Stefan Klatt, Bernd Bohnet |
You Don't Have to Think Twice if You Carefully Tokenize. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IJCNLP ![In: Natural Language Processing - IJCNLP 2004, First International Joint Conference, Hainan Island, China, March 22-24, 2004, Revised Selected Papers, pp. 299-309, 2004, Springer, 3-540-24475-1. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
|
93 | Robert Bernecky |
An SPMD/SIMD parallel tokenizer for APL. ![Search on Bibsonomy](Pics/bibsonomy.png) |
APL ![In: Proceedings of the 2003 Conference on APL: Stretching the Mind, APL 2003, San Diego, California, USA, June 11-14, 2003, pp. 21-32, 2003, ACM, 1-58113-668-4. The full citation details ...](Pics/full.jpeg) |
2003 |
DBLP DOI BibTeX RDF |
|
55 | Bin Ma 0001, Haizhou Li 0001 |
A phonotactic-semantic paradigm for automatic spoken document classification. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SIGIR ![In: SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Salvador, Brazil, August 15-19, 2005, pp. 369-376, 2005, ACM, 1-59593-034-5. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
acoustic words, phonotactic-semantic, semantic domain, spoken document classification, voice tokenizer, n-gram |
50 | Run Shao, Zhaoyang Zhang, Chao Tao, Yunsheng Zhang, Chengli Peng, Haifeng Li 0007 |
Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for Remote Sensing Image Understanding. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2403.18593, 2024. The full citation details ...](Pics/full.jpeg) |
2024 |
DBLP DOI BibTeX RDF |
|
47 | Cody Boisclair |
Developing a tokenizer and morphological parser for English text in C#. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACM Southeast Regional Conference ![In: Proceedings of the 46th Annual Southeast Regional Conference, 2008, Auburn, Alabama, USA, March 28-29, 2008, pp. 288-293, 2008, ACM, 978-1-60558-105-7. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
|
33 | Amir Shahab Shahabi, Mohammad Reza Kangavari |
A Fuzzy Approach for Persian Text Segmentation Based on Semantic Similarity of Sentences. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Intelligent Information Processing ![In: Intelligent Information Processing III, IFIP TC12 International Conference on Intelligent Information Processing (IIP 2006), September 20-23, Adelaide, Australia, pp. 411-420, 2006, Springer, 978-0-387-44639-4. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
Fuzzy Similarity Relation, Fuzzy Proximity Relation, Lemma, Fuzzy Relations Composition, Anti-Redundancy, Syntax Parser, Meta Variable, Meta Rule, Paradigmatic, Tokenizer, Multi-Document Summarizer, Lemmatizer |
25 | Nicolas Boizard, Kevin El Haddad, Céline Hudelot, Pierre Colombo |
Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2402.12030, 2024. The full citation details ...](Pics/full.jpeg) |
2024 |
DBLP DOI BibTeX RDF |
|
25 | Omri Uzan, Craig W. Schmidt, Chris Tanner, Yuval Pinter |
Greed is All You Need: An Evaluation of Tokenizer Inference Methods. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2403.01289, 2024. The full citation details ...](Pics/full.jpeg) |
2024 |
DBLP DOI BibTeX RDF |
|
25 | Gautier Dagan, Gabriel Synnaeve, Baptiste Rozière |
Getting the most out of your tokenizer for pre-training and domain adaptation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2402.01035, 2024. The full citation details ...](Pics/full.jpeg) |
2024 |
DBLP DOI BibTeX RDF |
|
25 | Jacob Zhiyuan Fang, Skyler Zheng, Vasu Sharma, Robinson Piramuthu |
ε-ViLM : Efficient Video-Language Model via Masked Video Modeling with Semantic Vector-Quantized Tokenizer. ![Search on Bibsonomy](Pics/bibsonomy.png) |
WACV (Workshops) ![In: IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, WACVW 2024 - Workshops, Waikoloa, HI, USA, January 1-6, 2024, pp. 529-540, 2024, IEEE, 979-8-3503-7028-7. The full citation details ...](Pics/full.jpeg) |
2024 |
DBLP DOI BibTeX RDF |
|
25 | Goodwill Erasmo Ndomba, Young-Seob Jeong |
Effects of Swahili Monolingual Tokenizer on Downstream Tasks. ![Search on Bibsonomy](Pics/bibsonomy.png) |
BigComp ![In: IEEE International Conference on Big Data and Smart Computing, BigComp 2024, Bangkok, Thailand, February 18-21, 2024, pp. 357-358, 2024, IEEE, 979-8-3503-7002-7. The full citation details ...](Pics/full.jpeg) |
2024 |
DBLP DOI BibTeX RDF |
|
25 | Sanghyun Choo, Wonjoon Kim |
A study on the evaluation of tokenizer performance in natural language processing. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Appl. Artif. Intell. ![In: Appl. Artif. Intell. 37(1), December 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Jungeun Kim, Ha Young Kim |
CSLT-AK: Convolutional-embedded transformer with an action tokenizer and keypoint emphasizer for sign language translation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Pattern Recognit. Lett. ![In: Pattern Recognit. Lett. 173, pp. 115-122, September 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Zhiwei Deng, Ting Chen, Yang Li |
Perceptual Group Tokenizer: Building Perception with Iterative Grouping. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2311.18296, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Sandeep Mehta, Darpan Shah, Ravindra Kulkarni, Cornelia Caragea |
Semantic Tokenizer for Enhanced Natural Language Processing. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2304.12404, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Zipeng Xu, Enver Sangineto, Nicu Sebe |
StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized Tokenizer of a Large-Scale Generative Model. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2303.09268, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Lijun Yu, José Lezama, Nitesh Bharadwaj Gundavarapu, Luca Versari, Kihyuk Sohn, David Minnen, Yong Cheng, Agrim Gupta, Xiuye Gu, Alexander G. Hauptmann, Boqing Gong, Ming-Hsuan Yang 0001, Irfan Essa, David A. Ross, Lu Jiang 0004 |
Language Model Beats Diffusion - Tokenizer is Key to Visual Generation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2310.05737, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Jacob Zhiyuan Fang, Skyler Zheng, Vasu Sharma, Robinson Piramuthu |
E-ViLM: Efficient Video-Language Model via Masked Video Modeling with Semantic Vector-Quantized Tokenizer. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2311.17267, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Yuying Ge, Sijie Zhao, Ziyun Zeng, Yixiao Ge, Chen Li, Xintao Wang, Ying Shan |
Making LLaMA SEE and Draw with SEED Tokenizer. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2310.01218, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Zhiyuan Liu, Yaorui Shi, An Zhang 0003, Enzhi Zhang, Kenji Kawaguchi, Xiang Wang 0010, Tat-Seng Chua |
Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2310.14753, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Felix Stollenwerk |
Training and Evaluation of a Multilingual Tokenizer for GPT-SW3. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2304.14780, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Xin Zhang, Dong Zhang, Shimin Li, Yaqian Zhou, Xipeng Qiu |
SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2308.16692, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Miao Fan, Chen Hu, Shuchang Zhou 0001 |
Proximal Policy Optimization Actual Combat: Manipulating Output Tokenizer Length. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2308.05585, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Mohamed Afham, Satya Narayan Shukla, Omid Poursaeed, Pengchuan Zhang, Ashish Shah, Sernam Lim |
Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2309.11569, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Christopher Meaney, Therese A. Stukel, Peter C. Austin, Michael D. Escobar |
Comparing Variation in Tokenizer Outputs Using a Series of Problematic and Challenging Biomedical Sentences. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2305.08787, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Mehdi Ali, Michael Fromm 0001, Klaudia Thellmann, Richard Rutmann, Max Lübbering, Johannes Leveling, Katrin Klug, Jan Ebert, Niclas Doll, Jasper Schulze Buschhoff, Charvi Jain, Alexander Arno Weber, Lena Jurkschat, Hammam Abdelwahab, Chelsea John, Pedro Ortiz Suarez, Malte Ostendorff, Samuel Weinbach, Rafet Sifa, Stefan Kesselheim, Nicolas Flores-Herr |
Tokenizer Choice For LLM Training: Negligible or Crucial? ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2310.08754, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Tatsuya Hiraoka, Tomoya Iwakura |
Downstream Task-Oriented Neural Tokenizer Optimization with Vocabulary Restriction as Post Processing. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2304.10808, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Wenhao Li, Mengyuan Liu, Hong Liu 0009, Pichao Wang, Jialun Cai, Nicu Sebe |
Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2311.12028, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Zipeng Xu, Enver Sangineto, Nicu Sebe |
StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized Tokenizer of a Large-Scale Generative Model. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICCV ![In: IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, France, October 1-6, 2023, pp. 7567-7577, 2023, IEEE, 979-8-3503-0718-4. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Jimin Sun, Patrick Fernandes, Xinyi Wang, Graham Neubig |
A Multi-dimensional Evaluation of Tokenizer-free Multilingual Pretrained Models. ![Search on Bibsonomy](Pics/bibsonomy.png) |
EACL (Findings) ![In: Findings of the Association for Computational Linguistics: EACL 2023, Dubrovnik, Croatia, May 2-6, 2023, pp. 1680-1690, 2023, Association for Computational Linguistics, 978-1-959429-47-0. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Mohamed Afham, Satya Narayan Shukla, Omid Poursaeed, Pengchuan Zhang, Ashish Shah, Sernam Lim |
Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICCV (Workshops) ![In: IEEE/CVF International Conference on Computer Vision, ICCV 2023 - Workshops, Paris, France, October 2-6, 2023, pp. 1181-1186, 2023, IEEE, 979-8-3503-0744-3. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Tuan Aqeel Bohoran, Polydoros N. Kampaktsis, Laura McLaughlin, Jay Leb, Serafeim P. Moustakidis, Gerry P. McCann, Archontis Giannakidis |
Right Ventricular Volume Prediction by Feature Tokenizer Transformer-Based Regression of 2D Echocardiography Small-Scale Tabular Data. ![Search on Bibsonomy](Pics/bibsonomy.png) |
FIMH ![In: Functional Imaging and Modeling of the Heart - 12th International Conference, FIMH 2023, Lyon, France, June 19-22, 2023, Proceedings, pp. 292-300, 2023, Springer, 978-3-031-35301-7. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Zhiyuan Liu, Yaorui Shi, An Zhang 0003, Enzhi Zhang, Kenji Kawaguchi, Xiang Wang, Tat-Seng Chua |
Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules. ![Search on Bibsonomy](Pics/bibsonomy.png) |
NeurIPS ![In: Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, NeurIPS 2023, New Orleans, LA, USA, December 10 - 16, 2023., 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP BibTeX RDF |
|
25 | Adhiraj Banerjee, Vipul Arora 0001 |
wav2tok: Deep Sequence Tokenizer for Audio Retrieval. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICLR ![In: The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023, 2023, OpenReview.net. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP BibTeX RDF |
|
25 | Rinka Kiriyama, Akio Sashima, Ikuko Shimizu |
Robust Tokenizer for Vision Transformer. ![Search on Bibsonomy](Pics/bibsonomy.png) |
GCCE ![In: 12th IEEE Global Conference on Consumer Electronics, GCCE 2023, Nara, Japan, October 10-13, 2023, pp. 34-38, 2023, IEEE, 979-8-3503-4018-1. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
25 | Eugene Bagdasaryan, Congzheng Song, Rogier C. van Dalen, Matt Seigel, Áine Cahill |
Training a Tokenizer for Free with Private Federated Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2203.09943, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
25 | Md Mofijul Islam, Gustavo Aguilar, Pragaash Ponnusamy, Clint Solomon Mathialagan, Chengyuan Ma, Chenlei Guo |
A Vocabulary-Free Multilingual Neural Tokenizer for End-to-End Task Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2204.10815, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
25 | Jivnesh Sandhan, Rathin Singha, Narein Rao, Suvendu Samanta, Laxmidhar Behera, Pawan Goyal 0002 |
TransLIST: A Transformer-Based Linguistically Informed Sanskrit Tokenizer. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2210.11753, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
25 | Jimin Sun, Patrick Fernandes, Xinyi Wang, Graham Neubig |
A Multi-dimensional Evaluation of Tokenizer-free Multilingual Pretrained Models. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2210.07111, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
25 | Shiyue Zhang, Vishrav Chaudhary, Naman Goyal, James Cross, Guillaume Wenzek, Mohit Bansal, Francisco Guzmán |
How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training? ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2204.14268, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
25 | Jivnesh Sandhan, Rathin Singha, Narein Rao, Suvendu Samanta, Laxmidhar Behera, Pawan Goyal 0002 |
TransLIST: A Transformer-Based Linguistically Informed Sanskrit Tokenizer. ![Search on Bibsonomy](Pics/bibsonomy.png) |
EMNLP (Findings) ![In: Findings of the Association for Computational Linguistics: EMNLP 2022, Abu Dhabi, United Arab Emirates, December 7-11, 2022., pp. 6902-6912, 2022, Association for Computational Linguistics. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
25 | Md Mofijul Islam, Gustavo Aguilar, Pragaash Ponnusamy, Clint Solomon Mathialagan, Chengyuan Ma, Chenlei Guo |
A Vocabulary-Free Multilingual Neural Tokenizer for End-to-End Task Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
RepL4NLP@ACL ![In: Proceedings of the 7th Workshop on Representation Learning for NLP, RepL4NLP@ACL 2022, Dublin, Ireland, May 26, 2022, pp. 91-99, 2022, Association for Computational Linguistics, 978-1-955917-48-3. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
25 | Jinghao Zhou, Chen Wei 0005, Huiyu Wang, Wei Shen 0002, Cihang Xie, Alan L. Yuille, Tao Kong |
Image BERT Pre-training with Online Tokenizer. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICLR ![In: The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022, 2022, OpenReview.net. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP BibTeX RDF |
|
25 | Pavel Rychlý, Samuel Spalek |
Utok: The Fast Rule-based Tokenizer. ![Search on Bibsonomy](Pics/bibsonomy.png) |
RASLAN ![In: Proceedings of the 16th Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2022, Karlova Studánka, Czech Republic, December 9-11, 2022, pp. 149-154, 2022, Tribun EU, 978-80-263-1752-4. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP BibTeX RDF |
|
25 | Shiyue Zhang, Vishrav Chaudhary, Naman Goyal, James Cross, Guillaume Wenzek, Mohit Bansal, Francisco Guzmán |
How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training? ![Search on Bibsonomy](Pics/bibsonomy.png) |
AMTA ![In: Proceedings of the 15th biennial conference of the Association for Machine Translation in the Americas (Volume 1: Research Track), AMTA 2022, Orlando, USA, September 12-16, 2022, pp. 97-116, 2022, Association for Machine Translation in the Americas. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP BibTeX RDF |
|
25 | Jinghao Zhou, Chen Wei 0005, Huiyu Wang, Wei Shen 0002, Cihang Xie, Alan L. Yuille, Tao Kong |
iBOT: Image BERT Pre-Training with Online Tokenizer. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2111.07832, 2021. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP BibTeX RDF |
|
25 | Sangah Lee, Hyopil Shin |
The Korean Morphologically Tight-Fitting Tokenizer for Noisy User-Generated Texts. ![Search on Bibsonomy](Pics/bibsonomy.png) |
W-NUT ![In: Proceedings of the Seventh Workshop on Noisy User-generated Text, W-NUT 2021, Online, November 11, 2021, pp. 410-416, 2021, Association for Computational Linguistics, 978-1-954085-90-9. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
25 | Phillip Rust, Jonas Pfeiffer, Ivan Vulic, Sebastian Ruder, Iryna Gurevych |
How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACL/IJCNLP (1) ![In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, ACL/IJCNLP 2021, (Volume 1: Long Papers), Virtual Event, August 1-6, 2021., pp. 3118-3135, 2021, Association for Computational Linguistics, 978-1-954085-52-7. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
25 | Phillip Rust, Jonas Pfeiffer, Ivan Vulic, Sebastian Ruder, Iryna Gurevych |
How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2012.15613, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
25 | Daniele Mazzei, Giacomo Baldi, Gualtiero Fantoni, Gabriele Montelisciani, Antonio Pitasi, Laura Ricci, Lorenzo Rizzello |
A Blockchain Tokenizer for Industrial IOT trustless applications. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Future Gener. Comput. Syst. ![In: Future Gener. Comput. Syst. 105, pp. 432-445, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
25 | Dokook Choe, Rami Al-Rfou, Mandy Guo, Heeyoung Lee, Noah Constant |
Bridging the Gap for Tokenizer-Free Language Models. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/1908.10322, 2019. The full citation details ...](Pics/full.jpeg) |
2019 |
DBLP BibTeX RDF |
|
25 | Kazuhisa Nakasho |
Development of a Flexible Mizar Tokenizer and Parser for Information Retrieval System. ![Search on Bibsonomy](Pics/bibsonomy.png) |
FedCSIS ![In: Proceedings of the 2019 Federated Conference on Computer Science and Information Systems, FedCSIS 2019, Leipzig, Germany, September 1-4, 2019., pp. 77-80, 2019, 978-83-952357-8-8. The full citation details ...](Pics/full.jpeg) |
2019 |
DBLP DOI BibTeX RDF |
|
25 | Taku Kudo, John Richardson |
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/1808.06226, 2018. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP BibTeX RDF |
|
25 | Taku Kudo, John Richardson |
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing. ![Search on Bibsonomy](Pics/bibsonomy.png) |
EMNLP (Demonstration) ![In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018: System Demonstrations, Brussels, Belgium, October 31 - November 4, 2018, pp. 66-71, 2018, Association for Computational Linguistics, 978-1-948087-85-8. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP DOI BibTeX RDF |
|
25 | Johannes Graën, Mara Bertamini, Martin Volk 0001 |
Cutter - a Universal Multilingual Tokenizer. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SwissText ![In: Proceedings of the 3rd Swiss Text Analytics Conference, SwissText 2018, Winterthur, Switzerland, June 12-13, 2018., pp. 75-81, 2018, CEUR-WS.org. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP BibTeX RDF |
|
25 | Matthieu Jimenez, Maxime Cordy, Yves Le Traon, Mike Papadakis |
On the Impact of Tokenizer and Parameters on N-Gram Based Code Analysis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICSME ![In: 2018 IEEE International Conference on Software Maintenance and Evolution, ICSME 2018, Madrid, Spain, September 23-29, 2018, pp. 437-448, 2018, IEEE Computer Society, 978-1-5386-7870-1. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP DOI BibTeX RDF |
|
25 | Kazuma Takaoka, Sorami Hisamoto, Noriko Kawahara, Miho Sakamoto, Yoshitaka Uchida, Yuji Matsumoto 0001 |
Sudachi: a Japanese Tokenizer for Business. ![Search on Bibsonomy](Pics/bibsonomy.png) |
LREC ![In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation, LREC 2018, Miyazaki, Japan, May 7-12, 2018., 2018, European Language Resources Association (ELRA). The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP BibTeX RDF |
|
25 | Luz Marina Sierra, Carlos Alberto Cobos Lozada, Juan Carlos Corrales |
Tokenizer Adapted for Nasa Yuwe Language. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Computación y Sistemas ![In: Computación y Sistemas 20(3), pp. 355-364, 2016. The full citation details ...](Pics/full.jpeg) |
2016 |
DBLP DOI BibTeX RDF |
|
25 | K. Divyavarma, M. Remya, G. Deepa |
An Enhanced Bug Mining for Identifying Frequent Bug Pattern Using Word Tokenizer and FP-Growth. ![Search on Bibsonomy](Pics/bibsonomy.png) |
FICTA (1) ![In: Proceedings of the 5th International Conference on Frontiers in Intelligent Computing: Theory and Applications - FICTA 2016, Bhubaneswar, Odisa, India, Volume 1, pp. 525-532, 2016, Springer, 978-981-10-3152-6. The full citation details ...](Pics/full.jpeg) |
2016 |
DBLP DOI BibTeX RDF |
|
25 | György Szaszák, Máté Ákos Tündik, András Beke |
Summarization of Spontaneous Speech using Automatic Speech Recognition and a Speech Prosody based Tokenizer. ![Search on Bibsonomy](Pics/bibsonomy.png) |
KDIR ![In: Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2016) - Volume 1: KDIR, Porto - Portugal, November 9 - 11, 2016., pp. 221-227, 2016, SciTePress, 978-989-758-203-5. The full citation details ...](Pics/full.jpeg) |
2016 |
DBLP DOI BibTeX RDF |
|
25 | Juhaida Abu Bakar, Khairuddin Omar, Mohammad Faidzul Nasrudin, Mohd Zamri Murah |
Tokenizer for the Malay language using pattern matching. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ISDA ![In: 14th International Conference on Intelligent Systems Design and Applications, ISDA 2014, Okinawa, Japan, November 28-30, 2014, pp. 140-144, 2014, IEEE, 978-1-4799-7938-7. The full citation details ...](Pics/full.jpeg) |
2014 |
DBLP DOI BibTeX RDF |
|
25 | Arianna Pipitone, Maria Carmela Campisi, Roberto Pirrone |
An A* Based Semantic Tokenizer for Increasing the Performance of Semantic Applications. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICSC ![In: 2013 IEEE Seventh International Conference on Semantic Computing, Irvine, CA, USA, September 16-18, 2013, pp. 393-394, 2013, IEEE Computer Society, 978-0-7695-5119-7. The full citation details ...](Pics/full.jpeg) |
2013 |
DBLP DOI BibTeX RDF |
|
25 | Jirí Marsík, Ondrej Bojar |
TrTok: A Fast and Trainable Tokenizer for Natural Languages. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Prague Bull. Math. Linguistics ![In: Prague Bull. Math. Linguistics 98, pp. 75-86, 2012. The full citation details ...](Pics/full.jpeg) |
2012 |
DBLP BibTeX RDF |
|
25 | Neil Barrett, Jens H. Weber-Jahnke |
Building a biomedical tokenizer using the token lattice design pattern and the adapted Viterbi algorithm. ![Search on Bibsonomy](Pics/bibsonomy.png) |
BMC Bioinform. ![In: BMC Bioinform. 12(S-3), pp. S1, 2011. The full citation details ...](Pics/full.jpeg) |
2011 |
DBLP DOI BibTeX RDF |
|
25 | Neil Barrett, Jens H. Weber-Jahnke |
Building a Biomedical Tokenizer Using the Token Lattice Design Pattern and the Adapted Viterbi Algorithm. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICMLA ![In: The Ninth International Conference on Machine Learning and Applications, ICMLA 2010, Washington, DC, USA, 12-14 December 2010, pp. 473-478, 2010, IEEE Computer Society, 978-0-7695-4300-0. The full citation details ...](Pics/full.jpeg) |
2010 |
DBLP DOI BibTeX RDF |
|
25 | Aasish Pappu, Ratna Sanyal |
Vaakkriti: Sanskrit Tokenizer. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IJCNLP ![In: Third International Joint Conference on Natural Language Processing, IJCNLP 2008, Hyderabad, India, January 7-12, 2008, pp. 577-582, 2008, The Association for Computer Linguistics. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP BibTeX RDF |
|
25 | Chengguo Jin, Seung-Hoon Na, Dong-Il Kim, Jong-Hyeok Lee |
Automatic Extraction of English-Chinese Transliteration Pairs using Dynamic Window and Tokenizer. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IJCNLP ![In: Third International Joint Conference on Natural Language Processing, IJCNLP 2008, Hyderabad, India, January 7-12, 2008, pp. 9-15, 2008, The Association for Computer Linguistics. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP BibTeX RDF |
|
25 | Oana Frunza |
A Trainable Tokenizer, solution for multilingual texts and compound expression tokenization. ![Search on Bibsonomy](Pics/bibsonomy.png) |
LREC ![In: Proceedings of the International Conference on Language Resources and Evaluation, LREC 2008, 26 May - 1 June 2008, Marrakech, Morocco, 2008, European Language Resources Association. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP BibTeX RDF |
|
25 | Zhi-Jie Chang, Hsiao-Chuan Wang |
以高斯混合模型表徵器與語言模型為基礎之語言辨認 (Language Identification based on Gaussian Mixture Model Tokenizer and Language Model) [In Chinese]. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ROCLING ![In: Proceedings of the 17th Conference on Computational Linguistics and Speech Processing, ROCLING 2005, Taiwan, ROC, 2005, 2005, Association for Computational Linguistics and Chinese Language Processing (ACLCLP), Taiwan. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP BibTeX RDF |
|
23 | Rong Tong, Bin Ma 0001, Haizhou Li 0001, Chng Eng Siong |
A Target-Oriented Phonotactic Front-End for Spoken Language Recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Speech Audio Process. ![In: IEEE Trans. Speech Audio Process. 17(7), pp. 1335-1347, 2009. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
|
23 | Yu-Chieh Wu, Jie-Chi Yang |
A Robust Passage Retrieval Algorithm for Video Question Answering. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Circuits Syst. Video Technol. ![In: IEEE Trans. Circuits Syst. Video Technol. 18(10), pp. 1411-1421, 2008. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
|
23 | Rong Tong, Bin Ma 0001, Haizhou Li 0001, Engsiong Chng |
Target-oriented phone tokenizers for spoken language recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2008, March 30 - April 4, 2008, Caesars Palace, Las Vegas, Nevada, USA, pp. 4221-4224, 2008, IEEE, 1-4244-1484-9. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
|
23 | Hong Phuong Le, Nguyên Thi Minh Huyên, Azim Roussanaly, Hô Tuòng Vinh |
A Hybrid Approach to Word Segmentation of Vietnamese Texts. ![Search on Bibsonomy](Pics/bibsonomy.png) |
LATA ![In: Language and Automata Theory and Applications, Second International Conference, LATA 2008, Tarragona, Spain, March 13-19, 2008. Revised Papers, pp. 240-249, 2008, Springer, 978-3-540-88281-7. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
|
23 | Francisco-Mario Barcala, Jesús Vilares Ferro, Miguel A. Alonso 0001, Jorge Graña Gil, Manuel Vilares Ferro |
Tokenization and Proper Noun Recognition for Information Retrieval. ![Search on Bibsonomy](Pics/bibsonomy.png) |
DEXA Workshops ![In: 13th International Workshop on Database and Expert Systems Applications (DEXA 2002), 2-6 September 2002, Aix-en-Provence, France, pp. 246-250, 2002, IEEE Computer Society, 0-7695-1668-8. The full citation details ...](Pics/full.jpeg) |
2002 |
DBLP DOI BibTeX RDF |
|
23 | Jesús Vilares Ferro, Francisco-Mario Barcala, Miguel A. Alonso 0001, Jorge Graña Gil, Manuel Vilares Ferro |
Practical NLP-Based Text Indexing. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IBERAMIA ![In: Advances in Artificial Intelligence - IBERAMIA 2002, 8th Ibero-American Conference on AI, Seville, Spain, November 12-15, 2002, Proceedings, pp. 635-644, 2002, Springer, 3-540-00131-X. The full citation details ...](Pics/full.jpeg) |
2002 |
DBLP DOI BibTeX RDF |
|
Displaying result #1 - #75 of 75 (100 per page; Change: )
|
|