๐ฐTokenization Functions
A set of functions to tokenize a given text into sentences or words, with optional conditions and morphological operations.
Table of Contents
tokenize
Example
const text = "test ุฐููุจ ู
ุญูู
ุฏ ุงูู ุงูุณูู";
const tokens = tokenize(text);
const tokenswithparams = tokenize(text, [isArabicRange], [stripHarakat]);
console.log(tokens); // Output: [ 'test','ุฐููุจ', 'ู
ูุญู
ุฏ', 'ุงูู', 'ุงูุณูู' ]
console.log(tokenswithparams); // Output: [ 'ุฐูุจ', 'ู
ุญู
ุฏ', 'ุงูู', 'ุงูุณูู' ]sentenceTokenize
tokenizeWithLocation
Last updated