ModErn Text Analysis
META Enumerates Textual Applications
|
#include <functional>
#include <string>
Go to the source code of this file.
Namespaces | |
meta | |
The ModErn Text Analysis toolkit is a suite of natural language processing, classification, information retreival, data mining, and other applications of text processing. | |
meta::utf | |
Functions for converting to and from various character sets. | |
Functions | |
std::string | meta::utf::to_utf8 (const std::string &str, const std::string &charset) |
Converts a string from the given charset to utf8. More... | |
std::u16string | meta::utf::to_utf16 (const std::string &str, const std::string &charset) |
Converts a string fro the given charset to utf16. More... | |
std::string | meta::utf::to_utf8 (const std::u16string &str) |
Converts a string from utf16 to utf8. More... | |
std::u16string | meta::utf::to_utf16 (const std::string &str) |
Converts a string from utf8 to utf16. More... | |
std::string | meta::utf::tolower (const std::string &str) |
Lowercases a utf8 string. More... | |
std::string | meta::utf::toupper (const std::string &str) |
Uppercases a utf8 string. More... | |
std::string | meta::utf::foldcase (const std::string &str) |
Folds the case of a utf8 string. More... | |
std::string | meta::utf::transform (const std::string &str, const std::string &id) |
Transliterates a utf8 string, using the rules defined in ICU. More... | |
std::string | meta::utf::remove_if (const std::string &str, std::function< bool(uint32_t)> pred) |
Removes UTF-32 codepoints that match the given function. More... | |
uint64_t | meta::utf::length (const std::string &str) |
bool | meta::utf::isalpha (uint32_t codepoint) |
bool | meta::utf::isblank (uint32_t codepoint) |
All files in META are dual-licensed under the MIT and NCSA licenses. For more details, consult the file LICENSE.mit and LICENSE.ncsa in the root of the project.