ModErn Text Analysis
META Enumerates Textual Applications
analyzer_test.h
Go to the documentation of this file.
1 
10 #ifndef META_ANALYZER_TEST_H_
11 #define META_ANALYZER_TEST_H_
12 
13 #include <string>
14 #include "test/unit_test.h"
15 #include "analyzers/all.h"
16 
17 namespace meta
18 {
19 namespace testing
20 {
27 template <class Analyzer>
28 void check_analyzer_expected(Analyzer& ana, corpus::document doc,
29  uint64_t num_unique, uint64_t length);
30 
35 int content_tokenize();
36 
41 int file_tokenize();
42 
47 int analyzer_tests();
48 }
49 }
50 
51 #endif
uint64_t length(const std::string &str)
Definition: utf.cpp:136
int file_tokenize()
Test tokenization on documents that read content from files.
Definition: analyzer_test.cpp:78
Represents an indexable document.
Definition: document.h:31
The ModErn Text Analysis toolkit is a suite of natural language processing, classification, information retreival, data mining, and other applications of text processing.
Definition: analyzer.h:24
int analyzer_tests()
Runs the analyzer tests.
Definition: analyzer_test.cpp:104
void check_analyzer_expected(Analyzer &ana, corpus::document doc, uint64_t num_unique, uint64_t length)
Definition: analyzer_test.cpp:29
int content_tokenize()
Test tokenization on documents with content.
Definition: analyzer_test.cpp:48