ModErn Text Analysis
META Enumerates Textual Applications
semi_skeleton_featurizer.h
Go to the documentation of this file.
1 
10 #ifndef META_SEMI_SKELETON_FEATURIZER_H_
11 #define META_SEMI_SKELETON_FEATURIZER_H_
12 
14 #include "util/clonable.h"
15 
16 namespace meta
17 {
18 namespace analyzers
19 {
20 
26  : public util::clonable<tree_featurizer, semi_skeleton_featurizer>
27 {
28  public:
35  const parser::parse_tree& tree) const override;
36 
38  const static std::string id;
39 };
40 }
41 }
42 
43 #endif
void tree_tokenize(corpus::document &doc, const parser::parse_tree &tree) const override
Keeps track of one node's tag and the skeleton structure beneath it.
Definition: semi_skeleton_featurizer.cpp:50
Represents the parse tree for a sentence.
Definition: parse_tree.h:32
Template class to facilitate polymorphic cloning.
Definition: clonable.h:28
Represents an indexable document.
Definition: document.h:31
The ModErn Text Analysis toolkit is a suite of natural language processing, classification, information retreival, data mining, and other applications of text processing.
Definition: analyzer.h:24
static const std::string id
Identifier for this featurizer.
Definition: semi_skeleton_featurizer.h:38
Tokenizes parse trees by keeping track of only a single node label and the underlying tree structure...
Definition: semi_skeleton_featurizer.h:25