ModErn Text Analysis
META Enumerates Textual Applications
Main Page
Related Pages
Namespaces
Classes
Files
File List
File Members
include
index
score_data.h
Go to the documentation of this file.
1
10
#ifndef META_SCORE_DATA_H_
11
#define META_SCORE_DATA_H_
12
13
#include "
meta.h
"
14
15
namespace
meta
16
{
17
18
namespace
corpus
19
{
20
class
document;
21
}
22
23
namespace
index
24
{
25
class
inverted_index;
26
}
27
}
28
29
namespace
meta
30
{
31
namespace
index
32
{
33
39
struct
score_data
40
{
41
// general info
42
44
inverted_index
&
idx
;
46
double
avg_dl
;
48
uint64_t
num_docs
;
50
uint64_t
total_terms
;
52
const
corpus::document
&
query
;
53
54
// term-based info
55
57
term_id
t_id
;
59
uint64_t
query_term_count
;
61
uint64_t
doc_count
;
63
uint64_t
corpus_term_count
;
64
65
// document-based info
66
68
doc_id
d_id
;
70
uint64_t
doc_term_count
;
72
uint64_t
doc_size
;
74
uint64_t
doc_unique_terms
;
75
84
score_data
(
inverted_index
& p_idx,
double
p_avg_dl, uint64_t p_num_docs,
85
uint64_t p_total_terms,
const
corpus::document
& p_query)
86
: idx(p_idx),
// gcc no non-const ref init from brace init list
87
avg_dl{p_avg_dl},
88
num_docs{p_num_docs},
89
total_terms{p_total_terms},
90
query
(p_query)
// gcc no non-const ref init from brace init list
91
{
92
/* nothing */
93
}
94
};
95
}
96
}
97
98
#endif
meta::index::score_data::doc_unique_terms
uint64_t doc_unique_terms
number of unique terms in the doc
Definition:
score_data.h:74
meta.h
Contains top-level namespace documentation for the META toolkit.
meta::index::inverted_index
The inverted_index class stores information on a corpus indexed by term_ids.
Definition:
inverted_index.h:54
meta::index::score_data::num_docs
uint64_t num_docs
total number of documents
Definition:
score_data.h:48
meta::index::score_data::doc_count
uint64_t doc_count
number of docs that t_id appears in
Definition:
score_data.h:61
meta::index::score_data::corpus_term_count
uint64_t corpus_term_count
number of times t_id appears in corpus
Definition:
score_data.h:63
meta::index::score_data::total_terms
uint64_t total_terms
total number of terms in the index
Definition:
score_data.h:50
meta::index::score_data::score_data
score_data(inverted_index &p_idx, double p_avg_dl, uint64_t p_num_docs, uint64_t p_total_terms, const corpus::document &p_query)
Constructor to initialize most elements.
Definition:
score_data.h:84
meta::corpus::document
Represents an indexable document.
Definition:
document.h:31
meta::index::score_data::avg_dl
double avg_dl
average document length
Definition:
score_data.h:46
meta
The ModErn Text Analysis toolkit is a suite of natural language processing, classification, information retreival, data mining, and other applications of text processing.
Definition:
analyzer.h:24
meta::index::score_data::d_id
doc_id d_id
document id
Definition:
score_data.h:68
meta::index::score_data::idx
inverted_index & idx
index queries are running on
Definition:
score_data.h:44
meta::index::score_data::query_term_count
uint64_t query_term_count
query term count
Definition:
score_data.h:59
meta::index::score_data
A score_data object contains information needed to evaluate a ranking function.
Definition:
score_data.h:39
meta::index::score_data::query
const corpus::document & query
the current query
Definition:
score_data.h:52
meta::index::score_data::doc_size
uint64_t doc_size
total number of terms in the doc
Definition:
score_data.h:72
meta::index::score_data::doc_term_count
uint64_t doc_term_count
number of times the term appears in the current doc
Definition:
score_data.h:70
meta::index::score_data::t_id
term_id t_id
doc term id
Definition:
score_data.h:57
Generated on Tue Mar 3 2015 23:20:16 for ModErn Text Analysis by
1.8.9.1