#include <np.h>
Inheritance diagram for np:
Public Member Functions | |
np (const maco_options &) | |
Constructor. | |
Private Member Functions | |
int | ComputeToken (int, sentence::const_iterator, const sentence &) |
Compute the right token code for word j from given state. | |
void | ResetActions () |
Reset flag about capitalized noun at sentence start. | |
void | StateActions (int, int, int, sentence::const_iterator) |
Perform necessary actions in "state" reached from state "origin" via word j interpreted as code "token": Basically, set flag about capitalized noun at sentence start. | |
void | SetMultiwordAnalysis (sentence::iterator) const |
Set the appropriate lemma and parole for the new multiword. | |
bool | ValidMultiWord (const word &) const |
Perform last minute validation before effectively building multiword. | |
bool | find_tag_match (RegEx &, const sentence::const_iterator) |
Private method to look for a tag in a list of analysis. | |
Private Attributes | |
set< string > | func |
set of function words | |
set< string > | punct |
set of special punctuation tags | |
string | NE_tag |
Tag to assign to detected NEs. | |
bool | initialNoun |
it is a noun at the beggining of the sentence | |
unsigned int | Title_length |
length beyond which a multiword made of all capitialized words ("WRECKAGE: TITANIC DISAPPEARS IN NORTHERN SEA") will be considered a title and not a proper noun. | |
RegEx | RE_NounAdj |
RegEx | RE_Closed |
RegEx | RE_DateNumPunct |
|
Constructor.
|
|
Compute the right token code for word j from given state.
Implements automat. |
|
Private method to look for a tag in a list of analysis.
|
|
Reset flag about capitalized noun at sentence start.
Implements automat. |
|
Set the appropriate lemma and parole for the new multiword.
Implements automat. |
|
Perform necessary actions in "state" reached from state "origin" via word j interpreted as code "token": Basically, set flag about capitalized noun at sentence start.
Implements automat. |
|
Perform last minute validation before effectively building multiword.
Reimplemented from automat. |
|
set of function words
|
|
it is a noun at the beggining of the sentence
|
|
Tag to assign to detected NEs.
|
|
set of special punctuation tags
|
|
|
|
|
|
|
|
length beyond which a multiword made of all capitialized words ("WRECKAGE: TITANIC DISAPPEARS IN NORTHERN SEA") will be considered a title and not a proper noun. A value of zero deactivates this behaviour. |