Main Page | Namespace List | Class Hierarchy | Alphabetical List | Class List | Directories | File List | Class Members | File Members

np Class Reference

The class np implements a dummy proper noun recognizer. More...

#include <np.h>

Inheritance diagram for np:

Inheritance graph
[legend]
Collaboration diagram for np:

Collaboration graph
[legend]
List of all members.

Public Member Functions

 np (const maco_options &)
 Constructor.

Private Member Functions

int ComputeToken (int, sentence::const_iterator, const sentence &)
 Compute the right token code for word j from given state.
void ResetActions ()
 Reset flag about capitalized noun at sentence start.
void StateActions (int, int, int, sentence::const_iterator)
 Perform necessary actions in "state" reached from state "origin" via word j interpreted as code "token": Basically, set flag about capitalized noun at sentence start.
void SetMultiwordAnalysis (sentence::iterator) const
 Set the appropriate lemma and parole for the new multiword.
bool ValidMultiWord (const word &) const
 Perform last minute validation before effectively building multiword.
bool find_tag_match (RegEx &, const sentence::const_iterator)
 Private method to look for a tag in a list of analysis.

Private Attributes

set< string > func
 set of function words
set< string > punct
 set of special punctuation tags
string NE_tag
 Tag to assign to detected NEs.
bool initialNoun
 it is a noun at the beggining of the sentence
unsigned int Title_length
 length beyond which a multiword made of all capitialized words ("WRECKAGE: TITANIC DISAPPEARS IN NORTHERN SEA") will be considered a title and not a proper noun.
RegEx RE_NounAdj
RegEx RE_Closed
RegEx RE_DateNumPunct

Detailed Description

The class np implements a dummy proper noun recognizer.


Constructor & Destructor Documentation

np::np const maco_options  ) 
 

Constructor.


Member Function Documentation

int np::ComputeToken int  ,
sentence::const_iterator  ,
const sentence
[private, virtual]
 

Compute the right token code for word j from given state.

Implements automat.

bool np::find_tag_match RegEx ,
const sentence::const_iterator 
[private]
 

Private method to look for a tag in a list of analysis.

void np::ResetActions  )  [private, virtual]
 

Reset flag about capitalized noun at sentence start.

Implements automat.

void np::SetMultiwordAnalysis sentence::iterator   )  const [private, virtual]
 

Set the appropriate lemma and parole for the new multiword.

Implements automat.

void np::StateActions int  ,
int  ,
int  ,
sentence::const_iterator 
[private, virtual]
 

Perform necessary actions in "state" reached from state "origin" via word j interpreted as code "token": Basically, set flag about capitalized noun at sentence start.

Implements automat.

bool np::ValidMultiWord const word  )  const [private, virtual]
 

Perform last minute validation before effectively building multiword.

Reimplemented from automat.


Member Data Documentation

set<string> np::func [private]
 

set of function words

bool np::initialNoun [private]
 

it is a noun at the beggining of the sentence

string np::NE_tag [private]
 

Tag to assign to detected NEs.

set<string> np::punct [private]
 

set of special punctuation tags

RegEx np::RE_Closed [private]
 

RegEx np::RE_DateNumPunct [private]
 

RegEx np::RE_NounAdj [private]
 

unsigned int np::Title_length [private]
 

length beyond which a multiword made of all capitialized words ("WRECKAGE: TITANIC DISAPPEARS IN NORTHERN SEA") will be considered a title and not a proper noun.

A value of zero deactivates this behaviour.


The documentation for this class was generated from the following files:
Generated on Wed Apr 26 12:58:26 2006 for FreeLing by  doxygen 1.4.4