Public Member Functions | Public Attributes | Protected Member Functions | Protected Attributes

CLemmatizer Class Reference

#include <Lemmatizers.h>

Inheritance diagram for CLemmatizer:
Inheritance graph
[legend]
Collaboration diagram for CLemmatizer:
Collaboration graph
[legend]

List of all members.

Public Member Functions

Public Attributes

Protected Member Functions

Protected Attributes


Constructor & Destructor Documentation

CLemmatizer::CLemmatizer ( MorphLanguageEnum  Language  ) 
CLemmatizer::~CLemmatizer (  )  [virtual]

References m_ic_ext2int, and m_ic_int2ext.


Member Function Documentation

virtual void CLemmatizer::FilterSrc ( string &  src  )  const [protected, pure virtual]

Implemented in CLemmatizerRussian, CLemmatizerEnglish, and CLemmatizerGerman.

Referenced by CreateParadigmCollection(), GetAllAncodesAndLemmasQuick(), and GetAllAncodesQuick().

Here is the caller graph for this function:

string CLemmatizer::GetRegistryString (  )  const [inline, protected]

References m_Registry.

Referenced by GetPath().

Here is the caller graph for this function:

string CLemmatizer::GetPath (  )  const [protected]

References GetRegistryString().

Referenced by LoadDictionariesRegistry(), and LoadStatisticRegistry().

Here is the call graph for this function:

Here is the caller graph for this function:

void CLemmatizer::ReadOptions ( string  FileName  )  [protected]

References LoadFileToString(), m_bAllowRussianJo, and Trim().

Referenced by LoadDictionariesRegistry().

Here is the call graph for this function:

Here is the caller graph for this function:

bool CLemmatizer::LemmatizeWord ( string &  InputWordStr,
const bool  cap,
const bool  predict,
vector< CAutomAnnotationInner > &  results,
bool  bGetLemmaInfos 
) const [protected]
void CLemmatizer::AssignWeightIfNeed ( vector< CAutomAnnotationInner > &  FindResults  )  const [protected]

References CStatistic::get_HomoWeight(), CAutomAnnotationInner::GetParadigmId(), m_bUseStatistic, CAutomAnnotationInner::m_ItemNo, CAutomAnnotationInner::m_nWeight, and m_Statistic.

Referenced by CreateParadigmCollection().

Here is the call graph for this function:

Here is the caller graph for this function:

bool CLemmatizer::CheckAbbreviation ( string  InputWordStr,
vector< CAutomAnnotationInner > &  FindResults,
bool  is_cap 
) const [protected]

References ConvertPredictTupleToAnnot(), CPredictBase::Find(), CABCEncoder::GetCriticalNounLetterPack(), GetLanguage(), is_upper_consonant(), CMorphDict::m_pFormAutomat, and m_Predict.

Referenced by PredictByDataBase().

Here is the call graph for this function:

Here is the caller graph for this function:

CAutomAnnotationInner CLemmatizer::ConvertPredictTupleToAnnot ( const CPredictTuple input  )  const [protected]
void CLemmatizer::PredictByDataBase ( string  InputWordStr,
vector< CAutomAnnotationInner > &  results,
bool  is_cap 
) const [protected]
bool CLemmatizer::IsPrefix ( const string &  Prefix  )  const [protected]

References m_PrefixesSet.

Referenced by LemmatizeWord().

Here is the caller graph for this function:

MorphLanguageEnum CLemmatizer::GetLanguage (  )  const [inline]
const CStatistic & CLemmatizer::GetStatistic (  )  const

References m_Statistic.

Referenced by CFormInfo::GetHomonymWeightWithForm(), CFormInfo::GetWordWeight(), and CFormInfo::SetParadigmId().

Here is the caller graph for this function:

bool CLemmatizer::CheckABC ( const string &  WordForm  )  const

References CABCEncoder::CheckABCWithoutAnnotator(), and CMorphDict::m_pFormAutomat.

Referenced by PredictByDataBase().

Here is the call graph for this function:

Here is the caller graph for this function:

bool CLemmatizer::IsHyphenPostfix ( const string &  Postfix  )  const

References m_HyphenPostfixes.

Referenced by CreateParadigmCollection().

Here is the caller graph for this function:

bool CLemmatizer::IsHyphenPrefix ( const string &  Prefix  )  const

References m_HyphenPrefixes.

Referenced by CreateParadigmCollection().

Here is the caller graph for this function:

bool CLemmatizer::initIconv ( const string &  enc_internal = "",
const string &  enc_external = "UTF8" 
)

References ddcIconv::good(), m_enc_ext, m_enc_int, m_ic_ext2int, and m_ic_int2ext.

Referenced by CLemmatizer(), and CLemmatizerGerman::CLemmatizerGerman().

Here is the call graph for this function:

Here is the caller graph for this function:

std::string CLemmatizer::recode_ext2int ( const std::string  s_ext  )  const [inline]

References ddcIconv::convert(), and m_ic_ext2int.

Referenced by GetWordForms().

Here is the call graph for this function:

Here is the caller graph for this function:

std::string CLemmatizer::recode_int2ext ( const std::string  s_int  )  const [inline]

References ddcIconv::convert(), and m_ic_int2ext.

Referenced by GetWordForms().

Here is the call graph for this function:

Here is the caller graph for this function:

bool CLemmatizer::LoadDictionariesRegistry ( string &  strError  ) 
bool CLemmatizer::LoadStatisticRegistry ( SubjectEnum  subj  ) 

References GetPath(), CStatistic::Load(), m_Statistic, subjComputer, subjFinance, and subjLiterature.

Here is the call graph for this function:

bool CLemmatizer::CreateParadigmCollection ( bool  bNorm,
string &  WordStr,
bool  capital,
bool  bUsePrediction,
vector< CFormInfo > &  Result 
) const
void CLemmatizer::GetAllAncodesQuick ( const BYTE WordForm,
bool  capital,
BYTE OutBuffer,
bool  bUsePrediction 
) const
bool CLemmatizer::GetAllAncodesAndLemmasQuick ( string &  InputWordStr,
bool  capital,
char *  OutBuffer,
size_t  MaxBufferSize,
bool  bUsePrediction 
) const
bool CLemmatizer::CreateParadigmFromID ( DWORD  id,
CFormInfo Result 
) const

References CFormInfo::AttachLemmatizer(), and CFormInfo::SetParadigmId().

Referenced by CMorphologyHolder::id_to_paradigm(), and CMorphologyHolder::id_to_string().

Here is the call graph for this function:

Here is the caller graph for this function:

bool CLemmatizer::ProcessHyphenWords ( CGraphmatFile piGraphmatFile  )  const

Member Data Documentation

string CLemmatizer::m_Registry [protected]
set<string> CLemmatizer::m_HyphenPostfixes [protected]
set<string> CLemmatizer::m_HyphenPrefixes [protected]
set<string> CLemmatizer::m_PrefixesSet [protected]

Referenced by CLemmatizer(), and PredictByDataBase().

internal encoding (default=""=none)

Referenced by CLemmatizer(), and initIconv().

external encoding (default=""=none)

Referenced by CLemmatizer(), and initIconv().

iconv converter from user encoding to morph-internal encoding

Referenced by initIconv(), recode_ext2int(), and ~CLemmatizer().

iconv converter from morph-internal encoding to user encoding

Referenced by initIconv(), recode_int2ext(), and ~CLemmatizer().


The documentation for this class was generated from the following files: