| Tanl Linguistic Pipeline |
Read a plain text file, split into tokens and return one sentence at a time. More...
#include <SplitSentenceReader.h>
Classes | |
| struct | BoundarySet |
| Parameters defininig where to split a sentence. More... | |
Public Member Functions | |
| SplitSentenceReader (std::istream *is, Corpus *corpus=0, BoundarySet const *bs=&defaultBS) | |
Creates a new SplitSentenceReader. | |
| bool | MoveNext () |
| Return true if there is another sentence available. | |
| void | reset () |
| Restart. | |
Static Public Attributes | |
| static BoundarySet const | defaultBS |
Read a plain text file, split into tokens and return one sentence at a time.
| Tanl::SplitSentenceReader::SplitSentenceReader | ( | std::istream * | is, | |
| Corpus * | corpus = 0, |
|||
| BoundarySet const * | bs = &defaultBS | |||
| ) |
Creates a new SplitSentenceReader.
| is | the stream containing the sentence to read. | |
| lang | the language of the document. |
| bool Tanl::SplitSentenceReader::MoveNext | ( | ) | [virtual] |
Return true if there is another sentence available.
Reimplemented from Tanl::SentenceReader.
References Parser::Tokenizer::Current(), Parser::Tokenizer::MoveNext(), and Tanl::SentenceReader::sentence.