NewsgroupDocument-class {tm}R Documentation

Text document

Description

A class representing a newsgroup document with additional information. The newsgroup documents must be formatted according to the Newsgroup dataset from the UCI KDD archive.

Objects from the Class

Objects can be created by calls of the form new("NewsgroupDocument", ...).

Slots

Newsgroup:
Object of class character containing the newsgroups where the document has been posted
URI:
Object of class character containing the path and filename holding the data physically on disk
Cached:
Object of class numeric containing the status whether the file was already loaded in memory. Zero indicates not loaded, a value greater zero indicates in memory.

Extends

Class character and TextDocument, directly.

Methods

Corpus
signature(object = "NewsgroupDocument"): returns the text corpus, i.e., the actual character data slot
Corpus<-
signature(object = "NewsgroupDocument"): sets the text corpus, i.e., the actual character data slot
URI
signature(object = "NewsgroupDocument"): returns the filename on disk
Cached
signature(object = "NewsgroupDocument"): returns status information for loading on demand
Cached<-
signature(object = "NewsgroupDocument"): sets status information for loading on demand

Author(s)

Ingo Feinerer

References

http://kdd.ics.uci.edu/databases/20newsgroups/20newsgroups.html


[Package tm version 0.1-1 Index]