org.textmining.extraction.word.model
Class CHPFormattedDiskPage
java.lang.Object
org.textmining.extraction.word.model.FormattedDiskPage
org.textmining.extraction.word.model.CHPFormattedDiskPage
public class CHPFormattedDiskPage
- extends FormattedDiskPage
Represents a CHP fkp. The style properties for paragraph and character runs
are stored in fkps. There are PAP fkps for paragraph properties and CHP fkps
for character run properties. The first part of the fkp for both CHP and PAP
fkps consists of an array of 4 byte int offsets that represent a
Paragraph's or Character run's text offset in the main stream. The ending
offset is the next value in the array. For example, if an fkp has X number of
Paragraph's stored in it then there are (x + 1) 4 byte ints in the beginning
array. The number X is determined by the last byte in a 512 byte fkp.
CHP and PAP fkps also store the compressed styles(grpprl) that correspond to
the offsets on the front of the fkp. The offset of the grpprls is determined
differently for CHP fkps and PAP fkps.
- Author:
- Ryan Ackley
Methods inherited from class java.lang.Object |
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
CHPFormattedDiskPage
public CHPFormattedDiskPage()
CHPFormattedDiskPage
public CHPFormattedDiskPage(byte[] documentStream,
int offset,
int fcMin,
NodeHelper fc2Cp)
- This constructs a CHPFormattedDiskPage from a raw fkp (512 byte array
read from a Word file).
fill
public void fill(java.util.ArrayList filler,
int start)
fill
public void fill(java.util.List filler)
getCHPX
public CHPX getCHPX(int index)
getEndIndex
public int getEndIndex()
getOverflow
public java.util.ArrayList getOverflow()
size
public int size()
- Description copied from class:
FormattedDiskPage
- Used to get the total number of grrprl's stored int this FKP
- Specified by:
size
in class FormattedDiskPage
- Returns:
- The number of grpprls in this FKP
toByteArray
public byte[] toByteArray(int fcMin)