public class TaggedPdfReaderTool extends Object
Modifier and Type | Field | Description |
---|---|---|
protected PrintWriter |
out |
The writer object to which the XML will be written
|
protected PdfReader |
reader |
The reader object from which the content streams are read.
|
Constructor | Description |
---|---|
TaggedPdfReaderTool() |
Modifier and Type | Method | Description |
---|---|---|
void |
convertToXml(PdfReader reader,
OutputStream os) |
Parses a string with structured content.
|
void |
convertToXml(PdfReader reader,
OutputStream os,
String charset) |
Parses a string with structured content.
|
void |
inspectChild(PdfObject k) |
Inspects a child of a structured element.
|
void |
inspectChildArray(PdfArray k) |
If the child of a structured element is an array, we need to loop over
the elements.
|
void |
inspectChildDictionary(PdfDictionary k) |
If the child of a structured element is a dictionary, we inspect the
child; we may also draw a tag.
|
void |
inspectChildDictionary(PdfDictionary k,
boolean inspectAttributes) |
If the child of a structured element is a dictionary, we inspect the
child; we may also draw a tag.
|
void |
parseTag(String tag,
PdfObject object,
PdfDictionary page) |
Searches for a tag in a page.
|
protected String |
xmlName(PdfName name) |
protected PdfReader reader
protected PrintWriter out
public void convertToXml(PdfReader reader, OutputStream os, String charset) throws IOException
reader
- the PdfReader that has access to the PDF fileos
- the OutputStream to which the resulting xml will be writtencharset
- the charset to encode the dataIOException
public void convertToXml(PdfReader reader, OutputStream os) throws IOException
reader
- the PdfReader that has access to the PDF fileos
- the OutputStream to which the resulting xml will be writtenIOException
public void inspectChild(PdfObject k) throws IOException
k
- the child to inspectIOException
public void inspectChildArray(PdfArray k) throws IOException
k
- the child array to inspectIOException
public void inspectChildDictionary(PdfDictionary k) throws IOException
k
- the child dictionary to inspectIOException
public void inspectChildDictionary(PdfDictionary k, boolean inspectAttributes) throws IOException
k
- the child dictionary to inspectIOException
public void parseTag(String tag, PdfObject object, PdfDictionary page) throws IOException
tag
- the name of the tagobject
- an identifier to find the marked contentpage
- a page dictionaryIOException
Copyright © 2018. All rights reserved.