org.pdfbox.util
Class PDFHighlighter
java.lang.Object
org.pdfbox.util.PDFStreamEngine
org.pdfbox.util.PDFTextStripper
org.pdfbox.util.PDFHighlighter
public class PDFHighlighter
- extends PDFTextStripper
Highlighting of words in a PDF document with an XML file.
- Version:
- $Revision: 1.6 $
- Author:
- slagraulet (slagraulet@cardiweb.com), Ben Litchfield (ben@csh.rit.edu)
- See Also:
-
Adobe Highlight File Format
Methods inherited from class org.pdfbox.util.PDFTextStripper |
endDocument, endParagraph, flushText, getCharactersByArticle, getCurrentPageNo, getEndBookmark, getEndPage, getLineSeparator, getOutput, getPageSeparator, getStartBookmark, getStartPage, getText, getText, getWordSeparator, processPage, processPages, setEndBookmark, setEndPage, setLineSeparator, setPageSeparator, setShouldSeparateByBeads, setSortByPosition, setStartBookmark, setStartPage, setSuppressDuplicateOverlappingText, setWordSeparator, shouldSeparateByBeads, shouldSortByPosition, shouldSuppressDuplicateOverlappingText, showCharacter, startDocument, startPage, startParagraph, writeCharacters, writeText, writeText |
Methods inherited from class org.pdfbox.util.PDFStreamEngine |
getColorSpaces, getCurrentPage, getFonts, getGraphicsStack, getGraphicsState, getGraphicsStates, getResources, getTextLineMatrix, getTextMatrix, getXObjects, processOperator, processOperator, processStream, processSubStream, setColorSpaces, setFonts, setGraphicsStack, setGraphicsState, setGraphicsStates, setTextLineMatrix, setTextMatrix, showString |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
PDFHighlighter
public PDFHighlighter()
throws IOException
- Default constructor.
- Throws:
IOException
- If there is an error constructing this class.
generateXMLHighlight
public void generateXMLHighlight(PDDocument pdDocument,
String highlightWord,
Writer xmlOutput)
throws IOException
- Generate an XML highlight string based on the PDF.
- Parameters:
pdDocument
- The PDF to find words in.highlightWord
- The word to search for.xmlOutput
- The resulting output xml file.
- Throws:
IOException
- If there is an error reading from the PDF, or writing to the XML.
generateXMLHighlight
public void generateXMLHighlight(PDDocument pdDocument,
String[] sWords,
Writer xmlOutput)
throws IOException
- Generate an XML highlight string based on the PDF.
- Parameters:
pdDocument
- The PDF to find words in.sWords
- The words to search for.xmlOutput
- The resulting output xml file.
- Throws:
IOException
- If there is an error reading from the PDF, or writing to the XML.
endPage
protected void endPage(PDPage pdPage)
throws IOException
- Description copied from class:
PDFTextStripper
- End a page. Default implementation is to do nothing. Subclasses
may provide additional information.
- Overrides:
endPage
in class PDFTextStripper
- Parameters:
pdPage
- The page we are about to process.
- Throws:
IOException
- If there is any error writing to the stream.- See Also:
PDFTextStripper.endPage( PDPage )
main
public static void main(String[] args)
throws IOException
- Command line application.
- Parameters:
args
- The command line arguments to the application.
- Throws:
IOException
- If there is an error generating the highlight file.
Copyright © 2006-2007 EGIZ - E-Government Innovationszentrum. All Rights Reserved.