Class XSLFPowerPointExtractorDecorator

java.lang.Object
org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
org.apache.tika.parser.microsoft.ooxml.XSLFPowerPointExtractorDecorator
All Implemented Interfaces:
OOXMLExtractor

public class XSLFPowerPointExtractorDecorator extends AbstractOOXMLExtractor
  • Constructor Details

    • XSLFPowerPointExtractorDecorator

      public XSLFPowerPointExtractorDecorator(org.apache.tika.metadata.Metadata metadata, org.apache.tika.parser.ParseContext context, org.apache.poi.xslf.extractor.XSLFExtractor extractor)
  • Method Details

    • buildXHTML

      protected void buildXHTML(org.apache.tika.sax.XHTMLContentHandler xhtml) throws SAXException, IOException
      Description copied from class: AbstractOOXMLExtractor
      Populates the XHTMLContentHandler object received as parameter.
      Specified by:
      buildXHTML in class AbstractOOXMLExtractor
      Throws:
      SAXException
      IOException
      See Also:
      • SlideShowExtractor.getText()
    • getMainDocumentParts

      protected List<org.apache.poi.openxml4j.opc.PackagePart> getMainDocumentParts() throws org.apache.tika.exception.TikaException
      In PowerPoint files, slides have things embedded in them, and slide drawings which have the images
      Specified by:
      getMainDocumentParts in class AbstractOOXMLExtractor
      Throws:
      org.apache.tika.exception.TikaException