Class OutlookExtractor


  • public class OutlookExtractor
    extends Object
    Outlook Message Parser.
    • Constructor Summary

      Constructors 
      Constructor Description
      OutlookExtractor​(org.apache.poi.poifs.filesystem.DirectoryNode root, org.apache.tika.metadata.Metadata metadata, org.apache.tika.parser.ParseContext context)  
    • Method Summary

      All Methods Static Methods Instance Methods Concrete Methods 
      Modifier and Type Method Description
      static void addEvenIfNull​(org.apache.tika.metadata.Property property, String value, org.apache.tika.metadata.Metadata metadata)  
      protected org.apache.tika.detect.Detector getDetector()  
      static String getNormalizedMessageClass​(String messageClass)  
      protected String getPassword()
      Returns the password to be used for this file, or null if no / default password should be used
      protected org.apache.tika.config.TikaConfig getTikaConfig()  
      protected void handleEmbeddedOfficeDoc​(org.apache.poi.poifs.filesystem.DirectoryEntry dir, String resourceName, org.apache.tika.sax.XHTMLContentHandler xhtml, boolean outputHtml)
      Handle an office document that's embedded at the POIFS level
      protected void handleEmbeddedOfficeDoc​(org.apache.poi.poifs.filesystem.DirectoryEntry dir, org.apache.tika.metadata.Metadata metadata, String resourceName, org.apache.tika.sax.XHTMLContentHandler xhtml, boolean outputHtml)
      Handle an office document that's embedded at the POIFS level
      protected void handleEmbeddedOfficeDoc​(org.apache.poi.poifs.filesystem.DirectoryEntry dir, org.apache.tika.sax.XHTMLContentHandler xhtml, boolean outputHtml)
      Handle an office document that's embedded at the POIFS level
      protected void handleEmbeddedResource​(org.apache.tika.io.TikaInputStream resource, String filename, String relationshipID, String mediaType, org.apache.tika.sax.XHTMLContentHandler xhtml, boolean outputHtml)  
      protected void handleEmbeddedResource​(org.apache.tika.io.TikaInputStream resource, String filename, String relationshipID, org.apache.poi.hpsf.ClassID storageClassID, String mediaType, org.apache.tika.sax.XHTMLContentHandler xhtml, boolean outputHtml)  
      protected void handleEmbeddedResource​(org.apache.tika.io.TikaInputStream resource, org.apache.tika.metadata.Metadata embeddedMetadata, String filename, String relationshipID, org.apache.poi.hpsf.ClassID storageClassID, String mediaType, org.apache.tika.sax.XHTMLContentHandler xhtml, boolean outputHtml)  
      void parse​(org.apache.tika.sax.XHTMLContentHandler xhtml)  
      static String tryToGetMsgTitle​(org.apache.poi.poifs.filesystem.DirectoryEntry node, String defaultVal)  
    • Field Detail

      • parentMetadata

        protected final org.apache.tika.metadata.Metadata parentMetadata
      • context

        protected final org.apache.tika.parser.ParseContext context
    • Constructor Detail

      • OutlookExtractor

        public OutlookExtractor​(org.apache.poi.poifs.filesystem.DirectoryNode root,
                                org.apache.tika.metadata.Metadata metadata,
                                org.apache.tika.parser.ParseContext context)
                         throws org.apache.tika.exception.TikaException
        Throws:
        org.apache.tika.exception.TikaException
    • Method Detail

      • addEvenIfNull

        public static void addEvenIfNull​(org.apache.tika.metadata.Property property,
                                         String value,
                                         org.apache.tika.metadata.Metadata metadata)
      • getNormalizedMessageClass

        public static String getNormalizedMessageClass​(String messageClass)
      • getTikaConfig

        protected org.apache.tika.config.TikaConfig getTikaConfig()
      • getDetector

        protected org.apache.tika.detect.Detector getDetector()
      • getPassword

        protected String getPassword()
        Returns the password to be used for this file, or null if no / default password should be used
      • handleEmbeddedResource

        protected void handleEmbeddedResource​(org.apache.tika.io.TikaInputStream resource,
                                              String filename,
                                              String relationshipID,
                                              String mediaType,
                                              org.apache.tika.sax.XHTMLContentHandler xhtml,
                                              boolean outputHtml)
                                       throws IOException,
                                              SAXException,
                                              org.apache.tika.exception.TikaException
        Throws:
        IOException
        SAXException
        org.apache.tika.exception.TikaException
      • handleEmbeddedResource

        protected void handleEmbeddedResource​(org.apache.tika.io.TikaInputStream resource,
                                              String filename,
                                              String relationshipID,
                                              org.apache.poi.hpsf.ClassID storageClassID,
                                              String mediaType,
                                              org.apache.tika.sax.XHTMLContentHandler xhtml,
                                              boolean outputHtml)
                                       throws IOException,
                                              SAXException,
                                              org.apache.tika.exception.TikaException
        Throws:
        IOException
        SAXException
        org.apache.tika.exception.TikaException
      • handleEmbeddedResource

        protected void handleEmbeddedResource​(org.apache.tika.io.TikaInputStream resource,
                                              org.apache.tika.metadata.Metadata embeddedMetadata,
                                              String filename,
                                              String relationshipID,
                                              org.apache.poi.hpsf.ClassID storageClassID,
                                              String mediaType,
                                              org.apache.tika.sax.XHTMLContentHandler xhtml,
                                              boolean outputHtml)
                                       throws IOException,
                                              SAXException,
                                              org.apache.tika.exception.TikaException
        Throws:
        IOException
        SAXException
        org.apache.tika.exception.TikaException
      • handleEmbeddedOfficeDoc

        protected void handleEmbeddedOfficeDoc​(org.apache.poi.poifs.filesystem.DirectoryEntry dir,
                                               org.apache.tika.sax.XHTMLContentHandler xhtml,
                                               boolean outputHtml)
                                        throws IOException,
                                               SAXException,
                                               org.apache.tika.exception.TikaException
        Handle an office document that's embedded at the POIFS level
        Throws:
        IOException
        SAXException
        org.apache.tika.exception.TikaException
      • handleEmbeddedOfficeDoc

        protected void handleEmbeddedOfficeDoc​(org.apache.poi.poifs.filesystem.DirectoryEntry dir,
                                               String resourceName,
                                               org.apache.tika.sax.XHTMLContentHandler xhtml,
                                               boolean outputHtml)
                                        throws IOException,
                                               SAXException,
                                               org.apache.tika.exception.TikaException
        Handle an office document that's embedded at the POIFS level
        Throws:
        IOException
        SAXException
        org.apache.tika.exception.TikaException
      • handleEmbeddedOfficeDoc

        protected void handleEmbeddedOfficeDoc​(org.apache.poi.poifs.filesystem.DirectoryEntry dir,
                                               org.apache.tika.metadata.Metadata metadata,
                                               String resourceName,
                                               org.apache.tika.sax.XHTMLContentHandler xhtml,
                                               boolean outputHtml)
                                        throws IOException,
                                               SAXException,
                                               org.apache.tika.exception.TikaException
        Handle an office document that's embedded at the POIFS level
        Throws:
        IOException
        SAXException
        org.apache.tika.exception.TikaException
      • tryToGetMsgTitle

        public static String tryToGetMsgTitle​(org.apache.poi.poifs.filesystem.DirectoryEntry node,
                                              String defaultVal)