Class Document


  • public class Document
    extends Object
    • Constructor Detail

      • Document

        public Document()
    • Method Detail

      • getTextFromDocument

        public static String getTextFromDocument​(byte[] document)
        Convert to text plain a lot of kind of document (like odt, ods, odp, doc(x), xls(x), ppt(x), pdf, mp3, mp4, etc.) with Apache Tika
        Parameters:
        document - binary representation of the document
        Returns:
        text from document without format