Download Source Code Apache Tika 1.10
Apache Software Foundation is announcing that Apache Tika with version number 1.10 is already available to download.
What is Apache Tika ?
Apache Tika is An open source toolkit for parsing, analyzing, and extracting metadata and content from files, with support for a broad range of file types .
Apache Tika was developed as a low-level toolkit for searching content inside other files.Tika doesn’t do much on its own being a simple library, but it can be integrated in more powerful tools like search engines, digital asset management systems or CMSs to provide a fully-functional in-file search system.The library can access just the file’s header for quick overall file information, or it can go really deep and search even in the file’s body for various types of data, in text or binary format.A wide range of file types are supported and Tika can also be used with other programming languages thanks to a series of third-party bindings and wrappers.
This is changelog for Apache Tika version 1.10 :
- This release includes bug fixes and new features including a new Tesseract OCR Parser; a new GDAL Parser; more supported formats, and overall improvements in Tika stability.
You can read the complete changelog and also download this latest version on their homepage: tika.apache.org