top of page

Document Filters 

 

Unlock the value inside your content.

 

Document Filters technology is the engine that powers the Search software, identifying all of the unstructured information that resides in an organization — files, emails, presentations and hundreds of other types. So staff can parse, analyze, load and display the exact information they need quickly in high definition.

 

The powerful technology for Document Filters is the catalyst that drives content mining and intelligence gathering across a range of key business areas, such as data loss prevention, research, text analytics, content management (CM) and email archival.

 

​As an embeddable set of components that also are available to partners, Document Filters is the core search technology inside solutions from many global ISVs and SaaS vendors. It also helps drive content gathering and mining for ISV applications and Apache Lucene open-source deployments at many Fortune 500 companies and government customers.

 

  • Identify, index and search nearly 500 different document, email, legacy, archive and container formats — Word, Excel, PowerPoint, PDF, WordPerfect, ZIPs, MSGs, Visio and more.

  • Analyze all text and metadata in a file with deep-inspection capability that even uncovers previously hidden information, such as tracked changes, comments, notes, annotations and embedded web links.

  • Seamlessly view, render and manipulate content to a quality that is faithful to the original without the need for additional components like ActiveX.

  • Determine the true nature of content, ensuring that source information is accurately identified for filtering without relying on file-name extensions.

  • Deploy across multiple platforms including Windows, Mac OSX, Linux, Solaris, HP-UX and AIX — plus full support of character sets and encodings, such as Unicode.

  • Easily export content for further usage elsewhere by converting files into HTML and rendering embedded graphics as a JPEG or PNG image.

  • Offer superior operating speed, as Document Filters technology runs 2-3 times faster than competitor solutions.

bottom of page