TET PDF IFilter

  • Rating:
  • Version: 4.0
  • Publisher:
    www.pdflib.com
  • File Size: 7.69 MB
  • Date: Jul 29, 2010
  • License: Free Trial Software
  • Category:
    PDF Software
    File & Disk
TET PDF IFilter Download
Free Download TET PDF IFilter 4.0

Enterprise PDF Search for Windows. TET PDF IFilter extracts text and metadata from PDF documents and makes it available to search and retrieval software on Windows. This allows PDF documents to be searched on the local desktop, a corporate server, or the Web. TET PDF IFilter is based on the patented PDFlib Text Extraction Toolkit (TET), which is a developer product for reliably extracting text from PDF documents.

TET PDF IFilter is a robust implementation of Microsoft's IFilter indexing interface. It works with all search and retrieval products which support the IFilter interface, e.g. SharePoint and SQL Server. Such products use format-specific filter programs - called IFilters - for particular file formats, e.g. HTML. TET PDF IFilter is such a program, aimed at PDF documents. The user interface for searching the documents may be the Windows Explorer, a Web or database frontend, a query script, or a custom application. As an alternative to interactive searches, queries can also be submitted programmatically without any user interface.

TET PDF IFilter offers the following advantages:
1. Supports Western text, Chinese, Japanese, and Korean (CJK) text and right-to-left languages such as Arabic and Hebrew
2. Indexes protected documents and extracts text even from PDFs where Acrobat fails
3. Supports Unicode folding, decomposition, and normalization
4. Deployment: thread-safe, fast and robust, 32- and 64-bit versions
5. Automatic script and language detection for improved search

TET PDF IFilter is available in fully thread-safe native 32- and 64-bit versions. You can implement enterprise PDF search solutions with TET PDF IFilter and the following products:
1. Microsoft Office SharePoint Server
2. Microsoft Search Server
3. Microsoft SQL Server
4. Microsoft Exchange Server
5. Mirosoft Site Server

TET PDF IFilter can be used with all other Microsoft and third-party products which support the IFilter interface.

Desktop PDF search
TET PDF IFilter can also be used to implement desktop PDF search, e.g. with the following products:
* Windows Search is integrated in Windows, Vista/7; also available as free add-on for Windows XP
* Windows Indexing Service

TET PDF IFilter is freely available for non-commercial desktop use, which provides a convenient basis for test and evaluation.

Accepted PDF input
TET PDF IFilter supports all relevant flavors of PDF input:
1. All PDF versions up to Acrobat 9, including ISO 32000-1
2. Protected PDFs which do not require a password for opening the document
3. Damaged PDF documents will be repaired

Unicode Postprocessing
TET PDF IFilter supports various Unicode postprocessing steps which can be used to improve the search results:
1. Foldings preserve, remove or replace characters, e.g. remove punctuation or characters from irrelevant scripts.
2. Decompositions replace a character with an equivalent sequence of one or more other characters, e.g. replace a Chinese character with its canonically equivalent Unicode character.
3. Text can be converted to all four Unicode normalization forms, e.g. emit NFC form to match the requirements of a database.

Internationalization
In addition to Western text TET PDF IFilter fully supports Chinese, Japanese, and Korean (CJK) text. All CJK encodings are recognized; horizontal and vertical writing modes are supported. Automatic detection of the locale ID (language and region identifier) of the text improves the results of Microsoft's word breaking and stemming algorithms, which is especially important for East Asian text.

Right-to-left languages such as Hebrew and Arabic are also supported. Contextual character forms are normalized and the text is delivered in logical order.

PDF contains more than just Pages
TET PDF IFilter treats PDF documents as containers which may contain much more information than only plain pages. TET PDF IFilter indexes all relevant items in PDF documents:
1. Page contents
2. Text in bookmarks
3. Metadata (see below)
4. Embedded PDFs and PDF packages/portfolios are processed recursively so that the text in all embedded PDF documents can be searched.

XMP Document Metadata and Document Info Entries
The advanced metadata implementation in TET PDF IFilter supports the Windows property system for metadata. It indexes XMP metadata as well as standard or custom document info entries. Metadata indexing can be configured on several levels:
1. Document info entries, Dublin Core fields and other common XMP properties are mapped to equivalent Windows properties, e.g. Title, Subject, Author.
2. TET PDF IFilter adds useful PDF-specific pseudo properties, e.g. page size, PDF/A conformance level, font names.
3. All relevant predefined XMP properties can be searched.
4. User-defined XMP properties can be searched, e.g. company-specific classification properties, PDF/A extension schemas.

TET PDF IFilter optionally integrates metadata in the full text index. As a result, even full text search engines without metadata support (e.g. SQL Server) can search for metadata.

Installing TET PDF IFilter
TET PDF IFilter is delivered as an MSI installer for Windows systems. All TET PDF IFilter packages contain a signed IFilter DLL plus support files, documentation, and samples. Running the MSI installer requires Administrator privileges. The installer will install
and register TET PDF IFilter. Additional steps for specific search environments (e.g. Windows Desktop Search, SharePoint) as well as custom configuration are discussed elsewhere in this manual.

32-bit and 64-bit versions. TET PDF IFilter is available for 32-bit and 64-bit platforms. Both versions are available in separate installers, and can be installed on the same system in parallel if required. The 64-bit version is a native 64-bit implementation which
works only with 64-bit executables. While the 64-bit installer will refuse to install on 32-bit systems, the 32-bit version works on both 32-bit and 64-bit systems.

The license of this software is Free Trial Software, you can free download and get a free trial.

More Details:
Related Software: