PHPKB knowledge base software is able to index text content of the files attached to knowledge articles in order to make them searchable. Attachment indexing is supported in all editions of PHPKB knowledge base software. Some document types can be searched without any additional tools, others need PHP modules enabled or third-party tools (plugins) installed. All these modules (plugins) and third-party tools are free.
File Type | Supported Format | Required Tool (Plug-in) |
MS Office 2003 Word Document | .DOC | AntiWord (free) is required |
MS Office 2003 Excel Workbook | .XLS | xlhtml (free) is required |
MS Office 2003 PowerPoint Presentation | .PPT | ppthtml (free) is required |
MS Office 2007 Word Document | .DOCX | PHP ZIP library is required |
MS Office 2007 Excel Workbook | .XLSX | PHP ZIP library is required |
MS Office 2007 PowerPoint Presentation | .PPTX | PHP ZIP library is required |
Adobe PDF Documents | pdftotext (free) is required | |
Plain-text Documents | .TXT, .HTM, .HTML, .CSV, .XML | No Plugin Required |
We strongly recommend you use the latest version of PHP. These plugins work under PHP 5.3+ correctly. Earlier versions of PHP have bugs and may freeze when launching external programs (e.g. attachment indexation plugins) using the Windows command line.
Download the latest PHP package for Windows (VC9 x86 Non Thread Safe is recommended).
PHP 5.3 doesn’t support ISAPI anymore. So you need to use FastCGI instead.
1. Enabling Required PHP Extensions (Modules) on Windows Server
You need to enable certain PHP modules in order to index the content of MS Office 2007 documents.
2. Attachment Indexing Plugins Installation on Windows Server
Please follow the instructions below to install/enable the attachment indexation plugins and required PHP extensions.
1. Enabling Required PHP Extensions (Modules) on Linux Server
2. Attachment Indexing Plugins Installation on Linux Server
If your system (RedHat / Fedora / CentOS) supports Yum Package Manager you can run this command instead to install necessary modules:
OR run the following command to install necessary modules on a system that has an APT library (e.g. Ubuntu, Debian):
How to Install the "xlhtml" and "ppthtml" Package on Ubuntu?
See below for quick step-by-step instructions of SSH commands, Copy/Paste to avoid miss-spelling or installing a different package by mistake.
1. Run the update command to update package repositories and get the latest package information.
2. Run the install command with -y flag to quickly install the xlhtml package and dependencies.
3. Run the install command with -y flag to quickly install the ppthtml package and dependencies.
4. Check the system logs to confirm that there are no related errors.
Note: -y flag means to assume yes and silently install, without asking you questions in most cases.
Note: In case ppthtml & xlhtml do not get installed using the above commands then you can search them here and install accordingly.
If you would like to auto-index the attached files as soon as they are uploaded, then there is another setting under the "File Upload Settings" section of the "Miscellaneous" tab. You can set the checkbox for "Index Attachments" as shown in the image below.
Now you can upload attachments and they will be automatically indexed for search.
You can also manually run indexation for existing file attachments whenever required from the "Tools" » "Index Attachments" section of the admin control panel as shown below.
PHPKB knowledge base software is able to index the text content of PDF documents and make them searchable. It converts a PDF file to text file format to search its contents. It uses "pdftotext utility" to convert Portable Document Format (PDF) files to plain text. It reads the PDF file and writes a text file thus making itself able to search within the contents of PDF documents.
It may be necessary to disable attachment indexing for specific file types if the conversion is unable to process the attachments due to size or other errors. When disabled, the conversion will be bypassed for only specified file types. Attachment indexing can also be disabled for all types if desired.
Article ID: 47
Created: February 12, 2013
Last Updated: March 17, 2021
Author: Ajay Chadha [support@phpkb.com]
Online URL: https://www.phpkb.com/kb/article/installation-of-the-attachment-indexing-plugins-47.html