Processing File Inclusions and Exclusions
Insight is a single processing, culling, review and image creation platform. We receive a wide range of file types for processing and loading into Insight. Many are not appropriate for indexing or review for a variety of reasons. Some are system and program files, others are utility files and yet others have no viewable content.
To protect our system and to keep review costs to a minimum, we regularly exclude NIST (program) files and a number of other file types from processing. The NIST list itself is extensive (over 50 million known files).
For processing purposes, we use Lexmark Document Filters.
Additionally, below is more information about the files we include or exclude during the inventory phase of processing.
System and Program Files
ADE - Microsoft Access Project Extension
ADP - Microsoft Access Project
BAS - Visual Basic Class Module
BAT - Batch File
CHM - Compiled
HTML Help File
CMD - Windows NT Command Script
COM - MS-DOS Application
CPL - Control Panel Extension
CRT - Security Certificate
DLL - Dynamic Link Library
EXE - Application
HLP - Windows Help File
HTA - HTML Applications
INF - Setup Information File
INS - Internet Communication Settings
ISP - Internet Communication Settings
JS - JScript File
JSE - JScript Encoded Script File
LNK - Shortcut
MSI - Windows Installer Package
MSP - Windows Installer Patch
MST - Visual Test Source File
OCX - ActiveX Objects
PCD - Photo CD Image
PIF - Shortcut to MS-DOS Program
REG - Registration Entries
SCR - Screen Saver
SCT - Windows Script Component
SHB - Document Shortcut File
SHS - Shell Scrap Object
SYS - System Config/Driver
URL - Internet Shortcut (Uniform Resource Locator)
VB - VBScript File
VBE - VBScript Encoded Script File
VBS - VBScript Script File
WSC - Windows Script Component
WSF - Windows Script File
WSH - Windows Scripting Host Settings File
Non Reviewable Files
CAB – MS Windows Cabinet Archive
JAR – Java Archives
CFS – Software Distribution Container Archive File Format
LDF – Database File
LIB – Library Files
MDF – Database File
TMP – Temporary Files
If you want to exclude additional file types, or do not want to exclude any of the listed file types, contact your Project Consultant.
Recommended Inclusions
There are situations where it may be appropriate to target only common user file types and disregard all other files types. In this case an inclusion list—rather than an exclusion list—may be appropriate for the project.
If you choose this option, only file types on the inclusion list will be processed and loaded to the site. All other file types will be excluded. The file inclusion list can be restrictive. We recommend talking to your Project Consultant before finalizing an inclusion list for processing. Here are our recommended file types for an inclusion list:
Word Processing Files: doc, docm, docx, dot, dotm, dotx, rtf, wpd
SpreadSheets:csv, dbf, dif, xla, xlam, xls, xlsb, xlsm, xlsx, xlt, xltm, xltx, xlw, wks, wk1, wk2, wk3, wk4, qpw
Presentations:pot, potm, potx, ppa, ppam, pps, ppsm, ppsx, ppt, pptm, pptx,
Emails: pst, eml, msg, ost, nsf, mbx
Others: pdf, txt, mdb, zip, rar, tiff, jpg, mp3, m4a, wav, mov, wmv, avi
These recommendations are based on standard practices. Your case requirements may vary; use this list only as a starting recommendation.