Default crawled file name extensions and parsed file types in SharePoint Server
**上一次修改主题:**2018-03-08
Summary: Learn which file name extensions SharePoint Server crawls by default and which file types it parses by default.
The crawl component can only crawl a file if the list on the Manage File Types page includes the file name extension. The content processing component can only parse the contents of a crawled file:
When it has a format handler that can parse the file format.
When it is enabled to use the format handler to parse files that have the file format and file name extension.
By default, SharePoint Server satisfies these requirements for many file types.
Default crawled file name extensions and parsed file formats
The following table shows all the file formats that SharePoint Server has built-in format handlers for. The table shows one or several format ID and file name extensions for each file format. By default SharePoint Server is enabled to parse files that have these file formats and file name extensions. For each file name extension the table also indicates whether the Manage File Types page by default includes the file name extension.
备注
SharePoint Online supports the same file name extensions as in this table. In addition, SharePoint Online also supports the following:
-
.one
-
.xlt
-
.xlc
-
.xlb
File format |
Format ID |
File name extension |
File name extension listed on the Manage File Types page by default |
Email message |
eml |
.eml |
Yes |
Email message |
nws |
.nws |
Yes |
HTML |
html |
.ascx |
Yes |
HTML |
html |
.asp |
Yes |
HTML |
html |
.aspx |
Yes |
HTML |
html |
.css |
No |
HTML |
html |
.hta |
No |
HTML |
html |
.htm |
Yes |
HTML |
html |
.html |
Yes |
HTML |
html |
.htw |
No |
HTML |
html |
.htx |
No |
HTML |
html |
.jhtml |
No |
HTML |
html |
.stm |
No |
MHTML document |
mhtml |
.mht |
Yes |
MHTML document |
mhtml |
.mhtml |
Yes |
Microsoft Excel |
xlb |
.xlb |
No |
Microsoft Excel |
xlc |
.xlc |
No |
Microsoft Excel |
xls |
.xls |
Yes |
Microsoft Excel |
xlsb |
.xlsb |
Yes |
Microsoft Excel |
xlsm |
.xlsm |
Yes |
Microsoft Excel |
xlsx |
.xlsx |
Yes |
Microsoft Excel |
xlt |
.xlt |
No |
Microsoft OneNote |
one |
.one |
No |
Microsoft PowerPoint |
pot |
.pot |
No |
Microsoft PowerPoint |
ppa |
.ppa |
No |
Microsoft PowerPoint |
pps |
.pps |
No |
Microsoft PowerPoint |
ppt |
.ppt |
Yes |
Microsoft PowerPoint |
pptm |
.pptm |
Yes |
Microsoft PowerPoint |
pptx |
.pptx |
Yes |
Microsoft Publisher |
pub |
.pub |
Yes |
Microsoft Word |
doc |
.doc |
Yes |
Microsoft Word |
docm |
.docm |
Yes |
Microsoft Word |
docx |
.docx |
Yes |
Microsoft Word |
dot |
.dot |
Yes |
Microsoft Word |
dotx |
.dotx |
Yes |
Microsoft XPS |
xps |
.xps |
No |
Open Document Chart |
odc |
.odc |
Yes |
Open Document Presentation |
odp |
.odp |
Yes |
Open Document Spreadsheet |
ods |
.ods |
Yes |
Open Document Text |
odt |
.odt |
Yes |
Outlook item |
msg |
.msg |
Yes |
Portable Document Format |
Yes |
||
Rich Text Format |
rtf |
.rtf |
No |
Text |
txt |
.asm |
Yes |
Text |
txt |
.bat |
No |
Text |
txt |
.c |
No |
Text |
txt |
.cmd |
No |
Text |
txt |
.cpp |
No |
Text |
txt |
.csv |
Yes |
Text |
txt |
.cxx |
Yes |
Text |
txt |
.def |
Yes |
Text |
txt |
.h |
No |
Text |
txt |
.hpp |
No |
Text |
txt |
.lnk |
No |
Text |
txt |
.mpx |
No |
Text |
txt |
.php |
No |
Text |
txt |
.trf |
No |
Text |
txt |
.txt |
Yes |
Text |
txt |
.url |
No |
TIFF |
tiff |
.tif |
No |
TIFF |
tiff |
.tiff |
No |
Visio |
vdw |
.vdw |
Yes |
Visio |
vdx |
.vdx |
Yes |
Visio |
vsd |
.vsd |
Yes |
Visio |
vsdm |
.vsdm |
Yes |
Visio |
vsdx |
.vsdx |
Yes |
Visio |
vss |
.vss |
Yes |
Visio |
vssm |
.vssm |
Yes |
Visio |
vssx |
.vssx |
Yes |
Visio |
vst |
.vst |
Yes |
Visio |
vstm |
.vstm |
Yes |
Visio |
vstx |
.vstx |
Yes |
Visio |
vsx |
.vsx |
Yes |
Visio |
vtx |
.vtx |
Yes |
XML |
xml |
.jsp |
Yes |
XML |
xml |
.mspx |
No |
XML |
xml |
.rss |
No |
XML |
xml |
.xml |
Yes |
ZIP |
zip |
.zip |
Yes |
NOTE: The following folders are reserved names and search doesn't crawl them:
AUX
COM1
COM2
COM3
COM4
COM5
COM6
COM7
COM8
COM9
CON
LPT1
LPT2
LPT3
LPT4
LPT5
LPT6
LPT7
LPT8
LPT9
NUL
PRN
CLOCK$