Wednesday, March 14, 2018

Search Crawl and Index

image

SharePoint 2013 Search Topologies Explained - Agnes Molnar
https://blog.syskit.com/sharepoint-2013-search-topologies-explained

https://searchexplained.com/

References:

Office 365

How to Flag Items to be Crawled in Office 365 (2013 2016)
https://searchexplained.com/how-to-flag-items-to-be-crawled-in-office-365/

2016

Overview of search architecture in SharePoint Server
https://technet.microsoft.com/en-us/library/jj219738(v=office.16).aspx

2013

Search architectures for SharePoint Server 2013
http://www.microsoft.com/en-us/download/details.aspx?id=30374

Overview of search in SharePoint Server 2013
http://technet.microsoft.com/en-us/library/jj219738%28v=office.15%29.aspx

Configuring SharePoint 2013 Search Topology
http://blogs.technet.com/b/meamcs/archive/2013/04/09/configuring-sharepoint-2013-search-topology.aspx

SharePoint 2013: Scaling Out Enterprise Search Series
http://stevemannspath.blogspot.com/2013/06/sharepoint-2013-scaling-out-enterprise.html

More references … Thx Mike

SP2013: Understanding storage locations for files gathered by the Crawl Component
https://blogs.msdn.microsoft.com/sharepoint_strategery/2014/12/04/sp2013-understanding-storage-locations-for-files-gathered-by-the-crawl-component/

Search Crawler filling up drive, continuous crawls do nothing
https://sharepoint.stackexchange.com/questions/127943/search-crawler-filling-up-drive-continuous-crawls-do-nothing

Certain folders may have to be excluded from antivirus scanning when you use file-level antivirus software in SharePoint
https://support.microsoft.com/en-us/help/952167/certain-folders-may-have-to-be-excluded-from-antivirus-scanning-when-y

Managing the SP Gatherer’s temp files
https://blogs.technet.microsoft.com/searchguys/2012/09/18/managing-the-sp-gatherers-temp-files/

Problems with drive C running out of space after moving SharePoint Search Index to D drive
https://social.technet.microsoft.com/Forums/lync/en-US/6dfdbc41-48c3-48e6-a6d3-2a171fd49d27/problems-with-drive-c-running-out-of-space-after-moving-sharepoint-search-index-to-d-drive?forum=sharepointsearch

First is a quick high level flow of what happens at crawl time:

1. As the mssdmn.exe (gatherer) goes out and fetches content, it will store the data in the TempPath defined at this registry key:

HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\15.0\Search\Global\Gathering Manager\TempPath

2. The mssearch.exe process will then perform a windows operation to Rename\Move the file to the path defined here at this registry key:

( Your "guid-crawl-x " will differ )

HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\15.0\Search\Components\52ccea97-e96c-47ca-bcc4-737823da6e85-crawl-0\GathererDataPath

3. Once the Content Processing Component Processes this data, the mssearch should then delete the file from this path.

This is OK to use this method to move the locations where the "gatherer" writes it data to

$ssa.TempPath cmdlet has been removed
• To modify application path where mssdmn\mssearch writes to, the following registry keys should be modified:

HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\15.0\Search\Global\Gathering Manager\DefaultApplicationPath

HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\15.0\Search\Components\{Both admin and crawl components}\LocalStoragePath

( your "admin" component key will be a GUID folder without anything like, xxxxx-crawl-xx, following it )

• After that search service should be restarted

net stop osearch15

net start osearch15

• and original folders:

C:\Program Files\Microsoft Office Servers Debug\15.0\Data\Office Server\Applications\{both admin and crawl components}

• should be removed
• ONLY modify these keys and on the Crawl Servers and Admin components..


Anthony Casillas