SharePoint 2013 Search Topologies Explained - Agnes Molnar
https://blog.syskit.com/sharepoint-2013-search-topologies-explained
References:
Office 365
How to Flag Items to be Crawled in Office 365 (2013 2016)
https://searchexplained.com/how-to-flag-items-to-be-crawled-in-office-365/
2016
Overview of search architecture in SharePoint Server
https://technet.microsoft.com/en-us/library/jj219738(v=office.16).aspx
2013
Search architectures for SharePoint Server 2013
http://www.microsoft.com/en-us/download/details.aspx?id=30374
Overview of search in SharePoint Server 2013
http://technet.microsoft.com/en-us/library/jj219738%28v=office.15%29.aspx
Configuring SharePoint 2013 Search Topology
http://blogs.technet.com/b/meamcs/archive/2013/04/09/configuring-sharepoint-2013-search-topology.aspx
SharePoint 2013: Scaling Out Enterprise Search Series
http://stevemannspath.blogspot.com/2013/06/sharepoint-2013-scaling-out-enterprise.html
More references … Thx Mike
SP2013: Understanding storage locations for files gathered by the Crawl Component
https://blogs.msdn.microsoft.com/sharepoint_strategery/2014/12/04/sp2013-understanding-storage-locations-for-files-gathered-by-the-crawl-component/
Search Crawler filling up drive, continuous crawls do nothing
https://sharepoint.stackexchange.com/questions/127943/search-crawler-filling-up-drive-continuous-crawls-do-nothing
Certain folders may have to be excluded from antivirus scanning when you use file-level antivirus software in SharePoint
https://support.microsoft.com/en-us/help/952167/certain-folders-may-have-to-be-excluded-from-antivirus-scanning-when-y
Managing the SP Gatherer’s temp files
https://blogs.technet.microsoft.com/searchguys/2012/09/18/managing-the-sp-gatherers-temp-files/
Problems with drive C running out of space after moving SharePoint Search Index to D drive
https://social.technet.microsoft.com/Forums/lync/en-US/6dfdbc41-48c3-48e6-a6d3-2a171fd49d27/problems-with-drive-c-running-out-of-space-after-moving-sharepoint-search-index-to-d-drive?forum=sharepointsearch
First is a quick high level flow of what happens at crawl time:
1. As the mssdmn.exe (gatherer) goes out and fetches content, it will store the data in the TempPath defined at this registry key:
HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\15.0\Search\Global\Gathering Manager\TempPath
2. The mssearch.exe process will then perform a windows operation to Rename\Move the file to the path defined here at this registry key:
( Your "guid-crawl-x " will differ )
HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\15.0\Search\Components\52ccea97-e96c-47ca-bcc4-737823da6e85-crawl-0\GathererDataPath
3. Once the Content Processing Component Processes this data, the mssearch should then delete the file from this path.
This is OK to use this method to move the locations where the "gatherer" writes it data to
• $ssa.TempPath cmdlet has been removed
• To modify application path where mssdmn\mssearch writes to, the following registry keys should be modified:
HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\15.0\Search\Global\Gathering Manager\DefaultApplicationPath
HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\15.0\Search\Components\{Both admin and crawl components}\LocalStoragePath
( your "admin" component key will be a GUID folder without anything like, xxxxx-crawl-xx, following it )
• After that search service should be restarted
net stop osearch15
net start osearch15
• and original folders:
C:\Program Files\Microsoft Office Servers Debug\15.0\Data\Office Server\Applications\{both admin and crawl components}
• should be removed
• ONLY modify these keys and on the Crawl Servers and Admin components..
Anthony Casillas