Installation and configuration guide
This guide walks you through the complete setup process for the Intranet crawler, from initial configuration to enabling the tool for your assistants. Setup requires admin permissions in Intric and typically involves collaboration with your IT or intranet administration team.
For a high-level overview of capabilities and security, see the article below.
Prerequisites
Section titled âPrerequisitesâBefore beginning setup, ensure the following:
- You have Admin permissions in your Intric tenant
- The MCP module is enabled for your tenant
- You have identified which intranet content should be indexed
- You have created or identified a service account with appropriate permissions on your intranet
- Your intranet is accessible over HTTPS
- You understand your intranetâs authentication requirements
Step 1: Create the MCP server record
Section titled âStep 1: Create the MCP server recordâNavigate to the MCP servers section in the Intric admin interface.
Click Create and provide a descriptive name for the server, such as âCompany Intranet Crawlerâ or âEmployee Handbook Searchâ.
The server record is created. You can now proceed to configure its connection details.
Step 2: Configure the server URL and authentication
Section titled âStep 2: Configure the server URL and authenticationâEdit the newly created server and provide the following information:
Server URL
Section titled âServer URLâEnter the HTTPS endpoint of the Intranet crawler service. This URL will be provided by Intric during setup coordination.
Example: https://crawler.yourcompany.intric.app/mcp
Authentication
Section titled âAuthenticationâChoose API key as the authentication type.
Provide the API key that was generated during the crawler service provisioning. This key is used to authenticate Intricâs requests to the crawler and is encrypted at rest.
Security note: Store the API key securely. It will not be visible in the interface after saving, and only indicates whether a key is configured.
Description (optional)
Section titled âDescription (optional)âAdd a description to help other administrators understand the purpose and scope of this crawler, for example:
âIndexes the employee handbook, HR policies, and IT documentation from the internal intranet.â
Save the configuration. Intric will automatically discover the crawlerâs available capabilities: Search Intranet and Correct Intranet.
Step 3: Configure crawl targets
Section titled âStep 3: Configure crawl targetsâCrawl targets define which pages on your intranet the crawler should visit and index.
In the MCP server configuration, navigate to the Crawl targets section.
Add URLs for each section of your intranet you want to index. For each target, specify:
- URL â The starting point for the crawl (e.g.,
https://intranet.yourcompany.com/handbook) - Crawl depth (optional) â How many levels of links the crawler should follow from the starting URL
- URL patterns (optional) â Include or exclude specific URL patterns to focus the crawl
Example configuration:
URL: https://intranet.yourcompany.com/handbookDepth: 2Include patterns: /handbook/*, /policies/*Exclude patterns: /archive/*, /draft/*This configuration indexes the handbook section and all linked policy pages, while skipping archived and draft content.
Step 4: Configure authentication for the crawler
Section titled âStep 4: Configure authentication for the crawlerâThe crawler needs credentials to access your intranet. The specific configuration depends on your intranetâs authentication method.
For basic authentication or form-based login
Section titled âFor basic authentication or form-based loginâProvide the username and password for the service account.
Important: Specify which service account should be used and ensure it has:
- Access to all content you want indexed
- No access to content that should remain restricted
- A strong, unique password
- Appropriate audit logging enabled
For multi-step authentication
Section titled âFor multi-step authenticationâIf your intranet requires multi-step login (e.g., username/password followed by a security question), youâll need to work with Intricâs technical team to configure the authentication flow.
Provide:
- Step-by-step instructions for the login process
- Any security questions and answers
- Session timeout information
Note: Authentication flows with CAPTCHA or device-based two-factor authentication are not currently supported.
For SSO or OAuth
Section titled âFor SSO or OAuthâIf your intranet uses single sign-on or OAuth authentication, contact Intricâs support team for guidance on the appropriate configuration method.
Step 5: Configure document extraction
Section titled âStep 5: Configure document extractionâThe crawler can extract and index various document types found on your intranet.
In the Document extraction settings, specify:
- Supported file types â PDF, Word documents, Excel spreadsheets, etc.
- Maximum file size â Documents larger than this limit will be skipped
- Text extraction method â How the crawler should process different document types
Recommended settings:
Supported types: PDF, DOCX, XLSX, TXTMaximum size: 10 MBExtract text from: Body content, document metadata, embedded textStep 6: Set crawl schedule
Section titled âStep 6: Set crawl scheduleâConfigure how often the crawler should re-index your intranet content.
The default schedule is once per day (daily at 02:00 UTC), which is appropriate for most organizations. You can adjust this based on:
- How frequently your intranet content changes
- System load considerations
- The size of your intranet
Example schedules:
- Static content (handbooks, policies): Once per week
- Regularly updated content (news, announcements): Once per day
- Frequently changing content: Consider whether the crawler is the right tool
Rate limiting: The crawler includes built-in rate limiting to avoid overwhelming your intranet server. Review the default settings and adjust if needed based on your infrastructure.
Step 7: Test the configuration
Section titled âStep 7: Test the configurationâBefore enabling the crawler for production use, validate the configuration:
- Click Validate connection to verify the crawler can reach your intranet and authenticate successfully
- Review the connection status:
- OK â Configuration is working correctly
- Unauthorized â Check service account credentials
- Failed â Verify URL and network accessibility
- Trigger a manual test crawl to verify content extraction
- Review the crawl logs to ensure expected pages are being indexed
- Check for any authentication errors or permission issues
Step 8: Enable the tool for your tenant
Section titled âStep 8: Enable the tool for your tenantâOnce configuration is complete and tested:
- Navigate to the Tools section in the admin interface
- Find the Intranet crawler in the list of available tools
- Enable it for your tenant
- Configure security classification settings if required
The tool is now available for use in assistants.
Step 9: Attach the tool to assistants
Section titled âStep 9: Attach the tool to assistantsâTo make the Intranet crawler available to users:
- Open or create an assistant in a Space
- Navigate to the assistantâs Tools configuration
- Add the Intranet crawler tool
- Save the assistant
Users can now ask the assistant to search intranet content during conversations.
Managing corrections
Section titled âManaging correctionsâWhen users suggest corrections to indexed content:
- Corrections appear in the Intranet corrections admin panel
- Review each suggestion for accuracy and appropriateness
- Options for each correction:
- Accept â Apply the correction as submitted
- Modify â Edit the correction before applying
- Reject â Decline the correction with optional feedback
Accepted corrections are applied to the search index during the next crawl cycle.
Security best practices
Section titled âSecurity best practicesâService account configuration
Section titled âService account configurationâ- Create a dedicated service account specifically for the crawler
- Grant only the minimum necessary permissions
- Use a strong, unique password
- Enable audit logging for the service account
- Review service account access regularly
Content scoping
Section titled âContent scopingâ- Carefully consider which intranet sections to index
- Exclude sensitive HR data, financial information, or confidential projects
- Use URL patterns to restrict crawling to approved areas
- Regularly review indexed content for appropriateness
Access control
Section titled âAccess controlâ- Configure security classification rules for the Intranet crawler tool
- Limit which assistants have access to the crawler
- Consider creating specialized assistants for sensitive content areas
- Use Space permissions to control who can use intranet-enabled assistants
Troubleshooting
Section titled âTroubleshootingâ| Issue | Solution |
|---|---|
| âServer URL is not setâ | Edit the MCP server and provide a valid HTTPS URL |
| âUnauthorizedâ during validation | Verify service account credentials and permissions |
| No content appearing in search results | Check crawl logs for errors; verify crawl targets are correct |
| Authentication keeps failing | Review multi-step authentication configuration; check for session timeouts |
| Crawl is too slow | Reduce crawl depth or restrict URL patterns; adjust rate limiting |
| Incorrect content being indexed | Refine URL include/exclude patterns; check service account permissions |
| Corrections not appearing | Ensure Correct Intranet capability is enabled; check user permissions |
Getting help
Section titled âGetting helpâIf you encounter issues during setup or need assistance with complex authentication configurations, contact Intric support with:
- Your tenant name
- The MCP server name
- Screenshots of any error messages
- Crawl log excerpts (if applicable)
- Description of your intranetâs authentication method