This guide provides an overview of the different data connectivity options providers can enable in Microsoft Sentinel for customers with specific focus on build, validation steps and publishing process. Furthermore, the document covers technical details on opportunities to enable richer Microsoft Sentinel experiences.
- Choose the data connector type – Provider decides on the connector type planned to be built, depending on the planned customer experience and the existing ingestion mechanism(s) supported by the provider data source
- Send data to Microsoft Sentinel – Provider follows the specific steps for the applicable data connector to establish the pipeline setup as POC, validate and see the data flow in Microsoft Sentinel
- Build the connector – Provider builds the connector using templates and guidance, validates and submits the data connector with query samples and documentation
- Ship the data connector in a solution – Provider follows the guidance to build and publish a solution to ship the data connector built. This step enables customers to discover the data connector for the provider product in Microsoft Sentinel content hub and Azure Marketplace.
Provider can build workbooks, analytic data templates, hunting queries, investigation graph queries, logic app connectors, playbooks and more for an enhanced customer experience while using provider's data connector in Microsoft Sentinel as illustrated below. Refer to details in evolve the data experience.
There are three types of data connectors providers can build to stream their data into Microsoft Sentinel.
The following table lists these and provides a high-level overview to help providers decide.
Microsoft Sentinel Log Ingestion Format | Customer Experience | Why choose? |
---|---|---|
REST API (Push to Microsoft Sentinel) |
|
When you have data that does not conform to CEF or RAW Syslog formats you can create custom tables. You want strict control over schema mapping and column names in Microsoft Sentinel tables on how you present your data. |
Codeless Connector Platform (CCP) (Preview) / Native Microsoft Sentinel Polling |
|
Cloud native approach to integrate with Microsoft Sentinel with maximum customer benefits. Connectors created using CCP are fully SaaS, without any requirements for service installations, and also include health monitoring and full support from Microsoft Sentinel. CCP is currently in Public Preview so reach out to [email protected] with any feedback you have while integrating with CCP. |
REST API Polling (provider API using Azure Functions) |
|
Approach to connect with provider APIs using Azure Functions to ingest data into Microsoft Sentinel. CCP is preferred over this approach. Use this as an alternative approach if you run into issues integrating with CCP. |
CEF |
|
CEF will appear as the know CEF (CommonSecurityLog) schema as columns in the Microsoft Sentinel Log tables. |
Syslog (Least preferred) |
|
You only can emit RAW Syslog at this point. |
Once you have decided on the type of data connector you plan to support, set the pipeline to send this data to Microsoft Sentinel as a POC before building the connector. The process is described for each data connector type. Once you have a POC, send an email to [email protected] for the POC demo.
- Use the Azure Monitor Data Collector API to send data to Azure Log Analytics. This blog covers step by step instructions with screenshots to do so. If on prem, open port 443 (HTTPS/TLS) on your environment to talk to Microsoft Sentinel.
- Ensure the schema used for structuring the data in Log Analytics is locked. Any changes to the schema after the data connector is published will have a compatibility impact, hence need to have a new name for the connector data type.
- Design a configuration mechanism in your product experience via product settings or via your product website, where your customers can go and enter the following information to send their logs into Log Analytics for Microsoft Sentinel.
- [Required] Microsoft Sentinel workspace ID
- [Required] Microsoft Sentinel primary key
- [Optional] Custom log name
- Any other specific dependency that may be needed to successfully establish connectivity
- These logs will appear in a Custom Log Analytics table CustomLogs -> <log name> where the log name is what the customer provides in the above-mentioned step. Identify a default log name to handle the scenario where customer does not enter the custom log name.
- Design and validate a few key queries that lands the value of the data stream using Kusto Query Language. Share these as sample queries in the data connector.
Example connectors to refer to : Symantec, Barracuda WAF
Connector Validation Steps
- Test the actual customer experience and validate if data flows as expected and appears in the expected Microsoft Sentinel Log Analytics custom table provided.
- If on prem, open port 443 (HTTPS/TLS) on your environment to talk to Microsoft Sentinel. Ensure this is documented in connector documentation (steps in following section) for your customers.
- From a data quality perspective,
- Ensure the data you send is complete and contains the same fields available in your product.
- Ensure the data is valid and easy to query using Log Analytics.
- Follow documentation to create a data connector using CCP. Use the template (ARM template) in step 2 of the Build the connector section below for quick start. Download and update for integrating with your API endpoint.
- Ensure the schema used for structuring the data in Log Analytics is locked. Any changes to the schema after the data connector is published will have a compatibility impact, hence need to have a new name for the connector data type.
- Use the ARM deploy mechanism to upload the ARM template from step 1 for testing.
- Pass the parameters in the configuration setting and establish connectivity.
- These logs will appear in a Custom Log Analytics table CustomLogs -> <log name> where the log name is what you have as data type name in the template.
- Design and validate a few key queries that lands the value of the data stream using Kusto Query Language. Share these as sample queries in the data connector.
Example connectors to refer to : GitHub, Lastpass (check in Content hub for the solution that has these data connectors)
Connector Validation Steps
- Test the actual customer experience and validate if data flows as expected and appears in the expected Microsoft Sentinel Log Analytics custom table provided.
- From a data quality perspective,
- Ensure the data you send is complete and contains the same fields available in your product.
- Ensure the data is valid and easy to query using Log Analytics.
- Ensure the schema aligns with ASIM schema as much as possible. Other non-mappable fields can land in as-is for complete product integration value to customers.
To enable the CEF connector deploy a dedicated proxy Linux machine (VM or on premises) to support the communication between your security solution (the product that sends the CEF messages) and Microsoft Sentinel.
Enable the CEF connector as follows:
- Go to Microsoft Sentinel
- Open the Data Connectors page and choose the relevant connector and click Open connector page
- Follow the CEF instructions below (also in the CEF connector documentation).
1. Install and configure Linux Syslog agent
Install and configure the Linux agent to collect your Common Event Format (CEF) Syslog messages and forward them to Microsoft Sentinel.
1.1 Select a Linux machine
Select or create a Linux machine that Microsoft Sentinel will use as the proxy between your security solution and Microsoft Sentinel this machine can be on your on-prem environment, Azure or other clouds.
1.2 Install the CEF collector on the Linux machine
Install the Microsoft Monitoring Agent on your Linux machine and configure the machine to listen on the necessary port and forward messages to your Microsoft Sentinel workspace.
Note:
-
Make sure that you have Python on your machine using the following command:
python –version
-
You must have elevated permissions (sudo) on your machine
Run the following command to install and apply the CEF collector:
sudo wget https://raw.githubusercontent.com/Azure/Azure-Sentinel/master/DataConnectors/CEF/cef\_installer.py&&sudo python cef\_installer.py [WorkspaceID]_ [Workspace Primary Key]
2. Forward Common Event Format (CEF) logs to Syslog agent
2.1 Set your security solution to send Syslog messages in CEF format to the proxy machine. This varies from product to product and follow the process for your product. There are couple of ways to choose from pushing your logs
- The agent can collect logs from multiple sources but must be installed on dedicated machine per the following diagram
- Alternatively, you can deploy the agent manually on an existing Azure VM, on a VM in another cloud, or on an on-premises machine as shown in the diagram below 2.2 Make sure to send the logs to port 514 TCP on the machine's IP address.
2.3 Outline specific steps custom for sending your product logs along with link to your (partner) product documentation on how customers should configure their agent to send CEF logs from the respective product into Microsoft Sentinel.
Example connectors to refer to : ZScaler
Connector Validation Steps
Follow the instructions to validate your connectivity:
-
Open Log Analytics to check if the logs are received using the CommonSecurityLog schema. Note: It may take about 20 minutes until the connection streams data to your workspace.
-
If the logs are not received, run the following connectivity validation script:
- Note:
- Make sure that you have Python on your machine using the following command:
python –version
- You must have elevated permissions (sudo) on your machine
- Make sure that you have Python on your machine using the following command:
- sudo wget https://raw.githubusercontent.com/Azure/Azure-Sentinel/master/DataConnectors/CEF/cef\_troubleshoot.py&&sudo python cef_troubleshoot.py [WorkspaceID]
- Note:
-
From a data quality perspective,
- Ensure the data you send is complete and contains the same fields available in your product.
- Ensure the data is valid and easy to query using Log Analytics.
-
Design and validate a few key queries that lands the value of the data stream using Kusto Query Language. Share these as sample queries in the data connector.
To use TLS communication between the security solution and the Syslog machine, you will need to configure the Syslog daemon (rsyslog or syslog-ng) to communicate in TLS: Encrypting Syslog Traffic with TLS - rsyslog, Encrypting log messages with TLS – syslog-ng.
Note: If your product supports CEF, the connection is more complete and you should choose CEF and follow the instructions in Connecting data from CEF and data connector building steps detailed in the CEF connector section.
- Follow the steps outlined in the Connecting data from Syslog to use the Microsoft Sentinel syslog connector to connect your product.
- Set your security solution to send Syslog messages to the proxy machine. This varies from product to product and follow the process for your product.
- Outline specific steps custom for sending your product logs along with link to your (partner) product documentation on how customers should configure their agent to send Syslog logs from the respective product into Microsoft Sentinel.
- Design and validate a few key queries that lands the value of the data stream using Kusto Query Language. Share these as sample queries in the data connector.
- Build a parser based on Kusto function to ensure the query building experience is easy for customers working with the data connector.
Example connectors to refer to : Barracuda CWF
Connector Validation Steps
Follow the instructions to validate your connectivity:
- Open Log Analytics to check if the logs are received using the Syslog schema. Note: It may take about 20 minutes until the connection streams data to your workspace.
- From a data quality perspective,
- Ensure the data you send is complete and contains the same fields available in your product.
- Ensure the data is valid and easily to query using Log Analytics.
Once you have a working POC, you are ready to build, validate the data connector user experience and submit your connector and relevant documentation.
-
Review the data connector template guidance - This is to help get familiarized with the nomenclature used in the templates and to enable filling out the json template easily.
-
Use the template - Download the right template for your data connector type from the following, rename the json file to 'ProviderNameApplianceName.json' (no spaces in name) and fill out the template per the guidance mentioned above.
-
Validate the Connector UX – Follow these steps to render and validate the connector UX you just built
- The test utility can be accessed by this URL - https://aka.ms/sentineldataconnectorvalidateurl
- Go to Microsoft Sentinel -> Data Connectors
- Click the "import" button and select the json file you created as follows. To validate a codeless connector with this tool, be sure to only upload a json file containing the
connectorUiConfig
collection. - The json file you just created is loaded (example as follows) - Validate connector UX by ensuring all links resolve appropriately with no errors (including query links) in both the main and ‘next steps’ page, check for content accuracy, grammar, typos and formatting. Update the json as needed and reload to revalidate.
Note: This json is loaded only in your session and not shared out. The logo won’t show up since it’s not part of the json. Connector logo will be included when Microsoft builds and deploys the data connector.
-
Prepare sample data for validation and submission – Plan to submit some real-world, sanitized raw sample data/logs for your connectors that covers all types of logs, events, alerts, etc. depending on the data type. Sample data is extremely useful when troubleshooting issues, supporting and/or enhancing the Data Connectors with more Security-focused content (such as Analytics, Hunting Queries, Workbooks, etc.). The following guidelines are designed to help committing sample data in a usable format into GitHub:
-
The extension for the file can be .json (for API based Data Connector) / .txt (for Syslog/CEF based data Connectors) with the column names / property names adhering to the data type property names.
-
Submit the Sample Data via a GitHub PR. All sample data files must reside inside a folder called "Sample Data" within the Solution folder. Example folder structure - "Azure-Sentinel/Solutions//Sample Data/".
-
Important: Please ensure all sample data has been scrubbed to remove all sensitive PII information that may exist in the logs. The intent is to understand the "what" and "how" from the logs not the "who".
_IMPORTANT!: Detailed guidance on Sample Data contribution including expected file names, format, file extensions and extraction method is available here.
-
-
Submit your data connector - Follow the general contribution guidelines for Microsoft Sentinel to open a Pull Request (PR) to submit the data connector:
- The json file in the 'Connectors' folder
- The sample data file in the right folder. Example folder structure - "Azure-Sentinel/Solutions//Sample Data/"
- The company logo adhering to the following requirements in the 'Logo' folder
- Logo needs to be in SVG format and under 5 Kb
- Ensure raw file of logo does not have any of the following:
- cls and style formats
- embedded png formats
- xmlns:xlink
- data-name
- Do not use xlink:href - use inline instead
- Do not use title tag
- If some properties in the logo have IDs (for e.g. <g id="layer0"...), then set these IDs as GUIDs so that these are uniquely identifiable across all Azure logo assets
- Logo scales well to fit in a 75 px square
- SVG code file is formatted well for readability
- For Syslog data connector, the Kusto function parser is in the right subfolder (PROVIDERNAME) of 'Parsers' folder
- If you are bringing in detections or hunting queries, requiredDataConnectors section of the YAML template must be populated. Details of what to reference in the YAML template from the connector JSON are in the Query Style Guide under requiredDataConnectors
- Follow these steps to build and publish your data connector in a solution. This is required to enable customer discoverability and telemetric insights for future enhancements for your integrations.
- Follow the guidance in evolve the data experience section below to enrich your solution with workbooks, analytics rule (detections), hunting queries, playbooks and more OOTB content.
Follow the steps to build your workbook and submit your workbook json file, two screenshots of the workbook view one each in white and black background theme settings, logo and entry in the ‘workbooksMetadata.json’ file by a PR as mentioned in the instructions.
Follow the steps to build and submit your analytic rule template or detection pertaining to this data connector. Ensure to fill in the requiredDataConnectors parameter with the right data connector ID(s) to establish relation of this analytic rule template with the data connector.
Build logic apps connectors to enable automation capabilities for customers in the following areas:
- Incident management – for e.g. assign a ticket to an analyst, keep ticket status in sync, …
- Enrichment and Investigation – for e.g. geo lookup for an IP, sending investigation emails, …
- Remediation – for e.g. block an IP address, block user access, isolate machine, …
- Any other automation capabilities unique to your appliance.
Follow the steps in the Azure Logic Apps building custom connectors documentation to create, certify and ship an Azure Logic App connector. This not only discoverable for Microsoft Sentinel customers, but also visible in the Azure Logic Apps gallery for Azure Logic Apps and Microsoft Flow customers too. Inform [email protected] if you are thinking of building a custom connector for your security appliance.
Check out the Microsoft Sentinel GitHub repo for more information on these.
Reach out to [email protected].