{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# 10 minutes to CLX\n", "\n", "This is a short introduction to CLX geared mainly towards new users of the code.\n", "\n", "## What are these libraries?\n", "\n", "CLX (Cyber Log Accelerators) provides a simple API for security analysts, data scientists, and engineers to quickly get started applying RAPIDS to real-world cyber use cases. CLX uses the GPU dataframe ([cuDF](https://github.com/rapidsai/cudf)) and other RAPIDS packages to execute cybersecurity and information security workflows. The following packages are available:\n", "\n", "* analytics - Machine learning and statistics functionality\n", "* ip - IPv4 data translation and parsing\n", "* parsers - Cyber log Event parsing\n", "* io - Input and output features for a workflow\n", "* workflow - Workflow which receives input data and produces analytical output data\n", "* osi - Open source integration (VirusTotal, FarsightDB and Whois)\n", "* dns - TLD extraction\n", "\n", "\n", "## When to use CLX\n", "\n", "Use CLX to build your cyber data analytics workflows for a GPU-accelerated environmetn using RAPIDS. CLX contains common cyber and cyber ML functionality, such as log parsing for specific data sources, cyber data type parsing (e.g., IPv4), and DGA detection. CLX also provides the ability to integrate this functionality into a CLX workflow, which simplifies execution of the series of parsing and ML functions needed for end-to-end use cases.\n" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Log Parsing \n", "\n", "CLX provides traditional parsers for some common log types.\n", "Here’s an example parsing a common [Windows Event Log](https://www.ultimatewindowssecurity.com/securitylog/encyclopedia/default.aspx) of event code type [4770](https://www.ultimatewindowssecurity.com/securitylog/encyclopedia/event.aspx?eventid=4770)." ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", " | service_information_service_id | \n", "target_account_old_account_name | \n", "service_service_name | \n", "group_group_name | \n", "changed_attributes_account_expires | \n", "detailed_authentication_information_key_length | \n", "additional_information_result_code | \n", "account_information_security_id | \n", "changed_attributes_user_account_control | \n", "process_information_caller_process_id | \n", "... | \n", "changed_attributes_old_uac_value | \n", "attributes_profile_path | \n", "attributes_user_account_control | \n", "account_for_which_logon_failed_account_domain | \n", "account_whose_credentials_were_used_account_domain | \n", "new_logon_logon_guid | \n", "service_server | \n", "attributes_home_directory | \n", "failure_information_status | \n", "failure_information_sub_status | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | ... | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " |
1 rows × 131 columns
\n", "\n", " | hostname | \n", "domain | \n", "suffix | \n", "subdomain | \n", "
---|---|---|---|---|
0 | \n", "www.google.com | \n", "com | \n", "www | \n", "|
1 | \n", "gmail.com | \n", "gmail | \n", "com | \n", "\n", " |
2 | \n", "github.com | \n", "github | \n", "com | \n", "\n", " |
3 | \n", "pandas.pydata.org | \n", "pydata | \n", "org | \n", "pandas | \n", "
4 | \n", "www.worldbank.org.kg | \n", "worldbank | \n", "org.kg | \n", "www | \n", "
5 | \n", "waiterrant.blogspot.com | \n", "waiterrant | \n", "blogspot.com | \n", "\n", " |
6 | \n", "forums.news.cnn.com.ac | \n", "cnn | \n", "com.ac | \n", "forums.news | \n", "
7 | \n", "forums.news.cnn.ac | \n", "cnn | \n", "ac | \n", "forums.news | \n", "
8 | \n", "b.cnn.com | \n", "cnn | \n", "com | \n", "b | \n", "
9 | \n", "a.news.uk | \n", "news | \n", "uk | \n", "a | \n", "
10 | \n", "a.news.co.uk | \n", "news | \n", "co.uk | \n", "a | \n", "
11 | \n", "a.news.co.uk | \n", "news | \n", "co.uk | \n", "a | \n", "
12 | \n", "107-193-100-2.lightspeed.cicril.sbcglobal.net | \n", "sbcglobal | \n", "net | \n", "107-193-100-2.lightspeed.cicril | \n", "
13 | \n", "a23-44-13-2.deploy.static.akamaitechnologies.com | \n", "akamaitechnologies | \n", "com | \n", "a23-44-13-2.deploy.static | \n", "
\n", " | member_account_name | \n", "attributes_password_last_set | \n", "service_service_name | \n", "attributes_profile_path | \n", "account_information_security_id | \n", "additional_information_transited_services | \n", "additional_information_caller_computer_name | \n", "network_information_direction | \n", "new_logon_account_name | \n", "changed_attributes_home_drive | \n", "... | \n", "certificate_information_certificate_issuer_name | \n", "network_information_source_network_address | \n", "service_information_service_name | \n", "privileges | \n", "account_for_which_logon_failed_account_domain | \n", "network_information_network_address | \n", "service_server | \n", "new_account_account_name | \n", "user_account_name | \n", "attributes_user_account_control | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | inbound | \n", "\n", " | \n", " | ... | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " |
1 rows × 131 columns
\n", "