<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: CloudFabrix</title>
    <description>The latest articles on DEV Community by CloudFabrix (@cloudfabrix).</description>
    <link>https://dev.to/cloudfabrix</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F614791%2F3cd5c70f-d2a8-4044-ba7a-91fb378f6702.png</url>
      <title>DEV Community: CloudFabrix</title>
      <link>https://dev.to/cloudfabrix</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/cloudfabrix"/>
    <language>en</language>
    <item>
      <title>Robotic Data Automation (RDA): Reducing Costs and Improving Efficiencies of Your Log Management Investment</title>
      <dc:creator>CloudFabrix</dc:creator>
      <pubDate>Tue, 14 Sep 2021 13:31:29 +0000</pubDate>
      <link>https://dev.to/cloudfabrix/robotic-data-automation-rda-reducing-costs-and-improving-efficiencies-of-your-log-management-investment-1k7m</link>
      <guid>https://dev.to/cloudfabrix/robotic-data-automation-rda-reducing-costs-and-improving-efficiencies-of-your-log-management-investment-1k7m</guid>
      <description>&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--zzMBpqDl--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://cloudfabrix.com/blog/wp-content/uploads/2021/08/rda-Reducing-Costs-and-Improving-Efficiencies.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--zzMBpqDl--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://cloudfabrix.com/blog/wp-content/uploads/2021/08/rda-Reducing-Costs-and-Improving-Efficiencies.png"&gt;&lt;/a&gt;&lt;br&gt;
People’s involvement has been inevitable with log management despite advancements in ITOps. Log management at a high level collects and indexes all your application and system log files so that you can search through them quickly. It also lets you define rules based on log patterns so that you can get alerts when an anomaly occurs.&lt;br&gt;
Log management analytics solution leveraging RDA has been able to detect anomalies and aid predictive models over a machine learning layer. This has demonstrated improved efficiencies and a direct reduction in costs.&lt;/p&gt;

&lt;h2&gt;How does a conventional log management system work?&lt;/h2&gt;

&lt;p&gt;To make things better, log analytics has been deployed. However, log analytics alone cannot replace human interactions. Log analytics systems produce a significant increase in alerts which trigger the need for more &lt;a href="https://blog.viibe.co/6-tips-improve-it-support/"&gt;support from IT personnel&lt;/a&gt;, thus limiting its progress anytime soon. There is still a need to blend it with traditional operations practices, namely on-call monitoring teams and consulting analyst teams who respond to any alert received by the system. &lt;/p&gt;

&lt;p&gt;However, none of these mechanisms could explain or ease up the burden of an engineer sitting at his workstation while he/she responds to every notification generated by the log monitoring tools without being able to pre-determine whether such notifications are critical or not. &lt;/p&gt;

&lt;h2&gt;Challenges of a conventional log management tool&lt;/h2&gt;

&lt;p&gt;A log management tool has to be fast. Without this, users will not be able to search for logs. It also has to have enough storage so that it can index logs from an entire enterprise. This can be very expensive depending on the number of servers or applications you need information from. &lt;br&gt;
A log management tool needs to support every platform (Windows, Linux, Unix) your business uses in order to pull relevant metrics and log data you need for troubleshooting and analysis. &lt;br&gt;
In a conventional way, automating aggregation and distributing to different systems is not possible.&lt;/p&gt;

&lt;h2&gt;RDA-enhanced log management system&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--MwBm7BZ3--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://cloudfabrix.com/blog/wp-content/uploads/2021/08/rda-log-management-system-1.png%2522" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--MwBm7BZ3--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://cloudfabrix.com/blog/wp-content/uploads/2021/08/rda-log-management-system-1.png%2522"&gt;&lt;/a&gt;&lt;br&gt;
With &lt;a href="https://cloudfabrix.com/blog/aiops/robotic-data-automation-dataops-aiops/"&gt;Robotic Data Automation(RDA)&lt;/a&gt;, your log management system will be augmented with an intelligent layer to increase efficiency without disrupting any established processes. RDA also enables the creation of advanced ML-based dynamic baselining and predictive models that makes anomaly detection efficient and more reliable.&lt;/p&gt;

&lt;p&gt;The benefits of using an RDA-enabled log management solution for your business include reduced costs and improved efficiencies in managing logs.&lt;/p&gt;

&lt;ol&gt;

&lt;li&gt;Automates alerts: The first benefit is that RDA automates the alert generation process from manual rules-based. It analyzes the raw log data and dynamically learns alert conditions by minimizing the risk of missing alerts from new/modified log entries.&lt;/li&gt;
&lt;li&gt;Auto-correlation of alerts: It supports intelligent alerting capabilities with auto-correlation of alerts across different systems/services, resulting in a reduced number of alerts turning into trouble tickets.&lt;/li&gt;
&lt;li&gt;Contextual Insights: It automatically provides contextual log data when analyzing the trouble ticket thereby reducing the need for users to manually search logs&lt;/li&gt;
&lt;li&gt;Removes duplicates: It prevents unnecessary alert noise by removing duplicates across various systems while increasing the overall quality of alerts and insights derived from them.&lt;/li&gt;
&lt;li&gt;Alert log pattern summary: It automatically summarizes the alert log patterns as time-series data making it efficient and cost-effective to retain historical knowledge.&lt;/li&gt;
&lt;li&gt;Builds regression pipeline: It allows building a regression pipeline on any attribute for predicting the trend and anomalies. 
&lt;/li&gt;
&lt;li&gt;Brings new data sources: RDA also helps in bringing new data sources into existing log management systems where native methods are either complex or simply do not exist.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;RDA offers out-of-box integration with Splunk and Elastic search products. Combining RDA with Splunk/ELK not only significantly reduces the overall efforts required for managing IT Operations but also provides an excellent set of capabilities for detecting anomalies in near real-time. Also, using RDA reduces false positives, meaning alerts generated are not a result of false or random events.&lt;/p&gt;

&lt;p&gt;By building these capabilities on top of existing log management analytics investments, organizations and IT Ops can reduce costs while improving efficiency and business outcomes.&lt;/p&gt;

&lt;h2&gt;RDA vs AIOps&lt;/h2&gt;

&lt;p&gt;If you are looking at optimizing the operations of a single or small subset of IT tools like a log management analytics platform, RDA can be a better choice. Standalone RDA is faster to deploy/adopt and validate the benefits at a lower cost.&lt;/p&gt;

&lt;p&gt;AIOps, on the other hand, is designed to bring transformation across IT functions. It integrates with multiple data sources and performs cross-domain correlation and analysis to deliver actionable insights.&lt;/p&gt;

&lt;h2&gt;RDA with CloudFabrix&lt;/h2&gt;

&lt;p&gt;The CloudFabrix &lt;a href="https://www.cloudfabrix.com/platform/"&gt;AIOps platform&lt;/a&gt; and all of its products are pre-packaged with RDA. This includes &lt;a href="https://www.cloudfabrix.com/aiops-solution/"&gt;Operational Intelligence&lt;/a&gt;, &lt;a href="https://www.cloudfabrix.com/solutions/asset-intelligence-and-analytics/"&gt;Asset Intelligence&lt;/a&gt;, and &lt;a href="https://www.cloudfabrix.com/observability/"&gt;Observability-In-a-Box&lt;/a&gt;. &lt;/p&gt;

&lt;p&gt;RDA is also provided as a standalone tool and as a hosted offering to help iterate, author, and publish data workflows.&lt;/p&gt;

&lt;p&gt;So with CloudFabrix, organizations can start with RDA, validate the technology promise, and seamlessly upgrade to AIOps, delivering the value across functions.&lt;/p&gt;

</description>
      <category>aiops</category>
      <category>itops</category>
      <category>roboticdataautomation</category>
      <category>loganalytics</category>
    </item>
    <item>
      <title>Excessive Alert Noise: Cause, effect, and solution</title>
      <dc:creator>CloudFabrix</dc:creator>
      <pubDate>Thu, 24 Jun 2021 10:52:59 +0000</pubDate>
      <link>https://dev.to/cloudfabrix/excessive-alert-noise-cause-effect-and-solution-477</link>
      <guid>https://dev.to/cloudfabrix/excessive-alert-noise-cause-effect-and-solution-477</guid>
      <description>&lt;p&gt;With an exponential growth in the IT sector over the last few years, traditional operational tools and process  isn’t enough to stay ahead of the market. Problems/anomalies are treated as ‘events’. Each of these events triggers an alert in the system leading to separate incidents that require individual resolution. With an increase in data, hybridization, operational tools, countless metrics, there has been a corresponding increase in alert volume. This causes inundation of high volume and variety of log data, usually with multiple false and redundant alerts.&lt;/p&gt;


&lt;center&gt;&lt;em&gt;About 40% of IT organizations see over a million event alerts a day, with 11% receiving over 10 million alerts a day.&lt;/em&gt;&lt;/center&gt;
&lt;br&gt;

&lt;p&gt;Most &lt;a href="https://snacknation.com/blog/peo-companies/"&gt;IT teams&lt;/a&gt; today operate in disparate silos, often unaware of the assets they have, their utilization or inter-dependence thereby compounding the problem. &lt;/p&gt;

&lt;h2&gt;Why is there an excess of alert noise?&lt;/h2&gt;

&lt;p&gt;Some of the common reasons for an increasing volume of alert noise are:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Lack of stack awareness&lt;/li&gt; 
&lt;li&gt;Static thresholds&lt;/li&gt; 
&lt;li&gt;Alert Storms&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;b&gt;&lt;h4&gt;Lack of stack awareness&lt;/h4&gt;&lt;/b&gt;&lt;br&gt;
Traditional legacy systems process this differently using approaches that solely rely on signature/footprint matching. This does not allow for Machine Learning capabilities to perform impact analysis and correlation of alerts/events from multiple stack elements.&lt;/p&gt;

&lt;p&gt;&lt;b&gt;&lt;h4&gt;Static thresholds&lt;/h4&gt;&lt;/b&gt;&lt;br&gt;
Static thresholds are unable to take into account the dynamic nature of IT workloads. This creates alerts at pre-established levels, that no longer works for a majority of the workloads leading to an excessive number of alerts. Being unable to identify and create contextual awareness of where to disabled alerts and where to increase alert capacity proves to be a barrier.&lt;/p&gt;

&lt;p&gt;&lt;b&gt;&lt;h4&gt;Alert Storms&lt;/h4&gt;&lt;/b&gt;&lt;br&gt;
Outages both planned and unplanned stir up alert storms. Network disruption causes employees, remote users, and devices to disconnect leading to a high volume of unwanted alerts. &lt;/p&gt;

&lt;p&gt;Alert Noise is estimated to cost an average of $1.27Million per year to companies.&lt;/p&gt;

&lt;h2&gt;How does CloudFabrix help with alert reduction?&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://www.cloudfabrix.com/aiops-solution/"&gt;AIOps solution&lt;/a&gt; has been implemented by 60% of organizations to reduce noise alerts and identify real-time root cause analysis.&lt;/p&gt;

&lt;p&gt;The CloudFabrix &lt;a href="https://www.cloudfabrix.com/platform/"&gt; AIOps platform&lt;/a&gt; uses combination of user configurations and advanced AI/ML algorithms such as correlation , anomaly, forecasting etc to reduce alert volume through grouping, suppression and prevention. &lt;br&gt;
&lt;br&gt;&lt;/p&gt;

&lt;h2&gt;Rule Based → AI/ML and Analytics Based Approach&lt;/h2&gt;

&lt;p&gt;Instead of relying on manual tagging and rule based grouping, CloudFabrix uses time based and asset dependency based automated grouping of multiple alerts into actionable problems. It further uses predictive analytics thereby reducing &lt;a href="https://www.cloudfabrix.com/aiops-solution/alert-watch/"&gt;alert noise&lt;/a&gt; by a significant number. &lt;/p&gt;

&lt;p&gt;&lt;b&gt;&lt;h4&gt;Static Thresholds → Dynamic Thresholds&lt;/h4&gt;&lt;/b&gt;&lt;br&gt;
Static thresholds ignore dynamic nature of IT workloads and create alerts at per-established levels, which won’t work for the majority of the IT workloads that are dynamic in nature. This results in excessive number of alerts.&lt;/p&gt;

&lt;p&gt;&lt;b&gt;&lt;h4&gt;To address this problem&lt;/h4&gt;&lt;/b&gt;&lt;br&gt;
Granular Controls: Provide granular alert controls to tune telemetry collection interval. And to minimize the alerts caused to metric fluctuations we provide hi-watermark, lo-watermark and minimum occurrence controls.&lt;br&gt;
Dynamic Thresholds: Dynamic thresholds establish a baseline for every metric and raise an alert only if the metric is deviating from baseline.&lt;br&gt;
Identify heavily utilized assets where alerting should be disabled or more capacity should be added.&lt;/p&gt;

&lt;p&gt;&lt;b&gt;&lt;h4&gt;Alert Storms →  Actionable Incidents&lt;/h4&gt;&lt;/b&gt;&lt;br&gt;
Alert Storms can occur anytime, but more so during unplanned outages , planned outages and cascading alerts &lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Planned Outages: With our platform, alerts can be configured to be ignored during planned outages like patching, backup or maintenance. In addition to this, we are able to automatically exclude network device access ports from monitoring, as this can cause an excessive number of unwanted alerts, whenever employees, remote users, phones etc. connect/disconnect from the network.&lt;/li&gt;
&lt;li&gt;Unplanned Outages: and device fluctuations or flapping situations cause alert storms, which we detect automatically  and suppress the alerts during unplanned outages like network disruption or device unavailable events.&lt;/li&gt;
&lt;li&gt;Cascading Alerts: this happens when a device/component fails resulting in alerts from other parts due to interdependence or lost connectivity between the monitoring system and the dependent devices. These deluge of alerts are often pointing to the same underlying issue. These sort of alerts can be grouped together if the system has knowledge of the interdependencies and can identify the underlying root cause issue.&lt;/li&gt; 
&lt;/ol&gt;

&lt;p&gt;Please feel free to ask anything about &lt;a href="https://cloudfabrix.com/blog/aiops/what-is-aiops-top-10-common-use-cases/"&gt;aiops&lt;/a&gt; and we will be happy to answer any queries you have.&lt;/p&gt;

</description>
      <category>alertnoise</category>
      <category>aiops</category>
      <category>ai</category>
      <category>itops</category>
    </item>
    <item>
      <title>Taming the Data Problem and Accelerating AIOps implementations with Robotic Data Automation (RDA)</title>
      <dc:creator>CloudFabrix</dc:creator>
      <pubDate>Fri, 30 Apr 2021 13:08:48 +0000</pubDate>
      <link>https://dev.to/cloudfabrix/taming-the-data-problem-and-accelerating-aiops-implementations-with-robotic-data-automation-rda-57ei</link>
      <guid>https://dev.to/cloudfabrix/taming-the-data-problem-and-accelerating-aiops-implementations-with-robotic-data-automation-rda-57ei</guid>
      <description>&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--y1Vj7qA0--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/l1auh2s3owju08iqkken.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--y1Vj7qA0--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/l1auh2s3owju08iqkken.png" alt="image" width="800" height="533"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;What is Robotic Data Automation (RDA)&lt;/h2&gt;

&lt;p&gt;Robotic Data Automation (RDA) is a new paradigm to help automate data integration and data preparation activities involved in dealing with machine data for Analytics and &lt;a href="https://www.eteam.io/blog/ai-and-ml-in-software-development"&gt;AI/Machine Learning applications&lt;/a&gt;. RDA is not just a framework, but also includes a set of technologies and product capabilities that help implement the data automation.&lt;/p&gt;

&lt;p&gt;RDA enables enterprises to operationalize machine data at scale to drive AI &amp;amp; analytics-driven decisions.&lt;/p&gt;

&lt;p&gt;RDA has broad applicability within the enterprise realm, and to begin with, CloudFabrix took the RDA framework and applied it to solve AIOps problems – to help simplify and accelerate AIOps implementations and make them more open and extensible.&lt;/p&gt;

&lt;p&gt;RDA automates repetitive data integration, cleaning, verification, shaping, enrichment, and transformation activities using data bots that are invoked to work in succession in “no-code” data workflows or pipelines. RDA helps to move data in and out of AIOps systems easily, thereby simplifying, and accelerating AIOps implementations that otherwise would depend on numerous manual data integrations and professional services activities.&lt;/p&gt;

&lt;p&gt;To work with RDA workflows, CloudFabrix provides a collaborative integrated development environment (IDE) called AIOps Studio, which is an integral part of all CloudFabrix solutions. Using AIOps Studio, &lt;a href="https://crustlab.com/blog/before-you-start-what-to-discuss-with-your-software-development-agency/"&gt;developers&lt;/a&gt;, services, and implementation personnel can design, author, iterate and publish data workflows and finalized workflows can then be published to production systems for the workflows to take effect. &lt;/p&gt;

&lt;h2&gt;Why RDA is Needed?&lt;/h2&gt;

&lt;p&gt;Artificial Intelligence for IT Operations (AIOps) requires processing vast amounts of data obtained from various hybrid IT data sources, that are spread across on-premises, cloud, and edge environments. This data comes in various formats and delivery modes. Additionally, results and outcomes of such data processing need to be also exchanged with other tools in the IT ecosystem (Ex: ITSM/Closed-loop automation/&lt;a href="https://nuovoteam.com/"&gt;Collaboration Tools&lt;/a&gt; and BI/Reporting tools).&lt;/p&gt;

&lt;p&gt;All of this requires integrating, ingesting, preparing, verifying, cleaning, transforming, shaping, analyzing, and moving data in and out of AIOps systems in an efficient, reusable, and scalable manner. These essential tasks are most often overlooked in AIOps implementations and cause significant delays and increase costs of AIOps projects.&lt;/p&gt;

&lt;h2&gt;Challenges&lt;/h2&gt;

&lt;p&gt;Let us understand what some of the key challenges in data preparation &amp;amp; data integration activities are, when implementing AIOps projects.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Different data formats (text/binary/json/XML/CSV), data delivery modes (streaming, batch, bulk, notifications), programmatic interfaces (APIs/&lt;a href="https://www.sortlist.com/web-development/doha-qa"&gt;Web development&lt;/a&gt;/Webhooks/Queries/CLIs)&lt;/li&gt;
&lt;li&gt;Complex data preparation activities involving integrity checks, cleaning, transforming, and shaping the data (aggregating/filtering/sorting)&lt;/li&gt;
&lt;li&gt;Raw data often lacks application or service context, requiring real-time data enrichment bringing in context from external systems&lt;/li&gt;
&lt;li&gt;Implementing data workflows require specialized programming/data science skill set&lt;/li&gt;
&lt;li&gt;Changes in source or destination systems require rewriting/updating connectors&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;Traditional Approach of Data Handling in AIOps:&lt;/h2&gt;

&lt;p&gt;In traditional approach, AIOps vendors provide a set of out-of-the-box integrations and once you connect AIOps software to your data sources, you are now pretty much at the mercy of how your data gets utilized, processed for producing results &amp;amp; Outcomes.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Black box approach of data acquisition, processing, and integration&lt;/li&gt;
&lt;li&gt;Use cases and scenarios limited to what the platform supports&lt;/li&gt;
&lt;li&gt;Integrations mostly predefined/hard coded limiting reuse&lt;/li&gt;
&lt;li&gt;Complex scripting modules or cookbooks requiring specialized/programming skills (Javscript, Python etc.)&lt;/li&gt;
&lt;li&gt;Difficult to bring in external integrations for intermittent data processing (ex: enrichment)&lt;/li&gt;
&lt;li&gt;Difficult to access data in a programmatic way for complementary functions (ex: data access for scripting, reporting, dashboarding, automation etc.)&lt;/li&gt;
&lt;/ul&gt;
&lt;br&gt;
These are all inhibitors to effective AIOps implementations by way of adding delays &amp;amp; costs (manual data prep/handling activities)&lt;br&gt;
&lt;h4&gt;Need of the Hour: Robotic Data Automation for AIOps&lt;/h4&gt;
&lt;h2&gt;Robotic Data Automation (RDA), a key enabler for AIOps 2.0&lt;/h2&gt;
RDA automates DataOps, similar to what RPA did to automate business processes. RDA is integral part of AIOps solution that provides augmented data preparation and integration capabilities. RDA is both a data automation framework and a toolkit to accelerate and simplify all data handling in AIOps implementations.

Highlights
&lt;ul&gt;
&lt;li&gt;Implement &lt;a href="https://databox.com/dashboard-examples/sales-pipeline"&gt;No-code Data Pipelines&lt;/a&gt; using Data bots&lt;/li&gt;
&lt;li&gt;Native AI/ML bots&lt;/li&gt;
&lt;li&gt;CFXQL – Uniform Query Language&lt;/li&gt;
&lt;li&gt;Inline Data Mapping&lt;/li&gt;
&lt;li&gt;Data Integrity Checks&lt;/li&gt;
&lt;li&gt;Data masking, redaction, and encryption&lt;/li&gt;
&lt;li&gt;Data Shaping: Aggregation/Filtering/Sorting&lt;/li&gt;
&lt;li&gt;Data Extraction/Metrics Harvesting&lt;/li&gt;
&lt;li&gt;Synthetic Data generation&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Benefits&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Simplify and Accelerate AIOps implementations&lt;/li&gt;
&lt;li&gt;Reduces time/effort/costs tied to data prep and integrations.&lt;/li&gt;
&lt;li&gt;Suitable for DevOps/ProdOps personnel (no need of data scientist skills)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Example Use Cases and Scenarios&lt;/p&gt;
&lt;ul&gt;

&lt;li&gt;Log Clustering: Ingest app logs from cloud and on-prem, run ML models to cluster logs, push results to Kibana/CFX dashboards&lt;/li&gt;

&lt;li&gt;CMDB Synchronization: Take latest asset inventory from CFX and push it to CMDB.&lt;/li&gt;

&lt;li&gt;E-Bonding of tickets from partner/subsidiary ITSM to customer’s ITSM (Ex: BMC incidents to ServiceNow)&lt;/li&gt;

&lt;li&gt;Incident NLP Classification: Ingest tickets from ServiceNow, do NLP classification with OpenAI (GPT-3) and enrich tickets back in ServiceNow&lt;/li&gt;

&lt;li&gt;Anomaly Detection: From Prometheus (or any monitoring tool), get historical CPU usage data for a node (Hourly). Apply regression and send a message on Slack with a list of anomalies as an attachment.&lt;/li&gt;

&lt;li&gt;Ticket Clustering: Take last 24-hrs incidents from ServiceNow, apply clustering on tickets and push results to a new dataset for visualization in CFX dashboard.&lt;/li&gt;

&lt;li&gt;Change Detection: Take baseline inventory of AWS EC2 VMs and compare against current state to highlight unplanned changes.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;How it Works&lt;/h2&gt;

&lt;p&gt;CloudFabrix AIOps platform and all its products (Operational Intelligence, Asset Intelligence, Observability-In-a-Box) are pre-packaged with RDA. Additionally, RDA is also provided as a standalone tool and as a hosted offering to help iterate, author, and publish data workflows.&lt;br&gt;
To begin with, RDA provides a lot of out-of-the-box data integrations (called extensions), which are written in Python. Customers can use extensions to connect with their data sources and access them through tags or data sets. All DataOps activities like performing integrity checks, data cleaning, transforming, and shaping the data (aggregating/filtering/sorting) can be performed with data bots invoked in a pipeline, which is essentially a series of data processing steps, that typically take data from a source, do some processing and transformation, and send data to a sink. A unique aspect here is these are “No-Code” pipelines, meaning you do not have to know Python or JavaScript to implement these pipelines, and the pipeline just uses configurational semantics using text that is similar to natural language.&lt;br&gt;
Following is an example of data workflow that takes AWS EC2 Inventory Baseline&lt;br&gt;
&lt;br&gt;&lt;br&gt;
@c:simple-loop loop_var = "aws-prod:us-east-1,aws-dev:ap-south-1,aws-dev:us-west-2"&lt;br&gt;
--&amp;gt; *${loop_var}:ec2:instances&lt;br&gt;
--&amp;gt; &lt;a class="mentioned-user" href="https://dev.to/dm"&gt;@dm&lt;/a&gt;
:save name = "temp-aws-instances-${loop_var}" &lt;/p&gt;

&lt;p&gt;--&amp;gt; @c:new-block&lt;br&gt;
--&amp;gt; &lt;a class="mentioned-user" href="https://dev.to/dm"&gt;@dm&lt;/a&gt;
:concat names = "temp-aws-instances-.*"&lt;br&gt;
--&amp;gt; &lt;a class="mentioned-user" href="https://dev.to/dm"&gt;@dm&lt;/a&gt;
:selectcolumns include = 'InstanceId|InstanceType|State.Name$|LaunchTime|^Placement.AvailabilityZone$'&lt;br&gt;
--&amp;gt; &lt;a class="mentioned-user" href="https://dev.to/dm"&gt;@dm&lt;/a&gt;
:fixcolumns&lt;br&gt;
--&amp;gt; &lt;a class="mentioned-user" href="https://dev.to/dm"&gt;@dm&lt;/a&gt;
:save name = 'all-aws-instances-Mar-25'&lt;br&gt;
&lt;br&gt;&lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--vOZlWCCu--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/1uun579qu89cxnrrasbx.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--vOZlWCCu--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/1uun579qu89cxnrrasbx.png" alt="image" width="880" height="486"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Fig: AIOps Studio: Visualization of Data Pipeline execution&lt;/p&gt;

&lt;p&gt;AIOps product has a lot of prebuilt pipelines that provide a turnkey experience for all core AIOps use cases and scenarios, so that customers perform integrations with their tools and get to work right away. Anytime customizations are needed, either to address new use case or scenario, or update the way data transformations or enrichments are done, customers can use RDA utility to iterate and experiment with pipelines, and once pipelines are functioning as expected, customers can then publish the pipelines to AIOps platform to be deployed into production. Resultant datasets can be easily visualized in CFX dashboards.&lt;br&gt;
This process is like how &lt;a href="https://scalac.io/blog/why-developers-should-pay-attention-to-zio-in-2021/"&gt;developers&lt;/a&gt; use Integrated Development Environment (IDEs) to write code, iterate, compile and &lt;a href="https://attrock.com/image-compression-tool/"&gt;build images&lt;/a&gt;, which are then pushed onto a runtime environment. In the same way, RDA provides a collaborative Jupyter style notebook authoring tool and workflow visualization tool, using which pipelines can be built or customized, and once finalized they can be pushed on to the production AIOps platform. For workflow authoring, RDA is also available as a standalone tool that can be deployed in a customer’s own environment or it can be used as a service within CloudFabrix’s cloud-hosted environment.&lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--hAsNrPvO--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/2vjx740egc2q8p5tpy73.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--hAsNrPvO--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/2vjx740egc2q8p5tpy73.png" alt="image" width="880" height="509"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--2kIwadxm--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/rqvfvaidj0w1ch0u7gji.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--2kIwadxm--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/rqvfvaidj0w1ch0u7gji.png" alt="image" width="768" height="347"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Fig: AIOps Studio provides IDE-like capabilities for DataOps&lt;br&gt;
Getting Started – AIOps Studio&lt;br&gt;
It is very easy to get started with RDA using AIOps Studio, which is visual IDE for authoring and publishing data pipelines. We have few options available.&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt; Install on top of Docker&lt;/li&gt;
&lt;li&gt; AWS image&lt;/li&gt;
&lt;li&gt; Hosted service&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Once you bring up RDA install using one of above options, you can invoke AIOps Studio and pretty much do all data operations using the studio, that provides intuitive user interface.&lt;br&gt;
 &lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--XH9g6mW6--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/vq6sxon28um4od4g684r.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--XH9g6mW6--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/vq6sxon28um4od4g684r.png" alt="image" width="880" height="490"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Fig: AIOps Studio Welcome Page&lt;/p&gt;

&lt;h2&gt;AIOps Studio Tasks&lt;/h2&gt;

&lt;p&gt;Following are some of the tasks that can be performed from AIOps Studio:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt; Add a New Pipeline&lt;/li&gt;
&lt;li&gt; Edit a Pipeline&lt;/li&gt;
&lt;li&gt; Execute a Pipeline&lt;/li&gt;
&lt;li&gt; Pipeline Syntax Verification&lt;/li&gt;
&lt;li&gt; Inspect Pipeline Data Results at Each Stage&lt;/li&gt;
&lt;li&gt; Visualize and Debug Pipeline Execution&lt;/li&gt;
&lt;li&gt; Publish Pipeline to Production App&lt;/li&gt;
&lt;li&gt; Explore Data Automation Bots&lt;/li&gt;
&lt;li&gt; Explore API models for Data Automation Bots&lt;/li&gt;
&lt;li&gt;Explore Datasets&lt;/li&gt;
&lt;li&gt;Administration: Manage Bot Sources&lt;/li&gt;
&lt;li&gt;Administration: Check Connectivity for Configured Sources&lt;/li&gt;
&lt;li&gt;Administration: View Plugins&lt;/li&gt;
&lt;li&gt;Administration: Add New Plugin (coming soon)&lt;/li&gt;
&lt;li&gt;Show Inline Help&lt;/li&gt;
&lt;li&gt;More …
Anyone interested in trying out RDA can sign up for interest list here: &lt;a href="https://www.roboticdata.ai/signup/"&gt;https://www.roboticdata.ai/signup/&lt;/a&gt;
&lt;/li&gt;
&lt;/ol&gt;

</description>
      <category>rda</category>
      <category>aiops</category>
      <category>mlops</category>
      <category>itops</category>
    </item>
    <item>
      <title>Observability is Transforming ITOM Landscape as Next Generation Monitoring</title>
      <dc:creator>CloudFabrix</dc:creator>
      <pubDate>Thu, 15 Apr 2021 05:45:20 +0000</pubDate>
      <link>https://dev.to/cloudfabrix/observability-is-transforming-itom-landscape-as-next-generation-monitoring-2ahd</link>
      <guid>https://dev.to/cloudfabrix/observability-is-transforming-itom-landscape-as-next-generation-monitoring-2ahd</guid>
      <description>&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--aaifBEI9--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://cdn.steemitimages.com/DQmTM185eMb8J8hLhGoBoqr5yxsMCcEgoqYgvSGNyjt2ywT/image.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--aaifBEI9--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://cdn.steemitimages.com/DQmTM185eMb8J8hLhGoBoqr5yxsMCcEgoqYgvSGNyjt2ywT/image.png" alt="image.png"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;First things first. Observability is inherent as a principle to a system and not something that is instilled. Here, we are addressing observability as an open source-based solution in the context of insightful monitoring within the ITOM landscape.&lt;/p&gt;

&lt;p&gt;ITOM is now in the middle of addressing the needs of the expanding and dynamic nature of IT infrastructure as a function. It is no longer about being a monolithic computing stack. It is now beyond monitoring discrete infrastructure elements. &lt;/p&gt;

&lt;p&gt;Simple systems are smart enough to run a diagnosis and correct it on their own. But, native-cloud businesses with &lt;a href="https://middleware.io/blog/microservices-architecture/?swcfpc=1"&gt;micro service architecture&lt;/a&gt; and distributed systems are complex. &lt;/p&gt;

&lt;p&gt;System admins and ITops analysts were the eyes of systems, services and apps. It circled around ticketing, resolution, and people dependency. Complex systems depending on people increased the level of failure.&lt;/p&gt;

&lt;p&gt;Now, ITOM is about observability that creates a proactive and transparent among complex and modern ecosystems.&lt;/p&gt;

&lt;h1&gt;Why Observability for your ITOM?&lt;/h1&gt;

&lt;p&gt;Observability provides a deeper operational visibility that brings an organization-wide sense of customer experience.  For digital transformation and cloud adoption, observability is one of the essential and critical pillars. The monitoring workload is also shifted to development. Apart from these, a few trends make observability a necessity for complex structures.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Businesses are living with the pressure to innovate fast. The development teams launch something every other hour, and they need to talk to internal systems for insights. &lt;/li&gt;
&lt;li&gt;Apps and software have to deliver a &lt;a href="https://www.channels.app/blog/customer-service-oriented"&gt;high customer experience.&lt;/a&gt; &lt;/li&gt;
&lt;li&gt;To meet these business needs, technology stacks is adding more and more tools. It is getting bigger and complex.&lt;/li&gt;
&lt;li&gt;To put all these in place, there is a lot of demand for devops and automation. Creating intelligent autonomous systems is the ITOM’s new skill to be added.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Observability has been able to respond to these trends. Teams love it: Observability is gaining attention in the software world. It is able to effectively enable engineers to &lt;a href="https://www.liveagent.com/academy/customer-experience/"&gt;deliver excellent customer experiences&lt;/a&gt; with software despite the complexity of the modern digital enterprise. It is helping &lt;a href="https://crustlab.com/blog/before-you-start-what-to-discuss-with-your-software-development-agency/"&gt;modern software teams&lt;/a&gt;. On the whole, it have proven to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Deliver high-quality software at scale&lt;/li&gt;
&lt;li&gt;Build a sustainable culture of innovation&lt;/li&gt;
&lt;li&gt;Optimize investments in cloud and modern tools&lt;/li&gt;
&lt;li&gt;See the real-time performance of their digital business&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Observability, across different use cases is able to provide full visibility into rapidly scaling infrastructure and applications, anomaly detection and metrics. It is not yet another monitoring tool. &lt;/p&gt;

&lt;h1&gt;Observability, the future of systems and app monitoring&lt;/h1&gt;

&lt;p&gt;Monitoring as a tool has detected problems and anomalies in applications. Troubleshooters used to also gain insights into capacity requirements and performance trends over time. But, monitoring would be useless if these systems are not externalized enough. &lt;/p&gt;

&lt;p&gt;Observability as a next generation monitoring mechanism measures how a system’s internal state can infer external outputs. It develops organizational capabilities of monitoring and analyzing events, along with KPIs and other data. Observability yields actionable insights.&lt;/p&gt;

&lt;p&gt;CloudFabrix’s &lt;a href="https://www.cloudfabrix.com/observability/?utm_source=dev.to&amp;amp;utm_medium=referral&amp;amp;utm_campaign=aiops"&gt;Observability&lt;/a&gt; solution is built using open source and open telemetry components. It draws data from across different data streams coming from servers, databases, applications, tools and services through turn-key integrations across the full DevOps stack. It also includes a lightweight agent deployed that gets automatically provisioned on the host operating systems.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--YTp6u8se--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://cdn.steemitimages.com/DQmTkTNXicSEfMpubnVyCsHqJF55puW4m7Ciccjgonvj5a4/image.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--YTp6u8se--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://cdn.steemitimages.com/DQmTkTNXicSEfMpubnVyCsHqJF55puW4m7Ciccjgonvj5a4/image.png" alt="image.png"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;Observability is applied in the following use cases: &lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;To reduce visibility gaps for modern applications architecture and dynamic workloads since the existing monitoring tools are not designed to support.&lt;/li&gt;
&lt;li&gt;Provide context to the monitoring data to explain why and where questions.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Get a &lt;a href="https://www.youtube.com/watch?v=L6-wlH0N8dk"&gt;full tutorial of observability-in-a-box.&lt;/a&gt;&lt;/p&gt;

&lt;h1&gt;Observability in a box in the AIOps context&lt;/h1&gt;

&lt;p&gt;With growing complexity, the above use cases hint the need for inclusion of observability with AIOps. &lt;/p&gt;

&lt;p&gt;Traditional AIOps could not be applied for all use cases that challenged ITOM landscape. It served the immediate purpose but with high investment of resources and longer time to value.&lt;/p&gt;

&lt;p&gt;Observability in a box in the AIOps context&lt;br&gt;
With growing complexity, the above use cases hint the need for inclusion of observability with AIOps. &lt;/p&gt;

&lt;p&gt;Traditional AIOps could not be applied for all use cases that challenged ITOM landscape. It served the immediate purpose but with high investment of resources and longer time to value.&lt;/p&gt;

&lt;p&gt;AIOps 2.0 with observability in a box provides quick turnaround with easy implementation. It brings an outcome-driven approach for metrics, events, logs, traces, and alerts.&lt;/p&gt;

&lt;p&gt;Traditional ITOM comes with data centers geographically distributed. This adds to the alert noise and ticket volume. The problem resolution time is extended with reactive operations and lack of ability to preempt outages. &lt;/p&gt;

&lt;h2&gt;AIOps 2.0 by CloudFabrix with observability enables: &lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Close identification of specific problems in a switch.&lt;/li&gt;
&lt;li&gt;Dependency recognition, where the problem has actually triggered from.&lt;/li&gt;
&lt;li&gt;Identification of other observability gaps&lt;/li&gt;
&lt;li&gt;Asset intelligence&lt;/li&gt;
&lt;li&gt;Reduced MTTR (mean time to repair)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;It just gets better with more trained data and resolution models. It uses asset and model intelligence to correlate alerts where connected data relies on feeding information. Recommendations about adding new servers reduces the workload while leveraging the capabilities of Edge AI.&lt;/p&gt;

&lt;h1&gt;Why Observability is key for AIOps journey? &lt;/h1&gt;

&lt;p&gt;Observability is about providing a unified view across IT operations and systems which is tied to your business KPIs. &lt;a href="https://www.cloudfabrix.com/"&gt;AIOps&lt;/a&gt; is a part of Observability. AIOps 2.0 is all about intelligence. This element of intelligence is impossible without the insights drawn by observability. &lt;/p&gt;

&lt;h2&gt;We are referring to: &lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Cost reduction recommendations rather than matrices analysis&lt;/li&gt;
&lt;li&gt;Delay avoidance rather than processing delay intimation&lt;/li&gt;
&lt;li&gt;Real-time metric data analysis and trends with aligned actionable insights and anomaly detection.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;So, rather than telling you where the problem is, &lt;a href="https://www.cloudfabrix.com/aiops-solution/"&gt;AIOps solution&lt;/a&gt; with observability will tell you where the problem is, how it is solved, and what should be done to avoid it later. This continuous feed of intelligence is only possible when AIOps come together with observability.&lt;br&gt;
Click &lt;a href="https://www.cloudfabrix.com/?utm_source=dev.to&amp;amp;utm_medium=referral&amp;amp;utm_campaign=aiops"&gt;here&lt;/a&gt; to learn more about CloudFabrix with the help of this &lt;a href="http://hunchads.com/"&gt;dynamic creative&lt;/a&gt;.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>aiops</category>
      <category>mlops</category>
      <category>observability</category>
    </item>
  </channel>
</rss>
