<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Rashmiranjan Sahoo</title>
    <description>The latest articles on DEV Community by Rashmiranjan Sahoo (@rashmiranjan28).</description>
    <link>https://dev.to/rashmiranjan28</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F1187453%2F426fbb07-0578-44ab-ac56-bc1fff42ffb8.png</url>
      <title>DEV Community: Rashmiranjan Sahoo</title>
      <link>https://dev.to/rashmiranjan28</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/rashmiranjan28"/>
    <language>en</language>
    <item>
      <title>Azure storage service - learning day 4</title>
      <dc:creator>Rashmiranjan Sahoo</dc:creator>
      <pubDate>Sat, 21 Oct 2023 04:26:32 +0000</pubDate>
      <link>https://dev.to/rashmiranjan28/azure-storage-service-learning-day-4-26e7</link>
      <guid>https://dev.to/rashmiranjan28/azure-storage-service-learning-day-4-26e7</guid>
      <description>&lt;h2&gt;
  
  
  &lt;strong&gt;What is life cycle Management?&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;Lifecycle Management offers a rich rule-based policy which &lt;strong&gt;we can use to transition our data to the best access tier&lt;/strong&gt; and to delete data at the end of its life cycle.&lt;/p&gt;

&lt;h2&gt;
  
  
  Security in azure
&lt;/h2&gt;

&lt;p&gt;security is most important aspect in cloud.&lt;/p&gt;

&lt;p&gt;Security mapped for all the services&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Authentication&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Storage account keys&lt;/li&gt;
&lt;li&gt;Shared access signature (SAS)&lt;/li&gt;
&lt;li&gt;Azure Active Directory (Azure AD)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Access Control&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;-Role based access control (RBAC)&lt;br&gt;
-Access control list (ACL)&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Network access&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Firewall and virtual network&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Data Protection&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Data encryption in transit&lt;/li&gt;
&lt;li&gt;Data encryption at rest&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Difference between Iam and (RBAC, ACL)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;In IAM we give the access on container level in blob and data lake.&lt;/p&gt;

&lt;p&gt;But in RBAC, ACL we give the access on folder level, and it is only available for data lake gen2.  &lt;/p&gt;

&lt;p&gt;By click on container go to the directory structure, &lt;br&gt;
which is HFS hierarchy file structure, &lt;br&gt;
 Right click on any directory. &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;select manage ACL&lt;/li&gt;
&lt;li&gt;Add Principal
Here principal means it can be a user, group or it can be an application service.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Data redundancy for storage&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Redundancy mainly used when it comes to data lake for high availability and disaster recovery.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Types&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;LRS&lt;/strong&gt; (locally redundant storage)&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Three copies of data which is maintained within the same primary data center.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;ZRS&lt;/strong&gt; (Zone Redundant Storage)&lt;/p&gt;

&lt;p&gt;-Three copies of your data replicated synchronously to 3 Azure availability zones in a primary region.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Zones are different physical locations on different data centers.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;GRS&lt;/strong&gt; (Geo-redundant Storage)&lt;br&gt;
-This allows our data to be stored in different geographic areas of the country or world.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;we get 3 copies within a primary region, but it goes one step further and places 3 additional asynchronous copies in another region.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;RA-GRS&lt;/strong&gt; (Read access Geo redundant storage)&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;This is Grs but adds a read only element that allows us to have read access for things like reporting.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;GZRS&lt;/strong&gt; (Geo zone redundant storage)&lt;/p&gt;

&lt;p&gt;Copy our data synchronously over 3 primary region azure availability zones using ZRS.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Then asynchronously copies our data to a single physical location within the secondary region.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;RAGZRS&lt;/strong&gt; (Read Access Geo Zone redundant storage)&lt;/p&gt;

&lt;p&gt;It adds a layer of readability to our secondaries.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Monitoring service&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Azure monitoring service helps us to maximize the availability and performance of our applications and services.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;This information helps us to understand how our applications are performing and proactively identifies issues affecting them and the resources they depend on.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The components of monitoring services are:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;alerts&lt;/li&gt;
&lt;li&gt;metrics&lt;/li&gt;
&lt;li&gt;diagnostics&lt;/li&gt;
&lt;li&gt;logs Analytics&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt; &lt;br&gt;
&lt;strong&gt;Alerts&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Alerts in azure monitor proactively notify us of critical conditions and potentially attempt to take corrective actions.&lt;/p&gt;

</description>
      <category>azure</category>
    </item>
    <item>
      <title>Creating Azure data lake gen2- learning day 3</title>
      <dc:creator>Rashmiranjan Sahoo</dc:creator>
      <pubDate>Thu, 19 Oct 2023 16:10:08 +0000</pubDate>
      <link>https://dev.to/rashmiranjan28/creating-azure-data-lake-gen2-learning-day-3-3711</link>
      <guid>https://dev.to/rashmiranjan28/creating-azure-data-lake-gen2-learning-day-3-3711</guid>
      <description>&lt;p&gt;Disclaimer: I am writing this for revisiting all the concepts in azure data engineering and clearing my fundamentals. If you find this article helpful to you, I would love it. &lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;what is azure data lake gen2?&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;It is a data lake solution provided by azure.&lt;br&gt;
It is a combination of &lt;strong&gt;blob storage and data lake gen1&lt;/strong&gt; to provide high scalable and secure data lake, for big data storage and analytics.&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;How to create a data lake?&lt;/strong&gt;
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;To create a data lake in azure&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;click on create a storage account&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;fill all details&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;click on next &lt;/p&gt;&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;In advanced there is a section&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Data lake storage gen2&lt;/li&gt;
&lt;li&gt;Enable hierarchical namespace ☑&lt;/li&gt;
&lt;li&gt;click on this checkbox &lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;
&lt;li&gt;&lt;p&gt;By clicking this check box our storage account converts into azure data lake gen2 with hierarchical name space or directory which is not available in simple blob storage.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;In the background , there is a data lake configured for system and the disk space and cluster enable for us.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;when we enter into our storage account &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;click on container to create container 

&lt;ul&gt;
&lt;li&gt;give name &lt;/li&gt;
&lt;li&gt;create&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;
&lt;li&gt;when we click on the container that we created &lt;/li&gt;
&lt;li&gt;we can see different options available on the top bar that is &lt;strong&gt;add directory&lt;/strong&gt; a disk like structure&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  What is hierarchical name space?
&lt;/h2&gt;

&lt;p&gt;Hierarchical namespace is based on Linux file storage and Hadoop file system i.e., HDFS file system.&lt;/p&gt;

&lt;p&gt;It organizes files into a hierarchy of directories for efficient data access.  &lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;We can store everything on data lake gen2 as we know it can store 3 types of data classification data i.e., structured, semi-structured, Unstructured.&lt;br&gt;
Data can be web-server log data, relational data, streamed data, etc.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;We can process this data by synapse analytics, data bricks, stream analytics.&lt;/p&gt;

&lt;h2&gt;
  
  
  Access Tiers
&lt;/h2&gt;

&lt;p&gt;An access tier refers to a data storage option in cloud-based storage services that helps us to manage the cost and performance of our data based on &lt;strong&gt;how frequently we access it.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Types of Access tiers:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;Hot&lt;br&gt;
 It is optimized for storing data that is frequently accessed.&lt;br&gt;
It offers low latency and suitable where data frequently read and written.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Cool&lt;br&gt;
  Optimized for storing data that is infrequently accessed and stored for at least 30 days.&lt;br&gt;
There is an &lt;strong&gt;early deletion fee&lt;/strong&gt; is charged for deletion.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;What is early deletion fee?&lt;/strong&gt;&lt;br&gt;
If we feel that we don't need this data for 30 days on 15th day we want to delete.&lt;/p&gt;

&lt;p&gt;There we will pay some fee.&lt;/p&gt;

&lt;p&gt;This fee is called early deletion fee.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Archive Access Tier&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Optimized for storing data that is rarely accessed and stored for at least 180 days.&lt;br&gt;
It offers the lowest storage costs but can have the highest data retrieval times. &lt;/p&gt;

&lt;p&gt;This tier is suitable for long-term storage, such as compliance data, backup archives, and historical records.&lt;/p&gt;

&lt;h2&gt;
  
  
  what is Rehydration?
&lt;/h2&gt;

&lt;p&gt;To read data from archive storage we must first change the tier to hot or cool.&lt;/p&gt;

&lt;p&gt;This process is called rehydration and can take hours to complete.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Standard priority&lt;/strong&gt;:&lt;br&gt;
The rehydration request will be processed in the order it was received and may take up to 15 hours.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;High priority&lt;/strong&gt;:&lt;br&gt;
    The rehydration request will be prioritized over standard requests and may finish under 1 hour for files under 10gb in size.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;next article will be about all features in azure data lake gen2 like life cycle policy, security, etc. &lt;/p&gt;

</description>
      <category>azure</category>
      <category>dataengineering</category>
    </item>
    <item>
      <title>Azure Storage Account service -learning day 2</title>
      <dc:creator>Rashmiranjan Sahoo</dc:creator>
      <pubDate>Wed, 18 Oct 2023 15:59:01 +0000</pubDate>
      <link>https://dev.to/rashmiranjan28/azure-storage-account-service-learning-day-2-22a8</link>
      <guid>https://dev.to/rashmiranjan28/azure-storage-account-service-learning-day-2-22a8</guid>
      <description>&lt;p&gt;Disclaimer: This is the note I am publishing as blogs for me to revisit all topics and clear my fundamentals. I would love if this article you found as helpful.&lt;/p&gt;

&lt;p&gt;Azure Storage account is a Microsoft managed cloud service that provides storage that is highly available, secure, durable, scalable and redundant.&lt;/p&gt;

&lt;p&gt;Within azure there are&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;2 types of storage accounts&lt;/li&gt;
&lt;li&gt;4 types of storage&lt;/li&gt;
&lt;li&gt;4 levels of data redundancy&lt;/li&gt;
&lt;li&gt;3 tires for storage files&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;4 types of storage&lt;/strong&gt;&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Azure blobs&lt;/li&gt;
&lt;li&gt;Azure files&lt;/li&gt;
&lt;li&gt;Azure Queues&lt;/li&gt;
&lt;li&gt;Azure Tables&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Azure blob is important so we will explore it later.&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;Azure file Storage&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;Azure files is a shared network file storage service that provides administrator a way to &lt;strong&gt;access native smb file&lt;/strong&gt; shared in the cloud.&lt;/p&gt;

&lt;p&gt;Using this smb protocol we can integrate with our local machine to cloud azure.&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;Azure Queue Storage&lt;/strong&gt;
&lt;/h2&gt;

&lt;blockquote&gt;
&lt;p&gt;Azure Queue Storage is a service that allows users to store high volumes of messages, process them asynchronously and consume them when needed.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Here we put a retention day of 7 days or a month as per required.&lt;/p&gt;

&lt;p&gt;After that day the message will be disappear.&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;Azure table storage&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;Don't confuse with the name like table that don't mean structured data here.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Azure table storage is a scalable, no-SQL, key-value data storage system that can be used to store large amounts of data in the cloud.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;This storage offering has a schema less design and each table has rows, that are composed of key value pairs.&lt;/p&gt;

&lt;p&gt;In No-SQL there is a document dB where data stores as Json document file.&lt;/p&gt;

&lt;p&gt;similar ex: Mongo dB&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;Azure Blob Storage&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;&lt;em&gt;Extra importance for this storage service&lt;/em&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Azure blob storage is Microsoft azure service for storing binary large object or blobs which are typically composed of &lt;strong&gt;unstructured&lt;/strong&gt; data such as text, images and videos along with their metadata.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Blobs are stored in directory like structure called &lt;strong&gt;Containers&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Note:&lt;/strong&gt;&lt;br&gt;
All the 3 types of data classification we can store into this one place which is blob.&lt;/p&gt;

&lt;p&gt;Those classifications are:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;structured &lt;/li&gt;
&lt;li&gt;semi-structured&lt;/li&gt;
&lt;li&gt;unstructured&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Benefits of blobs&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;optimized for storing massive amounts of unstructured data&lt;/li&gt;
&lt;li&gt;Text or binary data&lt;/li&gt;
&lt;li&gt;General purpose object storage&lt;/li&gt;
&lt;li&gt;cost efficient&lt;/li&gt;
&lt;li&gt;Provide multiple access tiers.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Note:&lt;br&gt;
     we can do all the tasks using this single blob storage service.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Types of storage account&lt;/strong&gt;&lt;br&gt;
At the time of creation of storage account there will be option &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Standard&lt;/li&gt;
&lt;li&gt;Premium &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;These two types are called types of storage account.&lt;/p&gt;

&lt;p&gt;Standard: as from the name we can understand that it has les s I/O use case with low latency &lt;/p&gt;

&lt;p&gt;Premium: This has a very low latency.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;very good data movement from point A to point B.&lt;/li&gt;
&lt;li&gt;High I/O use case.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;In the time of creation, we have to very careful about the name, name of storage account.&lt;/p&gt;

&lt;p&gt;Because it is going to create a URL through which we connect this account from other services or accounts to perform any operation.&lt;/p&gt;

&lt;p&gt;Next article will be about azure data lake gen2.&lt;/p&gt;

</description>
      <category>azure</category>
    </item>
    <item>
      <title>Azure Storage services - Learning Day 1</title>
      <dc:creator>Rashmiranjan Sahoo</dc:creator>
      <pubDate>Tue, 17 Oct 2023 16:20:34 +0000</pubDate>
      <link>https://dev.to/rashmiranjan28/azure-storage-services-learning-day-1-4f12</link>
      <guid>https://dev.to/rashmiranjan28/azure-storage-services-learning-day-1-4f12</guid>
      <description>&lt;p&gt;Hi, here I am sharing my learning for my better practices and storing here so that I can read it whenever I need. &lt;/p&gt;

&lt;p&gt;If it helps you to learn or recall the fundamental things, I would be happy.&lt;/p&gt;

&lt;h2&gt;
  
  
  cloud
&lt;/h2&gt;

&lt;blockquote&gt;
&lt;p&gt;The practice of using a network of remote servers hosted on the internet to store, manage and process data, rather than a local server or a personal computer.&lt;/p&gt;

&lt;p&gt;In simple terms Hosting something on the internet and you are not managing on your local system.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Mainly 3 types of service provided by cloud.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1. IaaS (Infrastructure as a service)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;IaaS provides the foundational infrastructure elements needed for running applications, such as virtual machines, storage, and networking.&lt;/p&gt;

&lt;p&gt;Users can access and manage these resources remotely, eliminating the need for physical servers and data centers.&lt;/p&gt;

&lt;p&gt;ex: azure, gcp, aws.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. PaaS (Platform as a service)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;PaaS offers a platform that includes not only infrastructure but also tools and services for developing, testing, and deploying applications. &lt;/p&gt;

&lt;p&gt;It abstracts much of the underlying infrastructure, allowing developers to focus on writing code rather than managing servers.&lt;/p&gt;

&lt;p&gt;ex: Microsoft Azure App Service, Azure Data factory, Data lake, etc.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. SaaS (Software as a service)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;SaaS delivers software applications over the internet on a subscription basis. &lt;/p&gt;

&lt;p&gt;Users access the software through a web browser, and it is typically hosted and maintained by a third-party provider.&lt;/p&gt;

&lt;p&gt;This model is user-centric, and users do not need to manage the underlying infrastructure or worry about updates and maintenance.&lt;/p&gt;

&lt;p&gt;ex: Microsoft 365.&lt;/p&gt;

&lt;p&gt;To store the data in azure there are a lot of service available. Anyone can see from the documentation.&lt;/p&gt;

&lt;p&gt;But mostly we use,&lt;/p&gt;

&lt;blockquote&gt;
&lt;ol&gt;
&lt;li&gt;Azure storage account&lt;/li&gt;
&lt;li&gt;Azure SQL&lt;/li&gt;
&lt;li&gt;Azure Data Lake&lt;/li&gt;
&lt;li&gt;Azure cosmos DB&lt;/li&gt;
&lt;/ol&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;strong&gt;Why we need this?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;To answer this question lets go back into 90's&lt;br&gt;
when everything stored relational database where size of data is low.&lt;/p&gt;

&lt;p&gt;At that time, we use rdbms systems for storing data and processing data.&lt;/p&gt;

&lt;p&gt;But when we hit 20's there a lot of tech things come into the world.&lt;/p&gt;

&lt;p&gt;The world explores different different technologies and slowly starts generating data in lots of amounts.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;So, the big data arises here.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Then &lt;strong&gt;3v's&lt;/strong&gt; come into the place which describes the key characteristics or challenges associated with the BIG data.&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;Velocity --&amp;gt; 1sec, 1hr&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Variety --&amp;gt; structured, semi structured, unstructured&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Volume --&amp;gt; 5GB, 10Gb, 30Gb, 1TB&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;One of these individuals can't be called as Big Data.&lt;/p&gt;

&lt;p&gt;By combining these 3&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;If 5gb,10gb or any large volume of different varieties of data is generating in every sec, hour, or day then we can call this as BIG DATA. &lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;strong&gt;what is data classification (remember this vocabulary)?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1. structured data&lt;/strong&gt;&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;These data have some schema like row, column.
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

&lt;p&gt;Mostly in tabular format.&lt;br&gt;
 ex: SQL, csv, spreadsheet&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. semi-structured data&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;No-SQL, key-value pair, JSON.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. Unstructured data&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;media files, office files, text files, log files.&lt;/p&gt;

&lt;p&gt;Next article will be about azure storage services.&lt;/p&gt;

</description>
      <category>azure</category>
    </item>
  </channel>
</rss>
