<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Andre Yai</title>
    <description>The latest articles on DEV Community by Andre Yai (@andreyai).</description>
    <link>https://dev.to/andreyai</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F888155%2F09abd4c7-485e-4014-8f0e-2c9e899ab413.jpeg</url>
      <title>DEV Community: Andre Yai</title>
      <link>https://dev.to/andreyai</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/andreyai"/>
    <language>en</language>
    <item>
      <title>Conectando Athena com Amazon S3</title>
      <dc:creator>Andre Yai</dc:creator>
      <pubDate>Sat, 30 Mar 2024 22:21:22 +0000</pubDate>
      <link>https://dev.to/andreyai/conectando-athena-com-amazon-s3-fae</link>
      <guid>https://dev.to/andreyai/conectando-athena-com-amazon-s3-fae</guid>
      <description>&lt;p&gt;Atualmente os cientistas de dados lidam com volumes cada vez maiores de dados, a integração entre o Amazon Athena e o Amazon S3 oferece uma solução poderosa e eficiente. Com a capacidade de executar consultas SQL em gigabytes ou até petabytes de dados armazenados no S3, os cientistas de dados podem extrair insights valiosos de maneira rápida e escalável.&lt;/p&gt;

&lt;p&gt;O Amazon S3 serve como um repositório seguro e durável para os dados brutos, enquanto o Amazon Athena fornece uma camada de consulta simplificada, permitindo que os cientistas de dados executem consultas ad-hoc e análises complexas sem a necessidade de configurar ou gerenciar infraestrutura. Isso libera tempo e recursos para se concentrar na análise e na descoberta de padrões nos dados.&lt;/p&gt;

&lt;p&gt;Com a capacidade de lidar com conjuntos de dados de qualquer tamanho, o Amazon Athena é uma ferramenta essencial no arsenal de qualquer cientista de dados em busca de agilidade e flexibilidade. A simplicidade de escrever consultas SQL padrão no console do Athena facilita a exploração e a análise de dados complexos, permitindo descobertas significativas.&lt;/p&gt;

&lt;p&gt;Para cientistas de dados que buscam uma solução escalável, ágil e econômica para suas necessidades de análise de dados, a integração entre o Amazon Athena e o Amazon S3 oferece um caminho claro para desbloquear o potencial dos dados armazenados na nuvem.&lt;/p&gt;

&lt;p&gt;Uma vez tendo acesso a conta da AWS e aos serviços mencionados acima. As etapas para este processo geralmente envolve:&lt;br&gt;
1 - Inserir os dados para o bucket do S3. &lt;br&gt;
2 - Verificar os tipos dos dados dos dados no S3.&lt;br&gt;
3 - Criar o comando SQL para criar a tabela no Athena que referenciará os dados do S3.&lt;/p&gt;

&lt;p&gt;Abaixo vamos supor que você possui a tarefa de analisar os dados de uma cafeteria para criar um modelo de previsão de demanda de um determinado item.&lt;/p&gt;

&lt;p&gt;1 - Para isto pegamos os dados da fonte. Neste caso vou utilizar o dado do kaggle. &lt;br&gt;
&lt;a href="https://www.kaggle.com/datasets/divu2001/coffee-shop-sales-analysis"&gt;Kaggle Coffee Shop Sales Analysis&lt;/a&gt;&lt;br&gt;
2 - Irei inserir estes dados no bucket do S3.&lt;br&gt;
3 - Analisamos os tipos dos dados. Uma forma de realizar isto é usando o script abaixo.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;import pandas as pd
df = pd.read_csv("./Project.csv")
# obtendo o data shape
print(df.dtypes)
# salvando no S3 e removendo o header e o index do dado
df.to_csv("s3://ay-ds-projects/projects/coffee_shop/Project.csv",headers=None, index=None)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Atraves dele obtemos o seguinte resultado.&lt;br&gt;
&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fgvsn35wl8y2rkddu3pqs.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fgvsn35wl8y2rkddu3pqs.png" alt="Tipo dos dados" width="800" height="354"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;4 - Criaremos uma tabela no Athena com referencia ao dado no S3. Para isto podemos utilizarmos os seguintes datatypes &lt;a href="https://docs.aws.amazon.com/athena/latest/ug/data-types.html"&gt;https://docs.aws.amazon.com/athena/latest/ug/data-types.html&lt;/a&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;CREATE EXTERNAL TABLE IF NOT EXISTS `ds_project`.`coffee_shop` (
  `transaction_id` int,
  `transaction_date` char(30),
  `transaction_time` char(30),
  `store_id` int,
  `store_location` char(100),
  `product_id` int,
  `transaction_qty` int,
  `unit_price` float,
  `Total_Bill` float,
  `product_category` char(100),
  `product_type` char(100),
  `product_detail` char(100),
  `Size` char(40),
  `Month_Name` char(30),
  `Day_Name` char(30),
  `Hour` int,
  `Month` int,
  `Day_of_Week` int
)
ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe'
WITH SERDEPROPERTIES ('field.delim' = ',')
STORED AS INPUTFORMAT 'org.apache.hadoop.mapred.TextInputFormat' OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
LOCATION 's3://ay-ds-projects/projects/coffee_shop/'
TBLPROPERTIES ('classification' = 'csv');
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Em seguida pode realizar as operações SQL no Athena para obter informações sobre os dados.&lt;/p&gt;

</description>
      <category>datascience</category>
      <category>athena</category>
      <category>s3</category>
    </item>
    <item>
      <title>Steps of Big Data Pipeline</title>
      <dc:creator>Andre Yai</dc:creator>
      <pubDate>Thu, 28 Dec 2023 19:15:08 +0000</pubDate>
      <link>https://dev.to/andreyai/steps-of-big-data-pipeline-3g1a</link>
      <guid>https://dev.to/andreyai/steps-of-big-data-pipeline-3g1a</guid>
      <description>&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--6S-0ZsI3--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/xflsrz4lw68dsfg1jaqr.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--6S-0ZsI3--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_800/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/xflsrz4lw68dsfg1jaqr.png" alt="Image description" width="303" height="166"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;With the increase in computational and storage power, companies have been collecting more data than ever. This leading the need for new tasks and job opportunities. In order to extract value from data companies should rely on data pipelines. These pipelines consist of stages like collection, storage, process, and analyzing data.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Collection&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This step is responsible for ingesting data from different sources to use them for later analysis. This data comes mainly from real-time and batch sources. &lt;/p&gt;

&lt;p&gt;In real-time platforms, we have those who produce data (Producers) and those who consume data (Consumers). Usually, an example of it would be what Netflix and Spotify use to send their data to millions of users. Examples of  streaming include services like &lt;strong&gt;Kafka, AWS Kinesis, AWS SQS&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;Batch collection step may involves migrating data from an existing database. For example ingest data from a transactional database like &lt;strong&gt;RDS, PostgreSQL, MySQL, Oracle, Aurora&lt;/strong&gt; to a data lakes or data warehouses like &lt;strong&gt;AWS Redshift&lt;/strong&gt;. For that in AWS, you can use the &lt;strong&gt;AWS Data Migration Service&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Storage&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Once we collect our data it will need a place to be stored. In this service, by knowing their frequency and need we can control data lifecycle. This goes from getting more frequent data to archiving or deleting them. &lt;br&gt;
Some services that help with that would be &lt;strong&gt;AWS S3&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Process&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This step deals with ETL which involves the process of cleaning, enriching, and transforming raw data into a more sophisticated layer. &lt;br&gt;
Some services that help with that would be &lt;strong&gt;AWS Glue, AWS EMR, AWS Lambda&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Governance&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Data governance consists of data management, data quality, and data stewardship. This helps to manage policies to access data, data discovery, data accuracy, validation, and completeness. &lt;br&gt;
Some services that help with them are &lt;strong&gt;AWS Glue Catalog, AWS LakeFormation&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Analyze&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This part is responsible for extracting value from data by performing data analysis, machine learning, and data visualization. This consists in extracting meaning from data by showing how it is organized, grouping, and predicting it.&lt;br&gt;
Some services that help with that would be &lt;strong&gt;AWS Sagemaker, AWS QuickSight&lt;/strong&gt;.&lt;/p&gt;

</description>
      <category>bigdata</category>
      <category>aws</category>
      <category>datalake</category>
    </item>
    <item>
      <title>AWS Certifications Path for Data Professionals</title>
      <dc:creator>Andre Yai</dc:creator>
      <pubDate>Wed, 06 Jul 2022 21:16:07 +0000</pubDate>
      <link>https://dev.to/andreyai/aws-certifications-path-for-data-professionals-15o5</link>
      <guid>https://dev.to/andreyai/aws-certifications-path-for-data-professionals-15o5</guid>
      <description>&lt;p&gt;Guide and Tips for AWS Certifications&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--UkRsnjTL--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://home.pearsonvue.com/getattachment/Clients/Amazon-Web-Services/TnC_Certification-Framework-white_1300x960.png.aspx%3Flang%3Den-US" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--UkRsnjTL--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://home.pearsonvue.com/getattachment/Clients/Amazon-Web-Services/TnC_Certification-Framework-white_1300x960.png.aspx%3Flang%3Den-US" alt="AWS Certifications" width="880" height="650"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h1&gt;
  
  
  AWS Certification Path for Data Professionals
&lt;/h1&gt;

&lt;h2&gt;
  
  
  Sections
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Intro&lt;/li&gt;
&lt;li&gt;Certifications&lt;/li&gt;
&lt;li&gt;AWS Practitioner&lt;/li&gt;
&lt;li&gt;AWS Data Analytics Specialty&lt;/li&gt;
&lt;li&gt;AWS Machine Learning Specialty&lt;/li&gt;
&lt;li&gt;Others certifications&lt;/li&gt;
&lt;li&gt;Some Tips&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Intro
&lt;/h2&gt;

&lt;p&gt;Nowadays, many companies are migrating from their on-premise stations to the cloud. As such, AWS has become one of the main and most adopted cloud providers followed by Azure and GCP.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--JEslwznA--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://cdn.statcdn.com/Infographic/images/normal/18819.jpeg" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--JEslwznA--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://cdn.statcdn.com/Infographic/images/normal/18819.jpeg" alt="Statita ckoud service providers" width="880" height="880"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Therefore getting an AWS Certification can lead you to new and sometimes better job opportunities.&lt;/p&gt;

&lt;p&gt;For the most part in AWS certifications, you don't need to follow a particular order. The exceptions are AWS Professional Architect and AWS Professional DevOps Engineer.&lt;/p&gt;

&lt;p&gt;I had some experience taking AWS Certifications. I had gotten certifications in AWS Developer Associate and AWS Machine Learning Specialty nd plan to get more in the near future.&lt;/p&gt;

&lt;p&gt;Here I will list some certifications that AWS offers related to Data Professionals are taking and later some tips that I think may help you during this journey.&lt;/p&gt;

&lt;h2&gt;
  
  
  Certifications
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;AWS Practitioner&lt;/li&gt;
&lt;li&gt;AWS Data Analytics Specialty&lt;/li&gt;
&lt;li&gt;AWS Machine Learning Specialty&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  AWS Practitioner
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://aws.amazon.com/certification/certified-cloud-practitioner/"&gt;&lt;br&gt;
&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--LuG8WqRe--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://d1.awsstatic.com/training-and-certification/certification-badges/AWS-Certified-Cloud-Practitioner_badge.634f8a21af2e0e956ed8905a72366146ba22b74c.png" alt="AWS Practitioner" width="300" height="300"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Cloud Practitioner&lt;br&gt;
&lt;/p&gt;

&lt;p&gt;About: This is a starter point for all AWS certifications, a more general certification. Although this isn't a certification focus in Data, it is important for those who want to know more about cloud fundamentals. This credential helps organizations identify and develop talent with critical knowledge related to implementing cloud initiatives. Earning AWS Certified Cloud Practitioner validates cloud fluency and foundational AWS knowledge.&lt;/p&gt;

&lt;p&gt;Price: 100 USD&lt;/p&gt;

&lt;h3&gt;
  
  
  Sections:
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://d1.awsstatic.com/training-and-certification/docs-cloud-practitioner/AWS-Certified-Cloud-Practitioner_Exam-Guide.pdf"&gt;Exam Guide Line&lt;/a&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Domain&lt;/th&gt;
&lt;th&gt;% of Exam&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Domain 1: Cloud Concepts&lt;/td&gt;
&lt;td&gt;26%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Domain 2: Security and Compliance&lt;/td&gt;
&lt;td&gt;25%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Domain 3: Technology&lt;/td&gt;
&lt;td&gt;33%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Domain 4: Billing and Pricing&lt;/td&gt;
&lt;td&gt;16%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Total&lt;/td&gt;
&lt;td&gt;100%&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Main Services: EC2, IAM, RDS, Lambda, S3, Route 53&lt;/p&gt;

&lt;h3&gt;
  
  
  Courses:
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://www.udemy.com/course/aws-certified-cloud-practitioner-new/"&gt;[NEW] Ultimate AWS Certified Cloud Practitioner - 2022&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://www.udemy.com/course/aws-certified-cloud-practitioner-practice-exams-amazon/"&gt;AWS Certified Cloud Practitioner Practice Exams&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  AWS Data Analytics Specialty
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://aws.amazon.com/certification/data-analytics-specialty/"&gt;&lt;br&gt;
&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--uCuvzPwb--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://d1.awsstatic.com/training-and-certification/certification-badges/AWS-Certified-Data-Analytics-Specialty_badge.c74203ecf6d7c4889d01d8b4ba5c9df6e69c6f43.png" alt="Aws Data Analytics" width="300" height="300"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;AWS Data Analytics Specialty&lt;br&gt;
&lt;/p&gt;

&lt;p&gt;About: This certification is more suitable for Data Engineers. This credential helps organizations identify and develop talent with critical skills for implementing cloud initiatives. Earning AWS Certified Data Analytics - Specialty validates expertise in using AWS data lakes and analytics services to get insights from data.&lt;/p&gt;

&lt;p&gt;Price: 300 USD&lt;/p&gt;

&lt;h3&gt;
  
  
  Sections
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://d1.awsstatic.com/training-and-certification/docs-data-analytics-specialty/AWS-Certified-Data-Analytics-Specialty_Exam-Guide.pdf"&gt;Data Analytics Exam Guide&lt;/a&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Domain&lt;/th&gt;
&lt;th&gt;% of Exam&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;1. Collection&lt;/td&gt;
&lt;td&gt;18%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2. Storage&lt;/td&gt;
&lt;td&gt;22%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;3. Processing&lt;/td&gt;
&lt;td&gt;24%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;4. Analysis and Visualization&lt;/td&gt;
&lt;td&gt;18%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;5. Security&lt;/td&gt;
&lt;td&gt;18%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Total&lt;/td&gt;
&lt;td&gt;100%&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Main Services: Kinesis, Athena, Redshift, EMR, Glue, Lambda, Step Functions, S3, DynamoDB, QuickSight&lt;/p&gt;

&lt;h3&gt;
  
  
  Courses
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://www.udemy.com/course/aws-data-analytics/"&gt;AWS Certified Data Analytics Specialty 2022 - Hands On!&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://www.udemy.com/course/aws-certified-data-analytics-specialty-practice-exams-amazon/"&gt;AWS Certified Data Analytics Specialty Practice Exams&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  AWS Machine Learning Specialty
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://aws.amazon.com/certification/certified-machine-learning-specialty/"&gt;&lt;br&gt;
&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--HPzZGZal--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://d1.awsstatic.com/training-and-certification/certification-badges/AWS-Certified-Machine-Learning-Specialty_badge.e5d66b56552bbf046f905bacaecef6dad0ae7180.png" alt="AWS Machine Learning Specialty" width="300" height="300"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;AWS Machine Learning Specialty&lt;br&gt;
&lt;/p&gt;

&lt;p&gt;About: This certification is more suitable for Data Scientists and Machine Learning Engineers. This credential helps organizations identify and develop talent with critical skills for implementing cloud initiatives. Earning AWS Certified Machine Learning - Specialty validates expertise in building, training, tuning, and deploying machine learning (ML) models on AWS.&lt;/p&gt;

&lt;p&gt;Price: 300 USD&lt;/p&gt;

&lt;h3&gt;
  
  
  Sections
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://d1.awsstatic.com/training-and-certification/docs-ml/AWS-Certified-Machine-Learning-Specialty_Exam-Guide.pdf"&gt;Exam Guide&lt;/a&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Domain&lt;/th&gt;
&lt;th&gt;% of Exam&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;1. Data Engineering&lt;/td&gt;
&lt;td&gt;20%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2. Data Analysis&lt;/td&gt;
&lt;td&gt;24%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;3. Modelling&lt;/td&gt;
&lt;td&gt;36%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;4. Machine Learning Implementation and Operations&lt;/td&gt;
&lt;td&gt;20%&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Total&lt;/td&gt;
&lt;td&gt;100%&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Some AWS Services for this exam are: Sagemaker, S3, Glue and EMR. Be aware that you need some previous knowledge how machine learning works (building, training, tuning, validation, deployment).&lt;/p&gt;

&lt;h3&gt;
  
  
  Courses
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://www.udemy.com/course/aws-machine-learning/"&gt;AWS Certified Machine Learning Specialty 2022 - Hands On!&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://www.udemy.com/course/aws-certified-machine-learning-specialty-practice-exams-amazon/"&gt;AWS Certified Machine Learning Specialty Practice Exams&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Others certifications:
&lt;/h2&gt;

&lt;p&gt;Other certifications that I would recommend are AWS Developer and AWS Solutions Architecture.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://aws.amazon.com/certification/certified-developer-associate/"&gt;AWS Developer&lt;/a&gt; is a certification that helps organizations identify and develop talent with critical skills for implementing cloud initiatives. Earning AWS Certified Developer - Associate validates the ability to write and deploy cloud-based applications&lt;/p&gt;

&lt;p&gt;&lt;a href="https://aws.amazon.com/certification/certified-solutions-architect-associate/"&gt;AWS Solutions Architeture&lt;/a&gt; is a certification that helps organizations identify and develop talent with critical skills for implementing cloud initiatives. Earning AWS Certified Solutions Architect - Associate validates the ability to design and implement distributed systems on AWS.&lt;/p&gt;

&lt;p&gt;You can look more certification in &lt;a href="https://aws.amazon.com/certification/exams/"&gt;AWS Certification Portal Page&lt;/a&gt;.&lt;/p&gt;

&lt;h2&gt;
  
  
  Some Tips
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Read the Certification Exam Guide.&lt;/li&gt;
&lt;li&gt;Watch and take notes from the Udemy courses. Other great source is AWS Training and reading AWS Whitepapers.&lt;/li&gt;
&lt;li&gt;Create a AWS Account.

&lt;ul&gt;
&lt;li&gt;if it is first time using an AWS account look for free tiers and for services that you can use free in the first months.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;
&lt;li&gt;Pratice with some of AWS Services. Be careful with the time spent and computation it can become very expensive if you leave it running.&lt;/li&gt;
&lt;li&gt;Do some mock tests. Like from Dojo Test and AWS Samples Tests.&lt;/li&gt;
&lt;li&gt;Register your certification.

&lt;ul&gt;
&lt;li&gt;You can do it &lt;a href="https://home.pearsonvue.com/Clients/Amazon-Web-Services.aspx"&gt;online or in a registered center&lt;/a&gt; . There you can choose which exam you would like to do and also the date and time slot. In most of AWS Certifications exam you have more or less 3 hours to complete. And most of them are composed by 65 to 70 multiple choice questions.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;
&lt;li&gt;Create an AWS Account.&lt;/li&gt;
&lt;li&gt;If it is the first time using an AWS account , you should look for free tiers and services that you can use free in the first months.&lt;/li&gt;
&lt;li&gt;Practice with some of AWS Services. Be careful with the time spent and computation can become very expensive if you leave it running.&lt;/li&gt;
&lt;li&gt;Do and redo some mock tests. Like from Dojo Test and AWS Samples Tests. Some similar questions may be on your exam.&lt;/li&gt;
&lt;li&gt;Register your certification. You can do it online or in a registered center. There you can choose which exam you would like to take and the date and time slot.&lt;/li&gt;
&lt;li&gt;In most AWS Certifications exams you have more or less 3 hours to complete.&lt;/li&gt;
&lt;li&gt;And most of them are composed of 65 to 70 multiple-choice questions.&lt;/li&gt;
&lt;li&gt;To pass you should get a mark greater than 75%.&lt;/li&gt;
&lt;li&gt;If English isn't your first language, you can apply for an additional 30 min for your exam.&lt;/li&gt;
&lt;li&gt;If you already have an AWS certification, you can use a promotion code and pay 50% less for your next exam certification.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;3 days previous exam date:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Be sure that you know everything that will be in your exam test.&lt;/li&gt;
&lt;li&gt;It is a good idea to redo a Mock test. And check if the grade is good enough to pass the exam. If not, you should consider rescheduling it.&lt;/li&gt;
&lt;li&gt;Be careful that you cannot reschedule it if you have less than 48 hours before the exam.&lt;/li&gt;
&lt;li&gt;If you are doing it online, it's a good idea to test your connectivity and exam software. Make sure everything is properly fine.&lt;/li&gt;
&lt;li&gt;If you are doing it in a test center, you should check how to go there.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Before the exam:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Review everything you need to know to be successful in this exam.&lt;/li&gt;
&lt;li&gt;Connect to your exam time slot early. Clean your workstation. There will be a proctor watching you. Make sure your computer notifications are off and your internet speed is good.&lt;/li&gt;
&lt;li&gt;Be calm and everything will be fine. Good Luck!!&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;You can follow me on LinkedIn: &lt;a href="https://www.linkedin.com/in/andre-yai/"&gt;https://www.linkedin.com/in/andre-yai/&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  References
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://www.statista.com/chart/18819/worldwide-market-share-of-leading-cloud-infrastructure-service-providers/"&gt;Statita Cloud Share&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://aws.amazon.com/certification/exams/"&gt;AWS Certification Portal Page&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://home.pearsonvue.com/Clients/Amazon-Web-Services.aspx"&gt;Pearson Vue Test Center&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://www.udemy.com/course/aws-certified-cloud-practitioner-new/"&gt;[NEW] Ultimate AWS Certified Cloud Practitioner - 2022&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://www.udemy.com/course/aws-certified-cloud-practitioner-practice-exams-amazon/"&gt;AWS Certified Cloud Practitioner Practice Exams&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://www.udemy.com/course/aws-data-analytics/"&gt;AWS Certified Data Analytics Specialty 2022 - Hands On!&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://www.udemy.com/course/aws-certified-data-analytics-specialty-practice-exams-amazon/"&gt;AWS Certified Data Analytics Specialty Practice Exams&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://www.udemy.com/course/aws-machine-learning/"&gt;AWS Certified Machine Learning Specialty 2022 - Hands On!&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://www.udemy.com/course/aws-certified-machine-learning-specialty-practice-exams-amazon/"&gt;AWS Certified Machine Learning Specialty Practice Exams&lt;/a&gt;&lt;/p&gt;

</description>
      <category>aws</category>
      <category>datascience</category>
      <category>machinelearning</category>
      <category>analytics</category>
    </item>
  </channel>
</rss>
