<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Rahmah Abubakar</title>
    <description>The latest articles on DEV Community by Rahmah Abubakar (@rahmah_abubakar_d1754f0b3).</description>
    <link>https://dev.to/rahmah_abubakar_d1754f0b3</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3425222%2F83acda48-0d4c-4d0c-86fc-dd52733cd860.png</url>
      <title>DEV Community: Rahmah Abubakar</title>
      <link>https://dev.to/rahmah_abubakar_d1754f0b3</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/rahmah_abubakar_d1754f0b3"/>
    <language>en</language>
    <item>
      <title>No ArcGIS, No QGIS—Just Python and a Problem Worth Solving</title>
      <dc:creator>Rahmah Abubakar</dc:creator>
      <pubDate>Sun, 10 Aug 2025 15:33:27 +0000</pubDate>
      <link>https://dev.to/rahmah_abubakar_d1754f0b3/no-arcgis-no-qgis-just-python-and-a-problem-worth-solving-30n7</link>
      <guid>https://dev.to/rahmah_abubakar_d1754f0b3/no-arcgis-no-qgis-just-python-and-a-problem-worth-solving-30n7</guid>
      <description>&lt;p&gt;This was 2024 project I did during my internship with HNG TECH focused on data cleansing and anomaly detection in electoral datasets. I recently revisited it to document the methodology and insights more clearly, as part of building my professional portfolio and showcasing real-world analytical impact.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Section 1&lt;/strong&gt;: Why This Matters**  &lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Electoral credibility is foundational to democracy. In this project, I used Python to analyze polling unit-level data from Zamfara State, identifying statistical anomalies that could signal irregularities, data entry errors, or procedural lapses. This is a lightweight, reproducible approach to election auditing—no GIS tools required.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;strong&gt;Section 2&lt;/strong&gt;: Methodology Overview**&lt;br&gt;&lt;br&gt;
📥 Data Source  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;CSV file with polling unit-level results
&lt;/li&gt;
&lt;li&gt;Fields: Latitude, Longitude, Registered Voters, Accredited Voters, Votes per Party, PU Metadata.&lt;/li&gt;
&lt;li&gt;Excel was used to manually clean and organize the geospatial coordinates (longitude and latitude)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Outlier Detection&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Calculated outlier scores using z-scores and domain heuristics
&lt;/li&gt;
&lt;li&gt;Ranked polling units by anomaly severity per party&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Section 2: Key Findings&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;PU 108 &amp;amp; PU 104 (Birnin Magaji)&lt;/strong&gt;: High outlier scores for APC and PDP
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;PU 321 &amp;amp; PU 124 (Tsafe)&lt;/strong&gt;: Negative transcription counts flagged
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Geospatial Clustering&lt;/strong&gt;: Anomalies concentrated in specific LGAs
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Cross-Party Deviations&lt;/strong&gt;: Irregularities not isolated to one party&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Section 4: Technical Highlights&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;No GIS tools—just Python (Pandas, Matplotlib, NumPy)
&lt;/li&gt;
&lt;li&gt;Google Colab for cloud execution
&lt;/li&gt;
&lt;li&gt;Scalable framework for other states or elections&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Section 5: Recommendations&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Audit flagged polling units
&lt;/li&gt;
&lt;li&gt;Fix metadata gaps and transcription errors
&lt;/li&gt;
&lt;li&gt;Integrate anomaly detection into official result 
verification&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Closing Thoughts&lt;/strong&gt;  &lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;This project taught me how to apply statistical thinking to real-world problems, especially in politically sensitive domains. It’s a reminder that data science isn’t &lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;For full analysis—including code, tables, and transcription breakdowns—see the complete notebook on my LinkedIn page(&lt;a href="https://www.linkedin.com/posts/rahmah-abubakar-243058288_analysis-process-activity-7253521166152163328-zh2n?utm_source=share&amp;amp;utm_medium=member_android&amp;amp;rcm=ACoAAEXGTFMBU8WaWDL8Z4Wl7HzBBVCiQx_eYO4)._" rel="noopener noreferrer"&gt;https://www.linkedin.com/posts/rahmah-abubakar-243058288_analysis-process-activity-7253521166152163328-zh2n?utm_source=share&amp;amp;utm_medium=member_android&amp;amp;rcm=ACoAAEXGTFMBU8WaWDL8Z4Wl7HzBBVCiQx_eYO4)._&lt;/a&gt;&lt;/p&gt;

</description>
      <category>data</category>
      <category>outliers</category>
      <category>datascience</category>
      <category>arcgis</category>
    </item>
  </channel>
</rss>
