DEV Community

Ankit malik
Ankit malik

Posted on

Bigquery public datasets

Introduction

In the world of big data, public datasets are an invaluable resource for data scientists, researchers, and analysts looking to gain insights into a wide range of topics. One of the most popular platforms for accessing public datasets is Google BigQuery, a cloud-based data warehouse that allows users to store, query, and analyze large amounts of data.

BigQuery offers access to a vast array of public datasets, which can be accessed and queried directly from the platform. These datasets cover a wide range of topics, from weather patterns and transportation data to population demographics and financial information.

One of the key benefits of using public datasets in BigQuery is that they are pre-processed and ready for analysis, which means that users can skip the time-consuming and often tedious process of cleaning and formatting data. Additionally, BigQuery allows for fast and efficient querying of these datasets, which can help users to identify patterns and insights that might be difficult or impossible to find through other means.

View Public datasets here: https://console.cloud.google.com/marketplace/browse?filter=solution-type:dataset

Some of the most famous public datasets available in BigQuery include:

The New York City Taxi and Limousine Commission (TLC) dataset: This dataset contains over 1 billion taxi trips taken in New York City between 2009 and 2019. It includes information on pickup and dropoff locations, trip duration, and fare amounts, among other variables. This dataset has been used by researchers to study traffic patterns, transportation policy, and economic trends.

Check it here:
https://console.cloud.google.com/marketplace/product/city-of-new-york/nyc-tlc-trips?project=grand-eye-333818

The OpenAQ dataset: This dataset contains air quality data from over 11,000 monitoring stations in more than 100 countries. It includes information on a range of pollutants, such as ozone, particulate matter, and nitrogen dioxide. This dataset has been used to study the impact of air pollution on human health and the environment.

Check it here: https://console.cloud.google.com/marketplace/product/openaq/real-time-air-quality?project=grand-eye-333818

The World Bank Indicators dataset: This dataset contains a wide range of economic and social indicators for countries around the world, including GDP, population, and health outcomes. It has been used by researchers to study the impact of economic policies and to identify trends in global development.

Check it here:
https://console.cloud.google.com/marketplace/product/the-world-bank/wdi?project=grand-eye-333818

Conclusion

BigQuery offers access to a wide range of other public datasets covering topics such as climate data, financial information, and social media trends. Overall, public datasets in BigQuery are an incredibly valuable resource for anyone looking to gain insights into complex topics or to develop new applications and tools.

Top comments (0)