Google BigQuery

This is the implementation of the BigQuery data handler for Writer Knowledge Graph. BigQuery is a fully-managed, serverless data warehouse that enables scalable analysis and machine learning over petabytes of data. It is a Platform-as-a-Service that supports querying using ANSI SQL, and it provides many useful built-in tools and capabilities.

Implementation

This handler is implemented using the google-cloud-bigquery Python library.

The required arguments to establish a connection are as follows:

  • project_id is a globally unique identifier for your project.
  • dataset defines the default dataset. Tables from this dataset are shown in the object tree.
  • service_account_keys is a full path to the service account key file. This parameter is required but can be replaced by service_account_json.

For more info about creating and managing the service account key visit this page.

There is an optional parameter that can be used instead of the service_account_keys parameter:

  • service_account_json stores the content of a JSON file defined by the service_account_keys parameter.