Bootiful GCP: Globally Consistent Data Access With Spanner (Part 3)

Hi, Spring fans! In this brief part of the series, we’re going to look at Spring Cloud integration for the Google Cloud Platform, Spring Cloud GCP. Spring Cloud GCP represents a joint effort between Google and Pivotal that provides a first-class experience for Spring Cloud developers when using the Google Cloud Platform. Pivotal Cloud Foundry users will enjoy an even easier integration with the GCP service broker . I wrote these installments with input from the Google Cloud Developer Advocate and my buddy, Ray Tsang . You can also catch a walkthrough of Spring Cloud GCP in our Google Next 2018 session, Bootiful Google Cloud Platform . Thanks, buddy! As always, I’d love to hear from you if you have feedback .

If you’re just joining us, be sure to read the previous installments:

MySQL and PostgreSQL are familiar friends in an unfamiliar land, but they’re not why we’re here. No, no. Were I you, I’d look at a platform like GCP and take from it the best bits — the parts that have no analog elsewhere. The things that separate it from the other platforms is one feature called Google Spanner . Spanner is something else entirely. In this installment, we’re going to look at Google Cloud Spanner.

Google first revealed Spanner when they introduced F1, a SQL database engine that the Adwords team moved to, away from MySQL (“But Josh!,”I hear you exclaim, “Didn’t we just deploy to MySQL??”), in 2012. Spanner provides low latency reads and, to a lesser extent, writes globally. Google announced it in 2012 in a research paper that called Spanner “the first system to distribute data at global scale and support externally-consistent distributed transactions.”

“Spanner is impressive work on one of the hardest distributed system's problems — a globally replicated database that supports externally consistent transactions within reasonable latency bounds,” said Andy Gross , principal architect at Basho.

Spanner is able to offer a broad amount of geographic redundancy, thanks to a method Google has developed for being able to give precise times to applications to let them write, read, and replicate data without making mistakes. Spanner’s “TrueTime” API depends upon GPS receivers and atomic clocks that have been installed in Google’s data centers to let applications get accurate time readings locally without having to sync globally.

There are a number of database technologies at Google, such as Bigtable ( a columnar database that is great for high throughput writes), and Megastore and the NoSQL Database. Bigtable is supported eventually with consistent replication across datacenters. According to the paper, “at least 300 applications within Google use Megastore, despite its relatively low performance, because its data model is simpler to manage than Bigtable’s and because of its support for synchronous replication across datacenters.” At the time, applications like Gmail, Picasa, Calendar, Android Market, and AppEngine relied on Megastore.

Spanner was designed to be “scalable, multi-version, globally distributed, and synchronously-replicated database.” Transactions are a first-class concept in Spanner that is driven, in part, by their absence in Bigtable.

“The lack of cross-row transactions in Bigtable led to frequent complaints; Percolator was, in part, built to address this failing. Some authors have claimed that general two-phase commit is too expensive to support, because of the performance or availability problems that it brings. We believe it is better to have application programmers deal with performance problems due to overuse of transactions as bottlenecks arise, rather than always coding around the lack of transactions. Running two-phase commit over Paxos mitigates the availability problems.”

Each of the databases has their own use cases. Bigtable, on GCP as Cloud Bigtable , is great for consistent low latency and high throughput workload. While Megastore, on the GCP as Cloud Datastore , can be used as a managed NoSQL data store with ACID transactions. Spanner, on GCP as Cloud Spanner , is meant for horizontally scalable, highly available, and strongly consistent RDBMS workloads.

Well, alright! I’m simultaneously interested and intimidated! I want Spanner, but I don’t want to have to rack and stack servers and synchronize GPS receivers and atomic clocks. But, something tells me Google would be happy to do that for me, so let’s try it out.

As shown before, you’ll need to enable the API for Google Cloud GCP Spanner before you can use it:

gcloud services enable spanner.googleapis.com

Then, create a new Google Cloud Spanner instance:

gcloud spanner instances create reservations --config=regional-us-central1 \
  --nodes=1 --description="Reservations for everybody"

Then, create the database instance:

gcloud spanner databases create reservations --instance=reservations

Next, you will need to confirm that the Spanner instance is available:

gcloud spanner databases list --instance=reservations

Once the instance is READY , it’s time to create the table. Here’s the Spanner DDL. If this looks uncannily like SQL, that’s good! It should. Put this DDL into a separate file. I’ve called it schema.ddl .

schema.ddl

CREATE TABLE reservations (
  id        STRING (36) NOT NULL,
  name      STRING (255) NOT NULL
) PRIMARY KEY (id );

gcloud spanner databases ddl update reservations \
  --instance=reservations --ddl="$(./gcp/src/main/resources/db/schema.ddl )"

Now, we can read the data from Spanner in our Spring application. The auto-configuration needs a little bit of configuration in order to talk to the right database.

application.properties

spring.cloud.gcp.spanner.instance-id=reservations-demo
spring.cloud.gcp.spanner.database=reservations

We’ll use the brand new Spring Data Spanner module that supports common Spring Data idioms when working with Spanner. Add org.springframework.cloud : spring-cloud-gcp-starter-data-spanner to your Maven build. Let’s use a Spring Data repository to make short work of reading with our database.

package com.example.gcp.spanner;

import lombok.AllArgsConstructor;
import lombok.Data;
import lombok.NoArgsConstructor;
import lombok.extern.slf4j.Slf4j;
import org.springframework.boot.SpringApplication;
import org.springframework.boot.autoconfigure.SpringBootApplication;
import org.springframework.boot.context.event.ApplicationReadyEvent;
import org.springframework.cloud.gcp.data.spanner.core.mapping.PrimaryKey;
import org.springframework.cloud.gcp.data.spanner.core.mapping.Table;
import org.springframework.context.event.EventListener;
import org.springframework.data.annotation.Id;
import org.springframework.data.repository.PagingAndSortingRepository;
import org.springframework.data.rest.core.annotation.RepositoryRestResource;

import java.util.UUID;
import java.util.stream.Stream;

@Slf4j
@SpringBootApplication
public class SpannerApplication {

        private final ReservationRepository reservationRepository;

        SpannerApplication(ReservationRepository reservationRepository) {
                this.reservationRepository = reservationRepository;
        }

        @EventListener(ApplicationReadyEvent.class)
        public void setup() {


                this.reservationRepository.deleteAll();

                Stream
                    .of("josh", "ray")
                    .map(name -> new Reservation(UUID.randomUUID().toString(), name))
                    .forEach(this.reservationRepository::save);
                this.reservationRepository.findAll().forEach(r -> log.info(r.toString()));
        }

        public static void main(String args[]) {
                SpringApplication.run(SpannerApplication.class, args);
        }
}


@Data
@AllArgsConstructor
@NoArgsConstructor
@Table(name = "reservations")
class Reservation {

        @Id
        @PrimaryKey
        private String id;
        private String name;
}


@RepositoryRestResource
interface ReservationRepository extends PagingAndSortingRepository<Reservation, String> {
}

We kick off the application, delete existing data, and then write some new data to the database using our Spring Data Spanner-powered repository.
We define the Spring Data Spanner entity using custom mapping annotations, @Table and @PrimaryKey .
We create a Spring Data repository that is also exposed using Spring Data REST as a REST API.

This example should look familiar if you’ve ever used Spring Data. Spring Data Spanner builds upon familiar concepts and patterns — templates, repositories, and entities — to support familiar data access patterns with a very different kind of database.

schema.ddl

application.properties

Recommend

时序异常检测算法概览

Use the official Boost.Hana with MSVC 2017 Update 8 compiler

初识MobX

Remote Mac Exploitation via Custom URL Scheme

The pain that minimal version selection solves

Introduction to Integration Patterns

Async：简洁优雅的异步之道

技术帖 | 京东金融李冠男：区块链存储扩展的三个方案

与程序员打交道，千万别“嘴欠”说这 11 句话

Git中.gitignore文件不起作用的解决以及Git中的忽略规则介绍

About Joyk