Google Spanner 论文阅读

Google spanner 阅读笔记

脑图详细

- Spanner is a scalable, globally distributed database designed, built, and deployed at Google.
  可扩展，全球分布式
  介绍
  - features
    - First, the replication configurations for data can be dynamically controlled at a fine grain by applications. Applications can specify constraints to control which datacenters contain which data, how far data is from its users (to control read latency), how far replicas are from each other (to control write latency), and how many replicas are maintained (to control durability, availability, and read performance). Data can also be dynamically and transparently moved between datacenters by the system to balance resource usage across datacenters.
    - Second, Spanner has two features that are difficult to implement in a distributed database: it provides externally consistent [Gifford 1982] reads and writes, and globally consistent reads across the database at a timestamp.
      These features enable Spanner to support consistent backups, consistent MapReduce executions [Dean and Ghemawat 2010], and atomic schema updates, all at global scale, and even in the presence of ongoing transactions.
  - - Figure 1 illustrates the servers in a Spanner universe. A zone has one zonemas- ter and between one hundred and several thousand spanservers. The former assigns data to spanservers; the latter serve data to clients. The per-zone location proxies are used by clients to locate the spanservers assigned to serve their data. The universe master and the placement driver are currently singletons. The universe master is pri- marily a console that displays status information about all the zones for interactive debugging. The placement driver handles automated movement of data across zones on the timescale of minutes. The placement driver periodically communicates with the spanservers to find data that needs to be moved, either to meet updated replication constraints or to balance load. For space reasons, we will only describe the spanserver in any detail.
    - A tablet is similar to Bigtable’s tablet abstraction, in that it implements a bag of the following mappings.
      (key:string, timestamp:int64) → string
    - A directory is the unit of data placement. All data in a directory has the same repli- cation configuration. When data is moved between Paxos groups, it is moved directory by directory, as shown in Figure 3. Spanner might move a directory to shed load from a Paxos group; to put directories that are frequently accessed together into the same group; or to move a directory into a group that is closer to its accessors. Directories can be moved while client operations are ongoing. One would expect that a 50MB directory could be moved in a few seconds.
    - Figure 4 contains an example Spanner schema for storing photo metadata on a per-user, per-album basis. The schema language is similar to Megastore’s, with the additional requirement that every Spanner database must be partitioned by clients into one or more hierarchies of tables. Client applications declare the hierarchies in database schemas via the INTERLEAVE IN declarations. The table at the top of a hi- erarchy is a directory table. Each row in a directory table with key K, together with all of the rows in descendant tables that start with K in lexicographic order, forms a directory. ON DELETE CASCADE says that deleting a row in the directory table deletes any associated child rows. The figure also illustrates the interleaved layout for the ex- ample database: for example, Albums(2,1) represents the row from the Albums table for user id 2, album id 1. This interleaving of tables to form directories is significant because it allows clients to describe the locality relationships that exist between multi- ple tables, which is necessary for good performance in a sharded, distributed database.
      Without it, Spanner would not know the most important locality relationships.
  - - Paxos Leader Leases. Spanner’s Paxos implementation uses timed leases to make leadership long-lived (10 seconds by default). A potential leader sends requests for timed lease votes; upon receiving a quorum of lease votes the leader knows it has a lease. A replica extends its lease vote implicitly on a successful write, and the leader requests lease-vote extensions if they are near expiration. Define a leader’s lease in- terval to start when it discovers it has a quorum of lease votes, and to end when it no longer has a quorum of lease votes (because some have expired). Spanner depends on the following disjointness invariant: for each Paxos group, each Paxos leader’s lease interval is disjoint from every other leader’s. Section 4.2.5 describes how this invariant is enforced.
      The Spanner implementation permits a Paxos leader to abdicate by releasing its slaves from their lease votes. To preserve the disjointness invariant, Spanner
      constrains when abdication is permissible. Define smaxto be the maximum timestamp used by a leader. Subsequent sections will describe when smaxis advanced. Before ab- dicating, a leader must wait until TT.after(smax) is true.
    - Spanner also enforces the following external-consistency invariant: if the start of a transaction T2occurs after the commit of a transaction T1, then the commit time- stamp of T2must be greater than the commit timestamp of T1. Define the start and commit events for a transaction Tiby estart i and ecommit i ; and the commit timestamp of a transaction Tiby si. The invariant becomes tabs(ecommit 1 ) < tabs(estart 2 ) ⇒ s1< s2. The protocol for executing transactions and assigning timestamps obeys two rules, which together guarantee this invariant, as shown in the following. Define the arrival event of the commit request at the coordinator leader for a write Tito be eserver i .
      Start. The coordinator leader for a write Tiassigns a commit timestamp sino less than the value of TT.now().latest, computed after eserver i . Note that the participant lead- ers do not matter here; Section 4.2.1 describes how they are involved in the implemen- tation of the next rule.
      Commit Wait. The coordinator leader ensures that clients cannot see any data com- mitted by Tiuntil TT.after(si) is true. Commit wait ensures that siis less than the absolute commit time of Ti, or si< tabs(ecommit i ). The implementation of commit wait is described in Section 4.2.1. Proof:
  - - To summarize, Spanner combines and extends ideas from two research communities: from the database community, a familiar, easy-to-use, semirelational interface, trans- actions, and an SQL-based query language; from the systems community, scalability, automatic sharding, fault tolerance, consistent replication, external consistency, and wide-area distribution.
    - We have shown that reifying clock uncertainty in the time API makes it possible to build distributed systems with much stronger time semantics. In addition, as the underlying system enforces tighter bounds on clock uncertainty, the overhead of the stronger semantics decreases. As a community, we should no longer depend on loosely synchronized clocks and weak time APIs in designing distributed algorithms.

MindMap-Spanner- Google’s Globally Distributed Database.pdf
Spanner- Google’s Globally Distributed Database.pdf

Spanner, TrueTime and the CAP Theorem 读书笔记

Spanner是CP系统,但是用户可以将其当成CA系统使用在分区发生的时候，Spanner会丢弃可用性选择一致性但是Spanner的可用性极高（发生超过制定时间预期的分区时间极低），因此大家可以认为Spanner是一个CA系统 [链接]Spanner的网络是保证高可用性的根本谷歌的数据中心之间都有3根独占光纤连接 ..

Google Map api 国内正常使用该如何配置（2021 最新）

最近有客户要求给他们网站做地图方面的功能，由于某些原因，网站必须使用 Google map，而且希望用到 geocoding。大家知道 Google map API 调用国内已经访问不了，虽然网上有很多教程，什么替换 ip 啊，把 maps.google.com 改成 maps.google.cn。但其实这些方法都是掉 ..

提升编码能力，Google 家的工程实践文档，你不看？远程开发必备!

大家好，我是老王。前端时空的共建者之一，欢迎大家关注前端时空，并且来共同建设我们的前端时空社区。 [链接] Google's Engineering Practices documentation。这份文档是 Google 团队长期以来的内部项目最佳实践。其目的是帮助开发者更好地进行代码审查工作，通过 Code Re ..

使用 Google 搜索引擎最简单的一种方式

[图片] Google 搜索引擎我相信对于广大程序员来说应该是必不可少的，本文向大家推荐最最最简单的一种访问 Google 的方式，即 Google 访问助手。下载谷歌访问助手插件下载链接提取码：y7c5 解压下载后的插件 [图片] 安装浏览器插件在 google 浏览器中输入 chrome://extensi ..

Google AdSense 广告接入，博客也能有收入

前言大家好，我是美丽的前言，我又出现了其实我的出现很简单，就是总结概括一下主要内容本文的主要内容有：介绍Google AdSense广告接入原文地址：https://www.remixjc.cn/articles/2019/12/18/1576652138981.html 双 11 入手的 1 折服务器，欢迎来 ..

Google 面试题 - 玻璃球

题目给你两个一模一样的玻璃球. 这两个球如果从一定高度掉到地上就会摔碎, 当然, 如果在这个高度以下往下扔, 怎么都不会碎, 超过这个高度肯定就一次摔碎. 现在已知这个恰巧摔碎的高度范围在 1 楼到 100 层之间, 如何用最少的试验次数, 用这两个玻璃球测试出玻璃球恰好摔碎的楼高一般策略逐一法第 1 颗球: ..

欢迎来到这里！

我们正在构建一个小众社区，大家在这里相互信任，以平等 • 自由 • 奔放的价值观进行分享交流。最终，希望大家能够找到与自己志同道合的伙伴，共同成长。

关于

Google spanner 阅读笔记

相关帖子

Spanner, TrueTime and the CAP Theorem 读书笔记

Google Map api 国内正常使用该如何配置（2021 最新）

提升编码能力，Google 家的工程实践文档，你不看？远程开发必备!

使用 Google 搜索引擎最简单的一种方式

Google AdSense 广告接入，博客也能有收入

Google 真香, 仅用一个小时就将环境搭建好。贼赞

Google 面试题 - 玻璃球

欢迎来到这里！

近期热议

推荐标签标签

最新标签

Google Spanner 论文阅读

Google spanner 阅读笔记

相关帖子

Spanner, TrueTime and the CAP Theorem 读书笔记

Google Map api 国内正常使用该如何配置（2021 最新）

提升编码能力，Google 家的工程实践文档，你不看？远程开发必备!

使用 Google 搜索引擎最简单的一种方式

Google AdSense 广告接入，博客也能有收入

Google 真香, 仅用一个小时就将环境搭建好。贼赞

Google 面试题 - 玻璃球

欢迎来到这里！

近期热议

推荐标签 标签

最新标签

推荐标签标签