Retype a Column in YugabyteDB (And PostgreSQL)

This article takes the reader through changing the datatype of a YugabyteDB column with minimal downtime.

Franck Pachot

Mar. 22, 23 · Tutorial

Like (1)

694 Views

You inherit a data model that does not use the best datatypes, like using int instead of bigint for a value that can be larger? Don't wait until the tables grow larger. Here is an example to show how to retype it efficiently. Usually, those operations are done two times:

The DDL that may block the application must be short, with metadata changes only
The DML that modifies data can take longer as long as they are online without impact on the application availability

During the intermediate phase, you have to maintain both versions of the columns. Because YugabyteDB is PostgreSQL compatible with triggers, you may even do that without any change to the application code.

Here is an example.

Demo Table

I have the following table:

create table demo ( k bigserial primary key, a int);
insert into demo(a) select generate_series(1,10000);
\d demo

yugabyte=> \d demo

                            Table "public.demo"
 Column |  Type   | Collation | Nullable |             Default
--------+---------+-----------+----------+---------------------------------
 k      | bigint  |           | not null | nextval('demo_k_seq'::regclass)
 a      | integer |           |          |
Indexes:
    "demo_pkey" PRIMARY KEY, lsm (k HASH)

Unfortunately, column a was defined as int and I want to change it to bigint to get larger values.

Add the New Column

First, I'll add a new column a_new with the target datatype, bigint and a trigger to update it from a:


alter table demo add column a_new bigint;

create or replace function a_new() returns trigger as $$ begin new.a_new := new.a; return new; end; $$ language plpgsql;

create trigger a_new_trigger

This will synchronize the new column for new inserts or when a is updated, but I have all existing rows to update:

yugabyte=> select * from demo limit 5;

  k   |  a   | a_new
------+------+-------
 4443 | 4443 |
  212 |  212 |
 8937 | 8937 |
 3710 | 3710 |
 8899 | 8899 |
(5 rows)

Backfill the New Column

I can do it in one transaction:

update demo set a_new=a where a_new is null;

Or if the table is large, I may prefer to do it with small transactions. To reduce the reads, I can create an index on what remains to update:

create index demo_a_new on demo(k) where a_new is null;

This can take longer but is online, with backfill, and will help to update by small batches without re-scanning the table each time. You may choose to do the same without the index. With the index, you do fewer reads, but more writes.

This will update by the batch of 1,000:

/*+ Set(yb_bnl_batch_size 1000) */
with updated as (
update demo set a_new=a where k in ( select k from demo where a_new is null limit 1000 )
returning 1 ) select count(*) as updated ,1/count(*) as fail_when_zero from updated
;

It will stop with ERROR: division by zero when there is no more to update. I use that to stop the \watch without more code, but you can do better, of course.

The most important to be sure that it is efficient is to look at the execution plan with explain (costs off, analyze, dist):

                                                             QUERY PLAN
------------------------------------------------------------------------------------------------------------------------------------
 Aggregate (actual time=116.497..116.497 rows=1 loops=1)
   CTE updated
     ->  Update on demo (actual time=84.384..116.095 rows=1000 loops=1)
           ->  YB Batched Nested Loop Join (actual time=84.325..108.839 rows=1000 loops=1)
                 Join Filter: (demo.k = "ANY_subquery".k)
                 ->  HashAggregate (actual time=3.925..4.061 rows=1000 loops=1)
                       Group Key: "ANY_subquery".k
                       ->  Subquery Scan on "ANY_subquery" (actual time=3.098..3.661 rows=1000 loops=1)
                             ->  Limit (actual time=3.093..3.495 rows=1000 loops=1)
                                   ->  Index Only Scan using demo_a_new on demo demo_1 (actual time=3.092..3.407 rows=1000 loops=1)
                                         Heap Fetches: 0
                                         Storage Index Read Requests: 1
                                         Storage Index Execution Time: 4.000 ms
                 ->  Index Scan using demo_pkey on demo (actual time=79.913..103.305 rows=1000 loops=1)
                       Index Cond: (k = ANY (ARRAY["ANY_subquery".k, $2, $3, ..., $1000]))
                       Storage Index Read Requests: 1
                       Storage Index Execution Time: 16.000 ms
   ->  CTE Scan on updated (actual time=84.387..116.390 rows=1000 loops=1)
 Planning Time: 0.902 ms
 Execution Time: 153.539 ms
 Storage Read Requests: 2
 Storage Write Requests: 1
 Storage Execution Time: 56.000 ms
 Peak Memory Usage: 121241 kB
(24 rows)

This confirms that I have only two read requests and one write request for each batch.

The index, which has no entries now can be removed:

drop index demo_a_new;

Validate the State of the New Column

I validate that a_new is synchronized even after some DML:

yugabyte=> select * from demo where k<=3;
 k | a | a_new
---+---+-------
 3 | 3 |     3
 2 | 2 |     2
 1 | 1 |     1
(3 rows)

yugabyte=> update demo set a=a*2;
UPDATE 10000

yugabyte=> select * from demo where k<=3;
 k | a | a_new
---+---+-------
 3 | 6 |     6
 2 | 4 |     4
 1 | 2 |     2
(3 rows)

yugabyte=> insert into demo values(0,42);
INSERT 0 1

yugabyte=> select * from demo where k<=3;
 k | a  | a_new
---+----+-------
 3 |  6 |     6
 2 |  4 |     4
 0 | 42 |    42
 1 |  2 |     2
(4 rows)

Switch to the New Column

With no urge and no stress, we are ready for the last step: dropping a and renaming a_new to a:

alter table demo rename column a to a_old;
alter table demo rename column a_new to a;

drop function a_new cascade;

This must be done while the application does not query the table because in YugabyteDB 2.17, we don't do transactional DDL yet (#3109). However, this is short as it updates only the table metadata. And it is done in a way where it can raise an error but not corrupt data.

Here is the table in the final state:

yugabyte=> select * from demo where k<=3;

 k | a
---+----
 3 |  6
 2 |  4
 0 | 42
 1 |  2
(4 rows)

Remarks

You don't have to use a trigger. This can also be done with two application releases:

the first one adds the column and updates both of them in the application
the second one, after the update of all existing columns, removes the old one and renames the first one

Even if, in theory, you don't need this in PostgreSQL because DDL is transactional, you still want to avoid long transactions in PostgreSQL, and this is still a good alternative.

Retype a Column in YugabyteDB (And PostgreSQL)

Retype a Column in YugabyteDB (And PostgreSQL)

This article takes the reader through changing the datatype of a YugabyteDB column with minimal downtime.

Demo Table

Add the New Column

Backfill the New Column

Validate the State of the New Column

Switch to the New Column

Remarks

Recommend

傲慢的马斯克，“全自动驾驶”辅助系统提高到 1.2万美元

SaladBowl Version 3.0

Four publishers are trying to destroy the Internet Archive and digital lending |...

你们都用的什么代码托管平台呀？求推荐

后端想整点前端项目，学 vue 还是 react

案例研究 | 手把手教你建立健身类MVP产品

一大波海外顶级二次元游戏获版号，米哈游鹰角能否顶住“六大派围攻”？

中国教育行业市场需求洞察报告——成人篇

Thinkpad 已经这么不耐用了吗？才两年就要爆费？

ComPort over Network

About Joyk