

AWS Tutorial 4 - RDS Data Loading into Aurora Run In Circles Scream and Shout -...
source link: https://fuzzyblog.io/blog/aws/2016/08/28/aws-tutorial-4-rds-data-loading-into-aurora-run-in-circles-scream-and-shout-the-oh-shite-moment.html
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

AWS Tutorial 4 - RDS Data Loading into Aurora Run In Circles Scream and Shout - The Oh Shite Moment
Aug 28, 2016
If you've read tutorial 1, 2 or 3 then you absolutely need to read this one.
What I've been doing is generating mysqldump load files on a normal MySQL instance and then feeding them into an Amazon RDS hosted Aurora db. Normally I'm beyond conservative with the storage layer of anything I build but this time I went for the gusto and used something new - Amazon Aurora.
What I've just discovered is terribly, terribly troubling. I'm encountering silent load failures where data I've exported is in the dump files but NOT in Aurora after the load. I discovered this one time thru an error message and other than that I haven't seen an error message but I am missing data.
Now given the convoluted gyrations with split, sync and cat to recombine, it would be easy to assume that the issue is mine but I only used split / cat on two tables of data and there are issues with at least 3 tables so one of the tables did NOT use split / cat. Here's an example:
source:
select count(*) from large_table;
+----------+
| count(*) |
+----------+
| 72994268 |
+----------+
1 row in set (1 min 28.98 sec)
aurora:
select count(*) from large_table;
+----------+
| count(*) |
+----------+
| 43012074 |
+----------+
1 row in set (30.66 sec)
And this was one of the non split files. The files done with split are worse.
I have an ec2 instance configured as a high performance db server so I'm loading one of the offending files into to see if I can eliminate Aurora as the issue.
And I'm not going to definitely say that the issue is Aurora. I've prematurely pointed the finger in the past and that's not fair. What I do think is fair is to say the following:
- Do not assume that because an import succeeded that all your data is there
- Absolutely verify row counts from source to destination
- I don't know that row counts indicate success; I think you need a checksum of sql statements from source to destination but that's a much harder problem
More later if / when I understand this.
Recommend
-
14
Integrating with PVS-Studio The first and top diagnostic Camera position in games The legacy of CryEngine Errors live longer than you might think ...
-
17
A shout-out to LWN 📰 29th January 2017 One of the biggest advantages of
-
14
Scream at your game controller to mess up the other player Here's another one of my half-baked ideas from quite a few years back. This one could be the basis for a particularly crazy two-player game. Games are not my thing, so t...
-
8
AWS Tutorial 5 - Aurora RDS Issues Aug 29, 2016 So I took the time to reload the data I described
-
10
Mar 11, 2022 — 15:43 CUT MacStories Unwind: A.P. Bio and House by Shout Out Louds
-
13
欧雷说:「Shake, Wave, Shout, Fuck with music.Thanks for the...」 欧雷 发表于 2022-09-01 16:29 Shake, Wave, Shout,...
-
4
MoviesHalloween Is Cool and All, But John Carpenter Is Really Into Godzilla These DaysThe legendary horror director will...
-
5
VIM-SHOUT Run and Capture Shell Command Output Should work with Vim9 (compiled with HUGE features). I always wanted a simpler way to run an...
-
6
Don’t miss what’s happeningPeople on Twitter are the first to know....
-
6
About Joyk
Aggregate valuable and interesting links.
Joyk means Joy of geeK