Why we Did Not Name the cdata Transforms wide/tall/long/short

We recently saw this UX (user experience) question from the tidyr author as he adapts tidyr to cdata techniques.

3eMfuuA.png!web

zemYNvA.png!web

The terminology that he is not adopting from cdata is “unpivot_to_blocks()” and “pivot_to_rowrecs()”. One of the research ideas in the cdata package is that the important thing to call out is record structure.

The important point is: are we in a very de-normalized form where all facts about an instance are in a single row (which we called “row records”), or are we in a record oriented form where all the facts about an instances are in several rows (which we called “block records”)? The point is: row records don’t necessarily have more columns than block records. This makes shape based naming of the transforms problematic, no matter what names you pick for the shapes. There is an advantage to using intent or semantic based naming.

Below is a simple example.

zQRnQzA.png!web

Notice the width of the result relative to input width varies as function of the input data, even though we were always calling the same transform. This makes it incorrect to characterize these transforms as merely widening or narrowing.

There are still some subtle points (for instance row records are in fact instances of block records), but overall the scheme we (Nina Zumel, and myself: John Mount) worked out, tested, and promoted is pretty good. A lot of our work researching this topic can be found here .

Recommend

Tuning performance is harder than debugging bugs

Python and Oracle Cloud: loading data

Writing Resilient Components

When Excel isn’t enough: Using Python to clean your Data, automate Excel and muc...

Why the default, WSL terminal is so fast and other commentary about low-level AP...

GitHub - libgit2/libgit2: A cross-platform, linkable library implementation of G...

中亚Prime会员:SONY 索尼 Xperia XZ1 智能手机 4GB+64GB 黑色 1649.83元含税直邮_亚马...

中亚Prime会员、再降价:TIGER 虎牌 MCX-A035 梦重力保温杯 350ml *3件￥204.09+￥22....

25日0点、历史低价:Joyoung 九阳 JR5001 反渗透纯水机 999元包邮（前2分钟付款）_天猫...

Glambody无钢圈性感半杯文胸套装 58元包邮（需用券）_京东优惠

About Joyk