Division-less Euclid's algorithm

By orz, history, 2 days ago,

Usually Euclid's algorithm, which computes , is implemented as follows:

while b != 0:
  a %= b
  swap(a, b)
return a

Or, in recursive fashion,

if b == 0:
  return a
else:
  return gcd(b % a, b)

While it works in time (where is the maximum of binary lengths of and — that is, big-Theta of the length of the input), it uses quite an expensive operation of integer division. The fastest known procedure of integer division works in time, so, if we take into account the time spent on arithmetic operations, the time complexity is . But even if we don't, int64 division is still much slower than such operations as addition, subtraction and binary shifts.

If you didn't know there is an algorithm which doesn't need division at all!

def remove_trailing_zeros(a):
  return a >> count_trailing_zeros(a)

def gcd_of_odd_numbers(a, b):
  if a == b:
    return a
  if a < b:
    swap(a, b)
  return gcd_of_odd_numbers(b, remove_trailing_zeros(a - b))

def gcd(a, b)
  if a == 0:
    return b
  if b == 0:
    return a
  return gcd_of_odd_numbers(remove_trailing_zeros(a), remove_trailing_zeros(b)) << min(count_trailing_zeros(a), count_trailing_zeros(b))

The function count_trailing_zeros(a) finds the maximum such that is divisible by . The function remove_trailing_zeros(a) divides by the maximum power of two that divides . Both these functions can be easily implemented in time, if we take into account the complexity of arithmetic operations. gcd_of_odd_numbers(a, b) finds gcd of the two numbers and , given they are both odd. Everything except the recursive call works in time. Note that the sum of binary lengths of numbers is decremented by at least one from call to call, so there will be only recursive calls. Therefore, gcd_of_odd_numbers(a, b) works in time. Finally, gcd(a, b) is also obvious to take time.

My question is: why does everyone use the implementation with divisions? Are there some hidden advantages? I didn't compare how much these two take with fixed-length integer types and arbitrary-precision integer types in practice. Did someone in community investigated this question? Did you know about division-less gcd implementation at all? Please let me know in the comments.

Division-less Euclid's algorithm

Recommend

Elon Musk’s Chinese rival toppled Tesla – now it’s coming for Britain

How does BO Broadcasting work in SAP BW/BI?

HTML Color Codes

Could happy advertisers be the key to connected TV success?

Stock Market News: US Stocks Drop As Traders Mull Fed Rate Cut Outlook

百度量子计算有变化，官方证实：实验室及设备将赠予北京量子院

同价位机械键盘和薄膜键盘该怎么选择？

用AR科技打造「看得见的声音」，星纪魅族集团受邀参加中国信息无障碍论坛

亚太唯一！阿里云4度蝉联Gartner全球云数据库领导者

The state of SEO in 2024: Are you AI-ready?

About Joyk