Comparison between signed and unsigned integer expressions

2009 05 01

Many times I find myself comparing size_t with int; for example, when I need to check that a vector isn’t bigger than some runtime-determined boundary value. Using -Wall (as everyone should be), and assuming your size_t is defined rationally, compilers correctly flag this as problematic:

./test.cpp: In function main:
./test.cpp:8: warning: comparison between signed and unsigned integer expressions

I think the best solution is to static_cast one to the other. But which to which? If you cast the int to a size_t, you run the risk of a negative int overflowing into an incorrectly-huge size_t. And if you cast the size_t to an int, you run the risk of a huge size_t overflowing into an incorrectly- negative int. So, pragmatically, if your int has a high likelihood of being negative, you should cast your size_t to an int; if your size_t has a high likelihood of being larger than your implementation’s max int value, you should cast your int to a size_t.

Yet, at a high level, size_t is a “concept” both in the sense that it is not directly a primitive type, and that it represents something not immediately mathematical: it’s a non-negative count of a number of items. That, handily, maps to an unsigned something int in C++, as an implementation detail. So one could argue that the best bet is to cast the int to a size_t, as it’s casting a lower-order concept to a higher one, which (presumably) is capable of handling it.

But an int is also a sort of “concept”, albeit a mathematical one directly supported by most modern languages. It’s an integer value that may by definition be negative; any limitation on the size of that value is an implementation detail. In other words, by definition and ignoring implementation details, a size_t is-a int, whereas an int is-NOT-a size_t. And therefore, I think in the general case you should compare ints and size_ts by casting the size_t to an int, eg.

int max_size = ...;
if (static_cast<int>(my_vector.size()) > max_size)
{
    // handle error
}

Luckily this also tends to be the correct pragmatic choice; if your STL containers regularly contain more than N-million elements, you are probably not very interested in bounds-checking them.

Comparison between signed and unsigned integer expressions

Comparison between signed and unsigned integer expressions

Recommend

5 Best Digital Marketing Courses to Take in 2021

A nontrivial cherrypy server example

3 Traffic & Conversion Secrets Amazon Doesn't Want You To Know [Webinar]

C++ and member function pointers

积极引领OCP社区行业标准，希捷：让更多的企业获益

GitHub - sysprog21/lkmpg: The Linux Kernel Module Programming Guide (updated for...

Gladys West Modelled The Earth So That We Can Have GPS

GitHub - apollographql/apollo-client: A fully-featured, production ready cachin...

Who needs boost? A simple pthreads wrapper.

Strangest Upside-Down 3D Printer Fits In A Filament Box

About Joyk