This is probably inherently correlated with the intrinsic aspect of ceil and floor. The corresponding FPU (floating point unit) instructions are going to be faster and/or more precise, the former by avoiding a function call and the latter by often using the FPU's higher precision intermediate results.