amd64: use compiler intrinsics for bsf* and bsr*
(cherry picked from commit aae89f6f09576351cc3a9a54959649e60fdd849b)