x86: optimize {,V}EXTRACTPS with immediate 0 They are equivalent to simple moves, which are up to 2 bytes shorter to encode (and maybe also cheaper to execute).