x86: optimize {,V}EXTRACTPS with immediate 0

They are equivalent to simple moves, which are up to 2 bytes shorter to
encode (and maybe also cheaper to execute).
12 files changed