Is this supported in FPC? I cannot rely on compiler producing SSE which is rather bad.
In C++ I can simply do:
__m128 *p1,*p2,*p3;
__declspec(align(16)) float f1[N],f2[N],f3[N];
for(int i=0;i<N;i++){
f1[i]=i+0.12;
f2[i]=i+0.16;
};
p1=(__m128*)f1;
p2=(__m128*)f2;
p3=(__m128*)f3;