Change scaling of rectangular fwd transforms

Modifies the C fwd txfms to have correct scaling. Rectangular
transforms now are always implemented in a way that the samller
side is transformed first.

The SSE2 tests are temporarily disabled until the SSSE2 code
is modified to be consistent with the C code.

Also includes a fdct32 fix.

borgtest results show a slight improvement.

Change-Id: I9417fd0b833d79e0ab13c85d3210d9ea8f2029a4
7 files changed