    Redo the forward 4x4 dct · d0dd01b8
    Yaowu Xu authored
    The new fdct lowers the round trip sum squared error for a
    4x4 block ~0.12. or ~0.008/pixel. For reference, the old
    matrix multiply version has average round trip error 1.46
    for a 4x4 block.
    Thanks to "derf" for his suggestions and references.
    Change-Id: I5559d1e81d333b319404ab16b336b739f87afc79