Float128 fma() subtly corrupts state on Windows #31

RalphAS · 2019-04-20T21:27:58Z

On Windows (but not Linux or OSX) calls to fma() somehow break the overflow handling.

julia> h, x = floatmax(Float128), Float128(1)
julia> h + h
inf
julia> fma(x,x,x);
julia> h + h
1.18973149535723176508575932662800701e+4932

I don't see anything wrong with our wrappers; it may be a libquadmath and/or libgcc issue.

The text was updated successfully, but these errors were encountered:

simonbyrne · 2019-04-21T05:07:31Z

Is it correct for other values? Not sure if this is related: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89459

RalphAS · 2019-04-21T18:30:36Z

After calling fma on Float128, overflow seems generally broken for +,-,*,/ with Float128 or Float64 arguments (tested with mixtures of huge and moderate values). I surmise that some floating point control structure in the mingw runtime is not being reset properly.

Since things work on Linux, I don't think this one is an upstream GCC issue. (I believe I'm using a fairly old GCC on Linux without trouble here.)

simonbyrne · 2019-04-22T02:53:54Z

ah, if it's breaking Float64 rounding, then I suspect that is related to JuliaLang/julia#9847 (comment) / https://sourceware.org/bugzilla/show_bug.cgi?id=17907

In that case, maybe we should just disable fma on Windows? Or call the MPFR one?

RalphAS · 2019-04-22T04:34:45Z

I'm really here for the trouble it gave me in #30 and the challenge of a level-1 puzzle to pin that down, so either of your suggestions LGTM.

simonbyrne · 2019-04-22T17:44:00Z

Maybe we just remove fma on Windows for the time being?

See #31.

simonbyrne · 2019-04-24T04:37:23Z

I have disabled fma on Windows for now. Once we have a better solution we can add it back.

JeffreySarnoff · 2019-05-03T04:13:18Z

Opinion?

@static if Sys.iswindows()
    const SPLITTER = 1 + Float128(2)^57
    const THRESHOLD = Float128(2)^16326

    function two_sum(x::Float128, y::Float128)
        hi = x + y
        a  = hi - y
        b  = hi - a
        lo = (x - a) + (y - b)
        return hi, lo
    end
    
    function sum_three(x::Float128, y::Float128, z::Float128)
        s, t  = two_sum(y, z)
        u, v  = two_sum(x, s)
        hi    = t + v + u
        return hi
    end
    
    function splitter(x::Float128)
        x < THRESHOLD || throw(DomainError("Threshold $THRESHOLD exceeded"))
        a = x * SPLITTER
        b = x - a
        x1 = a + b
        x2 = x - x1
        return x1, x2
    end
    
    function two_prod(x::Float128, y::Float128)
        hi = x * y
        x1, x2 = splitter(x)
        y1, y2 = splitter(y)
        lo = x2 * y2 - (((hi - x1*y1) - x1*y2) - x2*y1)
        return hi, lo
    end 
 
    function fma(x::Float128, y::Float128, z::Float128)
        xyhi, xylo = two_prod(x, y)
        result = sum_three(z, xyhi, xylo)
        return result
    end
    
 end

simonbyrne · 2019-05-03T17:48:02Z

fmas are subtly difficult to get correct (especially dealing with underflow and subnormals). I think the BigFloat option is probably the best for now until we can figure out what is going on.

JeffreySarnoff · 2019-05-08T01:49:59Z

It's fine by me -- the more important aspect of this is let the Windows version work for clients who use fma. Why not implement the BigFloat fma stopgap until the cause is dissolved or resolved?

JeffreySarnoff · 2020-06-14T11:17:58Z

Thought it worth the revisit

if Sys.iswindows()
    function Base.fma(x::Float128, y::Float128, z::Float128)
        oldprec = precision(BigFloat)
        setprecision(BigFloat, 512)   # at least 448 == Int(128 * 3.5) 
        result = Float128(x * y + z)
        setprecision(BigFloat, oldprec)
        return result
    end
else
   # ccall
end

simonbyrne · 2020-06-15T15:46:59Z

@JeffreySarnoff would you mind opening a pull request?

Isn't fma defined for BigFloat?

JeffreySarnoff · 2020-06-15T17:12:08Z

fma is defined for BigFloat and the logic will process fma(x,y,z) at the precision set for BigFloat. That precision may be well in excess of the precision demanded for Float128s and take longer than it might, or it may be much less than that and deliver an imprecise result. Either way, we need to manipulate setprecision(BigFloat) when fma is called (for Win). I'll PR it.

JeffreySarnoff · 2022-04-16T15:50:48Z

Has this "On Windows (but not Linux or OSX) calls to fma() somehow break the overflow handling." fixed itself with the current libquadmath that we use? If so, we should remove the code that blocks fma on Windows.

RalphAS · 2022-04-16T23:30:19Z

It appears to be unchanged for current and nightly Julia.

The example at the top is included as a broken test for Windows in the commit shown here:
https://github.com/RalphAS/Quadmath.jl/actions/runs/2178209027

simonbyrne added a commit that referenced this issue Apr 23, 2019

disable fma on Windows

c8b8b6c

See #31.

expikr mentioned this issue May 26, 2024

math.hypot: fix incorrect over/underflow behavior ziglang/zig#19472

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Float128 fma() subtly corrupts state on Windows #31

Float128 fma() subtly corrupts state on Windows #31

RalphAS commented Apr 20, 2019

simonbyrne commented Apr 21, 2019

RalphAS commented Apr 21, 2019

simonbyrne commented Apr 22, 2019

RalphAS commented Apr 22, 2019

simonbyrne commented Apr 22, 2019

simonbyrne commented Apr 24, 2019

JeffreySarnoff commented May 3, 2019 •

edited

Loading

simonbyrne commented May 3, 2019

JeffreySarnoff commented May 8, 2019

JeffreySarnoff commented Jun 14, 2020

simonbyrne commented Jun 15, 2020

JeffreySarnoff commented Jun 15, 2020

JeffreySarnoff commented Apr 16, 2022

RalphAS commented Apr 16, 2022

Float128 fma() subtly corrupts state on Windows #31

Float128 fma() subtly corrupts state on Windows #31

Comments

RalphAS commented Apr 20, 2019

simonbyrne commented Apr 21, 2019

RalphAS commented Apr 21, 2019

simonbyrne commented Apr 22, 2019

RalphAS commented Apr 22, 2019

simonbyrne commented Apr 22, 2019

simonbyrne commented Apr 24, 2019

JeffreySarnoff commented May 3, 2019 • edited Loading

simonbyrne commented May 3, 2019

JeffreySarnoff commented May 8, 2019

JeffreySarnoff commented Jun 14, 2020

simonbyrne commented Jun 15, 2020

JeffreySarnoff commented Jun 15, 2020

JeffreySarnoff commented Apr 16, 2022

RalphAS commented Apr 16, 2022

JeffreySarnoff commented May 3, 2019 •

edited

Loading