Difference between revisions of "Instruction Set/fmafz"
From Mill Computing Wiki
Line 1: | Line 1: | ||
{{DISPLAYTITLE:fmafz}} | {{DISPLAYTITLE:fmafz}} | ||
− | <div style="font-size:80%;line-height:90%;margin-bottom:2em">[[Speculation|realizing]] [[Encoding|exu stream]] [[Decode|exu block]] [[Phasing|compute phase]] operation [[Domains|in the binary floating point value domain]] <br /> | + | <div style="font-size:80%;line-height:90%;margin-bottom:2em">[[Speculation|realizing]] [[Encoding|exu stream]] [[Decode|exu block]] [[Phasing|compute phase]] operation [[Domains|in the binary floating point value domain]] [[Rounding|and rounds toward zero]]<br /> |
'''native on:''' [[Cores/Silver|Silver]] [[Cores/Gold|Gold]] <br /> | '''native on:''' [[Cores/Silver|Silver]] [[Cores/Gold|Gold]] <br /> | ||
</div> | </div> | ||
− | fused multiply-add | + | Binary floating point fused multiply-add. |
+ | As usual for those, it yields a higher precision than doing it separately, and is faster too. | ||
+ | Rounds towards zero. | ||
+ | |||
---- | ---- | ||
<code style="font-size:130%"><b style="color:#050">fmafz</b>(<span style="color:#009">[[Domains#f|f]]</span> <span title="belt operand from ops window">x</span>, <span style="color:#009">[[Domains#f|f]]</span> <span title="belt operand from ops window">y</span>, <span style="color:#009">[[Domains#f|f]]</span> <span title="belt operand from ops | <code style="font-size:130%"><b style="color:#050">fmafz</b>(<span style="color:#009">[[Domains#f|f]]</span> <span title="belt operand from ops window">x</span>, <span style="color:#009">[[Domains#f|f]]</span> <span title="belt operand from ops window">y</span>, <span style="color:#009">[[Domains#f|f]]</span> <span title="belt operand from ops | ||
Line 10: | Line 13: | ||
<div style="font-size:80%;line-height:90%;margin-bottom:2em">'''operands:''' [[Operands#likeAddf|like Addf [ff:f]]] | <div style="font-size:80%;line-height:90%;margin-bottom:2em">'''operands:''' [[Operands#likeAddf|like Addf [ff:f]]] | ||
</div> | </div> | ||
− | <br /> | + | Returns x*y+z on the belt.<br /> |
'''encoding:''' | '''encoding:''' | ||
Line 31: | Line 34: | ||
<div style="font-size:80%;line-height:90%;margin-bottom:2em">'''operands:''' [[Operands#likeFmasf|like Fmasf [ff:f]]] | <div style="font-size:80%;line-height:90%;margin-bottom:2em">'''operands:''' [[Operands#likeFmasf|like Fmasf [ff:f]]] | ||
</div> | </div> | ||
− | <br /> | + | This is a fused multiply-add-subtract. An excellent way to make full use of all [[Functional Unit]]s in the 2 [[Slot]]s.<br /> |
+ | r<sub>0</sub> is x*y+z*w<br /> | ||
+ | r<sub>1</sub> is x*y-z*w<br /><br /> | ||
'''encoding:''' | '''encoding:''' |
Revision as of 18:52, 20 December 2014
realizing exu stream exu block compute phase operation in the binary floating point value domain and rounds toward zero
Binary floating point fused multiply-add. As usual for those, it yields a higher precision than doing it separately, and is faster too. Rounds towards zero.
operands: like Addf [ff:f]
Returns x*y+z on the belt.
encoding:
fmafz(f x)
,
exuArgs(op arg0, op arg1)
Core | In Slots | Latencies |
---|---|---|
Silver | E0 E1 | w,w:w=6 wv,wv:wv=6 d,d:d=7 dv,dv:dv=7 q,q:q=8 qv,qv:qv=8 |
Gold | E0 E1 E2 E3 | w,w:w=6 wv,wv:wv=6 d,d:d=7 dv,dv:dv=7 q,q:q=8 qv,qv:qv=8 |
fmafz(f x, f y, f z, f w) → f r0, f r1
operands: like Fmasf [ff:f]
This is a fused multiply-add-subtract. An excellent way to make full use of all Functional Units in the 2 Slots.
r0 is x*y+z*w
r1 is x*y-z*w
encoding:
fmafz(f x, f y)
,
exuArgs(op arg0, op arg1)
Core | In Slots | Latencies |
---|---|---|
Silver | E0 | w,w:w,w=6,6 wv,wv:wv,wv=6,6 d,d:d,d=7,7 dv,dv:dv,dv=7,7 q,q:q,q=8,8 qv,qv:qv,qv=8,8 |
Gold | E0 E2 | w,w:w,w=6,6 wv,wv:wv,wv=6,6 d,d:d,d=7,7 dv,dv:dv,dv=7,7 q,q:q,q=8,8 qv,qv:qv,qv=8,8 |
Instruction Set, alphabetical, Instruction Set by Category, Instruction Set, sortable, filterable