Difference between revisions of "Instruction Set/fmadz"

From Mill Computing Wiki
Jump to: navigation, search
Line 1:Line 1:
 
{{DISPLAYTITLE:fmadz}}
 
{{DISPLAYTITLE:fmadz}}
<div style="font-size:80%;line-height:90%;margin-bottom:2em">[[Speculation|realizing]]&nbsp;&nbsp;[[Encoding|exu stream]]&nbsp;&nbsp;[[Decode|exu block]]&nbsp;&nbsp;[[Phasing|compute phase]]&nbsp;&nbsp; operation&nbsp;&nbsp; [[Domains|in the decimal floating point value domain]]&nbsp;&nbsp;<br />
+
<div style="font-size:80%;line-height:90%;margin-bottom:2em">[[Speculation|realizing]]&nbsp;&nbsp;[[Encoding|exu stream]]&nbsp;&nbsp;[[Decode|exu block]]&nbsp;&nbsp;[[Phasing|compute phase]]&nbsp;&nbsp; operation&nbsp;&nbsp; [[Domains|in the decimal floating point value domain]]&nbsp;&nbsp; [[Rounding|and rounds toward zero]]<br />
 
'''native on:''' [[Cores/Decimal8|Decimal8]] [[Cores/Decimal16|Decimal16]] <br />
 
'''native on:''' [[Cores/Decimal8|Decimal8]] [[Cores/Decimal16|Decimal16]] <br />
 
</div>
 
</div>
  
fused multiply-add or -add/subtract
+
Decimal floating point fused multiply-add.
 +
As usual for those, it yields a higher precision than doing it separately, and is faster too.
 +
Rounds towards zero.
 +
 
 
----
 
----
 
<code style="font-size:130%"><b style="color:#050">fmadz</b>(<span style="color:#009">[[Domains#d|d]]</span> <span title="belt operand from ops window">x</span>, <span style="color:#009">[[Domains#d|d]]</span> <span title="belt operand from ops window">y</span>, <span style="color:#009">[[Domains#d|d]]</span> <span title="belt operand from ops  
 
<code style="font-size:130%"><b style="color:#050">fmadz</b>(<span style="color:#009">[[Domains#d|d]]</span> <span title="belt operand from ops window">x</span>, <span style="color:#009">[[Domains#d|d]]</span> <span title="belt operand from ops window">y</span>, <span style="color:#009">[[Domains#d|d]]</span> <span title="belt operand from ops  
Line 10:Line 13:
 
<div style="font-size:80%;line-height:90%;margin-bottom:2em">'''operands:''' [[Operands#likeAddd|like Addd [dd:d]]]
 
<div style="font-size:80%;line-height:90%;margin-bottom:2em">'''operands:''' [[Operands#likeAddd|like Addd [dd:d]]]
 
</div>
 
</div>
<br />
+
Returns x*y+z on the belt.<br />
  
 
'''encoding:'''  
 
'''encoding:'''  
Line 31:Line 34:
 
<div style="font-size:80%;line-height:90%;margin-bottom:2em">'''operands:''' [[Operands#likeFmasd|like Fmasd [dd:d]]]
 
<div style="font-size:80%;line-height:90%;margin-bottom:2em">'''operands:''' [[Operands#likeFmasd|like Fmasd [dd:d]]]
 
</div>
 
</div>
<br />
+
This is a fused multiply-add-subtract. An excellent way to make full use of all [[Functional Unit]]s in the 2 [[Slot]]s.<br />
 +
r<sub>0</sub> is x*y+z*w<br />
 +
r<sub>1</sub> is x*y-z*w<br /><br />
  
 
'''encoding:'''  
 
'''encoding:'''  

Revision as of 18:52, 20 December 2014

realizing  exu stream  exu block  compute phase   operation   in the decimal floating point value domain   and rounds toward zero

native on: Decimal8 Decimal16

Decimal floating point fused multiply-add. As usual for those, it yields a higher precision than doing it separately, and is faster too. Rounds towards zero.


fmadz(d x, d y, d z) → d r0

operands: like Addd [dd:d]

Returns x*y+z on the belt.

encoding: fmadz(d x) , exuArgs(op arg0, op arg1)

Core In Slots Latencies
Decimal8 E0 E1 d,d:d=7 dv,dv:dv=7 q,q:q=8 qv,qv:qv=8
Decimal16 E0 E1 d,d:d=7 dv,dv:dv=7 q,q:q=8 qv,qv:qv=8

fmadz(d x, d y, d z, d w) → d r0, d r1

operands: like Fmasd [dd:d]

This is a fused multiply-add-subtract. An excellent way to make full use of all Functional Units in the 2 Slots.
r0 is x*y+z*w
r1 is x*y-z*w

encoding: fmadz(d x, d y) , exuArgs(op arg0, op arg1)

Core In Slots Latencies
Decimal8 E0 d,d:d,d=7,7 dv,dv:dv,dv=7,7 q,q:q,q=8,8 qv,qv:qv,qv=8,8
Decimal16 E0 d,d:d,d=7,7 dv,dv:dv,dv=7,7 q,q:q,q=8,8 qv,qv:qv,qv=8,8


Instruction Set, alphabetical, Instruction Set by Category, Instruction Set, sortable, filterable