.. _umlinst:

UML Instruction Reference
=========================

.. contents::
    :local:
    :depth: 2


.. _umlinst-intro:

Introduction
------------

UML is the instruction set used by MAME’s recompiler framework.
Code running on emulated guest CPUs is translated to UML instructions,
and back-ends convert the UML instructions to a form that can be
executed or interpreted on the host system.

Many UML instruction have multiple instruction sizes.  Integer
instructions default to 32-bit size.  Adding a ``D`` or ``d`` prefix to
the mnemonic changes to 64-bit size (double word).  Floating point
instructions use the mnemonic prefix/suffix ``FS`` or ``fs`` for
IEEE 754 32-bit format (single precision) or or the prefix/suffix ``FD``
or ``fd`` for IEEE 754 64-bit format (double precision).


.. _umlinst-special:

Special value types
-------------------

.. _umlinst-conditions:

Conditions
~~~~~~~~~~

+-------------+--------------------------------+-------------+--------------------+
| Disassembly | Mnemonic                       | Usage       | Flags tested       |
+=============+================================+=============+====================+
| ``z``       | zero                           | ``COND_Z``  | ``Z``              |
|             +--------------------------------+-------------+                    |
|             | equal                          | ``COND_E``  |                    |
+-------------+--------------------------------+-------------+--------------------+
| ``nz``      | not zero                       | ``COND_NZ`` | ``!Z``             |
|             +--------------------------------+-------------+                    |
|             | not equal                      | ``COND_NE`` |                    |
+-------------+--------------------------------+-------------+--------------------+
| ``s``       | sign set                       | ``COND_S``  | ``S``              |
+-------------+--------------------------------+-------------+--------------------+
| ``ns``      | sign not set                   | ``COND_NS`` | ``!S``             |
+-------------+--------------------------------+-------------+--------------------+
| ``c``       | carry                          | ``COND_C``  | ``C``              |
|             +--------------------------------+-------------+                    |
|             | below (unsigned)               | ``COND_B``  |                    |
+-------------+--------------------------------+-------------+--------------------+
| ``nc``      | no carry                       | ``COND_NC`` | ``!C``             |
|             +--------------------------------+-------------+                    |
|             | above or equal (unsigned)      | ``COND_AE`` |                    |
+-------------+--------------------------------+-------------+--------------------+
| ``v``       | overflow (signed)              | ``COND_V``  | ``V``              |
+-------------+--------------------------------+-------------+--------------------+
| ``nv``      | no overflow (signed)           | ``COND_NV`` | ``!V``             |
+-------------+--------------------------------+-------------+--------------------+
| ``u``       | unordered                      | ``COND_U``  | ``U``              |
+-------------+--------------------------------+-------------+--------------------+
| ``nu``      | not unordered                  | ``COND_NU`` | ``!U``             |
+-------------+--------------------------------+-------------+--------------------+
| ``a``       | above (unsigned)               | ``COND_A``  | ``!Z && !C``       |
+-------------+--------------------------------+-------------+--------------------+
| ``be``      | below or equal (unsigned)      | ``COND_BE`` | ``Z || C``         |
+-------------+--------------------------------+-------------+--------------------+
| ``g``       | greater than (signed)          | ``COND_G``  | ``!Z && (S == V)`` |
+-------------+--------------------------------+-------------+--------------------+
| ``le``      | less than or equal (signed)    | ``COND_LE`` | ``Z || (S != V)``  |
+-------------+--------------------------------+-------------+--------------------+
| ``l``       | less than (signed)             | ``COND_L``  | ``S != V``         |
+-------------+--------------------------------+-------------+--------------------+
| ``ge``      | greater than or equal (signed) | ``COND_GE`` | ``S == V``         |
+-------------+--------------------------------+-------------+--------------------+

.. _umlinst-flagmask:

Flag masks
~~~~~~~~~~

+-------+-------------+-----------+----------------+
| Value | Disassembly | Mnemonic  | Usage          |
+=======+=============+===========+================+
| 0x01  | ``C``       | carry     | ``FLAG_C``     |
+-------+-------------+-----------+----------------+
| 0x02  | ``V``       | overflow  | ``FLAG_V``     |
+-------+-------------+-----------+----------------+
| 0x04  | ``Z``       | zero      | ``FLAG_Z``     |
+-------+-------------+-----------+----------------+
| 0x08  | ``S``       | sign      | ``FLAG_S``     |
+-------+-------------+-----------+----------------+
| 0x10  | ``U``       | unordered | ``FLAG_U``     |
+-------+-------------+-----------+----------------+
| 0x00  |             |           | ``FLAGS_NONE`` |
+-------+-------------+-----------+----------------+
| 0x1f  | ``USZVC``   |           | ``FLAGS_ALL``  |
+-------+-------------+-----------+----------------+

.. _umlinst-roundmode:

Floating point rounding modes
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

+-------+-------------+----------+-------------------+---------------------------------------+
| Value | Disassembly | Mnemonic | Usage             | Mode                                  |
+=======+=============+==========+===================+=======================================+
| 0     | ``trunc``   | truncate | ``ROUND_TRUNC``   | Round toward zero                     |
+-------+-------------+----------+-------------------+---------------------------------------+
| 1     | ``round``   | round    | ``ROUND_ROUND``   | Round to nearest, round half to even  |
+-------+-------------+----------+-------------------+---------------------------------------+
| 2     | ``ceil``    | ceiling  | ``ROUND_CEIL``    | Round toward positive infinity        |
+-------+-------------+----------+-------------------+---------------------------------------+
| 3     | ``floor``   | floor    | ``ROUND_FLOOR``   | Round toward negative infinity        |
+-------+-------------+----------+-------------------+---------------------------------------+
|       | ``default`` | default  | ``ROUND_DEFAULT`` | Use the current default rounding mode |
+-------+-------------+----------+-------------------+---------------------------------------+


.. _umlinst-flow:

Flow control
------------

.. _umlinst-comment:

COMMENT
~~~~~~~

Insert a comment into logged UML code.

+--------------------+---------------------------------+
| Disassembly        | Usage                           |
+====================+=================================+
| .. code-block::    | .. code-block:: C++             |
|                    |                                 |
|     comment string |     UML_COMMENT(block, string); |
+--------------------+---------------------------------+

Operands
^^^^^^^^

string
    The comment text as a pointer to a NUL-terminated string.  This must
    remain valid until code is generated for the block.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

.. _umlinst-nop:

NOP
~~~

No operation.

+-----------------+---------------------+
| Disassembly     | Usage               |
+=================+=====================+
| .. code-block:: | .. code-block:: C++ |
|                 |                     |
|     nop         |     UML_NOP(block); |
+-----------------+---------------------+

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

.. _umlinst-label:

LABEL
~~~~~

Associate a location with a label number local to the current generated
code block.  Label numbers must not be reused within a generated code
block.  The :ref:`JMP <umlinst-jmp>` instruction may be used to transfer
control to the location associated with a label number.

+-------------------+------------------------------+
| Disassembly       | Usage                        |
+===================+==============================+
| .. code-block::   | .. code-block:: C++          |
|                   |                              |
|     label   label |     UML_LABEL(block, label); |
+-------------------+------------------------------+

Operands
^^^^^^^^

label (label number)
    The label number to associate with the current location.  A label
    number must not be used more than once within a generated code
    block.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

.. _umlinst-handle:

HANDLE
~~~~~~

Mark a location as an entry point of a subroutine.  Subroutines may be
called using the :ref:`CALLH <umlinst-callh>` and :ref:`EXH
<umlinst-exh>` instructions, and also by the `HASHJMP <umlinst-hashjmp>`
if no location is associated with the specified mode and emulated
program counter.

+--------------------+--------------------------------+
| Disassembly        | Usage                          |
+====================+================================+
| .. code-block::    | .. code-block:: C++            |
|                    |                                |
|     handle  handle |     UML_HANDLE(block, handle); |
+--------------------+--------------------------------+

Operands
^^^^^^^^

handle (code handle)
    The code handle to bind to the current location.  The handle must
    already be allocated, and must not have been bound since the last
    generated code reset (all handles are implicitly unbound when
    resetting the generated code cache).

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

.. _umlinst-hash:

HASH
~~~~

Associate a location with the specified mode and emulated program
counter values.  The :ref:`HASHJMP <umlinst-hashjmp>` instruction may be
used to transfer control to the location associated with a mode and
emulated program counter value.  HASH instructions are not permitted in
invariant code blocks.

This is usually used to mark the location of the generated code for an
emulated instruction or sequence of instructions.

+---------------------+------------------------------+
| Disassembly         | Usage                        |
+=====================+==============================+
| .. code-block::     | .. code-block:: C++          |
|                     |                              |
|     hash    mode,pc |   UML_HASH(block, mode, pc); |
+---------------------+------------------------------+

Operands
^^^^^^^^

mode (32-bit – immediate, map variable)
    The mode to associate with the current location in the generated
    code.  Must be greater than or equal to zero and less than the
    number of modes specified when creating the recompiler context.
pc (32-bit – immediate, map variable)
    The emulated program counter value to associate with the current
    location in the generated code.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

.. _umlinst-jmp:

JMP
~~~

Jump to the location associated with a label number within the current
block.

+------------------------+-----------------------------------+
| Disassembly            | Usage                             |
+========================+===================================+
| .. code-block::        | .. code-block:: C++               |
|                        |                                   |
|     jmp     label      |     UML_JMP(block, label);        |
|     jmp     label,cond |     UML_JMPc(block, cond, label); |
+------------------------+-----------------------------------+

Operands
^^^^^^^^

label (label number)
    The label number associated with the location to jump to in the
    current generated code block.  The label number must be associated
    with a location in the generated code block before the block is
    finalised.
cond (condition)
    If supplied, a condition that must be met to jump to the specified
    label.  If the condition is not met, execution will continue with
    the following instruction.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

.. _umlinst-callh:

CALLH
~~~~~

Call the subroutine beginning at the specified code handle.

+-------------------------+--------------------------------------+
| Disassembly             | Usage                                |
+=========================+======================================+
| .. code-block::         | .. code-block:: C++                  |
|                         |                                      |
|     callh   handle      |     UML_CALLH(block, handle);        |
|     callh   handle,cond |     UML_CALLHc(block, cond, handle); |
+-------------------------+--------------------------------------+

Operands
^^^^^^^^

handle (code handle)
    Handle located at the entry point of the subroutine to call.  The
    handle must already be allocated but does not need to be bound until
    the instruction is executed.  Calling a handle that was unbound at
    code generation time may produce less efficient code than calling a
    handle that was already bound.
cond (condition)
    If supplied, a condition that must be met for the subroutine to be
    called.  If the condition is not met, the subroutine will not be
    called.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

.. _umlinst-exh:

EXH
~~~

Set the ``EXP`` register and call the subroutine beginning at the
specified code handle.  The ``EXP`` register is a 32-bit special
function register that may be retrieved with the :ref:`GETEXP
<umlinst-getexp>` instruction.

+-----------------------------+-----------------------------------------+
| Disassembly                 | Usage                                   |
+=============================+=========================================+
| .. code-block::             | .. code-block:: C++                     |
|                             |                                         |
|     exh     handle,arg      |     UML_EXH(block, handle, arg);        |
|     exh     handle,arg,cond |     UML_EXHc(block, handle, arg, cond); |
+-----------------------------+-----------------------------------------+

Operands
^^^^^^^^

handle (code handle)
    Handle located at the entry point of the subroutine to call.  The
    handle must already be allocated but does not need to be bound until
    the instruction is executed.  Calling a handle that was unbound at
    code generation time may produce less efficient code than calling a
    handle that was already bound.
arg (32-bit – memory, integer register, immediate, map variable)
    Value to store in the ``EXP`` register.
cond (condition)
    If supplied, a condition that must be met for the subroutine to be
    called.  If the condition is not met, the subroutine will not be
    called and the ``EXP`` register will not be modified.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Immediate values for the ``arg`` operand are truncated to 32 bits.

.. _umlinst-ret:

RET
~~~

Return from a subroutine, transferring control to the instruction
following the :ref:`CALLH <umlinst-callh>` or :ref:`EXH <umlinst-exh>`
instruction used to call the subroutine.  This instruction must only be
used within generated code subroutines.  The :ref:`EXIT <umlinst-exit>`
instruction must be used to exit from the generated code.

+------------------+----------------------------+
| Disassembly      | Usage                      |
+==================+============================+
| .. code-block::  | .. code-block:: C++        |
|                  |                            |
|     ret          |     UML_RET(block);        |
|     ret     cond |     UML_RETc(block, cond); |
+------------------+----------------------------+

Operands
^^^^^^^^

cond (condition)
    If supplied, a condition that must be met to return from the
    subroutine.  If the condition is not met, execution will continue
    with the following instruction.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

.. _umlinst-hashjmp:

HASHJMP
~~~~~~~

Unwind all nested generated code subroutine frames and transfer control
to the location associated with the specified mode and emulated program
counter values.  If no location is associated with the specified mode
and program counter values, call the subroutine beginning at the
specified code handle.  Note that all nested generated code subroutine
frames are unwound in either case.

This is usually used to jump to the generated code corresponding to the
emulated code at a particular address when it is not known to be in the
current generated code block or when the mode changes.

+----------------------------+-----------------------------------------+
| Disassembly                | Usage                                   |
+============================+=========================================+
| .. code-block::            | .. code-block:: C++                     |
|                            |                                         |
|     hashjmp mode,pc,handle |   UML_HASHJMP(block, mode, pc, handle); |
+----------------------------+-----------------------------------------+

Operands
^^^^^^^^

mode (32-bit – memory, integer register, immediate, map variable)
    The mode associated with the location in the generated code to
    transfer control to.  Must be greater than or equal to zero and less
    than the number of modes specified when creating the recompiler
    context.
pc (32-bit – memory, integer register, immediate, map variable)
    The emulated program counter value associated with the location in
    the generated code to transfer control to.
handle (code handle)
    Handle located at the entry point of the subroutine to call if no
    location in the generated code is associated with the specified mode
    and emulated program counter values.  The handle must already be
    allocated but does not need to be bound until the instruction is
    executed.  Calling a handle that was unbound at code generation time
    may produce less efficient code than calling a handle that was
    already bound.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

.. _umlinst-exit:

EXIT
~~~~

Exit from the generated code, returning control to the caller.  May be
used from within any level of nested subroutine calls in the generated
code.

+-----------------------+----------------------------------+
| Disassembly           | Usage                            |
+=======================+==================================+
| .. code-block::       | .. code-block:: C++              |
|                       |                                  |
|     exit    arg,      |     UML_EXIT(block, arg);        |
|     exit    arg,,cond |     UML_EXITc(block, arg, cond); |
+-----------------------+----------------------------------+

Operands
^^^^^^^^

arg (32-bit – memory, integer register, immediate, map variable)
    The value to return to the caller.
cond (condition)
    If supplied, a condition that must be met to exit from the generated
    code.  If the condition is not met, execution will continue with the
    following instruction.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Immediate values for the ``arg`` operand are truncated to 32 bits.

.. _umlinst-callc:

CALLC
~~~~~

Call a C function with the signature ``void (*)(T *)`` or
``void (*)(T &)`` (i.e. a function with a single pointer or reference
parameter that does not return a value).

+---------------------------+-----------------------------------------+
| Disassembly               | Usage                                   |
+===========================+=========================================+
| .. code-block::           | .. code-block:: C++                     |
|                           |                                         |
|     callc   func,arg      |     UML_CALLC(block, func, arg);        |
|     callc   func,arg,cond |     UML_CALLCc(block, func, arg, cond); |
+---------------------------+-----------------------------------------+

Operands
^^^^^^^^

func (C function)
    Function pointer to the function to call.
arg (memory)
    Argument to pass to the function.
cond (condition)
    If supplied, a condition that must be met for the function to be
    called.  If the condition is not met, the function will not be
    called.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

.. _umlinst-debug:

DEBUG
~~~~~

Call the debugger instruction hook function if appropriate.

If the debugger is active, this should be executed before each emulated
instruction.  Any emulated CPU state kept in UML registers should be
flushed to memory before executing this instruction and reloaded
afterwards to ensure the debugger can display and modify values
correctly.

+-----------------+---------------------------+
| Disassembly     | Usage                     |
+=================+===========================+
| .. code-block:: | .. code-block:: C++       |
|                 |                           |
|     debug   pc  |     UML_DEBUG(block, pc); |
+-----------------+---------------------------+

Operands
^^^^^^^^

pc (32-bit – memory, integer register, immediate, map variable)
    The emulated program counter value to supply to the debugger
    instruction hook function.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Immediate values for the ``pc`` operand are truncated to 32 bits.

.. _umlinst-break:

BREAK
~~~~~

Break into the host debugger if attached.  Has no effect or crashes if
no host debugger is attached depending on the host system and
configuration.  This is intended as a developer aid and should not be
left in final code.

+-----------------+-----------------------+
| Disassembly     | Usage                 |
+=================+=======================+
| .. code-block:: | .. code-block:: C++   |
|                 |                       |
|     break       |     UML_BREAK(block); |
+-----------------+-----------------------+

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.


.. _umlinst-control:

Status and control
------------------

.. _umlinst-set:

SET
~~~

Conditionally set integer to zero or one depending on flags.

+----------------------+----------------------------------+
| Disassembly          | Usage                            |
+======================+==================================+
| .. code-block::      | .. code-block:: C++              |
|                      |                                  |
|     set     dst,cond |     UML_SETc(block, dst);        |
|     dset    dst,cond |     UML_DSETc(block, cond, dst); |
+----------------------+----------------------------------+

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination that will be set to zero (0) if the condition is not
    met or one (1) if the condition is met.
cond (condition)
    A condition to test.  The destination will be set to zero (0) if the
    condition is not met or one (1) if the condition is met.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.

.. _umlinst-carry:

CARRY
~~~~~

Set the carry flag.

+---------------------+----------------------------------+
| Disassembly         | Usage                            |
+=====================+==================================+
| .. code-block::     | .. code-block::                  |
|                     |                                  |
|     carry   src,bit |     UML_CARRY(block, src, bit);  |
|     dcarry  src,bit |     UML_DCARRY(block, src, bit); |
+---------------------+----------------------------------+

src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    An integer value containing a bit to be copied to the carry flag.
bit (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The index of the bit to copy to the carry flag.  Bits are numbered
    starting at zero for the least significant bit position, ascending
    toward the most significant bit position.  Only the least
    significant five bits or six bits of this operand are used,
    depending on the instruction size.

Flags
^^^^^

carry (C)
    Set to the value of the selected bit of the ``src`` operand.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Immediate values for the ``src`` operand are truncated to the
  instruction size.
* Immediate values for the ``bit`` operand are truncated to five or six
  bits for 32-bit or 64-bit operands, respectively.

.. _umlinst-setflgs:

SETFLGS
~~~~~~~

Set the flags arbitrarily.

+-----------------+-------------------------------+
| Disassembly     | Usage                         |
+=================+===============================+
| .. code-block:: | .. code-block::               |
|                 |                               |
|     setflgs src |     UML_SETFLGS(block, src);  |
+-----------------+-------------------------------+

The five least significant bits of the value of ``src`` are copied to
the flags.  Bits are copied to the **carry (C)**, **overflow (V)**,
**zero (Z)**, **sign (S)** and **unordered (U)** flags, starting from
the least significant bit position.

Operands
^^^^^^^^

src (32-bit – memory, integer register, immediate, map variable)
    The value to copy to the flags.  Only the least significant five
    bits of this operand are used.

Flags
^^^^^

carry (C)
    Set to the value of bit 0 of the ``src`` operand, counting from the
    least significant bit starting from zero.
overflow (V)
    Set to the value of bit 1 of the ``src`` operand, counting from the
    least significant bit starting from zero.
zero (Z)
    Set to the value of bit 2 of the ``src`` operand, counting from the
    least significant bit starting from zero.
sign (S)
    Set to the value of bit 3 of the ``src`` operand, counting from the
    least significant bit starting from zero.
unordered (U)
    Set to the value of bit 4 of the ``src`` operand, counting from the
    least significant bit starting from zero.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.

.. _umlinst-getflgs:

GETFLGS
~~~~~~~

Copy flags.

+----------------------+------------------------------------+
| Disassembly          | Usage                              |
+======================+====================================+
| .. code-block::      | .. code-block:: C++                |
|                      |                                    |
|     getflgs dst,mask |     UML_GETFLGS(block, dst, mask); |
+----------------------+------------------------------------+

The flags corresponding to bit positions that are set in ``mask`` are
copied to the corresponding bit positions in ``dst``.  Bit positions
corresponding in ``dst`` that do not correspond to flags or that
correspond to bit positions that are clear in ``mask`` are cleared.

Back-ends may be able to generate more efficient code if fewer bit
positions are set in ``mask``.

Operands
^^^^^^^^

src (32-bit – memory, integer register)
    The destination where the flags corresponding to bit positions that
    are set in the ``mask`` operand will be copied.
mask (flag mask – immediate, map variable)
    The mask to specify which flags to copy.  Only the least significant
    five bits of this operand are used.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.

.. _umlinst-setfmod:

SETFMOD
~~~~~~~

Set the default floating point rounding mode.  The default rounding mode
is used for floating point arithmetic and for floating point to integer
conversion when ``ROUND_DEFAULT`` is specified.

+-------------------+--------------------------------+
| Disassembly       | Usage                          |
+===================+================================+
| .. code-block::   | .. code-block:: C++            |
|                   |                                |
|     setfmod round |     UML_SETFMOD(block, round); |
+-------------------+--------------------------------+

Operands
^^^^^^^^

round (32-bit – memory, integer register, immediate, map variable)
    The rounding mode to set as the default.  Only the two least
    significant bits of the value are used.  Must be 0 (``ROUND_TRUNC``)
    to round toward zero, 1 (``ROUND_ROUND``) to round to nearest, 2
    (``ROUND_CEIL``) to round toward positive infinity, or 3
    (``ROUND_FLOOR``) to round toward negative infinity.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.

.. _umlinst-getfmod:

GETFMOD
~~~~~~~

Get the current default floating point rounding mode set by the most
recent :ref:`SETFMOD <umlinst-setfmod>` or :ref:`RESTORE
<umlinst-restore>` instruction.

+-----------------+------------------------------+
| Disassembly     | Usage                        |
+=================+==============================+
| .. code-block:: | .. code-block:: C++          |
|                 |                              |
|     getfmod dst |     UML_GETFMOD(block, dst); |
+-----------------+------------------------------+

Note that the result of this instruction may not correspond to the
actual effective default rounding mode between entering the generated
code and executing the first :ref:`SETFMOD <umlinst-setfmod>` or
:ref:`RESTORE <umlinst-restore>` instruction.

Operands
^^^^^^^^

dst (32-bit – memory, integer register)
    The destination where the current default rounding mode will be
    stored.  Will be set to 0 (``ROUND_TRUNC``) for round toward zero, 1
    (``ROUND_ROUND``) for round to nearest, 2 (``ROUND_CEIL``) for round
    toward positive infinity, or 3 (``ROUND_FLOOR``) for round toward
    negative infinity.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.

.. _umlinst-restore:

RESTORE
~~~~~~~

Set the contents of the UML integer and floating point registers, the
contents of the ``EXP`` register, the flags and the default floating
point rounding mode from a ``drcuml_machine_state`` structure.

+-----------------+------------------------------+
| Disassembly     | Usage                        |
+=================+==============================+
| .. code-block:: | .. code-block:: C++          |
|                 |                              |
|     restore src |     UML_RESTORE(block, src); |
+-----------------+------------------------------+

Restores program-visible UML state from a structure in memory.  The
subroutine call stack and current instruction pointer are not changed.
Execution continues with the following UML instruction.

src (``drcuml_machine_state`` structure – memory)
    The source that will be used to set program-visible UML machine
    state.  This may be any host memory location accessible by the
    application.  It is not restricted to the recompiler cache.

Flags
^^^^^

carry (C)
    Set from the ``src`` operand.
overflow (V)
    Set from the ``src`` operand.
zero (Z)
    Set from the ``src`` operand.
sign (S)
    Set from the ``src`` operand.
unordered (U)
    Set from the ``src`` operand.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.

.. _umlinst-save:

SAVE
~~~~

Copy the contents of the UML integer and floating point registers, the
contents of the ``EXP`` register, the flags and the default floating
point rounding mode to a ``drcuml_machine_state`` structure.

+-----------------+---------------------------+
| Disassembly     | Usage                     |
+=================+===========================+
| .. code-block:: | .. code-block:: C++       |
|                 |                           |
|     save    dst |     UML_SAVE(block, dst); |
+-----------------+---------------------------+

Saves program-visible UML state to a structure in memory that can
subsequently be restored using the :ref:`RESTORE <umlinst-restore>`
instruction.  The subroutine call stack and current instruction pointer
are not saved.

Note that the saved floating point rounding mode may not correspond to
the actual effective default rounding mode between entering the
generated code and executing the first :ref:`SETFMOD <umlinst-setfmod>`
or :ref:`RESTORE <umlinst-restore>` instruction.

Operands
^^^^^^^^

dst (``drcuml_machine_state`` structure – memory)
    The destination where program-visible UML machine state will be
    saved.  This may be any host memory location accessible by the
    application.  It is not restricted to the recompiler cache.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.


.. _umlinst-datamove:

Data movement
-------------

.. _umlinst-mov:

MOV
~~~

Copy an integer value.

+--------------------------+---------------------------------------+
| Disassembly              | Usage                                 |
+==========================+=======================================+
| .. code-block::          | .. code-block:: C++                   |
|                          |                                       |
|     mov     dst,src      |     UML_MOV(block, dst, src);         |
|     mov     dst,src,cond |     UML_MOVc(block, cond, dst, src);  |
|     dmov    dst,src      |     UML_DMOV(block, dst, src);        |
|     dmov    dst,src,cond |     UML_DMOVc(block, cond, dst, src); |
+--------------------------+---------------------------------------+

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the value will be copied to.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The source value to copy.
cond (condition)
    If supplied, a condition that must be met to copy the value.  If the
    condition is not met, the instruction will have no effect.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Immediate values for the ``src`` operand are truncated to the
  instruction size.
* Converted to :ref:`NOP <umlinst-nop>` if the ``src`` and ``dst``
  operands refer to the same memory location or register and the
  instruction size is no larger than the destination size.

.. _umlinst-fmov:

FMOV
~~~~

Copy a floating point value.  The binary value will be preserved even if
it is not a valid representation of a floating point number.

+--------------------------+----------------------------------------+
| Disassembly              | Usage                                  |
+==========================+========================================+
| .. code-block::          | .. code-block:: C++                    |
|                          |                                        |
|     fsmov   dst,src      |     UML_FSMOV(block, dst, src);        |
|     fsmov   dst,src,cond |     UML_FSMOVc(block, cond, dst, src); |
|     fdmov   dst,src      |     UML_FDMOV(block, dst, src);        |
|     fdmov   dst,src,cond |     UML_FDMOVc(block, cond, dst, src); |
+--------------------------+----------------------------------------+

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, floating point register)
    The destination where the value will be copied to.
src (32-bit or 64-bit – memory, floating point register)
    The source value to copy.
cond (condition)
    If supplied, a condition that must be met to copy the value.  If the
    condition is not met, the instruction will have no effect.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`NOP <umlinst-nop>` if the ``src`` and ``dst``
  operands refer to the same memory location or register.

.. _umlinst-fcopyi:

FCOPYI
~~~~~~

Reinterpret an integer value as a floating point value.  The binary
value will be preserved even if it is not a valid representation of a
floating point number.

+---------------------+-----------------------------------+
| Disassembly         | Usage                             |
+=====================+===================================+
| .. code-block::     | .. code-block:: C++               |
|                     |                                   |
|     fscopyi dst,src |     UML_FSCOPYI(block, dst, src); |
|     fdcopyi dst,src |     UML_FDCOPYI(block, dst, src); |
+---------------------+-----------------------------------+

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, floating point register)
    The destination where the value will be copied to.
src (32-bit or 64-bit – memory, integer register)
    The source value to copy.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

.. _umlinst-icopyf:

ICOPYF
~~~~~~

Reinterpret a floating point value as an integer value.  The binary
value will be preserved even if it is not a valid representation of a
floating point number.

+---------------------+-----------------------------------+
| Disassembly         | Usage                             |
+=====================+===================================+
| .. code-block::     | .. code-block:: C++               |
|                     |                                   |
|     icopyfs dst,src |     UML_ICOPYFS(block, dst, src); |
|     icopyfd dst,src |     UML_ICOPYFD(block, dst, src); |
+---------------------+-----------------------------------+

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the value will be copied to.
src (32-bit or 64-bit – memory, floating point register)
    The source value to copy.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

.. _umlinst-load:

LOAD
~~~~

Load an unsigned integer value from a memory location with variable
displacement.  The value is zero-extended to the size of the
destination.  Host system rules for integer alignment must be followed.

+---------------------------------------+------------------------------------------------------+
| Disassembly                           | Usage                                                |
+=======================================+======================================================+
| .. code-block::                       | .. code-block:: C++                                  |
|                                       |                                                      |
|     load    dst,base,index,size_scale |     UML_LOAD(block, dst, base, index, size, scale);  |
|     dload   dst,base,index,size_scale |     UML_DLOAD(block, dst, base, index, size, scale); |
+---------------------------------------+------------------------------------------------------+

This instruction can be used to read a value from any host memory
location accessible by the application.  It is not restricted to the
recompiler cache.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the value read from memory will be stored.
base (memory)
    The base address of the area of memory to read from.
index (32-bit – memory, integer register, immediate, map variable)
    The displacement value added to the base address to calculate the
    address to read from.  This value may be scaled by a factor of 1, 2,
    4 or 8 depending on the ``scale`` operand.  Note that this is always
    a 32-bit operand interpreted as a signed integer, irrespective of
    the instruction size.
size (access size)
    The size of the value to read.  Must be ``SIZE_BYTE`` (8-bit),
    ``SIZE_WORD`` (16-bit), ``SIZE_DWORD`` (32-bit) or ``SIZE_QWORD``
    (64-bit).  Note that this operand controls the size of the value
    read from memory while the instruction size sets the size of the
    ``dst`` operand.
scale (index scale)
    The scale factor to apply to the ``index`` operand.  Must be
    ``SCALE_x1``, ``SCALE_x2``, ``SCALE_x4`` or ``SCALE_x8`` to multiply
    by 1, 2, 4 or 8, respectively (shift left by 0, 1, 2 or 3 bits).

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

.. _umlinst-loads:

LOADS
~~~~~

Load a signed integer value from a memory location with variable
displacement.  The value is sign-extended to the size of the
destination.  Host system rules for integer alignment must be followed.

+---------------------------------------+-------------------------------------------------------+
| Disassembly                           | Usage                                                 |
+=======================================+=======================================================+
| .. code-block::                       | .. code-block:: C++                                   |
|                                       |                                                       |
|     loads   dst,base,index,size_scale |     UML_LOADS(block, dst, base, index, size, scale);  |
|     dloads  dst,base,index,size_scale |     UML_DLOADS(block, dst, base, index, size, scale); |
+---------------------------------------+-------------------------------------------------------+

This instruction can be used to read a value from any host memory
location accessible by the application.  It is not restricted to the
recompiler cache.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the value read from memory will be stored.
base (memory)
    The base address of the area of memory to read from.
index (32-bit – memory, integer register, immediate, map variable)
    The displacement value added to the base address to calculate the
    address to read from.  This value may be scaled by a factor of 1, 2,
    4 or 8 depending on the ``scale`` operand.  Note that this is always
    a 32-bit operand interpreted as a signed integer, irrespective of
    the instruction size.
size (access size)
    The size of the value to read.  Must be ``SIZE_BYTE`` (8-bit),
    ``SIZE_WORD`` (16-bit), ``SIZE_DWORD`` (32-bit) or ``SIZE_QWORD``
    (64-bit).  Note that this operand controls the size of the value
    read from memory while the instruction size sets the size of the
    ``dst`` operand.
scale (index scale)
    The scale factor to apply to the ``index`` operand.  Must be
    ``SCALE_x1``, ``SCALE_x2``, ``SCALE_x4`` or ``SCALE_x8`` to multiply
    by 1, 2, 4 or 8, respectively (shift left by 0, 1, 2 or 3 bits).

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

.. _umlinst-store:

STORE
~~~~~

Store an integer value to a location in memory with variable
displacement.  Host system rules for integer alignment must be followed.

+---------------------------------------+-------------------------------------------------------+
| Disassembly                           | Usage                                                 |
+=======================================+=======================================================+
| .. code-block::                       | .. code-block:: C++                                   |
|                                       |                                                       |
|     store   base,index,src,size_scale |     UML_STORE(block, base, index, src, size, scale);  |
|     dstore  base,index,src,size_scale |     UML_DSTORE(block, base, index, src, size, scale); |
+---------------------------------------+-------------------------------------------------------+

This instruction can be used to write a value to any host memory
location accessible by the application.  It is not restricted to the
recompiler cache.

Operands
^^^^^^^^

base (memory)
    The base address of the area of memory to write to.
index (32-bit – memory, integer register, immediate, map variable)
    The displacement value added to the base address to calculate the
    address to write to.  This value may be scaled by a factor of 1, 2,
    4 or 8 depending on the ``scale`` operand.  Note that this is always
    a 32-bit operand interpreted as a signed integer, irrespective of
    the instruction size.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The value to write to memory.
size (access size)
    The size of the value to write.  Must be ``SIZE_BYTE`` (8-bit),
    ``SIZE_WORD`` (16-bit), ``SIZE_DWORD`` (32-bit) or ``SIZE_QWORD``
    (64-bit).  Note that this operand controls the size of the value
    written to memory while the instruction size sets the size of the
    ``src`` operand.
scale (index scale)
    The scale factor to apply to the ``index`` operand.  Must be
    ``SCALE_x1``, ``SCALE_x2``, ``SCALE_x4`` or ``SCALE_x8`` to multiply
    by 1, 2, 4 or 8, respectively (shift left by 0, 1, 2 or 3 bits).

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

.. _umlinst-fload:

FLOAD
~~~~~

Load a floating point value from a memory location with variable
displacement.  The binary value will be preserved even if it is not a
valid representation of a floating point number.  Host system rules for
memory access alignment must be followed.

+----------------------------+------------------------------------------+
| Disassembly                | Usage                                    |
+============================+==========================================+
| .. code-block::            | .. code-block:: C++                      |
|                            |                                          |
|     fsload  dst,base,index |     UML_FSLOAD(block, dst, base, index); |
|     fdload  dst,base,index |     UML_FDLOAD(block, dst, base, index); |
+----------------------------+------------------------------------------+

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, floating point register)
    The destination where the value read from memory will be stored.
base (memory)
    The base address of the area of memory to read from.
index (32-bit – memory, integer register, immediate, map variable)
    The displacement value added to the base address to calculate the
    address to read from.  This value will be scaled by the instruction
    size (multiplied by 4 or 8).  Note that this is always a 32-bit
    operand interpreted as a signed integer, irrespective of the
    instruction size.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

.. _umlinst-fstore:

FSTORE
~~~~~~

Store a floating point value to a memory location with variable
displacement.  The binary value will be preserved even if it is not a
valid representation of a floating point number.  Host system rules for
memory access alignment must be followed.

+----------------------------+-------------------------------------------+
| Disassembly                | Usage                                     |
+============================+===========================================+
| .. code-block::            | .. code-block:: C++                       |
|                            |                                           |
|     fsstore base,index,src |     UML_FSSTORE(block, base, index, src); |
|     fdstore base,index,src |     UML_FDSTORE(block, base, index, src); |
+----------------------------+-------------------------------------------+

Operands
^^^^^^^^

base (memory)
    The base address of the area of memory to write to.
index (32-bit – memory, integer register, immediate, map variable)
    The displacement value added to the base address to calculate the
    address to write to.  This value will be scaled by the instruction
    size (multiplied by 4 or 8).  Note that this is always a 32-bit
    operand interpreted as a signed integer, irrespective of the
    instruction size.
src (32-bit or 64-bit – memory, floating point register)
    The value to write to memory.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

.. _umlinst-getexp:

GETEXP
~~~~~~

Copy the value of the ``EXP`` register.  The ``EXP`` register can be set
using the :ref:`EXH <umlinst-exh>` instruction.

+-----------------+-----------------------------+
| Disassembly     | Usage                       |
+=================+=============================+
| .. code-block:: | .. code-block:: C++         |
|                 |                             |
|     getexp  dst |     UML_GETEXP(block, dst); |
+-----------------+-----------------------------+

Operands
^^^^^^^^

dst (32-bit – memory, integer register)
    The destination to copy the value of the ``EXP`` register to.  Note
    that the ``EXP`` register can only hold a 32-bit value.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

.. _umlinst-mapvar:

MAPVAR
~~~~~~

Set the value of a map variable starting at the current location in the
current generated code block.

+--------------------------+---------------------------------------+
| Disassembly              | Usage                                 |
+==========================+=======================================+
| .. code-block::          | .. code-block:: C++                   |
|                          |                                       |
|     mapvar  mapvar,value |     UML_MAPVAR(block, mapvar, value); |
+--------------------------+---------------------------------------+

Operands
^^^^^^^^

mapvar (map variable)
    The map variable to set the value of.
value (32-bit – immediate, map variable)
    The value to set the map variable to.  Note that map variables can
    only hold 32-bit values.

Flags
^^^^^

carry (C)
    Unchanged.
overflow (V)
    Unchanged.
zero (Z)
    Unchanged.
sign (S)
    Unchanged.
unordered (U)
    Unchanged.

.. _umlinst-recover:

RECOVER
~~~~~~~

Retrieve the value of a map variable at the location of the call
instruction in the outermost generated code frame.  This instruction
should only be used from within a generated code subroutine.  Results
are undefined if this instruction is executed from outside any
generated code subroutines.

+------------------------+--------------------------------------+
| Disassembly            | Usage                                |
+========================+======================================+
| .. code-block::        | .. code-block:: C++                  |
|                        |                                      |
|     recover dst,mapvar |     UML_RECOVER(block, dst, mapvar); |
+------------------------+--------------------------------------+

Operands
^^^^^^^^

dst (32-bit – memory, integer register)
    The destination to copy the value of the map variable to.  Note that
    map variables can only hold 32-bit values.
mapvar (map variable)
    The map variable to retrieve the value of from the outermost
    generated code frame.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.


.. _umlinst-memaccess:

Emulated memory access
----------------------

.. _umlinst-read:

READ
~~~~

Read from an emulated address space.  The access mask is implied to have
all bits set.

+---------------------------------+-----------------------------------------------+
| Disassembly                     | Usage                                         |
+=================================+===============================================+
| .. code-block::                 | .. code-block:: C++                           |
|                                 |                                               |
|     read    dst,addr,space_size |     UML_READ(block, dst, addr, size, space);  |
|     dread   dst,addr,space_size |     UML_DREAD(block, dst, addr, size, space); |
+---------------------------------+-----------------------------------------------+

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the value read from the emulated address space
    will be stored.
addr (32-bit – memory, integer register, immediate, map variable)
    The address to read from in the emulated address space.  Note that
    this is always a 32-bit operand, irrespective of the instruction
    size.
size (access size)
    The size of the emulated memory access.  Must be ``SIZE_BYTE``
    (8-bit), ``SIZE_WORD`` (16-bit), ``SIZE_DWORD`` (32-bit) or
    ``SIZE_QWORD`` (64-bit).  Note that this operand controls the size
    of the emulated memory access while the instruction size sets the
    size of the ``dst`` operand.
space (address space number)
    An integer identifying the address space to read from.  May be
    ``SPACE_PROGRAM``, ``SPACE_DATA``, ``SPACE_IO`` or ``SPACE_OPCODES``
    for one of the common CPU address spaces, or a non-negative integer
    cast to ``memory_space``.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Immediate values for the ``addr`` operand are truncated to 32 bits.

.. _umlinst-readm:

READM
~~~~~

Read from an emulated address space with access mask specified.

+--------------------------------------+------------------------------------------------------+
| Disassembly                          | Usage                                                |
+======================================+======================================================+
| .. code-block::                      | .. code-block:: C++                                  |
|                                      |                                                      |
|     readm   dst,addr,mask,space_size |     UML_READM(block, dst, addr, mask, size, space);  |
|     dreadm  dst,addr,mask,space_size |     UML_DREADM(block, dst, addr, mask, size, space); |
+--------------------------------------+------------------------------------------------------+

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the value read from the emulated address space
    will be stored.
addr (32-bit – memory, integer register, immediate, map variable)
    The address to read from in the emulated address space.  Note that
    this is always a 32-bit operand, irrespective of the instruction
    size.
mask (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The access mask for the emulated memory access.
size (access size)
    The size of the emulated memory access.  Must be ``SIZE_BYTE``
    (8-bit), ``SIZE_WORD`` (16-bit), ``SIZE_DWORD`` (32-bit) or
    ``SIZE_QWORD`` (64-bit).  Note that this operand controls the size
    of the emulated memory access while the instruction size sets the
    size of the ``dst`` and ``mask`` operands.
space (address space number)
    An integer identifying the address space to read from.  May be
    ``SPACE_PROGRAM``, ``SPACE_DATA``, ``SPACE_IO`` or ``SPACE_OPCODES``
    for one of the common CPU address spaces, or a non-negative integer
    cast to ``memory_space``.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Immediate values for the ``addr`` operand are truncated to 32 bits.
* Immediate values for the ``mask`` operand are truncated to the access
  size.
* Converted to :ref:`READ <umlinst-read>` if the ``mask`` operand is an
  immediate value with all bits set.

.. _umlinst-write:

WRITE
~~~~~

Write to an emulated address space.  The access mask is implied to have
all bits set.

+---------------------------------+------------------------------------------------+
| Disassembly                     | Usage                                          |
+=================================+================================================+
| .. code-block::                 | .. code-block:: C++                            |
|                                 |                                                |
|     write   addr,src,space_size |     UML_WRITE(block, addr, src, size, space);  |
|     dwrite  addr,src,space_size |     UML_DWRITE(block, addr, src, size, space); |
+---------------------------------+------------------------------------------------+

Operands
^^^^^^^^

addr (32-bit – memory, integer register, immediate, map variable)
    The address to write to in the emulated address space.  Note that
    this is always a 32-bit operand, irrespective of the instruction
    size.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The value to write to the emulated address space.
size (access size)
    The size of the emulated memory access.  Must be ``SIZE_BYTE``
    (8-bit), ``SIZE_WORD`` (16-bit), ``SIZE_DWORD`` (32-bit) or
    ``SIZE_QWORD`` (64-bit).  Note that this operand controls the size
    of the emulated memory access while the instruction size sets the
    size of the ``src`` operand.
space (address space number)
    An integer identifying the address space to read from.  May be
    ``SPACE_PROGRAM``, ``SPACE_DATA``, ``SPACE_IO`` or ``SPACE_OPCODES``
    for one of the common CPU address spaces, or a non-negative integer
    cast to ``memory_space``.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Immediate values for the ``addr`` operand are truncated to 32 bits.
* Immediate values for the ``src`` operand are truncated to the access
  size.

.. _umlinst-writem:

WRITEM
~~~~~~

Write to an emulated address space with access mask specified.

+--------------------------------------+-------------------------------------------------------+
| Disassembly                          | Usage                                                 |
+======================================+=======================================================+
| .. code-block::                      | .. code-block:: C++                                   |
|                                      |                                                       |
|     writem  addr,src,mask,space_size |     UML_WRITEM(block, addr, src, mask, size, space);  |
|     dwritem addr,src,mask,space_size |     UML_DWRITEM(block, addr, src, mask, size, space); |
+--------------------------------------+-------------------------------------------------------+

Operands
^^^^^^^^

addr (32-bit – memory, integer register, immediate, map variable)
    The address to write to in the emulated address space.  Note that
    this is always a 32-bit operand, irrespective of the instruction
    size.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The value to write to the emulated address space.
mask (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The access mask for the emulated memory access.
size (access size)
    The size of the emulated memory access.  Must be ``SIZE_BYTE``
    (8-bit), ``SIZE_WORD`` (16-bit), ``SIZE_DWORD`` (32-bit) or
    ``SIZE_QWORD`` (64-bit).  Note that this operand controls the size
    of the emulated memory access while the instruction size sets the
    size of the ``src`` and ``mask`` operands.
space (address space number)
    An integer identifying the address space to read from.  May be
    ``SPACE_PROGRAM``, ``SPACE_DATA``, ``SPACE_IO`` or ``SPACE_OPCODES``
    for one of the common CPU address spaces, or a non-negative integer
    cast to ``memory_space``.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Immediate values for the ``addr`` operand are truncated to 32 bits.
* Immediate values for the ``src`` and ``mask`` operands are truncated
  to the access size.
* Converted to :ref:`WRITE <umlinst-read>` if the ``mask`` operand is an
  immediate value with all bits set.

.. _umlinst-fread:

FREAD
~~~~~

Read a floating point value from an emulated address space.  The binary
value will be preserved even if it is not a valid representation of a
floating point number.  The access mask is implied to have all bits set.

+---------------------------------+------------------------------------------+
| Disassembly                     | Usage                                    |
+=================================+==========================================+
| .. code-block::                 | .. code-block:: C++                      |
|                                 |                                          |
|     fsread  dst,addr,space_size |     UML_FSREAD(block, dst, addr, space); |
|     fdread  dst,addr,space_size |     UML_FDREAD(block, dst, addr, space); |
+---------------------------------+------------------------------------------+

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, floating point register)
    The destination where the value read from the emulated address space
    will be stored.
addr (32-bit – memory, integer register, immediate, map variable)
    The address to read from in the emulated address space.  Note that
    this is always a 32-bit operand, irrespective of the instruction
    size.
space (address space number)
    An integer identifying the address space to read from.  May be
    ``SPACE_PROGRAM``, ``SPACE_DATA``, ``SPACE_IO`` or ``SPACE_OPCODES``
    for one of the common CPU address spaces, or a non-negative integer
    cast to ``memory_space``.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Immediate values for the ``addr`` operand are truncated to 32 bits.

.. _umlinst-fwrite:

FWRITE
~~~~~~

Write a floating point value to an emulated address space.  The binary
value will be preserved even if it is not a valid representation of a
floating point number.  The access mask is implied to have all bits set.

+---------------------------------+-------------------------------------------+
| Disassembly                     | Usage                                     |
+=================================+===========================================+
| .. code-block::                 | .. code-block:: C++                       |
|                                 |                                           |
|     fswrite addr,src,space_size |     UML_FSWRITE(block, addr, src, space); |
|     fdwrite addr,src,space_size |     UML_FDWRITE(block, addr, src, space); |
+---------------------------------+-------------------------------------------+

Operands
^^^^^^^^

addr (32-bit – memory, integer register, immediate, map variable)
    The address to write to in the emulated address space.  Note that
    this is always a 32-bit operand, irrespective of the instruction
    size.
src (32-bit or 64-bit – memory, floating point register)
    The value to write to the emulated address space.
    will be stored.
space (address space number)
    An integer identifying the address space to read from.  May be
    ``SPACE_PROGRAM``, ``SPACE_DATA``, ``SPACE_IO`` or ``SPACE_OPCODES``
    for one of the common CPU address spaces, or a non-negative integer
    cast to ``memory_space``.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Immediate values for the ``addr`` operand are truncated to 32 bits.


.. _umlinst-intarith:

Integer arithmetic and logic
----------------------------

.. _umlinst-add:

ADD
~~~

Add two integers.

+---------------------------+---------------------------------------+
| Disassembly               | Usage                                 |
+===========================+=======================================+
| .. code-block::           | .. code-block:: C++                   |
|                           |                                       |
|     add     dst,src1,src2 |     UML_ADD(block, dst, src1, src2);  |
|     dadd    dst,src1,src2 |     UML_DADD(block, dst, src1, src2); |
+---------------------------+---------------------------------------+

Calculates ``dst = src1 + src2``.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the sum will be stored.
src1 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The first addend.
src2 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The second addend.

Flags
^^^^^

carry (C)
    Set in the case of arithmetic carry out of the most significant bit,
    or cleared otherwise (unsigned overflow).
overflow (V)
    Set in the case of signed two’s complement overflow, or cleared
    otherwise.
zero (Z)
    Set if the result is zero, or cleared otherwise.
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>`, :ref:`AND <umlinst-and>` or
  :ref:`OR <umlinst-or>` if the ``src1`` and ``src2`` operands are both
  immediate values and the carry and overflow flags are not required.
* Converted to :ref:`MOV <umlinst-mov>` or :ref:`AND <umlinst-and>` if
  the ``src1`` operand or ``src2`` operand is the immediate value zero
  and the carry and overflow flags are not required.
* Immediate values for the ``src1`` and ``src2`` operands are truncated
  to the instruction size.
* If the ``src2`` and ``dst`` operands refer to the same register or
  memory location, the ``src1`` and ``src2`` operands are exchanged.
* If the ``src1`` operand is an immediate value and the ``src2`` operand
  is not an immediate value, the ``src1`` and ``src2`` operands are
  exchanged.

.. _umlinst-addc:

ADDC
~~~~

Add two integers and the carry flag.

+---------------------------+----------------------------------------+
| Disassembly               | Usage                                  |
+===========================+========================================+
| .. code-block::           | .. code-block:: C++                    |
|                           |                                        |
|     addc    dst,src1,src2 |     UML_ADDC(block, dst, src1, src2);  |
|     daddc   dst,src1,src2 |     UML_DADDC(block, dst, src1, src2); |
+---------------------------+----------------------------------------+

Calculates ``dst = src1 + src2 + C``.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the sum will be stored.
src1 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The first addend.
src2 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The second addend.

Flags
^^^^^

carry (C)
    Set in the case of arithmetic carry out of the most significant bit,
    or cleared otherwise (unsigned overflow).
overflow (V)
    Set in the case of signed two’s complement overflow, or cleared
    otherwise.
zero (Z)
    Set if the result is zero, or cleared otherwise.
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Immediate values for the ``src1`` and ``src2`` operands are truncated
  to the instruction size.
* If the ``src2`` and ``dst`` operands refer to the same register or
  memory location, the ``src1`` and ``src2`` operands are exchanged.
* If the ``src1`` operand is an immediate value and the ``src2`` operand
  is not an immediate value, the ``src1`` and ``src2`` operands are
  exchanged.

.. _umlinst-sub:

SUB
~~~

Subtract an integer from another integer.

+---------------------------+---------------------------------------+
| Disassembly               | Usage                                 |
+===========================+=======================================+
| .. code-block::           | .. code-block:: C++                   |
|                           |                                       |
|     sub     dst,src1,src2 |     UML_SUB(block, dst, src1, src2);  |
|     dsub    dst,src1,src2 |     UML_DSUB(block, dst, src1, src2); |
+---------------------------+---------------------------------------+

Calculates ``dst = src1 - src2``.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the difference will be stored.
src1 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The minuend (the value to subtract from).
src2 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The subtrahend (the value to subtract).

Flags
^^^^^

carry (C)
    Set if the subtrahend is a larger unsigned value than the minuend,
    or cleared otherwise (unsigned overflow, or arithmetic borrow).
overflow (V)
    Set in the case of signed two’s complement overflow, or cleared
    otherwise.
zero (Z)
    Set if the result is zero, or cleared otherwise (set if the minuend
    and subtrahend are equal, or cleared otherwise).
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>`, :ref:`AND <umlinst-and>` or
  :ref:`OR <umlinst-or>` if the ``src1`` and ``src2`` operands are both
  immediate values and the carry and overflow flags are not required.
* Converted to :ref:`MOV <umlinst-mov>` or :ref:`AND <umlinst-and>` if
  the ``src2`` operand is the immediate value zero and the carry and
  overflow flags are not required.
* Immediate values for the ``src1`` and ``src2`` operands are truncated
  to the instruction size.

.. _umlinst-subb:

SUBB
~~~~

Subtract an integer and the carry flag from another integer.

+---------------------------+----------------------------------------+
| Disassembly               | Usage                                  |
+===========================+========================================+
| .. code-block::           | .. code-block:: C++                    |
|                           |                                        |
|     subb    dst,src1,src2 |     UML_SUBB(block, dst, src1, src2);  |
|     dsubb   dst,src1,src2 |     UML_DSUBB(block, dst, src1, src2); |
+---------------------------+----------------------------------------+

Calculates ``dst = src1 - src2 - C``.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the difference will be stored.
src1 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The minuend (the value to subtract from).
src2 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The subtrahend (the value to subtract).

Flags
^^^^^

carry (C)
    Set if the subtrahend plus the carry flag is a larger unsigned value
    than the minuend, or cleared otherwise (unsigned overflow, or
    arithmetic borrow).
overflow (V)
    Set in the case of signed two’s complement overflow, or cleared
    otherwise.
zero (Z)
    Set if the result is zero, or cleared otherwise (set if the minuend
    is equal to the subtrahend plus the carry flag, or cleared
    otherwise).
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Immediate values for the ``src1`` and ``src2`` operands are truncated
  to the instruction size.

.. _umlinst-cmp:

CMP
~~~

Compare two integers and set the flags as though they were subtracted.

+-----------------------+----------------------------------+
| Disassembly           | Usage                            |
+=======================+==================================+
| .. code-block::       | .. code-block:: C++              |
|                       |                                  |
|     cmp     src1,src2 |     UML_CMP(block, src1, src2);  |
|     dcmp    src1,src2 |     UML_DCMP(block, src1, src2); |
+-----------------------+----------------------------------+

Sets the flags based on calculating ``src1 - src2`` but discards the
result of the subtraction.

Operands
^^^^^^^^

src1 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The left-hand side value to compare, or the minuend (the value to
    subtract from).
src2 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The right-hand side value to compare, or the subtrahend (the value
    to subtract).

Flags
^^^^^

carry (C)
    Set if the unsigned value of the ``src1`` operand is smaller than
    the unsigned value of the ``src2`` operand, or cleared otherwise.
overflow (V)
    Set if subtracting the value of the ``src2`` operand from the value
    of the ``src1`` operand would result in two’s complement overflow,
    or cleared otherwise.
zero (Z)
    Set if the values of the ``src1`` and ``src2`` operands are equal,
    or cleared otherwise.
sign (S)
    Set to the value of the most significant bit of the result of
    subtracting the value of the ``src2`` operand from the value of the
    ``src1`` operand (set if the result would be a negative signed
    integer, or cleared otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`NOP <umlinst-nop>` if no flags are required.
* Immediate values for the ``src1`` and ``src2`` operands are truncated
  to the instruction size.

.. _umlinst-and:

AND
~~~

Calculate the bitwise logical conjunction of two integers (result bits
will be set if the corresponding bits are set in both inputs).

+---------------------------+---------------------------------------+
| Disassembly               | Usage                                 |
+===========================+=======================================+
| .. code-block::           | .. code-block:: C++                   |
|                           |                                       |
|     and     dst,src1,src2 |     UML_AND(block, dst, src1, src2);  |
|     dand    dst,src1,src2 |     UML_DAND(block, dst, src1, src2); |
+---------------------------+---------------------------------------+

Calculates ``dst = src1 & src2``.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the logical conjunction will be stored.
src1 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The first input.
src2 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The second input.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Set if the result is zero, or cleared otherwise.
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>` if the ``src1`` and ``src2``
  operands refer to the same memory location or register, the ``src1``
  and ``src2`` operands are both immediate values or one of them is an
  immediate value with all bits set or no bits set and flags are not
  required.
* Converted to :ref:`OR <umlinst-or>` if the ``src1`` and ``src2``
  operands are both immediate values with all bits set and flags are
  required.
* Converted to :ref:`TEST <umlinst-test>` if the instruction size is
  64 bits or the ``dst`` operand refers to a memory location, one of the
  ``src1`` and ``src2`` operands refer to the same memory location or
  register as ``dst``, the other source operand refers to the same
  memory location or register or is an immediate value with all bits
  set, and flags are required.
* If the ``src1`` and ``src2`` operands are both immediate values, the
  conjunction is not zero and flags are required, ``src1`` is replaced
  with the conjunction and ``src2`` is set to an immediate value with
  all bits set.
* If the ``src1`` and ``src2`` operands are both immediate values and
  the conjunction is zero or either the ``src1`` or ``src2`` operand is
  the immediate value zero and flags are required, ``src1`` is set to
  refer to the same memory location or register as ``dst`` and ``src2``
  is set to the immediate value zero.
* Immediate values for the ``src1`` and ``src2`` operands are truncated
  to the instruction size.
* If the ``src2`` and ``dst`` operands refer to the same register or
  memory location, the ``src1`` and ``src2`` operands are exchanged.
* If the ``src1`` operand is an immediate value and the ``src2`` operand
  is not an immediate value, the ``src1`` and ``src2`` operands are
  exchanged.

.. _umlinst-test:

TEST
~~~~

Set the flags based on the bitwise logical conjunction of two integers.

+-----------------------+-----------------------------------+
| Disassembly           | Usage                             |
+=======================+===================================+
| .. code-block::       | .. code-block:: C++               |
|                       |                                   |
|     test    src1,src2 |     UML_TEST(block, src1, src2);  |
|     dtest   src1,src2 |     UML_DTEST(block, src1, src2); |
+-----------------------+-----------------------------------+

Sets the flags based on calculating ``src1 & src2`` but discards the
result of the conjunction.

Operands
^^^^^^^^

src1 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The first input.
src2 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The second input.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Set if the result of the conjunction is zero, or cleared otherwise.
sign (S)
    Set if the most significant bit is set in both inputs, or cleared
    otherwise (set if the both inputs are negative signed integers, or
    cleared otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`NOP <umlinst-nop>` if flags are not required.
* If the ``src1`` and ``src2`` operands are both immediate values and
  the bitwise logical conjunction is not zero, the ``src1`` operand is
  set to the conjunction and the ``src2`` operand is set to an immediate
  value with all bits set.
* If either of the ``src1`` and ``src2`` operands is the immediate value
  zero or the ``src1`` and ``src2`` operands are both immediate values
  and the bitwise logical conjunction is zero, the ``src1`` and ``src2``
  operands are both set to the immediate value zero.
* If the ``src1`` and ``src2`` operands refer to the same memory
  location or register, the ``src2`` operand is set to an immediate
  value with all bits set.  * Immediate values for the ``src1`` and
  ``src2`` operands are truncated to the instruction size.
* If the ``src1`` operand is an immediate value and the ``src2`` operand
  is not an immediate value, the ``src1`` and ``src2`` operands are
  exchanged.

.. _umlinst-or:

OR
~~

Calculate the bitwise logical inclusive disjunction of two integers (result bits
will be set if the corresponding bits are set in either input).

+---------------------------+--------------------------------------+
| Disassembly               | Usage                                |
+===========================+======================================+
| .. code-block::           | .. code-block:: C++                  |
|                           |                                      |
|     or      dst,src1,src2 |     UML_OR(block, dst, src1, src2);  |
|     dor     dst,src1,src2 |     UML_DOR(block, dst, src1, src2); |
+---------------------------+--------------------------------------+

Calculates ``dst = src1 | src2``.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the logical inclusive disjunction will be
    stored.
src1 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The first input.
src2 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The second input.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Set if the result is zero, or cleared otherwise.
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>` if the ``src1`` and ``src2``
  operands are both immediate values or one of the ``src1`` or ``src2``
  operands is an immediate value with all bits set and flags are not
  required.
* Converted to :ref:`AND <umlinst-and>` if the ``src1`` and ``src2``
  operands are both immediate values and the inclusive disjunction does
  not have all bits set and flags are required.
* Converted to :ref:`MOV <umlinst-mov>`, :ref:`AND <umlinst-and>` or
  :ref:`TEST <umlinst-test>` if the ``src1`` and ``src2`` operands refer
  to the same memory location or register or if one of the ``src1`` and
  ``src2`` operands is the immediate value zero.
* If one of the ``src1`` and ``src2`` operands is an immediate value
  with all bits set or the ``src1`` and ``src2`` operands are both
  immediate values and the inclusive disjunction has all bits set and
  flags are required, ``src1`` is set to refer to the same memory
  location or register as ``dst`` and ``src2`` is set to an immediate
  value with all bits set.
* Immediate values for the ``src1`` and ``src2`` operands are truncated
  to the instruction size.
* If the ``src2`` and ``dst`` operands refer to the same register or
  memory location, the ``src1`` and ``src2`` operands are exchanged.
* If the ``src1`` operand is an immediate value and the ``src2`` operand
  is not an immediate value, the ``src1`` and ``src2`` operands are
  exchanged.

.. _umlinst-xor:

XOR
~~~

Calculate the bitwise logical exclusive disjunction of two integers
(result bits will be set if the corresponding bit is set in one input
and unset in the other input).

+---------------------------+---------------------------------------+
| Disassembly               | Usage                                 |
+===========================+=======================================+
| .. code-block::           | .. code-block:: C++                   |
|                           |                                       |
|     xor     dst,src1,src2 |     UML_XOR(block, dst, src1, src2);  |
|     dxor    dst,src1,src2 |     UML_DXOR(block, dst, src1, src2); |
+---------------------------+---------------------------------------+

Calculates ``dst = src1 ^ src2``.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the logical exclusive disjunction will be
    stored.
src1 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The first input.
src2 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The second input.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Set if the result is zero, or cleared otherwise.
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>`, :ref:`AND <umlinst-and>`,
  :ref:`TEST <umlinst-test>` or :ref:`OR <umlinst-or>` if the ``src1``
  and ``src2`` operands are both immediate values, if one of the
  ``src1`` and ``src2`` operands is the immediate value zero or if the
  ``src1`` and ``src2`` operands refer to the same memory location or
  register.

.. _umlinst-sext:

SEXT
~~~~

Sign extend an integer value.

+--------------------------+---------------------------------------+
| Disassembly              | Usage                                 |
+==========================+=======================================+
| .. code-block::          | .. code-block::                       |
|                          |                                       |
|     sext    dst,src,size |     UML_SEXT(block, dst, src, size);  |
|     dsext   dst,src,size |     UML_DSEXT(block, dst, src, size); |
+--------------------------+---------------------------------------+

Sets ``dst`` to the value of ``src`` sign extended from the size
specified by the ``size`` operand to the instruction size.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the sign extended value will be stored.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The value to sign extend.
size (access size)
    The size of the value to sign extend.  Must be ``SIZE_BYTE``
    (8-bit), ``SIZE_WORD`` (16-bit) or ``SIZE_DWORD`` (32-bit).

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Set if the result is zero, or cleared otherwise.
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>`, :ref:`AND <umlinst-and>` or
  :ref:`OR <umlinst-or>` if the ``src`` operand is an immediate value or
  if the ``size`` operand specifies a size no smaller than the
  instruction size.

.. _umlinst-lzcnt:

LZCNT
~~~~~

Count the number of contiguous left-aligned zero bits in an integer
(count leading zeroes).

+---------------------+----------------------------------+
| Disassembly         | Usage                            |
+=====================+==================================+
| .. code-block::     | .. code-block:: C++              |
|                     |                                  |
|     lzcnt   dst,src |     UML_LZCNT(block, dst, src);  |
|     dlzcnt  dst,src |     UML_DLZCNT(block, dst, src); |
+---------------------+----------------------------------+

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the result will be stored.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The input value in which to count left-aligned zero bits.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Set if the result is zero, or cleared otherwise (set to the most
    significant bit of the input).
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>` or :ref:`AND <umlinst-and>` if
  the ``src`` operand is an immediate value.

.. _umlinst-tzcnt:

TZCNT
~~~~~

Count the number of contiguous right-aligned zero bits in an integer
(count trailing zeroes).

+---------------------+----------------------------------+
| Disassembly         | Usage                            |
+=====================+==================================+
| .. code-block::     | .. code-block:: C++              |
|                     |                                  |
|     tzcnt   dst,src |     UML_TZCNT(block, dst, src);  |
|     dtzcnt  dst,src |     UML_DTZCNT(block, dst, src); |
+---------------------+----------------------------------+

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the result will be stored.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The input value in which to count right-aligned zero bits.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Set if the result is zero, or cleared otherwise (set to the least
    significant bit of the input).
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>` or :ref:`AND <umlinst-and>` if
  the ``src`` operand is an immediate value.


.. _umlinst-intmuldiv:

Integer multiply and divide
---------------------------

.. _umlinst-mullw:

MULLW
~~~~~

Multiply two integer values.

+---------------------------+------------------------------------------+
| Disassembly               | Usage                                    |
+===========================+==========================================+
| .. code-block::           | .. code-block:: C++                      |
|                           |                                          |
|     mululw  dst,src1,src2 |     UML_MULULW(block, dst, src1, src2);  |
|     mulslw  dst,src1,src2 |     UML_MULSLW(block, dst, src1, src2);  |
|     dmululw dst,src1,src2 |     UML_DMULULW(block, dst, src1, src2); |
|     dmulslw dst,src1,src2 |     UML_DMULSLW(block, dst, src1, src2); |
+---------------------------+------------------------------------------+

Calculates ``dst = src1 * src2`` producing a result the same size as the
inputs.  MULULW and DMULULW take unsigned integer values as inputs and
produce an unsigned integer value as a result, while MULSLW and DMULSLW
take signed integer values as inputs and produce a signed integer value
as a result.  Note that the distinction between signed and unsigned
values only affects the calculation of the overflow flag for this
instruction.  It does not affect the result of the multiplication.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the product will be stored.
src1 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The multiplicand (the value to multiply).
src2 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The multiplier (the value to multiply by).

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Set if the full result of the multiplication cannot be represented
    within the instruction size.
zero (Z)
    Set if the result is zero, or cleared otherwise.  Note that this is
    based on the possibly truncated result value, not the full result of
    the multiplication.
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).  Note that this is based on the possibly truncated
    result value, not the full result of the multiplication.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>`, :ref:`AND <umlinst-and>` or
  :ref:`OR <umlinst-or>` if the ``src1`` and ``src2`` operands are both
  immediate values or either the ``src1`` or ``src2`` operand is the
  immediate value zero or one and the overflow flag is not required.
* Immediate values for the ``src1`` and ``src2`` operands are truncated
  to the instruction size.
* If the ``src2`` and ``dst`` operands refer to the same register or
  memory location, the ``src1`` and ``src2`` operands are exchanged.
* If the ``src1`` operand is an immediate value and the ``src2`` operand
  is not an immediate value, the ``src1`` and ``src2`` operands are
  exchanged.

.. _umlinst-mul:

MUL
~~~

Multiply two integer values, possibly producing an extended result.

+--------------------------------+----------------------------------------------+
| Disassembly                    | Usage                                        |
+================================+==============================================+
| .. code-block::                | .. code-block:: C++                          |
|                                |                                              |
|     mulu    dst,edst,src1,src2 |     UML_MULU(block, dst, edst, src1, src2);  |
|     muls    dst,edst,src1,src2 |     UML_MULS(block, dst, edst, src1, src2);  |
|     dmulu   dst,edst,src1,src2 |     UML_DMULU(block, dst, edst, src1, src2); |
|     dmuls   dst,edst,src1,src2 |     UML_DMULS(block, dst, edst, src1, src2); |
+--------------------------------+----------------------------------------------+

Calculates ``edst:dst = src1 * src2`` if the ``dst`` and ``edst``
operands do not refer to the same register or memory location, or
``dst = src1 * src2`` if the ``dst`` and ``edst`` operands refer to the
same register or memory location.  MULU and DMULU take unsigned integer
values as inputs and produce an unsigned integer value as a result,
while MULS and DMULS take signed integer values as inputs and produce a
signed integer value as a result.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the least significant half of the full product
    will be stored.
edst (32-bit or 64-bit – memory, integer register)
    The destination where the most significant half of the full product
    will be stored if this operand does not refer to the same memory
    location or register as the ``dst`` operand.  If this operand refers
    to the same memory location or register as the ``dst`` operand, the
    most significant half of the full product will be discarded,
    producing a result the same size as the inputs.
src1 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The multiplicand (the value to multiply).
src2 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The multiplier (the value to multiply by).

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Set if the full result of the multiplication cannot be represented
    within the instruction size.
zero (Z)
    Set if the full result of the multiplication is zero, or cleared
    otherwise.  Note that this is based on the full result of the
    multiplication even when the ``dst`` and ``edst`` operands refer to
    the same memory location or register, causing the result to be
    truncated.
sign (S)
    Set to the value of the most significant bit of the full result of
    the multiplication (set if the result is a negative signed integer
    value, or cleared otherwise).  Note that this is based on the full
    result of the multiplication even when the ``dst`` and ``edst``
    operands refer to the same memory location or register, causing the
    result to be truncated.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MULLW <umlinst-mullw>` if the ``dst`` and ``edst``
  operands refer to the same memory location or register and the zero
  and sign flags are not required.
* Converted to :ref:`MOV <umlinst-mov>`, :ref:`AND <umlinst-and>` or
  :ref:`OR <umlinst-or>` if the ``dst`` and ``edst`` operands refer to
  the same memory location or register, the ``src1`` and ``src2``
  operands are both immediate values or either the ``src1`` or ``src2``
  operand is the immediate value zero, the most significant half of the
  full result of the multiplication is the sign extension of the least
  significant half or the sign flag is not required, and the overflow
  flag is not required.
* Converted to :ref:`MOV <umlinst-mov>` or :ref:`AND <umlinst-and>` if
  the ``dst`` and ``edst`` operands refer to the same memory location or
  register, either the ``src1`` or ``src2`` operand is the immediate
  value one, signed multiplication is being performed or the sign flag
  is not required, and the overflow flag is not required.
* Immediate values for the ``src1`` and ``src2`` operands are truncated
  to the instruction size.
* If the ``src1`` operand is an immediate value and the ``src2`` operand
  is not an immediate value, the ``src1`` and ``src2`` operands are
  exchanged.

.. _umlinst-div:

DIV
~~~

Divide an integer value by another integer value.

+--------------------------------+----------------------------------------------+
| Disassembly                    | Usage                                        |
+================================+==============================================+
| .. code-block::                | .. code-block:: C++                          |
|                                |                                              |
|     divu    dst,edst,src1,src2 |     UML_DIVU(block, dst, edst, src1, src2);  |
|     divs    dst,edst,src1,src2 |     UML_DIVS(block, dst, edst, src1, src2);  |
|     ddivu   dst,edst,src1,src2 |     UML_DDIVU(block, dst, edst, src1, src2); |
|     ddivs   dst,edst,src1,src2 |     UML_DDIVS(block, dst, edst, src1, src2); |
+--------------------------------+----------------------------------------------+

If the value of ``src2`` is not zero, the value of ``src1`` is divided
by the value of ``src2``, the quotient is stored in the memory location
or register referred to by ``dst``, and the remainder is stored in the
memory location or register referred to by ``edst`` if the ``dst`` and
``edst`` operands do not refer to the same memory location or register.

If the value of ``src2`` is zero, the overflow flag is set and the
values of the memory locations or registers referred to by the ``dst``
and ``edst`` operands are undefined.

DIVU and DDIVU take unsigned integer values as inputs and produce
unsigned integer values as results, while DIVS and DDIVS take signed
integer values as inputs and produce signed integer values as results.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the quotient will be stored if the value of
    the ``src2`` operand is not zero.
edst (32-bit or 64-bit – memory, integer register)
    The destination where the value of the remainder will be stored if
    this operand does not refer to the same memory location or register
    as the ``dst`` operand and the value of the ``src2`` operand is not
    zero.
src1 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The dividend (the value to divide).
src2 (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The divisor (the value to divide by).

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Set if the divisor (the value of the ``src2`` operand) is zero, or
    cleared otherwise.
zero (Z)
    Set if the divisor (the value of the ``src2`` operand) is not zero
    and the quotient is zero, or cleared otherwise.
sign (S)
    Set to the most significant bit of the quotient (set if the quotient
    is a negative signed integer value) if the divisor (the value of the
    ``src2`` operand) is not zero, or cleared otherwise.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>`, :ref:`AND <umlinst-and>` or
  :ref:`OR <umlinst-or>` if the ``dst`` and ``edst`` operands refer to
  the same memory location or register, the ``src1`` and ``src2``
  operands are both immediate values or ``src2`` operand is the
  immediate value one, the ``src2`` operand is not the immediate value
  zero, and the overflow flag is not required.
* Immediate values for the ``src1`` and ``src2`` operands are truncated
  to the instruction size.


.. _umlinst-intshift:

Integer shift and rotate
------------------------

.. _umlinst-shl:

SHL
~~~

Shift an integer value to the left (toward the most significant bit
position), shifting zeroes into the least significant bit.

+---------------------------+---------------------------------------+
| Disassembly               | Usage                                 |
+===========================+=======================================+
| .. code-block::           | .. code-block::                       |
|                           |                                       |
|     shl     dst,src,count |     UML_SHL(block, dst, src, count);  |
|     dshl    dst,src,count |     UML_DSHL(block, dst, src, count); |
+---------------------------+---------------------------------------+

Sets ``dst`` to the value of ``src`` shifted left by ``count`` bit
positions modulo the operand size in bits.  Zeroes are shifted into the
least significant bit position.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the shifted value will be stored.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The value to shift.
count (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The number of bit positions to shift by.  Only the least significant
    five bits or six bits of this operand are used, depending on the
    instruction size.

Flags
^^^^^

carry (C)
    Set to the value of the last bit shifted out of the most significant
    bit position if the shift count modulo the operand size in bits is
    non-zero, or cleared if the shift count modulo the operand size in
    bits is zero.
overflow (V)
    Undefined.
zero (Z)
    Set if the result is zero, or cleared otherwise.
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>`, :ref:`AND <umlinst-and>` or
  :ref:`OR <umlinst-or>` if the ``src`` and ``count`` operands are both
  immediate values or the ``count`` operand is the immediate value zero
  and the carry flag is not required.
* Immediate values for the ``src`` operand are truncated to the
  instruction size.
* Immediate values for the ``count`` operand are truncated to five or
  six bits for 32-bit or 64-bit operands, respectively.

.. _umlinst-shr:

SHR
~~~

Shift an integer value to the right (toward the least significant bit
position), shifting zeroes into the most significant bit.

+---------------------------+---------------------------------------+
| Disassembly               | Usage                                 |
+===========================+=======================================+
| .. code-block::           | .. code-block::                       |
|                           |                                       |
|     shr     dst,src,count |     UML_SHR(block, dst, src, count);  |
|     dshr    dst,src,count |     UML_DSHR(block, dst, src, count); |
+---------------------------+---------------------------------------+

Sets ``dst`` to the value of ``src`` shifted right by ``count`` bit
positions modulo the operand size in bits.  Zeroes are shifted into the
most significant bit position.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the shifted value will be stored.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The value to shift.
count (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The number of bit positions to shift by.  Only the least significant
    five bits or six bits of this operand are used, depending on the
    instruction size.

Flags
^^^^^

carry (C)
    Set to the value of the last bit shifted out of the least
    significant bit position if the shift count modulo the operand size
    in bits is non-zero, or cleared if the shift count modulo the
    operand size in bits is zero.
overflow (V)
    Undefined.
zero (Z)
    Set if the result is zero, or cleared otherwise.
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>`, :ref:`AND <umlinst-and>` or
  :ref:`OR <umlinst-or>` if the ``src`` and ``count`` operands are both
  immediate values or the ``count`` operand is the immediate value zero
  and the carry flag is not required.
* Immediate values for the ``src`` operand are truncated to the
  instruction size.
* Immediate values for the ``count`` operand are truncated to five or
  six bits for 32-bit or 64-bit operands, respectively.

.. _umlinst-sar:

SAR
~~~

Shift an integer value to the right (toward the least significant bit
position), preserving the value of the most significant bit.

+---------------------------+---------------------------------------+
| Disassembly               | Usage                                 |
+===========================+=======================================+
| .. code-block::           | .. code-block::                       |
|                           |                                       |
|     sar     dst,src,count |     UML_SAR(block, dst, src, count);  |
|     dsar    dst,src,count |     UML_DSAR(block, dst, src, count); |
+---------------------------+---------------------------------------+

Sets ``dst`` to the value of ``src`` shifted right by ``count`` bit
positions modulo the operand size in bits.  The value of the most
significant bit is preserved  after each shift step.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the shifted value will be stored.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The value to shift.
count (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The number of bit positions to shift by.  Only the least significant
    five bits or six bits of this operand are used, depending on the
    instruction size.

Flags
^^^^^

carry (C)
    Set to the value of the last bit shifted out of the least
    significant bit position if the shift count modulo the operand size
    in bits is non-zero, or cleared if the shift count modulo the
    operand size in bits is zero.
overflow (V)
    Undefined.
zero (Z)
    Set if the result is zero, or cleared otherwise.
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>`, :ref:`AND <umlinst-and>` or
  :ref:`OR <umlinst-or>` if the ``src`` and ``count`` operands are both
  immediate values or the ``count`` operand is the immediate value zero
  and the carry flag is not required.
* Immediate values for the ``src`` operand are truncated to the
  instruction size.
* Immediate values for the ``count`` operand are truncated to five or
  six bits for 32-bit or 64-bit operands, respectively.

.. _umlinst-rol:

ROL
~~~

Rotate an integer value to the left (toward the most significant bit
position).  Bits shifted out of the most significant bit position are
shifted into the least significant bit position.

+---------------------------+---------------------------------------+
| Disassembly               | Usage                                 |
+===========================+=======================================+
| .. code-block::           | .. code-block::                       |
|                           |                                       |
|     rol     dst,src,count |     UML_ROL(block, dst, src, count);  |
|     drol    dst,src,count |     UML_DROL(block, dst, src, count); |
+---------------------------+---------------------------------------+

Sets ``dst`` to the value of ``src`` rotated left by ``count`` bit
positions.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the rotated value will be stored.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The value to rotated.
count (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The number of bit positions to rotate by.  Only the least
    significant five bits or six bits of this operand are used,
    depending on the instruction size.

Flags
^^^^^

carry (C)
    Set to the value of the last bit rotated out of the most significant
    bit position (set to the least significant bit of the result) if the
    shift count modulo the operand size in bits is non-zero, or cleared
    if the shift count modulo the operand size in bits is zero.
overflow (V)
    Undefined.
zero (Z)
    Set if the result is zero, or cleared otherwise.
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>`, :ref:`AND <umlinst-and>` or
  :ref:`OR <umlinst-or>` if the ``src`` and ``count`` operands are both
  immediate values or the ``count`` operand is the immediate value zero
  and the carry flag is not required.
* Immediate values for the ``src`` operand are truncated to the
  instruction size.
* Immediate values for the ``count`` operand are truncated to five or
  six bits for 32-bit or 64-bit operands, respectively.

.. _umlinst-ror:

ROR
~~~

Rotate an integer value to the right (toward the least significant bit
position).  Bits shifted out of the least significant bit position are
shifted into the most significant bit position.

+---------------------------+---------------------------------------+
| Disassembly               | Usage                                 |
+===========================+=======================================+
| .. code-block::           | .. code-block::                       |
|                           |                                       |
|     ror     dst,src,count |     UML_ROR(block, dst, src, count);  |
|     dror    dst,src,count |     UML_DROR(block, dst, src, count); |
+---------------------------+---------------------------------------+

Sets ``dst`` to the value of ``src`` rotated right by ``count`` bit
positions.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the rotated value will be stored.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The value to rotated.
count (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The number of bit positions to rotate by.  Only the least
    significant five bits or six bits of this operand are used,
    depending on the instruction size.

Flags
^^^^^

carry (C)
    Set to the value of the last bit rotated out of the least
    significant bit position (set to the most significant bit of the
    result) if the shift count modulo the operand size in bits is
    non-zero, or cleared if the shift count modulo the operand size in
    bits is zero.
overflow (V)
    Undefined.
zero (Z)
    Set if the result is zero, or cleared otherwise.
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>`, :ref:`AND <umlinst-and>` or
  :ref:`OR <umlinst-or>` if the ``src`` and ``count`` operands are both
  immediate values or the ``count`` operand is the immediate value zero
  and the carry flag is not required.
* Immediate values for the ``src`` operand are truncated to the
  instruction size.
* Immediate values for the ``count`` operand are truncated to five or
  six bits for 32-bit or 64-bit operands, respectively.

.. _umlinst-rolc:

ROLC
~~~~

Rotate an integer value concatenated with the carry flag to the left
(toward the most significant bit position).  For each step, the carry
flag is shifted into the least significant bit position, and the carry
flag is set to the bit shifted out of the most significant bit position.

+---------------------------+----------------------------------------+
| Disassembly               | Usage                                  |
+===========================+========================================+
| .. code-block::           | .. code-block::                        |
|                           |                                        |
|     rolc    dst,src,count |     UML_ROLC(block, dst, src, count);  |
|     drolc   dst,src,count |     UML_DROLC(block, dst, src, count); |
+---------------------------+----------------------------------------+

Sets ``dst`` to the value of ``src`` concatenated with the carry flag
rotated left by ``count`` bit positions modulo the operand size in bits.
For each shift step, the current value of the carry flag is shifted into
the least significant bit position, and the carry flag is set to the
value of the bit shifted out of the most significant bit position.

Note that although this instruction rotates a 33-bit or 65-bit value
(including the carry flag), the shift count is interpreted modulo 32 or
64.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the rotated value will be stored.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The value to rotated.
count (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The number of bit positions to rotate by.  Only the least
    significant five bits or six bits of this operand are used,
    depending on the instruction size.

Flags
^^^^^

carry (C)
    Set to the value of the last bit shifted out of the least
    significant bit position if the shift count modulo the operand size
    in bits is non-zero, or unchanged if the shift count modulo the
    operand size in bits is zero.
overflow (V)
    Undefined.
zero (Z)
    Set if the result is zero, or cleared otherwise.
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>` or :ref:`NOP <umlinst-nop>` if
  the ``count`` operand is the immediate value zero and the zero and
  sign flags are not required.
* Immediate values for the ``src`` operand are truncated to the
  instruction size.
* Immediate values for the ``count`` operand are truncated to five or
  six bits for 32-bit or 64-bit operands, respectively.

.. _umlinst-rorc:

RORC
~~~~

Rotate an integer value concatenated with the carry flag to the right
(toward the least significant bit position).  For each step, the carry
flag is shifted into the most significant bit position, and the carry
flag is set to the bit shifted out of the least significant bit
position.

+---------------------------+----------------------------------------+
| Disassembly               | Usage                                  |
+===========================+========================================+
| .. code-block::           | .. code-block::                        |
|                           |                                        |
|     rorc    dst,src,count |     UML_RORC(block, dst, src, count);  |
|     drorc   dst,src,count |     UML_DRORC(block, dst, src, count); |
+---------------------------+----------------------------------------+

Sets ``dst`` to the value of ``src`` concatenated with the carry flag
rotated right by ``count`` bit positions modulo the operand size in
bits.  For each shift step, the current value of the carry flag is
shifted into the most significant bit position, and the carry flag is
set to the value of the bit shifted out of the least significant bit
position.

Note that although this instruction rotates a 33-bit or 65-bit value
(including the carry flag), the shift count is interpreted modulo 32 or
64.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the rotated value will be stored.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The value to rotated.
count (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The number of bit positions to rotate by.  Only the least
    significant five bits or six bits of this operand are used,
    depending on the instruction size.

Flags
^^^^^

carry (C)
    Set to the value of the last bit shifted out of the least
    significant bit position if the shift count modulo the operand size
    in bits is non-zero, or unchanged if the shift count modulo the
    operand size in bits is zero.
overflow (V)
    Undefined.
zero (Z)
    Set if the result is zero, or cleared otherwise.
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>` or :ref:`NOP <umlinst-nop>` if
  the ``count`` operand is the immediate value zero and the zero and
  sign flags are not required.
* Immediate values for the ``src`` operand are truncated to the
  instruction size.
* Immediate values for the ``count`` operand are truncated to five or
  six bits for 32-bit or 64-bit operands, respectively.

.. _umlinst-bfx:

BFX
~~~

Extract a contiguous bit field from an integer value.

+---------------------------------+-----------------------------------------------+
| Disassembly                     | Usage                                         |
+=================================+===============================================+
| .. code-block::                 | .. code-block:: C++                           |
|                                 |                                               |
|     bfxu    dst,src,shift,width |     UML_BFXU(block, dst, src, shift, width);  |
|     bfxs    dst,src,shift,width |     UML_BFXS(block, dst, src, shift, width);  |
|     dbfxu   dst,src,shift,width |     UML_DBFXU(block, dst, src, shift, width); |
|     dbfxs   dst,src,shift,width |     UML_DBFXS(block, dst, src, shift, width); |
+---------------------------------+-----------------------------------------------+

Extracts and right-aligns a contiguous bit field from the value of
``src``, specified by its least significant bit position and width in
bits.  The field must be narrower than the ``src`` operand, but it may
wrap around from the most significant bit position to the least
significant bit position.  BFXU and DBFXU zero-extend an unsigned field,
while BFXS and DBFXS sign-extend a signed field.

Back-ends may be able to optimise some forms of this instruction for
example when the ``shift`` and ``width`` operands are both immediate
values.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the extracted field will be stored.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The value to extract a contiguous bit field from.
shift (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The position of the least significant bit of the field to extract,
    where zero is the least significant bit position, and bit numbers
    increase toward the most significant bit position.  Only the least
    significant five bits or six bits of this operand are used,
    depending on the instruction size.
width (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The width of the field to extract in bits.  Only the least
    significant five bits or six bits of this operand are used,
    depending on the instruction size.  The result is undefined if the
    width modulo the instruction size in bits is zero.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Set if the result is zero, or cleared otherwise.
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>`, :ref:`AND <umlinst-and>` or
  :ref:`OR <umlinst-or>` if the ``src``, ``shift`` and ``width``
  operands are all immediate values, or if the ``width`` operand is the
  immediate value zero.
* Converted to :ref:`SHR <umlinst-shr>` or :ref:`SAR <umlinst-sar>` if
  the ``src`` operand is not an immediate value, the ``shift`` and
  ``width`` operands are both immediate values, and the sum of the value
  of the ``shift`` operand and the value of the ``width`` operand is
  equal to the instruction size in bits.
* BFXU and DBFXU are converted to :ref:`AND <umlinst-and>` if the
  ``shift`` operand is the immediate value zero and ``width`` operand is
  an immediate value.
* BFXS and DBFXS are converted to :ref:`SEXT <umlinst-sext>` if the
  ``shift`` operand is the immediate value zero and ``width`` operand is
  the immediate value 8, 16 or 32.
* Immediate values for the ``src`` operand are truncated to the
  instruction size.
* Immediate values for the ``shift`` and ``width`` operands are
  truncated to five or six bits for 32-bit or 64-bit operands,
  respectively.

.. _umlinst-roland:

ROLAND
~~~~~~

Rotate and mask an integer value.  The value is rotated to the left
(toward the most significant bit position).  Bits shifted out of the
most significant bit position are shifted into the least significant bit
position.  The logical conjunction of the rotated value and the mask is
then calculated.

+--------------------------------+------------------------------------------------+
| Disassembly                    | Usage                                          |
+================================+================================================+
| .. code-block::                | .. code-block:: C++                            |
|                                |                                                |
|     roland  dst,src,count,mask |     UML_ROLAND(block, dst, src, count, mask);  |
|     droland dst,src,count,mask |     UML_DROLAND(block, dst, src, count, mask); |
+--------------------------------+------------------------------------------------+

Sets ``dst`` to the logical conjunction of the value of ``src`` rotated
left by ``count`` bit positions and the value of ``mask``.

Back-ends may be able to optimise some forms of this instruction, for
example when the ``count`` and ``mask`` operands are both immediate
values.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the rotated and masked value will be stored.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The value to be rotated and masked.
count (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The number of bit positions to rotate by.  Only the least
    significant five bits or six bits of this operand are used,
    depending on the instruction size.
mask (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The mask to calculate the logical conjunction with.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Set if the result is zero, or cleared otherwise.
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>`, :ref:`AND <umlinst-and>` or
  :ref:`OR <umlinst-or>` if the ``src`` and ``count`` operands are both
  immediate values, or if either the ``count`` or ``mask`` operand is
  the immediate value zero.
* Converted to :ref:`ROL <umlinst-rol>` if the ``mask`` operand is an
  immediate value with all bits set.
* Converted to :ref:`SHL <umlinst-shl>` if the ``count`` operand is an
  immediate value and the ``mask`` operand is an immediate value
  containing a single contiguous left-aligned sequence of set bits of
  the appropriate length for the value of the ``count`` operand.
* Converted to :ref:`SHR <umlinst-shr>` or :ref:`BFX <umlinst-bfx>` if
  the ``count`` operand is an immediate value and the ``mask`` operand
  is an immediate value containing a single contiguous right-aligned
  sequence of set bits.
* Immediate values for the ``src`` and ``mask`` operands are truncated
  to the instruction size.
* Immediate values for the ``count`` operand are truncated to five or
  six bits for 32-bit or 64-bit operands, respectively.

.. _umlinst-rolins:

ROLINS
~~~~~~

Rotate an integer value and combine with the destination value using a
mask.  The value is rotated to the left (toward the most significant bit
position).  Bits shifted out of the most significant bit position are
shifted into the least significant bit position.  The rotated value is
then combined with the destination value: for bit positions that are set
in the mask, the corresponding bit position values are copied from the
rotated value to the destination; for bit positions that are clear in
the mask, the corresponding bit position values in the destination are
preserved.

+--------------------------------+------------------------------------------------+
| Disassembly                    | Usage                                          |
+================================+================================================+
| .. code-block::                | .. code-block:: C++                            |
|                                |                                                |
|     rolins  dst,src,count,mask |     UML_ROLINS(block, dst, src, count, mask);  |
|     drolins dst,src,count,mask |     UML_DROLINS(block, dst, src, count, mask); |
+--------------------------------+------------------------------------------------+

Rotates the value of ``src`` left by ``count`` bit positions and then
copies bit values to ``dst`` where the corresponding bit positions are
set in ``mask``.

Back-ends may be able to optimise some forms of this instruction, for
example when the ``count`` and ``mask`` operands are both immediate
values.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination that will be combined with the rotated value.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The value to be rotated and masked.
count (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The number of bit positions to rotate by.  Only the least
    significant five bits or six bits of this operand are used,
    depending on the instruction size.
mask (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The mask to control which bit positions are copied from the rotated
    value and which bit positions are preserved in the destination.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Set if the result is zero, or cleared otherwise.
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`ROL <umlinst-rol>`, :ref:`AND <umlinst-and>`,
  :ref:`OR <umlinst-or>` or :ref:`MOV <umlinst-mov>` if the ``mask``
  operand is an immediate value with all bits set or the immediate value
  zero.
* Immediate values for the ``src`` and ``mask`` operands are truncated
  to the instruction size.
* Immediate values for the ``count`` operand are truncated to five or
  six bits for 32-bit or 64-bit operands, respectively.

.. _umlinst-bswap:

BSWAP
~~~~~

Reverse the order of bytes within an integer value.

+---------------------+----------------------------------+
| Disassembly         | Usage                            |
+=====================+==================================+
| .. code-block::     | .. code-block:: C++              |
|                     |                                  |
|     bswap   dst,src |     UML_BSWAP(block, dst, src);  |
|     dbswap  dst,src |     UML_DBSWAP(block, dst, src); |
+---------------------+----------------------------------+

This instruction can be used to convert between big Endian and little
Endian byte order.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the result will be stored.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The value to have its byte order reversed.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Set if the result is zero, or cleared otherwise.
sign (S)
    Set to the value of the most significant bit of the result (set if
    the result is a negative signed integer value, or cleared
    otherwise).
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`MOV <umlinst-mov>`, :ref:`AND <umlinst-and>` or
  :ref:`OR <umlinst-or>` if the ``src`` operand is an immediate value.


.. _umlinst-fparith:

Floating point arithmetic
-------------------------

.. _umlinst-fadd:

FADD
~~~~

Add two floating point numbers.

+---------------------------+----------------------------------------+
| Disassembly               | Usage                                  |
+===========================+========================================+
| .. code-block::           | .. code-block:: C++                    |
|                           |                                        |
|     fsadd   dst,src1,src2 |     UML_FSADD(block, dst, src1, src2); |
|     fdadd   dst,src1,src2 |     UML_FDADD(block, dst, src1, src2); |
+---------------------------+----------------------------------------+

Calculates ``dst = src1 + src2``.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, floating point register)
    The destination where the sum will be stored.
src1 (32-bit or 64-bit – memory, floating point register)
    The first addend.
src2 (32-bit or 64-bit – memory, floating point register)
    The second addend.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.

.. _umlinst-fsub:

FSUB
~~~~

Subtract a floating point number from another floating point number.

+---------------------------+----------------------------------------+
| Disassembly               | Usage                                  |
+===========================+========================================+
| .. code-block::           | .. code-block:: C++                    |
|                           |                                        |
|     fssub   dst,src1,src2 |     UML_FSSUB(block, dst, src1, src2); |
|     fdsub   dst,src1,src2 |     UML_FDSUB(block, dst, src1, src2); |
+---------------------------+----------------------------------------+

Calculates ``dst = src1 - src2``.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, floating point register)
    The destination where the difference will be stored.
src1 (32-bit or 64-bit – memory, floating point register)
    The minuend (the value to subtract from).
src2 (32-bit or 64-bit – memory, floating point register)
    The subtrahend (the value to subtract).

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.

.. _umlinst-fcmp:

FCMP
~~~~

Compare two floating-point numbers and set the carry, zero and unordered
flags.

+-----------------------+-----------------------------------+
| Disassembly           | Usage                             |
+=======================+===================================+
| .. code-block::       | .. code-block:: C++               |
|                       |                                   |
|     fscmp   src1,src2 |     UML_FSCMP(block, src1, src2); |
|     fdcmp   src1,src2 |     UML_FDCMP(block, src1, src2); |
+-----------------------+-----------------------------------+

Operands
^^^^^^^^

src1 (32-bit or 64-bit – memory, floating point register)
    The left-hand side value to compare.
src2 (32-bit or 64-bit – memory, floating point register)
    The right-hand side value to compare.

Flags
^^^^^

carry (C)
    Set if the value of ``src1`` is less than the value of ``src2``, or
    cleared otherwise.
overflow (V)
    Undefined.
zero (Z)
    Set if the values of ``src1`` and ``src2`` are equal, or cleared
    otherwise.
sign (S)
    Undefined.
unordered (U)
    Set if either ``src1`` or ``src2`` is not a number (NaN), or cleared
    otherwise.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.

.. _umlinst-fmul:

FMUL
~~~~

Multiply two floating point numbers.

+---------------------------+----------------------------------------+
| Disassembly               | Usage                                  |
+===========================+========================================+
| .. code-block::           | .. code-block:: C++                    |
|                           |                                        |
|     fsmul   dst,src1,src2 |     UML_FSMUL(block, dst, src1, src2); |
|     fdmul   dst,src1,src2 |     UML_FDMUL(block, dst, src1, src2); |
+---------------------------+----------------------------------------+

Calculates ``dst = src1 * src2``.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, floating point register)
    The destination where the product will be stored.
src1 (32-bit or 64-bit – memory, floating point register)
    The multiplicand (the value to multiply).
src2 (32-bit or 64-bit – memory, floating point register)
    The multiplier (the value to multiply by).

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.

.. _umlinst-fdiv:

FDIV
~~~~

Divide a floating point number by another floating point number.

+---------------------------+----------------------------------------+
| Disassembly               | Usage                                  |
+===========================+========================================+
| .. code-block::           | .. code-block:: C++                    |
|                           |                                        |
|     fsdiv   dst,src1,src2 |     UML_FSDIV(block, dst, src1, src2); |
|     fddiv   dst,src1,src2 |     UML_FDDIV(block, dst, src1, src2); |
+---------------------------+----------------------------------------+

Calculates ``dst = src1 / src2``.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, floating point register)
    The destination where the quotient will be stored.
src1 (32-bit or 64-bit – memory, floating point register)
    The dividend (the value to divide).
src2 (32-bit or 64-bit – memory, floating point register)
    The divisor (the value to divide by).

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.

.. _umlinst-fneg:

FNEG
~~~~

Negate a floating point number.

+---------------------+---------------------------------+
| Disassembly         | Usage                           |
+=====================+=================================+
| .. code-block::     | .. code-block:: C++             |
|                     |                                 |
|     fsneg   dst,src |     UML_FSNEG(block, dst, src); |
|     fdneg   dst,src |     UML_FDNEG(block, dst, src); |
+---------------------+---------------------------------+

Calculates ``dst = -src``.

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, floating point register)
    The destination where the result will be stored.
src (32-bit or 64-bit – memory, floating point register)
    The value to be negated.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.

.. _umlinst-fabs:

FABS
~~~~

Calculate the absolute value of a floating point number.

+---------------------+---------------------------------+
| Disassembly         | Usage                           |
+=====================+=================================+
| .. code-block::     | .. code-block:: C++             |
|                     |                                 |
|     fsabs   dst,src |     UML_FSABS(block, dst, src); |
|     fdabs   dst,src |     UML_FDABS(block, dst, src); |
+---------------------+---------------------------------+

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, floating point register)
    The destination where the result will be stored.
src (32-bit or 64-bit – memory, floating point register)
    The value to calculate the absolute value of.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.

.. _umlinst-fsqrt:

FSQRT
~~~~~

Calculate the square root of a floating point number.

+---------------------+----------------------------------+
| Disassembly         | Usage                            |
+=====================+==================================+
| .. code-block::     | .. code-block:: C++              |
|                     |                                  |
|     fssqrt  dst,src |     UML_FSSQRT(block, dst, src); |
|     fdsqrt  dst,src |     UML_FDSQRT(block, dst, src); |
+---------------------+----------------------------------+

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, floating point register)
    The destination where the square root will be stored.
src (32-bit or 64-bit – memory, floating point register)
    The value to calculate the square root of.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.

.. _umlinst-frecip:

FRECIP
~~~~~~

Calculate an approximate reciprocal value of a floating point number.
The algorithm used, precision and nature of inaccuracies in the
approximation are unspecified.

+---------------------+---------------------------------+
| Disassembly         | Usage                           |
+=====================+=================================+
| .. code-block::     | .. code-block:: C++             |
|                     |                                 |
|     fsabs   dst,src |     UML_FSABS(block, dst, src); |
|     fdabs   dst,src |     UML_FDABS(block, dst, src); |
+---------------------+---------------------------------+

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, floating point register)
    The destination where the result will be stored.
src (32-bit or 64-bit – memory, floating point register)
    The value to approximate the reciprocal of.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.

.. _umlinst-frsqrt:

FRSQRT
~~~~~~

Calculate an approximate reciprocal value of the square root of a
floating point number.  The algorithm used, precision and nature of
inaccuracies in the approximation are unspecified.

+---------------------+-----------------------------------+
| Disassembly         | Usage                             |
+=====================+===================================+
| .. code-block::     | .. code-block:: C++               |
|                     |                                   |
|     fsrsqrt dst,src |     UML_FSRSQRT(block, dst, src); |
|     fdrsqrt dst,src |     UML_FDRSQRT(block, dst, src); |
+---------------------+-----------------------------------+

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, floating point register)
    The destination where the result will be stored.
src (32-bit or 64-bit – memory, floating point register)
    The value to approximate the reciprocal of the square root of.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.

.. _umlinst-frnds:

FRNDS
~~~~~

Round a 64-bit floating point value to 32-bit precision.  The current
default rounding type set using the :ref:`SETFMOD <umlinst-setfmod>` is
used.  Note that the instruction size must always be 64 bits for this
instruction.

+---------------------+----------------------------------+
| Disassembly         | Usage                            |
+=====================+==================================+
| .. code-block::     | .. code-block::                  |
|                     |                                  |
|     fdrnds  dst,src |     UML_FDRNDS(block, dst, src); |
+---------------------+----------------------------------+

Operands
^^^^^^^^

dst (64-bit – memory, floating point register)
    The destination where the rounded value will be stored.
src (64-bit – memory, floating point register)
    The floating point value to round to 32-bit precision.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.


.. _umlinst-fpconv:

Floating point conversion
-------------------------

.. _umlinst-ftoint:

FTOINT
~~~~~~

Convert a floating point number to a signed two’s complement integer.

+--------------------------------+------------------------------------------------+
| Disassembly                    | Usage                                          |
+================================+================================================+
| .. code-block::                | .. code-block::                                |
|                                |                                                |
|     fstoint dst,src,size,round |     UML_FSTOINT(block, dst, src, size, round); |
|     fdtoint dst,src,size,round |     UML_FDTOINT(block, dst, src, size, round); |
+--------------------------------+------------------------------------------------+

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, integer register)
    The destination where the integer value will be stored.  The
    size/format is controlled by the ``size`` operand.
src (32-bit or 64-bit – memory, floating point register)
    The floating point value to convert to an integer.  The instruction
    size sets the size/format of this operand.
size (access size)
    The size of the result.  Must be ``SIZE_DWORD`` (32-bit) or
    ``SIZE_QWORD`` (64-bit).  Note that this operand controls the size
    of the ``dst`` operand while the instruction size sets the size of
    the ``src`` operand.
round (rounding type)
    The rounding type to use.  Must be ``ROUND_ROUND`` (round to
    nearest), ``ROUND_CEIL`` (round toward positive infinity),
    ``ROUND_FLOOR`` (round toward negative infinity), ``ROUND_TRUNC``
    (round toward zero) or ``ROUND_DEFAULT`` (use the current default
    rounding type set using the :ref:`SETFMOD <umlinst-setfmod>`
    instruction).

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

No simplifications are applied to this instruction.

.. _umlinst-ffrint:

FFRINT
~~~~~~

Convert a signed two’s complement integer to a floating point number.

+--------------------------+-----------------------------------------+
| Disassembly              | Usage                                   |
+==========================+=========================================+
| .. code-block::          | .. code-block::                         |
|                          |                                         |
|     fsfrint dst,src,size |     UML_FSFRINT(block, dst, src, size); |
|     fdfrint dst,src,size |     UML_FDFRINT(block, dst, src, size); |
+--------------------------+-----------------------------------------+

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, floating point register)
    The destination where the floating point value will be stored.  The
    instruction size sets the size/format of this operand.
src (32-bit or 64-bit – memory, integer register, immediate, map variable)
    The integer value to convert to a floating point value.  The
    size/format is controlled by the ``size`` operand.
size (access size)
    The size of the input.  Must be ``SIZE_DWORD`` (32-bit) or
    ``SIZE_QWORD`` (64-bit).  Note that this operand controls the size
    of the ``src`` operand while the instruction size sets the size of
    the ``dst`` operand.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Immediate values for the ``src`` operand are truncated to the size set
  using the ``size`` operand.

.. _umlinst-ffrflt:

FFRFLT
~~~~~~

Convert between floating point formats.  The current default rounding
type set using the :ref:`SETFMOD <umlinst-setfmod>` is used if
applicable.

+--------------------------+-----------------------------------------+
| Disassembly              | Usage                                   |
+==========================+=========================================+
| .. code-block::          | .. code-block::                         |
|                          |                                         |
|     fsfrflt dst,src,size |     UML_FSFRFLT(block, dst, src, size); |
|     fdfrflt dst,src,size |     UML_FDFRFLT(block, dst, src, size); |
+--------------------------+-----------------------------------------+

Operands
^^^^^^^^

dst (32-bit or 64-bit – memory, floating point register)
    The destination where the converted value will be stored.  The
    instruction size sets the size/format of this operand.
src (32-bit or 64-bit – memory, floating point register)
    The floating point value to convert.  The size/format is controlled
    by the ``size`` operand.
size (access size)
    The size of the input.  Must be ``SIZE_SHORT`` (32-bit) or
    ``SIZE_DOUBLE`` (64-bit).  Note that this operand controls the size
    of the ``src`` operand while the instruction size sets the size of
    the ``dst`` operand.

Flags
^^^^^

carry (C)
    Undefined.
overflow (V)
    Undefined.
zero (Z)
    Undefined.
sign (S)
    Undefined.
unordered (U)
    Undefined.

Simplification rules
^^^^^^^^^^^^^^^^^^^^

* Converted to :ref:`FMOV <umlinst-fmov>` or :ref:`NOP <umlinst-nop>` if
  the ``dst`` and ``src`` operands have the same size/format.
