Difference between revisions of "How best to map the core components of Forth onto the Mill?"

Revision as of 18:50, 5 January 2015

In his post [[1]] forums, Ivan wrote:

The interesting part of doing Forth on a Mill is not the ops, which are straightforward, but in figuring out how to map the Forth dual-stack program model onto the belt

{LarryP opinion}: I'd generalize Ivan's statement to, "How best to map the Forth's key components (including the two stacks) onto the Mill architecture."

For the moment, let's put input/output functions and structures aside (including the outer interpreter, which is basically an input line editing/buffering device.) What's left that Forth needs:

A data stack

A return stack (or functional equivalent)

The inner interpreter

The dictionary, a contiguous chunk of memory used for both compiled words and static variables.

Let's dispose of the easy ones first:

The inner interpreter and any helper functions it needs must be in memory that has execute permission. This probably means that the interpreter and other directly executable code needs to be in a separate chunk of memory from the dictionary and stacks (though they may be adjacent.)

The dictionary is big (way too big to fit on the belt and/or the spiller) and needs both read and write permissions. Although words in the dictionary will be "executed," that execution is really just interpretation by the inner interpreter and any helper functions it calls.

The data stack is also very likely too big to fit on the belt, and will also need to be in read/write memory. Note that this could conceivably be on the Mill's frame-oriented stack of the interpreter function itself, although we'd have to be careful that the Mill stack's virtual zero and automatic stack cut-back work very differently than stacks on conventional CPUs.

Now the less obvious Forth-to-Mill mappings

Unless we want to do painful, low-level management of call and return addresses in the interpreter itself (and use branches instead of calls... yuck!), I think the best option for the return stack is to use the Mill's native call and return operations and the associated spiller stack. Note that using Mill call/return ops will hide the actual CPU/spiller state from us. However, so long as call and return behave; I think that's probably OK.

Note that the Mill's call/return ops impose something -- an a priori known number of results, that Forth doesn't itself require. Relatively few Forth words produce variable numbers of results, because those make data-stack management even trickier than otherwise, but Forth permits one to do tricky/risky things. I think we can use either Nones (maybe) or argc/argv (more likely) for variable numbers of results. Or we forbid variable number of formal return values, at least in the first version. (Translation: if a word wants to do varargs, it has to pass argc and argv as explicit arguments that are included in the formal arg or return counts denoted in the dictionary.)

Side question: does the genAsm language itself generate argc/argv work-arounds for variable number of return values? It seems to me that genAsm must do something like argc/Argv for variable numbers of input arguments -- or for functions that have more input args than will fit on the target member's belt.

Now using the Mill's call/return machinery means that to execute a word, the interpreter will have to:

Find the dictionary entry for the Forth "word" to be executed.
Extract from that dictionary entry the word's execution address, number of input arguments and number of results.
Get the input args onto its own belt
Call the function (probably using an indirect call)
Once the function returns, copy the results from the interpreter's belt onto the data stack, including handling possible variable number of results.

Yes, this means that the interpreter will spend a good fraction of its time moving data between the data stack and its own belt, but it also means that the interpreter won't have to handle the return stack; the Mill hardware will do it for us -- and more reliably. I think this is a better initial design than trying to use conform or rescue ops to force FIFO behavior onto the LIFO-natured belt.

Side note: There is -- or can be -- yet a third stack, if we have use for it. Although the Mill keeps return addresses out of its program-accessible stack, it has a frame-oriented stack that we are free to use. Since each called function gets its own stack frame and allocation (if it uses the stack at all), functions could use such stack frames for local (automatic) storage -- even if the inner interpreter is using its stack to hold the entire dictionary. This would be a departure from the classic (and very stripped-down) usual Forth virtual machine. However, the Mill is such a big departure from conventional architectures, that this may be a fruitful area to consider for "Fifth" -- a notional follow-on interpreted language, designed specifically for the Mill.

/{LarryP opinion}

Difference between revisions of "How best to map the core components of Forth onto the Mill?"

Revision as of 18:50, 5 January 2015

Navigation menu

Personal tools

Namespaces

Variants

Views

Actions

Search

Navigation

Tools

@@ Line 21: / Line 21: @@
 The dictionary is big (way too big to fit on the belt and/or the spiller) and needs both read and write permissions.  Although words in the dictionary will be "executed," that execution is really just interpretation by the inner interpreter and any helper functions it calls.
-The data stack is also very likely too big to fit on the belt, and will also need to be in read/write memory.
+The data stack is also very likely too big to fit on the belt, and will also need to be in read/write memory.  Note that this could conceivably be on the Mill's frame-oriented stack of the interpreter function itself, although we'd have to be careful that the Mill stack's virtual zero and automatic stack cut-back work very differently than stacks on conventional CPUs.
 ----
@@ Line 27: / Line 27: @@
 Now the less obvious Forth-to-Mill mappings
-Unless we want to do painful, low-level management of call and return addresses (e.g. and use branches instead of calls), I think the best option for the return stack is to use the Mill's native call and return operations.  Note that using Mill call/return ops will hide the actual CPU/spiller state from us.  However, so long as call and return behave; I think that's probably OK.
+Unless we want to do painful, low-level management of call and return addresses in the interpreter itself (and use branches instead of calls... yuck!), I think the best option for the return stack is to use the Mill's native call and return operations and the associated spiller stack.  Note that using Mill call/return ops will hide the actual CPU/spiller state from us.  However, so long as call and return behave; I think that's probably OK.
-However, the Mill's call/return ops impose something -- an ''a priori'' known number of results, that Forth doesn't itself enforce.  Relatively few Forth words produce variable numbers of results, because those make stack management even trickier than otherwise. I think we can use either Nones or argc/argv for variable numbers of results.
+Note that the Mill's call/return ops impose something -- an ''a priori'' known number of results, that Forth doesn't itself require.  Relatively few Forth words produce variable numbers of results, because those make data-stack management even trickier than otherwise, but Forth permits one to do tricky/risky things.  I think we can use either Nones (maybe) or argc/argv (more likely) for variable numbers of results.  Or we forbid variable number of formal return values, at least in the first version.  (Translation: if a word wants to do varargs, it has to pass argc and argv as explicit arguments that are included in the formal arg or return counts denoted in the dictionary.)
+Side question: does the genAsm language itself generate argc/argv work-arounds for variable number of return values?  It seems to me that genAsm must do something like argc/Argv for variable numbers of input arguments -- or for functions that have more input args than will fit on the target member's belt.
 Now using the Mill's call/return machinery means that to execute a word, the interpreter will have to:
@@ Line 38: / Line 40: @@
 * Once the function returns, copy the results from the interpreter's belt onto the data stack, including handling possible variable number of results.
+Yes, this means that the interpreter will spend a good fraction of its time moving data between the data stack and its own belt, but it also means that the interpreter won't have to handle the return stack; the Mill hardware will do it for us -- and more reliably.  I think this is a better initial design than trying to use conform or rescue ops to force FIFO behavior onto the LIFO-natured belt.
+Side note:  There is -- or can be -- yet a third stack, if we have use for it.  Although the Mill keeps return addresses out of its program-accessible stack, it has a frame-oriented stack that we are free to use.  Since each called function gets its own stack frame and allocation (if it uses the stack at all), functions could use such stack frames for local (automatic) storage -- even if the inner interpreter is using its stack to hold the entire dictionary.  This would be a departure from the classic (and very stripped-down) usual Forth virtual machine.  However, the Mill is such a big departure from conventional architectures, that this may be a fruitful area to consider for "Fifth" -- a notional follow-on interpreted language, designed specifically for the Mill.
 /{LarryP opinion}