2019-04-24 20:05:58 +02:00
2019-04-11 10:02:41 +02:00
2019-04-05 16:24:26 +02:00
2019-04-05 21:34:49 +02:00
2019-04-05 16:51:50 +02:00
2019-04-03 16:04:23 +02:00
2019-04-03 16:04:23 +02:00
2019-04-06 23:13:28 +02:00
2019-04-03 16:16:30 +02:00
2019-04-03 16:08:32 +02:00
2019-04-10 00:03:17 +02:00
2019-04-05 16:24:26 +02:00
2019-04-24 13:31:52 +02:00
2019-04-06 23:13:55 +02:00
2019-04-05 23:21:42 +02:00
2019-04-07 14:10:50 +02:00

LMIA advanced development kit

This project was developed as a simple development kit for programming and microprogramming the LMIA system.

μASM instruction set

NOP

No-operation. This wastes one clock cycle

MOV [regA] [regB]

Move value in regA to regB

This operation uses the bus

If regB is LC, no other LC operation can be specified in the same cycle

MVN {[reg] | [const]}

Move the inverse of a value (from register or constant) into register AR.

This operation uses the bus

If a constant is passed, this operation cannot be parallellized

MVZ

Set AR to zero.

Sets flags: Z, N

ADD {[reg] | [const]}

Add value (from register or constant) to register AR.

This operation uses the bus

If a constant is passed, this operation cannot be parallellized

Sets flags: Z, N, O, C

SUB {[reg] | [const]}

Subtract value (from register or constant) from register AR.

This operation uses the bus

If a constant is passed, this operation cannot be parallellized

Sets flags: Z, N, O, C

AND {[reg] | [const]}

Perform bitwise AND with given value (from register or constant) and register AR. reg is ignored and only prevalent due to a microcompiler quirk; expect it to be removed in future releases.

This operation uses the bus

If a constant is passed, this operation cannot be parallellized

Sets flags: Z, N

ORR {[reg] | [const]}

Perform bitwise Or with given value (from register or constant) and register AR.

This operation uses the bus

If a constant is passed, this operation cannot be parallellized

Sets flags: Z, N

ADN {[reg] | [const]}

Add value (from register or constant) to register AR without updating flags.

This operation uses the bus

If a constant is passed, this operation cannot be parallellized

LSL

Performs a logical shift left of AR.

This operation uses the bus

If a constant is passed, this operation cannot be parallellized

Sets flags: Z, N, C

BSL

Performs a logical shift left of AR and HR as if they were one 32-bit register where AR corresponds to the most-significant bits.

This operation uses the bus

If a constant is passed, this operation cannot be parallellized

Sets flags: Z, N, C

ASR

Performs an arithmetic shift right on AR.

This operation uses the bus

If a constant is passed, this operation cannot be parallellized

Sets flags: Z, N, C

BSR

Performs an arithmetic shift right of AR and HR as if they were one 32-bit register where AR corresponds to the most-significant bits.

This operation uses the bus

If a constant is passed, this operation cannot be parallellized

Sets flags: Z, N, C

LSR

Performs a logical shift right of AR.

This operation uses the bus

If a constant is passed, this operation cannot be parallellized

Sets flags: Z, N, C

ROL

Performs an arithmetic rotate right on AR.

This operation uses the bus

If a constant is passed, this operation cannot be parallellized

Sets flags: Z, N, C

BRL

Performs an arithmetic shift left of AR and HR as if they were one 32-bit register where AR corresponds to the most-significant bits.

This operation uses the bus

If a constant is passed, this operation cannot be parallellized

Sets flags: Z, N, C

LCSET [const]

Set LC to value of constant.

CONST [const]

Set AR to value of cosntant.

This operation cannot be parallelized

INCPC

Increment value in PC by one.

DECLC

Decrement value in LC by one.

CALL [label]

Move value in uPC to uSP (MySPC) and set value in uPC to point to the address of the given label.

RET

Move value in uSP (MySPC) into uPC.

HALT

Stop execution and set value int uPC to 0.

BRA [label]

Perform an unconditional branch to the address of the given label.

BNZ [label]

Branch to address of label if Z-flag is 0.

BRZ [label]

Branch to address of label if Z-flag is 1.

BRN [label]

Branch to address of label if N-flag is 1.

BRC [label]

Branch to address of label if C-flag is 1.

BRO [label]

Branch to address of label if O-flag is 1.

BLS [label]

Branch to address of label if L-flag is 1.

BNC [label]

Branch to address of label if C-flag is 0.

BNO [label]

Branch to address of label if O-flag is 0.

RESET [reg]

Sets all bits in the specified register to 1.

This operation uses the bus

Compiler/weaver directives

#define [name] [const]

Define a compile-time constant. This will replace all (valid) constant declarations with the given name with the value supplied here.

NOTE: Constant names are case-insensitive; i.e. FOO and foo are functionally indistinguishable to the compiler.

#data [address] [const]

Define an initial value in the program memory at the given address.

$[label]

Define a compile-time label at the given position in the microprogram. Labels can be referenced using an '@' symbol.

For example: $BAR would declare a label BAR which can be referenced with @BAR.

NOTE: Label names are case-insensitive; i.e. @FOO and @foo are functionally indistinguishable to the compiler.

Flags

Flags - aside from L - are set based on ALU operations, so they depend on AR and the BUS. Henceforth, unless otherwise implied or stated, AR will refer to the state/value of AR after an ALU operation.

Z

Set if AR == 0

N

Set if sign bit in AR is set.

O

Set if sign of AR differs from signs of both AR and BUS before the arithmetic operation.

C

Set if AR is less than or equal to AR before the arithmetic operation.

L

Set if LC == 0.

Sorting algorithms

For the sorting competition, we have chosen to focus on implementing bucketsort with an inline insertionsort when inserting values into corresponding buckets.

bucksort.uc

This was the first attempt at a sorting implementation in uASM. It doesn't do more than a simple hash and possibly updating some bucket-specific values. Nonetheless, it formed a clear basis for future implementations.

Pros:

  • N/A

Cons:

  • N/A

Average cycle count: N/A

bsrt.uc

The first successful uASM bucketsort implementation. It uses the aforementioned inline insertionsort. Additionally, it employs a lookup table for pointing to buckets; this had the benefit of significantly decreasing the cycles required to hash values, as well as allowing for buckets of sizes other than even exponents of 2 (as opposed to other implementations). On top of this, it used a parallel-hashing system which allowed the rotation step of the hash algorithm to be applied to two elements of the list (to be sorted) at once, further decreasing the amount of cycles required to perform a hash (per element).

Pros:

  • Variable bucket size
  • Arbitrary bucket arrangement
  • Parallel-hash implementation
  • Uses the most recent merge algorithm (rated around 400 cycles)

Cons:

  • Heavy bookkeeping due to lookup table paired with parallel-hash
  • Inefficiency in merge due to lookup table (average loss of 80 cycles)

Average cycle count: 1250

sort2.uc

A second iteration of the bsrt implementation, this one refines the dual-hash by omitting the lookup table. This means that bookkeeping can be minimized, as it was only necessary due to a lack of available registers. This, of course, comes at the cost of not being able to have variable-sized buckets; i.e. their sizes must an exponent of 2. A minor optimization that was created for this algorithm was the hash-based bookkeeping, wherein a small optimization to bookkeeping could be done during the hashing of the list elements. Any bookkeeping which does not require the ALU can, in fact, be performed during the hashing, since the rotation steps only require AR and HR to be untouched. All other operations are permitted.

Pros:

  • Merge: (320-330 cycles)
    • Direct addressing fits optimally due to register limitations
    • Optimized regster allocations
  • Minimal bookkeeping since it is mostly performed during the hashing operation

Cons:

  • Bucket sizes must be exponents of 2
  • Fixed bucket indices based on hash
  • Two merge operations required (negative + positive)
  • 95 unused program-memory addresses

Average cycle count: 1100

sort3.uc

A third iteration of the bsrt implementation: this time entirely scrapping the bucket header, opting to delegate this behaviour to the lookup-table itself. I.e. the lookup table no long points to the start of a bucket (as this can be inferred later), but rather points to the last element of the bucket. Additionally, this implementation scraps the insertion-sort performed in the bucket sort, delegating this task to the merge stage of the sorting algorithm.

Pros:

  • Constant-time bucket sort (406 cycles)
  • More optimally used LUT
  • No bookkeeping during bucket-sort

Cons:

  • Non-constant-time merge
  • Optimized LUT requires marginally more arithmetic operations over other variants

Average cycle count: N/A (not fully implemented)

Description
University project
Readme 1.3 MiB
Languages
Python 100%