A Bitbanger's Blog: July 2018

Monday, July 30, 2018

'COSMIC ALIENS'

Stevie Strow, host of CoCoTalk (among other things), and I chatted on Discord yesterday. The conversation shifted to BASIC optimizations related to the 'Why is BASIC so Slow' segment on his show. This lead to him asking me if I could look at the code for his COSMIC ALIENS game to see what optimizations could be made. A couple hours later, he recorded this video showing the latest version of his game. There is still room for optimization, but it is already quite playable and it would have certainly been published in a magazine back in the day. Not a bad little Demon Attack type of game.

If you were to do this on the Atari, you'd need to use an assembly subroutine to draw the sprites, or you'd have to overlay multiple player/missiles to get that many colors for the sprites.

You can visit Stevie's development page to compare it to previous versions. The improvement in the speed is subtle but definitely visible.

Stevie's COSMIC ALIENS page:
http://cosmicaliens.com

Sunday, July 29, 2018

Article on Floating vs Fixed Point Math

"Is COBOL holding you hostage with Math?
Face it: nobody likes fractions, not even computers.
..."
https://medium.com/@bellmar/is-cobol-holding-you-hostage-with-math-5498c0eb428b

Tuesday, July 24, 2018

BASIC accuracy, 6502 vs 6809/6803

In the discussion about Microsoft BASIC, the difference in accuracy between 6502 and 6809/6803 versions was mentioned.
These are what Ahl's benchmark reports for accuracy on a few machines.
Smaller numbers are better (more accurate).

OSI Challenger 1P .32959
Mattel Aquarius .187805
TRS-80 Model III .0338745
Atari 800 .012959
Atari (fastchip) .006875
Apple II+/IIe .0010414235
C64 .0010414235
VIC 20 .0010414235
Oric .0010414235
Sinclair ZX-81 .0006685257
Sinclair Spectrum .0006685257
MC-10 .000596284867
CoCo .000596284867
Model 100 .0000002058
TI-99/4A .00000011

The Atari, Sinclair, and TI machines do not use Microsoft BASIC.

OSI used a lower precision math library than Apple and Commodore versions, but it finished the benchmark over 30 seconds faster than the Apple & Commodore machines.

The Aquarius appears to use a reduced precision library vs the TRS-80 Model III.

The Model III benchmark seems to have been conducted with single precision variables, but it's BASIC also supports double precision which would make it the 2nd most accurate here. Double precision would be much slower.

The Atari fastchip is an aftermarket math ROM that speeds up the Atari. It cuts over 4 minutes off the the Atari's benchmark time and is more accurate than the regular ROM.

The Oric is clocked at the same speed as the Apple II & C64, but it's benchmark times are over twice as long. The chrget/chrgot code on page zero skips spaces quickly, but all other conditions are tested in ROM, making it slower than other 6502 versions, but still more optimized than the MC-10 and CoCo versions. At the very least it should have tested for tokens in RAM and everything else in ROM. You can delete extra spaces from code, but you are stuck with tokens.

The Model 100 appears to use double precision math.

The TI-99/4A may be slow, but it has really accurate math libraries.

Sunday, July 22, 2018

Why is Microsoft BASIC so slow? (text related to my guest appearance on the CoCoTalk video podcast)

The following text is a slightly revised version of what I presented on the CoCoTalk video podcast.
Some of the text was simplified for a re-recording of the show and I may have added one new section. The rest of the discussion was off script.

Why is Microsoft BASIC so slow?

A brief introduction to the inner workings of

Microsoft BASIC

on the CoCo and MC-10

Index

What is an interpreter?
The Tokenizer
8 bit vs 16 bit and the 6800 legacy
Floating Point Math
Variable Storage
Runtime math formula evaluation
Garbage Collection
Running a program step by step

What is an interpreter?

From wikipedia:

“In computer science, an interpreter is a computer program that directly executes, i.e. performs, instructions written in a programming or scripting language, without requiring them previously to have been compiled into a machine language program. An interpreter generally uses one of the following strategies for program execution:

parse the source code and perform its behavior directly

...”

The Microsoft BASIC Interpreter...

Makes no attempt to convert the program to machine code
The code is still human readable or can be quickly translated back to human readable code
Makes minimal changes to the source code to speed up interpretation
All syntax checking is left for runtime execution

The Tokenizer

Changes the line number at the beginning of a line to a 16 bit integer
Converts BASIC keywords to tokens
Terminates the line with a zero
Inserts the line of code into the program linked list
Does not perform any syntax checking
Does not convert other numbers from ASCII to computer readable form, therefore they must be converted every time they are needed, and conversion uses one of the slowest math functions, division

8 bit vs 16 bit and the 6800 legacy

The CoCo and MC-10 interpreters are mostly based on Microsoft's 6800 version and have minimal optimizations that take advantage of the 6809's or 6803's additional 16 bit features, new registers, or hardware multiply instruction.
The math library is largely unchanged from the 6800 version. It is still oriented around the use of 8 bit registers, making it slower and larger than it needs to be. It is so complex, it occupies ¼ of the CoCo Color BASIC and MC-10 ROMs
Functions such as memory moves, screen scrolls, clearing screens, etc... all use 8 bit code even though 16 bit code would be significantly faster

Floating Point Math

All numeric variables are stored as floating point
All math is performed using floating point
Floating point math is performed similar to integer math, but it uses larger numbers and requires extra steps such as normalization, converting numbers to the same exponent, and dealing with the sign
Many fast floating point math functions such as calculating a square root, did not exist yet
All math is performed using software floating point registers reserved in memory. This requires constantly coping numbers back and forth between the software registers and memory
Floating point numbers are stored in variables using a packed form, but must be unpacked to load key floating point registers, then repacked to move them back to variables
Some numbers do not have exact floating point equivalents making pre-conversion of constants impractical

Runtime Math Evaluation

Math formulas must be parsed in order to determine proper order of operations.
A list of operations is built and then is processed by the interpreter.
This must be done every time you use a math function, even something as simple as A=A+1

Variable Storage

Variables are added to system memory in the order they are first used, and then by variable type. Every time a new variable is created, space must be allocated for it in the proper area of the variable list. This can include copying a large block for RAM to make room
The interpreter must search through the variable list every time a variable is used in order to use it
The interpreter allocates space for the stack and variables at startup
It must perform checks every time a variable is created, a string is modified, or the stack is used to insure that the program and variables are not overwritten

Garbage Collection

As string variables change in size, additional space must be allocated or freed to keep from running out of variable storage space. This can involve moving large blocks of memory
Every time the system does this, it requires copying the blocks of memory where a string is stored and any strings after it up or down in memory as needed
All memory copies in the CoCo and MC-10 ROMs use 8 rather than 16 bits

Running a program step by step

Every time the program looks for a new command, it starts by checking for the BREAK key, which is slow
All characters from the program are read through a call to an inefficient subroutine
Command tokens must be range checked before commands are executed
The token must be used to look up the address of the command and then the command itself is called
Commands must perform additional parsing and syntax checking, on completion, they return to the main loop
Every GOTO and GOSUB requires searching through the linked list of lines
Whenever a line such as an IF THEN skips the rest of a line, the interpreter must search for end of line byte by byte
This sequence is repeated for every command in a line of code