Commodore 64

From SizeCoding
Revision as of 16:34, 29 April 2023 by Wil (talk | contribs) (Getting something on screen: changed comment for clarity)

Jump to: navigation, search

Commodore 64

The Commodore system consists of a 6502-compatible MOS 6510 processor with custom hardware for graphics and sound.

Setting up

Setting up your development platform for the Commodore systems is quite easy, first get the following tools:

  • Assembler: To be added
  • Emulator(s): VICE is the way to go

Memory map

    0-1          $0-$1      PROCESSOR PORT REGISTERS, not usable as RAM
   0-255         $0-$FF     ZEROPAGE, mostly used by BASIC interpreter and Kernal
  256-511      $100-$1FF    SYSTEM STACK
  512-600      $200-$258    INPUT BUFFER
  601-630      $259-$276    FILE SYSTEM DATA
  631-640      $277-$280    KEYBOARD BUFFER, filled by Kernal ISR upon key presses 
  641-678      $281-$2A6    VARIOUS KERNAL VARIABLES
  679-767      $2A7-$2FF    Unused (89 byte)
  768-779      $300-$30B    KERNAL VECTORS
  780-783      $30C-$30F    REGISTER VALUES A,X,Y,SR for SYS
  784-819      $310-$333    KERNAL VECTORS
  820-827      $334-$33B    Unused (8 byte)
  828-1019     $33C-$3FB    DATASETTE BUFFER
 1020-1023     $3FC-$3FF    Unused (4 byte)
 1024-2023     $400-$7E7    SCREEN MEMORY (25 lines times 40 columns)
 2024-2039     $7E8-$7F7    Unused (16 byte)
 2040-2047     $7F8-$7FF    SPRITE COSTUME POINTERS
 2048          $800         must contain a value of 0 so that the BASIC program can be RUN
 2049-40959    $801-$9FFF   BASIC AREA (code, variables, arrays, strings)
40960-49151   $A000-$BFFF   BASIC ROM (unused RAM underneath)
49152-53247   $C000-$CFFF   UPPER RAM AREA (unused)
53248-53294   $D000-$D02E   VIC-II, memory-mapped graphics chip registers)
54272-54300   $D400-$D41C   SID, memory-mapped sound chip registers)
55296-56295   $D800-$DBE7   Color RAM, only lower 4 bits are stored, higher nibble undefined
56296-56319   $DBE8-$DBFF   Unused (24 half-bytes)
56320-56335   $DC00-$DC0F   CIA#1, memory-mapped I/O chip registers for keyboard, joystick, mouse, datasette, IRQ control
56576-56591   $DD00-$DD0F   CIA#1, memory-mapped I/O chip registers for serial bus, RS232, NMI control
57344-65535   $E000-$FFFF   KERNAL ROM
65534-65535   $FFFE-$FFFF   Execution address of interrupt service routine, can only be set if KERNAL is turned off

For a detailed explanation of the C64 memory map check out Commodore 64 memory map on Joe Forster/STA homepage.

Program file overhead

A standard CBM file consists of a 2 byte loading address followed by the data that is placed into memory starting at this address. In addition, if your machine program should be able to be started with "RUN", you need a BASIC stub that is loaded into memory from $0801 to $080c, typically followed by your machine program:

*=$0801
        .word $080b   ;address of next BASIC line
        .word LINENO  ;line number, can be chosen between 0 and 63999
        .byte $9e     ;token for SYS command
        .byte $32,$30,$35,$39  ;address to jump to in ASCII code: "2059" 
        .byte $00     ;end of BASIC line
start   lax #0        ;actually address of next BASIC line, $xx00 means end of BASIC program, use $xx for an opcode
; your code here

Together with the 2 byte load address, this makes an overhead of 14 bytes for your program, coming with A and X initialised to 0. To reduce this, you can

  • omit the BASIC line and tell users to start your program by directly entering "SYS address"
  • save your program from $0802 on, the byte at $0801 is recovered automatically after loading by the relink function. This reduces your program file size by 1 byte
  • accept the loss and use the 2 byte establishing LINENO for something good, they are copied to zeropage addresses $39/$3a automatically, saving you the effort of initializing the addresses
  • use an autostart method (see below)

Some compos also specify that the BASIC loader does not count towards the overall filesize, check the rules before you worry.

Autostart

Unlike the Commodore 128, the Commodore 64 has no dedicated built-in auto-boot feature, but it is possible to autorun a program after loading by specifying the load address so that it overwrites a vector or address that is called by the operating system. There are multiple possibilities:

Overwriting CHROUT vector at $326-$327

The vector at address $326-$327 points at the CHROUT routine, normally at address $F1CA. If you put the entry address of your program into $326/27, your code will be called when the operating system wants to print "READY." after loading. Note that the vector at address $328-$329 points at the STOP routine that checks the status of Stop key indicator. Since this routine is called multiple times before your code is done loading, its default value $F6ED needs to be preserved.

Application example:

*=$0326
        .word start              
        .byte $ed,$f6
start
; rest of code

Together with the two byte load address, the overwritten CHROUT vector and the preserved STOP vector, your program file will have a deadweight of 6 byte: $26 $03 $0a $03 $ed $f6, less than half of the standard version with a BASIC stub with SYS

Caveat:

  • If your program after $32a is longer than 214 byte, part of your program will load into the screen at $400. Thus, issuing a clear screen will overwrite your program. It is possible to move the screen memory somewhere else, but that requires storing a new value into address $D018, which when done with an LDA # / STA pair will cost you 5 byte
  • Since you changed the CHROUT pointer, standard KERNAL functions for printing won't work. For output of single characters, you can still use the direct address $F1CA. Alternatively, you can restore the vectors by calling $FD15, which will overwrite the memory $314 to $333 with the default values.
  • The memory area $33C-$3FB is the datasette buffer, thus loading a program into this area won't work when loading from a cassette.

Overwriting BASIC idle loop vector at $302-$303

After loading the program, the BASIC idle loop will be called. By overwriting the vector at $302/$303 it is possible to autostart your program. Since the memory before $302 can be safely overwritten, the program would be arranged so that $303 is the last address of your program. If your program requires a loop back to the start, the vector can be used as part of a JMP where the following two bytes are already there, saving one byte in comparison to a loop done with a branch command.

Application example:

*=$304 - (end - start)
start:
        ;your code here
        ; ...
        jmp     start    ;the address for start lies in $302/$303, thus will be called by the Kernal after loading
end:

In cases where the vector has a practical use, the overhead in filesize is only the 2 byte loading address. For programs larger than 93 byte keep in mind that addresses before $2A7 are in use by the Kernal.

Overwriting return address on stack at $1F8

When the load routine is called, the stack pointer is always at $FA at a stock C64. Thus the return address from the loading process is stored in memory address $1F8 (low byte) and $1F9 (high byte). During the loading routine, stack usage will clobber the stack below $1F8, but a program loaded to $1F8 and above will be fine and the overwritten address stored at $1F8/$1F9 will be called after loading.

Application example:

*=$1F8
        .word start-1   ;adjust for the RTS command pulling the address from stack and increasing it by 1
start:
        ;your code here

For programs larger than 95 byte keep in mind that addresses after $259 are in use by the Kernal, you might overwrite some values here if you don't have disk operations, but better check the memory map beforehand.

Together with the two byte load address and the overwritten address at $1F8/$1F9, your program file will have a deadweight of 4 byte when using this method.

Putting your program into zeropage overwriting part of CHRGET

CHRGET is a machine code routine for the BASIC interpreter that is placed in the zero page at $73-$8A. By putting your program at $7C and following, the CHRGET routine runs directly into your code.

Application example:

*=$7C
start:
        ;your code here

With this method, there is no need for a start address vector. The only deadweight in your program is the loading address. However, the zero page is a busy place, if your program extends beyond address $8f (that is it has more than 22 bytes on disk) you have to deal with some parts of your program being overwritten, for example, the status byte in $90 is set after loading.

Loading data into address $A9 has an interesting side effect - the vector of $AE/$AF defines the address where the loaded bytes are written to. If $AE gets overwritten with, for example, $40, the next byte will be written to $41 (because the vector is incremented after the write). This can be used to split your program into different parts of memory.

Video display

Video display on the Commodore has the following native video modes:

  • Standard text mode, 40x25 characters, 1 individual foreground color per character, 1 common background color
  • Multicolor text mode, 40x25 characters, only 8 foreground colors, can be mixed with hires chars, foreground < 8 denotes a hires char, with foreground >= 8 a pair of two bits (brick pixel) define color: background, foreground & %111, common color 1+2
  • Extended background color text mode, 40x25 characters, 1 individual foreground color per character, 1 out of 4 background colors, only 64 chars
  • Hires bitmap, 320x200 pixels, an 8x8 field can have an individual foreground and an individual background color
  • Multicolor bitmap, 160x200 pixels, a pair of two bits (brick pixel) define color: background, screen color lower nibble, screen color higher nibble, color from color RAM

Sync with frame

Depending on the VIC type, there are 262/263 raster lines (NTSC) or 312 raster lines (PAL-B). Therefore, the VIC uses register at $D012 and the highest bit of register $D011 to represent the current raster line. Checking only the lower 8 bits and waiting for them to be 0 would therefore not work because this happens two times per frame. However, any number greater than 64 will only trigger once. Using a value of 255, we would sync to a rasterline at the end of the screen, well below the part where graphics are normally displayed:

waitsync:
        ldx $D012
        inx
        bne waitsync

As long as there have more than 63 cycles passed since the snyc, the next sync will happen at the next frame. This works fine as long as we don't miss the line 255 due to an interrupt. If your program has interrupts that are not synced to a rasterline, you can change the condition as follows

        lda #$E3
waitsync:
        cmp $D012
        bcs waitsync

Now we have a window between raster line $E4 and $FF to trigger our sync, which we will meet even if a standard Kernal interrupt comes in-between. Of course, our main program now has to burn around 1700 cycles to avoid a retriggering in the same frame.

Getting something on screen

Output a character:

        lda     #C       ;C must be the char code according to the PETSCII code table
        jsr     $FFD2    ;function preserves A,X,Y and enables interrupt; you can also use the direct address $FC1A

Since the PETSCII code table contains also control characters, this method can be also used to clear the screen, move the cursor or set the text color. However, some functions are shorter (and faster) by directly calling the respective KERNAL function:

Clear screen             JSR $E544
Cursor home position     JSR $E566
Output newline character JSR $AAD7
Set text color           LDA #COLOR, STA $286
Set border color         LDA #COLOR, STA $D020   ;not possible with a PETSCII control code
Set background color     LDA #COLOR, STA $D021   ;not possible with a PETSCII control code

Copy characters directly to screen memory:

.macpack cbm   ;scrcode macro
SCREEN_BASE=$400
LINE=5    ;for example
COLUMN=10 ;for example

        ldx     #endtext-text-1
@loop:  lda     text,x
        sta     SCREEN_BASE+LINE*40+COLUMN,x
        dex
        bpl     @loop
text:   scrcode "hello world!"
endtext:

Note that the text needs to be stored in screen code format, not in PETSCII (see Commodore 64 PETSCII code to screen code conversion for the differences). Furthermore, the example above only writes to the screen memory, not the color RAM, assuming that the color RAM already contains the intended text color. This is the case after issuing a clear screen on most C64 machines, except for some old ROM revisions. To be on the save side you would have to write to color RAM as well, which costs another 5 byte:

.macpack cbm     ;scrcode macro
SCREEN_BASE=$400
COLOR_RAM  =$D800
LINE       =5    ;for example
COLUMN     =10   ;for example
COLOR      =1    ;white

        ldx     #endtext-text-1
@loop:  lda     text,x
        sta     SCREEN_BASE+LINE*40+COLUMN,x
        lda     #COLOR
        sta     COLOR_RAM  +LINE*40+COLUMN,x
        dex
        bpl     @loop
text:   scrcode "hello world!"
endtext:

Sound

The Commodore 64 uses the famous SID chip to generate sound. To be added soon.

Make some noise

To be added soon.

Useful Kernal functions

Get a pseudo random number

        lda     #1   
        jsr     $E09A
        lda     $63

The sequence of random numbers is always the same after a cold start or reset, the first 20 numbers are:

 4,24,231,4,177,170,193,101,231,18,196,1,75,48,149,36,124,53,65,154

Delay 1 Millisecond

        jsr     $EEB3

Delay ~ 2 Seconds

This is actually Commodore's memory testing routine that is run every time the computer is turned on. Execution takes about 2 seconds to execute.

        jsr     $FD68

Increment a zero page counter, return in X register, clear Y register

This is part of the string handling function in the BASIC ROM, but the code could be useful for some loop stuff.

        jsr     $B5FF    ;does inc $23, ldx $23, ldy #$00, rts

Additional Resources