A Python-based asset extractor and very incomplete bytecode decompiler for Media Station, Inc. CD-ROM children's titles. I loved many of these when I was growing up - especially Dalmatians. The ultimate goal of this repo is to create an engine for ScummVM, but I've found investigation and reverse-engineering easier to do out-of-tree in Python. Please join me in preserving these top-quality children's titles for future generations!
The most important knowledge needed right now is documentation for the bytecode engine (scripts). If you have knowledge or code you can contribute, please let us know!
Get it on PyPI: pip3 install MediaStation
Installing the PyPI package also installs a MediaStation
script that can be invoked from the command line as follows:
# Input directory Export directory
# ................. ...................
MediaStation ~/DALMATIANS/DATA ~/DalmatiansExtract
See and contribute to the updated list on this projects's wiki.
If you like my reverse engineering, you can buy me a matcha latte to show your appreciation 🍵!
Please contribute if you have more details here!
All the data files for known titles are stored in the data/
subdirectory on the CD-ROM. These seem to be the same across the Windows and Mac versions. Some titles have additional files than these (like Tonka Garage, which has some Direct3D models for the car design activity), but these are the known files and formats unique to the Media Station engine.
Media Station titles have these types of files:
- Context files (
*.CXT
) - Title definition file (
BOOT.STM
) - Profile (
PROFILE._ST
) - ONLY non-OG titles.
Each context file generally contains all the assets (and, depending on the version, the scripts) necessary to render one screen of the game. Since the format seems to have been originally designed for Disney's Interactive Storybook, this makes sense.
- A context file contains one or more subfiles, which are each complete and (almost) standard
RIFF
s. - Each subfile inside a context file contains one or more (almost) standard
RIFF
_chunk_s.igod
: Indicates a chunk that contains metadata about asset(s) in metadata sections.a000
, where000
is a string that represents a 3-digit hexadecimal number: Indicates a chunk that contains actual asset data (mainly sounds and bitmaps) with lower-level metadata in metadata sections.
- Each chunk can contain the following:
- One or more metadata sections.
- Raw asset data (PCM audio or RLE-compressed bitmaps).
- Each metadata section contains one or more _datum_s.
- Each datum contains a "primitive" data type (integer, string, etc.)
Also known as the "system" file. Contains metadata sections with global title information like the following:
- Title compiler version.
- Declarations of each context file.
- File offsets of all subfiles in all context files.
- Declarations of cursors stored as resources in the executable.
When present, contains a human-readable enumeration of metadata like the following:
- All the assets in the title, along with the IDs and chunk FourCC(s) for that asset.
- Declarations of the variables, constants, cursors, and so forth used in the game.
This doesn't seem to be opened/read by the executables at all while the titles are running. But there is a ton of useful cross-checking info in here.
I re-discovered these titles when I was finding Director titles for the ScummVM Director
engine at GSoC 2020. Coincidentally, the main data file extension (*.CXT
) used in Media
Station titles is the same as that used for protected Director cast archives. I quickly
discovered these weren't Director titles but something completely different - and so this
project was born to preserve them.
Coming soon! For now, the Disney's Animated Storybook article has great background on the early titles, sourced largely from Newton Lee's books.
- The bytecode decompiler needs a ton of work.
- Some script data seems to be stored in the executables. That should be extracted.
- Write a wikipedia article about the defunct Michigan-based company Media Station, Inc.