MIDI

From ZDoom Wiki
Jump to navigation Jump to search

MIDI, acronym of MiniWikipediaLogoIcon.pngMusical Instrument Digital Interface, is a comprehensive standard for synthesizers and computers. Notably, it defines a file format for storing music tracks that can then be synthesized. In common speech, "MIDI" or "midi" is often used to refer to these files. A number of variants of the standard MIDI file format have been developed for various reasons, sometimes to add extended information to allow more effects, sometimes on the contrary to remove less important parts to make the files even more compact. These variants are usually not recognized by the MiniWikipediaLogoIcon.pngMIDI Manufacturers Association; but several of them can be considered valid standards in video game communities. ZDoom can supports four of these variants, the MUS, HMI, HMP and XMI formats.

MIDI playback in ZDoom is done through one of the software synthesizers available.

Technical details

The standard MIDI file format, developed in 1990, is, like the PNG image format, based on chunks, inspired by the MiniWikipediaLogoIcon.pngInterchange File Format, developed in 1985 jointly by MiniWikipediaLogoIcon.pngElectronic Arts and MiniWikipediaLogoIcon.pngCommodore. A MIDI chunk begins with an 8-byte header, consisting of a four-byte identifier (e.g. "MThd") followed by a 32-bit MiniWikipediaLogoIcon.pngbig-endian unsigned integer giving the size of the data part of the chunk (the header's size is not counted in that total). The standard defines two chunks: MThd, the header chunk, and MTrk, a music track. There can be only one header and it must be at the start of the file; but there may be any number of tracks.

The header chunk has a size of six and contains three 16-bit big-endian signed integers; respectively the MIDI format, the number of tracks, and the division value.

  • MIDI format: the standard defines three formats:
    • Format 0: Contains a single song defined by a single track.
    • Format 1: Contains a single song defined by one or more tracks.
    • Format 2: Contains one or more songs, each defined by a single track.
  • Number of tracks: this should correspond to the total number of MTrk chunks contained in the file.
  • Division value: this allows to compute the length of the time unit used in the rest of the file, notably for the duration of events. The value is expressed on fifteen bits: the first bit corresponds to a flag determining whether to use real-time FPS measurement, or metrical measurement. If it is zero, then the rest of the value corresponds to the number of ticks per quarter-note. If it is one, then the remaining fifteen bits are cut into two values: the first seven bits correspond to the number of frames per second, normally identifying a MiniWikipediaLogoIcon.pngstandard SMPTE frame rate (24, 25, 29 or 30). The last eight bits correspond to the number of ticks per frame.

A track chunk is a stream of "events"; there are MIDI events which concern the song itself, system exclusive events (a.k.a. sysex events) which concern the synthesizer, and meta-events, which can be anything else. Each event is preceded by its duration, also called "delta time", which is the number of divisions it should last, expressed as a variable length number.

A variable length number is big-endian and uses seven bits per byte for the actual value; the most significant bit being used to mark whether the data continues on the next bit (set) or not (unset). So for instance, the value 128 (1000 0000 or 0x80 normally) would be represented as (1)000 0001 (0)000 0000, or 0x8100.

MIDI events

Each MIDI event uses a header of four or more bytes that works like this: first, one byte or more for the variable length number delta time value. Then a byte for the event type. Then two bytes for event parameters, which are usually two single-byte values but may be combined as a single 16-bit value, depending on the event. After the header, again depending on the event, additional data may follow. The most important MIDI events are channel events, where the event type byte is cut in two four-bit values (from 0 to 15). The four most significant bits correspond to the event type, the four least significant bits to the channel number. There are seven MIDI channel events:

Value Event name Parameters
0x8 Note off Note number Velocity
0x9 Note on Note number Velocity
0xA Note aftertouch Note number Value
0xB Controller Control type Value
0xC Program change Program 0
0xD Channel aftertouch Value 0
0xE Pitch bend pitch value

Of these, the least straightforward is the controller event. ZDoom supports the standard MIDI controllers as well as the MiniWikipediaLogoIcon.pngApogee EMIDI extra events. The following table only details event types that have special handling in ZDoom; for a list of all standard MIDI controllers refer to the specs.

Type Name Notes
007 Main volume This is ignored in EMIDI songs, but accepted in other MIDI songs.
110 EMIDI track designation Designated track uses FM synth for instruments 4 to 7; only valid in InitBeat.
111 EMIDI track exclusion Designate all tracks except excluded one; only valid in InitBeat.
112 EMIDI program change Ignored unless it also appears in the InitBeat.
113 EMIDI volume change Ignored unless it also appears in the InitBeat.
116 EMIDI Track loop begin Value corresponds to number of time it loops, a value of zero means infinite loops.
117 EMIDI Track loop end Value must be 127.
118 EMIDI Global loop begin Value corresponds to number of time it loops, a value of zero means infinite loops.
119 EMIDI Global loop end Value must be 127.

Note that some MIDI files, notably those created by MiniWikipediaLogoIcon.pngRPG Maker, use controller 111 to mark a loop starting point; ZDoom does not support this (it conflicts with an EMIDI controller) and will not play these MIDI songs correctly.

External links