The ISO/IEC 11172 specification defines the audio, video and multiplexing standards collectively and colloquially referred to as the MPEG-1 (Motion Picture Experts Group) compression standard. The data structures for the various components in an encoded bitstream are given in a pseudo-C syntax, and are extensively discussed. However, it is difficult to get the big picture from reading the spec.
More practically, in order to parse an MPEG-1 bitstream, it is necessary to know byte offsets within each structure. To make this information more readily accessible, we have condensed it into graphic form. Of course, this is no substitute for the original spec. Where more information is required than can be squeezed into the diagram, references are provided to the spec.
A multiplexed MPEG-1 stream is composed of distinct Packs.
Video or audio data is divided into Packets without regard to lower-level structures -- Groups, Pictures, etc. may break across Packet boundaries.
Audio information is composed of Frames. We will not discuss the substructure of Frames. There are no higher-level audio structures.