The tapes containing this audio were digitized in a multi-year effort funded by the National Science Foundation under the guidance of Prof. John Hansen at UT Dallas. The original tranche of transfers were supplied to the Apollo in Real time team in 2017 along with 7,000 hours of Apollo 13 audio.
The Soundscriber machine that played back each tape for digitization is the only remaining machine of its kind in the world. It was built around the time of the Apollo missions and even though it was refurbished prior to digitization, it still introduced severe distortions to the audio during the playback process and subsequent digital files.
These artifacts, known as "wow" (a long term, random drift that makes the ~16 hour long tapes play back up to an hour longer than originally recorded) and "flutter" (a short term drift that makes the sounds warble in pitch about three times a second) were corrected using a custom algorithm developed by Apollo in Real Time team member, Jeremy Cooper. The resulting digital files are a near-perfect representation of what was originally recorded on the tapes at the time of the mission. Since they contain no time variation and contain an original timing signal in a standard format (known as an IRIG-B), the corrected files as a whole are the primary source material that the team relies upon when publishing the exact timing of mission events, including the synchronization of video sources in the timeline. They are indisputably historically accurate.
Unlike the main space-to-ground transcript (left), which is based upon the original mission transcript that was manually typewritten in 1969, the transcripts provided here have been generated using the "large-v3" Whisper model.
The processing of the 11,000 hours of Mission Control audio required over 1200 hours of computer time. The resulting transcripts contain 4,803,618 utterances, consisting of 42,684,618 words, totaling 224MB of text. Download the transcripts for all mission control channels in a single file here (version 2024-10-25).
As valuable as these results are, they remain imperfect. The Apollo in Real Time team will not endeavor to manually correct these generated transcripts. Instead, as automated transcription technology improves over time, the audio will be re-processed, replacing these transcripts.
The Apollo in Real Time team is grateful to the OpenAI team for the development and open-source release of Whisper.