21 December 2017
Extract Spatial Audio from Ricoh Theta V
The most recent Ricoh Theta V camera supports spatial audio recording. To examine the spatial audio content, here are the steps. A nice byproduct of this process is a youtube-ready panorama video.
- Download all three software packages listed on Theta V official website – we will need all of them: https://theta360.com/en/support/download/
- Basic app.
- File transfer app.
- Movie converter app.
- Record a 360 video with your Theta V. By default, Theta V with the latest firmware records spatial audio, without any accessory. Although an external microphone can improve the audio quality, I only use the camera itself without the external mic.
- Get the video to your computer.
- Power on Theta V.
- Connect Theta V to your computer using microUSB cable.
- Transfer the raw video using File transfer app. Let’s call it raw.mp4.
- Drag and drop raw.mp4 onto the Basic Theta app. Uncheck the [top/bottom correction] box. Start export. Let’s call this corrected.mp4.
- Drag and drop corrected.mp4 onto the Movie converter app. A mov file will be generated with spatial audio content. Since this file is ready to be uploaded to youtube, we call this file youtube.mov.
- Extract spatial audio
ffmpeg -i youtube.mov -vn spatial_audio.wav
-vn option is to disable video output.
- spatial_audio.wav should have 4 channels, following youtube ambisonic standard, i.e. ACN, SN3D, WXYZ.
- To verify, you can drag spatial_audio.wav into Audacity to see that it indeed has 4 channels.
[caption id=”attachment_287” align=”aligncenter” width=”713”] The Oculus - World Trade Center PATH Station[/caption]
PS: Theta V’s video quality is way better than its Theta S little brother.