Studio 3

By Thom Johansen (thomj@notam.no) and Niklas Adam (na@notam.no)15 minutes read



Figure 1.

alt text

Overview of Notam's studio 3

This document contains an overview of the setup and use of Notam's studio 3 (studio for spatial audio). It is composed of two parts, with the first detailing the hardware and the second containing a quick first use setup guide.

Technical overview

The loudspeaker setup is a slightly irregular hemisphere consisting of 24 loudspeakers in three rings. The lower ring of 12 speakers is placed in the horizontal plane at head height. The two additional rings placed further up consists of 8 and 4 loudspeakers respectively. In addition there is currently one subwoofer. The measured loudspeaker positions can be found in various formats in the ###Coordinates section. It is highly recommended that you utilize these if your software supports it because of the irregular distances to the top four speakers.

The loudspeaker type is KEF LS-50. The subwoofer (Genelec 7070A) is intended to provide the majority of the low frequency content, so the bass has been rolled off somewhat in the loudspeakers.

The sweet spot in the middle of these loudspeakers is at head height above the markings on the floor.

Controls

The controls you'll find on the desk are quite simple:

Audio connectivity

On the table you will find an RME Madiface USB. This connects to your Windows or Mac computer via USB. When using this, your computer's output channels 1-24 will be sent to the loudspeakers (see diagram on last page for channel numbering). Output 25 is sent to the subwoofer.

The Madiface USB is supposed to be set up as clock master to allow you to select the sample rate yourself, but thanks to limitations in the hardware, the only easily supported sample rates are 44.1 kHz and 48 kHz. If you have a good reason for requiring either 88.2 kHz or 96 kHz, tell the staff. 192 kHz is not supported.

The audio connections in the studio are MADI based, which makes it non-trivial to connect other hardware to the system. We do however have solutions in place, so feel free to ask. As a quick summary, we can easily accomodate seven analogue line/mic inputs (via the patch panel in the front of the room) and MADI.

Gear list

Here follows the list of gear making up the signal chain in studio 3 for those who are curious.

First time use

Setting up

  1. Set the mute switch to one of the lower positions and the volume controller to a very low level (counter-clockwise).

  2. Install the RME Madiface USB drivers at the RME download page.

  • feel free to do this in advance if you are comfortable doing it. If in doubt ask the staff.
  • Of the two available drivers for Apple computers, the Driverkit one is recommended. Once the driver is properly installed and you've rebooted, the Totalmix FX window should pop up.
  1. Some caveats involved with the driver install and Totalmix:
    • If Totalmix shows up with a dialog window saying "detected", featuring "yes" and "no" buttons, press "yes" until it goes away.
  1. In the File menu of Totalmix, press "Load Workspace" and load the totalmix workspace file located here. Click "yes" if it asks you whether to discard all changes.

  2. After switching the mute switch to the top position you should now be good to go, and all the loudspeakers in the studio will be available from your software of choice.

If you have any questions about the Madiface USB setup, please check the troubleshooting section.

  1. Notam's decoder files for usage with the IEM SimpleDecoder can be found here

Reaper template

Here you will find a Reaper template for ambisonics work with the IEM Plug-in Suite.

Troubleshooting

When troubleshooting, please remember to set the studio volume knob at a low level and mute all speakers with the mute switch whenever you are doing something drastic like restarting equipment or connecting cables. There might be no sound when you are troubleshooting, but when you find the problem or manage to properly insert that last cable you forgot, you might be surprised at how loud you were really playing...

Speaker placement and channel order drawing

overview Figure 2.


Coordinates

Loudspeaker coordinates as of 29.11.2024

AED format:

  • Angle, Elevation, Distance
  • IRCAM Spat Compatible
  • Positive azimuths CLOCKWISE
  • 0 degree azimuth is forwards (front speaker)
0 0 2
30 0 2
60 0 2
90 0 2
120 0 2
150 0 2
180 0 2
-150 0 2
-120 0 2
-90 0 2
-60 0 2
-30 0 2
22.5 29.2 1.95
67.5 28.7 1.936
112.5 28.8 1.938
157.5 28 1.918
-157.5 28.8 1.931
-112.5 29 1.937
-67.5 28.8 1.939
-22.5 28.5 1.939
45 61.6 1.523
135 59.8 1.538
-135 60 1.545
-45 59.8 1.538

json AERC format

  • Azimuth, Elevation, Radius, Channel
  • Negative azimuths is CLOCKWISE
  • 0 degree azimuth is forwards (front speaker)

Created via jq on Notam's decoder file found here

jq '.LoudspeakerLayout.Loudspeakers \
| sort_by(.Channel) \
| map(del(.IsImaginary, .Gain))' \
s3_iem_decoder_config_2024.json

[
  {
    "Azimuth": 0.0,
    "Elevation": 0.0,
    "Radius": 2.0,
    "Channel": 1
  },
  {
    "Azimuth": -30.0,
    "Elevation": 0.0,
    "Radius": 2.0,
    "Channel": 2
  },
  {
    "Azimuth": -60.0,
    "Elevation": 0.0,
    "Radius": 2.0,
    "Channel": 3
  },
  {
    "Azimuth": -90.0,
    "Elevation": 0.0,
    "Radius": 2.0,
    "Channel": 4
  },
  {
    "Azimuth": -120.0,
    "Elevation": 0.0,
    "Radius": 2.0,
    "Channel": 5
  },
  {
    "Azimuth": -150.0,
    "Elevation": 0.0,
    "Radius": 2.0,
    "Channel": 6
  },
  {
    "Azimuth": -180.0,
    "Elevation": 0.0,
    "Radius": 2.0,
    "Channel": 7
  },
  {
    "Azimuth": 150.0,
    "Elevation": 0.0,
    "Radius": 2.0,
    "Channel": 8
  },
  {
    "Azimuth": 120.0,
    "Elevation": 0.0,
    "Radius": 2.0,
    "Channel": 9
  },
  {
    "Azimuth": 90.0,
    "Elevation": 0.0,
    "Radius": 2.0,
    "Channel": 10
  },
  {
    "Azimuth": 60.0,
    "Elevation": 0.0,
    "Radius": 2.0,
    "Channel": 11
  },
  {
    "Azimuth": 30.0,
    "Elevation": 0.0,
    "Radius": 2.0,
    "Channel": 12
  },
  {
    "Azimuth": -22.5,
    "Elevation": 29.20000076293945,
    "Radius": 1.950000047683716,
    "Channel": 13
  },
  {
    "Azimuth": -67.5,
    "Elevation": 28.70000076293945,
    "Radius": 1.935999989509583,
    "Channel": 14
  },
  {
    "Azimuth": -112.5,
    "Elevation": 28.79999923706055,
    "Radius": 1.937999963760376,
    "Channel": 15
  },
  {
    "Azimuth": -157.5,
    "Elevation": 28.0,
    "Radius": 1.917999982833862,
    "Channel": 16
  },
  {
    "Azimuth": 157.5,
    "Elevation": 28.79999923706055,
    "Radius": 1.930999994277954,
    "Channel": 17
  },
  {
    "Azimuth": 112.5,
    "Elevation": 29.0,
    "Radius": 1.937000036239624,
    "Channel": 18
  },
  {
    "Azimuth": 67.5,
    "Elevation": 28.79999923706055,
    "Radius": 1.939000010490417,
    "Channel": 19
  },
  {
    "Azimuth": 22.5,
    "Elevation": 28.5,
    "Radius": 1.939000010490417,
    "Channel": 20
  },
  {
    "Azimuth": -45.0,
    "Elevation": 61.20000076293945,
    "Radius": 1.523000001907349,
    "Channel": 21
  },
  {
    "Azimuth": -135.0,
    "Elevation": 59.79999923706055,
    "Radius": 1.537999987602234,
    "Channel": 22
  },
  {
    "Azimuth": 135.0,
    "Elevation": 60.0,
    "Radius": 1.544999957084656,
    "Channel": 23
  },
  {
    "Azimuth": 45.0,
    "Elevation": 59.79999923706055,
    "Radius": 1.537999987602234,
    "Channel": 24
  }
]

Decoder

IEM AllRAD

Notam's decoders made with the AllRAD approach for usage with the SimpleDecoder:

Use the same json file with the DistanceCompensator. Dont forget to enable the 25th channel for the sub. And same with the SimpleDecoder. Set the sub channel to discrete and choose channel 25.

Weights

These are two different decoder optimization strategies (also called "shelf filters" or "weighting schemes") used when decoding Ambisonics to loudspeaker arrays. They address a fundamental trade-off between spatial accuracy and perceptual quality.


A basic ("sampling") Ambisonics decoder reconstructs the sound field accurately in theory, but our ears don't perceive it that way in practice. Higher-order components contribute energy that can cause tonal coloration, diffuse imaging, or even unstable/phasey perception — especially with limited loudspeaker arrays. Weightings are applied to the Ambisonic orders to compensate.


MaxRE (Maximum Energy Vector)

MaxRE maximizes the energy vector (rE), which is a measure of the direction from which the listener perceives the majority of sound energy. It's derived from psychoacoustics — specifically, the idea that at high frequencies, the auditory system localizes based on energy rather than pressure.

Think of it as: optimized for where the sound seems to come from.


InPhase

InPhase ensures that all loudspeakers in the array reproduce signals that are in phase with each other for a source at any direction. This prevents cancellation artifacts and phase conflicts between speakers.

Think of it as: optimized for stability and cohesion of the sound image.


Side-by-Side Comparison

PropertyBasic (no weighting)MaxREInPhase
GoalAccurate wave fieldMax energy vectorAll speakers in phase
High-freq localizationPoorBestModerate
Image stabilityUnstableGoodBest
Source widthNarrow but phaseyModerateWidest
Order attenuationNoneModerateHeaviest
Negative speaker gains?YesSometimesNever
Best forAnalysisPlayback (HF)Large venues / LF

Usage

In practice, a well-designed Ambisonics decoder uses both, applied in different frequency bands via shelf filters:

This is the basis of the dual-band or "psychoacoustic" decoder described by Gerzon and later formalized in tools like the AllRAD and EPAD decoder designs.

Order Overview (0th – 7th)

In 3D (full-sphere) Ambisonics, each order N adds (2N+1) new spherical harmonic components. The total number of channels for a complete N th-order system is (N+1)².

Order Table

Order (N)New Components (2N+1)Total Channels (N+1)²ACN Channel IndicesCommon LabelSpatial Detail
0110WOmnidirectional — no directional information
1341–3Y, Z, XFront/back, up/down, left/right — basic directionality
2594–8V, T, R, S, UQuadrupole patterns — moderate spatial detail
37169–15Q, O, M, K, L, N, POctupole patterns — good localization, typical VR/spatial audio target
492516–24Fine directional resolution — high-quality immersive audio
5113625–35Very fine resolution — approaching perceptual limits for most listeners
6134936–48Extremely fine resolution — research / high-end reproduction systems
7156449–63Near-theoretical maximum perceptual resolution for human hearing

Channel Labels (FuMa Notation)

The letters come from FuMa (Furse-Malham) notation, the older naming convention for Ambisonic channels. Each letter maps to a specific spherical harmonic shape — essentially a 3D pattern describing which directions that component is sensitive to.


Order 0

LabelDescription
WThe omnidirectional component. Picks up sound equally from all directions — like a perfect omni microphone. It's the "base" of the entire sound field.

Order 1

LabelAxisDescription
XFront/BackFigure-of-eight pattern pointing forward and backward
YLeft/RightFigure-of-eight pattern pointing left and right
ZUp/DownFigure-of-eight pattern pointing up and down

These three together with W give you a complete first-order B-format signal — the classic output of a Soundfield microphone, and the foundation of all Ambisonics.


Order 2

LabelDescription
RVertical "axial" component — a zonal harmonic aligned to the Z axis
SDiagonal pattern in the XZ plane
TDiagonal pattern in the YZ plane
UHorizontal "saddle" pattern — differentiates front/back from left/right in the XY plane
VDiagonal pattern in the XY plane, rotated 45° from U

Order 3

LabelDescription
KZonal harmonic along Z axis (vertical, narrow lobe)
LTesseral pattern in the XZ plane
MTesseral pattern in the YZ plane
NSectoral pattern mixing X and Y axes
OSectoral pattern mixing X and Y axes, rotated
PDiagonal pattern in the XZ plane
QDiagonal pattern in the YZ plane

Useful intuition

The letters themselves are somewhat arbitrary — they're just shorthand tags assigned by Furse and Malham in the 1990s. What actually matters is the spherical harmonic function each one represents, which determines its polar pattern in 3D space.

A useful intuition is that as you go up in order:

Each new set of patterns fills in finer and finer angular detail, which is why higher orders give better spatial resolution.

Modern systems (AmbiX / ACN+SN3D) dropped the letter names entirely above order 1 and just use the numeric (N, M) degree/order index, which scales cleanly to any order without needing to invent new letters.

Notes

Recommendations

Many workflows are possible in our Spatial Audio Studio.
The staff is mostly using Reaper with the IEM Plug-in Suite and SuperCollider with ATK.

In general we work with Ambisonics and VBAP. But if needed Dolby Atmos is possible. ^^.