Vinyl info and calculators

This is just a collection of information about turntables and vinyl for anyone wanting to dig deeper into It (which might mean that it’s just for me…). I’ll keep adding to this (and completing it) as time goes by.

Glossary

Cantilever

  • The rod or arm that connects the stylus on one end to the “motor” on the other.

Effective Length

  • The straight-line distance between the pivot point of the tonearm and the top of the stylus

Equivalent Mass

  • definition to come

Flutter

  • Higher-frequency modulation of the audio frequency caused by changes in the groove speed. These may be the result of changes in problems such as unstable motor speed, variable compliance on a belt, issues with a spindle bearing, drive wheel eccentricity, and other issues.
  • Flutter describes a modulation in the groove speed ranging from 6 to 100 times a second (6 Hz to 100 Hz).

Frequency Drift

  • Very long-term (or low-frequency) changes in the audio frequency, typically caused by slow changes in the platter rotation speed.
  • Typically, changes with a modulation frequency of less than 0.5 Hz (a period of no less than 2 seconds) are considered to be frequency drift. Faster changes are labelled “Wow”

Groove

  • The v-shaped track pressed into the surface of the vinyl record, in which the stylus sits

Linear Tracking

  • A tonearm that moves linearly, following a path that is parallel to the radius line traced by the stylus. This (in theory) ensures that the tracking error is always 0º, however, in practice this error is merely small.

Modulation Width

  • The distance measured on a line through the spindle from the start of the modulated groove to the end of the modulated groove. This is approximately 3″ or 76 mm.

Mounting Distance

  • The distance between the spindle and the pivot point of the tonearm.

Needle

  • Also known as the stylus. The point that is placed in the groove of the vinyl record.
    Some persons distinguish between the “stylus” (to indicate the chisel on the mastering lathe that creates the groove in the master record), and the “needle” (to indicate the portion of the pickup on a turntable that plays the signal).

Null Radius

  • The radius (distance between the spindle and the stylus) where the tracking error is 0º. A typically-designed and correctly installed radial tracking tonearm has two null radii (see this posting).

Offset Angle

  • The angle between the axis of the stylus and a line drawn between the tonearm pivot and the stylus. See the line diagram below.

Overhang

  • The difference between the Effective Length of the tonearm and the Mounting Distance. This value is used in some equations for calculating the Tracking Error.

Pickup, Electromagnetic

  • Includes three general types: Moving Coil, Moving Magnet, and Variable Reluctance (aka Moving Iron). These produce an output proportional to the velocity of the stylus movement.

Pickup, Piezoelectic

  • Produces an output proportional to the displacement of the stylus.

Pitch

  • The density of the groove count per distance in lines per inch or lines per mm. The pitch can vary from disc to disc, or even within a single track, according to the requirements of the mastering.

Radial Tracking

  • A tonearm that rotates on a pivot point with the stylus tracing a circular path around that pivot.

Radius

  • The distance between the centre of the vinyl disc and the pickup stylus.

RIAA

  • A pre-emphasis / de-emphasis filter designed to fill two functions.
    • The first is a high-frequency attenuation de-emphasis that reduces the playback system’s sensitivity to surface noise. This requires a reciprocal high-frequency pre-emphasis boost.
    • The second is a low-frequency attenuation pre-emphasis that maintains a constant modulation amplitude at lower frequencies to avoid over-excursion of the playback stylus. This requires a reciprocal low-frequency de-emphasis boost.
  • The first of the two plots below, show the theoretical (black lines) and typical (red) response of the pre-emphasis filter. The second of the two plots shows the de-emphasis filter response.

Side Thrust

  • definition to come

Skating Force

  • definition to come

Spindle

  • The centre of the platter around which the record rotates

Stylus

  • Also known as the needle. The point that is placed in the groove of the vinyl record.
    Some persons distinguish between the “stylus” (to indicate the chisel on the mastering lathe that creates the groove in the master record), and the “needle” (to indicate the portion of the pickup on a turntable that plays the signal).

Stylus, Bonded vs. Nude

  • Although the tip of the stylus is typically made of diamond today, in lower-cost units, that diamond tip is mounted or bonded to a metal pin (typically steel, aluminium, or titanium) which is, in turn, connected to the cantilever (the long “arm” that connects back to the cartridge housing). This bonded design is cheaper to manufacture, but it results in a high mass at the stylus tip, which means that it will not move easily at high frequencies.
  • In order to reduce mass, the metal pin is eliminated, and the entire stylus is made of diamond instead. This makes things more costly, but reduces the mass dramatically, so it is preferred if the goal is higher sound performance. This design is known as a nude stylus.

Tracking Error

  • The angle between the tangent to the groove and the alignment of the stylus. In a perfect system, the stylus would align with the tangent to the groove at all radii (distances from the spindle), since this matches the angular rotation of the cutting head when the master was made on a lathe. A linear tracking arm minimises this error. A radial tracking arm can be designed to have two radii with no tracking error (each called a “Null Radius”) but will have some measurable tracking error at all other locations on the disk.
  • One side-effect of tracking error is distortion of the audio signal, typically calculated and expressed as a 2nd-harmonic distortion on a sinusoidal audio signal. However, higher order distortion and intermodulation artefacts also exist.

Warp Wow

  • A modulation of the frequency of the audio signal caused by vertical changes in the vinyl surface (a warped record). This typically happens at a lower frequency, which is why it is “warp wow” and not “warp flutter”.

Wow

  • Low-frequency modulation of the audio frequency caused by changes in the groove speed. These may be the result of changes in problems such as rotation speed of the platter, discs with an incorrectly-placed centre hole, or vertical changes in the surface of the vinyl, and other issues.
  • Wow is a modulation in the groove speed ranging from once every 2 seconds to 6 times a second (0.5 Hz to 6 Hz). Note that, for a turntable, the rotational speed of the disc is within this range. (At 33 1/3 RPM: 1 revolution every 1.8 seconds is equal to approximately 0.556 Hz.)

Disk size limits

Outside starting diameter

  • 7″ discs
    • 6.78″, +0.06″, -0.00″
    • 172.2 mm, +1.524 mm, – 0.0 mm
  • 10″ discs
    • 9.72″, +0.06″, -0.00″
    • 246.9 mm, +1.524 mm, – 0.0 mm
  • 12″ discs
    • 11.72″, +0.06″, -0.00″
    • 297.7 mm, +1.524 mm, – 0.0 mm

Start of modulated pitch diameter

  • 7″ discs
    • 6.63″, +0.00″, -0.03″
    • 168.4 mm, +0.0 mm, – 0.762 mm
  • 10″ discs
    • 9.50″, +0.00″, -0.03″
    • 241.3 mm, +0.0 mm, – 0.762 mm
  • 12″ discs
    • 11.50″, +0.00″, -0.03″
    • 292.1 mm, +0.0 mm, – 0.762 mm

Minimum inside diameter

  • 7″ discs
    • 4.25″
    • 107.95 mm
  • 10″ discs
    • 4.75″
    • 120.65 mm
  • 12″ discs
    • 4.75″
    • 120.65 mm

Lockout Groove diameter

  • 7″ discs
    • 3.88″, +0.00, -0.08
    • 98.552 mm, +0.0 mm, -2.032 mm
  • 10″ discs
    • 4.19″, +0.00, -0.08
    • 106.426 mm, +0.0 mm, -0.762 mm
  • 12″ discs
    • 4.19″, +0.00, -0.08
    • 106.426 mm, +0.0 mm, -0.762 mm

Unmodulated (silent) groove width

  • 2 mil minimum, 4 mil maximum
  • 0.0508 mm minimum, 0.1016 mm maximum

Modulated groove depth

  • 1 mil minimum, 5 mil maximum
  • 0.0254 mm minimum, 0.127 mm maximum
  • The figure below shows the typical, minimum, and maximum groove depths, drawn to scale (with a 13 µm spherical stylus)

Signal levels

  • A typical standard reference level is a velocity of 35.4 mm/sec on one channel.
  • This means that a monophonic signal (identical signal in both channels) with that modulation will have a lateral (side-to-side) velocity of 50 mm/sec.

Calculators and Measurements

Conversion

  • 1 mil = 1 “thou” = 1/1000 inch
  • Lengthmm = Lengthmil * 127/5000

Revolutions per Second

  • RevolutionsPerSecond = RevolutionsPerMinute / 60
  • e.g.
    • 0.556 Rev/Sec @ 33 1/3 RPM
    • 0.75 Rev/Sec @ 45 RPM
    • 1.3 Rev/Sec @ 78 RPM

Seconds per revolution

  • SecondsPerRevolution = 60 / RevolutionsPerMinute
  • e.g.
    • 1.8 Sec/Rev @ 33 1/3 RPM
    • 1.333 Sec/Rev @ 45 RPM
    • 0.769 Sec/Rev @ 78 RPM

Modulation Width

  • MaximumModulationWidth = (StartOfModulatedPitch – MinimumInsideDiameter) / 2
  • e.g. for a 12″ disc
    • (292.1 mm – 120.65 mm) / 2 = 85.725 mm
    • Typically approximately 3″ = 76 mm

Pitch (assuming constant pitch)

  • (RunningTime * RPM) / ModulationWidth
  • e.g.
    • (20 minutes * 33.333 RPM) / 76 mm = 8.77 lines per mm
    • (20 minutes * 33.333 RPM) / 3″ = 222.22 lines per inch

Groove Width

  • GrooveWidthInMil = (1000 / PitchInLinesPerInch + 1) / 2
  • e.g.
    • (1000 / 222 LPI + 1) / 2 = 2.75 mil = 2.75 x 10-3 inches = 0.07 mm

Angular Frequency (of the audio)

  • abbreviated ω (unit: radians per second)
  • ω = 2 * π * FrequencyHz

Angular Speed of Rotation (of of the disk)

  • commonly abbreviated ωr (unit: radians per second)
  • ωr = 2 * π * RevolutionsPerSecond

Displacement Amplitude

  • DisplacementAmplitudePeak = Velocitypeak / ω
  • e.g.
    • 50 mm/sec / (2 * pi * 1000 Hz) = 0.008 mm (peak)

Groove Speed

  • 2 * π * Radius * RevolutionsPerSecond
  • e.g.
    • at the Start of modulated pitch on a 12″ disk turning at 33 1/3 RPM
      • 2 * π * (292.1 mm / 2) * (33.333 / 60) = 509.8 mm/sec
    • at the Minimum inside diameter on a 12″ disk turning at 33 1/3 RPM
      • 2 * π * (120.65 mm / 2) * (33.333 / 60) = 210.6 mm/sec
  • The plot below shows the groove speeds of 12″ 33 1/3 RPM and 7″ 45 RPM for all possible radii.

Wavelength

  • GrooveSpeed / Frequency
  • e.g.
    • 20 Hz at the Start of modulated pitch on a 12″ disk turning at 33 1/3 RPM
      • 509.8 / 20 = 25.5 mm
    • 20 kHz at the Start of modulated pitch on a 12″ disk turning at 33 1/3 RPM
      • 509.8 / 20000 = 0.0255 mm
    • The plot below shows wavelengths of 4 different frequencies for 12″ 33 1/3 RPM records (the longer curves) and 7″ 45 RPM records (the shorter curves)

Tracking Error

  • Tracking Error =
    OffsetAngle – asin ((EffectiveLength2 + Radius2 – MountingDistance2) / (2 * EffectiveLength * Radius))
  • see this posting for an explanation and example

Distortion caused by tracking error

  • Equation is for calculating percentage of second-harmonic distortion of a laterally-modulated monophonic sinusoidal audio signal
  • DistortionPercent = 100 * (PeakVelocity * tan(TrackingError)) / (GrooveSpeed)
  • see this posting for an explanation and examples

Wow and Flutter

  • Typically measured with a 3150 Hz sinusoidal tone, played from the vinyl surface
  • This signal is then de-modulated to determine its change over time. That modulation is then filtered through the response shown below which approximates human sensitivity to frequency modulation of an audio signal. More detailed information is given below
  • The AES6-2008 standard, which is the currently accepted method of measuring and expressing the wow and flutter specification, uses a “2σ” or “2-Sigma” method, which is a way of looking at the peak deviation to give a kind of “worst-case” scenario. In this method, the tone is played from a disc and captured for as long a time as is possible (or feasible). Firstly, the average value of the actual frequency of the output is found (in theory, it’s fixed at 3,150 Hz, but this is never true). Next, the short-term variation of the actual frequency over time is compared to the average, and weighted using the filter shown above. The result shows the instantaneous frequency variations over the length of the captured signal, relative to the average frequency (however, the effect of very slow and very fast changes have been reduced by the filter). Finally, the standard deviation of the variation from the average is calculated, and multiplied by 2 (hence “2-Sigma”, or “two times the standard deviation”), resulting in the value that is shown as the specification. The reason two standard deviations is chosen is that (in the typical case where the deviation has a Gaussian distribution) the actual Wow & Flutter value should exceed this value no more than 5% of the time.

References

All of these are available online. Some of them require you to purchase them (or be a member of an organisation).

  • “Tracking Angle in Phonograph Pickups”
    B. B. Bauer. Electronics magazine, March 1945
  • “Minimising Pickup Tracking Error”
    Dr. John D. Seagrave, Audiocraft Magazine, December 1956, January 1957, and August 1957
  • “Understanding Phono Cartridges”
    S.K. Pramanik, Audio magazine, March 1979
  • “Tonearm Geometry and Setup Demystified”
    Martin D. Kessler and B.V.Pisha, Audio magazine, January 1980
  • “Understanding Tonearms”
    S.K. Pramanik, Audio magazine, June 1980
  • “Analytic Treatment of Tracking Error and Notes on Optimal Pick-up Design”
    H.G.Baerwald, Journal of the Society of Motion Picture Engineers, December 1941
  • “Pickup Arm Design”
    J.K. Stevenson, Wireless World magazine, May 1966, and June 1966
  • “The Optimum Pivot Position on Tonearm”
    S. Takahashi et. al., Audio Engineering Society Preprint no. 1390 (61st Convention, November 1978)
  • “Audible Effects of Mechanical Resonances in Turntables”
    Brüel and Kjær Application Note (1977)
  • “Basic Disc Mastering”; “
    Larry Boden (1981)
  • “Cartridge / Arm / Turntable Followup: Loose Ends and New Developments”
    The Audio Critic, 1:43 (Spring/Fall, 1978)
  • “Have Tone Arm Designers Forgotten Their High-School Geometry?”
    The Audio Critic, 1:31 (Jan./Feb. 1977).
  • “How the Stereo Disc Works”
    Radio-Electronics, (July 1958)
  • “Manual of Analogue Sound Restoration Techniques”
    Peter Copeland (2008)
  • “On the Mechanics of Tonearms”
    Dick Pierce (2005)
  • “Reproduction of Sound in High-Fidelity and Stereo Phonographs”
    Edgar Villchur (1966)
  • Journal of the Audio Engineering Society (www.aes.org)
    • “Centennial Issue: The Phonograph and Sound Recording After One-Hundred Years”
      Vol. 25, No. 10/11 (Oct./Nov. 1977)
    • “Factors Affecting the Stylus / Groove Relationship in Phonograph Playback Systems”
      C.R. Bastiaans; Vol. 15 Issue 4 (Oct. 1967)
    • “Further Thoughts on Geometric Conditions in the Cutting and Playing of Stereo Disk”
      C.R. Bastiaans; Vol. 11 Issue 1 (Jan. 1963)
    • “Record Changers, Turntables, and Tone Arms-A Brief Technical History”
      James H. Kogen; Vol. 25 (Oct./Nov. 1977)
    • “Some Thoughts on Geometric Conditions in the Cutting and Playing of Stereodiscs and Their Influence on the Final Sound Picture”
      Ooms, Johan L., Bastiaans, C. R.; Vol. 7 Issue 3 (Jul. 1959)
    • “The High-Fidelity Phonograph Transducer”
      B.B. Bauer; Vol. 25 Issue 10/11 (Nov. 1977)
  • DIN Standards
    • 45 500: Hi-Fi Technics: Requirements for Disk Recording Reproducing Equipment
    • 45 507: Measuring Apparatus for Frequency Variations in Sound Recording Equipment
    • 45 538: Definitions for Disk Record Reproducing Equipment
    • 45 539: Disk Record Reproducing Equipment: Directives for Measurements, Markings, and Audio Frequency, Connections, Dimensions of Interchangeable Pickups, Requirements of Playback Amplifiers
    • 45 541: Frequency Test Record St 33 and M 33 (33 1/3 rev/min; Stereo and Mono)
    • 45 542: Distortion Test Record St 33 and St 45 (33 1/3 or 45 rev/min; Stereo)
    • 45 543: Frequency Response and Crosstalk Test Record
    • 45 544: Rumble Measurement Test Record St 33 and M 33 (33 1/3 rev/min; Stereo and Mono)
    • 45 545: Wow and Flutter Test Records, 33 1/3 and 45 rev/min
    • 45 546: Stereophonic Disk Record St 45 (45 rpm)
    • 45 547: Stereophonic Disk Record St 33 (33 1/3 rpm)
    • 45 548 Aptitude for Performance of Disk Record Reproducing Equipment
    • 45 549: Tracking Ability Test Record
  • IEC Publications
    • 98: Recommendations for Lateral-Cut Commercial and Transcription Disk Recordings
    • 98: Processed Disk Records and Reproducing Equipment
    • 386: Method of Measurement of Speed Fluctuations in Sound Recording and Reproducing Equipment

The Phono Gram

Late-night free-floating anxiety is not a modern phenomenon. One of my favourite descriptions of it is Dorothy Parker’s 1933 short story called “The Little Hours”, published in The New Yorker.

However, more than 30 years before this, The Phono Gram magazine published the following, which a Dr. J. Leonard Corning proposed to cure the problem of late-night melancholy with what we would, today, called a “pair of headphones” (although the design that he describes probably wouldn’t sell well to anyone who is not interested in S&M…) playing Wagnerian arpeggios and minor chords. (Personally, I just put the timer on my iPhone to turn off after 30 minutes, and turn on an old episode of “QI” or “8 Out of 10 Cats Does Countdown” – Sean Lock’s voice drowns out my own internal ones.)

I’ve removed a rather significant portion here…

Loud speaker

I live in Denmark where people speak Danish. One interesting word that I use every day is “højtaler” which is the Danish word for “loudspeaker”. I say that this word is “interesting” because, just like “loudspeaker” it is actually two words glued together. “Høj” means “high” or “loud” and “taler” means “talker” or “speaker” (as in “the person who is doing the talking”).

Sometimes, when I have a couple of minutes to spare, instead of looking at cat videos on YouTube, I sift through old audio and electronics magazines for fun. One really good source for these is the collection at worldradiohistory.com. (archive.org is also good!). Today I stumbled across the December, 1921 edition of Practical Electrics Magazine (which changed its name to The Experimenter in November of 1924 and then to Amazing Stories in April 1926*) It had a short description of a stage trick called the “Haunted Violin”, an excerpt from which is shown below.

The trick was that a violin, held by a woman walking around the aisle of the theatre would appear to play itself. In fact, as you can see above, there was an incognito violinist with a “detectophone” that was transmitting through wires connected to metal plates under the carpet in the theatre. The woman was wearing shoes with heels pointy enough to pierce the carpet and make contact with the plates. The heels were then connected with wires running through her dress to a “loud talker” hidden inside the violin.

Seems that, in 1921, it would have been easier to learn at least one word in Danish…

Side note: This is why, when I’m writing about audio systems I try as hard as possible to always use the word “loudspeaker” instead of “speaker”. To me, a “speaker” is a person giving a speech. A “loudspeaker” is a thing I complain about every day at work.

* That April 1926 edition of Amazing Stories had short stories by Jules Verne, H.G. Wells, and Edgar Allan Poe!

Post Script:
My wife reminded me that it’s the same in French: “haut-parleur”. It’s a reminder that the original loudspeakers were never intended for music, I suppose…

Tonearm tracking error and distortion

In the last posting, I reviewed the math for calculating the tracking error for a radial tonearm. The question associated with this is “who cares?”

In the March, 1945 issue of Electronics Magazine, Benjamin Bauer supplied the answer. An error in the tracking angle results in a distortion of the audio signal. (This was also discussed in a 3-part article by Dr. John D. Seagrave in Audiocraft Magazine in December 1956, January 1957, and August 1957)

If the signal is a sine wave, then the distortion is almost entirely 2nd-order (meaning that you get the sine wave fundamental, plus one octave above it). If the signal is not a sine wave, then things are more complicated, so I will not discuss this.

Let’s take a quick look at how the signal is distorted. An example of this is shown below.

In that plot, you can see that the actual output from the stylus with a tracking error (the black curve) precedes the theoretical output that’s actually on the vinyl surface (the red curve) when the signal is positive, and lags when it’s negative. An intuitive way of thinking of this to consider the tracking error as an angular rotation, so the stylus “reads” the signal in the groove at the wrong place. This is shown below, which is merely zooming in on the figure above.

Here, you can see that the rotation (tracking error) of the stylus is getting its output from the wrong place in the groove and therefore has the wrong output at any given moment. However, the amount by which it’s wrong is dependent not only on the tracking error but the amplitude of the signal. When the signal is at 0, then the error is also 0. This is not only the reason why the distortion creates a harmonic of the sine wave, but it also explains why (as we’ll see below) the level of distortion is dependent on the level of the signal.

This intuitive explanation is helpful, but life is unfortunately, more complicated. This is because (as we saw in the previous posting), the tracking error is not constant; it changes according to where the stylus is on the surface of the vinyl.

If you dig into Bauer’s article, you’ll find a bunch of equations to help you calculate how bad things get. There are some minor hurdles to overcome, however. Since he was writing in the USA in 1945, his reference was 78 RPM records and his examples are all in inches. However, if you spend some time, you can convert this to something more useful. Or, you could just trust me and use the information below.

In the case of a sinusoidal signal the level of the 2nd harmonic distortion (in percent) can be calculated with the following equation:

PercentDistortion = 100 * (ω Αpeak α) / (ωr r)

where

  • ω is 2 * pi * the audio frequency in Hz
  • Apeak is the peak amplitude of the modulation (the “height” of the groove) in mm
  • α is the tracking error in radians
  • ωr is the rotational speed of the record in radians per second, calculated using 2 * pi * (RPM / 60)
  • r is the radius of the groove; the distance from the centre spindle to the stylus in mm

Let’s invent a case where you have a constant tracking error of 1º, with a rotational speed of 33 1/3 RPM, and a frequency of 1 kHz. Even though the tracking error remains constant, the signal’s distortion will change as the needle moves across the surface of the record because the wavelength of the signal on the vinyl surface changes (the rotational speed is the same, but the circumference is bigger at the outside edge of the record than the inside edge). The amount of error increases as the wavelength gets smaller, so the distortion is worse as you get closer to the centre of the record. This can be see in the shapes of the curves in the plot below. (Remember that, as you play the record, the needle is moving from right to left in those plots.)

You can also see in those plots that the percentage of distortion changes significantly with the amplitude of the signal. In this case, I’ve calculated using three different modulation velocities. The middle plot is 35.4 mm / sec, which is a typical accepted standard reference level, which we’ll call 0 dB. The other two plots have modulation velocities of -3 dB (25 mm / sec) and + 3 dB (50 mm / sec).

Sidebar: If you want to calculate the Amplitude of the modulation

Apeak = (ModulationVelocity * sqrt(2)) / (2 * pi * FrequencyInHz)

Note that this simplifies the equation for calculating the distortion somewhat.

Also, if you need to convert radians to degrees, then you can multiply by 180/pi. (about 57.3)

Of course, unless you have a very badly-constructed linear tracking turntable, you will never have a constant tracking error. The tracking error of a radial tonearm is a little more complicated. Using the recommended values for the “well known tonearm” that I used in the last posting:

  • Effective Length (l) : 233.20 mm
  • Mounting Distance (d) : 215.50 mm
  • Offset angle (y) : 23.63º

and assuming that this was done perfectly, we get the following result for a 33 1/3 RPM album.

You can see here that the distortion drops to 0% when the tracking error is 0º, which (in this case) happens at two radii (distances between the centre spindle and the stylus).

If we do exactly the same calculation at 45 RPM, you’ll see that the distortion level drops (because the value of ωr increases), as shown below. (But good luck finding a 12″ 45 RPM record… I only have two in my collection, and one of those is a test record.)

Important notes:

Everything I’ve shown above is not to be used as proof of anything. It’s merely to provide some intuitive understanding of the relationship between radial tracking tonearms, tracking error, and the resulting distortion. There is one additional important reason why all this should be taken with a grain of salt. Remember that the math that I’ve given above is for 78 RPM records in 1945. This means that they were for laterally-modulated monophonic grooves; not modern two-channel stereophonic grooves. This means that the math above isn’t accurate for a modern turntable, since the tracking error will be 45º off-axis to the axis of modulation of the groove wall. This rotation can be built into the math as a modification applied to the variable α, however, I’m not going to complicate things further today…

In addition, the RIAA equalisation curve didn’t get standardised until 1954 (although other pre-emphasis curves were being used in the 1940s). Strictly speaking, the inclusion of a pre-emphasis curve doesn’t really affect the math above, however, in real life, this equalisation makes it a little more complicated to find out what the modulation velocity (and therefore the amplitude) of the signal is, since it adds a frequency-dependent scaling factor on things. On the down-side, RIAA pre-emphasis will increase the modulation velocity of the signal on the vinyl, resulting in an increase in the distortion effects caused by tracking error. On the up-side, the RIAA de-emphasis filtering is applied not only to the fundamentals, but the distortion components as well, so the higher the order of the unwanted harmonics, the more they’ll be attenuated by the RIAA filtering. How much these two effects negate each other could be the subject of a future posting; if I can wrap my own head around the problem…

One extra comment for the truly geeky:

You may be looking at the last two plots above and being confused in the same way that I was when I made them the first time. If you look at the equation, you can see that the PercentDistortion is related to α: the tracking error. However, if you look at the plots, you’ll see that I’ve shown it as being related to | α |: the absolute value of the tracking error instead. This took me a while to deal with, since my first versions of the plots were showing a negative value for the distortion. “How can a negative tracking result in distortion being removed?” I asked myself. The answer is that it doesn’t. When the tracking error is negative, then the angle shown in the second figure above rotates counter-clockwise to the left of the vertical line. In this case, then the output of the stylus lags for positive values and precedes for negative values (opposite to the example I gave above), meaning that the 2nd-order harmonic flips in polarity. SINCE you cannot compare the phase of two sine tones that do not have the same frequency, and SINCE (for these small levels of distortion) it’ll sound the same regardless of the polarity of the 2nd-order harmonic, and SINCE (in real-life) we don’t listen to sine tones so we get higher-order THN and IMD artefacts, not just a frequency doubling, THEN I chose to simplify things and use the absolute value.
Post Script to the comment for geeks: This conclusion was confirmed by J.K. Stevenson’s article called “Pickup Arm Design” in the May, 1966 edition of Wireless World where he states “The sign of φ (positive or negative) is ignored as it has no effect on the distortion.” (He uses φ to denote the tracking error angle.)

Penultimate Post Script:

J.K. Stevenson’s article gives an alternative way of calculating the 2nd order harmonic distortion that gives the same results. However, if you are like me, then you think in modulation velocity instead of amplitude, so it’s easier to not convert on the way through. This version of the equation is

PercentDistortion = 100 * (Vpeak tan(α)) / (μ)

where

  • Vpeak is the peak modulation velocity in mm/sec
  • α is the tracking error in radians
  • μ is the groove speed of the record in mm/sec calculated using 2*pi*(rpm/60)*r
  • r is the radius of the groove; the distance from the centre spindle to the stylus in mm

Final Post-Script:

I’ve given this a lot of thought over the past couple of days and I’m pretty convinced that, since the tracking error is a rotation angle on an axis that is 45º away from the axis of modulation of the stylus (unlike the assumption that we’re dealing with a monophonic laterally-modulated groove in all of the above math), then, to find the distortion for a single channel of a stereophonic groove, you should multiply the results above by cos(45º) or 1/sqrt(2) or 0.707 – whichever you prefer. If you are convinced that this was the wrong thing to do, and you can convince me that you’re right, I’ll be happy to change it to something else.

Tonearm alignment and tracking error

The June 1980 issue of Audio Magazine contains an article written by Subir K. Pramanik called “Understanding Tonearms”. This is a must-read tutorial for anyone who is interested in the design and behaviour of radial tonearms.

One of the things Pram talked about in that article concerned the already well-known relationship between tonearm geometry, its mounting position on the turntable, and the tracking error (the angular difference between the tangent to the groove and the cantilever axis – or the rotation of the stylus with respect to the groove). Since the tracking error is partly responsible for distortion of the audio signal, the goal is to minimise it as much as possible. However, without a linear-tracking system (or an infinitely long tonearm), it’s impossible to have a tracking error of 0º across the entire surface of a vinyl record.

One thing that is mentioned in the article is that “Small errors in the mounting distance from the centre of the platter … can make comparatively large differences in angular error” So I thought that I’d do a little math to find out this relationship.

The article contains the diagram shown below, showing the information required to do the calculations we’re interested in. In a high-end turntable, the Mounting Distance (d) can be varied, since the location of the tonearm’s bearing (the location of the pivot point) is adjustable, as can be seen in the photo above of an SME tonearm on a Micro Seiki turntable.

The tonearm’s Effective Length (l) and Offset Angle (y) are decided by the manufacturer (assuming that the pickup cartridge is mounted correctly). The Minimum and Maximum groove radius are set by international standards (I’ve rounded these to 60 mm and 149 mm respectively). The Radius (r) is the distance from the centre of the LP (the spindle) to the stylus at any given moment when playing the record.

In a perfect world, the tracking error would be 0º at all locations on the record (for all values of r from the Maximum to the Minimum groove radii) which would make the cantilever align with the tangent to the groove. However, since the tonearm rotates around the bearing, the tracking error is actually the angle x (in the diagram above) subtracted from the offset angle. “X” can be calculated using the equation:

x = asin ((l2 + r2 – d2) / (2 l r))

So the tracking error is

Tracking Error = y – asin ((l2 + r2 – d2) / (2 l r))

Just as one example, I used the dimensions of a well-known tonearm as follows:

  • Effective Length (l) : 233.20 mm
  • Mounting Distance (d) : 215.50 mm
  • Offset angle (y) : 23.63º

Then the question is, if I make an error in the Mounting Distance, what is the effect on the Tracking Error? The result is below.

If we take the manufacturer’s recommendation of d = 215.4 mm as the reference, and then look at the change in that Tracking Error by mounting the bearing at the incorrect distance in increments of 0.2 mm, then we get the plot below.

So, as you can see there, a 0.2 mm error in the location of the tonearm bearing (which, in my opinion, is a very small error…) results in a tracking error difference of about 0.2º at the minimum groove radius.

If I increase the error to increments of 1 mm (± 5mm) then we get similar plots, but with correspondingly increased tracking error.

If you go back and take a look at the equation above, you can see that the change in the tracking error is constant with the Offset Angle (unlike its relationship with an error in the location of the tonearm bearing, which results in a tracking error that is NOT constant). This means that if you mount your pickup on the tonearm head shell with a slight error in its angle, then this angular error is added to the tracking error as a constant value, regardless of the location of the stylus on the surface of the vinyl, as shown below.

Phase vs Polarity

I know that language evolves. I know that a dictionary is a record of how we use words; not an arbiter of how words should be used. However, I also believe very firmly that if you don’t use words correctly, then you won’t be saying what you mean, and therefore you can be misconstrued.

One of the more common phrases that you’ll hear audio people use is “out of phase” when they mean “180º out of phase” or possibly even “opposite polarity”. I recently heard someone I work with say “out of phase” and I corrected them and said “you mean ‘opposite polarity'” and so a discussion began around the question of whether “180º out of phase” and “opposite polarity” can possibly result in two different things, or whether they’re interchangeable.

Let’s start by talking about what “phase” is. When you look at a sine wave, you’re essentially looking at a two-dimensional view of a three-dimensional shape. I’ve talked about this a lot in two other postings: this one and this one. However, the short form goes something like “Look at a coil spring from the side and it will look like a sine wave.” A coil is a two-dimensional circle that has been stretched in the third dimension so that when you rotate 360º, you wind up back where you started in the first two dimensions, but not the third. When you look at that coil from the side, the circular rotation (say, in degrees) looks like a change in height.

Figure 1
Figure 2

Notice in the two photos above how the rotation of the circle, when viewed from the side, looks only like a change in height related to the rotation in degrees.

Figure 3

The figure above is a classic representation of a sine wave with a peak amplitude of 1, and as you can see there, it’s essentially the same as the photo of the Slinky. In fact, you get used to seeing sine waves as springs-viewed-from-the-side if you force yourself to think of it that way.

Now let’s look at the same sine wave, but we’ll start at a different place in the rotation.

Figure 4

The figure above shows a sine wave whose rotation has been delayed by some number of degrees (22.5º, to be precisely accurate).

If I delay the start of the sine wave by 180 degrees instead, it looks like Figure 5..

Figure 5

However, if I take the sine wave and multiply each value by -1 (inverting the polarity) then it looks like this:

Figure 6

As you can probably see, the plots in Figure 5 and 6 are identical. Therefore, in the case of a sine wave, shifting the phase of the signal by 180 degrees has the same result at inverting the polarity.

What happens when you have a signal that is the sum of multiple sine waves? Let’s look at a simple example below.

Figure 7

The top plot above shows two sine waves, one with a frequency of three times the other, and with 1/3 the amplitude. If I add these two together, the result is the red curve in the lower plot. There are two ways to think of this addition: You can add each amplitude, degree by degree to get the red curve. You can also think of the slopes adding. At the 180º mark, the two downward-going slopes of the two sine waves cause the steeper slope in the red curve.

If we shift the phase of each of the two sine wave components, then the result looks like the plots below.

Figure 8

As you can see in the plots above, shifting the phases of the sine waves is the same as inverting their polarities, and so the resulting total sum (the red curve) is the same as if we had inverted the polarity of the previous total sum.

So, so far, we can conclude that shifting the phase by 180º gives the same result as inverting the polarity.

In the April, 1946 edition of Wireless World magazine, C.E. Cooper wrote an article called “Phase Relationships: ‘180 Degrees Out of Phase’ or ‘Reversed Polarity’?” (I’m not the first one to have this debate…) In this article, it’s states that there is a difference between “phase” and “polarity” with the example shown below.

Figure 9

There is a problem with the illustration in Figure 9, which is the fact that you cannot say that the middle plot has been shifted in phase by 180 degrees because that waveform doesn’t have a “phase”. If you decomposed it to its constituent sines/cosines and shifted each of those by 180º, then the result would look like (c) instead of (b). Instead, this signal has had a delay of 1/2 of a period applied to it – which is a different thing, since it’s delaying in time instead of shifting in phase.

However, there is a hint here of a correct answer… If we think of the black and blue sine waves in the 2-part plots above as sine waves with frequencies 1 Hz and 3 Hz, we can add another “sine wave” with a frequency of 0 Hz, or DC, as shown in Figure 10, below.

Figure 10

In the plot above, the top plot has a DC component (the blue line) that is added to the sine component (the black curve) resulting in a sine wave with a DC offset (the red curve).

If we invert the polarity of this signal, then the result is as shown in Figure 11.

Figure 11

However, if we delay the components by 180º, the result is different, as shown in Figure 12:

Figure 12

The hint from the 1946 article was the addition of a DC offset to the signal. If we think of that as a sine wave with a frequency of 0 Hz, then it can be “phase-shifted” by 180º which results in the same value instead of inverting polarity.

However, to be fair, most of the time, shifting the phase by 180º gives the same result as inverting the polarity. However, I still don’t like it when people say “flip the phase”…

Volume controls vs. Output levels

#92 in a series of articles about the technology behind Bang & Olufsen

One question people often ask about B&O loudspeakers is something like ”Why doesn’t the volume control work above 50%?”.

This is usually asked by someone using a small loudspeaker listening to pop music.

There are two reasons for this, related to the facts that there is such a wide range of capabilities in different Bang & Olufsen loudspeakers AND you can use them together in a surround or multiroom system. In other words for example, a Beolab 90 is capable of playing much, much more loudly than a Beolab 12; but they still have to play together.

Let’s use the example of a Beolab 90 and a Beolab 12, both playing in a surround configuration or a multiroom setup. In both cases, if the volume control is set to a low enough level, then these two types of loudspeakers should play at the same output level. This is true for quiet recordings (shown on the left in the figure below) and louder recordings (shown on the right).

However, if you turn up the volume control, you will reach an output level that exceeds the capability of the Beolab 12 for the loud song (but not for the quiet song), shown in the figure below. At this point, for the loud song, the Beolab 12 has already begun to protect itself.

Once a B&O loudspeaker starts protecting itself, no matter how much more you turn it up, it will turn itself down by the same amount; so it won’t get louder. If it did get louder, it would either distort the sound or stop working – or distort the sound and then stop working.

If you ONLY own Beolab 12s and you ONLY listen to loud songs (e.g. pop and rock) then you might ask “why should I be able to turn up the volume higher than this?”.

The first answer is “because you might also own Beolab 90s” which can go louder, as you can see in the right hand side of the figure above.

The second answer is that you might want to listen to quieter recording (like a violin solo or a podcast). In this case, you haven’t reached the maximum output of even the Beolab 12 yet, as you can see in the left hand side of the figure above. So, you should be able to increase the volume setting to make even the quiet recording reach the limits of the less-capable loudspeaker, as shown below.

Notice, however, that at this high volume setting, both the quiet recording and the loud recording have the same output level on the Beolab 12.

So, the volume allows you to push the output higher; either because you might also own more capable loudspeakers (maybe not today – but some day) OR because you’re playing a quiet recording and you want to hear it over the sound of the exhaust fan above your stove or the noise from your shower.

It’s also good to remember that the volume control isn’t an indicator of how loud the output should be. It’s an indicator of how much quieter or louder you’re making the input signal.

The volume control is more like how far down you’re pushing the accelerator in your car – not the indication of the speedometer. If you push down the accelerator 50% of the way, your actual speed is dependent on many things like what gear you’re in, whether you’re going uphill or downhill, and whether you’re towing a heavy trailer. Similarly Metallica at volume step 70 will be much louder than a solo violin recording at the same volume step, unless you are playing it through a loudspeaker that reached its maximum possible output at volume step 50, in which case the Metallica and the violin might be the same level.

Note 1: For all of the above, I’ve said “quiet song” and “loud song” or “quiet recording” and “loud recording” – but I could just have easily as said “quiet part of the song” and “loud part of the song”. The issue is not just related to mastering levels (the overall level of the recording) but the dynamic range (the “distance” between the quietest and the loudest moment of a recording).

Note 2: I’ve written a longer, more detailed explanation of this in Posting #81: Turn it down half-way.

Variations on the Goldberg Variations

As part of a listening session today, I put together a playlist to compare piano recordings. I decided that an interesting way to do this was to use the same piece of music, recorded by different artists on different instruments in different rooms by different engineers using different microphone and techniques. The only constant was the notes on the page in front of the performer.

A link to the playlist is here: LINK TO TIDAL

Playing through this, it’s interesting to pay attention to things like:

  • Overall level of the recording
    • Notice how much (typically) quieter the Dolby Atmos-encoded recording is than the 2.0 PCM encoded ones. However, there’s a large variation amongst the 2.0 recordings.
  • Monophonic vs. stereo recordings
  • Perceived width of the piano
  • Perceived width of the room
  • How enveloping the room is (this might be different from the perceived width, but these two attributes can be co-related, possibly even correlated)
  • Perceived distance to the piano.
    • On some of the recordings, the piano appears to be close. The attack of each note is quite fast, and there is not much reveberation.
    • On some of the recordings, the piano appears to be distant – more reveberant, with a soft, slow attack on each note.
    • On other recordings, it may appear that the piano is both near (because of the fast attack on each hammer-to-string strike) and far (because of the reverberation). (Probably achieved by using a combination of microphones at different distances – or using digital reverb…)
  • The length of the reverberation time
  • Whether the piano is presented as one instrument or a collection of strings (e.g. can you hear different directions to (or locations of) individual notes?)
  • If the piano is presented as a wide source with separation between bass and treble, is the presentation from the pianist’s perspective (bass on the left, treble on the right) or the audience’s perspective (bass on the left, treble on the right… sort of…)

32 is a lot of bits…

Once upon a time, I did a blog posting about why, when we test digital audio systems, we typically use a 997 Hz sine wave instead of a 1000 Hz tone.

The short version of this is the following:

Let’s say that I digitally create a (not-dithered) 1000 Hz sine wave at 0 dB FS in a 16-bit system running at 48 kHz. This means that every second, there are exactly 1000 cycles of the wave, and since there are 48,000 samples per second, this, in turn means that there is one cycle every 48 samples, so sample #49 is identical to sample #1.

So, we are only testing 48 of the possible 2^16 ( = 65,536) quantisation values, right?

Wrong. It’s worse than you think.

If we zoom in a little more, we can see that Sample #1 = 0 (because it’s a sine wave). Sample #25 is also equal to 0 (because 48,000 / 1,000 is a nice number that is divisible by 2).

Unfortunately, 48,000 / 1,000 is a nice number that is also divisible by 4. So what? This means that when the sine wave goes up from 0 to maximum, it hits exactly the same quantisation values as it does on the way from maximum back down to 0. For example, in the figure below, the values of the two samples shown in red are identical. This is true for all symmetrical points in the positive side and the negative side of the wave.

Jumping ahead, this means that, if we make a “perfect” 1 kHz sine wave at 48 kHz (regardless of how many bits in the system) we only test a total of 25 quantisation steps. 0, 12 positive steps, and 12 negative ones.

Not much of a test – we only hit 25 out of a possible 65,546 values in a 16-bit system (or 25 out of 16,777,216 possible values in a 24-bit system).

What if I wanted to make a signal that tested ALL possible quantisation values in an LPCM system? One way to do this is to simply make a linear ramp that goes from the lowest possible value up to the highest possible value, step by step, sample by sample. (of course, there are other ways, but it doesn’t matter… we’re just trying to hit every possible quantisation value…)

How long would it take to play that test signal?

First we convert the number of bits to the number of quantisation steps. This is done using the equation 2^bits. So, you get the following results

Number of BitsNumber of Quantisation Steps
1665,536
2416,777,216
324,294,967,296

If the value of each sample has a different quantisation value, and we play the file at the sampling rate then we can calculate the time it will take by dividing the number of quantisation steps by the sampling rate. This results in the following:

Sampling Rate (kHz)16 Bits24 Bits32 Bits
44.11.5 seconds6.4 minutes27.1 hours
481.4 seconds5.8 minutes24.9 hours
88.20.7 seconds3.2 minutes13.5 hours
960.7 seconds2.9 minutes12.4 hours
176.40.4 seconds1.6 minutes6.8 hours
1920.3 seconds1.5 minutes6.2 hours
352.80.2 seconds47.6 seconds3.4 hours
3840.2 seconds43.7 seconds3.1 hours
705.60.1 seconds23.8 seconds1.7 hours
7680.1 seconds21.8 seconds1.6 hours

So, the moral of the story is, if you’re testing the validity of a quantiser in a 32-bit fixed-point system, and you’re not able to do it off-line (meaning that you’re locked to a clock running at the correct sampling rate) you’d either (1) hope that it’s also a crazy-high sampling rate or (2) that you’re getting paid by the hour.

Why I am thinking about this?

I often get asked for my opinion about audio players; these days, network streamers especially, since they’re in style.

Let’s say, for example, that someone asked me to recommend a network streamer for use with their system. In order to recommend this, I need to measure it to make sure it behaves.

One of the tests I’m going to run is to ensure that every sample value on a file is accurately output from the device. Let’s also make it simple and say that the device has a digital output, and I only need to test 3 LPCM audio file formats (WAV, AIFF and FLAC – since those can be relied to give a bit-for-bit match from file to output). (We’ll also pretend that the digital output can support a 32-bit audio word…)

So, to run this test, I’m going to

  • create test files that I described above (checking every quantisation value at all three bit depths and all 10 sampling rates)
  • play them
  • record them
  • and then compare whether I have a bit-for-bit match from input (the original file) to the output

If you add up all the values in the table above for the 10 sampling rates and the three bit depths, then you get to a total of 4.2 DAYS of play time (playing audio constantly 24 hours a day) per file format.

So, say I wanted to test three file formats for all of the sampling rates and bit depths, then I’m looking at playing & recording 12.6 days of audio – and then I can start the analysis.

REALLY‽

Of course this is silly… I’m not going to test a 32-bit, 44.1 kHz file… In fact, if I don’t bother with the 32-bit values at all, then my time per file format drops from 4.2 days down to 23.7 minutes of play time, which is a lot more feasible, but less interesting if I’m getting paid by the hour.

However, it was fun to calculate – and it just goes to show how big a number 2^32 is…