Free field as a baseline for closed-back headphones

Before I begin this post, I would like to address the elephant in the room:

”Why not diffuse field?”

This is something I’ve been asking myself every time I EQ’d my headphones (Audio Technica ATH-M50x) to a particular target (Harman 2018 AE/OE, SoundGuys, etc.), because without failure there was always excess energy in the sibilance region, and as someone with somewhat mild hyperacusis, this drove me up the wall. Songs with bright mixes such as some Drake songs, for example, were practically unlistenable for me regardless of the volume I listened at. Quiet enough to tame the sibilance/harshness, not enough bass impact. Loud enough to get some oomph from the bass in the song, and the treble would be attacking my ears with daggers (perhaps a slight exaggeration).

Now, of course, as with everyone who develops somewhat of an obsessive dive into the headphone rabbit hole, I was undoubtedly questioning the validity of everything. Why was the KEMAR used so widely yet sounded terrible with every target I EQ’d to? How does the 5128 cost so much yet targets developed with it sound astoundingly subpar? Aren’t the Harman filters an astounding oversimplification of the complexity of hearing? These are all thoughts I’ve had, but as I’ve come to learn with my time endlessly tinkering with target curves, there is always plenty of room at the table for subjective experience and opinion.

One thing I tended to notice was that target curves rarely, if at all, specified if they were intended for or developed on open-back or closed-back headphones. This may be an irrelevant point, but it is what lead me to exploring the idea of potentially using the 5128’s free field as a baseline to forming a target as opposed to the tried and true diffuse field. Queue the head shakes of disapproval. I am aware that the free field has been tested in research, and produced a very convincing last place position. Regardless, I wanted to give it a try. Perhaps it could solve the gripes I had with treble, and maybe solve the issues I had with that pesky 5-10kHz region, after all, it looks like this:

And out of sheer coincidence, the free field (on paper) appeared to exactly solve this gripe! However, EQing my headphones to this response curve went about as expected. No bueno. Not enough bass, very mid forward, and the sound was very cramped overall. Not a great experience. But nevertheless, I could apply the 2018 Harman filters and get a good sound, right? Err, no. Now I get bloated bass, an airy yet dull treble, and just an awkward tuning overall.

Alright… well now I was stuck between a rock and a hard place. The DF with Harman filters sounded bad, and the FF with Harman filters also sounded bad. So, now what? Well, like anyone who slowly drives themselves insane by staring at a bajillion headphone measurement graphs, targets, EQs, threads, forums, listening to all kinds of wonky tunings and self-made targets with questionable methodologies, I turned to the ever-elusive ~ Personal HRTF ~… with the only caveat being I don’t have the money nor the knowledge on the subject to pursue this avenue (sigh… one day). Luckily, I found this post by @listener which helped provide some insight into speaker characteristics within a room, and how it may relate to the perception of a “flat” headphone (or rather, how the in-room behaviour of speakers can be utilised to make a headphone tuning sound more “correct”). More importantly, I found this image in the post quite intriguing:

This graph, which shows the difference between the measurement of headphones on the 5128 (green) vs. a group of people (blue), appears to outline the very issues I have with both the DF and FF curves on the 5128, regardless of filters applied. That being:

  • Not enough bass extension
  • Too much 5kHz - 10kHz / sibilance & harshness
  • Weird relationship between mids and treble

I also spent far too long racking my brain over how to use this data to modify a 5128 DF/FF. Do I add the extra treble and subtract the bass from the baseline target so that a person would hear the baseline? Or do I do the opposite, so that the target represents what a person hears? Honestly, I still can’t wrap my head around the logic, so I just tested both. Well, it goes without saying, I absolutely did not enjoy either the DF or FF with the extra treble and less bass, regardless of any extra preference shelf adjustments. With less treble and more bass however, things started to look promising. Both the DF and FF targets sound much more listenable with just this simple transform adjustment, albeit both still sounding a tad too bright for my liking. My preference also very much started to lean towards the free field in this regard, as the mids seemed to integrate much better with the treble.

I also tried to slap on the Harman 2018 filters onto both of these modified targets, and quickly discovered that for once in my life (I am a basshead), I did not enjoy the amount of bass. The overall signature of both became much too boomy and dark for even my tastes, as someone who tends to prefer some extra bottom end and a smooth, potentially warm signature. I then remembered that the post that I had got the population transform from was primarily about relating headphones to the behaviour of speakers in-room. Now, I will depart from what listener used for the high shelf, and I opted for a HS that is 4500Hz, -4dB, 0.4Q. This very closely resembles an almost linear downward slope from 1kHz to 20kHz, a behaviour I have measured from my unequalised speakers in my listening space, and one I have seen mentioned in several forums across the web as being the general in-room response that is preferred by mixing engineers and whatnot, though this may be a baseless claim. I also decided to tack on a -1.8dB peak filter at 3000Hz, just to tame some of the ear gain that I don’t necessarily need.

Overall, this leads me to my target for my (closed-back) headphones: A B&K 5128 free field, modified to represent a population’s in-ear measurement, and applying the natural timbre roll-off of speakers in a room:

“What happened to 8/9kHz!?!?”

Well, to me, at least, this curve sounds damn near perfect, or at least the very best tuning I have tried on my headphones. I have checked by going to several songs and videos which I had previously perceived as having harsh sound, whether through the mids or in the treble, and that harshness is completely eliminated. Songs that I already loved the mix on, got an immediate upgrade. I have tried applying the same adjustments to the diffuse field curve, however there is just still too much 7-10kHz, many youtube videos become difficult to watch with how harsh sibilance becomes, and I find this to be true for every target I’ve tried that uses the diffuse field as a baseline.

This target curve also very closely mimics how a good set of stereo speakers sounds to my ears, despite the free field baseline being measured from a 0 degree incidence. I also find it particularly interesting that everything I perceive as harsh and unnatural in my headphones seems to correlate to anything that does not sound to me like listening to stereo speakers.

Now, to clear up some concerns you may have:

  • “This is just confirmation bias of a very particular preference response”
    • Potentially, it would be interesting to see if anyone else likes how this target sounds on their closed-backs. I find it to be very natural sounding, and although the “soundstage” is narrowed, most sounds seem placed in front as if listening to speakers.
  • “You don’t prefer free field, you prefer diffuse field with a heavy cut from 7kHz to 10kHz”
    • Maybe, however I find the mids and bass to sound much more natural and accurate to me with the free field target.
  • “Why just for closed-backs?”
    • Alright, that part is just pure speculation. I don’t own and have never owned a pair of open-backs, but I have noticed that almost all headphone-related discussion seems to revolve around open-back headphones, and from my shallow digging into Harman research, it seems to me that there is a heavy emphasis on open-backs there too, though I may be wrong about that. Another reason why I state this to be for closed-backs, whether for myself or for others, is that I predict that the diffuse field baseline may actually be the ideal baseline to use for open-backs, as they interact directly with the external environment. Since closed-backs are a sealed environment, it makes sense to me to treat them as an anechoic chamber, logically reinforcing my perceptual preference for free field. Perhaps some confirmation bias at play here.
  • “No headphones have ever been tuned like that”
    • I am well aware, as I’ve compared it to every headphone available on listener’s 5128 squig.link database, and despite some potentially coincidental similarities in some frequencies to certain models, there is absolutely nothing that resembles this curve remotely, especially in that 9kHz zone. You may see this as a sign that my target is just incredibly personalized. I’m not refuting the idea, my hopes are that the methodology I used could be helpful to some, or perhaps spark some productive discourse. Of course, it would be amazing if somehow I’ve created the magical closed-back target that everyone loves, but of course I am working with a sample size of one and expecting such a result is baseless.

Well, there you have it. Is this a massive nonsensical dribble post? Did I create the world’s highest effort sh-tpost? Who knows. I’m interested to hear anyone’s thoughts on this, if anyone decides to spend their afternoon reading my very professional thesis (/s). Please let me know if I missed anything or didn’t quite explain something properly.

I’m also new to this forum, so forgive me for any improper formatting/etiquette. I look forward to chatting with you all.

3 Likes

Welcome.

Someone has to say it. Why not the class clown?:clown_face:

Axel Grell also likes free field, but with a substantial slope added, if the OAE1 is any indication. So you’re actually in pretty good company on some of this. And might want to try to elicit some feedback from him on it. Maybe some of the lads can coax him onto this forum to take a look at what you’ve done, and to offer a few comments on it.

What I personally keep coming back to on this though are the broad conclusions of the Harman research. That most listeners prefer something close to the in-ear response of neutral speakers in a semi-reflective room. Without some actual in-ear measurements of this though, there is really no way to know who’s targets are closest to something like that. So your target might be just as close as Harman, SoundGuys, the DF+SP model I use, or any others, for all we know.

The last time I listened to a new pair of M50X’s, I also noticed some fairly pronounced sibilance in some spots in the treble. So that may be one of the reasons you’re preferring to dial the mid-treble back with that particular headphone.

I think that’s likely to be one of the main causes of your troubles. But you may also have some special sensitivies in the same frequency range that are making the sibilance in the M50X treble seem even more pronounced.

The ear canal normally produces a series of resonances at about 3, 9, 15, and 21 kHz. The sibilant region is generally the area between the first two resonances, or about 3 to 9 kHz*. The concha normally also produces a resonance between these two ear canal resonances though. And it also produces a notch that can help to tame some of the ear canal resonance at around 9-10 kHz. Maybe these features are not so well aligned though on your concha and canal. And your headphones are producing a more pronounced ear canal resonance in the 9 kHz range.

Some people also have ringing in their ears (tinnitus) near these same frequencies, which might exacerbate the sibilance in that range.

Another possibility is that your gear or media is passing through too much noise to your headphones. That might happen if you’re using a fairly sensitive pair of headphones with an amp that has too much gain and/or noise. Or using some older media (such as tapes or LPs) that might need some noise reduction. Some computer connections can also generate some noise. These are just a few possible examples though.

(*The ITU-R 468 noise curve shows where the ear is most sensitive to noise in this range.)

1 Like

Here are two plots of the Grell OAE1 btw. I’ve pre-cued Resolve’s review to the raw and compensated 5128 graphs.

Earphones Archive also has a 5128 plot of this headphone. Since you’ve looked at many graphs already, I assume you know your way around the Squiglink interface. You can turn off the compensation though with the little squiggle near the eye in the OAE1’s display options underneath the graph, on the righthand side.

1 Like

Welcome.

Regarding your issue, let me offer another option. Sonarworks Sound Reference ID software for headphones offers a measured correction for your headphone. I use it occasionally for some headphones (Sony MDR Z7, Sennheiser Momentum 4), and I find that Sonarworks improves the headphone playback quite a bit.
Sonarworks does not use any of the various Harman curves, instead they use a propriety curve meant to make headphones sound like nearfield monitors.
You can also send in your headphones to them, and they will generate a specific curve for your exact headphone. One of the dirty little secrets not discussed enough is the variation in measured responses with any given specific headphone model. I’ve experienced this firsthand, and have had it happen more than once.

At any rate, let us know how you go with this.

1 Like

It is also possible that the sibilance on the M50X could lessen a bit as the pads wear or break-in. The fit, placement, and coupling on the headphone on your head and ears might also be a factor. The pads on the M50X are pretty small, and clamp force is fairly high, so I always had a difficult time getting them to fit comfortably and consistently on my large head and ears.

I can also post some M50X EQ curves to try, based on the methods I use, which are Harman-like. But not based on their specific curves. No guarantees it will sound better though.

1 Like

My pair are a fair few years old now - safe to say I think they’re done breaking in, lol

I have a small head and ears, they fit pretty well for me and I’ve never had a problem getting a good seal.

I’d also be happy to try any curves you have for these, I’m always open to trying different tunings!

1 Like

I have used Sonarworks ID before, and while I did enjoy the speaker emulation feature, I did not like the tuning. The bass was simply too anaemic for me to thoroughly enjoy any music, and it felt to be overly focused on the high mids / presence range, which made everything feel very forward and clinical. The mid + treble relationship also felt awkward to me. However, I appreciate the suggestion. You also make a good point about unit variation - obviously that’s something I can’t account for, but it’s also another reason why I’m hesitant to label anything with objectivity, other than acoustic / psychoacoustic phenomena, which I mainly try to adhere to where possible.

If I ever have the money to spare, I may pick up a pair of their calibrated headphones. Would be interesting to see how much it differs from my experience with SoundID.

Cheers

1 Like

I hadn’t heard of the OAE1 until now, so thanks for bringing that to my attention. I don’t necessarily agree with the philosophy behind it (mainly tilt), but it’s obviously an interesting concept. Interesting to see the drivers angled - I don’t seem to struggle with getting a speaker-like soundstage with my headphones, but I get it if it’s supposed to narrow the stereo field to less than 180 degrees.

I also think I placed too much emphasis on perceived harshness (lol), I wrote most of this way too late at night. The main reason I brought it up so much is because that to me, what I perceive as harshness on both headphones and speakers seems to correlate 1:1 with anything that doesn’t sound like anechoically flat speakers in a room. The result is that anything that sounds artificial or overhyped in literally any region of the frequency band I perceive as harsh and grating.

I’ve also revisited the diffuse field and used it for a day to see how I would settle in, and I noticed two more main gripes I have with it:

  1. Overly present treble. It’s not harsh per se, but it’s far more pronounced than I think is necessary. I listened to a bunch of my favourite songs with summing to mono turned on, and the diffuse field tuning simply made every cymbal crowd the centre and smother the vocals and the rest of the drums. When I switch to free field, the summed mono image sounds pleasant, gives the vocal room to breathe, and the drums all sound balanced. It sounds like speakers.
  2. Awkward stereo image. This one is a controversial take I’m sure - but I can’t help but notice that whenever I switch to diffuse field, anything that is “front and centre” (i.e. the centre of a mix or just podcast audio) becomes “the top of my head”. I’m not sure if this is a common experience, but with diffuse field I get this sense that the frequency spectrum gets spread vertically, and that mono sounds start coming from above my head. Free field on the other hand, places every mono element out in front at eye level, much like the phantom centre you get from a properly configured stereo speaker setup. It also removes the vertical spreading of frequencies, and places the stereo field around me laterally, like you’d get from speakers. Diffuse field just sounds to me like I have a fishbowl on my head.

So, my main reasoning for free-field has shifted towards it being (to me) a more accurate representation of listening to speakers than diffuse field. It feels weird to have sounds feel like they’re coming from inside of / above your head, but that is my subjective experience.

I also had a look around at some measurements for the M50x, and couldn’t spot any nasty treble peaks that were too narrow for EQ, only a few dips. I double-checked by listening to a 20Hz - 20kHz sine sweep through these with my EQ applied, and couldn’t hear any sneaky peaks that aren’t evident on measurement graphs. There are a few points of distortion on these - and I have noticed those before, but it seems that a nice EQ profile seems to almost completely alleviate the perception of distortion. I can run these fairly hot through my preamp, and not run into any troubles with distortion or driver excursion.

Cheers

1 Like

There are a number of things here but I just wanted to tackle this one first. This isn’t the takeaway from this image. Also, people should probably take that study with a grain of salt, and I’d actually recommend they redo this one with a slightly different methodology/tools. But even still, this one probably needs some clarification.

  1. This does not include the canal effects, so it really just shows pinna differences, which we’d already expect to measure brighter on both the 5128 and the KEMAR-based rigs (this is the only thing to take seriously from this IMO).
  2. The bass extension difference is likely just due to the measurement technician not getting a seal when taking the measurement. So this is unlikely to be a difference in actual performance when seal is achieved, and it will vary quite a bit depending on the headphones anyway.
  3. Both of those graphs could actually sound the same to the individual ‘head’ the headphone is sitting on.

So this doesn’t actually mean that measurement heads need a different baseline to be predictive, it just means the HpTFs trend brighter for those pinna effects, which we already knew. And, I actually have examples on of this same type of data that shows it’s a lot closer than the above image indicates.

Red is on-head, teal is on-rig. You can ignore the low frequency differences, I expect that’s just due to the mic adapter.

3 Likes

Apologies for asking this, qrayzie. But just to clarify, you have the M50X, and not the orriginal M50, with no X. Is that correct? Because the original M50 has a somewhat brighter tilt than the X version does.

I will see what I can do though on some EQ curves for the M50X. And try to get back to you very soon on that.

One of the reasons I prefer to use averages of other headphones for my EQ targets is that they have the potential to sift out some of the finer and narrower band details in the treble, which may be missing or smoothed over in a more generalized target, like the Harman curve for example. There are aspects of the Harman curve which do not seem exactly neutral though to my ears regardless of this.

Also, what are you currently using to drive your headphones?

Thanks for clarifying this, that makes a lot more sense when it’s put that way. I mainly used it as a shot in the dark, as it seemed to align with some of the troubles I had getting a good sounding response. I’m gathering that this was just pure coincidence, as I’ve gone back and tried my current target without this modification and it does bring back a lot of detail and accuracy in the treble. That’s an oversight on my part.

Cheers

1 Like

Yes, I’m using the M50x.

I have used the average headphone target that’s on listener’s squiglink before, didn’t really like it. However, that probably takes from some headphones that have unfavourable tunings but I am unsure. I’d be interested to see what headphones you use to average out a response.

The preamp I’m using is a 4th gen Scarlett Solo, seems to do the trick

1 Like

I have a few thoughts on the Scarlett, mainly related to impedance. But will save those for later.

I’ve also done a few graphs comparing the M50X to various other headphones measured on Oratory1990’s setup, and have identified what appear to be a few bright spots in the treble, at around 9.5 and 14.5 kHz. I will try to post a few images in the next day or say.

I want to do some graphs based on HBK 5128 measurements as well for comparison.

This is still preliminary, but will give you an idea of the kind of thing I’m seeing in the treble on the M50X versus some other headphones.

This graph is based on 5128 measurements. And it shows the difference in response between the M50X and about 18 other neutralish headphones. The M50X is showing some signs of brightness in the treble relative to the average of the other headphones. And I suspect that some of the peaks in the treble may be more pronounced than is indicated here. A larger and more careful sampling might possibly shed a little more light on that. Hopefully I’ll have that done sometime tomorrow or the next day.

Overall, the response of the M50X is fairly neutral. But it is V-shaped particularly in the lower mids/upper bass where there’s a fairly pronounced dip at 300 Hz. And it leans a little forward in spots in the treble, upper mids, and mid-bass.

SInce you are using this with an interface, I assume you’re using the wired rather than a Bluetooth version of the M50X, which would have a different response.

1 Like

This seems to align well with the problems I personally have with the default tuning of the headphone, FWIW this is what it looks like when compensated to the target curve I’m currently using:

This is also based on a B&K 5128 measurement. I’m currently using the 5128’s free-field response with the filters from the 2018 Harman study (6.6dB bass, -3dB treble, -1.8dB 3khz) as my target.

I am using the wired version of this headphone, and in regards to impedance, the Scarlett can absolutely run this headphone too hot if you let it, since it’s only 38 Ohm.

Beyond middle position on the output knob, the distortion ramps up significantly. But that’s a really high volume so rarely do I ever feel the need to crank it up that high. I keep it below halfway and distortion seems to be mostly (if not completely) unaffected in this range.