Thanx, Starguy. I am very interested in this.
So, if you take the standard deviation of (HIPPARCOS MV - your MV) for every star in ASCC that has a good parallax (10%?), that should give a measure of the systematic errors, or at least some idea of the spread of the data around your fit. Convert that magnitude standard deviation to a percentage difference and you have your systematic error... Another (easier?) way to calculate the systematic errors would be to take the standard deviation of abs(hipparcos distance-your distance)/hipparcos distance. Anyway, I get the impression you've already done this.
I'm not sure. What I ultimately did - out of sheer frustration - was to directly match the distance modulus from the Hip parallaxes in the callibration tables with all four of the distance moduli from my averaging scheme. I took the Hip based Distance Modulus (which I assumed to be fully accurate), subtracted the spectrocopic DM's, and then because I didn't want to deal with negative numbers, added one to the result. Hence 1.0 would be a dead match, and each 0.1 of difference thereafter would be a 5% difference in distance. I counted 1.0 to 1.1 and 1.0 to 0.9 as being accurate to within 5%, 0.8 -0.9 and 1.1-1.2 as being accurate to within 5-10%; ect. To keep things visually simple (and avoid debilitating eyestrain) I actually inserted fields with little `t's' or 'h''s or what not in them: one such letter if the match was within 15-20%, 2 letters if the match was within 10-15%, ect. I did this for each of the V, J, and H moduli, along with the combined value.
Does that constitute calculating 'standard deviation''? My observation, then and now, was that as long as the star in question was not a close double, variable, underluminous, or superluminal then 90% of the time my distance would be accurate to within 20%, and 75% of the time (give or take a point or two) would be accurate to within 15%. (And within 5% well over half the time.)
The superluminal stars in particular perplex me. How can they be a over a full magnitude brighter than the 'average' and still be class V stars? Isn't that verging on subgiant territory? And are there really that many of them (something like 10 - 15% of the total).
One weird thing I did note which baffles me: both the really underluminous stars (the ones my scheme places at about double their actual distances) and the superluminous stars (which my scheme puts at half or less of their actual distance) tended to have almost but not quite identical J, H, V-J, and J-H values. So if I didn't already know the distance, I could look at the J, H, V-J, or J-H info for such a star and possibly tell it wouldn't work, but I wouldn't be able to tell if it was superluminous or underluminous. That in turn meant that I couldn't attempt a photometric distance - though with the addition of a proper motion evaluation...hmmm...
In a few of the tables I experimented with adding in the K's (which I dropped because the K magnitudes are so close to the H Magnitudes it actually skewed the whole table). I have wondered now and again if the K magnitude might have been a better choice instead of H for these calculations, but by the time that issue arose I already had a great deal of work done involving the H magnitudes, so I stuck with them. I also tried - and gave up on - a number of other schemes, some involving averaging, some not. The only one I actually retained applies to stars of F5 - G3; with those stars I count the J and H Distance Moduli twice instead of once and divide by five instead of three (because they tended to be more accurate for at least the bottom end of the superluminals, and still reasonably close for the more normal stars. This resulted in maybe a 5% improvement in the quality of the distances to some of those pesky superluminal stars).
As for individual errors, Henry et al. 2004 take the standard deviation of the different photometric distances (in their case, 12 color relations; in yours, 3?) as the error; I think Weis 1984 quotes a similar number for his two relations.
The total resulting error on a distance should be sqrt((distance*systematic%)^2 + individual^2).
I believe, with my limited knowledge of statistics, that the last equation is correct as long as the systematic errors are independent of the individual errors. In theory, they should be; the systematic error describes how well your fit works given perfect data, and the individual error describes the errors in the measurement of the particular star. In practice, they probably aren't, but you should be able to test this too; your distances should be within 2-sigma of the Hipparcos distances 95% of the time: abs(hipparcos distance - your distance) < 2 * (hipparcos error + total resulting error).
I will have to give this some thought. I would really like to be able to attach some sort of error bar to my distances, and had thought I would have to settle for just flagging the ones most likely to be in error. I will have to give some thought as to just exactly what keys to push on the computer to get a decent error bar scheme to work.
Secondary note here: You are familiar with Weis's work then? I was under the impression that nearly all memory of his work had vanished, even though he probably provided distances to at least as many stars as Gliese, and possibly more accurate distances much of the time as well.
As for those spectral types, I don't think Cannon defined any more than F5, F8, G0, G2, G5, G8, K0, K2, K3 and K5... any other types were interpolated and developed by Morgan, Keenan, and Kellman in 1943 (that was the one that added I-V for luminosity classes) to mostly match Cannon's types, but they are not quitethe same. So, there will be some inherent uncertainty in the actual spectral type because your 'dead average' absolute V magnitudes from VizieR are probably all using the MKK sequence, while ASCC is apparently using a mix of MK(K) and HD types. There are still no K6 or K8 or K9 types though, K7 was defined by Keenan & McNeil (1976) as being halfway between K5 and M0.
This explains a great deal. I knew the I-V classes were a later addition, but was not aware that Canon's magnitudes were so limited. However, some of my later ASCC downloads do include K8 type stars.
There are also some limits to how far you can push spectral types, anyway. I have personally found them to only really be accurate to one defined type, but I work with M stars that are a lot more confusing to get spectral types for.
Something else I am only to well aware of. Burnham, in his Celestial Handbook also complained about the 'authorities' assigning sometimes radically different spectral types to the same star.
Oh, and obviously, comparing YPC parallaxes work too, if you've got them. There's a wide range of accuracy in the YPC, but they were compiling 160 years of parallaxes.
This gets into another project of mine from a few years ago. I started to wonder just how accurate or not some of the parallaxes with *really* high error bars were, so I went and rounded up several hundred of the old Yale parallaxes with high error bars and split them into groups based on that. (One group with errors of around 15%, another group with errors of around 20%, all the way up to and including groups where the errors were greater than the parallaxes themselves). I then went and dug up the best Hip parallaxes for those same stars - ones with errors of less than 5% where possible. It turned out as I recollect (I'll have to see if I can't find my notes on this somewhere) that once the error bar for a given parallax gets past 20%, then the odds of the parallax being correct drop pretty dramatically. Or...if a parallax has an error bar of 25%, there is a 50% chance the parallax is wrong - and sometimes wrong past the limits of the error bar - meaning instead of being off by 25%, it could be off by something like 30 - 40%. Once the error bar hits 50% of the parallax, the odds of the parallax being right drop to about one in three...but weirdly enough, that is about as low as it goes. Even if the error bar tops 100%, the parallax still has about a 30% chance (more or less) of being right (or at least to within 15% or so) - but that also means it has about a 70% chance of being wrong. At least that was the conclusion I reached back then - as I recollect, I was trying to decide how far to trust the Tycho parallaxes. (Answer - not very far).
I also think your assumption that G0 stars won't turn out to be G5V stars is reasonable, and your tests that change the spectral types are reassuring. There will always be some oddballs (maybe the telescope was pointed at the wrong star, who knows?) but by and large I'd agree. The giants were probably easier to weed out of the K stars because there's more magnitudes of separation between K dwarfs and K giants (and thus more proper motion difference)... the giant branch turnoff is indeed near F/G stars, so what you're seeing there is also real.
Thanx. I was wondering about that. It seemed that some of the...limits... I had to impose on V-J and J-H for F8-G2 stars were...not quite where they should be, compared to the rest. I spent a lot of time puzzling over that.
My biggest problem with your method is the scope- by limiting yourself to only stars with spectral types, you're limiting yourself to only a few thousand stars, where Pascal Hartman already did over a million with purely photometric criteria. It'll be worth it if you can prove your distances are more accurate than his.
Actually, this is just part one. I have played around with these B-V, V-J, J-H numbers and the rest for so long, and have defined ranges for them for so many spectral types, I think I could almost do this *without* knowing the spectral type. Basically, pick a spectral type, and then pull the stars that fall within my B-V, V-J, and J-H criteria for stars of that spectral type, and run the numbers from there. I actually did a few preliminary callibration tables on this; it seems that about one third to one half of the stars summoned in such a manner will be of the desired spectral type, and another to one third or so will be of the immediately adjoining spectral types to either side, which my prior tests show my system can handle relatively well. The big concern would be overlaps; for example many G5V and G3V stars fall within the same overlapping B-V, V-J, and J-H criteria, meaning the same star could appear in more than one set, with differing distance values. I'd probably get around this by making sure the B-V criteria at least did not overlap. Anyhow, I have a suspicion I could probably add another 40,000 - 50,000 stars to the total, with the added dubious bonus of very rough spectral types. But that will probably be a project for next winter at the earliest.
All that said, aside from Pascal Hartmans efforts (which I did not know about until coming across this site a week or so ago), I am familiar with two previous attempts to identify dwarf stars in the Tycho/ASCC. The first was by Turnbull and Tarter, as part of their second 'HabCat' Catalog. They used proper motion and B-V cuts to define and pull a couple hundred thousand stars they beleived to be dwarfs of F5 - K5ish, which they refined further using something called a Gausian Random number generator. Their initial proper motion cut reduced the number of dwarf stars by about half. However, they didn't try to determine actual distances.
The other effort (was it by Aemons or Timmons? can't remember right off) did try to determine distances, using proper motion as a proxy for distance. My personal view is it didn't work very well; the positive and negative error bars for the distances tend to run at about 90% of the distance itself. Because they did the whole ASCC - including the Hip catalogue - I was able to compare many of their distances with near perfect Hip ones; most of the time they were not even close. However, I agree with their results in the broad statistical sense; they claimed something on the order of 600,000 dwarfs in the Tycho/ASCC; when you compare that with Turnbull and Tarters work and allow for their more strident criteria the conclusion is about the same -
- there is something on the order of 3-4 giant stars to every dwarf in the Tycho/ASCC. (And my work so far tends to support this).
Pascal Hartman, though...provided distances to over a million Tycho/ASCC stars, on the assumption that the vast majority were dwarfs, which to me doesn't seem to hold up.