VI. Analysis of the 2MASS Second Incremental Release Catalogs


4. Completeness and Reliability

c. Extended Source Catalog (XSC) Reliability

The 2MASS Second Incremental Data Release covers 47% of sky. This large area presents a challenge for assessing the quality of the XSC, in particular the reliability. The method we employ is to cross-identify the XSC sources with "catalogs" in which the source is identified as either extended (i.e., resolved) or otherwise. See below.

The "reliability" refers to the ratio of real "extended" sources to the total number of sources in the XSC. Objects which are not truly extended, but which mimic "extendedness" (e.g., double stars), are inevitably part of the Catalog. The purpose of this page is to present the level of contamination.

Key Conclusion: the XSC satisfies (or closely approaches) the Level 1 science requirement for reliability.

i. "Truth" Table

As part of the quality verification process, 2MASS has undertaken a program in which "extended" sources are visually examined, cross-compared with catalogs in SIMBAD, NED, & ZCAT, compared with DSS images, compared with deep optical and infrared imaging, and followed up with imaging and spectroscopy observations (in which systematic programs are in place). The visual examination entails an "expert" interaction in which the 2MASS J, H and Ks images (plus a composite three-color image) are displayed alongside the comparable DSS image. An example is shown in Figure 1:

Figure 1: Galaxy near the Large Magellanic Cloud (J=14.77, J-Ks=0.99; no entries in SIMBAD or NED)

The upshot is that we have a truth-table "catalog" of ~150,000 sources in which we have "classified" the source as either (1) extended (almost always a galaxy, except deep in the Zone of Avoidance [ZoA]), (2) stellar or pointlike, (3) double star (usually barely resolved), (4) triple stars (usually confined to the ZoA), (5) unknown (usually at the sensitivity and resolution limits). These sources are spread throughout the sky, with pockets concentrated about regions that were targeted for study (e.g., the Coma & Hercules cluster, LMC, ZoA, etc). Figure 2 is an equatorial skymap of the current "truth" table.

There are several cautionary notes. See the Caveats below.

Figure 2

ii. Method

We will use the truth-table to assess the reliability of the XSC. The Level 1 science requirements are relevant to the integrated flux intervals:

Our goal is >98% reliability (and >98% completeness) for |glat| > 30°, and >98% reliability (and >80% completeness) for |glat| > 20°. There are no requirements for |glat| < 20°, although we do present results below. These requirements apply to sources free of confusion or contamination. In the XSC there are confusion flags for each photometry aperture and an overall confusion flag. The reliability is computed for sources in which sources have "clean" confusion flags. This reliablity measurement should be considered an "upper limit" to the true reliability (see Caveats section).

The flux measurement used in this analysis corresponds to the radius = 7´´ circular aperture (j_m_7, h_m_7, k_m_7). This aperture is particularly suited to galaxies at the faintness levels relevant to the Level 1 requirements (see above). Hence, the radius = 7´´ circular aperture confusion flag (j_flg_7, h_flg_7, k_flg_7). is used to separate "confused" from "clean" sources. We also use the cc_flg confusion flag.

Finally, the Level 1 requirements are relevant to the "galaxy" catalog, which is a subset of the XSC. The "galaxy" catalog is constructed by selecting only sources with a g_score <= 1.4.

Summary

Final Note: for sources in the Galactic plane (|glat| < 10°), the magnitude thresholds were lowered to account for the loss in sensitivity, due to confusion noise and to bolster the number of XSC-"truth" matches.

See also the Caveats section.

iii. Results

The total number of XSC sources that match the "truth" table is ~15,000. Figure 3 is a sky plot (shown with a flat projection) of the XSC sources that satisfy the g_score and magnitude thresholds. Overlaid are the "truth" matches, color-coded according to galactic coordinates.

Figure 3

(The legend to Figure 3 is
Black points:

Yellow points: Magenta points: Cyan points: Green points: Note the magenta-yellow patch of sources near glon ~280°, glat = -30°, which is the LMC. The large cyan/green patch near glon~60°, glat~0° is the ZoA area targeted by Jarrett et al. (2000, AJ, 120, 298).

Tables 1-5 summarize our results.

Table 1: Reliability of the XSC for 30° < |glat| < 90°
ib mag- mag+ Den Den- Den+ nT nTf ng ns nd nt nu R
1 14.50 15.00 2.72 1.88 3.20 5230 3711 3467 41 25 0 178 0.981
2 13.80 14.30 2.72 1.88 3.20 4499 3452 3262 26 22 2 139 0.985
3 13.00 13.50 2.73 2.10 3.20 3806 2966 2906 8 5 1 46 0.995
ib = band (1=J, 2=H , 3=Ks)
mag- = bright threshold limit (mag units)
mag+ = fiant threshold limit (mag units)
Den = average stellar number density (log[# stars deg-2 brighter than 14th mag at Ks])
Den- = minimum stellar number density
Den+ = maximum stellar number density
nT = total number of XSC sources that pass mag and g_score thresholds
nTf = total number of XSC sources that pass thresholds AND have clean confusion flags
ng = total number of verified extended sources (usually galaxies)
ns = total number of verified stars (usually isolated)
nd = total number of verified double stars
nt = total number of verified triple stars
nu = total number of unknown sources
R = upper limit reliability: ng / (ng + ns + nd + nt)

Table 2: Reliability of the XSC for 20° < |glat| < 30°
(and with "sh" >= 0.5, see second set of numbers)
ib mag- mag+ Den Den- Den+ nT nTf ng ns nd nt nu R
1 14.50 15.00 3.05 2.64 3.40 2894 2091 1840 39 24 1 187 0.966
2 13.80 14.30 3.05 2.64 3.40 2467 1897 1701 28 10 1 157 0.978
3 13.00 13.50 3.06 2.52 3.40 1915 1518 1453 6 6 1 52 0.991
- - - - - - - - - - - - - -
ib mag- mag+ Den Den- Den+ nT nTf ng ns nd nt nu R
1 14.50 15.00 3.05 2.64 3.40 2894 1871 1707 8 12 1 143 0.988
2 13.80 14.30 3.05 2.64 3.40 2467 1579 1469 4 6 1 99 0.993
3 13.00 13.50 3.06 2.52 3.40 1915 1156 1128 2 1 1 24 0.996

Table 3: Reliability of the XSC for 10° < |glat| < 20°
(and with confusion flags ignored; see second set of numbers)
ib mag- mag+ Den Den- Den+ nT nTf ng ns nd nt nu R
1 14.50 15.00 3.36 2.87 3.81 2934 1700 1555 29 24 4 88 0.965
2 13.80 14.30 3.35 2.87 3.81 2561 1611 1510 12 16 2 71 0.981
3 13.00 13.50 3.39 2.83 3.83 2638 1579 1513 11 11 3 41 0.984
- - - - - - - - - - - - - -
ib mag- mag+ Den Den- Den+ nT nTf ng ns nd nt nu R
1 14.50 15.00 3.39 2.87 3.85 2934 2934 2506 58 166 12 174 0.908
2 13.80 14.30 3.38 2.87 3.83 2561 2561 2315 28 72 8 135 0.954
3 13.00 13.50 3.42 2.83 3.85 2638 2638 2500 18 30 12 76 0.976

Table 4: Reliability of the XSC for 5° < |glat| < 10°
(and with confusion flags ignores; see second set of numbers)
ib mag- mag+ Den Den- Den+ nT nTf ng ns nd nt nu R
1 12.00 15.00 3.67 3.26 4.48 2321 984 888 5 19 9 63 0.964
2 11.30 14.30 3.67 3.26 4.48 2335 1038 942 3 17 10 66 0.969
3 10.50 13.50 3.69 3.26 4.48 2088 910 830 3 18 8 51 0.966
- - - - - - - - - - - - - -
ib mag- mag+ Den Den- Den+ nT nTf ng ns nd nt nu R
1 12.00 15.00 3.73 3.26 4.50 2321 2293 1918 32 98 52 184 0.909
2 11.30 14.30 3.73 3.26 4.50 2335 2307 1968 25 71 48 190 0.930
3 10.50 13.50 3.75 3.26 4.50 2088 2057 1787 25 56 39 145 0.935

Table 5: Reliability of the XSC for 0° < |glat| < 5°
(and with confusion flags ignores; see second set of numbers)
ib mag- mag+ Den Den- Den+ nT nTf ng ns nd nt nu R
1 12.00 15.00 3.77 3.44 4.50 1445 432 358 10 15 12 37 0.906
2 11.30 14.30 3.74 3.43 4.50 1639 562 485 10 16 9 42 0.933
3 10.50 13.50 3.75 3.43 4.50 1601 538 467 9 17 12 33 0.925
- - - - - - - - - - - - - -
ib mag- mag+ Den Den- Den+ nT nTf ng ns nd nt nu R
1 12.00 15.00 3.93 3.44 4.50 1445 1318 866 60 73 164 147 0.740
2 11.30 14.30 3.89 3.43 4.50 1639 1514 1061 61 72 163 148 0.777
3 10.50 13.50 3.90 3.43 4.50 1601 1479 1029 58 78 167 138 0.767

iv. Reliability for XSC Sources with (g_score > 1.4) and (e_score <= 1.4)

About 10% of the XSC is populated by sources that do not pass the g_score threshold, but do satisfy the e_score criteria. What are these objects? The e_score is designed to identify extended sources using star-galaxy discriminants independent of any color bias (unlike the g_score, which is biased against blue objects). As designed, the e_score detects blue (J-Ks < 0.8) extended sources, including blue compact and low surface brightness galaxies. Figure 4 shows an example of a nice blue (J=14.65, J-Ks = 0.42) low surface brightness galaxy.

Figure 4

Unfortunately, the e_score is also vulnerable to false "extended" sources, including artifacts from bright stars (because most stars are brightest at "J", hence the generate more artifacts in the "blue" band). It is not as robust at the g_score at separating (faint) single and double stars from real extended sources. Figure 5 shows an example of a false e_score source (it is a compact blue double star). The overall reliability is poor, hovering around 50% for most of the sky and less than 10% for the Galactic plane(see below). On the other hand, the e_score does identify blue galaxies. Hence, the e_score, in combination with the reliable g_score, comprises a more complete set of galaxies, spanning a larger dynamic range in color. For the Galactic plane, the e_score is not reliable indicator of extendedness.

Figure 5

Table 6: Reliability of the XSC for e_score <= 1.4, g_score > 1.4 and 30° < |glat| < 90°
ib mag- mag+ Den Den- Den+ nT nTf ng ns nd nt nu na R
1 14.50 15.00 2.70 2.10 3.20 847 401 127 139 73 3 53 6 0.365
2 13.80 14.30 2.71 2.27 3.20 583 258 79 85 50 4 33 7 0.351
3 13.00 13.50 2.69 2.24 3.09 137 32 15 5 4 1 7 0 0.600
ib = band (1=J, 2=H , 3=Ks)
mag- = bright threshold limit (mag units)
mag+ = fiant threshold limit (mag units)
Den = average stellar number density (log[# stars deg-2 brighter than 14th mag at Ks])
Den- = minimum stellar number density
Den+ = maximum stellar number density
nT = total number of XSC sources that pass mag and g_score thresholds
nTf = total number of XSC sources that pass thresholds AND have clean confusion flags
ng = total number of verified extended sources (usually galaxies)
ns = total number of verified stars (usually isolated)
nd = total number of verified double stars
nt = total number of verified triple stars
nu = total number of unknown sources
na = total number of artifacts
R = upper limit reliability: ng / (ng + ns + nd + nt + na)

Reliability of the XSC for {E_score <= 1.4, G_score > 1.4} and 20o < |glat| < 30o
ib mag- mag+ Den Den- Den+ nT nTf ng ns nd nt nu na R
1 14.50 15.00 3.13 2.78 3.40 455 213 60 44 64 2 43 0 0.353
2 13.80 14.30 3.16 2.82 3.40 279 138 52 12 38 0 35 1 0.505
3 13.00 13.50 3.16 2.95 3.37 30 15 7 1 2 1 4 0 0.636

Reliability of the XSC for {E_score <= 1.4, G_score > 1.4} and 10o < |glat| < 20o
ib mag- mag+ Den Den- Den+ nT nTf ng ns nd nt nu na R
1 14.50 15.00 3.43 2.87 3.73 1319 433 122 69 164 17 61 0 0.328
2 13.80 14.30 3.47 3.00 3.81 1098 400 120 29 173 29 48 1 0.341
3 13.00 13.50 3.50 3.00 3.81 161 72 34 2 17 6 12 1 0.567

Reliability of the XSC for {E_score <= 1.4, G_score > 1.4} and 5o < |glat| < 10o
ib mag- mag+ Den Den- Den+ nT nTf ng ns nd nt nu na R
1 12.00 15.00 3.69 3.33 4.12 1104 276 45 18 134 38 40 1 0.191
2 11.30 14.30 3.72 3.31 4.12 903 249 45 12 120 36 35 1 0.210
3 10.50 13.50 3.82 3.38 4.12 305 85 21 1 33 16 13 1 0.292

Reliability of the XSC for {E_score <= 1.4, G_score > 1.4} and 0o < |glat| < 5o
(and with confusion flags ignored; see second set of numbers)
ib mag- mag+ Den Den- Den+ nT nTf ng ns nd nt nu na R
1 12.00 15.00 3.70 3.44 4.45 1850 436 25 55 244 71 37 3 0.063
2 11.30 14.30 3.72 3.44 4.45 1560 407 42 36 212 73 39 4 0.114
3 10.50 13.50 3.84 3.54 4.45 643 145 34 10 47 31 21 2 0.274
- - - - - - - - - - - - - -
1 12.00 15.00 3.74 3.44 4.50 1850 1825 63 132 1016 492 103 18 0.037
2 11.30 14.30 3.77 3.44 4.50 1560 1537 106 82 776 451 106 14 0.074
3 10.50 13.50 3.93 3.49 4.50 643 619 88 18 199 243 64 6 0.159

v. Caveats

The "truth" table is compiled using diverse methods, some more reliable and objective than others. It is inevitable that errors and biases are introduced into the table. For example, galaxy clusters are often targeted for study, due to their favorable properties (e.g., lots of galaxies in a small area). The reliability is always higher near a cluster (due to how the reliability is computed). Hence, we are biased toward a higher reliability in the "truth" table compared to a table compiled for only field sources: the reliability measurement is an upper limit. In addition, errors creep in, due to mis-classification (often it is in the sense that a real, but faint, galaxy is classified as "non-extended" or stellar, when in fact it is extragalactic): there is some uncertainty (if small) in the reliability measurement.

Although we have a large number of sources with "truth" matches (nearly 15,000), it is a tiny fraction of the total number of XSC sources (over 500,000 total for all flux levels). We are thus making sweeping conclusions (reliability) of the XSC based on a small number of sources (which are in itself biased; see above paragraph). To reiterate: the true reliability of the entire XSC is probably slightly lower than what is measured here.

Classification is particularly difficult in the Galactic plane, where confusion creates many kinds of multiple stars that can look identical to a galaxy.

We make no distinction between a resolved galaxy and a resolved Galactic source (e.g., nebulosity). Hence, "extended" source means galaxy and Galactic source. However, most extended sources are galaxies (by far this is true for most of the sky). In the Galactic plane the frequency of extended Galactic objects rises steeply (e.g., HII regions dominate for |glat| < 5°).

The Level 1 requirements apply to only the g_score (threshold = 1.4). The XSC has many sources (about 9.8% of the total) that do not satisfy this threhold. They are in the XSC due to the e_score criteria (also threshold at 1.4). The e_score is a simpler measure of "extendedness" that is designed to be less biased to blue objects (but is also less reliable overall). Figure 6 is a skymap of the XSC sources that do not meet the g_score threshold (but do meet the e_score threshold); there are a total of ~59,000 sources (at all flux levels).

Figure 6

When the confusion noise is high (i.e., in the Galactic plane) the frequency of contamination from a nearby source is significant. Hence, "clean" or unconfused sources in the ZoA are somewhat rare (see tables above, comparing "clean" and total counts). This is clearly a biased "reliability" measurement for these areas.

Note the relatively large number of "unknown" sources (denoted as "nu" in the tables). If these turn out to non-extended, then the reliability drops significantly. Conversely, if they are truly extended, then the reliability is pristine. Only higher sensitivity and deeper resolution imaging (or spectroscopy) can definitely decode the nature of these sources. Past experience with follow-up observations suggests that most of the sources are extragalactic in nature. Finally, statistical analysis suggests that these "unknown" sources are indeed resolved galaxies (see The Nature of Galaxy Truth Table "Unknowns").

[Last Updated: 2000 Jan 27 by T. Jarrett. Modified 2000 Sep 12 by S. Van Dyk.]


Previous page. Next page.
Return to Explanatory Supplement TOC Page.