randovaro seemed to utter in news:1172802467.437879.251580@
31g2000cwt.googlegroups.com:
On Mar 2, 8:48 am,
tim_wit...@hotmail.com (Tim Witort) wrote:
I have an application that compares the mean pay of men and
women using a T-test. The goal being to determine if the mean
pay of men is greater than the mean pay of women at a two standard
deviation confidence level. This has worked well for years, but
I came across an odd sample of pays where all of the men
were paid one salary and all of the women were paid another.
As a result, the variance in both samples was zero. This,
in turn gives me a divide by zero when trying to compute
the t-stat.
I've looked through a couple of statistics books for guidance
on how two samples can be compared when there is no variance,
but none of the books address this case.
In this case, the t-stat is effectively infinite. Can we
then state that the mean pay of men is higher than that
of women at our two standard deviation confidence level?
If so, it seems we cannot report a numeric t-stat - we can
only say "Yes, it's higher."
As always, my apologies if this is unclear. I wish I could
go back to school for a couple of years and study statistics,
so I could know how to handle these special cases.
Any ideas?
-- TRW
_______________________________________
t i m
a t
w i t o r t d o t c o m
_______________________________________
Can you give us a bit more info about the procedure? It sounds like
your method has inadvertently taken a sample from a single class of
employee (e.g. executive) hence all the same pay. You may have to
consider a bigger sample, or a re-sample, or think about ways to make
your sample better represent the population.
You are correct, the samples are all of the current employees
with a given job title in a company's current workforce - but it
is in no way inadvertent.
So there is really no way to get a better/different sample. The
test requires that the workforce be examined at the end of the
company's fiscal year, so I have no leeway in how the sample
is taken. The sample is *all* of the employees with
a given title on the specified date.
-- TRW
_______________________________________
t i m
a t
w i t o r t d o t c o m
_______________________________________