I wanted to pick a fun word that was somewhat unique. I knew "zyzzyva" was
one of the most improbable words to be played in a SCRABBLE® game, and I
liked the sound of it. Later on, after I had picked the name, I realized that
since "zyzzyva" was also the last word in alphabetical order (it has since
been replaced by "zzz"), I could call Zyzzyva "the last word" in word study. I
wish I could say I was smart enough to come up with that slogan from the
start.
I created Zyzzyva because I wanted a program for studying SCRABBLE®
words. There are several programs available, but all of them either lack
features I wanted or do not run on Linux. I therefore took it upon myself to
create the program I wanted, and Zyzzyva is the result. I have made it
available under the GNU GPL because I feel that others should be able to
benefit freely from my work.
Zyzzyva is not available for any handheld devices yet. However, I recently
acquired an iPod touch, and I hope to create a version of Zyzzyva that will
run on the iPod touch or iPhone. Because the existing Zyzzyva source code is
not able to be used directly on the iPhone, I don't know how long it will take
to produce such a program. When it's available, I will announce it on the Zyzzyva Users mailing
list.
How can I transfer my Zyzzyva data from one computer to another?
You can transfer all your data to a new computer by copying your data
directory. You can find out where your Zyzzyva data directory is by opening
the Preferences window in Zyzzyva and looking at the Data Directory setting.
Make a copy of this directory, and move it to wherever you like, on the new
computer. Then, after you install Zyzzyva on the new computer, open the
Preferences window and change the Data Directory setting to point to wherever
you put your data directory from the original computer.
The transfer should work regardless of the operating systems used on the two
computers. For example, you should be able to transfer your data directory
from a Windows computer to a Mac, or vice versa.
When viewing search results, what are the '-' marks at the beginning or
end of certain words?
Those marks denote "inner hooks" - they mean that either the first or last
letter of the word can be dropped to form another valid word, depending on
where the hyphen appears. If the hyphen appears at the beginning of the word,
then the first letter can be dropped. If the hyphen appears at the end of the
word, then the last letter can be dropped. A more detailed explanation is
given in the Help
documentation.
How does Zyzzyva calculate the probability of a word?
I don't have a nice formula handy, but it's basically a straight
calculation of the number of possible combinations to get a particular set of
letters from a full bag, including blanks. Separate combination values are
calculated for drawing zero, one, or two blanks, and then the numbers of
combinations are added together to get the total number of ways of drawing a
word. If you're interested, you can see the source code for yourself here
(see LetterBag::getNumCombinations particularly).
Are you sure the probability calculation is accurate? It seems
unintuitive to me.
Yep, it really is correct. The calculation of probability is notorious for
violating human intuition. One useful guideline to remember is that doubled or
tripled letters make a particular letter combination much more improbable than
you might think. That's because the number of ways to draw two identical
letters out of the bag is generally much lower than the number of ways to draw
two different letters out of the bag. If you'd like to verify the math, you
can see the source code for yourself here
(see LetterBag::getNumCombinations particularly).
Are you really sure about the probabilities? I need to see more
examples.
Consider BAILERS versus BELIERS. BAILERS is a fair amount more probable
than BELIERS because of the doubled Es. However, it seems intuitive to say,
"If BEILRS has been drawn from a full bag, then there are 9 As and 11 Es in
the remaining pool. The E is the more likely draw, which means BELIERS should
be more probable than BAILERS."
While it is true that drawing a single E is more likely than drawing a
single A, the math for calculating the probability of a word is more
complicated than that. You can't just start calculating with the assumption
that BEILRS has been drawn; those letters themselves factor into the
calculation, and they are not independent events. In this case, we can reduce
the difference between BAILERS and BELIERS to the difference between drawing
AE or EE when drawing two tiles from the bag.
If you draw two tiles out of a full bag, the probability of drawing two Es
is much lower than the probability of drawing an A and an E. That is because
there are 9 x 12 = 108 ways to draw an A and an E if you draw two tiles. But
there are only (12 x 11) / 2 = 66 ways to draw two Es if you draw two tiles.
This number of combinations is called "12 choose 2", and is explained in more
detail in this Wikipedia
article about combinations.
To reduce this to a simple example where it's easy to enumerate all
the possibilities, consider a 4-tile bag containing only AAEE. I'll
label the tiles A1, A2, E1, E2 for convenience. There are 6 possible
ways to draw two unordered tiles:
A1 A2 = AA
A1 E1 = AE
A1 E2 = AE
A2 E1 = AE
A2 E2 = AE
E1 E2 = EE
As you can see, AE is drawn 4/6 times, while AA is drawn only 1/6, and EE
is drawn only 1/6 as well. Even though A and E are equally likely to be drawn
if you are only drawing one tile, if you're drawing two tiles it is much more
likely to draw one of each than to draw duplicates.
As another example, consider a bag where E is more likely than A, as is the
case in the full Scrabble bag. Consider a 6-tile bag containing AAEEEE, with
tiles labeled A1, A2, E1, E2, E3, E4. There are 15 possible ways to draw two
unordered tiles:
A1 A2 = AA
A1 E1 = AE
A1 E2 = AE
A1 E3 = AE
A1 E4 = AE
A2 E1 = AE
A2 E2 = AE
A2 E3 = AE
A2 E4 = AE
E1 E2 = EE
E1 E3 = EE
E1 E4 = EE
E2 E3 = EE
E2 E4 = EE
E3 E4 = EE
AA occurs 1/15 times, AE occurs 8/15 times, EE occurs 6/15 times. Even
though E is twice as likely as A, AE is still more likely than EE! This is the
same basic principle that also causes words with duplicate letters to be drawn
with lower probability from a full Scrabble bag.
Complicating the math even further is the fact that Zyzzyva calculates
probabilities using a full 100-tile bag including two blanks. That may account
for differences (usually minor) with other probabilities you may have seen
calculated elsewhere, many of which use a 98-tile bag with no blanks.
Why did Zyzzyva mark all my answers as missed when I actually got them
all correct?
Zyzzyva only marks all responses as missed if you actually entered all the
correct responses, but also provided an incorrect response. The reason for
marking all responses as missed is because (currently) Zyzzyva has no other
way of marking the question as missed without marking at least one response as
missed. So instead of picking a random response to mark as missed, it marks
them all.
This only happens if you have the 'Mark question as missed after an
incorrect response' setting activated, but not 'End question after an
incorrect response'. Technically, it could also happen if you have the 'End
question after an incorrect response' setting activated, but not 'End question
after all correct responses', and the incorrect response is the last one you
give.
You can find the Zyzzyva issue tracker item for this issue here.
I love Zyzzyva so much that I would like to send you giant piles of
money. How can I do that?
That is so nice of you! I'm always very happy to receive donations of any
amount. You can do so by visiting the Donate
page.
If you find Zyzzyva useful, I would be very happy to receive a donation via
PayPal, which you can do by clicking the button below. Thank you for your
support!
This web site was designed and written by Michael
W. Thelen. All markup code, text, and graphics on this web site are Copyright
© 2004-2010 Michael W. Thelen. All rights reserved.