[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

entropy

To: cypherpunks@toad.com
Subject: entropy
From: Eric Hughes <hughes@soda.berkeley.edu>
Date: Mon, 26 Oct 92 07:54:33 -0800
In-Reply-To: Eric Hollander's message of Sun, 25 Oct 92 20:29:20 -0800 <9210260429.AA29646@soda.berkeley.edu>

Re: entropy

Eric Hollander writes:
>I seem to remember that English text is about 1.5 bits per character.  I can
>find a reference if you're interested.

There are lots of entropies available to measure.  There is "true"
entropy, the lower bound for all other entropy measures.  This is the
compressibility limit.

The entropy I was referring to was simply the single character
entropy.  That is, the probabilities p_i in the entropy expression are
the probabilities that a given single character appear in the text.
This will be higher than the true entropy.  Shannon's estimate for H_1
was 4.03 bits/character.  This assumes a 27 character alphabet.  The
entropy for ASCII-represented English will be higher because of
punctuation and capitals.

The true entropy of English is much lower than this, of course.  But
for an simple measure to automatically distinguish between plaintext
and ciphertext, it should suffice.

Re: uuencoding.  In my analysis before I assume that the uuencoding
would be of random data.  If it is not from random data, then the
entropy will be lower.  Thanks for the clarification.

Eric

References:
- Re: BBS E-mail policy
  - From: Eric Hollander <hh@soda.berkeley.edu>

Prev by Date: Re: Registering Keys with Big Brother
Next by Date: Registering Keys...
Prev by thread: Re: BBS E-mail policy
Next by thread: Re: BBS E-mail policy
Index(es):
- Date
- Thread