The Sounds of Life

L. Van Warren
Warren Design Vision
July 11, 1999

The brain consists of approximately 100 billion nerve cells. If you include cells that perform other duties, structural support, myelin insulation, vascularization, etc. there are about a trillion cells in the human brain. The average brain weighs between 1300 and 1400 grams (about 3 pounds). There are about 100 trillion cells in the human body. Each body cell contains 23 pairs of chromosomes, one from each parent. Each chromosome is a long section of folded DNA. Subsections of the DNA along each chromosome consist of genes of which there are about 100,000. Each gene carries the information required to synthesize a single protein [Lewin]. DNA is made by joining sequences of four nucleic acids, called C, T, A and G, or CTAG for short. RNA, which is used to transcribe genetic information, consists of thsame nucleic acids, excepting U which is substituted for T. UCAG is mnemonic for the RNA alphabet.

Triplets of the symbols U, C, A and G are used to direct protein synthesis. Proteins are typically chains of several hundred amino acids joined together by the removal of a water molecule shared between them in a process called condensation. There are 64 possible UCAG triplets, called codons. 61 UCAG combinations code for 20 amino acids, 3 of these are STOP signs. The amino acids are labeled A C D E F G H I K L M N P Q R S T V W Y. You can click on these letters to see their chemical structures. We will use lower case letters to denote RNA "ucag" bases and upper case letters to denote the 20 amino acids. The 64 possible codons are shown below. ALL are used in protein synthesis.


Triplet Codon Table

u
c
a
g
u
uuu
uuc
uua
uug
F
F
L
L
ucu
ucc
uca
ucg
S
S
S
S
uau
uac
uaa
uag
Y
Y
STOP
STOP
ugu
ugc
uga
ugg
C
C
STOP
W
c
cuu
cuc
cua
cug
L
L
L
L
ccu
ccc
cca
ccg
P
P
P
P
cau
cac
caa
cag
H
H
Q
Q
cgu
cgc
cga
cgg
R
R
R
R
a
auu
auc
aua
aug
I
I
I
M
acu
acc
aca
acg
T
T
T
T
aau
aac
aaa
aag
N
N
K
K
agu
agc
aga
agg
S
S
R
R
g
guu
guc
gua
gug
V
V
V
V
gcu
gcc
gca
gcg
A
A
A
A
gau
gac
gaa
gag
D
D
E
E
ggu
ggc
gga
ggg
G
G
G
G

 

Sounding out shape

So if we wanted to "hear" DNA, we could associate a tone with each of c, t, a, or g, and likewise with RNA.

If we wanted to "hear" the sound of protein syntheis we would have a piano keyset with four keys. We would press three keys, hearing their sounds, and then the sound of the amino acid that was transcribed would appear in a fourth tone. This fourth tone could be any of twenty tones, but the first three tones would be any of three with the following exceptions.

The first letter of each codon is the most significant, so it might be represented by a low note.
The second letter of each codon is the next most significant, so it might be represented by a higher note, of slightly less volume. The third note would be the least, and is sometimes not significant at all, so it would be represented by a still quieter third note.

Consider the following, if we typed the string, "uuu" into the protein player, we might hear
a chord of the three notes, each exactly one octave apart. The lowest note would be the loudest, and the highest note would be the quietest. We will map first triplet symbol to the lower octave by uppercase bold, e.g. C, the second codon symbol will get the next octave up, denoted by lowercase bold (c), the third and least significant codon symbol will get the third octave and is denoted by lowercase plain (c). The symbol set of ucag will be mapped to the musical notes, c, d, e and g, so that chorded protein patterns will "come together" in a recognizable way. "Hold him in your armchair you can hear his disease".


Triplet Chord Table

u ->c
c->d
a->e
g->g
u->C
uuu
uuc
uua
uug
Ccc
Ccd
Cce
Cc
g
ucu
ucc
uca
ucg
Cdc
Cdd
Cde
Cd
g
uau
uac
uaa
uag
Cec
Ced
STOP
STOP
ugu
ugc
uga
ugg
Cgc
Cgd
STOP
Cg
g
c->D
cuu
cuc
cua
cug
Dcc
Dcd
Dce
Dc
g
ccu
ccc
cca
ccg
Ddc
Ddd
Dde
Dd
g
cau
cac
caa
cag
Dec
Ded
Dee
De
g
cgu
cgc
cga
cgg
Dgc
Dgd
Dge
Dg
g
a->E
auu
auc
aua
aug
Ecc
Ecd
Ece
Ec
g
acu
acc
aca
acg
Edc
Edd
Ede
Ed
g
aau
aac
aaa
aag
Eec
Eed
Eee
Ee
g
agu
agc
aga
agg
Egc
Egd
Ege
Eg
g
g->G
guu
guc
gua
gug
Gcc
Gcd
Gce
Gc
g
gcu
gcc
gca
gcg
Gdc
Gdd
Gde
Gd
g
gau
gac
gaa
gag
Gec
Ged
Gee
Ge
g
ggu
ggc
gga
ggg
Ggc
Ggd
Gge
Gg
g

 

The fourth note of each triplet is one of the remaining twenty notes assigned to the amino acids.

This application initially needs a window into which to dump the DNA/RNA string.
It needs a table where the ctag, ucag and a-y mapping assignments are located.
The stop codes cause the music to stop.
It needs a play and reverse button. That is all for version one.

Sounding out relationships

When a protein is in a solute field, or is near another protein, each amino acid can be treated as a sound generator. The amino acid is given a unique chord based on its structure. As the protein is moved around, those amino acids closer to the viewer dominate the sound. This enables multiple proteins to be in the scene.