EBV Genomic Sequence (B95-8)
This document contains references, Orfs, and entire genomic sequence
LOCUS EBV 172281 bp DNA circular VRL 29-OCT-1996 DEFINITION Epstein-Barr virus (EBV) genome. The complete sequence [1-10] was determined from DNA from B95-8 cells cloned by Arrand et al [11]. B95-8 is a productive marmoset lymphoblastoid cell line immortalized with human EBV from a mononucleosis patient. ACCESSION V01555 J02070 K01729 K01730 V01554 X00498 X00499 X00784 NID g59074 KEYWORDS DNA polymerase; EBNA; genome; ribonucleotide reductase; tandem repeat; terminal repeat. SOURCE Epstein-Barr virus. ORGANISM Human herpesvirus 4 Viruses; dsDNA viruses, no RNA stage; Herpesviridae; Gammaherpesvirinae; Lymphocryptovirus. REFERENCE 1 (bases 1 to 172281) AUTHORS Baer,R.J., Bankier,A.T., Biggin,M.D., Deininger,P.L., Farrell,P.J., Gibson,T.J., Hatfull,G.F., Hudson,G.S., Satchwell,S.C., Seguin,C., Tuffnell,P.S. and Barrell,B.G. TITLE DNA sequence and expression of the B95-8 Epstein-Barr virus genome JOURNAL Nature 310 (5974), 207-211 (1984) MEDLINE 84270667 REFERENCE 2 (bases 1 to 172281) AUTHORS Deininger,P.L., Bankier,A., Farrell,P., Baer,R. and Barrell,B. TITLE Sequence analysis and in vitro transcription of portions of the Epstein-Barr virus genome JOURNAL J. Cell. Biochem. 19 (3), 267-274 (1982) MEDLINE 83109311 REFERENCE 3 (bases 1 to 172281) AUTHORS Farrell,P.J., Deininger,P.L., Bankier,A. and Barrell,B. TITLE Homologous upstream sequences near Epstein-Barr virus promoters JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80 (6), 1565-1569 (1983) MEDLINE 83169725 REFERENCE 4 (bases 1 to 172281) AUTHORS Farrell,P.J., Bankier,A.T., Seguin,C., Deininger,P.L. and Barrell,B.G. TITLE Latent and lytic cycle promoters of Epstein-Barr virus JOURNAL EMBO J. 2, 1331-1338 (1983) REFERENCE 5 (bases 142687 to 159853) AUTHORS Bankier,A.T., Deininger,P.L., Farrell,P.J. and Barrell,B.G. TITLE Sequence analysis of the 17,166 base-pair EcoRI fragment C of B95-8 Epstein-Barr virus JOURNAL Mol. Biol. Med. 1 (1), 21-45 (1983) MEDLINE 85035713 REFERENCE 6 (bases 112620 to 125316) AUTHORS Seguin,C., Farrell,P.J. and Barrell,B.G. TITLE DNA sequence and transcription of the BamHI fragment B region of B95-8 Epstein-Barr virus JOURNAL Mol. Biol. Med. 1 (3), 369-392 (1983) MEDLINE 85060424 REFERENCE 7 (bases 159853 to 172281) AUTHORS Bankier,A.T., Deininger,P.L., Satchwell,S.C., Baer,R., Farrell,P.J. and Barrell,B.G. TITLE DNA sequence analysis of the EcoRI Dhet fragment of B95-8 Epstein-Barr virus containing the terminal repeat sequences JOURNAL Mol. Biol. Med. 1 (4), 425-445 (1983) MEDLINE 85060428 REFERENCE 8 (bases 87650 to 92703) AUTHORS Biggin,M., Farrell,P.J. and Barrell,B.G. TITLE Transcription and DNA sequence of the BamHI L fragment of B95-8 Epstein-Barr virus JOURNAL EMBO J. 3 (5), 1083-1090 (1984) MEDLINE 84236104 REFERENCE 9 (bases 76089 to 79808) AUTHORS Gibson,T., Stockwell,P., Ginsburg,M. and Barrell,B. TITLE Homology between two EBV early genes and HSV ribonucleotide reductase and 38K genes JOURNAL Nucleic Acids Res. 12 (12), 5087-5099 (1984) MEDLINE 84247360 REFERENCE 10 (bases 1 to 172281) AUTHORS Hatfull,G.F., Barrell,B.G., Quinn,J. and McGeoch,D. JOURNAL Unpublished REFERENCE 11 (bases 1 to 172281) AUTHORS Arrand,J.R., Rymo,L., Walsh,J.E., Bjorck,E., Lindahl,T. and Griffin,B.E. TITLE Molecular cloning of the complete Epstein-Barr virus genome as a set of overlapping restriction endonuclease fragments JOURNAL Nucleic Acids Res. 9 (13), 2999-3014 (1981) MEDLINE 82014887 REFERENCE 12 (bases 1 to 172281) AUTHORS Kozak,M. TITLE Possible role of flanking nucleotides in recognition of the AUG initiator codon by eukaryotic ribosomes JOURNAL Nucleic Acids Res. 9 (20), 5233-5262 (1981) MEDLINE 82059504 REFERENCE 13 (bases 7315 to 9312) AUTHORS Yates,J., Warren,N., Reisman,D. and Sugden,B. TITLE A cis-acting element from the Epstein-Barr viral genome that permits stable replication of recombinant plasmids in latently infected cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81 (12), 3806-3810 (1984) MEDLINE 84222045 REFERENCE 14 (bases 45415 to 52824) AUTHORS Jones,M.D., Foster,L., Sheedy,T. and Griffin,B.E. TITLE The EB virus genome in Daudi Burkitt's lymphoma cells has a deletion similar to that observed in a non-transforming strain (P3HR-1) of the virus JOURNAL EMBO J. 3 (4), 813-821 (1984) MEDLINE 84207939 REFERENCE 15 (bases 45644 to 52450) AUTHORS Jeang,K.T. and Hayward,S.D. TITLE Organization of the Epstein-Barr virus DNA molecule. III. Location of the P3HR-1 deletion junction and characterization of the NotI repeat units that form part of the template for an abundant 12-O-tetradecanoylphorbol-13-acetate-induced mRNA transcript JOURNAL J. Virol. 48 (1), 135-148 (1983) MEDLINE 83294686 REFERENCE 16 (bases 1 to 172281) AUTHORS Farrell,P.J. and Barrell,B.G. TITLE Direct Submission JOURNAL Submitted (05-JUN-1984) to the EMBL/GenBank/DDBJ databases REFERENCE 17 (bases 1 to 172281) AUTHORS Bodescot,M. and Perricaudet,M. TITLE Clustered alternative splice sites in Epstein-Barr virus RNAs JOURNAL Nucleic Acids Res. 15 (14), 5887 (1987) MEDLINE 87289053 REFERENCE 18 (bases 1 to 172281) AUTHORS Laux,G., Perricaudet,M. and Farrell,P.J. TITLE A spliced Epstein-Barr virus gene expressed in immortalized lymphocytes is created by circularization of the linear viral genome JOURNAL EMBO J. 7 (3), 769-774 (1988) MEDLINE 88283646 REFERENCE 19 (bases 1 to 172281) AUTHORS Farrell,P.J. TITLE Direct Submission JOURNAL Submitted (18-MAR-1988) Farrell P., Ludwig Institute for Cancer Research, St. Mary's Hospital Medical School, Norfolk Place London W2 1PG COMMENT CDS Listed under this feature are all known protein coding regions as well as all the major open reading frames in the sequence. In general the term major is taken as the longest frame in a particular region taking into account the adjacent longest frames and likely transcription signals. Note that on this basis some long overlapping frames have been excluded and on the other hand some small frames have been included which might represent exons or genes because they occur in a logical combination with other features or because of some other experimental data. The reading frames are named according to the Bam H1 fragment in which they start. eg BALF3 is the third leftward frame starting in Bam H1 fragment A. BORF1 is the first rightward frame in Bam H1 fragment O. If there is an obvious TATA sequence followed by an in frame Met codon that satisfies the rules of Kozak [12] in that there is a purine at -3 and/or a G at +4 then the reading frame is numbered from the A of the ATG to the base preceding the termination codon. If there is no obvious initiation codon or there is a substantial reading frame in phase before the ATG then the reading frame is numbered from the first base of the first codon. SITEs of POLYA signals This feature lists all occurences of the sequence AATAAA which is found normally approximately 20 bases upstream of the mRNA processing/polyA addition site. The rarely used homolog ATTAAA is only listed when it is found in a position close to the end of a major reading frame. SITEs of DONOR and ACCEPT sequences This is not a comprehensive listing of all such sequences and only the positions of a few have been noted because they occur in potentially interesting positions. The number quoted in the table is the position of the terminal base in the intron in each case. Restriction enzyme SITEs. Only the positions of the sites Bam HI (BAM) are listed. RPT This feature is used to define repetitive sequences. SITE DEL This feature defines deletions in B95-8 with respect to other strains such as RAJI and also to deletions in other strains such as P3HR1 and DAUDI with respect to B95-8. SITE HPN Denotes sequences with twofold symmetry ie could form hairpin loops. This is not a comprehensive list - only a few occurences noted. ORGRPL Denotes the region that encompasses an origin of replication (ori P).[13]. NUMBERING The DNA sequence of B95-8 EBV has been revised [19]. The original (Baer et al, 1984) base 359 has been deleted so the new sequence around that position reads TCAGTCTTT. To avoid renumbering the entire sequence, position 1 has benn moved 1 base to the left of the EcoRI site separating EcoRI Dhet from EcoRI I (ie the first A of AGAATTC). FEATURES Location/Qualifiers source 1..172281 /organism="Human herpesvirus 4" /strain="B95-8" mRNA 58..272 /note="exon 2 terminal protein RNA" mRNA 360..458 /note="exon 3 terminal protein RNA" misc_feature complement(535) /note="polyA signal: AATAAA" mRNA 540..788 /note="exon 4 terminal protein RNA" mRNA 871..951 /note="exon 5 terminal protein RNA" mRNA 1026..1196 /note="exon 6 terminal protein RNA" promoter complement(1192) /note="TATA: TATAAAT" mRNA 1280..1495 /note="exon 7 terminal protein RNA" promoter complement(1383) /note="TATA: CATAAAA" mRNA 1574..1682 /note="exon 8 terminal protein RNA" promoter 1676 /note="TATA: TATAAAG" promoter 1691 /note="TATA: TATTAAA BN-R1 late promoter before BNRF1, gives 4.1kb late RNA. Probably encodes non glycosylated 140kd protein in membrane antigen. Also two latent RNAs spliced underneath this RNA, lengths 1.8 and 2.0kb (Hudson et al, 1985). The longer one encodes terminal protein." CDS 1736..5692 /note="BNRF1 reading frame, 5 NXT/S" /codon_start=1 /db_xref="PID:g59075" /db_xref="SWISS-PROT:P03179" /translation="MEERGRETQMPVARYGGPFIMVRLFGQDGEANIQEERLYELLSD PRSALGLDPGPLIAENLLLVALRGTNNDPRPQRQERARELALVGILLGNGEQGEHLGT ESALEASGNNYVYAYGPDWMARPSTWSAEIQQFLRLLGATYVLRVEMGRQFGFEVHRS RPSFRQFQAINHLVLFDNALRKYDSGQVAAGFQRALLVAGPETADTRPDLRKLNEWVF GGRAAGGRQLADELKIVSALRDTYSGHLVLQPTETLDTWKVLSRDTRTAHSLEHGFIH AAGTIQANCPQLFMRRQHPGLFPFVNAIASSLGWYYQTATGPGADARAAARRQQAFQT RAAAECHAKSGVPVVAGFYRTINATLKGGEGLQPTMFNGELGAIKHQALDTVRYDYGH YLIMLGPFQPWSGLTAPPCPYAESSWAQAAVQTALELFSALYPAPCISGYARPPGPSA VIEHLGSLVPKGGLLLFLSHLPDDVKDGLGEMGPARATGPGMQQFVSSYFLNPACSNV FITVRQRGEKINGRTVLQALGRACDMAGCQHYVLGSTVPLGGLNFVNDLASPVSTAEM MDDFSPFFTVEFPPIQEEGASSPVPLDVDESMDISPSYELPWLSLESCLTSILSHPTV GSKEHLVRHTDRVSGGRVAQQPGVGPLDLPLADYAFVAHSQVWTRPGGAPPLPYRTWD RMTEKLLVSAKPGGENVKVSGTVITLGEQGYKVSLDLREGTRLAMAEALLNAACAPIL DPEDVLLTLHLHLDPRRADNSAVMEAMTAASDYARGLGVKLTFGSASCPETGSSASNF MTVVASVSAPGEFSGPLITPVLQKTGSLLIAVRCGDGKIQGGSLFEQLFSDVATTPRA PEALSLKNLFRAVQQLVKSGIVLSGHDISDGGLVTCLVEMALAGQRGVTITMPVASDY LPEMFAEHPGLVFEVEERSVGEVLQTLRSMNMYPAVLGRVGEQGPDQMFEVQHGPETV LRQSLRLLLGTWSSFASEQYECLRPDRINRSMHVSDYGYNEALAVSPLTGKNLSPRRL VTEPDPRCQVAVLCAPGTRGHESLLAAFTNAGCLCRRVFFREVRDNTFLDKYVGLAIG GVHGARDSALAGRATVALINRFPALRDAILKFLNRPDTFSVALGELGVQVLAGLGAVG STDNPPAPGVEVNVQRSPLILAPNASGMFESRWLNISIPATTSSVMLRGLRGCVLPCW VQGSCLGLQFTNLGMPYVLQNAHQIACHFHSNGTDAWRFAMNYPRNPTEQGNIAGLCS RDGRHLALLCDPSLCTDFWQWEHIPPAFGHPTGCSPWTLMFQAAHLWSLRHGRPSE" misc_feature complement(1795) /note="polyA signal: AATAAA" misc_feature 3955 /note="BAM: Bam H1 Nhet/h" misc_feature 3994 /note="BAM: Bam H1 h/C" mRNA 5408..5856 /note="exon 9 terminal protein RNA" misc_feature 5841 /note="polyA signal: AATAAA, end of 4.1kb late RNA and TP latent RNA." misc_feature 5863 /note="alternative end to TP cDNAs" promoter 6097 /note="TATA: TATAAGA" misc_feature 6629..6795 /note="Pol III RNA EBER 1" promoter complement(6823) /note="TATA: CATAAAT" misc_feature 6956..7128 /note="Pol III RNA EBER 2" rep_origin 7315..9312 /note="origin of replication, ori P (Yates et al, 1984, 1985)" repeat_region 7421..8042 /note="21x30bp repeats, binding sites for EBNA-1 (site I, Rawlins et al, 1985). Tandem repeat part of oriP (Reisman et al, 1985). Also functions as a cell type specific enhancer (Reisman et al, 1985; Lupton and Levine, 1985)" promoter 7738 /note="TATA: TATAAAT" promoter 7888 /note="TATA: TATAAAT" promoter 8573 /note="TATA: CATAAAT" misc_feature complement(8680) /note="polyA signal: AATAAA" misc_feature complement(8755) /note="polyA signal: AATAAA" misc_feature 8962 /note="polyA signal: AATAAA" misc_feature 9021..9133 /note="HPN: dyad symmetry, site II for EBNA-1 binding (Rawlins et al, 1985). Dyad symmetry part of oriP (Reisman et al, 1985)" promoter complement(9398) /note="TATA: TATAAAT" promoter 9631 /note="TATA: TATAAAT BC-R1 late promoter before BCRF1" CDS 9675..10187 /note="BCRF1 reading frame" /codon_start=1 /db_xref="PID:g59076" /db_xref="SWISS-PROT:P03180" /translation="MERRLVVTLQCLVLLYLAPECGGTDQCDNFPQMLRDLRDAFSRV KTFFQTKDEVDNLLLKESLLEDFKGYLGCQALSEMIQFYLEEVMPQAENQDPEAKDHV NSLGENLKTLRLRLRRCHRFLPCENKSKAVEQIKNAFNKLQEKGIYKAMSEFDIFINY IEAYMTIKAR" promoter 10076 /note="TATA: GATAAAA" misc_feature complement(10148) /note="polyA signal: AATAAA" misc_feature 10173 /note="polyA signal: ATTAAA" misc_feature 10257 /note="polyA signal: AATAAA, end of 0.8kb late RNA from BCR1 and end of 1.6 kb late RNA, start unknown" misc_feature complement(10277) /note="polyA signal: AATAAA" promoter complement(10975) /note="TATA: CATAAAT" promoter 11305 /note="TATA: TACAAAA; BCR2 promoter for highly spliced EBNA latent RNAs." mRNA 11336..11480 /note="exon C1 of Bodescot et al (1986) RNAs" promoter 11524 /note="TATA: TATAATT" misc_feature complement(11587) /note="polyA signal: AATAAA" promoter complement(11606) /note="TATA: CATAAAT" mRNA 11626..11657 /note="exon C2 of Bodescot et al (1986) RNAs" promoter 11796 /note="TATA: TATAAGT" promoter complement(11799) /note="TATA: TATAAAA" repeat_region 12001..15072 /note="3072 repeat 1" CDS 12541..13689 /note="BCRF2 3072 repeat, reading frame 1; Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e25077" /db_xref="PID:g1334836" /translation="VWEAEGRPRPGEVEGDRPGLCWQSPGDPLRPSGPGRSPSAPQTD PRVSRQGPASSGAAGSPPQAPQTRVSASRADRPRAWRLLGASRRGWFCPSLCPSEEPG TSGTPEPLGPASRRPPGLRSPLSPVKPKECLRGATLGAQAPESRGQGHLRVPPRVPGQ PEGPRQPGRPQRPVPRPFPGLQSPGCPPEGTLGVPSPPLQARASPSRRGASLGPQVQP HRDPSGPDPPTGPSLCPPAPLQPSLHPRPQLLASPGPPGQPEGPRQPGRVAFPLPWPL LPASHPSPLSLPPHRVHQAGRRDPGGPVSVPPAAAQSLPPGKGASFSPPSLRPSLLCT VCKVQPPTPVHGSRAQPRPPLPTVDRPSVHPGHPRPPVSTPVPSRGDFM" misc_feature 13215 /note="BAM: BamH1 C/W" promoter 14352 /note="*TATA: TATAAAG BWR1 one of the promoters for highly spliced EBNA and LP RNAs (Sample et al, 1986; Speck et al, 1986)" mRNA 14384..14410 /note="*exon W0 of EBNA/LP RNAs" mRNA 14554..14619 /note="*exon W1 (also W66) part of leader protein (LP) gene. LP is also called EBNA-5 (Dillner et al, 1986) and EBNA-4 (Rowe et al, 1987)." mRNA 14559..14619 /note="*exon W1' (also W61) of EBNA/LP RNAs forms initiator met when fused to exon W0 or exon C2." mRNA 14701..14832 /note="*exon W2 (also W132) part of LP gene" repeat_region 15073..18144 /note="3072 repeat 2" CDS 15613..16761 /note="BWRF1 reading frame 2; Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e25078" /db_xref="PID:g1334837" /translation="VWEAEGRPRPGEVEGDRPGLCWQSPGDPLRPSGPGRSPSAPQTD PRVSRQGPASSGAAGSPPQAPQTRVSASRADRPRAWRLLGASRRGWFCPSLCPSEEPG TSGTPEPLGPASRRPPGLRSPLSPVKPKECLRGATLGAQAPESRGQGHLRVPPRVPGQ PEGPRQPGRPQRPVPRPFPGLQSPGCPPEGTLGVPSPPLQARASPSRRGASLGPQVQP HRDPSGPDPPTGPSLCPPAPLQPSLHPRPQLLASPGPPGQPEGPRQPGRVAFPLPWPL LPASHPSPLSLPPHRVHQAGRRDPGGPVSVPPAAAQSLPPGKGASFSPPSLRPSLLCT VCKVQPPTPVHGSRAQPRPPLPTVDRPSVHPGHPRPPVSTPVPSRGDFM" misc_feature 16287 /note="BAM: BamH1 W/W" promoter 17424 /note="TATA: TATAAAG" mRNA 17626..17691 /note="Exon W1" mRNA 17773..17904 /note="Exon W2" repeat_region 18145..21216 /note="3072 repeat 3" CDS 18685..19833 /note="BWRF1 reading frame 3; Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e25079" /db_xref="PID:g1334838" /translation="VWEAEGRPRPGEVEGDRPGLCWQSPGDPLRPSGPGRSPSAPQTD PRVSRQGPASSGAAGSPPQAPQTRVSASRADRPRAWRLLGASRRGWFCPSLCPSEEPG TSGTPEPLGPASRRPPGLRSPLSPVKPKECLRGATLGAQAPESRGQGHLRVPPRVPGQ PEGPRQPGRPQRPVPRPFPGLQSPGCPPEGTLGVPSPPLQARASPSRRGASLGPQVQP HRDPSGPDPPTGPSLCPPAPLQPSLHPRPQLLASPGPPGQPEGPRQPGRVAFPLPWPL LPASHPSPLSLPPHRVHQAGRRDPGGPVSVPPAAAQSLPPGKGASFSPPSLRPSLLCT VCKVQPPTPVHGSRAQPRPPLPTVDRPSVHPGHPRPPVSTPVPSRGDFM" misc_feature 19359 /note="BAM: BamH1 W/W" promoter 20496 /note="TATA: TATAAAG" mRNA 20698..20763 /note="Exon W1" mRNA 20845..20976 /note="Exon W2" repeat_region 21217..24288 /note="3072 repeat 4" CDS 21757..22905 /note="BWRF1 reading frame 4; Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e25080" /db_xref="PID:g1334839" /translation="VWEAEGRPRPGEVEGDRPGLCWQSPGDPLRPSGPGRSPSAPQTD PRVSRQGPASSGAAGSPPQAPQTRVSASRADRPRAWRLLGASRRGWFCPSLCPSEEPG TSGTPEPLGPASRRPPGLRSPLSPVKPKECLRGATLGAQAPESRGQGHLRVPPRVPGQ PEGPRQPGRPQRPVPRPFPGLQSPGCPPEGTLGVPSPPLQARASPSRRGASLGPQVQP HRDPSGPDPPTGPSLCPPAPLQPSLHPRPQLLASPGPPGQPEGPRQPGRVAFPLPWPL LPASHPSPLSLPPHRVHQAGRRDPGGPVSVPPAAAQSLPPGKGASFSPPSLRPSLLCT VCKVQPPTPVHGSRAQPRPPLPTVDRPSVHPGHPRPPVSTPVPSRGDFM" misc_feature 22431 /note="BAM: BamH1 W/W" promoter 23568 /note="TATA: TATAAAG" mRNA 23771..23835 /note="Exon W1" mRNA 23917..24048 /note="Exon W2" repeat_region 24289..27360 /note="3072 repeat 5" CDS 24829..25977 /note="BWRF1 reading frame 5; Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e25081" /db_xref="PID:g1334840" /translation="VWEAEGRPRPGEVEGDRPGLCWQSPGDPLRPSGPGRSPSAPQTD PRVSRQGPASSGAAGSPPQAPQTRVSASRADRPRAWRLLGASRRGWFCPSLCPSEEPG TSGTPEPLGPASRRPPGLRSPLSPVKPKECLRGATLGAQAPESRGQGHLRVPPRVPGQ PEGPRQPGRPQRPVPRPFPGLQSPGCPPEGTLGVPSPPLQARASPSRRGASLGPQVQP HRDPSGPDPPTGPSLCPPAPLQPSLHPRPQLLASPGPPGQPEGPRQPGRVAFPLPWPL LPASHPSPLSLPPHRVHQAGRRDPGGPVSVPPAAAQSLPPGKGASFSPPSLRPSLLCT VCKVQPPTPVHGSRAQPRPPLPTVDRPSVHPGHPRPPVSTPVPSRGDFM" misc_feature 25503 /note="BAM: BamH1 W/W" promoter 26640 /note="TATA: TATAAAG" mRNA 26842..26907 /note="Exon W1" mRNA 26989..27120 /note="Exon W2" repeat_region 27361..30432 /note="3072 repeat 6" CDS 27901..29049 /note="BWRF1 reading frame 6; Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e25066" /db_xref="PID:g1334841" /translation="VWEAEGRPRPGEVEGDRPGLCWQSPGDPLRPSGPGRSPSAPQTD PRVSRQGPASSGAAGSPPQAPQTRVSASRADRPRAWRLLGASRRGWFCPSLCPSEEPG TSGTPEPLGPASRRPPGLRSPLSPVKPKECLRGATLGAQAPESRGQGHLRVPPRVPGQ PEGPRQPGRPQRPVPRPFPGLQSPGCPPEGTLGVPSPPLQARASPSRRGASLGPQVQP HRDPSGPDPPTGPSLCPPAPLQPSLHPRPQLLASPGPPGQPEGPRQPGRVAFPLPWPL LPASHPSPLSLPPHRVHQAGRRDPGGPVSVPPAAAQSLPPGKGASFSPPSLRPSLLCT VCKVQPPTPVHGSRAQPRPPLPTVDRPSVHPGHPRPPVSTPVPSRGDFM" misc_feature 28575 /note="BAM: BamH1 W/W" promoter 29712 /note="TATA: TATAAAG" mRNA 29914..29979 /note="Exon W1" mRNA 30061..30192 /note="Exon W2" repeat_region 30433..33504 /note="3072 repeat 7" CDS 30973..32121 /note="BWRF1 reading frame 7; Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e25067" /db_xref="PID:g1334842" /translation="VWEAEGRPRPGEVEGDRPGLCWQSPGDPLRPSGPGRSPSAPQTD PRVSRQGPASSGAAGSPPQAPQTRVSASRADRPRAWRLLGASRRGWFCPSLCPSEEPG TSGTPEPLGPASRRPPGLRSPLSPVKPKECLRGATLGAQAPESRGQGHLRVPPRVPGQ PEGPRQPGRPQRPVPRPFPGLQSPGCPPEGTLGVPSPPLQARASPSRRGASLGPQVQP HRDPSGPDPPTGPSLCPPAPLQPSLHPRPQLLASPGPPGQPEGPRQPGRVAFPLPWPL LPASHPSPLSLPPHRVHQAGRRDPGGPVSVPPAAAQSLPPGKGASFSPPSLRPSLLCT VCKVQPPTPVHGSRAQPRPPLPTVDRPSVHPGHPRPPVSTPVPSRGDFM" misc_feature 31647 /note="BAM: BamH1 W/W" promoter 32784 /note="TATA: TATAAAG" mRNA 32986..33051 /note="Exon W1" mRNA 33133..33264 /note="Exon W2" repeat_region 33505..36576 /note="3072 repeat 8" CDS 34045..35193 /note="BWRF1 reading frame 8; Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e25068" /db_xref="PID:g1334843" /translation="VWEAEGRPRPGEVEGDRPGLCWQSPGDPLRPSGPGRSPSAPQTD PRVSRQGPASSGAAGSPPQAPQTRVSASRADRPRAWRLLGASRRGWFCPSLCPSEEPG TSGTPEPLGPASRRPPGLRSPLSPVKPKECLRGATLGAQAPESRGQGHLRVPPRVPGQ PEGPRQPGRPQRPVPRPFPGLQSPGCPPEGTLGVPSPPLQARASPSRRGASLGPQVQP HRDPSGPDPPTGPSLCPPAPLQPSLHPRPQLLASPGPPGQPEGPRQPGRVAFPLPWPL LPASHPSPLSLPPHRVHQAGRRDPGGPVSVPPAAAQSLPPGKGASFSPPSLRPSLLCT VCKVQPPTPVHGSRAQPRPPLPTVDRPSVHPGHPRPPVSTPVPSRGDFM" misc_feature 34719 /note="BAM: BamH1 W/W" promoter 35856 /note="TATA: TATAAAG" mRNA 36058..36123 /note="Exon W1" mRNA 36205..36336 /note="Exon W2" repeat_region 36577..39648 /note="3072 repeat 9" CDS 37117..38265 /note="BWRF1 reading frame 9; Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e25069" /db_xref="PID:g1334844" /translation="VWEAEGRPRPGEVEGDRPGLCWQSPGDPLRPSGPGRSPSAPQTD PRVSRQGPASSGAAGSPPQAPQTRVSASRADRPRAWRLLGASRRGWFCPSLCPSEEPG TSGTPEPLGPASRRPPGLRSPLSPVKPKECLRGATLGAQAPESRGQGHLRVPPRVPGQ PEGPRQPGRPQRPVPRPFPGLQSPGCPPEGTLGVPSPPLQARASPSRRGASLGPQVQP HRDPSGPDPPTGPSLCPPAPLQPSLHPRPQLLASPGPPGQPEGPRQPGRVAFPLPWPL LPASHPSPLSLPPHRVHQAGRRDPGGPVSVPPAAAQSLPPGKGASFSPPSLRPSLLCT VCKVQPPTPVHGSRAQPRPPLPTVDRPSVHPGHPRPPVSTPVPSRGDFM" misc_feature 37791 /note="BAM: BamH1 W/W" promoter 38928 /note="TATA: TATAAAG" mRNA 39130..39195 /note="Exon W1" mRNA 39277..39408 /note="Exon W2" repeat_region 39649..42720 /note="3072 repeat 10" CDS 40189..41337 /note="BWRF1 reading frame 10; Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e25070" /db_xref="PID:g1334845" /translation="VWEAEGRPRPGEVEGDRPGLCWQSPGDPLRPSGPGRSPSAPQTD PRVSRQGPASSGAAGSPPQAPQTRVSASRADRPRAWRLLGASRRGWFCPSLCPSEEPG TSGTPEPLGPASRRPPGLRSPLSPVKPKECLRGATLGAQAPESRGQGHLRVPPRVPGQ PEGPRQPGRPQRPVPRPFPGLQSPGCPPEGTLGVPSPPLQARASPSRRGASLGPQVQP HRDPSGPDPPTGPSLCPPAPLQPSLHPRPQLLASPGPPGQPEGPRQPGRVAFPLPWPL LPASHPSPLSLPPHRVHQAGRRDPGGPVSVPPAAAQSLPPGKGASFSPPSLRPSLLCT VCKVQPPTPVHGSRAQPRPPLPTVDRPSVHPGHPRPPVSTPVPSRGDFM" misc_feature 40863 /note="BAM: BamH1 W/W" promoter 42000 /note="TATA: TATAAAG" mRNA 42202..42267 /note="Exon W1" mRNA 42349..42480 /note="Exon W2" repeat_region 42721..45792 /note="3072 repeat 11" CDS 43261..44409 /note="BWRF1 reading frame 11; Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e25071" /db_xref="PID:g1334846" /translation="VWEAEGRPRPGEVEGDRPGLCWQSPGDPLRPSGPGRSPSAPQTD PRVSRQGPASSGAAGSPPQAPQTRVSASRADRPRAWRLLGASRRGWFCPSLCPSEEPG TSGTPEPLGPASRRPPGLRSPLSPVKPKECLRGATLGAQAPESRGQGHLRVPPRVPGQ PEGPRQPGRPQRPVPRPFPGLQSPGCPPEGTLGVPSPPLQARASPSRRGASLGPQVQP HRDPSGPDPPTGPSLCPPAPLQPSLHPRPQLLASPGPPGQPEGPRQPGRVAFPLPWPL LPASHPSPLSLPPHRVHQAGRRDPGGPVSVPPAAAQSLPPGKGASFSPPSLRPSLLCT VCKVQPPTPVHGSRAQPRPPLPTVDRPSVHPGHPRPPVSTPVPSRGDFM" misc_feature 43935 /note="BAM: BamH1 W/W" promoter 45072 /note="TATA: TATAAAG" mRNA 45274..45339 /note="Exon W1" misc_feature 45415..52824 /note="DEL: DAUDI deletion (Jones et al, 1984)" mRNA 45421..45552 /note="Exon W2" misc_feature 45644..52450 /note="DEL: P3HR1 deletion (Jeang and Hayward, 1983)" repeat_region 45793..47643 /note="3072 repeat 12" CDS 46333..47481 /note="BWRF1 reading frame 12; Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e25072" /db_xref="PID:g1334847" /translation="VWEAEGRPRPGEVEGDRPGLCWQSPGDPLRPSGPGRSPSAPQTD PRVSRQGPASSGAAGSPPQAPQTRVSASRADRPRAWRLLGASRRGWFCPSLCPSEEPG TSGTPEPLGPASRRPPGLRSPLSPVKPKECLRGATLGAQAPESRGQGHLRVPPRVPGQ PEGPRQPGRPQRPVPRPFPGLQSPGCPPEGTLGVPSPPLQARASPSRRGASLGPQVQP HRDPSGPDPPTGPSLCPPAPLQPSLHPRPQLLASPGPPGQPEGPRQPGRVAFPLPWPL LPASHPSPLSLPPHRVHQAGRRDPGGPVSVPPAAAQSLPPGKGASFSPPSLRPSLLCT VCKVQPPTPVHGSRAQPRPPLPTVDRPSVHPGHPRPPVSTPVPSRGDFM" misc_feature 47007 /note="BAM: BamH1 W/Y" mRNA 47761..47793 /note="Exon Y1 Bodescot et al, 1984" promoter 47831 /note="TATA: TATAAGT" mRNA 47878..47999 /note="Exon Y2 Bodescot et al, 1984 and EBNA-1 (Speck and Strominger,1985), last common exon" misc_feature complement(48023) /note="polyA signal: AATAAA" mRNA 48386..48444 /note="exon Bodescot et al, 1984" CDS 48386..50032 /note="Coding exon for EBNA-2 (Sample et al,1986)" /codon_start=1 /db_xref="PID:e25073" CDS 48504..49967 /note="BYRF1, encodes EBNA-2 (Dambaugh et al, 1984; Dillner et al, 1984)" /codon_start=1 /db_xref="PID:e25074" /db_xref="PID:g1632787" /db_xref="SWISS-PROT:P12978" /translation="MPTFYLALHGGQTYHLIVDTDSLGNPSLSVIPSNPYQEQLSDTP LIPLTIFVGENTGVPPPLPPPPPPPPPPPPPPPPPPPPPPPPPPSPPPPPPPPPPPQR RDAWTQEPSPLDRDPLGYDVGHGPLASAMRMLWMANYIVRQSRGDRGLILPQGPQTAP QARLVQPHVPPLRPTAPTILSPLSQPRLTPPQPLMMPPRPTPPTPLPPATLTVPPRPT RPTTLPPTPLLTVLQRPTELQPTPSPPRMHLPVLHVPDQSMHPLTHQSTPNDPDSPEP RSPTVFYNIPPMPLPPSQLPPPAAPAQPPPGVINDQQLHHLPSGPPWWPPICDPPQPS KTQGQSRGQSRGRGRGRGRGRGKGKSRDKQRKPGGPWRPEPNTSSPSMPELSPVLGLH QGQGAGDSPTPGPSNAAPVCRNSHTATPNVSPIHEPESHNSPEAPILFPDDWYPPSID PADLDESWDYIFETTESPSSDEDYVEGPSKRPRPSIQ" repeat_region 48678..48800 /note="14 x CCCCCACCA repeats" misc_feature 48848 /note="BAM: BamH1 Y/H" promoter 49350 /note="TATA: TATAACA" promoter complement(49353) /note="TATA: TATAAAA" repeat_region 49525..49578 /note="9 x GGGGCA repeats" mRNA 49852..50032 /note="exon (Bodescot et al 1984)" misc_feature 50003 /note="polyA signal: AATAAA, end of Bodescot T1 RNA and EBNA-2 RNA (3.0kb latent RNA in IB4 cells)" promoter complement(50156) /note="TATA: TATAAGT" misc_feature complement(50317) /note="polyA signal: AATAAA, end of 2.5kb early RNA from 52817" misc_feature complement(50578..52557) /note="BHLF1 early reading frame" repeat_region 50578..52115 /note="12 x 125bp repeats" misc_feature 52654..53697 /note="region homologous to Eco R1 C of Raji" promoter complement(52817) /note="TATA: GATAAAA promoter for 2.5kb early RNA containing BHLF1 (Jeang and Hayward, 1983; Freese et al,1983)" promoter 53759 /note="TATA: TATTAAC likely promoter for class III and IV early RNAs encoding BHRF1 (Pearson et al, 1987)" misc_feature 53895 /note="DONOR: CGGGTAACT donor for splice to 54335 in class IV early RNAs encoding BHRF1 (Pearson et al, 1987)" misc_feature 54335 /note="ACCEPT: TTTTCTAG acceptor from 48444 in class I, 47999 in class II, and 53895 in class IV early RNAs encoding BHRF1 (Pearson et al, 1987)" misc_feature 54376..54948 /note="BHRF1 reading frame, limited homolgy to bcl-2 gene. Early gene in B95-8 cells and part of restricted EA complex." promoter 54591 /note="TATA: TATAACA" promoter complement(54594) /note="TATA: TATAAAT" misc_feature 54853 /note="BAM: BamH1 H/F" misc_feature complement(54929) /note="polyA signal: AATAAA" promoter complement(54977) /note="TATA: TATAAAG" misc_feature 55518 /note="polyA signal: AATAAA, 3' end of 2.5kb, 1.9kb, 1.7kb and 0.6kb early RNAs" misc_feature complement(55982..56935) /note="BFLF2 reading frame, 4 NXT/S, homologous to RF 27 in VZV and HFRF2 in CMV" misc_feature complement(55990) /note="polyA signal: AATAAA, 3' end of 2.3kb and 1.1kb early RNAs from 58568 and 57081" promoter complement(56132) /note="TATA: TATAAAG" CDS complement(56948..58525) /note="BFLF1 reading frame, 2 NXT/S homologous to RF 26 in VZV and HFRF1 in CMV" /codon_start=1 /db_xref="PID:g1334849" /db_xref="SWISS-PROT:P03184" /translation="MAHKVTSANEPNPLTGKRLSSCPLTRSGVTEVAQIAGRTPKMED FVPWTVDNLKSQFEAVGLLMAHSYLPANAEEGIAYPPLVHTYESLSPASTCRVCDLLD TLVNHSDAPVAFFEDYALLCYYCLNAPRAWISSLITGMDFLHILIKYFPMAGGLDSLF MPSRILAIDIQLHFYICRCFLPVSSSDMIRNANLGYYKLEFLKSILTGQSPANFCFKS MWPRTTPTFLTLPGPRTCKDSQDVPGDVGRGLYTALCCHLPTRNRVQHPFLRAEKGGL SPEITTKADYCGLLLGTWQGTDLLGGPGHHAIGLNAEYSGDELAELALAITRPEAGDH SQGPCLLAPMFGLRHKNASRTICPLCESLGAHPDAKDTLDRFKSLILDSFGNNIKILD RIVFLIKTQNTLLDVPCPRLRAWLQMCTPQDFHKHLFCDPLCAINHSITNPSVLFGQI YPPSFQAFKAALAAGQNLEQGVCDSLITLVYIFKSTQVARVGKTILVDVTKELDVVLR IHGLDLVQSYQTSQVYV" promoter complement(57081) /note="TATA: TATTTAA before BFLF2; BFL2 promoter gives 1.1kb early RNA" promoter complement(58088) /note="TATA: GATAAAA" promoter complement(58568) /note="TATA: TATTAAA before BFLF1, BFL1 promoter gives 2.3kb early RNA" promoter 58832 /note="TATA: TATAAAA before BFRF1" CDS 58891..59901 /note="BFRF1 early reading frame, 1 NXT/S, homologous to HFLF4 in CMV" /codon_start=1 /db_xref="PID:g1334850" /db_xref="SWISS-PROT:P03185" /translation="MASPEERLLDELNNVIVSFLCDSGSLEVERCSGAHVFSRGSSQP LCTVKLRHGQIYHLEFVYKFLAFKLKNCNYPSSPVFVISNNGLATTLRCFLHEPSGLR SGQSGPCLGLSTDVDLPKNSIIMLGQDDFIKFKSPLVFPAELDLLKSMVVCRAYITEH RTTMQFLVFQAANAQKASRVMDMISDMSQQLSRSGQVEDTGARVTGGGGPRPGVTHSG CLGDSHVRGRGGWDLDNFSEAETEDEASYAPWRDKDSWSESEAAPWKKELVRHPIRRH RTRETRRMRGSHSRVEHVPPETRETVVGGAWRYSWRATPYLARVLAVTAVALLLMFLR WT" CDS 59808..61583 /note="BFRF2 early reading frame, homologous to HFLF5 in CMV" /codon_start=1 /db_xref="PID:e25058" /db_xref="PID:g1632788" /db_xref="SWISS-PROT:P14347" /translation="MALFLARHTLSGTGAGCHGRGPAPDVSEVDLTLQALGERGFSRL LDLGLACLDLSYVEMREFVVWGRPPASEAAVASTPGSLFRSHSSAYWLSEVERPGGLV RWARSQTSPSSLTLAPHLGPSLLSLSVVTGGGCGAVAFCNAFFLAYFLVVRSVFPAFS DRIAAWICDRSPFCENTRAVARGYRGLVKRFLAFVFERSSYDPPLLRQNSRPVERCFA IKNYVPGLDSQSCVTVPSFSRWAQSHASELDPREIRDRVTPATAPSFVADHASALLAS LQKKASDTPCGNPIQWMWYRLLVNSCLRSAHCLLPIPAVSEGGRKTGGGVGEELVGAG GPCLSRDVFVAIVSRNVLSCLLNVPAAGPRAYKCFRSHASRPVSGPDYPPLAVFCMDC GYCLNFGKQTGVGGRLNSFRPTLQFYPRDQKEKHVLTCHASGRVYCSNCGSAAVGCQR LAEPPSARSGWRPRIRAVLPHNAAYELDRGSRLLDAIIPCLGPDRTCMRPVVLRGVTV RQLLYLTLRTEARAVCSICQQRQAPEDARDEPHLFSSCLEVELPPGERCAGCRLYQTR YGTPAAQAHPPGEAGGGFSRQSPAS" promoter complement(61062) /note="TATA: GATAAAA" promoter 61344 /note="TATA: TATTTAA before BFRF3" CDS 61507..62037 /note="BFRF3 early reading frame" /codon_start=1 /db_xref="PID:e25059" /db_xref="PID:g1632789" /db_xref="SWISS-PROT:P14348" /translation="MARRLPKPTLQGRLEADFPDSPLLPKFQELNQNNLPNDVFREAQ RSYLVFLTSQFCYEEYVQRTFGVPRRQRAIDKRQRASVAGAGAHAHLGGSSATPVQQA QAAASAGTGALASSAPSTAVAQSATPSVSSSISSLRAATSGATAAASAAAAVDTGSGG GGQPHDTAPRGARKKQ" misc_feature complement(62068) /note="polyA signal: AATAAA" misc_feature 62069 /note="polyA signal: AATAAA, 3' end of 10, 6.5, 3.7, 3.4, 3.1, 2.5 and 0.8kb early RNAs" CDS complement(62078..71527) /note="BPLF1 reading frame, 1 NXT/S, analogous to VZV RF22" /codon_start=1 /db_xref="PID:g1334853" /db_xref="SWISS-PROT:P03186" /translation="MSNGDWGQSQRTRGTGPVRGIRTMDVNAPGGGSGGSALRILGTA SCNQAHCKFGRFAGIQCVSNCVLYLVKSFLAGRPLTSRPELDEVLDEGARLDALMRQS GILKGHEMAQLTDVPSSVVLRGGGRVHIYRSAEIFGLVLFPAQIANSAVVQSLAEVLH GSYNGVAQFILYICDIYAGAIIIETDGSFYLFDPHCQKDAAPGTPAHVRVSTYAHDIL QYVGAPGAQYTCVHLYFLPEAFETEDPRIFMLEHYGVYDFYEANGSGFDLVGPELVSS DGEAAGTPGADSSPPVMLPFERRIIPYNLRPLPSRSFTSDSFPAARYSPAKTNSPPSS PASAAPASAAPASAAPASAAPASAAPASAAPASAAPASAAPASSPPLFIPIPGLGHTP GVPAPSTPPRASSGAAPQTPKRKKGLGKDSPHKKPTSGRRLPLSSTTDTEDDQLPRTH VPPHRPPSAARLPPPVIPIPHQSPPASPTPHPAPVSTIAPSVTPSPRLPLQIPIPLPQ AAPSNPKIPLTTPSPSPTAAAAPTTTTLSPPPTQQQPPQSAAPAPSPLLPQQQPTPSA APAPSPLLPQQQPPPSAARAPSPLPPQQQPLPSATPAPPPAQQLPPSATTLEPEKNHP PAADRAGTEISPSPPFGQQPSFGDDASGGSGLVRYLSDLEEPFLSMSDSEEAESDLAS DIPTTEDEDMFEDEVFSNSLESGSSAPTSPITLDTARSQYYQTTFDIETPEMDFVPLE SNIARIAGHTYQEQAIVYDPASNREVPEADALSMIDYLLVTVVLEQGLIRSRDRSSVL NLLEFLKDWSGHLQVPTLDLEQLLTSELNIQNLANMLSENKGRAGEFHKHLAAKLEAC LPSLATKDAVRVDAGAKMLAEIPQLAESDDGKFDLEAARRRLTDLLSGGDQEAGEGGG EPEDNSIYRGPHVDVPLVLDDESWKRLLSLAEAARTAVARQQAGVDEEDVRFLALLTA IEYGAPPAASVPPFVHNVAVRSKNAALHVRRCTADIRDKVASAASDYLSYLEDPSLPT VMDFDDLLTHLRHTCQIIASLPLLNIRYTSIEWDYRELLYLGTALSDMSGIPWPLERV EEDDPSIAPLPEFETVAKKQKELETTRENEKRLRTILDDIEAMLGLAGVASAPGAPIS PASPSATPANHDNPEATPPLADTAALTIPVIEKYIANAGSIVGAAKNPTYIRLRDTIQ QIVRSKKYLMNILKSITFYTIDNYIASFEESIDHLYRDLPVLDPEVQDGIDRILDPMV SEALHTFEMGNRLTLEPARLVALQNFATHSTLKETAAAVNLLPGLLAVYDATITGQAP EDALRLLSGLQNQLSQTLIPGKLKKRFLSYLQKLKNNNNDQLRQKEVQAWRLEAEGFK PATEEQLEAFLDTAPNKELKRQYEKKLRQLMETGRKEKEKLREQEDKERQERRAREAN EAWARIRKALGARPEPAPTSPDDWNTLLASLLPDNTDSAAAAAAAVARNTDILDSLTQ ILAAMLLGITRVRRERLRSLLVDDGGAAERMEAAEPGWFTDIETGPLARLDAWPATPA ATAKEGGGGRGAEEAAGALFRARTAADAIRSALAQTRQALQSPDMKSAVVNTDLEAPY AEYERGLAGLLEKRRAAEAALTAIVSEYVDRTLPEATNDPGQANLPPPPTIPQATAPP RLASDSALWPKKPQLLTRRERDDLLQATGDFFSELLTEAEAAEVRALEEQVRESQTLM AKAHEMAASTRRGFHTALEAVLSRSRDEAPDDELRSLLPSPPKAPVQAPLEAALARAA AGNGSWPYRKSLAAAKWIRGICEAVRGLSEGALALAGGAGAWLNLAAAADGEIHELTR LLEVEGMAQNSMDGMEELRLALATLDPKRVAGGKETVADWKRRLSRLEAIIQEAQEES QLQGTLQDLVTQARGHTDPRQLKIVVEAARGLALGASAGSQYALLKDKLLRYASAKQS FLAFYETAQPTVFVKHPLTNNLPLLITISAPPTGWGNGAPTRRAQFLAAAGPAKYAGT LWLETESPCDPLNPAYVSADTQEPLNYIPVYHNFLEYVMPTVLENPEAFSLTPAGRPQ AIGPPQDDQERRRRTLASVASARLSAAAADSYWDTWPDVESNAGELLREYVSAPKALM EDLADNPIVAMTLLAHASLIASRNHPPYPAPATDREVILLEQREMMALLVGTHPAYAA AFLGAPSFYAGLGLVSALARDGGLGDLLSDSVLTYRLVRSPASGRGGMPSTTRGSNDG EDARRLTRHRIAGPPTGFIFFQDAWEEMDTRAALWPHPEFLGLVHNQSTARARACMLL LARRCFAPEALQQLWHSLRPLEGPVAFQDYLRDFVKQAYTRGEELPRAEGLEVPRETP SSYGTVTGRALRNLMPYGTPITGPKRGSGDTIPVSVFEAAVAAAFLGRPLTLFVSSQY LFNLKTLGQVRVVAPLLYCDGHSEPFRSLVETISLNFLQDLDGYSESFEPEMSIFARQ AVWLRELLTEARAAKPKEARPPTVAILANRKNIIWKCFTYRHNLPDVQFYFNAAGASR WPTDVLNPSFYEHEDPPLPVGYQLPPNPRNVQELFSGFPPRVGHGLVSGDGFQSADNT PASSDRLQQLGGGETDQGEKGSTTAESEASGPPSPQSPLLEKVAPGRPRDWLSPTSSP RDVTVTPGLAAPITLPGPRLMARPYFGAETRASESPDRSPGSSPRPWPKDSLELLPQP APQQPPSSPWASEQGPIVYTLSPHSTPSTASGSQKKHTIQIPGLVPSQKPSYPPSAPY KPGQSTGGIAPTPSAASLTTFGLQPQDTQASSQDPPYGHSIMQREKKQQGGREEAAEI RPSATRLPTAVGLRPRAPVVAAGAAASATPAFDPGEAPSGFPIPQAPALGSGLAAPAH TPVGALAPRPQKTQAQRPQDAAALPTPTIKAVGARPVPKATGALAAGARPRGQPTAAP PSAASPPRVSLPVRSRQQQSPAIPLPPMHSGSEPGARPEVRLSQYRHAGPQTYTVRKE APPSAASQLPKMPKCKDSMYYPPSGSARYPAPFQALSFSQSVASPAPSSDQTTLLWNT PSVVTQFLSIEDIIREVVTGGSTSGDLVVPSGSPSSLSTAAPEQDLRYSLTLSQASRV LSRFVSQLRRKLERSTHRLIADLERLKFLYL" misc_feature 62249 /note="BAM: BamH1 F/Q" misc_feature 62430..62477 /note="Site III for EBNA-1 binding (Rawlins et al, 1985)" misc_feature 66121 /note="BAM: BamH1 Q/U" mRNA 67477..67649 /note="Exon in EBNA-1 RNA (Speck and Strominger,1985) and cDNA clone T4 (Bodescot et al, 1986)" misc_feature 69410 /note="BAM: BamH1 U/P" repeat_region 69684..69930 /note="5 x 51bp repeats" repeat_region 70387..70521 /note="9 x 15bp repeat" promoter 70750 /note="TATA: CATAAAA" CDS complement(71520..75239) /note="BOLF1 reading frame, 1 NXT/S analogous to VZV RF 21" /codon_start=1 /db_xref="PID:g1334855" /db_xref="SWISS-PROT:P03189" /translation="MASAMESDSSGGSGGADAQPPLAEVDGGLARVTRQLLLSGDDPA ARLRALMPLELGIFGLGDLAQPVLVRDFLNTLTLMSGHAYPAAVLRHHAYYLLRAASF SRRSFGLGHLEAALDVLASSLPPTTASPATDDPLDGSRLIAETRALAAAYRRIIEEGS GEVLAVSGPTATFAFVEELVADTYLARWDAFPREGLSFYAFNAAKTTLGRWLVTVYAE TNRYPWAAAGQGQPTAADIKAMAVELVEHSGGGAGGGEGEESGGGLFHRPESLSSVVA SLPLARRRAVEILGVYAEASGGQTPPVAAVPVLAFDAARLRLLEPSGALFYDYVYEAL LWDQTYGVPDSVIEAFLAGMAAEMEALAARVQEAAGSRASFSPAAIEQVATVLLSAGL NETVAGDYAMMLASVPRVSRSRWRWLEATAALLESLSGFALHFFRLLPTASPTSRFAR VARAAYLRAEAEAVDRRARRTSGPSTPAAAPAATAVGVGAAADPWDAVTPLRIFIVPP PAAEYEQVAGDLSSELLRSLLWVRYSRLWQAPAPAPALPCKPPLLPGEQGRRQWTAAV AAAPRTDVEAYCRSLRAGQTARADPAYVHSPFFPAAFIEFQIWPALRRVLSNELPKTR SLAALRWLVSFGSDLALPSPELTRARRPLELIYATVWEIYDGAPPMPGESPQAVGLRP LNLEGEGKAGDAGAEGAEDEEGGGPWGLSSHDAVLRIMDAVREVSGIISETISASERA AEAPPLAWPTSLFSLLFTLRYSTTAESLGLATRRFLVSGETLSEDISRLTGAAWRLCS RPLLYDAETGRVQIPLATEEEEEAVVAVKEKSVSSSPRHYSTDLQTLKSVVEGIQDVC RDAAARWALATADTATLRRRLLVPALRESRGIADHPLWAHTSEPLRPDLEELNERVEH ALELGYSLTGALRRSVAYRFRDYTFARLFQPPAIDAERAEAIVRRDARPPPVFIPAPR RLPQGGADTPPPLSMDDILYLGKSICKALVDVLDHHPAAPETTPIKTYTPAMDLNPEQ ITVTPRSPSVLAAFARTARVQTHHLVPALTDDSPSPVGQTPPPFRILPAKKLAAILLG NGRNASKRRASRDLSPPPHGRWRAVLDSSPFSFSSSDFSDQDEGEGGEADLRGVPGGG GEGAYEEDRERPSDIDTAARAQKVETSCPRRRSPRTTPSPSRRASGGGGPDRGEAEAH TYPPYLSAAAAASRVRPRTRRGATRRPPRPTAEDE" promoter complement(72192) /note="TATA: TATTAAA before BPLF1" misc_feature 73468 /note="BAM: BamH1 P/O" promoter 75017 /note="TATA: TATTTAA BO-R1 late promoter before BORF1, gives 3.9kb late RNA" CDS 75238..76332 /note="BORF1 late reading frame, 2 NXT/S homologous to VZV RF20" /codon_start=1 /db_xref="PID:g1334854" /db_xref="SWISS-PROT:P03187" /translation="MKVQGSVDRRRLQRRIAGLLPPPARRLNISRGSEFTRDVRGLVE EHAQASSLSAAAVWRAGLLAPGEVAVAGGGSGGGSFSWSGWRPPVFGDFLIHASSFNN AEATGTPLFQFKQSDPFSGVDAVFTPLSLFILMNHGRGVAARVEAGGGLTRMANLLYD SPATLADLVPDFGRLVADRRFHNFITPVGPLVENIKSTYLNKITTVVHGPVVSKAIPR STVKVTVPQEAFVDLDAWLSGGAGGGGGVCFVGGLGLQPCPADARLYVALTYEEAGPR FTFFQSSRGHCQIMNILRIYYSPSIMHRYAVVQPLHIEELTFGAVACLGTFSATDGWR RSAFNYRGSSLPVVEIDSFYSNVSDWEVIL" promoter complement(75322) /note="TATA: TATTTAG before BOLF1" promoter 75819 /note="TATA: TATAAAG" misc_feature 75838 /note="polyA signal: AATAAA" misc_feature complement(76126) /note="polyA signal: AATAAA" promoter 76169 /note="TATA: TACATAT BO-R2 early promoter before BORF2, gives 2.8kb RNA" misc_feature complement(76300) /note="polyA signal: AATAAA" CDS 76407..78887 /note="BORF2 early reading frame, 2 NXT/S. Homology HSV 140K ribonucleotide reductase (Gibson et al, 1984) and RF 19 VZV" /codon_start=1 /db_xref="PID:g1334856" /db_xref="SWISS-PROT:P03190" /translation="MATTSHVEHELLSKLIDELKVKANSDPEADVLAGRLLHRLKAES VTHTVAEYLEVFSDKFYDEEFFQMHRDELETRVSAFAQSPAYERIVSSGYLSALRYYD TYLYVGRSGKQESVQHFYMRLAGFCASTTCLYAGLRAALQRARPEIESDMEVFDYYFE HLTSQTVCCSTPFMRFAGVENSTLASCILTTPDLSSEWDVTQALYRHLGRYLFQRAGV GVGVTGAGQDGKHISLLMRMINSHVEYHNYGCKRPVSVAAYMEPWHSQIFKFLETKLP ENHERCPGIFTGLFVPELFFKLFRDTPWSDWYLFDPKDAGDLERLYGEEFEREYYRLV TAGKFCGRVSIKSLMFSIVNCAVKAGSPFILLKEACNAHFWRDLQGEAMNAANLCAEV LQPSRKSVATCNLANICLPRCLVNAPLAVRAQRADTQGDELLLALPRLSVTLPGEGAV GDGFSLARLRDATQCATFVVACSILQGSPTYDSRDMASMGLGVQGLADVFADLGWQYT DPPSRSLNKEIFEHMYFTALCTSSLIGLHTRKIFPGFKQSKYAGGWFHWHDWAGTDLS IPREIWSRLSERIVRDGLFNSQFIALMPTSGCAQVTGCSDAFYPFYANASTKVTNKEE ALRPNRSFWRHVRLDDREALNLVGGRVSCLPEALRQRYLRFQTAFDYNQEDLIQMSRD RAPFVDQSQSHSLFLREEDAARASTLANLLVRSYELGLKTIMYYCRIEKAADLGVMEC KASAALSVPREEQNERSPAEQMPPRPMEPAQVAGPVDIMSKGPGEGPGGWCVPGGLEV CYKYRQLFSEDDLLETDGFTERACESCQ" misc_feature 77835 /note="BAM: Bam H1 O/a" promoter 78804 /note="TATA: TATAAGT Ba-R1 early promoter before BaRF1, gives 3.5kb RNA" misc_feature 78883 /note="polyA signal: AATAAA, end of 3.9kb late RNA from 75017 and 2.8kb early RNA from 76169" misc_feature complement(78896) /note="polyA signal: AATAAA" CDS 78900..79808 /note="BaRF1 early reading frame, homologous to HSV 38K ribonucleotide reductase (Gibson et al, 1984) and RF 18 VZV" /codon_start=1 /db_xref="PID:g1334857" /db_xref="SWISS-PROT:P03175" /translation="MSKLLYVRDHEGFACLTVETHRNRWFAAHIVLTKDCGCLKLLNE RDLEFYKFLFTFLAMAEKLVNFNIDELVTSFESHDIDHYYTEQKAMENVHGETYANIL NMLFDGDRAAMNAYAEAIMADEALQAKISWLRDKVAAAVTLPEKILVFLLIEGIFFIS SFYSIALLRVRGLMPGICLANNYISRDELLHTRAASLLYNSMTAKADRPRATWIQELF RTAVEVETAFIEARGEGVTLVDVRAIKQFLEATADRILGDIGQAPLYGTPPPKDCPLT YMTSIKQTNFFEQESSDYTMLVVDDL" promoter complement(79495) /note="TATA: TATAACA" misc_feature 79537 /note="BAM: Bam H1 a/M" promoter 79840 /note="TATA: CATAAAT BM-R1 early promoter before BMRF1, gives 2.5kb RNA" CDS 79899..81113 /note="BMRF1 early reading frame. Early antigen protein recognised by R3 monoclonal (Pearson et al 1983; Cho et al, 1985a)" /codon_start=1 /db_xref="PID:g1334858" /db_xref="SWISS-PROT:P03191" /translation="METTQTLRFKTKALAVLSKCYDHAQTHLKGGVLQVNLLSVNYGG PRLAAVANAGTAGLISFEVSPDAVAEWQNHQSPEEAPAAVSFRNLAYGRTCVLGKELF GSAVEQASLQFYKRPQGGSRPEFVKLTMEYDDKVSKSHHTCALMPYMPPASDRLRNEQ MIGQVLLMPKTASSLQKWARQQGSGGVKVTLNPDLYVTTYTSGEACLTLDYKPLSVGP YEAFTGPVAKAQDVGAVEAHVVCSVAADSLAAALSLCRIPAVSVPILRFYRSGIIAVV AGLLTSAGDLPLDLSVILFNHASEEAAASTASEPEDKSPRVQPLGTGLQQRPRHTVSP SPSPPPPPRTPTWESPARPETPSPAIPSHSSNTALERPLAVQLARKRTSSEARQKQKH PKKVKQAFNPLI" promoter 80779 /note="TATA: TATTTAA BM-R2 late promoter before BMRF2" misc_feature complement(80782) /note="polyA signal: AATAAA" promoter 80832 /note="TATA: GATAAAA, possible promoter for 1.4kb late RNA encoding BMRF2" CDS 81118..82191 /note="BMRF2 early reading frame" /codon_start=1 /db_xref="PID:g1334859" /db_xref="SWISS-PROT:P03192" /translation="MFSCKQHLSLGACVFCLGLLASTPFIWCFVFANLLSLEIFSPWQ THVYRLGFPTACLMAVLWTLVPAKHAVRAVTPAIMLNIASALIFFSLRVYSTSTWVSA PCLFLANLPLLCLWPRLAIEIVYICPAIHQRFFELGLLLACTIFALSVVSRALEVSAV FMSPFFIFLALGSGSLAGARRNQIYTSGLERRRSIFCARGDHSVASLKETLHKCPWDL LAISALTVLVVCVMIVLHVHAEVFFGLSRYLPLFLCGAMASGGLYLGHSSIIACVMAT LCTLTSVVVYFLHETLGPLGKTVLFISIFVYYFSGVAALSAAMRYKLKKFVNGPLVHL RVVYMCCFVFTFCEYLLVTFIKS" promoter 81751 /note="TATA: CATAAAT" misc_feature 82180 /note="polyA signal: ATTAAA, end of 3.5kb early RNA from 78804, 2.5kb early RNA from 79840 and 1.4kb late RNA" promoter complement(82311) /note="TATA: CATAAAT" repeat_region 82319..82461 /note="2x71bp repeats" CDS complement(82743..84059) /note="BMLF1 early reading frame. Diffuse early antigen (Cho et al, 1985b). Also homologous to RF 4 VZV and IE63 of HSV.(BSLF2 + BMLF1) is also called EB2 (Chevallier-Greco et al, 1986). General transactivator of transcription (Lieberman et al, 1986)." /codon_start=1 /db_xref="PID:e25042" /db_xref="PID:g1632790" /db_xref="SWISS-PROT:Q04360" /translation="MEGSEEHSTDGEISSSEEEDEDPTPAHAIPARPSSVVITPTSAS FVIPRKKWDLQDKTVTLHRSPLCRDEDEKEETGNSSYTRGHKRRRGEVHGCTDESYGK RRHLPPGARAPRAPRAPRVPRAPRSPRAPRSNRATRGPRSESRGAGRSTRKQARQERS QRPLPNKPWFDMSLVKPVSKITFVTLPSPLASLTLEPIQDPFLQSMLAVAAHPEIGAW QKVQPRHELRRSYKTLREFFTKSTNKDTWLDARMQAIQNAGLCTLVAMLEETIFWLQE ITYHGDLPLAPAEDILLACAMSLSKVILTKLKELAPCFLPNTRDYNFVKQLFYITCAT ARQNKVVETLSSSYVKQPLCLLAAYAAVAPAYINANCRRRHDEVEFLGHYIKNYNPGT LSSLLTEAVETHTRDCRSASCSRLVRAILSPGTGSLGLFFVPGLNQ" misc_feature complement(82747) /note="polyA signal: AATAAA" repeat_region 83640..83729 /note="10x9bp repeats" misc_feature complement(84122) /note="ACCEPT: CTCCCCTCTGCAG acceptor in spliced form of BMLF1 RNA" misc_feature complement(84227) /note="DONOR: CAGGTAAGA donor in spliced form of BMLF1 RNA" CDS complement(84229..84288) /note="BSLF2 early reading frame in 5' exon of spliced RNA encoding BMLF1" /codon_start=1 /db_xref="PID:g1334861" /translation="MVPSQRLSRTSSISSNEDPA" misc_feature 84233 /note="BAM: Bam H1 M/S" CDS complement(84257..86881) /note="BSLF1 reading frame, homologous to RF 6 VZV" /codon_start=1 /db_xref="PID:g1334862" /db_xref="SWISS-PROT:P03193" /translation="MSAPVVIKALVASNTDIAEAILDAILSRPDEGFRLFCLCHNASP LHHVAGSLVELQLHLPKKRLTSQSRCGLVLTLHLPAEEAFPFLRGLTPLTADRLSTYL DRAGALRSLTPLVELLTLSAKKQPQGDARGRVAWLRPKIVGCLRRIYRVNISARWFIS TFGSHEAQFVLVTAAYYFWGIPCTIETLAHLTELFTSESGQSLAAVTSLAELGEVFGS SAWAEQTEAFAHFAHEKLRRDSREIRAVARTIDAYRGRLPLASADLVRYVYLAHAQCF NEGTFKRYSQLTSMGEIGCLPSGGVVLPSLLDRGFAEHMRTYFTRETYLAEHVRVQQL KIRMEPPAPYTWDPDPDDGLMRAWAGLSVDVARELVELARWHADEGPTYPPTLQGFLC LAGQATCRGQWNPKEQFLPPTVLRRVQRLPVFLCHFADRHYFVMTAADPFSSHLAEVV STPTNCRLPDTCLTRALSYTPVYYSQNSLSEQLFVSRHEYFNPRLPVCNLVLDLDLKI KGAPWSLEEIYDLCRTVRREVLRLMRRLGPVSRAHPVYFFKSACPPADPDNMEDVLPF CICTGKLGFRVITPLPRGHAIVGTSAVQGFVSVLQKLMGLTACLRRMRHKIKEIGAPL FDSGVYHAGRCIRLPHTYKVDRGGGLSRQLRLFVCHPEEEDKHSYVKNALNIQNLLHH SLHVGWPAPKTFCYHIADDGRDYLIQRTRETLPPTVENVCAMIEGHLGLDLVAWVSSC IWPSLMSTLATAVPEDKFPQFLHVTFEQTGPNLVQVCHARGRNFACLRHTHRASSKNV RVFLVLYYTSQAITVTFMSQCFAGRCGANQPTAHFSISVPASRIINRAEASQDSTTSQ LARRRDRQDGSFSETLPN" promoter complement(84356) /note="TATA: CATAAAT before BSLF2 and BMLF1. Two RNAs start here; one is spliced and the other is unspliced, both traverse BMLF1." promoter 86882 /note="TATA: TATTTAA BS-R1 late promoter before BSRF1" CDS 86924..87580 /note="BSRF1 reading frame" /codon_start=1 /db_xref="PID:g1334863" /db_xref="SWISS-PROT:P03194" /translation="MAFYLPDWSCCGLWLFGRPRNRYSQLPEEPETFECPDRWRAEID LGLPPGVQVGDLLRNEQTMGSLRQVYLLAVQANSITDHLKRFDAVRVPESCRGVVEAQ VAKLEAVRSVIWNTMISLAVSGIEMDENGLKALLDKQAGDSLALMEMEKVATALKMDE TGAWAQEISAVVSSVTAPSASAPFINSAFEPEVPTPVLAPPPVVRQPEHSGPTELALT " misc_feature complement(87134) /note="polyA signal: AATAAA" misc_feature 87599 /note="polyA signal: AATAAA" misc_feature complement(87613) /note="polyA signal: AATAAA, end 1.0kb early RNA from BLL3" CDS complement(87638..88474) /note="BLLF3 early reading frame (BLLF2 in Baer et al, 1984). Homologous to RF 8 VZV and dUTPase HSV." /codon_start=1 /db_xref="PID:g1334864" /db_xref="SWISS-PROT:P03195" /translation="MEACPHIRYAFQNDKLLLQQASVGRLTLVNKTTILLRPMKTTTV DLGLYARPPEGHGLMLWGSTSRPVTSHVGIIDPGYTGELRLILQNQRRYNSTLRPSEL KIHLAAFRYATPQMEEDKGPINHPQYPGDVGLDVSLPKDLALFPHQTVSVTLTVPPPS IPHHRPTIFGRSGLAMQGILVKPCRWRRGGVDVSLTNFSDQTVFLNKYRRFCQLVYLH KHHLTSFYSPHSDAGVLGPRSLFRWASCTFEEVPSLAMGDSGLSEALEGRQGRGFGSS GQ" misc_feature 87650 /note="BAM: Bam H1 S/L" promoter 88507 /note="TATA: TATATAT BL-R1 late promoter before BLRF1, gives 1.0kb late RNA" promoter 88511 /note="TATA: TATAAGA" promoter complement(88514) /note="TATA: TATATAT BL-L3 early promoter before BLLF3, gives 1.0kb early RNA" CDS 88547..88855 /note="BLRF1 late reading frame" /codon_start=1 /db_xref="PID:g1334865" /db_xref="SWISS-PROT:P03196" /translation="MGKVLRKPFAKAVPLLFLAATWLLTGVLPAGASSPTNAAAASLT EAQDQFYSYTCNADTFSPSLTSFASIWALLTLVLVIIASAIYLMYVCFNKFVNTLLTD " promoter 88863 /note="TATA: TATTTAA BL-R2 late promoter before BLRF2, gives 0.6kb late RNA" CDS 88925..89413 /note="BLRF2 late reading frame, 2 NXS/T" /codon_start=1 /db_xref="PID:g1334866" /db_xref="SWISS-PROT:P03197" /translation="MSAPRKVRLPSVKAVDMSMEDMAARLARLESENKALKQQVLRGG ACASSTSVPSAPVPPPEPLTARQREVMITQATGRLASQAMKKIEDKVRKSVDGVTTRN EMENILQNLTLRIQVSMLGAKGQPSPGEGTRPRESNDPNATRRARSRSRGREAKKVQI SD" misc_feature 89412 /note="polyA signal: AATAAA, end of 1.0kb and 0.6kb late RNAs" misc_feature complement(89425) /note="polyA signal: AATAAA, end of 0.7kb early, 2.2kb late and 2.8kb late RNA" CDS complement(89430..92153) /note="BLLF1a, late reading frame, gp350 membrane antigen, 36 NXT /S (Hummel et al, 1984; Biggin et al, 1984; Beisel et al, 1985)" /codon_start=1 /db_xref="PID:g1334869" /db_xref="SWISS-PROT:P03200" /translation="MEAALLVCQYTIQSLIHLTGEDPGFFNVEIPEFPFYPTCNVCTA DVNVTINFDVGGKKHQLDLDFGQLTPHTKAVYQPRGAFGGSENATNLFLLELLGAGEL ALTMRSKKLPINVTTGEEQQVSLESVDVYFQDVFGTMWCHHAEMQNPVYLIPETVPYI KWDNCNSTNITAVVRAQGLDVTLPLSLPTSAQDSNFSVKTEMLGNEIDIECIMEDGEI SQVLPGDNKFNITCSGYESHVPSGGILTSTSPVATPIPGTGYAYSLRLTPRPVSRFLG NNSILYVFYSGNGPKASGGDYCIQSNIVFSDEIPASQDMPTNTTDITYVGDNATYSVP MVTSEDANSPNVTVTAFWAWPNNTETDFKCKWTLTSGTPSGCENISGAFASNRTFDIT VSGLGTAPKTLIITRTATNATTTTHKVIFSKAPESTTTSPTLNTTGFADPNTTTGLPS STHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPWDNGTESKAPDMTS STSPVTTPTPNATSPTPAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPT PNATSPTLGKTSPTSAVTTPTPNATSPTLGKTSPTSAVTTPTPNATGPTVGETSPQAN ATNHTLGGTSPTPVVTSQPKNATSAVTTGQHNITSSSTSSMSLRPSSNPETLSPSTSD NSTSHMPLLTSAHPTGGENITQVTPASISTHHVSTSSPAPRPGTTSQASGPGNSSTST KPGEVNVTKGTPPQNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTE PTTDYGGDSTTPRPRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQPRF SNLSMLVLQWASLAVLTLLLLLVMADCAFRRNLSTSHTYTTPPYDDAETYV" CDS complement(89430..92153) /note="BLLF1b, late reading frame gp220 membrane antigen, spliced form of BLLF1a (Hummel et al, 1984; Biggin et al, 1984; Beisel et al, 1985)" /codon_start=1 /db_xref="PID:g1334868" /db_xref="SWISS-PROT:P03200" /translation="MEAALLVCQYTIQSLIHLTGEDPGFFNVEIPEFPFYPTCNVCTA DVNVTINFDVGGKKHQLDLDFGQLTPHTKAVYQPRGAFGGSENATNLFLLELLGAGEL ALTMRSKKLPINVTTGEEQQVSLESVDVYFQDVFGTMWCHHAEMQNPVYLIPETVPYI KWDNCNSTNITAVVRAQGLDVTLPLSLPTSAQDSNFSVKTEMLGNEIDIECIMEDGEI SQVLPGDNKFNITCSGYESHVPSGGILTSTSPVATPIPGTGYAYSLRLTPRPVSRFLG NNSILYVFYSGNGPKASGGDYCIQSNIVFSDEIPASQDMPTNTTDITYVGDNATYSVP MVTSEDANSPNVTVTAFWAWPNNTETDFKCKWTLTSGTPSGCENISGAFASNRTFDIT VSGLGTAPKTLIITRTATNATTTTHKVIFSKAPESTTTSPTLNTTGFADPNTTTGLPS STHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPWDNGTESKAPDMTS STSPVTTPTPNATSPTPAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPT PNATSPTLGKTSPTSAVTTPTPNATSPTLGKTSPTSAVTTPTPNATGPTVGETSPQAN ATNHTLGGTSPTPVVTSQPKNATSAVTTGQHNITSSSTSSMSLRPSSNPETLSPSTSD NSTSHMPLLTSAHPTGGENITQVTPASISTHHVSTSSPAPRPGTTSQASGPGNSSTST KPGEVNVTKGTPPQNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTE PTTDYGGDSTTPRPRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQPRF SNLSMLVLQWASLAVLTLLLLLVMADCAFRRNLSTSHTYTTPPYDDAETYV" promoter complement(89434) /note="TATA: TATAAAG" CDS complement(89567..90013) /note="BLLF2 early reading frame (BLLF3 in Baer et al, 1984)" /codon_start=1 /db_xref="PID:g1334867" /db_xref="SWISS-PROT:P03199" /translation="MCPPVRQHPAQAPPAKRQALETVPHPQNRGRLMSPKARPPKMQR RPRPPVAKRRRFPRSPQQVERPILPPVESTPQDMEPGQVQSPPQITAVIQLRQDRDTM RPPIYLPALLANCGPAGLLRAHRLPQPKPPCQSRQRPSPDSQTSPC" promoter complement(90051) /note="TATA: TATAACA BL-L2 early promoter before BLLF2, gives 0.7kb early RNA" intron complement(90062..90652) /note="intervening sequence in gp220 gene" repeat_region 90177..90639 /note="21 copies of 21bp approximate repeat" promoter complement(92192) /note="TATA: TATTAAA BL-L1 late promoter before BLLF1a,b. Gives 2.8 and 2.2kb late RNAs" mRNA 92238..92581 /note="Exon in Bodescot et al (1986) RNA (spliced from 20763 to 92670)" CDS 92243..92602 /note="BLRF3 reading frame" /codon_start=1 /db_xref="PID:g1632791" /db_xref="SWISS-PROT:P03202" /translation="MDKDRPGPPALDDNMEEEVPSTSVVQEQVSAGDWENVLIELSDS SSEKEAEDAHLEPAQKGTKRKRVDHDAGGSAPARPMLPPQPDLPGREAILRRFPLDLR TLLQAIGAAATVSIPMA" CDS 92670..95162 /note="BERF1 frame, homology with BERF2b and BERF4. A fusion of BLRF3 with BERF1 encodes EBNA-3A, latent cycle gene. (Hennessy et al, 1986, Joab et al, 1987); Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e25053" /db_xref="PID:g1334871" /translation="RIDTRAIDQFFGSQISNTEMYIMYAMAIRQAIRDRRRNPASRRD QAKWRLQTLAAGWPMGYQAYSSWMYSYTDHQTTPTFVHLQATLGCTGGRRCHVTFSAG TFKLPRCTPGDRQWLYVQSSVGNIVQSCNPRYSIFFDYMAIHRSLTKIWEEVLTPDQR VSFMEFLGFLQRTDLSYIKSFVSDALGTTSIQTPWIDDNPSTETAQAWNAGFLRGRAY GIDLLRTEGEHVEGATGETREESEDTESDGDDEDLPCIVSRGGPKVKRPPIFIRRLHR LLLMRAGKRTEQGKEVLEKARGSTYGTPRPPVPKPRPEVPQSDETATSHGSAQVPEPP TIHLAAQGMAYPLHEQHGMAPCPVAQAPPTPLPPVSPGDQLPGVFSDGRVACAPVPAP AGPIVRPWEPSLTQAAGQAFAPVRPQHMPVEPVPVPTVALERPVYPKPVRPAPPKIAM QGPGETSGIRRARERWRPAPWTPNPPRSPSQMSVRDRLARLRAEAQVKQASVEVQPPQ LTQVSPQQPMEGPLVPEQQMFPGAPFSQVADVVRAPGVPAMQPQYFDLPLIQPISQGA PVAPLRASMGPVPPVPATQPQYFDIPLTEPINQGASAAHFLPQQPMEGPLVPEQWMFP GAALSQSVRPGVAQSQYFDLPLTQPINHGAPAAHFLHQPPMEGPWVPEQWMFQGAPPS QGTDVVQHQLDALGYTLHGLNHPGVPVSPAVNQYHLSQAAFGLPIDEDESGEGSDTSE PCEALDLSIHGRPCPQAPEWPVQEEGGQDATEVLDLSIHGRPRPRTPEWPVQGEGGQN VTGPETRRVVVSAVVHMCQDDEFPDLQDPPDEA" mRNA 92670..95248 /note="Exon in (Bodescot et al, 1986) RNA from 92581, to 3' end" misc_feature 92703 /note="BAM: Bam H1 L/E" promoter complement(93161) /note="TATA: CATAAAT" promoter 93479 /note="TATA: TATAAGA" promoter complement(93482) /note="TATA: TATAAAT" repeat_region 94208..94277 /note="repeat type A" repeat_region 94281..94306 /note="repeat type B" repeat_region 94307..94381 /note="repeat type C" repeat_region 94386..94411 /note="repeat type B" repeat_region 94412..94489 /note="repeat type C" repeat_region 94490..94560 /note="repeat type A" repeat_region 94571..94648 /note="repeat type C" repeat_region 94649..94719 /note="repeat type A" repeat_region 94896..94982 /note="repeat type D" repeat_region 94983..95069 /note="repeat type D" misc_feature 95221 /note="polyA signal: AATAAA" misc_feature complement(95272) /note="polyA signal: AATAAA" CDS join(95353..95709,95788..98247) /codon_start=1 /product="EBNA3B (EBNA4A) latent protein" /db_xref="PID:e276431" /db_xref="PID:g1632792" /translation="MKKAWLSRAQQADAGGASGSEDPPDYGDQGNVTQVGSEPISPEI GPFELSAASEDDPQSGPVEENLDAAAREEEEPHEQEHNGGDDPLDVHTRQPRFVDVNP TQAPVIQLVHAVYDSMLQSDLRPLGSLFLEQNLNIEEFIWMCMTVRHRCQAIRKKPLP IVKQRRWKLLSSCRSWRMGYRTHNLKVNSFESGGDNVHPVLVTATLGCDEGTRHATTY SAGIVQIPRISDQNQKIETAFLMARRARSLSAERYTLFFDLVSSGNTLYAIWIGLGTK NRVSFIEFVGWLCKKDHTHIREWFRQCTGRPKAAKPWLRAHPVAIPYDDPLTNEEIDL AYARGQAMNIEAPRLPDDPIIVEDDDESEEIEAESDEEEDKSGMESLKNIPQTLPYNP TVYGRPAVFDRKSDAKSTKKCRAIVTDFSVIKAIEEEHRKKKAARTEQPRATPESQAP TVVLQRPPTQQEPGPVGPLSVQARLEPWQPLPGPQVTAVLLHEESMQGVQVHGSMLDL LEKDDEVMEQRVMATLLPPVPQQPRAGRRGPCVFTGDLGIESDEPASTEPVHDQLLPA PGPDPLEIQPLTSPTTSQLSSSAPSCAQTPWPVVQPSQTPDDPTKQSRPPETAAPRQW PMPLRPIPMRPLRMQPIPFNHPVGPTPHQTPQVEITPYKPTWAQIGHIPYQPTPTGPA TMLLRQWAPATMQTPPRAPTPMSPPEVPPVPRQRPRGAPTPTPPPQVPPVPRQRPRGA PTPTPPPQVLPTPMQLALRAPAGQQGPTKQILRQLLTGGVKKGRPSLKLQAALERQAA AGWQPSPGSGTSDKIVQAPIFYPPVLQPIQVMGQGGSPTAMAASAVTQAPTEYTRERR GVGPMPPTDIPPSKRAKIEAYTEPEMPHGGASHSPVVILENVGQGQQQTLECGGTAKQ ERDMLGLGDIAVSSPSSSETSNDE" mat_peptide 95353..95724 /note="BERF2a reading frame" mat_peptide 95725..98244 /note="BERF2b frame, homology with BERF1 and BERF4. BERF2a and BERF2b are spliced together to make EBNA3B (EBNA4A) latent protein." misc_feature complement(95819) /note="polyA signal: AATAAA" promoter complement(95853) /note="TATA: TATAAAT" misc_feature complement(96276) /note="polyA signal: AATAAA" repeat_region 97522..97698 /note="3x60bp repeat" mat_peptide 98323..98769 /note="BERF3 reading frame" mRNA 98364..98730 /note="Exon in EBNA-1 RNA (Speck and Strominger, 1985)" CDS join(98371..98730,98805..101423) /codon_start=1 /product="EBNA3C (EBNA 4B) latent protein" /db_xref="PID:e276455" /db_xref="PID:g1632793" /translation="MESFEGQGDSRQSPDNERGDNVQTTGEHDQDPGPGPPSSGASER LVPEESYSRDQQPWGQSRGDENRGWMQRIRRRRRRRAALSGHLLDTEDNVPPWLPPHD ITPYTARNIRDAACRAVKQSHLQALSNLILDSGLDTQHILCFVMAARQRLQDIRRGPL VAEGGVGWRHWLLTSPSQSWPMGYRTATLRTLTPVPNRVGADSIMLTATFGCQNAART LNTFSATVWTPPHAGPREQERYAREAEVRFLRGKWQRRYRRIYDLIELCGSLHHIWQN LLQTEENLLDFVRFMGVMSSCNNPAVNYWFHKTIGNFKPYYPWNAPPNENPYHARRGI KEHVIQNAFRKAQIQGLSMLATGGEPRGDATSETSSDEDTGRQGSDVELESSDDELPY IDPNMEPVQQRPVMFVSRVPAKKPRKLPWPTPKTHPVKRTNVKTSDRSDKAEAQSTPE RPGPSEQSSVTVEPAHPTPVEMPMVILHQPPPVPKPVPVKPTPPPSRRRRGACVVYDD DVIEVIDVETTEDSSSVSQPNKPHRKHQDGFQRSGRRQKRAAPPTVSPSDTGPPAVGP PAAGPPAAGPPAAGPPAAGPPAAGPPAAGPRILAPLSAGPPAAGPHIVTPPSARPRIM APPVVRMFMRERQLPQSTGRKPQCFWEMRAGREITQMQQEPSSHLQSATQPTTPRPSW APSVCALSVMDAGKAQPIESSHLSSMSPTQPISHEEQPRYEDPDAPLDLSLHPDVAAQ PAPQAPYQGYQEPPAPQAPYQGYQEPPPPQAPYQGYQEPPAHGLQSSSYPGYAGPWTP RSQHPCYRHPWAPWSQDPVHGHTQGPWDPRAPHLPPQWDGSAGHGQDQVSQFPHLQSE TGPPRLQLSLVPLVSSSAPSWSSPQPRAPIRPIPTRFPPPPMPLQDSMAVGCDSSGTA CPSMPFASDYSQGAFTPLDINATTPKRPRVEESSHGPARCSQATAEAQEILSDNSEIS VFPKDAKQTDYDASTESELD" misc_feature 98731 /note="DONOR: AAGGTGAGT donor" mat_peptide 98805..101420 /note="BERF4 frame, homology with BERF1 and BERF2b. BERF3 and BERF4 are spliced together to make the EBNA3C (EBNA 4B) latent protein." mRNA 98805..99050 /note="Exon in T4 cDNA (Bodescot et al 1986). 99050 is not the end of the RNA." misc_feature 99126..102118 /note="DEL: Deletion in Raji" promoter 99443 /note="TATA: CATAAAA" misc_feature 100104 /note="DONOR: ACCGTGAGT possible donor before repeat." repeat_region 100122..100304 /note="10 x 15bp repeat" misc_feature 100528 /note="DONOR: CTGGTAAGG possible donor" misc_feature 100613 /note="BAM: Bam H1 E/e1" repeat_region 100665..100781 /note="3x39bp repeat" promoter complement(100860) /note="TATA: TATAACA" misc_feature 100919 /note="BAM: Bam H1 e1/e2" misc_feature 101426 /note="BAM: Bam H1 e2/e3" CDS complement(101445..102116) /note="BZLF2 reading frame 3x NXT/S. 2.5kb late RNA traverses BZLF2, ends unknown." /codon_start=1 /db_xref="PID:g1334876" /db_xref="SWISS-PROT:P03205" /translation="MVSFKQVRVPLFTAIALVIVLLLAYFLPPRVRGGGRVAAAAITW VPKPNVEVWPVDPPPPVNFNKTAEQEYGDKEVKLPHWTPTLHTFQVPQNYTKANCTYC NTREYTFSYKGCCFYFTKKKHTWNGCFQACAELYPCTYFYGPTPDILPVVTRNLNAIE SLWVGVYRVGEGNWTSLDGGTFKVYQIFGSHCTYVSKFSTVPVSHHECSFLKPCLCVS QRSNS" promoter 101690 /note="TATA: CATAAAA" misc_feature 101765 /note="polyA signal: AATAAA" promoter complement(101786) /note="TATA: TATAAAG" misc_feature 101947 /note="BAM: Bam H1 e3/Z" misc_feature complement(102098) /note="DONOR: CAGGTGAGG possible donor" mRNA complement(102126..102341) /note="3' terminal exon of 0.9kb and 2.8kb early RNAs" promoter 102153 /note="TATA: TATTAAT" misc_feature complement(102156) /note="polyA signal: AATAAA 3' end of 0.9kb and 2.8kb RNAs encoding BZLF1 and BRLF1" promoter complement(102160) /note="TATA: TATTAAT" CDS complement(join(<102210..102338,102423..102530, 102655..103155)) /note="BZLF1 reading frame, modified from Baer et al, 1984. Has two splices within frame. 2xNXT/S. Immediate early gene which disrupts latency (Countryman and Miller, 1985), called EB1 by Chevallier-Greco et al, 1986 and ZEBRA by Miller." /codon_start=1 /db_xref="PID:e276432" /db_xref="PID:g1632794" /translation="MMDPNSTSEDVKFTPDPYQVPFVQAFDQATRVYQDLGGPSQAPL PCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQ NQQTNQAGGEAPQPGDNSTVQTAAAVVFACPGANQGQQLADIGVPQPAPVAAPARRTR KPQQPESLEECDSELEIKRYKNRVASRKCRAKFKQLLQHYREVAAAKSSENDRLRLLL KQMCPSLDVDSIIPRTPDVLHEDLLNF" promoter complement(102380) /note="TATA: CATAAAT" promoter 102415 /note="TATA:TATATAC" promoter complement(102420) /note="TATA: TATATAC" mRNA complement(102426..102530) /note="Exon of 0.9kb and 2.8kb early RNAs" misc_feature complement(102504) /note="polyA signal: AATAAA, apparently not functional" repeat_region 102581..102652 /note="semi-repetitive sequence, homologous to human c-fos 3' sequence" mRNA complement(102655..103194) /note="First exon of 0.9kb early RNA encoding BZLF1" misc_feature complement(102918) /note="splice acceptor used in RZ fusion gene (Sargeant)" promoter complement(103231) /note="TATA: TTTAAA of BZL1 immediate early promoter gives 0.9kb RNA" misc_feature complement(103256..103311) /note="Upstream of BZL1, homology to 106243 to 106188" CDS complement(103366..105183) /note="BRLF1 reading frame, (immediate?) early gene, acts as transcription activator." /codon_start=1 /db_xref="PID:g1334878" /db_xref="SWISS-PROT:P03209" /translation="MRPKKDGLEDFLRLTPEIKKQLGSLVSDYCNVLNKEFTAGSVEI TLRSYKICKAFINEAKAHGREWGGLMATLNICNFWAILRNNRVRRRAENAGNDACSIA CPIVMRYVLDHLIVVTDRFFIQAPSNRVMIPATIGTAMYKLLKHSRVRAYTYSKVLGV DRAAIMASGKQVVEHLNRMEKEGLLSSKFKAFCKWVFTYPVLEEMFQTMVSSKTGHLT DDVKDVRALIKTLPRASYSSHAGQRSYVSGVLPACLLSTKSKAVETPILVSGADRMDE ELMGNDGGASHTEARYSESGQFHAFTDELESLPSPTMPLKPGAQSADCGDSSSSSSDS GNSDTEQSEREEARAEAPRLRAPKSRRTSRPNRGQTPCPSNAAEPEQPWIAAVHQESD ERPIFPHPSKPTFLPPVKRKKGLRDSREGMFLPKPEAGSAISDVFEGREVCQPKRIRP FHPPGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDP DEETSQAVKALREMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNL DSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF" misc_feature complement(103453..103462) /note="TAATGAAATC sequence" misc_feature 103741 /note="BAM: Bam H1 Z/g" misc_feature 103816 /note="BAM: Bam H1 g/R" mRNA complement(104926..105185) /note="exon in RZ fusion gene (Sargeant)" mRNA complement(104927..104989) /note="BRLF2 poss. small 5' exon" promoter 105016 /note="TATA: TATAAAT before BRRF1, possible promoter for 1.1 kb early RNA encoding BRRF1" promoter complement(105019) /note="TATA: TATAAAT before BRLF2" CDS 105182..106114 /note="BRRF1 early reading frame" /codon_start=1 /db_xref="PID:g1334877" /db_xref="SWISS-PROT:P03207" /translation="MASSNRGNARPLKSFLHELYLKHYPEVGDVVHLLNTIGVDCDLP PSHPLLTAQRGLFLARVLQAVQQHKLLEDTIVPKILKKLAYFLELLSYYSPKDEQRDI AEVLDHLKTNRDLGLDDRLWALIRKLRQDRHHASVNVLMPGSDYTAVSLQYYDGISIG MRKVIADVCRSGYASMPSMTATHNLSHQLLMASGPSEEPCAWRGFFNQVLLWTVALCK FRRCIYYNYIQGSIATISQLLHLEIKALCSWIISQDGMRLFQHSRPLLTLWESVAANQ EVTDAITLPDCAEYIDLLKHTKHVLENCSAMQYK" misc_feature complement(105185) /note="ACCEPT: splice acceptor in 2.8kb early RNA encoding BRLF1 and RZ fusion gene (Sargeant)" promoter 105213 /note="TATA: CATTAAA" misc_feature 106110 /note="polyA signal: AATAAA, 3' end of early 1.1kb RNA encoding BRRF1" misc_feature complement(106125) /note="DONOR: CAGGTAAGA possible donor" misc_feature complement(106188..106243) /note="Homology to upstream region of BZL1" promoter complement(106213) /note="TATA: CATAAAA" promoter 106243 /note="TATA: TATAAAA before BRRF2, possible promoter for 1.8 kb RNA encoding BRRF2" CDS 106302..107915 /note="BRRF2 reading frame" /codon_start=1 /db_xref="PID:g1334879" /db_xref="SWISS-PROT:P03210" /translation="MSGQQRGSVILVPEHLAGALTKLMSDFITGQDVTLSGGNIAVKI RDAINQTPGGGDVAILSSLFALWNALPTSGRQSSRDDLIPAAVQALTTAHNLCLGVIP GETSHKDTPESLLRAIVTGLQKLWVDSCGCPECLQCLKGLKAIKPGLYEIPRIIPHTK QCSPVNLLNMLVHKLVALRGHVQLAYDARVLTPDFHEIPDLDDSDAVFARTLLAALFH LNMFFILKDYITQDSMSLKQALSGHWMSATGNPLPAAPETLRDYLEAFRNSDNHFYLP TTGPLNTFQFPEELLGRVVVIDSSLCAASHVQDVITHGVGAGVPRPRFSALPPAPSRE PQQTCSQLTSRGNESSRRNLGQPGGTSPAVPPVCPIVSLTASGAKQNRGGMGSLHLAK PEETSPAVSPVCPIASPAASRSKQHCGVTGSSQAAPSFSSVAPVASLSGDLEEEEEGS RESPSLPSSKKGDEEFEAWLEAQDANLEDVQREFSGLRVIGDEDEDGSEDGEFSDLDL SDSDHEGDEGGGAVGGGRSLHSLYSLSVV" promoter complement(106385) /note="TATA: GATAAAA" misc_feature complement(106973) /note="polyA signal: AATAAA" promoter complement(107124) /note="TATA: GATAAAA" misc_feature 107457 /note="BAM: Bam H1 R/f" misc_feature 107565 /note="BAM: Bam H1 f/K" misc_feature 107914 /note="polyA signal: AATAAA, 3' end of 1.8kb RNA encoding BBRF2" misc_feature 107942 /note="ACCEPT: splice acceptor for EBNA-1 RNA (from 98730)" CDS 107950..109875 /note="BKRF1 encodes EBNA-1 protein, latent cycle gene." /codon_start=1 /db_xref="PID:g1334880" /db_xref="SWISS-PROT:P03211" /translation="MSDEGPGTGPGNGLGEKGDTSGPEGSGGSGPQRRGGDNHGRGRG RGRGRGGGRPGAPGGSGSGPRHRDGVRRPQKRPSCIGCKGTHGGTGAGAGAGGAGAGG AGAGGGAGAGGGAGGAGGAGGAGAGGGAGAGGGAGGAGGAGAGGGAGAGGGAGGAGAG GGAGGAGGAGAGGGAGAGGGAGGAGAGGGAGGAGGAGAGGGAGAGGAGGAGGAGAGGA GAGGGAGGAGGAGAGGAGAGGAGAGGAGAGGAGGAGAGGAGGAGAGGAGGAGAGGGAG GAGAGGGAGGAGAGGAGGAGAGGAGGAGAGGAGGAGAGGGAGAGGAGAGGGGRGRGGS GGRGRGGSGGRGRGGSGGRRGRGRERARGGSRERARGRGRGRGEKRPRSPSSQSSSSG SPPRRPPPGRRPFFHPVGEADYFEYHQEGGPDGEPDVPPGAIEQGPADDPGEGPSTGP RGQGDGGRRKKGGWFGKHRGQGGSNPKFENIAEGLRALLARSHVERTTDEGTWVAGVF VYGGSKTSLYNLRRGTALAIPQCRLTPLSRLPFGMAPGPGPQPGPLRESIVCYFMVFL QTHIFAEVLKDAIKDLVMTKPAPTCNIRVTVCSFDDGVDLPPWFPPMVEGAAAEGDDG DDGDEGGDGDEGEEGQE" repeat_region 108217..108924 /note="EBNA triplet repeat GGA,GCA,GGG." misc_feature 109856 /note="DONOR: AGGGTGAGG possible donor at end BKRF1" promoter 109905 /note="TATA: TATTAAA before BKRF2, possible start for 2.3kb late RNA" misc_feature 109906 /note="polyA signal: ATTAAA" misc_feature 109937 /note="polyA signal: AATAAA 3' end of EBNA-1 RNA" CDS 109958..110371 /note="BKRF2 reading frame" /codon_start=1 /db_xref="PID:g1334881" /db_xref="SWISS-PROT:P03212" /translation="MRAVGVFLAICLVTIFVLPTWGNWAYPCCHVTQLRAQHLLALEN ISDIYLVSNQTCDGFSLASLNSPKNGSNQLVISRCANGLNVVSFFISILKRSSSALTG HLRELLTTLETLYGSFSVEDLFGANLNRYAWHRGG" misc_feature 110271 /note="DONOR: TCCGTGAGT possible donor at end of BKRF2" CDS 110353..111120 /note="BKRF3 reading frame, homologous to RF 59 VZV" /codon_start=1 /db_xref="PID:e25020" /db_xref="PID:g1632795" /db_xref="SWISS-PROT:P12888" /translation="MASRGLDLWLDEHVWKRKQEIGVKGENLLLPDLWLDFLQLSPIF QRKLAAVIACVRRLRTQATVYPEEDMCMAWARFCDPSDIKVVILGQDPYHGGQANGLA FSVAYGFPVPPSLRNIYAELHRSLPEFSPPDHGCLDAWASQGVLLLNTILTVQKGKPG SHADIGWAWFTDHVISLLSERLKACVFMLWGAKAGDKASLINSKKHLVLTSQHPSPLA QNSTRKSAQQKFLGNNHFVLANNFLREKGLGEIDWRL" misc_feature 111098 /note="DONOR: TCGGTGAGA possible donor at end BKRF3" CDS 111134..111787 /note="BKRF4 reading frame, contains complex repetitive sequence" /codon_start=1 /db_xref="PID:e25021" /db_xref="PID:g1632796" /db_xref="SWISS-PROT:P30117" /translation="MAMFLKSRGVRSCRDRRLLSDEEEETSQSSSYTLGSQASQSIQE EDVSDTDESDYSDEDEEIDLEEEYPSDEDPSEGSDSDPSWHPSDSDESDYSESDEDEA TPGSQASRSSRVSPSTQQSSGLTPTPSFSRPRTRAPPRPPAPAPVRGRASAPPRPPAP VQQSTKDKGPHRPTRPVLRGPAPRRPPPPSSPNTYNKHMMETTPPIKGNNNYNWPWL" misc_feature 111272 /note="DONOR: GACGTGAGT poss.donor before rpt.seq. in BKRF4" misc_feature 111719 /note="polyA signal: AATAAA" misc_feature 111787 /note="polyA signal: AATAAA : currently unknown which is 3' end of the 2.3kb late and 1.1kb early RNAs" CDS complement(111830..114259) /note="BBLF4 early reading frame, very good homology to RF55 VZV" /codon_start=1 /db_xref="PID:g1334885" /db_xref="SWISS-PROT:P03214" /translation="MAEEPRAPEALSSTFMLNMTSDASVRRIVRRIGTLARRRVQQLP DMETFSPEFDPELSEPPFLPFSAYVITGTAGAGKSTSVSCLHHTMDCLVTGATTVAAQ NLSQTLRAYCPTVYSAFGFKSRHINMTQRVSSHGRSTDAALEELQRRDLAKYWPVLSD IAAEFRRTKPRGLYSGVSGPAFEVLRDMHQGQLWTTNVIVVDEAGTLSVHILTAVVFC YWFFNAWLRTPLYRRGRIPCIVCVGSPTQTDAFQSSFSHETQVNKIRECDNILTFLVG NPRAATYVDVARNWALFINNKRCTDVQFGHLMKTLEYGLELSPDILAYVDRFVVPRAA IMDPAQYVGWTRLFLSHAEVKTFLTTLHATLKTAGQGRAARGTGGDGGGVTMFTCPVE CEVFLDPLAQYKTLVGLPGLTAHTWLQKNYARLGNYSQFADQDMVPVGTEQDEERVKV TYNVTYVKHSSVSVNCKTKKSICGYTGTFGDFMDTLEADSFVEAHGHEQPEYVYSFLA RLIYGGIYAFSHGGHSLCENGEYVAELGAVPLPGRTWDPEVTAGMELGELPLEVAWDG ERSPAAVFYARVLAPPAANSAPLCSLLNIYNDLRAYFRQCLDVAVRYGGREFRDLPFC TFTNNMLIRDNIEFTSDEPLLHGLLDYASTTENYTLLGYTHLNVFFGIRGKQQPQDAG SSRMPRLMVKDEAGFVCCLEHNTNKLYETIEDKSLNLCSIRDYGISSKLAMTIAKAQG LSLNKVAICFGSHRNIKPGHVYVALSRARHSNCVVMDRNPLSEMITGEGNPASGYIVD ALKNSRALLVY" misc_feature complement(111830) /note="polyA signal: AATAAA" promoter 112471 /note="TATA: TATATAT" promoter complement(112476) /note="TATA: TATATAA" misc_feature 112620 /note="BAM: Bam H1 K/B" promoter 113876 /note="TATA: TATTTAT before BBRF1" promoter complement(113885) /note="TATA: CATAAAT" CDS 114204..116045 /note="BBRF1 late reading frame, homologous to RF 54 VZV" /codon_start=1 /db_xref="PID:g1334884" /db_xref="SWISS-PROT:P03213" /translation="MFNMNVDESASGALGSSAIPVHPTPASVRLFEILQGKYAYVQGQ TIYANLRNPGVFSRQVFTHLFKRAISHCTYDDVLHDWNKFEACIQKRWPSDDSCASRF RESTFESWSTTMKLTVRDLLTTNIYRVLHSRSVLSYERYVDWICATGMVPAVKKPITQ ELHSKIKSLRDRCVCRELGHERTIRSIGTELYEATKEIIESLNSTFIPQFTEVTIEYL PRSDEYVAYYCGRRIRLHVLFPPAIFAGTVTFDSPVQRLYQNIFMCYRTLEHAKICQL LNTAPLKAIVGHGGRDMYKDILAHLEQNSQRKDPKKELLNLLVKLSENKTISGVTDVV EEFITDASNNLVDRNRLFGQPGETAAQGLKKKVSNTVVKCLTDQINEQFDQINGLEKE RELYLKKIRSMESQLQASLGPGGNNPAASAPAAVAAEAASVDILTGSTASAIEKLFNS PSASLGARVSGHNESILNSFVSQYIPPSREMTKDLTELWESELFNTFKLTPVVDNQGQ RLYVRYSSDTISILLGPFTYLVAELSPVELVTDVYATLGIVEIIDELYRSSRLAIYIE DLGRKYCPASATGGDHGIRQAPSARGDTEPDHAKSKPARDPPPGAGS" CDS 115948..116784 /note="BBRF2 late reading frame, homologous to RF 53 VZV" /codon_start=1 /db_xref="PID:e25024" /db_xref="PID:g1632797" /db_xref="SWISS-PROT:P29882" /translation="MASGKHHQPGGTRSLTMQKVSLRVTPRLVLEVNRHNAICVATNV PEFYNARGDLNIRDLRAHVKARMISSQFCGYVLVSLLDSEDQVDHLNIFPHVFSERMI LYKPNNVNLMEMCALLSMIENAKSPSIGLCREVLGRLTLLHSKCNNLDSLFLYNGART LLSTLVKYHDLEEGAATPGPWNEGLSLFKLHKELKRAPSEARDLMQSLFLTSGKMGCL ARSPKDYCADLNKEEDANSGFTFNLFYQDSLLTKHFQCQTVLQTLRRKCLGSDTVSKI IP" promoter complement(116683) /note="TATA: GATAAAA" misc_feature complement(116696) /note="polyA signal: AATAAA" CDS complement(116784..117386) /note="BBLF3 early reading frame, spliced to BBLF2. BBLF3 contains a consensus nucleotide binding site; Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e25025" /db_xref="PID:g1334887" /translation="AFLQGVKDSEDASRLDRDVMGGEATVARRHIRVKARRGPGCLLM AIFQGDLYVGGCREHSGPFLVWHEAFSWTLDQLAARPEADKAPPSHDHLLTLVRDLTR RLAPGRRRNRFWALPRAWLQRLRRAGLRLSGSHVCLLDKDGARPAPCQTATEHGLSPT AYFREIMAFLLDVISALHPGYTIPMEITRETDLLMTVLSLF" misc_feature 116785 /note="polyA signal: AATAAA" intron complement(117386..117515) /note="intron spliced out in RNA linking BBLF2 and BBLF3" CDS complement(117515..119080) /note="BBLF2 early reading frame, spliced to BBLF3; Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e25026" /db_xref="PID:g1334888" /db_xref="SWISS-PROT:P30118" /translation="PDVLKGPVLLRSQTMMETPAESVRARVSSVTFYNVTQTAGRWWA IWVVGIVPIKREDVETLIVVQACQPPLGGSLEPPVVNAPSTTELNFLRWERELRRSGG LIAMLADAAEKDLFDLSFRTRDRRLLSAARVEDEQGLIFQPLFPAQVVCQSCSGDDGR DQQPPPVDGFGSEMEGEQTCPHAQRHSESPGQLDVYIRTPRGDVFTYSTETPDDPSPV PFRDILRPVTYEVDLVSSDGATGRGGDARRHRVSLKILEPAGGFESWLVNSWSMAGGG LYAFLRSIYASCYANHRGTKPIFYLLDPELCPGGSDFQPYVPGFPFLPIHYVGRARPA FWHRAPHSEGLLLLDLNLGVSGTPLADALLGLDARSGQRRGSLLLQQIWPPTRKEINP RHVCTREGGEGGGEDETTVVGRAEATAILEADATWWLYELARCHLSARGAPVGTPDGG GQARDAQTWLRALHRYGTSDTRRALGGLYTAVTRVLLHAAADLGLTWAYADEFILGFV APTSAHPSEEPLAQ" promoter 118981 /note="TATA: TATAAAA BBR1 late promoter before BBRF3" promoter 119067 /note="TATA: TTTAAAA BBR2 late promoter ?" promoter 119098 /note="TATA: TATTTAA BBR3 late promoter before BBRF3" misc_feature 119108 /note="DONOR: AAGGTGAAT possible donor" CDS 119137..120354 /note="BBRF3 late reading frame" /codon_start=1 /db_xref="PID:g1334889" /db_xref="SWISS-PROT:P03215" /translation="MKSSKNDTFVYRTWVKTLVVYFVMFVMSAVVPITAMFPNLGYPC YFNALVDYGALNLTNYNLAHHLTPTLYLEPPEMFVYITLVFIADCVAFIYYACGEVAL IKARKKVSGLTDLSAWVSAVGSPTVLFLAILKLWSIQVFIQVLSYKHVFLSAFVYFLH FLASVLHACACVTRFSPVWVVKAQDNSIPQDTFLWWVVFYLKPVVTNLYLGCLALETL VFSLSVFLALGNSFYFMVGDMVLGAVNLFLILPIFWYILTEVWLASFLRHNFGFYCGM FIASIILILPLVRYEAVFVSAKLHTTVAINVAIIPILCSVAMLIRICRIFKSMRQGTD YVPVSETVELELESEPRPRPSRTPSPGRNRRRSSTSSSSSRSTRRQRPVSTQALVSSV LPMTTDSEEEIFP" misc_feature 120260 /note="ACCEPT: ATCTTCCTCCAGGT possible acceptor" misc_feature 120358 /note="polyA signal: AATAAA" CDS complement(120747..120974) /note="BBLF1 late reading frame, possibly homologous to RF 49 VZV" /codon_start=1 /db_xref="PID:g1334890" /db_xref="SWISS-PROT:P03216" /translation="MGALWSLCRRRVNSIGDVDGGIINLYNDYEEFNLETTKLIAAEE GRACGETNEGLEYDEDSENDELLFLPNKKPN" misc_feature complement(120764) /note="polyA signal: AATAAA, 3' end of 0.6kb late, 1.6kb early, 3.0kb early RNAs" CDS complement(120929..122341) /note="BGLF5 early reading frame, homologous to RF 48 VZV and alkaline exonuclease of HSV" /codon_start=1 /db_xref="PID:g1334891" /db_xref="SWISS-PROT:P03217" /translation="MADVDELEDPMEEMTSYTFARFLRSPETEAFVRNLDRPPQMPAM RFVYLYCLCKQIQEFSGETGFCDFVSSLVQENDSKDGPSLKSIYWGLQEATDEQRTVL CSYVESMTRGQSENLMWDILRNGIISSSKLLSTIKNGPTKVFEPAPISTNHYFGGPVA FGLRCEDTVKDIVCKLICGDASANRQFGFMISPTDGIFGVSLDLCVNVESQGDFILFT DRSCIYEIKCRFKYLFSKSEFDPIYPSYTALYKRPCKRSFIRFINSIARPTVEYVPDG RLPSEGDYLLTQDEAWNLKDVRKRKLGPGHDLVADSLAANRGVESMLYVMTDPSENAG RIGIKDRVPVNIFINPRHNYFYQVLLQYKIVGDYVRHSGGGKPGRDCSPRVNIVTAFF RKRSPLDPATCTLGSDLLLDASVEIPVAVLVTPVVLPDSVIRKTLSTAAGSWKAYADN TFDTAPWVPSGLFADDESTP" promoter complement(121331) /note="TATA: TATTAAA BBL1 late promoter before BBLF1" promoter 121669 /note="TATA: CATAAAT" promoter 121697 /note="TATA: TATAAAG" promoter 121772 /note="TATA: CATAAAG" misc_feature 122313 /note="BAM: Bam H1 B/G" CDS complement(122328..123692) /note="BGLF4 early reading frame, homologous to RF 47 VZV; Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e25030" /db_xref="PID:g1334892" /db_xref="SWISS-PROT:P13288" /translation="SGWRSSVSRSLRPETSCDRPSSHLRNMDVNMAAELSPTNSSSSG ELSVSPEPPRETQAFLGKVTVIDYFTFQHKHLKVTNIDDMTETLYVKLPENMTRCDHL PITCEYLLGRGSYGAVYAHADNATVKLYDSVTELYHELMVCDMIQIGKATAEDGQDKA LVDYLSACTSCHALFMPQFRCSLQDYGHWHDGSIEPLVRGFQGLKDAVYFLNRHCGLF HSDISPSNILVDFTDTMWGMGRLVLTDYGTASLHDRNKMLDVRLKSSKGRQLYRLYCQ REPFSIAKDTYKPLCLLSKCYILRGAGHIPDPSACGPVGAQTALRLDLQSLGYSLLYG IMHLADSTHKIPYPNPDMGFDRSDPLYFLQFAAPKVVLLEVLSQMWNLNLDMGLTSCG ESPCVDVTAEHMSQFLQWCRSLKKRFKESYFFNCRPRFEHPHLPGLVAELLADDFFGP DGRRG" misc_feature complement(123506) /note="DONOR: AAGGTGACT possible donor" CDS complement(123941..124939) /note="BGLF3 reading frame" /codon_start=1 /db_xref="PID:g1334894" /db_xref="SWISS-PROT:P03220" /translation="MFNAVKADMPDDPMLARRYGQCLELALEACQDTPEQFKLVETPL KSFLLVSNILPQDNRPWHEARSSGRVAEDDYDFSSLALELLPLNPRLPEEWQFGGQGW SSRMEPSQPEMGMGLCFEVFDGDLMRIALAWNKDEVIGQALQILAHSQTWTSLVPEDP LPWMWALFYGPRSHCEERHCVYAAARGKRGPILLPTAVYTPCANIEAFLAHLTRCVYA LYLDVRDWKGEDIAPPFDVSRLNKMAKQLCLLPQEPFCITRVCLLCLLHKQNLNAQYK RPVDTYDPCLILTGEAERYMVDAVGNYREASTGTTVLYPTYDLGSIVADMVTYEDE" promoter complement(124117) /note="TATA: TATAAAA" misc_feature complement(124219) /note="polyA signal: AATAAA" CDS 124938..125915 /note="BGRF1 reading frame, homologous to RF 45 VZV and spliced HSV gene (Costa et al, 1985). Spliced to BDRF1. Northern blots in BGRF1 detect 2.7, 2.6, 2.1kb late and 1.9kb early RNAs. 2.6, 2.1kb RNAs very weak." /codon_start=1 /db_xref="PID:g1334893" /db_xref="SWISS-PROT:P03219" /translation="MLYASQRGRLTENLRNALQQDSTTQGCLGAETPSIMYTGAKSDR WAHPLVGTIHASNLYCPMLRAYCRHYGPRPVFVASDESLPMFGASPALHTPVQVQMCL LPELRDTLQRLLPPPNLEDSEALTEFKTSVSSARAILEDPNFLEMREFVTSLASFLSG QYKHKPARLEAFQKQVVLHSFYFLISIKSLEITDTMFDIFQSAFGLEEMTLEKLHIFK QKASVFLIPRRHGKTWIVVAIISLILSNLSNVQIGYVAHQKHVASAVFTEIIDTLTKS FDSKRVEVNKETSTITFRHSGKISSTVMCATCFNKNVRPDVSVLGNCRA" CDS join(124938..125873,129215..130351) /note="BDRF1 reading frame" /codon_start=1 /product="probable DNA packaging protein" /db_xref="PID:e276433" /db_xref="PID:g1632798" /translation="MLYASQRGRLTENLRNALQQDSTTQGCLGAETPSIMYTGAKSDR WAHPLVGTIHASNLYCPMLRAYCRHYGPRPVFVASDESLPMFGASPALHTPVQVQMCL LPELRDTLQRLLPPPNLEDSEALTEFKTSVSSARAILEDPNFLEMREFVTSLASFLSG QYKHKPARLEAFQKQVVLHSFYFLISIKSLEITDTMFDIFQSAFGLEEMTLEKLHIFK QKASVFLIPRRHGKTWIVVAIISLILSNLSNVQIGYVAHQKHVASAVFTEIIDTLTKS FDSKRVEVNKETSTITFRHSGKISSTVMCATCFNKNSIRGQTFHLLFVDEANFIKKEA LPAILGFMLQKDAKIIFISSVNSADQATSFLYKLKDAQERLLNVVSYVCQEHRQDFDM QDSMVSCPCFRLHIPSYITMDSNIRATTNLFLDGAFSTELMGDTSSLSQGSLSRTVRD DAINQLELCRVDTLNPRVAGRLASSLYVYVDPAYTNNTSASGTGIAAVTHDRADPNRV IVLGLEHFFLKDLTGDAALQIATCVVALVSSIVTLHPHLEEVKVAVEGNSSQDSAVAI ASIIGESCPLPCAFVHTKDKTSSLQWPMYLLTNEKSKAFERLIYAVNTASLSASQVTV SNTIQLSFDPVLYLISQIRAIKPIPLRDGTYTYTGKQRNLSDDVLVALVMAHFLATTQ KHTFKKVH" promoter complement(125113) /note="TATA: TATAAAT before BGLF3" misc_feature complement(125484) /note="polyA signal: AATAAA, 3' end of 1.6kb late, 1.8kb late, 3.0kb late and 3.7kb early RNAs" CDS complement(125863..126873) /note="BGLF2 late reading frame, poor homology to RF 44 VZV" /codon_start=1 /db_xref="PID:g1334895" /db_xref="SWISS-PROT:P03221" /translation="MASAANSSREQLRKFLNKECLWVLSDASTPQMKVYTATTAVSAV YVPQIAGPPKTYMNVTLIVLKPKKKPTYVTVYINGTLATVARPEVLFTKAVQGPHSLT LMYFGVFSDAVGEAVPVEIRGNPVVTCTDLTTAHVFTTSTAVKTVEELQDITPSEIIP LGRGGAWYAEGALYMFFVNMDMLMCCPNMPTFPSLTHFINLLTRCDNGECVTCYGAGA HVNILRGWTEDDSPGTSGTCPCLLPCTALNNDYVPITGHRALLGLMFKPEDAPFVVGL RFNPPKMHPDMSRVLQGVLANGKEVPCTAQPWTLLRFSDLYSRAMLYNCQVLKRQVLH SY" promoter 126277 /note="TATA: GATAAAA" CDS complement(126851..128374) /note="BGLF1 late reading frame" /codon_start=1 /db_xref="PID:g1334896" /db_xref="SWISS-PROT:P03222" /translation="MDVHIDNQVLSGLGTPLLVHLFVPDTVMAELCPNRVPNCEGAWC QTLFSDRTGLTRVCRVFAARGMLPGRPSHRGTFTSVPVYCDEGLPELYNPFHVAALRF YDEGGLVGELQIYYLSLFEGAKRALTDGHLIREASGVQESAAAMQPIPIDPGPPGGAG IEHMPVAAAQVEHPKTYDLKQILLEITQEENRGEQRLGHAGSPALCLGLRLRAGAETK AAAETSVSKHHPALENPSNIRGSAGGEGGGGRAGTGGTVGVGSGALSRVPVSFSKTRR AIRESRALVRGIAHIFSPHALYVVTYPELSAQGRLHRMTAVTHASPATDLAEVSILGA PEREFRFLISVALRISASFREKLAMQAWTAQQEIPVVIPTSYSRIYKNSDLIREAFFT VQTRVSWESCWVKATISNAPKTPDACLWIDSHPLYEEGASAWGKVIDSRPPGGLVGAA SQLVALGTDGHCVHLATTSDGQAFLVLPGGFVIKGQLALTPEERGYILARHGIRREQ" promoter complement(126929) /note="TATA: TATTAAA EEL8 late promoter before BGLF2, gives 1.6kb late RNA" promoter complement(127237) /note="TATA: TATAAAA, potential promoter for 1.8kb late RNA" misc_feature 128029 /note="polyA signal: AATAAA" CDS complement(128344..129021) /note="BDLF4 early reading frame" /codon_start=1 /db_xref="PID:g1334897" /db_xref="SWISS-PROT:P03223" /translation="MSDQGRLSLPRGEGGTDEPNPRHLCSYSKLEFHLPLPESMASVF ACWGCGEYHVCDGSSECTLIETHEGVVCALTGNYMGPHFQPALRPWTEIRQDTQDQRD KWEPEQVQGLVKTVVNHLYHYFLNENVISGVSEALFDQEGALRPHIPALVSFVFPCCL MLFRGASSEKVVDVVLSLYIHVIISIYSQKTVYGALLFKSTRNKRYDAVAKRMRELWM STLTTKC" promoter complement(128432) /note="TATA: TATTTAA before BGLF1, potential promoter for 3.0kb late RNA" misc_feature 128848 /note="BAM: Bam H1 G/D" promoter complement(129054) /note="TATA: TATTTGC before BDLF4, potential promoter for 3.7kb early RNA" mat_peptide 129188..130348 /note="BDRF1 reading frame, homologous to RF 42 VZV and spliced gene in HSV (Costa et al, 1985). Spliced from BGRF1. Northern blots in BDRF1 detect 2.7, 2.6 kb late and 1.9kb early RNAs. Possibly also 1.8kb early RNA." promoter 129374 /note="TATA: TATAAGC" promoter complement(129377) /note="TATA: TATAAAG" misc_feature 129413 /note="DONOR: GTGGTAAGT possible donor" misc_feature 130347 /note="polyA signal: ATTAAA" misc_feature complement(130359) /note="polyA signal: AATAAA, 3' end of 0.9kb late RNA, 2.3kb late RNA and 3.2kb late RNA" CDS complement(130362..131066) /note="BDLF3 late reading frame 9xNXT/S" /codon_start=1 /db_xref="PID:g1334899" /db_xref="SWISS-PROT:P03224" /translation="MAHARDKAGAVMAMILICETSLIWTSSGSSTASAGNVTGTTAVT TPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTFTGTTVTPVPTTSNASTINV TTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNA TKTTAELPTVPDERQPSLSYGLPLWTLVFVGLTFLMLILIFAAGLMMSAKNKPLDEAL LTNAVTRDPSLYKGLV" promoter complement(131104) /note="TATA: TATAAAA EEL4 late promoter before BDLF3, gives 0.9kb late RNA" CDS complement(131127..132389) /note="BDLF2 late reading frame" /codon_start=1 /db_xref="PID:g1334900" /db_xref="SWISS-PROT:P03225" /translation="MVDEQVAVEHGTVSHTISREEDGVVHERRVLASGERVEVFYKAP APRPREGRASTFHDFTVPAAAAVPGPEPEPEPHPPMPIHANGGGETKTNTQDQNQNQT TRTRTNAKAEERTAEMDDTMASSGGQRGAPISADLLSLSSLTGRMAAMAPSWMKSEVC GERMRFKEDVYDGEAETLAEPPRCFMLSFVFIYYCCYLAFLALLAFGFNPLFLPSFMP VGAKVLRGKGRDFGVPLSYGCPTNPFCKVYTLIPAVVINNVTYYPNNTDSHGGHGGFE AAALHVAALFESGCPNLQAVTNRNRTFNVTRASGRVERRLVQDMQRVLASAVVVMHHH CHYETYYVFDGVGPEFGTIPTPCFKDVLAFRPSLVTNCTAPLKTSVKGPNWSGAAGGM KRKQCRVDRLTDRSFPAYLEEVMYVMVQ" promoter 132266 /note="TATA: TATAAAA" CDS complement(132400..133305) /note="BDLF1 late reading frame, poor homology to RF 41 VZV" /codon_start=1 /db_xref="PID:g1334901" /db_xref="SWISS-PROT:P25214" /translation="MDLKVVVSLSSRLYTDEIAKMQQRIGCILPLASTHGTQNVQGLG LGQVYSLETVPDYVSMYNYLSDCTLAVLDEVSVDSLILTKIVPGQTYAIKNKYQPFFQ WHGTGSLSVMPPVFGREHATVKLESNDVDIVFPMVLPTPIAEEVLQKILLFNVYSRVV MQAPGNADMLDVHMHLGSVSYLGHHYELALPEVPGPLGLALLDNLSLYFCIMVTLLPR ASMRLVRGLIRHEHHDLLNLFQEMVPDEIARIDLDDLSVADDLSRMRVMMTYLQSLAS LFNLGPRLATAAYSQETLTATCWLR" promoter complement(132476) /note="TATA: TATTTAA before BDLF2, likely promoter for 2.3kb late RNA" misc_feature complement(133312) /note="polyA signal: AATAAA, 3' end of 4.5kb late RNA" CDS complement(133321..137466) /note="BcLF1 late reading frame, homologous to RF 40 VZV and major capsid protein of HSV" /codon_start=1 /db_xref="PID:g1334902" /db_xref="SWISS-PROT:P03226" /translation="MASNEGVENRPFPYLTVDADLLSNLRQSAAEGLFHSFDLLVGKD AREAGIKFEVLLGVYTNAIQYVRFLETALAVSCVNTEFKDLSRMTDGKIQFRISVPTI AHGDGRRPSKQRTFIVVKNCHKHHISTEMELSMLDLEILHSIPETPVEYAEYVGAVKT VASALQFGVDALERGLINTVLSVKLRHAPPMFILQTLADPTFTERGFSKTVKSDLIAM FKRHLLEHSFFLDRAENMGSGFSQYVRSRLSEMVAAVSGESVLKGVSTYTTAKGGEPV GGVFIVTDNVLRQLLTFLGEEADNQIMGPSSYASFVVRGENLVTAVSYGRVMRTFEHF MARIVDSPEKAGSTKSDLPAVAAGVEDQPRVPISAAVIKLGNHAVAVESLQKMYNDTQ SPYPLNRRMQYSYYFPVGLFMPNPKYTTSAAIKMLDNPTQQLPVEAWIVNKNNLLLAF NLQNALKVLCHPRLHTPAHTLNSLNAAPAPRDRRETYSLQHRRPNHMNVLVIVDEFYD NKYAAPVTDIALKCGLPTEDFLHPSNYDLLRLELHPLYDIYIGRDAGERARHRAVHRL MVGNLPTPLAPAAFQEARGQQFETATSLAHVVDQAVIETVQDTAYDTAYPAFFYVVEA MIHGFEEKFVMNVPLVSLCINTYWERSGRLAFVNSFSMIKFICRHLGNNAISKEAYSM YRKIYGELIALEQALMRLAGSDVVGDESVGQYVCALLDPNLLPPVAYTDIFTHLLTVS DRAPQIIIGNEVYADTLAAPQFIERVGNMDEMAAQFVALYGYRVNGDHDHDFRLHLGP YVDEGHADVLEKIFYYVFLPTCTNAHMCGLGVDFQHVAQTLAYNGPAFSHHFTRDEDI LDNLENGTLRDLLEISDLRPTVGMIRDLSASFMTCPTFTRAVRVSVDNDVTQQLAPNP ADKRTEQTVLVNGLVAFAFSERTRAVTQCLFHAIPFHMFYGDPRVAATMHQDVATFVM RNPQQRAVEAFNRPEQLFAEYREWHRSPMGKYAAECLPSLVSISGMTAMHIKMSPMAY IAQAKLKIHPGVAMTVVRTDEILSENILFSSRASTSMFIGTPNVSRREARVDAVTFEV HHEMASIDTGLSYSSTMTPARVAAITTDMGIHTQDFFSVFPAEAFGNQQVNDYIKAKV GAQRNGTLLRDPRTYLAGMTNVNGAPGLCHGQQATCEIIVTPVTADVAYFQKSNSPRG RAACVVSCENYNQEVAEGLIYDHSRPDAAYEYRSTVNPWASQLGSLGDIMYNSSYRQT AVPGLYSPCRAFFNKEELLRNNRGLYNMVNEYSQRLGGHPATSNTEVQFVVIAGTDVF LEQPCSFLQEAFPALSASSRALIDEFMSVKQTHAPIHYGHYIIEEVAPVRRILKFGNK VVF" misc_feature complement(133332) /note="DONOR: AAGGTGGTT possible donor" promoter complement(133352) /note="TATA: TATTAAA before BDLF1" promoter complement(133386) /note="TATA: TATATAA" misc_feature 135178 /note="polyA signal: AATAAA" promoter 135394 /note="TATA: TATAAGT" misc_feature 136624 /note="polyA signal: AATAAA" misc_feature 136868 /note="BAM: Bam H1 D/c" promoter complement(137710) /note="TATA: TATTAAA EHL1 promoter before BcLF1, gives 4.5kb late RNA" promoter 137857 /note="TATA: CATAAAC" CDS 137991..139718 /note="BcRF1 reading frame" /codon_start=1 /db_xref="PID:e25010" /db_xref="PID:g1632799" /db_xref="SWISS-PROT:P25215" /translation="MLAHLNQVTRIPPCPPFSGREARLKFHFFSWSTFMLSWPNNATL REIRTRAATNLTHHPHLVDTLYHASPQTPFLTRSGALYRFVTCCNCTLPNISIQQCKA GDRPGDLEIILQSNGGGRPASFQFPSSPTGSLLRCIVAASLLPEVSVGHQELSPLRSR SQGGQTDVRSGPDPARRLVALLRREDGAPKDPPLGPFGHPRGPGPAKSEDEESERRDA PPPPLDSSFQASRLVPVGPGFRLLVFNTNRVINTKLVCSEPLVKMRVCNVPRLINNFV ARKYVVKETAFTVSLFFTDGVGANLAINVNISGTYLSFLLAMTSLRCFLPVEAIYPAA VSNWNSTLDLHGLENQSLVRENRSGVFWTTNFPSVVSCRDGLNVSWFKAATATISRVH GQTLEQHLIREITPIVTHREAKISRIKNRLFTLLELRNRSQIQVLHKRFLEGLLDCAS LLRLDPSCINRIASEGLFDFSKRSIAHSKNRHECALLGHRHSANVTKLVVNERKTRLD ILGRNANFLTRCKHQVNLRQSPIFLTLLRHIRRRLGLGRASVKREITLLLAHLRKKTA PIHCRDAQV" misc_feature 138019 /note="BAM: Bam H1 c/b" misc_feature 139352 /note="BAM: Bam H1 b/T" CDS 139642..140916 /note="BTRF1 reading frame. Northern blots detect 0.95 late and 3.8kb early RNA; Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e25011" /db_xref="PID:g1334904" /db_xref="SWISS-PROT:P30119" /translation="NERLPFSWPTCAKRQPPSTAVMLKCKQPGARFIHGAVHLPSGQI VFHTIHSPTLASALGLPGENVPIPALFRASGLNVRESLPMTNMRAPIISLARLILAPN PYILEGQLTVGMTQDNGIPVLFARPVIEVKSGPESNIKASSQLMIAEDSCLNQIAPFS ASEHPAFSMVESVKRVRVDEGANTRRTIRDILEIPVTVLSSLQLSPTKSILKKAPEPP PPEPQATFDATPYARIFYDIGRQVPKLGNAPAAQVSNVLIANRSHNSLRLVPNPDLLP LQHLYLKHVVLKSLNLENIVQDFEAIFTSPSDTISEAETKAFEKLVEQAKNTVENIVF CLNSICSTSTLPDVVPDVNNPNISLALEKYFLMFPPSGTIMRNVRFATPIVRLLCQGA ELGTMAQFLGKYIKVKKETGMYTLVKLYYLLRI" misc_feature complement(140902) /note="polyA signal: AATAAA, 3' end of 2.5kb late RNA" CDS complement(140916..143036) /note="BXLF2 late reading frame, encodes gp85; homologous to RF 37 VZV and glycoprotein H of HSV (gpIII of VZV)" /codon_start=1 /db_xref="PID:g1334905" /db_xref="SWISS-PROT:P03231" /translation="MQLLCVFCLVLLWEVGAASLSEVKLHLDIEGHASHYTIPWTELM AKVPGLSPEALWREANVTEDLASMLNRYKLIYKTSGTLGIALAEPVDIPAVSEGSMQV DASKVHPGVISGLNSPACMLSAPLEKQLFYYIGTMLPNTRPHSYVFYQLRCHLSYVAL SINGDKFQYTGAMTSKFLMGTYKRVTEKGDEHVLSLVFGKTKDLPDLRGPFSYPSLTS AQSGDYSLVIVTTFVHYANFHNYFVPNLKDMFSRAVTMTAASYARYVLQKLVLLEMKG GCREPELDTETLTTMFEVSVAFFKVGHAVGETGNGCVDLRWLAKSFFELTVLKDIIGI CYGATVKGMQSYGLERLAAMLMATVKMEELGHLTTEKQEYALRLATVGYPKAGVYSGL IGGATSVLLSAYNRHPLFQPLHTVMRETLFIGSHVVLRELRLNVTTQGPNLALYQLLS TALCSALEIGEVLRGLALGTESGLFSPCYLSLRFDLTRDKLLSMAPQEATLDQAAVSN AVDGFLGRLSLEREDRDAWHLPAYKCVDRLDKVLMIIPLINVTFIISSDREVRGSALY EASTTYLSSSLFLSPVIMNKCSQGAVAGEPRQIPKIQNFTRTQKSCIFCGFALLSYDE KEGLETTTYITSQEVQNSILSSNYFDFDNLHVHYLLLTTNGTVMEIAGLYEERAHVVL AIILYFIAFALGIFLVHKIVMFFL" misc_feature 140970 /note="polyA signal: AATAAA" misc_feature complement(141286) /note="polyA signal: AATAAA" promoter 142589 /note="TATA: GATAAAA" misc_feature 142740 /note="BAM: Bam H1 T/X" CDS complement(143038..144861) /note="BXLF1 early reading frame, thymidine kinase (Littler et al, 1986). Weak homology to RF 36 VZV and HSV thymidine kinase. 4.0kb early RNA presumably encodes the TK. Also a 2.2kb late RNA here." /codon_start=1 /db_xref="PID:g1334907" /db_xref="SWISS-PROT:P03177" /translation="MAGFPGKEAGPPGGWRKCQEDESPENERHENFYAEIDDFAPSVL TPTGSDSGAGEEDDDGLYQVPTHWPPLMAPTGLSGERVPCRTQAAVTSNTGNSPGSRH TSCPFTLPRGAQPPAPAHQKPTAPTPKPRSRECGPSKTPDPFSWFRKTSCTEGGADST SRSFMYQKGFEEGLAGLGLDDKSDCESEDESNFRRPSSHSALKQKNGGKGKPSGLFEH LAAHGREFSKLSKHAAQLKRLSGSVMNVLNLDDAQDTRQAKAQRKESMRVPIVTHLTN HVPVIKPACSLFLEGAPGVGKTTMLNHLKAVFGDLTIVVPEPMRYWTHVYENAIKAMH KNVTRARHGREDTSAEVLACQMKFTTPFRVLASRKRSLLVTESGARSVAPLDCWILHD RHLLSASVVFPLMLLRSQLLSYSDFIQVLATFTADPGDTIVWMKLNVEENMRRLKKRG RKHESGLDAGYLKSVNDAYHAVYCAWLLTQYFAPEDIVKVCAGLTTITTVCHQSHTPI IRSGVAEKLYKNSIFSVLKEVIQPFRADAVLLEVCLAFTRTLAYLQFVLVDLSEFQDD LPGCWTEIYMQALKNPAIRSQFFDWAGLSKVISDFERGNRD" promoter complement(143310) /note="TATA: TATAAGA ECL2 late promoter before BXLF2, gives 2.5kb late RNA" misc_feature 143608 /note="polyA signal: AATAAA" misc_feature 144791 /note="ACCEPT: TCTTTCGTTTTCAGG poss. acceptor before BXRF1" CDS 144860..145606 /note="BXRF1 late reading frame, homologous to RF 35 VZV. Basic (core?) protein." /codon_start=1 /db_xref="PID:g1334906" /db_xref="SWISS-PROT:P03232" /translation="MDPTRGLCALSTHDLAKFHSLPPARKAAGKRAHLRCYSKLLSLK SWEQLASFLSLPPGPTFTDFRLFFEVTLGRRIADCVVVALQPYPRCYIVEFKTAMSNT ANPQSVTRKAQRLEGTAQLCDCANFLRTSCPPVLGSQGLEVLAALVFKNQRSLRTLQV EFPALGQKTLPTSTTGLLNLLSRWQDGALRARLDRPRPTAQGHRPRTHVGPKPSQLTA RVPRSARAGRAGGRKGQVGAVGQVCPGAQK" misc_feature 144862 /note="BAM: Bam H1 X/V" misc_feature 144945 /note="DONOR: CAGGTAAGC possible donor at 3' BXRF1" promoter complement(145135) /note="TATA: TATAACA before BXLF1" promoter 145302 /note="TATA: TATTTAA before BVRF1, potential promoter for 1.9kb early RNA" CDS 145416..147128 /note="BVRF1 early reading frame, homologous to RF 34 VZV" /codon_start=1 /db_xref="PID:g1334908" /db_xref="SWISS-PROT:P03233" /translation="MALSGHVLIDPARLPRDTGPELMWAPSLRNSLRVSPEALELAER EAERARSERWDRCAQVLKNRLLRVELDGIMRDHLARAEEIRQDLDAVVAFSDGLESMQ VRSPSTGGRSAPAPPSPSPAQPFTRLTGNAQYAVSISPTDPPLMVAGSLAQTLLGNLY GNINQWVPSFGPWYRTMSANAMQRRVFPKQLRGNLNFTNSVSLKLMTEVVAVLEGTTQ DFFSDVRHLPDLQAALILSVAYLLLQGGSSHQQRPLPASREELLELGPESLEKIIADL KAKSPGGNFMILTSGNKEARQSIAPLNRQAAYPPGTFADNKIYNLFVGAGLLPTTAAL NVPGAAGRDRDLVYRIANQIFGEDVPPFSSHQWNLRVGLAALEALMLVYTLCETANLA EAATRRLHLSSLLPQAMQRRKPAMASAGMPGAYPVQTLFRHGELFRFIWAHYVRPTVA ADPQASISSLFPGLVLLALELKLMDGQAPSHYAINLTGQKFDTLFEIINQKLLFHDPA AMLAARTQLRLAFEDGVGVALGRPSPMLAAREILERQFSASDDYDRLYFLTLGYLASP VAPS" misc_feature complement(146926) /note="polyA signal: AATAAA" misc_feature 147167 /note="DONOR: AAGGTAAAT possible donor" misc_feature 147170 /note="polyA signal: AATAAA, 3' end of 2.4kb late and 1.9kb early RNAs" promoter 147721 /note="TATA: TATTTAT before BVRF2, potential promoter for 2.1kb early RNA" CDS 147927..149744 /note="BVRF2 early reading frame, N-terminus homologous to RF 33 VZV" /codon_start=1 /db_xref="PID:g1334909" /db_xref="SWISS-PROT:P03234" /translation="MVQAPSVYVCGFVERPDAPPKDACLHLDPLTVKSQLPLKKPLPL TVEHLPDAPVGSVFGLYQSRAGLFSAASITSGDFLSLLDSIYHDCDIAQSQRLPLPRE PKVEALHAWLPSLSLASLHPDIPQTTADGGKLSFFDHVSICALGRRRGTTAVYGTDLA WVLKHFSDLEPSIAAQIENDANAAKRESGCPEDHPLPLTKLIAKAIDAGFLRNRVETL RQDRGVANIPAESYLKASDAPDLQKPDKALQSPPPASTDPATMLSGNAGEGATACGGS AAAGQDLISVPRNTFMTLLQTNLDNKPPRQTPLPYAAPLPPFSHQAIATAPSYGPGAG AVAPAGGYFTSPGGYYAGPAGGDPGAFLAMDAHTYHPHPHPPPAYFGLPGLFGPPPPV PPYYGSHLRADYVPAPSRSNKRKRDPEEDEEGGGLFPGEDATLYRKDIAGLSKSVNEL QHTLQALRRETLSYGHTGVGYCPQQGPCYTHSGPYGFQPHQSYEVPRYVPHPPPPPTS HQAAQAQPPPPGTQAPEAHCVAESTIPEAGAAGNSGPREDTNPQQPTTEGHHRGKKLV QASASGVAQSKEPTTPKAKSVSAHLKSIFCEELLNKRVA" misc_feature 148007 /note="BAM: Bam H1 V/d" promoter 148620 /note="TATA: TATTTAA ECR1 late promoter before BdRF1, gives 1.2kb late RNA" CDS 148707..149744 /note="BdRF1 reading frame; this is the C terminus of BVRF2" /codon_start=1 /db_xref="PID:g1334910" /translation="MLSGNAGEGATACGGSAAAGQDLISVPRNTFMTLLQTNLDNKPP RQTPLPYAAPLPPFSHQAIATAPSYGPGAGAVAPAGGYFTSPGGYYAGPAGGDPGAFL AMDAHTYHPHPHPPPAYFGLPGLFGPPPPVPPYYGSHLRADYVPAPSRSNKRKRDPEE DEEGGGLFPGEDATLYRKDIAGLSKSVNELQHTLQALRRETLSYGHTGVGYCPQQGPC YTHSGPYGFQPHQSYEVPRYVPHPPPPPTSHQAAQAQPPPPGTQAPEAHCVAESTIPE AGAAGNSGPREDTNPQQPTTEGHHRGKKLVQASASGVAQSKEPTTPKAKSVSAHLKSI FCEELLNKRVA" misc_feature 149115 /note="BAM: Bam H1 d/I" misc_feature 149727 /note="polyA signal: AATAAA, 3' end of 2.1kb early and 1.2kb late RNAs" misc_feature complement(149758) /note="polyA signal: AATAAA, 3' end of 1.0kb late, 1.5kb late and 1.8kb late RNAs" CDS complement(149779..150525) /note="BILF2 late reading frame 11xNXT/S" /codon_start=1 /db_xref="PID:g1334911" /db_xref="SWISS-PROT:P03218" /translation="MTHLVLLLCCCVGSVCAFFSDLVKFENVTAHAGARVNLTCSVPS NESVSRIELGRGYTPGDGQLPLAVATSNNGTHITNGGYNYSLTLEWVNDSNTSVSLII PNVTLAHAGYYTCNVTLRNCSVASGVHCNYSAGEEDDQYHANRTLTQRMHLTVIPATT IAPTTLVSHTTSTSHRPHRRPVSKRPTHKPVTLGPFPIDPWRPKTTWVHWALLLITCA VVAPVLLIIIISCLGWLAGWGRRRKGWIPL" promoter complement(150571) /note="TATA: TATTTAG before BILF2. Potential promoter for 1.0kb late RNA." repeat_region 151236..151618 /note="repetitive sequence 3X" misc_feature 151767 /note="polyA signal: AATAAA" promoter complement(151780) /note="TATA: CATAAAA" misc_feature 152012..152013 /note="DEL: B95-8 deletion with respect to Raji" CDS complement(152161..153099) /note="BILF1 reading frame, membrane protein, 3xNXS /T" /codon_start=1 /db_xref="PID:g1334912" /db_xref="SWISS-PROT:P03208" /translation="MLSTMAPGSTVGTLVANMTSVNATEDACTKSYSAFLSGMTSLLL VLLILLTLAGILFIIFVRKLVHRMDVWLIALLIELLLWVLGKMIQEFSSTGLCLLTQN MMFLGLMCSVWTHLGMALEKTLALFSRTPKRTSHRNVCLYLMGVFCLVLLLIIILLIT MGPDANLNRGPNMCREGPTKGMHTAVQGLKAGCYLLAAVLIVLLTVIIIWKLLRTKFG RKPRLICNVTFTGLICAFSWFMLSLPLLFLGEAGSLGFDCTESLVARYYPGPAACLAL LLIILYAWSFSHFMDSLKNQVTVTARYFRRVPSQST" promoter 152230 /note="TATA: CATAAAA" misc_feature 153259 /note="polyA signal: AATAAA" misc_feature 153637 /note="HPN: 22bp 2-fold symmetric" misc_feature complement(153690) /note="DONOR: AAAGTGAGG possible donor" CDS complement(153699..156746) /note="BALF5 DNA polymerase (early), homologous to many DNA polymerases, CMV HFLF2 and RF 28 VZV. 4.5kb early RNA apparently encodes BALF5, RNA ends unknown." /codon_start=1 /db_xref="PID:g1334913" /db_xref="SWISS-PROT:P03198" /translation="MSGGLFYNPFLRPNKGLLKKPDKEYLRLIPKCFQTPGAAGVVDV RGPQPPLCFYQDSLTVVGGDEDGKGMWWRQRAQEGTARPEADTHGSPLDFHVYDILET VYTHEKCAVIPSDKQGYVVPCGIVIKLLGRRKADGASVCVNVFGQQAYFYASAPQGLD VEFAVLSALKASTFDRRTPCRVSVEKVTRRSIMGYGNHAGDYHKITLSHPNSVCHVAT WLQDKHGCRIFEANVDATRRFVLDNDFVTFGWYSCRRAIPRLQHRDSYAELEYDCEVG DLSVRREDSSWPSYQALAFDIECLGEEGFPTATNEADLILQISCVLWSTGEEAGRYRR ILLTLGTCEDIEGVEVYEFPSELDMLYAFFQLIRDLSVEIVTGYNVANFDWPYILDRA RHIYSINPASLGKIRAGGVCEVRRPHDAGKGFLRANTKVRITGLIPIDMYAVCRDKLS LSDYKLDTVARHLLGAKKEDVHYKEIPRLFAAGPEGRRRLGMYCVQDSALVMDLLNHF VIHVEVAEIAKIAHIPCRRVLDDGQQIRVFSCLLAAAQKENFILPMPSASDRDGYQGA TVIQPLSGFYNSPVLVVDFASLYPSIIQAHNLCYSTMITPGEEHRLAGLRPGEDYESF RLTGGVYHFVKKHVHESFLASLLTSWLAKRKAIKKLLAACEDPRQRTILDKQQLAIKC TCNAVYGFTGVANGLFPCLSIAETVTLQGRTMLERAKAFVEALSPANLQALAPSPDAW APLNPEGQLRVIYGDTDSLFIECRGFSESETLRFADALAAHTTRSLFVAPISLEAEKT FSCLMLITKKRYVGVLTDGKTLMKGVELVRKTACKFVQTRCRRVLDLVLADARVKEAA SLLSHRPFQESFTQGLPVGFLPVIDILNQAYTDLREGRVPMGELCFSTELSRKLSAYK STQMPHLAVYQKFVERNEELPQIHDRIQYVFVEPKGGVKGARKTEMAEDPAYAERHGV PVAVDHYFDKLLQGAANILQCLFDNNSGAALSVLQNFTARPPF" misc_feature 154747 /note="BAM: Bam H1 I/A" misc_feature complement(156707) /note="polyA signal: AATAAA; 3' end of 2.5kb late (gB) RNA and 1.8kb late RNA" CDS complement(156749..159322) /note="BALF4 late reading frame 9xNXT/S homologous to HSV1 glycoprotein B (Pellet et al, 1985), CMV HFLF1 and RF 31 VZV (gpII)" /codon_start=1 /db_xref="PID:g1334914" /db_xref="SWISS-PROT:P03188" /translation="MTRRRVLSVVVLLAALACRLGAQTPEQPAPPATTVQPTATRQQT SFPFRVCELSSHGDLFRFSSDIQCPSFGTRENHTEGLLMVFKDNIIPYSFKVRSYTKI VTNILIYNGWYADSVTNRHEEKFSVDSYETDQMDTIYQCYNAVKMTKDGLTRVYVDRD GVNITVNLKPTGGLANGVRRYASQTELYDAPGWLIWTYRTRTTVNCLITDMMAKSNSP FDFFVTTTGQTVEMSPFYDGKNKETFHERADSFHVRTNYKIVDYDNRGTNPQGERRAF LDKGTYTLSWKLENRTAYCPLQHWQTFDSTIATETGKSIHFVTDEGTSSFVTNTTVGI ELPDAFKCIEEQVNKTMHEKYEAVQDRYTKGQEAITYFITSGGLLLAWLPLTPRSLAT VKNLTELTTPTSSPPSSPSPPAPSAARGSTPAAVLRRRRRDAGNATTPVPPTAPGKSL GTLNNPATVQIQFAYDSLRRQINRMLGDLARAWCLEQKRQNMVLRELTKINPTTVMSS IYGKAVAAKRLGDVISVSQCVPVNQATVTLRKSMRVPGSETMCYSRPLVSFSFINDTK TYEGQLGTDNEIFLTKKMTEVCQATSQYYFQSGNEIHVYNDYHHFKTIELDGIATLQT FISLNTSLIENIDFASLELYSRDEQRASNVFDLEGIFREYNFQAQNIAGLRKDLDNAV SNGRNQFVDGLGELMDSLGSVGQSITNLVSTVGGLFSSLVSGFISFFKNPFGGMLILV LVAGVVILVISLTRRTRQMSQQPVQMLYPGIDELAQQHASGEGPGINPISKTELQAIM LALHEQNQEQKRAAQRAAGPSVASRALQAARDRFPGLRRRRYHDPETAAALLGEAETE