Patent application title:

NUCLEIC ACID-GUIDED NUCLEASES

Publication number:

US20250066818A1

Publication date:
Application number:

18/945,973

Filed date:

2024-11-13

Smart Summary: Nucleic acid-guided nucleases are special tools that can cut DNA at specific locations. They use guide nucleic acids to find the right spot to make changes in the genetic material. These systems can help scientists edit genes more accurately and efficiently. They can be used for advanced genetic engineering techniques that allow for repeated modifications or tracking of changes. Overall, these tools improve our ability to work with and understand genetic information. 🚀 TL;DR

Abstract:

Disclosed herein are nucleic acid-guided nucleases, guide nucleic acids, and targetable nuclease systems, and methods of use. Disclosed herein are engineered non-naturally occurring nucleic acid-guided nucleases, guide nucleic acids, and targetable nuclease systems, and methods of use. Targetable nuclease systems can be used to edit genetic targets, including recursive genetic engineering and trackable genetic engineering methods.

Inventors:

Applicant:

Interested in similar patents?

Get notified when new applications in this technology area are published.

Classification:

C12N15/902 »  CPC main

Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor; Recombinant DNA-technology; Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation; Stable introduction of foreign DNA into chromosome using homologous recombination

C12N15/111 »  CPC further

Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor; Recombinant DNA-technology; DNA or RNA fragments; Modified forms thereof General methods applicable to biologically active non-coding nucleic acids

C12N15/905 »  CPC further

Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor; Recombinant DNA-technology; Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation; Stable introduction of foreign DNA into chromosome using homologous recombination in yeast

C12N15/907 »  CPC further

Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor; Recombinant DNA-technology; Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation; Stable introduction of foreign DNA into chromosome using homologous recombination in mammalian cells

C12N15/8509 »  CPC further

Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor; Recombinant DNA-technology; Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression; Vectors or expression systems specially adapted for eukaryotic hosts for animal cells for producing genetically modified animals, e.g. transgenic

C12N2310/20 »  CPC further

Structure or type of the nucleic acid; Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]

C12N2800/22 »  CPC further

Nucleic acids vectors Vectors comprising a coding region that has been codon optimised for expression in a respective host

C12N15/90 IPC

Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor; Recombinant DNA-technology; Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation Stable introduction of foreign DNA into chromosome

C12N9/22 »  CPC further

Enzymes; Proenzymes; Compositions thereof ; Processes for preparing, activating, inhibiting, separating or purifying enzymes; Hydrolases (3) acting on ester bonds (3.1) Ribonucleases RNAses, DNAses

C12N15/11 »  CPC further

Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor; Recombinant DNA-technology DNA or RNA fragments; Modified forms thereof

C12N15/70 »  CPC further

Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor; Recombinant DNA-technology; Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression Vectors or expression systems specially adapted for E. coli

C12N15/81 »  CPC further

Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor; Recombinant DNA-technology; Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression; Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts

C12N15/85 »  CPC further

Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor; Recombinant DNA-technology; Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression; Vectors or expression systems specially adapted for eukaryotic hosts for animal cells

Description

RELATED APPLICATIONS

This application is a Continuation of patent application U.S. Ser. No. 17/554,736, entitled “Nucleic Acid-Guided Nucleases” filed Dec. 17, 2021, now allowed; which is a Continuation of patent application U.S. Ser. No. 17/387,860, entitled “Nucleic Acid-guided Nucleases” filed Jul. 28, 2021, now U.S. Pat. No. 11,220,697; which is a Continuation of patent application U.S. Ser. No. 17/179,193, entitled “Nucleic Acid-Guided Nucleases” filed Feb. 18, 2021, now U.S. Pat. No. 11,130,970; which is a Continuation of patent application U.S. Ser. No. 16/819,896, entitled “Nucleic Acid-Guided Nucleases” filed Mar. 16, 2020; which is a Continuation of patent application U.S. Ser. No. 16/548,631, entitled “Nucleic Acid-Guided Nucleases” filed Aug. 22, 2019, now U.S. Pat. No. 10,626,416; which is a Continuation of patent application U.S. Ser. No. 15/896,433, entitled “Nucleic Acid-Guided Nucleases” filed Feb. 14, 2018, now U.S. Pat. No. 10,435,714; which is a Continuation of patent application U.S. Ser. No. 15/631,989, entitled “Nucleic Acid-Guided Nucleases” filed Jun. 23, 2017, now U.S. Pat. No. 10,011,849.

INCORPORATION BY REFERENCE

Submitted with the present application is an electronically filed sequence listing via EFS-Web as an ASCII formatted sequence listing, entitled “INSC104US8_seqlist_20220309”, created Mar. 9, 2022, and 791,000 bytes in size. The sequence listing is part of the specification filed herewith and is incorporated by reference in its entirety.

BACKGROUND OF THE DISCLOSURE

Nucleic acid-guided nucleases have become important tools for research and genome engineering. The applicability of these tools can be limited by the sequence specificity requirements, expression, or delivery issues.

SEQUENCE LISTING

This application contains a sequence list in Table 6.

SUMMARY OF THE DISCLOSURE

Disclosed herein are methods of modifying a target region in the genome of a cell, the method comprising: (a) contacting a cell with: a non-naturally occurring nucleic-acid-guided nuclease encoded by a nucleic acid having at least 80% identity to SEQ ID NO: 22; an engineered guide nucleic acid capable of complexing with the nucleic acid-guided nuclease; and an editing sequence encoding a nucleic acid complementary to said target region having a change in sequence relative to the target region; and (b) allowing the nuclease, guide nucleic acid, and editing sequence to create a genome edit in a target region of the genome of the cell. In some aspects, the engineered guide nucleic acid and the editing sequence are provided as a single nucleic acid. In some aspects, the single nucleic acid further comprises a mutation in a protospacer adjacent motif (PAM) site. In some aspects, the nucleic acid-guided nuclease is encoded by a nucleic acid with at least 85% identity to SEQ ID NO: 42. In some aspects, the nucleic acid-guided nuclease is encoded by a nucleic acid with at least 85% identity to SEQ ID NO: 128.

Disclosed herein are nucleic acid-guided nuclease systems comprising: (a) a non-naturally occurring nuclease encoded by a nucleic acid having at least 80% identity to SEQ ID NO: 22; (b) an engineered guide nucleic acid capable of complexing with the nucleic acid-guided nuclease, and (c) an editing sequence having a change in sequence relative to the sequence of a target region in a genome of a cell; wherein the system results in a genome edit in the target region in the genome of the cell facilitated by the nuclease, the engineered guide nucleic acid, and the editing sequence. In some aspects, nucleic acid-guided nuclease is encoded by a nucleic acid with at least 85% identity to SEQ ID NO: 42. In some aspects, the nucleic acid-guided nuclease is encoded by a nucleic acid with at least 85% identity to SEQ ID NO: 128. In some aspects, the nucleic acid-guided nuclease is codon optimized for the cell to be edited. In some aspects, the engineered guide nucleic acid and the editing sequence are provided as a single nucleic acid. In some aspects, the single nucleic acid further comprises a mutation in a protospacer adjacent motif (PAM) site.

Disclosed herein are compositions for use in genome editing comprising a non-naturally occurring nuclease encoded by a nucleic acid having at least 75% identity to SEQ ID NO: 22. In some aspects, the nucleic acid has at least 80% identity to SEQ ID NO: 22. In some aspects, the nucleic acid has at least 90% identity to SEQ ID NO: 22. In some aspects, the nuclease is further codon optimized for use in cells from a particular organism. In some aspects, the nuclease is codon optimized for E. Coli In some aspects, the nuclease is codon optimized for S. Cerevisiae. In some aspects, the nuclease is codon optimized for mammalian cells. In some aspects, the nucleic acid-guided nuclease has less than 40% protein identity to SEQ ID NO: 12. In some aspects, the nucleic acid-guided nuclease has less than 40% protein identity to SEQ ID NO: 108.

INCORPORATION BY REFERENCE

All publications and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.

BRIEF DESCRIPTION OF THE DRAWINGS

This patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.

FIG. 1A depicts a partial sequence alignment MAD1-8 (SEQ ID NO: 1-8) and MAD10-12 (SEQ ID NO: 10-12).

FIG. 1B depicts a phylogenetic tree of nucleases including MAD1-8.

FIG. 2 depicts an example protein expression construct.

FIG. 3 depicts an example editing cassette.

FIG. 4 depicts an example screening or selection experiment workflow.

FIG. 5A depicts an example protein expression construct.

FIG. 5B depicts an example editing cassette.

FIG. 5C depicts an example screening or selection experiment workflow.

FIG. 6A depicts an example protein expression construct.

FIG. 6B depicts an example editing cassette.

FIG. 6C depicts an example screening or selection experiment workflow.

FIG. 7A-7B depicts example data from a functional nuclease complex screening or selection experiment.

FIG. 8 depicts example data from a targetable nuclease complex-based editing experiment.

FIG. 9 depicts example data from a targetable nuclease complex-based editing experiment.

FIGS. 10A-10C depict example data from a targetable nuclease complex-based editing experiment.

FIG. 11 depicts a example sequence alignment of select sequences from an editing experiment.

FIG. 12 depicts example data from a targetable nuclease complex-based editing experiment.

FIG. 13A depicts an example alignment of scaffold sequences.

FIG. 13B depicts an example model of a nucleic acid-guided nuclease complexed with a guide nucleic acid and a target sequence.

FIG. 14A-14B depict example data from a primer validation experiment.

FIG. 15 depicts example data from a targetable nuclease complex-based editing experiment.

FIG. 16 depicts example validation data comparing results from two different assays.

FIG. 17A-17C depict an example trackable genetic engineering workflow, including a plasmid comprising an editing cassette and a recording cassette, and downstream sequencing of barcodes in order to identify the incorporated edit or mutation.

FIG. 18 depicts an example trackable genetic engineering workflow, including iterative rounds of engineering with a different editing cassette and recorder cassette with unique barcode (BC) at each round, which can be followed by selection and tracking to confirm the successful engineering step at each round.

FIG. 19 depicts an example recursive engineering workflow.

DETAILED DESCRIPTION OF THE DISCLOSURE

The present disclosure provides nucleic acid-guided nucleases and methods of use. Often, the subject nucleic-acid guided nucleases are part of a targetable nuclease system comprising a nucleic acid-guided nuclease and a guide nucleic acid. A subject targetable nuclease system can be used to cleave, modify, and/or edit a target polynucleotide sequence, often referred to as a target sequence. A subject targetable nuclease system refers collectively to transcripts and other elements involved in the expression of or directing the activity of genes, which may include sequences encoding a subject nucleic acid-guided nuclease protein and a guide nucleic acid as disclosed herein.

Methods, systems, vectors, polynucleotides, and compositions described herein may be used in various applications including altering or modifying synthesis of a gene product, such as a protein, polynucleotide cleavage, polynucleotide editing, polynucleotide splicing; trafficking of target polynucleotide, tracing of target polynucleotide, isolation of target polynucleotide, visualization of target polynucleotide, etc. Aspects of the invention also encompass methods and uses of the compositions and systems described herein in genome engineering, e.g. for altering or manipulating the expression of one or more genes or the one or more gene products, in prokaryotic, archaeal, or eukaryotic cells, in vitro, in vivo or ex vivo.

Nucleic Acid-Guided Nucleases

Bacterial and archaeal targetable nuclease systems have emerged as powerful tools for precision genome editing. However, naturally occurring nucleases have some limitations including expression and delivery challenges due to the nucleic acid sequence and protein size. Targetable nucleases that require PAM recognition are also limited in the sequences they can target throughout a genetic sequence. Other challenges include processivity, target recognition specificity and efficiency, and nuclease acidity efficiency, which often effect genetic editing efficiency.

Non-naturally occurring targetable nucleases and non-naturally occurring targetable nuclease systems can address many of these challenges and limitations.

Disclosed herein are non-naturally targetable nuclease systems. Such targetable nuclease systems are engineered to address one or more of the challenges described above and can be referred to as engineered nuclease systems. Engineered nuclease systems can comprise one or more of an engineered nuclease, such as an engineered nucleic acid-guided nuclease, an engineered guide nucleic acid, an engineered polynucleotides encoding said nuclease, or an engineered polynucleotides encoding said guide nucleic acid. Engineered nucleases, engineered guide nucleic acids, and engineered polynucleotides encoding the engineered nuclease or engineered guide nucleic acid are not naturally occurring and are not found in nature. It follows that engineered nuclease systems including one or more of these elements are non-naturally occurring.

Non-limiting examples of types of engineering that can be done to obtain a non-naturally occurring nuclease system are as follows. Engineering can include codon optimization to facilitate expression or improve expression in a host cell, such as a heterologous host cell. Engineering can reduce the size or molecular weight of the nuclease in order to facilitate expression or delivery. Engineering can alter PAM selection in order to change PAM specificity or to broaden the range of recognized PAMs. Engineering can alter, increase, or decrease stability, processivity, specificity, or efficiency of a targetable nuclease system. Engineering can alter, increase, or decrease protein stability. Engineering can alter, increase, or decrease processivity of nucleic acid scanning. Engineering can alter, increase, or decrease target sequence specificity. Engineering can alter, increase, or decrease nuclease activity. Engineering can alter, increase, or decrease editing efficiency. Engineering can alter, increase, or decrease transformation efficiency. Engineering can alter, increase, or decrease nuclease or guide nucleic acid expression.

Examples of non-naturally occurring nucleic acid sequences which are disclosed herein include sequences codon optimized for expression in bacteria, such as E. coli (e.g., SEQ ID NO: 41-60), sequences codon optimized for expression in single cell eukaryotes, such as yeast (e.g., SEQ ID NO: 127-146), sequences codon optimized for expression in multi cell eukaryotes, such as human cells (e.g., SEQ ID NO: 147-166), polynucleotides used for cloning or expression of any sequences disclosed herein (e.g., SEQ ID NO: 61-80), plasmids comprising nucleic acid sequences (e.g., SEQ ID NO: 21-40) operably linked to a heterologous promoter or nuclear localization signal or other heterologous element, proteins generated from engineered or codon optimized nucleic acid sequences (e.g., SEQ ID NO: 1-20), or engineered guide nucleic acids comprising any one of SEQ ID NO: 84-107. Such non-naturally occurring nucleic acid sequences can be amplified, cloned, assembled, synthesized, generated from synthesized oligonucleotides or dNTPs, or otherwise obtained using methods known by those skilled in the art.

Disclosed herein are nucleic acid-guided nucleases. Subject nucleases are functional in vitro, or in prokaryotic, archaeal, or eukaryotic cells for in vitro, in vivo, or ex vivo applications. Suitable nucleic acid-guided nucleases can be from an organism from a genus which includes but is not limited to Thiomicrospira, Succinivibrio, Candidatus, Porphyromonas, Acidaminococcus, Acidomonococcus, Prevotella, Smithella, Moraxella, Synergistes, Francisella, Leptospira, Catenibacterium, Kandleria, Clostridium, Dorea, Coprococcus, Enterococcus, Fructobacillus, Weissella, Pediococcus, Corynebacter, Sutterella, Legionella, Treponema, Roseburia, Filifactor, Eubacterium, Streptococcus, Lactobacillus, Mycoplasma, Bacteroides, Flaviivola, Flavobacterium, Sphaerochaeta, Azospirillum, Gluconacetobacter, Neisseria, Roseburia, Parvibaculum, Staphylococcus, Nitratifractor, Mycoplasma, Alicyclobacillus, Brevibacilus, Bacillus, Bacteroidetes, Brevibacilus, Carnobacterium, Clostridiaridium, Clostridium, Desulfonatronum, Desulfovibrio, Helcococcus, Leptotrichia, Listeria, Methanomethyophilus, Methylobacterium, Opitutaceae, Paludibacter, Rhodobacter, Sphaerochaeta, Tuberibacillus, Oleiphilus, Omnitrophica, Parcubacteria, and Campylobacter. Species of organism of such a genus can be as otherwise herein discussed. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within a kingdom which includes but is not limited to Firmicute, Actinobacteria, Bacteroidetes, Proteobacteria, Spirochates, and Tenericutes. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within a phylum which includes but is not limited to Erysipelotrichia, Clostridia, Bacilli, Actinobacteria, Bacteroidetes, Flavobacteria, Alphaproteobacteria, Betaproteobacteria, Gammaproteobacteria, Deltaproteobacteria, Epsilonproteobacteria, Spirochaetes, and Mollicutes. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within an order which includes but is not limited to Clostridiales, Lactobacillales, Actinomycetales, Bacteroidales, Flavobacteriales, Rhizobiales, Rhodospirillales, Burkholderiales, Neisseriales, Legionellales, Nautiliales, Campylobacterales, Spirochaetales, Mycoplasmatales, and Thiotrichales. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within a family which includes but is not limited to Lachnospiraceae, Enterococcaceae, Leuconostocaceae, Lactobacillaceae, Streptococcaceae, Peptostreptococcaceae, Staphylococcaceae, Eubacteriaceae, Corynebacterineae, Bacteroidaceae, Flavobacterium, Cryomoorphaceae, Rhodobiaceae, Rhodospirillaceae, Acetobacteraceae, Sutterellaceae, Neisseriaceae, Legionellaceae, Nautiliaceae, Campylobacteraceae, Spirochaetaceae, Mycoplasmataceae, Pisciririckettsiaceae, and Francisellaceae. Other nucleic acid-guided nucleases have been describe in US Patent Application Publication No. US20160208243 filed Dec. 18, 2015, US Application Publication No. US20140068797 filed Mar. 15, 2013, U.S. Pat. No. 8,697,359 filed Oct. 15, 2013, and Zetsche et al., Cell 2015 Oct. 22; 163(3):759-71, each of which are incorporated herein by reference in their entirety.

Some nucleic acid-guided nucleases suitable for use in the methods, systems, and compositions of the present disclosure include those derived from an organism such as, but not limited to, Thiomicrospira sp. XS5, Eubacterium rectale, Succinivibrio dextrinosolvens, Candidatus Methanoplasma termitum, Candidatus Methanomethylophilus alvus, Porphyromonas crevioricanis, Flavobacterium branchiophilum, Acidaminococcus Sp., Acidomonococcus sp., Lachnospiraceae bacterium COE1, Prevotella brevis ATCC 19188, Smithella sp. SCADC, Moraxella bovoculi, Synergistes jonesii, Bacteroidetes oral taxon 274, Francisella tularensis, Leptospira inadai serovar Lyme str. 10, Acidomonococcus sp. crystal structure (5B43) S. mutans, S. agalactiae, S. equisimilis, S. sanguinis, S. pneumonia; C. jejuni, C. coli; N. salsuginis, N. tergarcus; S. auricularis, S. carnosus; N. meningitides, N. gonorrhoeae; L. monocytogenes, L. ivanovii; C. botulinum, C. difficile, C. tetani, C. sordellii; Francisella tularensis 1, Prevotella albensis, Lachnospiraceae bacterium MC2017 1, Butyrivibrio proteoclasticus, Butyrivibrio proteoclasticus B316, Peregrinibacteria bacterium GW2011_GWA2_33_10, Parcubacteria bacterium GW2011_GWC2_44_17, Smithella sp. SCADC, Acidaminococcus sp. BV3L6, Lachnospiraceae bacterium MA2020, Candidatus Methanoplasma termitum, Eubacterium eligens, Moraxella bovoculi 237, Leptospira inadai, Lachnospiraceae bacterium ND2006, Porphyromonas crevioricanis 3, Prevotella disiens, Porphyromonas macacae, Catenibacterium sp. CAG:290, Kandleria vitulina, Clostridiales bacterium KA00274, Lachnospiraceae bacterium 3-2, Dorea longicatena, Coprococcus catus GD/7, Enterococcus columbae DSM 7374, Fructobacillus sp. EFB-N1, Weissella halotolerans, Pediococcus acidilactici, Lactobacillus curvatus, Streptococcus pyogenes, Lactobacillus versmoldensis, Filifactor alocis ATCC 35896, Alicyclobacillus acidoterrestris, Alicyclobacillus acidoterrestris ATCC 49025, Desulfovibrio inopinatus, Desulfovibrio inopinatus DSM 10711, Oleiphilus sp. Oleiphilus sp. HI0009, Candidtus kefeldibacteria, Parcubacteria CasY.4, Omnitrophica WOR 2 bacterium GWF2, Bacillus sp. NSP2.1, and Bacillus thermoamylovorans.

In some instances, a nucleic acid-guided nuclease disclosed herein comprises an amino acid sequence comprising at least 50% amino acid identity to any one of SEQ ID NO: 1-20. In some instances, a nuclease comprises an amino acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% amino acid identity to any one of SEQ ID NO: 1-20. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to any one of SEQ ID NO: 1-20. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to any one of SEQ ID NO: 1-8 or 10-12. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to any one of SEQ ID NO: 1-8 or 10-11. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to SEQ ID NO: 2. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to SEQ ID NO: 7.

In some cases, the nucleic acid-guided nuclease comprises any one of SEQ ID NO: 1-20. In some cases, the nucleic acid-guided nuclease comprises any one of SEQ ID NO: 1-8 or 10-12. In some cases, the nucleic acid-guided nuclease comprises any one of SEQ ID NO: 1-8 or 10-11. In some cases, the nucleic acid-guided nuclease comprises SEQ ID NO: 2. In some cases, the nucleic acid-guided nuclease comprises SEQ ID NO: 7.

In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 50% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 45% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 40% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 35% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 30% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110.

In some instances, a nucleic acid-guided nuclease disclosed herein is encoded by a nucleic acid sequence comprising at least 50% sequence identity to any one of SEQ ID NO: 21-40. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% sequence identity to any one of SEQ ID NO: 21-40. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to any one of SEQ ID NO: 21-40. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to any one of SEQ ID NO: 21-40. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to any one of SEQ ID NO: 21-28 or 30-32. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to any one of SEQ ID NO: 21-28 or 30-31. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to SEQ ID NO: 22. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to SEQ ID NO: 27.

In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 21-40. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 21-28 or 30-32. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 21-28 or 30-31. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 22. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 27.

In some instances, a nucleic acid-guided nuclease disclosed herein is encoded on a nucleic acid sequence. Such a nucleic acid can be codon optimized for expression in a desired host cell. Suitable host cells can include, as non-limiting examples, prokaryotic cells such as E. coli, P. aeruginosa, B. subtilus, and V. natriegens, and eukaryotic cells such as S. cerevisiae, plant cells, insect cells, nematode cells, amphibian cells, fish cells, or mammalian cells, including human cells.

A nucleic acid sequence encoding a nucleic acid-guided nuclease can be codon optimized for expression in gram positive bacteria, e.g., Bacillus subtilis, or gram negative bacteria, e.g., E. coli. In some instances, a nucleic acid-guided nuclease disclosed herein is encoded by a nucleic acid sequence comprising at least 50% sequence identity to any one of SEQ ID NO: 41-60. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% sequence identity to any one of SEQ ID NO: 41-60. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 41-60. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 41-48 or 50-52. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 41-48 or 50-51. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 42. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 47.

In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 41-60. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 41-48 or 50-52. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 41-48 or 50-51. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 42. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 47.

A nucleic acid sequence encoding a nucleic acid-guided nuclease can be codon optimized for expression in a species of yeast, e.g., S. cerevisiae. In some instances, a nucleic acid-guided nuclease disclosed herein is encoded by a nucleic acid sequence comprising at least 50% sequence identity to any one of SEQ ID NO: 127-146. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% sequence identity to any one of SEQ ID NO: 127-146. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 127-146. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 127-146. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 127-134 or 136-138. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 127-134 or 136-137. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 128. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 133.

In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 127-146. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 127-134 or 136-138. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 127-134 or 136-137. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 128. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 133.

A nucleic acid sequence encoding a nucleic acid-guided nuclease can be codon optimized for expression in mammalian cells. In some instances, a nucleic acid-guided nuclease disclosed herein is encoded by a nucleic acid sequence comprising at least 50% sequence identity to any one of SEQ ID NO: 147-166. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% sequence identity to any one of SEQ ID NO: 147-166. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 147-166. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 147-154 or 156-158. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 147-154 or 156-157. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 148. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 153.

In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 147-166. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 147-154 or 156-158. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 147-154 or 156-157. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 148. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 153.

A nucleic acid sequence encoding a nucleic acid-guided nuclease can be operably linked to a promoter. Such nucleic acid sequences can be linear or circular. The nucleic acid sequences can be comprised on a larger linear or circular nucleic acid sequences that comprises additional elements such as an origin of replication, selectable or screenable marker, terminator, other components of a targetable nuclease system, such as a guide nucleic acid, or an editing or recorder cassette as disclosed herein. These larger nucleic acid sequences can be recombinant expression vectors, as are described in more detail later.

Guide Nucleic Acid

In general, a guide nucleic acid can complex with a compatible nucleic acid-guided nuclease and can hybridize with a target sequence, thereby directing the nuclease to the target sequence. A subject nucleic acid-guided nuclease capable of complexing with a guide nucleic acid can be referred to as a nucleic acid-guided nuclease that is compatible with the guide nucleic acid. Likewise, a guide nucleic acid capable of complexing with a nucleic acid-guided nuclease can be referred to as a guide nucleic acid that is compatible with the nucleic acid-guided nucleases.

A guide nucleic acid can be DNA. A guide nucleic acid can be RNA. A guide nucleic acid can comprise both DNA and RNA. A guide nucleic acid can comprise modified of non-naturally occurring nucleotides. In cases where the guide nucleic acid comprises RNA, the RNA guide nucleic acid can be encoded by a DNA sequence on a polynucleotide molecule such as a plasmid, linear construct, or editing cassette as disclosed herein.

A guide nucleic acid can comprise a guide sequence. A guide sequence is a polynucleotide sequence having sufficient complementarity with a target polynucleotide sequence to hybridize with the target sequence and direct sequence-specific binding of a complexed nucleic acid-guided nuclease to the target sequence. The degree of complementarity between a guide sequence and its corresponding target sequence, when optimally aligned using a suitable alignment algorithm, is about or more than about 50%, 60%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99%, or more. Optimal alignment may be determined with the use of any suitable algorithm for aligning sequences. In some embodiments, a guide sequence is about or more than about 5, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 75, or more nucleotides in length. In some embodiments, a guide sequence is less than about 75, 50, 45, 40, 35, 30, 25, 20 nucleotides in length. Preferably the guide sequence is 10-30 nucleotides long. The guide sequence can be 15-20 nucleotides in length. The guide sequence can be 15 nucleotides in length. The guide sequence can be 16 nucleotides in length. The guide sequence can be 17 nucleotides in length. The guide sequence can be 18 nucleotides in length. The guide sequence can be 19 nucleotides in length. The guide sequence can be 20 nucleotides in length.

A guide nucleic acid can comprise a scaffold sequence. In general, a “scaffold sequence” includes any sequence that has sufficient sequence to promote formation of a targetable nuclease complex, wherein the targetable nuclease complex comprises a nucleic acid-guided nuclease and a guide nucleic acid comprising a scaffold sequence and a guide sequence. Sufficient sequence within the scaffold sequence to promote formation of a targetable nuclease complex may include a degree of complementarity along the length of two sequence regions within the scaffold sequence, such as one or two sequence regions involved in forming a secondary structure. In some cases, the one or two sequence regions are comprised or encoded on the same polynucleotide. In some cases, the one or two sequence regions are comprised or encoded on separate polynucleotides. Optimal alignment may be determined by any suitable alignment algorithm, and may further account for secondary structures, such as self-complementarity within either the one or two sequence regions. In some embodiments, the degree of complementarity between the one or two sequence regions along the length of the shorter of the two when optimally aligned is about or more than about 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 97.5%, 99%, or higher. In some embodiments, at least one of the two sequence regions is about or more than about 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 40, 50, or more nucleotides in length.

A scaffold sequence of a subject guide nucleic acid can comprise a secondary structure. A secondary structure can comprise a pseudoknot region. In some cases, binding kinetics of a guide nucleic acid to a nucleic acid-guided nuclease is determined in part by secondary structures within the scaffold sequence. In some cases, binding kinetics of a guide nucleic acid to a nucleic acid-guided nuclease is determined in part by nucleic acid sequence with the scaffold sequence.

A scaffold sequence can comprise the sequence of any one of SEQ ID NO: 84-107. A scaffold sequence can comprise the sequence of any one of SEQ ID NO: 84-103. A scaffold sequence can comprise the sequence of any one of SEQ ID NO: 84-91 or 93-95. A scaffold sequence can comprise the sequence of any one of SEQ ID NO: 88, 93, 94, or 95. A scaffold sequence can comprise the sequence of SEQ ID NO: 88. A scaffold sequence can comprise the sequence of SEQ ID NO: 93. A scaffold sequence can comprise the sequence of SEQ ID NO: 94. A scaffold sequence can comprise the sequence of SEQ ID NO: 95.

In some aspects, the invention provides a nuclease that binds to a guide nucleic acid comprising a conserved scaffold sequence. For example, the nucleic acid-guided nucleases for use in the present disclosure can bind to a conserved pseudoknot region as shown in FIG. 13A. Specifically, the nucleic acid-guided nucleases for use in the present disclosure can bind to a guide nucleic acid comprising a conserved pseudoknot region as shown in FIG. 13A. Certain nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-1 (SEQ ID NO: 172). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-3 (SEQ ID NO: 173). Still other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-4 (SEQ ID NO: 174). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-5 (SEQ ID NO: 175). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-6 (SEQ ID NO: 176). Still other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-7 (SEQ ID NO: 177). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-8 (SEQ ID NO: 178). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-10 (SEQ ID NO: 179). Still other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-11 (SEQ ID NO: 180). Certain nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-12 (SEQ ID NO: 181). Additional sequences in FIG. 13A include those for the consensus sequence (SEQ ID No: 190); frame 1 (SEQ ID No: 191); scaffold-1 (SEQ ID No: 192); scaffold-2 (SEQ ID No: 193); scaffold-3 (SEQ ID No: 194); scaffold-4 (SEQ ID No: 195); scaffold-5 (SEQ ID No: 196); scaffold-6 (SEQ ID No: 197); scaffold-7 (SEQ ID No: 198); scaffold-8 (SEQ ID No: 199); scaffold-10 (SEQ ID No: 200); scaffold-11 (SEQ ID No: 201); and scaffold-12 (SEQ ID No: 202).

A guide nucleic acid can comprise the sequence of any one of SEQ ID NO: 84-107. A guide nucleic acid can comprise the sequence of any one of SEQ ID NO: 84-103. A guide nucleic acid can comprise the sequence of any one of SEQ ID NO: 84-91 or 93-95. A guide nucleic acid can comprise the sequence of any one of SEQ ID NO: 88, 93, 94, or 95. A guide nucleic acid can comprise the sequence of SEQ ID NO: 88. A guide nucleic acid can comprise the sequence of SEQ ID NO: 93. A guide nucleic acid can comprise the sequence of SEQ ID NO: 94. A guide nucleic acid can comprise the sequence of SEQ ID NO: 95.

In aspects of the invention the terms “guide nucleic acid” refers to one or more polynucleotides comprising 1) a guide sequence capable of hybridizing to a target sequence and 2) a scaffold sequence capable of interacting with or complexing with an nucleic acid-guided nuclease as described herein. A guide nucleic acid may be provided as one or more nucleic acids. In specific embodiments, the guide sequence and the scaffold sequence are provided as a single polynucleotide.

A guide nucleic acid can be compatible with a nucleic acid-guided nuclease when the two elements can form a functional targetable nuclease complex capable of cleaving a target sequence. Often, a compatible scaffold sequence for a compatible guide nucleic acid can be found by scanning sequences adjacent to a native nucleic acid-guided nuclease loci. In other words, native nucleic acid-guided nucleases can be encoded on a genome within proximity to a corresponding compatible guide nucleic acid or scaffold sequence.

Nucleic acid-guided nucleases can be compatible with guide nucleic acids that are not found within the nucleases endogenous host. Such orthogonal guide nucleic acids can be determined by empirical testing. Orthogonal guide nucleic acids can come from different bacterial species or be synthetic or otherwise engineered to be non-naturally occurring.

Orthogonal guide nucleic acids that are compatible with a common nucleic acid-guided nuclease can comprise one or more common features. Common features can include sequence outside a pseudoknot region. Common features can include a pseudoknot region. Common features can include a primary sequence or secondary structure.

A guide nucleic acid can be engineered to target a desired target sequence by altering the guide sequence such that the guide sequence is complementary to the target sequence, thereby allowing hybridization between the guide sequence and the target sequence. A guide nucleic acid with an engineered guide sequence can be referred to as an engineered guide nucleic acid. Engineered guide nucleic acids are often non-naturally occurring and are not found in nature.

Targetable Nuclease System

Disclosed herein are targetable nuclease systems. A targetable nuclease system can comprise a nucleic acid-guided nuclease and a compatible guide nucleic acid. A targetable nuclease system can comprise a nucleic acid-guided nuclease or a polynucleotide sequence encoding the nucleic acid-guided nuclease. A targetable nuclease system can comprise a guide nucleic acid or a polynucleotide sequence encoding the guide nucleic acid.

In general, a targetable nuclease system as disclosed herein is characterized by elements that promote the formation of a targetable nuclease complex at the site of a target sequence, wherein the targetable nuclease complex comprises a nucleic acid-guided nuclease and a guide nucleic acid.

A guide nucleic acid together with a nucleic acid-guided nuclease forms a targetable nuclease complex which is capable of binding to a target sequence within a target polynucleotide, as determined by the guide sequence of the guide nucleic acid.

In general, to generate a double stranded break, in most cases a targetable nuclease complex binds to a target sequence as determined by the guide nucleic acid, and the nuclease has to recognize a protospacer adjacent motif (PAM) sequence adjacent to the target sequence.

A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-20 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-8 or 10-12 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-8 or 10-11 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid. In any of these cases, the guide nucleic acid can comprise a scaffold sequence compatible with the nucleic acid-guided nuclease. In any of these cases, the guide nucleic acid can further comprise a guide sequence. The guide sequence can be engineered to target any desired target sequence. The guide sequence can be engineered to be complementary to any desired target sequence. The guide sequence can be engineered to hybridize to any desired target sequence.

A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-20 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 84-107. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-8 or 10-12 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 84-95. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-8 or 10-11 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 84-91 or 93-95. In any of these cases, the guide nucleic acid can further comprise a guide sequence. The guide sequence can be engineered to target any desired target sequence. The guide sequence can be engineered to be complementary to any desired target sequence. The guide sequence can be engineered to hybridize to any desired target sequence.

A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 88, 93, 94, or 95. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising SEQ ID NO: 88. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising SEQ ID NO: 93. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising SEQ ID NO: 94. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising SEQ ID NO: 95. In any of these cases, the guide nucleic acid can further comprise a guide sequence. The guide sequence can be engineered to target any desired target sequence. The guide sequence can be engineered to be complementary to any desired target sequence. The guide sequence can be engineered to hybridize to any desired target sequence.

A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 88, 93, 94, or 95. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising SEQ ID NO: 88. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising SEQ ID NO: 93. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising SEQ ID NO: 94. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising SEQ ID NO: 95. In any of these cases, the guide nucleic acid can further comprise a guide sequence. The guide sequence can be engineered to target any desired target sequence. The guide sequence can be engineered to be complementary to any desired target sequence. The guide sequence can be engineered to hybridize to any desired target sequence.

A target sequence of a targetable nuclease complex can be any polynucleotide endogenous or exogenous to a prokaryotic or eukaryotic cell, or in vitro. For example, the target sequence can be a polynucleotide residing in the nucleus of the eukaryotic cell. A target sequence can be a sequence coding a gene product (e.g., a protein) or a non-coding sequence (e.g., a regulatory polynucleotide or a junk DNA). Without wishing to be bound by theory, it is believed that the target sequence should be associated with a PAM; that is, a short sequence recognized by a targetable nuclease complex. The precise sequence and length requirements for a PAM differ depending on the nucleic acid-guided nuclease used, but PAMs are typically 2-5 base pair sequences adjacent the target sequence. Examples of PAM sequences are given in the examples section below, and the skilled person will be able to identify further PAM sequences for use with a given nucleic acid-guided nuclease. Further, engineering of the PAM Interacting (PI) domain may allow programming of PAM specificity, improve target site recognition fidelity, and increase the versatility of a nucleic acid-guided nuclease genome engineering platform. Nucleic acid-guided nucleases may be engineered to alter their PAM specificity, for example as described in Kleinstiver B P et al. Engineered CRISPR-Cas9 nucleases with altered PAM specificities. Nature. 2015 Jul. 23; 523 (7561): 481-5. doi: 10.1038/nature14592.

A PAM site is a nucleotide sequence in proximity to a target sequence. In most cases, a nucleic acid-guided nuclease can only cleave a target sequence if an appropriate PAM is present. PAMs are nucleic acid-guided nuclease-specific and can be different between two different nucleic acid-guided nucleases. A PAM can be 5′ or 3′ of a target sequence. A PAM can be upstream or downstream of a target sequence. A PAM can be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides in length. Often, a PAM is between 2-6 nucleotides in length.

In some examples, a PAM can be provided on a separate oligonucleotide. In such cases, providing PAM on a oligonucleotide allows cleavage of a target sequence that otherwise would not be able to be cleave because no adjacent PAM is present on the same polynucleotide as the target sequence.

Polynucleotide sequences encoding a component of a targetable nuclease system can comprise one or more vectors. In general, the term “vector” refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. Vectors include, but are not limited to, nucleic acid molecules that are single-stranded, double-stranded, or partially double-stranded; nucleic acid molecules that comprise one or more free ends, no free ends (e.g. circular); nucleic acid molecules that comprise DNA, RNA, or both; and other varieties of polynucleotides known in the art. One type of vector is a “plasmid,” which refers to a circular double stranded DNA loop into which additional DNA segments can be inserted, such as by standard molecular cloning techniques. Another type of vector is a viral vector, wherein virally-derived DNA or RNA sequences are present in the vector for packaging into a virus (e.g. retroviruses, replication defective retroviruses, adenoviruses, replication defective adenoviruses, and adeno-associated viruses). Viral vectors also include polynucleotides carried by a virus for transfection into a host cell. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g. bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively-linked. Such vectors are referred to herein as “expression vectors.” Common expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. Further discussion of vectors is provided herein.

Recombinant expression vectors can comprise a nucleic acid of the invention in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory elements, which may be selected on the basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid sequence to be expressed. Within a recombinant expression vector, “operably linked” is intended to mean that the nucleotide sequence of interest is linked to the regulatory element(s) in a manner that allows for expression of the nucleotide sequence (e.g. in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell). With regards to recombination and cloning methods, mention is made of U.S. patent application Ser. No. 10/815,730, published Sep. 2, 2004 as US 2004-0171156 A1, the contents of which are herein incorporated by reference in their entirety.

In some embodiments, a regulatory element is operably linked to one or more elements of a targetable nuclease system so as to drive expression of the one or more components of the targetable nuclease system.

In some embodiments, a vector comprises a regulatory element operably linked to a polynucleotide sequence encoding a nucleic acid-guided nuclease. The polynucleotide sequence encoding the nucleic acid-guided nuclease can be codon optimized for expression in particular cells, such as prokaryotic or eukaryotic cells. Eukaryotic cells can be yeast, fungi, algae, plant, animal, or human cells. Eukaryotic cells may be those of or derived from a particular organism, such as a mammal, including but not limited to human, mouse, rat, rabbit, dog, or non-human mammal including non-human primate.

In general, codon optimization refers to a process of modifying a nucleic acid sequence for enhanced expression in the host cells of interest by replacing at least one codon (e.g. about or more than about 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more codons) of the native sequence with codons that are more frequently or most frequently used in the genes of that host cell while maintaining the native amino acid sequence. Various species exhibit particular bias for certain codons of a particular amino acid. Codon bias (differences in codon usage between organisms) often correlates with the efficiency of translation of messenger RNA (mRNA), which is in turn believed to be dependent on, among other things, the properties of the codons being translated and the availability of particular transfer RNA (tRNA) molecules. The predominance of selected tRNAs in a cell is generally a reflection of the codons used most frequently in peptide synthesis. Accordingly, genes can be tailored for optimal gene expression in a given organism based on codon optimization. Codon usage tables are readily available, for example, at the “Codon Usage Database” available at www.kazusa.orjp/codon/(visited Jul. 9, 2002), and these tables can be adapted in a number of ways. See Nakamura, Y., et al. “Codon usage tabulated from the international DNA sequence databases: status for the year 2000” Nucl. Acids Res. 28:292 (2000). Computer algorithms for codon optimizing a particular sequence for expression in a particular host cell are also available, such as Gene Forge (Aptagen; Jacobus, Pa.), are also available. In some embodiments, one or more codons (e.g. 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more, or all codons) in a sequence encoding an engineered nuclease correspond to the most frequently used codon for a particular amino acid.

In some embodiments, a vector encodes a nucleic acid-guided nuclease comprising one or more nuclear localization sequences (NLSs), such as about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more NLSs. In some embodiments, the engineered nuclease comprises about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more NLSs at or near the amino-terminus, about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more NLSs at or near the carboxy-terminus, or a combination of these (e.g. one or more NLS at the amino-terminus and one or more NLS at the carboxy terminus). When more than one NLS is present, each may be selected independently of the others, such that a single NLS may be present in more than one copy and/or in combination with one or more other NLSs present in one or more copies. In a preferred embodiment of the invention, the engineered nuclease comprises at most 6 NLSs. In some embodiments, an NLS is considered near the N- or C-terminus when the nearest amino acid of the NLS is within about 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50, or more amino acids along the polypeptide chain from the N- or C-terminus. Non-limiting examples of NLSs include an NLS sequence derived from: the NLS of the SV40 virus large T-antigen, having the amino acid sequence PKKKRKV (SEQ ID NO: 111); the NLS from nucleoplasmin (e.g. the nucleoplasmin bipartite NLS with the sequence KRPAATKKAGQAKKKK (SEQ ID NO:112)); the c-myc NLS having the amino acid sequence PAAKRVKLD (SEQ ID NO:113) or RQRRNELKRSP (SEQ ID NO:114); the hRNPA1 M9 NLS having the sequence NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY (SEQ ID NO: 115); the sequence RMRIZFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV (SEQ ID NO:1 116) of the IBB domain from importin-alpha; the sequences VSRKRPRP (SEQ ID NO:117) and PPKKARED (SEQ ID NO:115) of the myoma T protein; the sequence PQPKKKPL (SEQ ID NO:119) of human p53; the sequence SALIKKKKKMAP (SEQ ID NO:120) of mouse c-abl IV; the sequences DRLRR (SEQ ID NO:121) and PKQKKRK (SEQ ID NO:122) of the influenza virus NS1; the sequence RKLKKKIKKL (SEQ ID NO:123) of the Hepatitis virus delta antigen; the sequence REKKKFLKRR (SEQ ID NO: 124) of the mouse Mx1 protein; the sequence KRKGDEVDGVDEVAKKKSKK (SEQ ID NO: 125) of the human poly(ADP-ribose) polymerase; and the sequence RKCLQAGMNLEARKTKK (SEQ ID NO: 126) of the steroid hormone receptors (human) glucocorticoid.

In general, the one or more NLSs are of sufficient strength to drive accumulation of the nucleic acid-guided nuclease in a detectable amount in the nucleus of a eukaryotic cell. In general, strength of nuclear localization activity may derive from the number of NLSs, the particular NLS(s) used, or a combination of these factors. Detection of accumulation in the nucleus may be performed by any suitable technique. For example, a detectable marker may be fused to the nucleic acid-guided nuclease, such that location within a cell may be visualized, such as in combination with a means for detecting the location of the nucleus (e.g. a stain specific for the nucleus such as DAPI). Cell nuclei may also be isolated from cells, the contents of which may then be analyzed by any suitable process for detecting protein, such as immunohistochemistry, Western blot, or enzyme activity assay. Accumulation in the nucleus may also be determined indirectly, such as by an assay for the effect of the nucleic acid-guided nuclease complex formation (e.g. assay for DNA cleavage or mutation at the target sequence, or assay for altered gene expression activity affected by targetable nuclease complex formation and/or nucleic acid-guided nuclease activity), as compared to a control not exposed to the nucleic acid-guided nuclease or targetable nuclease complex, or exposed to a nucleic acid-guided nuclease lacking the one or more NLSs.

A nucleic acid-guided nuclease and one or more guide nucleic acids can be delivered either as DNA or RNA. Delivery of an nucleic acid-guided nuclease and guide nucleic acid both as RNA (unmodified or containing base or backbone modifications) molecules can be used to reduce the amount of time that the nucleic acid-guided nuclease persist in the cell. This may reduce the level of off-target cleavage activity in the target cell. Since delivery of a nucleic acid-guided nuclease as mRNA takes time to be translated into protein, it might be advantageous to deliver the guide nucleic acid several hours following the delivery of the nucleic acid-guided nuclease mRNA, to maximize the level of guide nucleic acid available for interaction with the nucleic acid-guided nuclease protein. In other cases, the nucleic acid-guided nuclease mRNA and guide nucleic acid are delivered concomitantly. In other examples, the guide nucleic acid is delivered sequentially, such as 0.5, 1, 2, 3, 4, or more hours after the nucleic acid-guided nuclease mRNA.

In situations where guide nucleic acid amount is limiting, it may be desirable to introduce a nucleic acid-guided nuclease as mRNA and guide nucleic acid in the form of a DNA expression cassette with a promoter driving the expression of the guide nucleic acid. This way the amount of guide nucleic acid available will be amplified via transcription.

Guide nucleic acid in the form of RNA or encoded on a DNA expression cassette can be introduced into a host cell comprising an nucleic acid-guided nuclease encoded on a vector or chromosome. The guide nucleic acid may be provided in the cassette one or more polynucleotides, which may be contiguous or non-contiguous in the cassette. In specific embodiments, the guide nucleic acid is provided in the cassette as a single contiguous polynucleotide.

A variety of delivery systems can be used to introduce a nucleic acid-guided nuclease (DNA or RNA) and guide nucleic acid (DNA or RNA) into a host cell. These include the use of yeast systems, lipofection systems, microinjection systems, biolistic systems, virosomes, liposomes, immunoliposomes, polycations, lipid:nucleic acid conjugates, virions, artificial virions, viral vectors, electroporation, cell permeable peptides, nanoparticles, nanowires (Shalek et al., Nano Letters, 2012), exosomes. Molecular trojan horses liposomes (Pardridge et al., Cold Spring Harb Protoc; 2010; doi:10.1101/pdb.prot5407) may be used to deliver an engineered nuclease and guide nuclease across the blood brain barrier.

In some embodiments, a editing template is also provided. A editing template may be a component of a vector as described herein, contained in a separate vector, or provided as a separate polynucleotide, such as an oligonucleotide, linear polynucleotide, or synthetic polynucleotide. In some cases, a editing template is on the same polynucleotide as a guide nucleic acid. In some embodiments, a editing template is designed to serve as a template in homologous recombination, such as within or near a target sequence nicked or cleaved by a nucleic acid-guided nuclease as a part of a complex as disclosed herein. A editing template polynucleotide may be of any suitable length, such as about or more than about 10, 15, 20, 25, 50, 75, 100, 150, 200, 500, 1000, or more nucleotides in length. In some embodiments, the editing template polynucleotide is complementary to a portion of a polynucleotide comprising the target sequence. When optimally aligned, a editing template polynucleotide might overlap with one or more nucleotides of a target sequences (e.g. about or more than about 1, 5, 10, 15, 20, 25, 30, 35, 40, or more nucleotides). In some embodiments, when a editing template sequence and a polynucleotide comprising a target sequence are optimally aligned, the nearest nucleotide of the template polynucleotide is within about 1, 5, 10, 15, 20, 25, 50, 75, 100, 200, 300, 400, 500, 1000, 5000, 10000, or more nucleotides from the target sequence.

In many examples, an editing template comprises at least one mutation compared to the target sequence. An editing template can comprise an insertion, deletion, modification, or any combination thereof compared to the target sequence. Examples of some editing templates are described in more detail in a later section.

In some aspects, the invention provides methods comprising delivering one or more polynucleotides, such as or one or more vectors or linear polynucleotides as described herein, one or more transcripts thereof, and/or one or proteins transcribed therefrom, to a host cell. In some aspects, the invention further provides cells produced by such methods, and organisms comprising or produced from such cells. In some embodiments, an engineered nuclease in combination with (and optionally complexed with) a guide nucleic acid is delivered to a cell.

Conventional viral and non-viral based gene transfer methods can be used to introduce nucleic acids in cells, such as prokaryotic cells, eukaryotic cells, mammalian cells, or target tissues. Such methods can be used to administer nucleic acids encoding components of an engineered nucleic acid-guided nuclease system to cells in culture, or in a host organism. Non-viral vector delivery systems include DNA plasmids, RNA (e.g. a transcript of a vector described herein), naked nucleic acid, and nucleic acid complexed with a delivery vehicle, such as a liposome. Viral vector delivery systems include DNA and RNA viruses, which have either episomal or integrated genomes after delivery to the cell. For a review of gene therapy procedures, see Anderson, Science 256:808-813 (1992); Nabel & Feigner, TIBTECH 11:211-217 (1993); Mitani & Caskey, TIBTECH 11:162-166 (1993); Dillon. TIBTECH 11:167-175 (1993); Miller, Nature 357:455-460 (1992); Van Brunt, Biotechnology 6(10):1149-1154 (1988); Vigne, Restorative Neurology and Neuroscience 8:35-36 (1995); Kremer & Perricaudet, British Medical Bulletin 51(1):31-44 (1995); Haddada et al., in Current Topics in Microbiology and Immunology Doerfler and Bohm (eds) (1995); and Yu et al., Gene Therapy 1:13-26 (1994).

Methods of non-viral delivery of nucleic acids include lipofection, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation or lipid:nucleic acid conjugates, naked DNA, artificial virions, and agent-enhanced uptake of DNA. Lipofection is described in e.g., U.S. Pat. Nos. 5,049,386, 4,946,787; and 4,897,355) and lipofection reagents are sold commercially (e.g., Transfectam™ and Lipofectin™). Cationic and neutral lipids that are suitable for efficient receptor-recognition lipofection of polynucleotides include those of Felgner, WO 91/17424; WO 91/16024. Delivery can be to cells (e.g. in vitro or ex vivo administration) or target tissues (e.g. in vivo administration).

The preparation of lipid:nucleic acid complexes, including targeted liposomes such as immunolipid complexes, is well known to one of skill in the art (see, e.g., Crystal, Science 270:404-410 (1995); Blaese et al., Cancer Gene Ther. 2:291-297 (1995); Behr et al., Bioconjugate Chem. 5:382-389 (1994); Remy et al., Bioconjugate Chem. 5:647-654 (1994); Gao et al., Gene Therapy 2:710-722 (1995); Ahmad et al., Cancer Res. 52:4817-4820 (1992); U.S. Pat. Nos. 4,186,183, 4,217,344, 4,235,871, 4,261,975, 4,485,054, 4,501,728, 4,774,085, 4,837,028, and 4,946,787).

The use of RNA or DNA viral based systems for the delivery of nucleic acids take advantage of highly evolved processes for targeting a virus to specific cells in culture or in the host and trafficking the viral payload to the nucleus or host cell genome. Viral vectors can be administered directly to cells in culture, patients (in vivo), or they can be used to treat cells in vitro, and the modified cells may optionally be administered to patients (ex vivo). Conventional viral based systems could include retroviral, lentivirus, adenoviral, adeno-associated and herpes simplex virus vectors for gene transfer. Integration in the host genome is possible with the retrovirus, lentivirus, and adeno-associated virus gene transfer methods, often resulting in long term expression of the inserted transgene. Additionally, high transduction efficiencies have been observed in many different cell types and target tissues.

The tropism of a retrovirus can be altered by incorporating foreign envelope proteins, expanding the potential target population of target cells. Lentiviral vectors are retroviral vectors that are able to transduce or infect non-dividing cells and typically produce high viral titers. Selection of a retroviral gene transfer system would therefore depend on the target tissue. Retroviral vectors are comprised of cis-acting long terminal repeats with packaging capacity for up to 6-10 kb of foreign sequence. The minimum cis-acting LTRs are sufficient for replication and packaging of the vectors, which are then used to integrate the therapeutic gene into the target cell to provide permanent transgene expression. Widely used retroviral vectors include those based upon murine leukemia virus (MuLV), gibbon ape leukemia virus (GaLV), Simian Immuno deficiency virus (SIV), human immuno deficiency virus (HIV), and combinations thereof (see, e.g., Buchscher et al., J. Virol. 66:2731-2739 (1992); Johann et al., J. Virol. 66:1635-1640 (1992); Sommnerfelt et al., Virol. 176:58-59 (1990); Wilson et al., J. Virol. 63:2374-2378 (1989); Miller et al., J. Virol. 65:2220-2224 (1991); PCT/US94/05700).

In applications where transient expression is preferred, adenoviral based systems may be used. Adenoviral based vectors are capable of very high transduction efficiency in many cell types and do not require cell division. With such vectors, high titer and levels of expression have been obtained. This vector can be produced in large quantities in a relatively simple system.

Adeno-associated virus (“AAV”) vectors may also be used to transduce cells with target nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy procedures (see, e.g., West et al., Virology 160:38-47 (1987); U.S. Pat. No. 4,797,368; WO 93/24641; Kotin, Human Gene Therapy 5:793-801 (1994); Muzyczka, J. Clin. Invest. 94:1351 (1994). Construction of recombinant AAV vectors are described in a number of publications, including U.S. Pat. No. 5,173,414; Tratschin et al., Mol. Cell. Biol. 5:3251-3260 (1985); Tratschin, et al., Mol. Cell. Biol. 4:2072-2081 (1984); Hermonat & Muzyczka, PNAS 81:6466-6470 (1984); and Samulski et al., J. Virol. 63:03822-3828 (1989).

In some embodiments, a host cell is transiently or non-transiently transfected with one or more vectors, linear polynucleotides, polypeptides, nucleic acid-protein complexes, or any combination thereof as described herein. In some embodiments, a cell in transfected in vitro, in culture, or ex vivo. In some embodiments, a cell is transfected as it naturally occurs in a subject. In some embodiments, a cell that is transfected is taken from a subject. In some embodiments, the cell is derived from cells taken from a subject, such as a cell line.

In some embodiments, a cell transfected with one or more vectors, linear polynucleotides, polypeptides, nucleic acid-protein complexes, or any combination thereof as described herein is used to establish a new cell line comprising one or more transfection-derived sequences. In some embodiments, a cell transiently transfected with the components of an engineered nucleic acid-guided nuclease system as described herein (such as by transient transfection of one or more vectors, or transfection with RNA), and modified through the activity of an engineered nuclease complex, is used to establish a new cell line comprising cells containing the modification but lacking any other exogenous sequence.

In some embodiments, one or more vectors described herein are used to produce a non-human transgenic cell, organism, animal, or plant. In some embodiments, the transgenic animal is a mammal, such as a mouse, rat, or rabbit. Methods for producing transgenic cells, organisms, plants, and animals are known in the art, and generally begin with a method of cell transformation or transfection, such as described herein.

Methods of Use

In the context of formation of an engineered nuclease complex, “target sequence” refers to a sequence to which a guide sequence is designed to have complementarity, where hybridization between a target sequence and a guide sequence promotes the formation of a engineered nuclease complex. A target sequence may comprise any polynucleotide, such as DNA, RNA, or a DNA-RNA hybrid. A target sequence can be located in the nucleus or cytoplasm of a cell. A target sequence can be located in vitro or in a cell-free environment.

Typically, formation of an engineered nuclease complex comprising a guide nucleic acid hybridized to a target sequence and complexed with one or more engineered nucleases as disclosed herein results in cleavage of one or both strands in or near (e.g. within 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 50, or more base pairs from) the target sequence. Cleavage can occur within a target sequence, 5′ of the target sequence, upstream of a target sequence, 3′ of the target sequence, or downstream of a target sequence.

In some embodiments, one or more vectors driving expression of one or more components of a targetable nuclease system are introduced into a host cell or in vitro such formation of a targetable nuclease complex at one or more target sites. For example, a nucleic acid-guided nuclease and a guide nucleic acid could each be operably linked to separate regulatory elements on separate vectors. Alternatively, two or more of the elements expressed from the same or different regulatory elements, may be combined in a single vector, with one or more additional vectors providing any components of the targetable nuclease system not included in the first vector. Targetable nuclease system elements that are combined in a single vector may be arranged in any suitable orientation, such as one element located 5′ with respect to (“upstream” of) or 3′ with respect to (“downstream” of) a second element. The coding sequence of one element may be located on the same or opposite strand of the coding sequence of a second element, and oriented in the same or opposite direction. In some embodiments, a single promoter drives expression of a transcript encoding a nucleic acid-guided nuclease and one or more guide nucleic acids. In some embodiments, a nucleic acid-guided nuclease and one or more guide nucleic acids are operably linked to and expressed from the same promoter. In other embodiments, one or more guide nucleic acids or polynucleotides encoding the one or more guide nucleic acids are introduced into a cell or in vitro environment already comprising a nucleic acid-guided nuclease or polynucleotide sequence encoding the nucleic acid-guided nuclease.

When multiple different guide sequences are used, a single expression construct may be used to target nuclease activity to multiple different, corresponding target sequences within a cell or in vitro. For example, a single vector may comprise about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, or more guide sequences. In some embodiments, about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more such guide-sequence-containing vectors may be provided, and optionally delivered to a cell or in vitro.

Methods and compositions disclosed herein may comprise more than one guide nucleic acid, wherein each guide nucleic acid has a different guide sequence, thereby targeting a different target sequence. In such cases, multiple guide nucleic acids can be using in multiplexing, wherein multiple targets are targeted simultaneously. Additionally or alternatively, the multiple guide nucleic acids are introduced into a population of cells, such that each cell in a population received a different or random guide nucleic acid, thereby targeting multiple different target sequences across a population of cells. In such cases, the collection of subsequently altered cells can be referred to as a library.

Methods and compositions disclosed herein may comprise multiple different nucleic acid-guided nucleases, each with one or more different corresponding guide nucleic acids, thereby allowing targeting of different target sequences by different nucleic acid-guided nucleases. In some such cases, each nucleic acid-guided nuclease can correspond to a distinct plurality of guide nucleic acids, allowing two or more non overlapping, partially overlapping, or completely overlapping multiplexing events.

In some embodiments, the nucleic acid-guided nuclease has DNA cleavage activity or RNA cleavage activity. In some embodiments, the nucleic acid-guided nuclease directs cleavage of one or both strands at the location of a target sequence, such as within the target sequence and/or within the complement of the target sequence. In some embodiments, the nucleic acid-guided nuclease directs cleavage of one or both strands within about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50, 100, 200, 500, or more base pairs from the first or last nucleotide of a target sequence.

In some embodiments, a nucleic acid-guided nuclease may form a component of an inducible system. The inducible nature of the system would allow for spatiotemporal control of gene editing or gene expression using a form of energy. The form of energy may include but is not limited to electromagnetic radiation, sound energy, chemical energy, light energy, temperature, and thermal energy. Examples of inducible system include tetracycline inducible promoters (Tet-On or Tet-Off), small molecule two-hybrid transcription activations systems (FKBP, ABA, etc), or light inducible systems (Phytochrome, LOV domains, or cryptochorome). In one embodiment, the nucleic acid-guided nuclease may be a part of a Light Inducible Transcriptional Effector (LITE) to direct changes in transcriptional activity in a sequence-specific manner. The components of a light inducible system may include a nucleic acid-guided nuclease, a light-responsive cytochrome heterodimer (e.g. from Arabidopsis thaliana), and a transcriptional activation/repression domain. Further examples of inducible DNA binding proteins and methods for their use are provided in U.S. 61/736,465 and U.S. 61/721,283, which is hereby incorporated by reference in its entirety. An inducible system can be temperature inducible such that the system is turned on or off by increasing or decreasing the temperature. In some temperature inducible systems, increasing the temperature turns the system on. In some temperature inducible systems, increasing the temperature turns the system off.

In some aspects, the invention provides for methods of modifying a target sequence in vitro, or in a prokaryotic or eukaryotic cell, which may be in vivo, ex vivo, or in vitro. In some embodiments, the method comprises sampling a cell or population of cells such as prokaryotic cells, or those from a human or non-human animal or plant (including micro-algae), and modifying the cell or cells. Culturing may occur at any stage in vitro or ex vivo. The cell or cells may even be re-introduced into the host, such as a non-human animal or plant (including micro-algae). For re-introduced cells it is particularly preferred that the cells are stem cells.

In some embodiments, the method comprises allowing a targetable nuclease complex to bind to the target sequence to effect cleavage of said target sequence, thereby modifying the target sequence, wherein the targetable nuclease complex comprises a nucleic acid-guided nuclease complexed with a guide nucleic acid wherein the guide sequence of the guide nucleic acid is hybridized to a target sequence within a target polynucleotide.

In some aspects, the invention provides a method of modifying expression of a target polynucleotide in in vitro or in a prokaryotic or eukaryotic cell. In some embodiments, the method comprises allowing a targetable nuclease complex to bind to a target sequence with the target polynucleotide such that said binding results in increased or decreased expression of said target polynucleotide; wherein the targetable nuclease complex comprises an nucleic acid-guided nuclease complexed with a guide nucleic acid, and wherein the guide sequence of the guide nucleic acid is hybridized to a target sequence within said target polynucleotide. Similar considerations apply as above for methods of modifying a target polynucleotide. In fact, these sampling, culturing and re-introduction options apply across the aspects of the present invention.

In some aspects, the invention provides kits containing any one or more of the elements disclosed in the above methods and compositions. Elements may provide individually or in combinations, and may be provided in any suitable container, such as a vial, a bottle, or a tube. In some embodiments, the kit includes instructions in one or more languages, for example in more than one language.

In some embodiments, a kit comprises one or more reagents for use in a process utilizing one or more of the elements described herein. Reagents may be provided in any suitable container. For example, a kit may provide one or more reaction or storage buffers. Reagents may be provided in a form that is usable in a particular assay, or in a form that requires addition of one or more other components before use (e.g. in concentrate or lyophilized form). A buffer can be any buffer, including but not limited to a sodium carbonate buffer, a sodium bicarbonate buffer, a borate buffer, a Tris buffer, a MOPS buffer, a HEPES buffer, and combinations thereof. In some embodiments, the buffer is alkaline. In some embodiments, the buffer has a pH from about 7 to about 10. In some embodiments, the kit comprises one or more oligonucleotides corresponding to a guide sequence for insertion into a vector so as to operably link the guide sequence and a regulatory element. In some embodiments, the kit comprises a editing template.

In some aspects, the invention provides methods for using one or more elements of a engineered targetable nuclease system. A targetable nuclease complex of the disclosure provides an effective means for modifying a target sequence within a target polynucleotide. A targetable nuclease complex of the disclosure has a wide variety of utility including modifying (e.g., deleting, inserting, translocating, inactivating, activating) a target sequence in a multiplicity of cell types. As such a targetable nuclease complex of the invention has a broad spectrum of applications in, e.g., biochemical pathway optimization, genome-wide studies, genome engineering, gene therapy, drug screening, disease diagnosis, and prognosis. An exemplary targetable nuclease complex comprises a nucleic acid-guided nuclease as disclosed herein complexed with a guide nucleic acid, wherein the guide sequence of the guide nucleic acid can hybridize to a target sequence within the target polynucleotide. A guide nucleic acid can comprise a guide sequence linked to a scaffold sequence. A scaffold sequence can comprise one or more sequence regions with a degree of complementarity such that together they form a secondary structure. In some cases, the one or more sequence regions are comprised or encoded on the same polynucleotide. In some cases, the one or more sequence regions are comprised or encoded on separate polynucleotides.

Provided herein are methods of cleaving a target polynucleotide. The method comprises cleaving a target polynucleotide using a targetable nuclease complex that binds to a target sequence within a target polynucleotide and effect cleavage of said target polynucleotide. Typically, the targetable nuclease complex of the invention, when introduced into a cell, creates a break (e.g., a single or a double strand break) in the target sequence. For example, the method can be used to cleave a target gene in a cell, or to replace a wildtype sequence with a modified sequence.

The break created by the targetable nuclease complex can be repaired by a repair process such as the error prone non-homologous end joining (NHEJ) pathway, the high fidelity homology-directed repair (HDR), or by recombination pathways. During these repair processes, an editing template can be introduced into the genome sequence. In some methods, the HDR or recombination process is used to modify a target sequence. For example, an editing template comprising a sequence to be integrated flanked by an upstream sequence and a downstream sequence is introduced into a cell. The upstream and downstream sequences share sequence similarity with either side of the site of integration in the chromosome, target vector, or target polynucleotide.

An editing template can be DNA or RNA, e.g., a DNA plasmid, a bacterial artificial chromosome (BAC), a yeast artificial chromosome (YAC), a viral vector, a linear piece of DNA, a PCR fragment, oligonucleotide, synthetic polynucleotide, a naked nucleic acid, or a nucleic acid complexed with a delivery vehicle such as a liposome or poloxamer.

An editing template polynucleotide can comprise a sequence to be integrated (e.g, a mutated gene). A sequence for integration may be a sequence endogenous or exogenous to the cell. Examples of a sequence to be integrated include polynucleotides encoding a protein or a non-coding RNA (e.g., a microRNA). Thus, the sequence for integration may be operably linked to an appropriate control sequence or sequences. Alternatively, the sequence to be integrated may provide a regulatory function. Sequence to be integrated may be a mutated or variant of an endogenous wildtype sequence. Alternatively, sequence to be integrated may be a wildtype version of an endogenous mutated sequence. Additionally or alternatively, sequenced to be integrated may be a variant or mutated form of an endogenous mutated or variant sequence.

Upstream and downstream sequences in an editing template polynucleotide can be selected to promote recombination between the target polynucleotide of interest and the editing template polynucleotide. The upstream sequence can be a nucleic acid sequence having sequence similarity with the sequence upstream of the targeted site for integration. Similarly, the downstream sequence can be a nucleic acid sequence having similarity with the sequence downstream of the targeted site of integration. The upstream and downstream sequences in an editing template can have 75%, 80%, 85%, 90%, 95%, or 100% sequence identity with the targeted polynucleotide. Preferably, the upstream and downstream sequences in the editing template polynucleotide have about 95%, 96%, 97%, 98%, 99%, or 100% sequence identity with the targeted polynucleotide. In some methods, the upstream and downstream sequences in the editing template polynucleotide have about 99% or 100% sequence identity with the targeted polynucleotide.

An upstream or downstream sequence may comprise from about 20 bp to about 2500 bp, for example, about 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900, 2000, 2100, 2200, 2300, 2400, or 2500 bp. In some methods, the exemplary upstream or downstream sequence has about 15 bp to about 50 bp, about 30 bp to about 100 bp, about 200 bp to about 2000 bp, about 600 bp to about 1000 bp, or more particularly about 700 bp to about 1000 bp.

In some methods, the editing template polynucleotide may further comprise a marker. Such a marker may make it easy to screen for targeted integrations. Examples of suitable markers include restriction sites, fluorescent proteins, or selectable markers. The exogenous polynucleotide template of the invention can be constructed using recombinant techniques (see, for example, Green and Sambrook et al., 2014 and Ausubel et al., 2017).

In an exemplary method for modifying a target polynucleotide by integrating an editing template polynucleotide, a double stranded break is introduced into the genome sequence by an engineered nuclease complex, the break can be repaired via homologous recombination using an editing template such that the template is integrated into the target polynucleotide. The presence of a double-stranded break can increase the efficiency of integration of the editing template.

Disclosed herein are methods for modifying expression of a polynucleotide in a cell. Some methods comprise increasing or decreasing expression of a target polynucleotide by using a targetable nuclease complex that binds to the target polynucleotide.

In some methods, a target polynucleotide can be inactivated to effect the modification of the expression in a cell. For example, upon the binding of a targetable nuclease complex to a target sequence in a cell, the target polynucleotide is inactivated such that the sequence is not transcribed, the coded protein is not produced, or the sequence does not function as the wild-type sequence does. For example, a protein or microRNA coding sequence may be inactivated such that the protein is not produced.

In some methods, a control sequence can be inactivated such that it no longer functions as a regulatory sequence. As used herein, “regulatory sequence” can refer to any nucleic acid sequence that effects the transcription, translation, or accessibility of a nucleic acid sequence. Examples of regulatory sequences include, a promoter, a transcription terminator, and an enhancer.

An inactivated target sequence may include a deletion mutation (i.e., deletion of one or more nucleotides), an insertion mutation (i.e., insertion of one or more nucleotides), or a nonsense mutation (i.e., substitution of a single nucleotide for another nucleotide such that a stop codon is introduced). In some methods, the inactivation of a target sequence results in “knockout” of the target sequence.

An altered expression of one or more target polynucleotides associated with a signaling biochemical pathway can be determined by assaying for a difference in the mRNA levels of the corresponding genes between the test model cell and a control cell, when they are contacted with a candidate agent. Alternatively, the differential expression of the sequences associated with a signaling biochemical pathway is determined by detecting a difference in the level of the encoded polypeptide or gene product.

To assay for an agent-induced alteration in the level of mRNA transcripts or corresponding polynucleotides, nucleic acid contained in a sample is first extracted according to standard methods in the art. For instance, mRNA can be isolated using various lytic enzymes or chemical solutions according to the procedures set forth in Green and Sambrook (2014), or extracted by nucleic-acid-binding resins following the accompanying instructions provided by the manufacturers. The mRNA contained in the extracted nucleic acid sample is then detected by amplification procedures or conventional hybridization assays (e.g. Northern blot analysis) according to methods widely known in the art or based on the methods exemplified herein.

For purpose of this invention, amplification means any method employing a primer and a polymerase capable of replicating a target sequence with reasonable fidelity. Amplification may be carried out by natural or recombinant DNA polymerases such as TaqGold™, T7 DNA polymerase, Klenow fragment of E. coli DNA polymerase, and reverse transcriptase. A preferred amplification method is PCR. In particular, the isolated RNA can be subjected to a reverse transcription assay that is coupled with a quantitative polymerase chain reaction (RT-PCR) in order to quantify the expression level of a sequence associated with a signaling biochemical pathway.

Detection of the gene expression level can be conducted in real time in an amplification assay. In one aspect, the amplified products can be directly visualized with fluorescent DNA-binding agents including but not limited to DNA intercalators and DNA groove binders. Because the amount of the intercalators incorporated into the double-stranded DNA molecules is typically proportional to the amount of the amplified DNA products, one can conveniently determine the amount of the amplified products by quantifying the fluorescence of the intercalated dye using conventional optical systems in the art. DNA-binding dye suitable for this application include SYBR green, SYBR blue, DAPI, propidium iodine, Hoeste, SYBR gold, ethidium bromide, acridines, proflavine, acridine orange, acriflavine, fluorcoumanin, ellipticine, daunomycin, chloroquine, distamycin D, chromomycin, homidium, mithramycin, ruthenium polypyridyls, anthramycin, and the like.

In another aspect, other fluorescent labels such as sequence specific probes can be employed in the amplification reaction to facilitate the detection and quantification of the amplified products. Probe-based quantitative amplification relies on the sequence-specific detection of a desired amplified product. It utilizes fluorescent, target-specific probes (e.g., TaqMan™ probes) resulting in increased specificity and sensitivity. Methods for performing probe-based quantitative amplification are well established in the art and are taught in U.S. Pat. No. 5,210,015.

In yet another aspect, conventional hybridization assays using hybridization probes that share sequence homology with sequences associated with a signaling biochemical pathway can be performed. Typically, probes are allowed to form stable complexes with the sequences associated with a signaling biochemical pathway contained within the biological sample derived from the test subject in a hybridization reaction. It will be appreciated by one of skill in the art that where antisense is used as the probe nucleic acid, the target polynucleotides provided in the sample are chosen to be complementary to sequences of the antisense nucleic acids. Conversely, where the nucleotide probe is a sense nucleic acid, the target polynucleotide is selected to be complementary to sequences of the sense nucleic acid.

Hybridization can be performed under conditions of various stringency, for instance as described herein. Suitable hybridization conditions for the practice of the present invention are such that the recognition interaction between the probe and sequences associated with a signaling biochemical pathway is both sufficiently specific and sufficiently stable. Conditions that increase the stringency of a hybridization reaction are widely known and published in the art. See, for example, (Green and Sambrook, et al., (2014); Nonradioactive in Situ Hybridization Application Manual, Boehringer Mannheim, second edition). The hybridization assay can be formed using probes immobilized on any solid support, including but are not limited to nitrocellulose, glass, silicon, and a variety of gene arrays. A preferred hybridization assay is conducted on high-density gene chips as described in U.S. Pat. No. 5,445,934.

For a convenient detection of the probe-target complexes formed during the hybridization assay, the nucleotide probes are conjugated to a detectable label. Detectable labels suitable for use in the present invention include any composition detectable by photochemical, biochemical, spectroscopic, immunochemical, electrical, optical or chemical means. A wide variety of appropriate detectable labels are known in the art, which include fluorescent or chemiluminescent labels, radioactive isotope labels, enzymatic or other ligands. In preferred embodiments, one will likely desire to employ a fluorescent label or an enzyme tag, such as digoxigenin, .beta.-galactosidase, urease, alkaline phosphatase or peroxidase, avidin/biotin complex.

Detection methods used to detect or quantify the hybridization intensity will typically depend upon the label selected above. For example, radiolabels may be detected using photographic film or a phosphoimager. Fluorescent markers may be detected and quantified using a photodetector to detect emitted light. Enzymatic labels are typically detected by providing the enzyme with a substrate and measuring the reaction product produced by the action of the enzyme on the substrate; and finally colorimetric labels are detected by simply visualizing the colored label.

An agent-induced change in expression of sequences associated with a signaling biochemical pathway can also be determined by examining the corresponding gene products. Determining the protein level typically involves a) contacting the protein contained in a biological sample with an agent that specifically bind to a protein associated with a signaling biochemical pathway; and (b) identifying any agent:protein complex so formed. In one aspect of this embodiment, the agent that specifically binds a protein associated with a signaling biochemical pathway is an antibody, preferably a monoclonal antibody.

The reaction can be performed by contacting the agent with a sample of the proteins associated with a signaling biochemical pathway derived from the test samples under conditions that will allow a complex to form between the agent and the proteins associated with a signaling biochemical pathway. The formation of the complex can be detected directly or indirectly according to standard procedures in the art. In the direct detection method, the agents are supplied with a detectable label and unreacted agents may be removed from the complex; the amount of remaining label thereby indicating the amount of complex formed. For such method, it is preferable to select labels that remain attached to the agents even during stringent washing conditions. It is preferable that the label does not interfere with the binding reaction. In the alternative, an indirect detection procedure may use an agent that contains a label introduced either chemically or enzymatically. A desirable label generally does not interfere with binding or the stability of the resulting agent:polypeptide complex. However, the label is typically designed to be accessible to an antibody for an effective binding and hence generating a detectable signal.

A wide variety of labels suitable for detecting protein levels are known in the art. Non-limiting examples include radioisotopes, enzymes, colloidal metals, fluorescent compounds, bioluminescent compounds, and chemiluminescent compounds.

The amount of agent:polypeptide complexes formed during the binding reaction can be quantified by standard quantitative assays. As illustrated above, the formation of agent:polypeptide complex can be measured directly by the amount of label remained at the site of binding. In an alternative, the protein associated with a signaling biochemical pathway is tested for its ability to compete with a labeled analog for binding sites on the specific agent. In this competitive assay, the amount of label captured is inversely proportional to the amount of protein sequences associated with a signaling biochemical pathway present in a test sample.

A number of techniques for protein analysis based on the general principles outlined above are available in the art. They include but are not limited to radioimmunoassays, ELISA (enzyme linked immunoradiometric assays), “sandwich” immunoassays, immunoradiometric assays, in situ immunoassays (using e.g., colloidal gold, enzyme or radioisotope labels), western blot analysis, immunoprecipitation assays, immunofluorescent assays, and SDS-PAGE.

Antibodies that specifically recognize or bind to proteins associated with a signaling biochemical pathway are preferable for conducting the aforementioned protein analyses. Where desired, antibodies that recognize a specific type of post-translational modifications (e.g., signaling biochemical pathway inducible modifications) can be used. Post-translational modifications include but are not limited to glycosylation, lipidation, acetylation, and phosphorylation. These antibodies may be purchased from commercial vendors. For example, anti-phosphotyrosine antibodies that specifically recognize tyrosine-phosphorylated proteins are available from a number of vendors including Invitrogen and Perkin Elmer. Anti-phosphotyrosine antibodies are particularly useful in detecting proteins that are differentially phosphorylated on their tyrosine residues in response to an ER stress. Such proteins include but are not limited to eukaryotic translation initiation factor 2 alpha (eIF-2.alpha.). Alternatively, these antibodies can be generated using conventional polyclonal or monoclonal antibody technologies by immunizing a host animal or an antibody-producing cell with a target protein that exhibits the desired post-translational modification.

In practicing a subject method, it may be desirable to discern the expression pattern of an protein associated with a signaling biochemical pathway in different bodily tissue, in different cell types, and/or in different subcellular structures. These studies can be performed with the use of tissue-specific, cell-specific or subcellular structure specific antibodies capable of binding to protein markers that are preferentially expressed in certain tissues, cell types, or subcellular structures.

An altered expression of a gene associated with a signaling biochemical pathway can also be determined by examining a change in activity of the gene product relative to a control cell. The assay for an agent-induced change in the activity of a protein associated with a signaling biochemical pathway will dependent on the biological activity and/or the signal transduction pathway that is under investigation. For example, where the protein is a kinase, a change in its ability to phosphorylate the downstream substrate(s) can be determined by a variety of assays known in the art. Representative assays include but are not limited to immunoblotting and immunoprecipitation with antibodies such as anti-phosphotyrosine antibodies that recognize phosphorylated proteins. In addition, kinase activity can be detected by high throughput chemiluminescent assays such as AlphaScreen™ (available from Perkin Elmer) and eTag™ assay (Chan-Hui, et al. (2003) Clinical Immunology 111: 162-174).

Where the protein associated with a signaling biochemical pathway is part of a signaling cascade leading to a fluctuation of intracellular pH condition, pH sensitive molecules such as fluorescent pH dyes can be used as the reporter molecules. In another example where the protein associated with a signaling biochemical pathway is an ion channel, fluctuations in membrane potential and/or intracellular ion concentration can be monitored. A number of commercial kits and high-throughput devices are particularly suited for a rapid and robust screening for modulators of ion channels. Representative instruments include FLIPR™ (Molecular Devices, Inc.) and VIPR (Aurora Biosciences). These instruments are capable of detecting reactions in over 1000 sample wells of a microplate simultaneously, and providing real-time measurement and functional data within a second or even a minisecond.

In practicing any of the methods disclosed herein, a suitable vector can be introduced to a cell, tissue, organism, or an embryo via one or more methods known in the art, including without limitation, microinjection, electroporation, sonoporation, biolistics, calcium phosphate-mediated transfection, cationic transfection, liposome transfection, dendrimer transfection, heat shock transfection, nucleofection transfection, magnetofection, lipofection, impalefection, optical transfection, proprietary agent-enhanced uptake of nucleic acids, and delivery via liposomes, immunoliposomes, virosomes, or artificial virions. In some methods, the vector is introduced into an embryo by microinjection. The vector or vectors may be microinjected into the nucleus or the cytoplasm of the embryo. In some methods, the vector or vectors may be introduced into a cell by nucleofection.

A target polynucleotide of a targetable nuclease complex can be any polynucleotide endogenous or exogenous to the host cell. For example, the target polynucleotide can be a polynucleotide residing in the nucleus of the eukaryotic cell, the genome of a prokaryotic cell, or an extrachromosomal vector of a host cell. The target polynucleotide can be a sequence coding a gene product (e.g., a protein) or a non-coding sequence (e.g., a regulatory polynucleotide or a junk DNA).

Examples of target polynucleotides include a sequence associated with a signaling biochemical pathway, e.g., a signaling biochemical pathway-associated gene or polynucleotide. Examples of target polynucleotides include a disease associated gene or polynucleotide. A “disease-associated” gene or polynucleotide refers to any gene or polynucleotide which is yielding transcription or translation products at an abnormal level or in an abnormal form in cells derived from a disease-affected tissues compared with tissues or cells of a non-disease control. It may be a gene that becomes expressed at an abnormally high level; it may be a gene that becomes expressed at an abnormally low level, where the altered expression correlates with the occurrence and/or progression of the disease. A disease-associated gene also refers to a gene possessing mutation(s) or genetic variation that is directly responsible or is in linkage disequilibrium with a gene(s) that is responsible for the etiology of a disease. The transcribed or translated products may be known or unknown, and may be at a normal or abnormal level.

Embodiments of the invention also relate to methods and compositions related to knocking out genes, editing genes, altering genes, amplifying genes, and repairing particular mutations. Altering genes may also mean the epigenetic manipulation of a target sequence. This may be the chromatin state of a target sequence, such as by modification of the methylation state of the target sequence (i.e. addition or removal of methylation or methylation patterns or CpG islands), histone modification, increasing or reducing accessibility to the target sequence, or by promoting 3D folding. It will be appreciated that where reference is made to a method of modifying a cell, organism, or mammal including human or a non-human mammal or organism by manipulation of a target sequence in a genomic locus of interest, this may apply to the organism (or mammal) as a whole or just a single cell or population of cells from that organism (if the organism is multicellular). In the case of humans, for instance, Applicants envisage, inter alia, a single cell or a population of cells and these may preferably be modified ex vivo and then re-introduced. In this case, a biopsy or other tissue or biological fluid sample may be necessary. Stem cells are also particularly preferred in this regard. But, of course, in vivo embodiments are also envisaged. And the invention is especially advantageous as to HSCs.

The functionality of a targetable nuclease complex can be assessed by any suitable assay. For example, the components of a targetable nuclease system sufficient to form a targetable nuclease complex, including a guide nucleic acid and nucleic acid-guided nuclease, can be provided to a host cell having the corresponding target sequence, such as by transfection with vectors encoding the components of the engineered nuclease system, followed by an assessment of preferential cleavage within the target sequence. Similarly, cleavage of a target sequence may be evaluated in a test tube by providing the target sequence and components of a targetable nuclease complex. Other assays are possible, and will occur to those skilled in the art. A guide sequence can be selected to target any target sequence. In some embodiments, the target sequence is a sequence within a genome of a cell. Exemplary target sequences include those that are unique in the target genome.

Editing Cassette

Disclosed herein are compositions and methods for editing a target polynucleotide sequence. Such compositions include polynucleotides containing one or more components of targetable nuclease system. Polynucleotide sequences for use in these methods can be referred to as editing cassettes.

An editing cassette can comprise one or more primer sites. Primer sites can be used to amplify an editing cassette by using oligonucleotide primers comprising reverse complementary sequences that can hybridize to the one or more primer sites. An editing cassette can comprise two or more primer times. Sometimes, an editing cassette comprises a primer site on each end of the editing cassette, said primer sites flanking one or more of the other components of the editing cassette. Primer sites can be approximately 10, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26 or more nucleotides in length.

An editing cassette can comprise an editing template as disclosed herein. An editing cassette can comprise an editing sequence. An editing sequence can be homologous to a target sequence. An editing sequence can comprise at least one mutation relative to a target sequence. An editing sequence often comprises homology region (or homology arms) flanking at least one mutation relative to a target sequence, such that the flanking homology regions facilitate homologous recombination of the editing sequence into a target sequence. An editing sequence can comprise an editing template as disclosed herein. For example, the editing sequence can comprise at least one mutation relative to a target sequence including one or more PAM mutations that mutate or delete a PAM site. An editing sequence can comprise one or more mutations in a codon or non-coding sequence relative to a non-editing target site.

A PAM mutation can be a silent mutation. A silent mutation can be a change to at least one nucleotide of a codon relative to the original codon that does not change the amino acid encoded by the original codon. A silent mutation can be a change to a nucleotide within a non-coding region, such as an intron, 5′ untranslated region, 3′ untranslated region, or other non-coding region.

A PAM mutation can be a non-silent mutation. Non-silent mutations can include a missense mutation. A missense mutation can be when a change to at least one nucleotide of a codon relative to the original codon that changes the amino acid encoded by the original codon. Missense mutations can occur within an exon, open reading frame, or other coding region.

An editing sequence can comprise at least one mutation relative to a target sequence. A mutation can be a silent mutation or non-silent mutation, such as a missense mutation. A mutation can include an insertion of one or more nucleotides or base pairs. A mutation can include a deletion of one or more nucleotides or base pairs. A mutation can include a substitution of one or more nucleotides or base pairs for a different one or more nucleotides or base pairs. Inserted or substituted sequences can include exogenous or heterologous sequences.

An editing cassette can comprise a polynucleotide encoding a guide nucleic acid sequence. In some cases, the guide nucleic acid sequence is optionally operably linked to a promoter. A guide nucleic acid sequence can comprise a scaffold sequence and a guide sequence as described herein.

An editing cassette can comprise a barcode. A barcode can be a unique DNA sequence that corresponds to the editing sequence such that the barcode can identify the one or more mutations of the corresponding editing sequence. In some examples, the barcode is 15 nucleotides. The barcode can comprise less than 10, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 88, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, or more than 200 nucleotides. A barcode can be a non-naturally occurring sequence. An editing cassette comprising a barcode can be a non-naturally occurring sequence.

An editing cassette can comprise one or more of an editing sequence and a polynucleotide encoding a guide nucleic acid optionally operably linked to a promoter, wherein the editing cassette and guide nucleic acid sequence are flanked by primer sites. An editing cassette can further comprise a barcode.

An example of an editing cassette is depicted in FIG. 3. Each editing cassette can be designed to edit a site in a target sequence Sites to be targeted can be coding regions, non-coding regions, functionally neutral sites, or they can be a screenable or selectable marker gene. Homology regions within the editing sequence flank the one or more mutations of the editing cassette and can be inserted into the target sequence by recombination. Recombination can comprise DNA cleavage, such as by an nucleic acid-guided nuclease, and repair via homologous recombination.

Editing cassettes can be generated by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.

Trackable sequences, such as barcodes or recorder sequences, can be designed in silico via standard code with a degenerate mutation at the target codon. The degenerate mutation can comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or more than 30 nucleic acid residues. In some examples, the degenerate mutations can comprise 15 nucleic acid residues (N15).

Homology arms can be added to an editing sequence to allow incorporation of the editing sequence into the desired location via homologous recombination or homology-driven repair. Homology arms can be added by synthesis, in vitro assembly, PCR, or other known methods in the art. For example, chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof. A homology arm can be added to both ends of a barcode, recorder sequence, and/or editing sequence, thereby flanking the sequence with two distinct homology arms, for example, a 5′ homology arm and a 3′ homology arm.

A homology arm can comprise sequence homologous to a target sequence. A homology arm can comprise sequence homologous to sequence adjacent to a target sequence. A homology arm can comprise sequence homologous to sequence upstream or downstream of a target sequence. A homology arm can comprise sequence homologous to sequence within the same gene or open reading frame as a target sequence. A homology arm can comprise sequence homologous to sequence upstream or downstream of a gene or open reading frame the target sequence is within. A homology arm can comprise sequence homologous to a 5′ UTR or 3′ UTR of a gene or open reading frame within which is a target sequence. A homology arm can comprise sequence homologous to a different gene, open reading frame, promoter, terminator, or nucleic acid sequence than that which the target sequence is within.

The same 5′ and 3′ homology arms can be added to a plurality of distinct editing sequences, thereby generating a library of unique editing sequences that each have the same targeted insertion site. The same 5′ and 3′ homology arms can be added to a plurality of distinct editing templates, thereby generating a library of unique editing templates that each have the same targeted insertion site. In alternative examples, different or a variety of 5′ or 3′ homology arms can be added to a plurality of editing sequences or editing templates.

A barcode library or recorder sequence library comprising flanking homology arms can be cloned into a vector backbone. In some examples, the barcode comprising flanking homology arms are cloned into an editing cassette. Cloning can occur by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.

An editing sequence library comprising flanking homology arms can be cloned into a vector backbone. In some examples, the editing sequence and homology arms are cloned into an editing cassette. Editing cassettes can, in some cases, further comprise a nucleic acid sequence encoding a guide nucleic acid or gRNA engineered to target the desired site of editing sequence insertion, e.g. the target sequence. Editing cassettes can, in some cases, further comprise a barcode or recorder sequence. Cloning can occur by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.

Gene-wide or genome-wide editing libraries can be cloned into a vector backbone. A barcode or recorder sequence library can be inserted or assembled into a second site to generate competent trackable plasmids that can embed the recording barcode at a fixed locus while integrating the editing libraries at a wide variety of user defined sites. Cloning can occur by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.

A guide nucleic acid or sequence encoding the same can be assembled or inserted into a vector backbone first, followed by insertion of an editing sequence and/or cassette. In other cases, an editing sequence and/or cassette can be inserted or assembled into a vector backbone first, followed by insertion of a guide nucleic acid or sequence encoding the same. In other cases, guide nucleic acid or sequence encoding the same and an editing sequence and/or cassette are simultaneous inserted or assembled into a vector. A recorder sequence or barcode can be inserted before or after any of these steps. In other words, it should be understood that there are many possible permutations to the order in which elements of the disclosure are assembled. The vector can be linear or circular and can be generated by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.

A nucleic acid molecule can be synthesized which comprises one or more elements disclosed herein. A nucleic acid molecule can be synthesized that comprises an editing cassette. A nucleic acid molecule can be synthesized that comprises a guide nucleic acid. A nucleic acid molecule can be synthesized that comprises a recorder cassette. A nucleic acid molecule can be synthesized that comprises a barcode. A nucleic acid molecule can be synthesized that comprises a homology arm. A nucleic acid molecule can be synthesized that comprises an editing cassette and a guide nucleic acid. A nucleic acid molecule can be synthesized that comprises an editing cassette and a barcode. A nucleic acid molecule can be synthesized that comprises an editing cassette, a guide nucleic acid, and a recorder cassette. A nucleic acid molecule can be synthesized that comprises an editing cassette, a recorder cassette, and two guide nucleic acids. A nucleic acid molecule can be synthesized that comprises a recorder cassette and a guide nucleic acid. In any of these cases, the guide nucleic acid can optionally be operably linked to a promoter. In any of these cases, the nucleic acid molecule can further include one or more barcodes.

Synthesis can occur by any nucleic acid synthesis method known in the art. Synthesis can occur by enzymatic nucleic acid synthesis. Synthesis can occur by chemical synthesis. Synthesis can occur by array-based synthesis. Synthesis can occur by solid-phase synthesis or phosphoramidite methods. Synthesis can occur by column or multi-well methods. Synthesized nucleic acid molecules can be non-naturally occurring nucleic acid molecules.

Software and automation methods can be used for multiplex synthesis and generation. For example, software and automation can be used to create 10, 102, 103, 104, 105, 106, or more synthesized polynucleotides, cassettes, or plasmids. An automation method can generate desired sequences and libraries in rapid fashion that can be processed through a workflow with minimal steps to produce precisely defined libraries, such as gene-wide or genome-wide editing libraries.

Polynucleotides or libraries can be generated which comprise two or more nucleic acid molecules or plasmids comprising any combination disclosed herein of recorder sequence, editing sequence, guide nucleic acid, and optional barcode, including combinations of one or more of any of the previously mentioned elements. For example, such a library can comprise at least 2, 3, 4, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500, 600, 700, 800, 1000, 1500, 2000, 2500, 3000, 3500, 4000, 4500, 5000, 5500, 6000, 6500, 7000, 7500, 8000, 8500, 9000, 9500, 104, 105, 106, 107, 108, 109, 1010, or more nucleic acid molecules or plasmids of the present disclosure. It should be understood that such a library can include any number of nucleic acid molecules or plasmids, even if the specific number is not explicit listed above.

Trackable plasmid libraries or nucleic acid molecule libraries can be sequenced in order to determine the recorder sequence and editing sequence pair that is comprised on each trackable plasmid. In other cases, a known recorder sequence is paired with a known editing sequence during the library generation process. Other methods of determining the association between a recorder sequence and editing sequence comprised on a common nucleic acid molecule or plasmid are envisioned such that the editing sequence can be identified by identification or sequencing of the recorder sequence.

Methods and compositions for tracking edited episomal libraries that are shuttled between E. coli and other organisms/cell lines are provided herein. The libraries can be comprised on plasmids, Bacterial artificial chromosomes (BACs), Yeast artificial chromosomes (YACs), synthetic chromosomes, or viral or phage genomes. These methods and compositions can be used to generate portable barcoded libraries in host organisms, such as E. coli. Library generation in such organisms can offer the advantage of established techniques for performing homologous recombination. Barcoded plasmid libraries can be deep-sequenced at one site to track mutational diversity targeted across the remaining portions of the plasmid allowing dramatic improvements in the depth of library coverage.

Any nucleic acid molecule disclosed herein can be an isolated nucleic acid. Isolated nucleic acids may be made by any method known in the art, for example using standard recombinant methods, assembly methods, synthesis techniques, or combinations thereof. In some embodiments, the nucleic acids may be cloned, amplified, assembled, or otherwise constructed.

Isolated nucleic acids may be obtained from cellular, bacterial, or other sources using any number of cloning methodologies known in the art. In some embodiments, oligonucleotide probes which selectively hybridize, under stringent conditions, to other oligonucleotides or to the nucleic acids of an organism or cell can be used to isolate or identify an isolated nucleic acid.

Cellular genomic DNA, RNA, or cDNA may be screened for the presence of an identified genetic element of interest using a probe based upon one or more sequences. Various degrees of stringency of hybridization may be employed in the assay.

High stringency conditions for nucleic acid hybridization are well known in the art. For example, conditions may comprise low salt and/or high temperature conditions, such as provided by about 0.02 M to about 0.15 M NaCl at temperatures of about 500° C. to about 70° C. It is understood that the temperature and ionic strength of a desired stringency are determined in part by the length of the particular nucleic acid(s), the length and nucleotide content of the target sequence(s), the charge composition of the nucleic acid(s), and by the presence or concentration of formamide, tetramethylammonium chloride or other solvent(s) in a hybridization mixture. Nucleic acids may be completely complementary to a target sequence or may exhibit one or more mismatches.

Nucleic acids of interest may also be amplified using a variety of known amplification techniques. For instance, polymerase chain reaction (PCR) technology may be used to amplify target sequences directly from DNA, RNA, or cDNA. PCR and other in vitro amplification methods may also be useful, for example, to clone nucleic acid sequences, to make nucleic acids to use as probes for detecting the presence of a target nucleic acid in samples, for nucleic acid sequencing, or for other purposes.

Isolated nucleic acids may be prepared by direct chemical synthesis by methods such as the phosphotriester method, or using an automated synthesizer. Chemical synthesis generally produces a single stranded oligonucleotide. This may be converted into double stranded DNA by hybridization with a complementary sequence or by polymerization with a DNA polymerase using the single strand as a template.

Recorder

In some example, two editing cassettes can be used together to track a genetic engineering step. For example, one editing cassette can comprise an editing template and an encoded guide nucleic acid, and a second editing cassette, referred to as a recorder cassette, can comprise an editing template comprising a recorder sequence and an encoded nucleic acid which has a distinct guide sequence compared to that of the first editing cassette. In such cases, the editing sequence and the recorder sequence can be inserted into separate target sequences and determined by their corresponding guide nucleic acids. A recorder sequence can comprise a barcode, trackable or traceable sequence, and/or a regulatory element operable with a screenable or selectable marker.

Through a multiplexed cloning approach, the recorder cassette can be covalently coupled to at least one editing cassette in a plasmid (e.g., FIG. 17A, green cassette) to generate plasmid libraries that have a unique recorder and editing cassette combination. This library can be sequenced to generate the recorder/edit mapping and used to track editing libraries across large segments of the target DNA (e.g., FIG. 17C). Recorder and editing sequences can be comprised on the same cassette, in which case they are both incorporated into the target nucleic acid sequence, such as a genome or plasmid, by the same recombination event. In other examples, the recorder and editing sequences can be comprised on separate cassettes within the same plasmid, in which case the recorder and editing sequences are incorporated into the target nucleic acid sequence by separate recombination events, either simultaneously or sequentially.

Methods are provided herein for combining multiplex oligonucleotide synthesis with recombineering, to create libraries of specifically designed and trackable mutations. Screens and/or selections followed by high-throughput sequencing and/or barcode microarray methods can allow for rapid mapping of mutations leading to a phenotype of interest.

Methods and compositions disclosed herein can be used to simultaneously engineer and track engineering events in a target nucleic acid sequence.

Such plasmids can be generated using in vitro assembly or cloning techniques. For example, the plasmids can be generated using chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, other in vitro oligo assembly techniques, traditional ligation-based cloning, or any combination thereof.

Such plasmids can comprise at least one recording sequence, such as a barcode, and at least one editing sequence. In most cases, the recording sequence is used to record and track engineering events. Each editing sequence can be used to incorporate a desired edit into a target nucleic acid sequence. The desired edit can include insertion, deletion, substitution, or alteration of the target nucleic acid sequence. In some examples, the one or more recording sequence and editing sequences are comprised on a single cassette comprised within the plasmid such that they are incorporated into the target nucleic acid sequence by the same engineering event. In other examples, the recording and editing sequences are comprised on separate cassettes within the plasmid such that they are each incorporated into the target nucleic acid by distinct engineering events. In some examples, the plasmid comprises two or more editing sequences. For example, one editing sequence can be used to alter or silence a PAM sequence while a second editing sequence can be used to incorporate a mutation into a distinct sequence.

Recorder sequences can be inserted into a site separated from the editing sequence insertion site. The inserted recorder sequence can be separated from the editing sequence by 1 bp to 1 Mbp. For example, the separation distance can be about 1 bp, 10 bp, 50 bp, 100 bp, 500 bp, 1 kp, 2 kb, 5 kb, 10 kb, or greater. The separation distance can be any discrete integer between 1 bp and 10 Mbp. In some examples, the maximum distance of separation depends on the size of the target nucleic acid or genome.

Recorder sequences can be inserted adjacent to editing sequences, or within proximity to the editing sequence. For example, the recorder sequence can be inserted outside of the open reading frame within which the editing sequence is inserted. Recorder sequence can be inserted into an untranslated region adjacent to an open reading frame within which an editing sequence has been inserted. The recorder sequence can be inserted into a functionally neutral or non-functional site. The recorder sequence can be inserted into a screenable or selectable marker gene.

In some examples, the target nucleic acid sequence is comprised within a genome, artificial chromosome, synthetic chromosome, or episomal plasmid. In various examples, the target nucleic acid sequence can be in vitro or in vivo. When the target nucleic acid sequence is in vivo, the plasmid can be introduced into the host organisms by transformation, transfection, conjugation, biolistics, nanoparticles, cell-permeable technologies, or other known methods for DNA delivery, or any combination thereof. In such examples, the host organism can be a eukaryote, prokaryote, bacterium, archaea, yeast, or other fungi.

The engineering event can comprise recombineering, non-homologous end joining, homologous recombination, or homology-driven repair. In some examples, the engineering event is performed in vitro or in vivo.

The methods described herein can be carried out in any type of cell in which a targetable nuclease system can function (e.g., target and cleave DNA), including prokaryotic and eukaryotic cells. In some embodiments the cell is a bacterial cell, such as Escherichia spp. (e.g., E. coli). In other embodiments, the cell is a fungal cell, such as a yeast cell, e.g., Saccharomyces spp. In other embodiments, the cell is an algal cell, a plant cell, an insect cell, or a mammalian cell, including a human cell.

In some examples, the cell is a recombinant organism. For example, the cell can comprise a non-native targetable nuclease system. Additionally or alternatively, the cell can comprise recombination system machinery. Such recombination systems can include lambda red recombination system, Cre/Lox, attB/attP, or other integrase systems. Where appropriate, the plasmid can have the complementary components or machinery required for the selected recombination system to work correctly and efficiently.

Method for genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette and at least one guide nucleic acid into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage and incorporation of the editing cassette; (c) obtaining viable cells; and (d) sequencing the target DNA molecule in at least one cell of the second population of cells to identify the mutation of at least one codon.

A method for genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette comprising a PAM mutation as disclosed herein and at least one guide nucleic acid into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage, incorporation of the editing cassette, and death of cells of the second population of cells that do not comprise the PAM mutation, whereas cells of the second population of cells that comprise the PAM mutation are viable; (c) obtaining viable cells; and (d) sequencing the target DNA in at least one cell of the second population of cells to identify the mutation of at least one codon.

Method for trackable genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette, at least one recorder cassette, and at least two guide nucleic acids into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage and incorporation of the editing and recorder cassettes; (c) obtaining viable cells; and (d) sequencing the recorder sequence of the target DNA molecule in at least one cell of the second population of cells to identify the mutation of at least one codon.

In some examples where the plasmid comprises a second editing sequence designed to silence a PAM, a method for trackable genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette, a recorder cassette, and at least two guide nucleic acids into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage, incorporation of the editing and recorder cassettes, and death of cells of the second population of cells that do not comprise the PAM mutation, whereas cells of the second population of cells that comprise the PAM mutation are viable; (c) obtaining viable cells; and (d) sequencing the recorder sequence of the target DNA in at least one cell of the second population of cells to identify the mutation of at least one codon.

In some examples transformation efficiency is determined by using a non-targeting control guide nucleic acid, which allows for validation of the recombineering procedure and CFU/ng calculations. In some cases, absolute efficient is obtained by counting the total number of colonies on each transformation plate, for example, by counting both red and white colonies from a galK control. In some examples, relative efficiency is calculated by the total number of successful transformants (for example, white colonies) out of all colonies from a control (for example, galK control).

The methods of the disclosure can provide, for example, greater than 1000× improvements in the efficiency, scale, cost of generating a combinatorial library, and/or precision of such library generation.

The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater improvements in the efficiency of generating genomic or combinatorial libraries.

The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater improvements in the scale of generating genomic or combinatorial libraries.

The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater decrease in the cost of generating genomic or combinatorial libraries.

The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater improvements in the precision of genomic or combinatorial library generation.

Recursive Tracking for Combinatorial Engineering

Disclosed herein are methods and compositions for iterative rounds of engineering. Disclosed herein are recursive engineering strategies that allow implementation of CREATE recording at the single cell level through several serial engineering cycles (e.g., FIG. 18 and FIG. 19). These disclosed methods and compositions can enable search-based technologies that can effectively construct and explore complex genotypic space. The terms recursive and iterative can be used interchangeably.

Combinatorial engineering methods can comprise multiple rounds of engineering. Methods disclosed herein can comprise 2 or more rounds of engineering. For example, a method can comprise 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, or more than 30 rounds of engineering.

In some examples, during each round of engineering a new recorder sequence, such as a barcode, is incorporated at the same locus in nearby sites (e.g., FIG. 18, green bars or FIG. 19, black bars) such that following multiple engineering cycles to construct combinatorial diversity throughout the genome (e.g., FIG. 18, green bars or FIG. 19, grey bars) a simple PCR of the recording locus can be used to reconstruct each combinatorial genotype or to confirm that the engineered edit from each round has been incorporated into the target site.

Disclosed herein are methods for selecting for successive rounds of engineering. Selection can occur by a PAM mutation incorporated by an editing cassette. Selection can occur by a PAM mutation incorporated by a recorder cassette. Selection can occur using a screenable, selectable, or counter-selectable marker. Selection can occur by targeting a site for editing or recording that was incorporated by a prior round of engineering, thereby selecting for variants that successfully incorporated edits and recorder sequences from both rounds or all prior rounds of engineering.

Quantitation of these genotypes can be used for understanding combinatorial mutational effects on large populations and investigation of important biological phenomena such as epistasis.

Serial editing and combinatorial tracking can be implemented using recursive vector systems as disclosed herein. These recursive vector systems can be used to move rapidly through the transformation procedure. In some examples, these systems consist of two or more plasmids containing orthogonal replication origins, antibiotic markers, and an encoded guide nucleic acids. The encoded guide nucleic acid in each vector can be designed to target one of the other resistance markers for destruction by nucleic acid-guided nuclease-mediated cleavage. These systems can be used, in some examples, to perform transformations in which the antibiotic selection pressure is switched to remove the previous plasmid and drive enrichment of the next round of engineered genomes. Two or more passages through the transformation loop can be performed, or in other words, multiple rounds of engineering can be performed. Introducing the requisite recording cassettes and editing cassettes into recursive vectors as disclosed herein can be used for simultaneous genome editing and plasmid curing in each transformation step with high efficiencies.

In some examples, the recursive vector system disclosed herein comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, or more than 10 unique plasmids. In some examples, the recursive vector system can use a particular plasmid more than once as long as a distinct plasmid is used in the previous round and in the subsequent round.

Recursive methods and compositions disclosed herein can be used to restore function to a selectable or screenable element in a targeted genome or plasmid. The selectable or screenable element can include an antibiotic resistance gene, a fluorescent gene, a unique DNA sequence or watermark, or other known reporter, screenable, or selectable gene. In some examples, each successive round of engineering can incorporate a fragment of the selectable or screenable element, such that at the end of the engineering rounds, the entire selectable or screenable element has been incorporated into the target genome or plasmid. In such examples, only those genome or plasmids which have successfully incorporated all of the fragments, and therefore all of the desired corresponding mutations, can be selected or screened for. In this way, the selected or screened cells will be enriched for those that have incorporated the edits from each and every iterative round of engineering.

Recursive methods can be used to switch a selectable or screenable marker between an on and an off position, or between an off and an on position, with each successive round of engineering. Using such a method allows conservation of available selectable or screenable markers by requiring, for example, the use of only one screenable or selectable marker. Furthermore, short regulatory sequence or start codon or non-start codons can be used to turn the screenable or selectable marker on and off. Such short sequences can easily fit within a synthesized cassette or polynucleotide.

One or more rounds of engineering can be performed using the methods and compositions disclosed herein. In some examples, each round of engineering is used to incorporate an edit unique from that of previous rounds. Each round of engineering can incorporate a unique recording sequence. Each round of engineering can result in removal or curing of the plasmid used in the previous round of engineering. In some examples, successful incorporation of the recording sequence of each round of engineering results in a complete and functional screenable or selectable marker or unique sequence combination.

Unique recorder cassettes comprising recording sequences such as barcodes or screenable or selectable markers can be inserted with each round of engineering, thereby generating a recorder sequence that is indicative of the combination of edits or engineering steps performed. Successive recording sequences can be inserted adjacent to one another. Successive recording sequences can be inserted within proximity to one another. Successive sequences can be inserted at a distance from one another.

Successive sequences can be inserted at a distance from one another. For example, successive recorder sequences can be inserted and separated by 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, or greater than 100 bp. In some examples, successive recorder sequences are separated by about 10, 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1100, 1200, 1300, 1400, 1500, or greater than 1500 bp.

Successive recorder sequences can be separated by any desired number of base pairs and can be dependent and limited on the number of successive recorder sequences to be inserted, the size of the target nucleic acid or target genomes, and/or the design of the desired final recorder sequence. For example, if the compiled recorder sequence is a functional screenable or selectable marker, than the successive recording sequences can be inserted within proximity and within the same reading frame from one another. If the compiled recorder sequence is a unique set of barcodes to be identified by sequencing and have no coding sequence element, then the successive recorder sequences can be inserted with any desired number of base pairs separating them. In these cases, the separation distance can be dependent on the sequencing technology to be used and the read length limit.

While preferred embodiments of the present invention have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention. It should be understood that various alternatives to the embodiments of the invention described herein may be employed in practicing the invention. It is intended that the following claims define the scope of the invention and that methods and structures within the scope of these claims and their equivalents be covered thereby.

Some Definitions

As used herein the term “wild type” is a term of the art understood by skilled persons and means the typical form of an organism, strain, gene or characteristic as it occurs in nature as distinguished from mutant or variant forms.

As used herein the term “variant” should be taken to mean the exhibition of qualities that have a pattern that deviates from what occurs in nature.

The terms “orthologue” (also referred to as “ortholog” herein) and “homologue” (also referred to as “homolog” herein) are well known in the art. By means of further guidance, a “homologue” of a protein as used herein is a protein of the same species which performs the same or a similar function as the protein it is a homologue of. Homologous proteins may but need not be structurally related, or are only partially structurally related. An “orthologue” of a protein as used herein is a protein of a different species which performs the same or a similar function as the protein it is an orthologue of Orthologous proteins may but need not be structurally related, or are only partially structurally related. Homologs and orthologs may be identified by homology modelling (see, e.g., Greer, Science vol. 228 (1985) 1055, and Blundell et al. Eur J Biochem vol 172 (1988), 513) or “structural BLAST” (Dey F, Cliff Zhang Q, Petrey D, Honig B. Toward a “structural BLAST”: using structural relationships to infer function. Protein Sci. 2013 April; 22(4):359-66. doi: 10.1002/pro.2225.).

The terms “polynucleotide”, “nucleotide”, “nucleotide sequence”, “nucleic acid” and “oligonucleotide” are used interchangeably. They refer to a polymeric form of nucleotides of any length, either deoxyribonucleotides or ribonucleotides, or analogs thereof. Polynucleotides may have any three dimensional structure, and may perform any function, known or unknown. The following are non-limiting examples of polynucleotides: coding or non-coding regions of a gene or gene fragment, loci (locus) defined from linkage analysis, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, short interfering RNA (siRNA), short-hairpin RNA (shRNA), micro-RNA (miRNA), ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes, and primers. The term also encompasses nucleic-acid-like structures with synthetic backbones, see, e.g., Eckstein, 1991; Baserga et al., 1992; Milligan, 1993; WO 97/03211; WO 96/39154; Mata, 1997; Strauss-Soukup, 1997; and Samstag, 1996. A polynucleotide may comprise one or more modified nucleotides, such as methylated nucleotides and nucleotide analogs. If present, modifications to the nucleotide structure may be imparted before or after assembly of the polymer. The sequence of nucleotides may be interrupted by non-nucleotide components. A polynucleotide may be further modified after polymerization, such as by conjugation with a labeling component.

“Complementarity” refers to the ability of a nucleic acid to form hydrogen bond(s) with another nucleic acid sequence by either traditional Watson-Crick base pairing or other non-traditional types. A percent complementarity indicates the percentage of residues in a nucleic acid molecule which can form hydrogen bonds (e.g., Watson-Crick base pairing) with a second nucleic acid sequence (e.g., 5, 6, 7, 8, 9, 10 out of 10 being 50%, 60%, 70%, 80%, 90%, and 100% complementary). “Perfectly complementary” means that all the contiguous residues of a nucleic acid sequence will hydrogen bond with the same number of contiguous residues in a second nucleic acid sequence. “Substantially complementary” as used herein refers to a degree of complementarity that is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99%, or 100% over a region of 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, or more nucleotides, or refers to two nucleic acids that hybridize under stringent conditions.

As used herein, “stringent conditions” for hybridization refer to conditions under which a nucleic acid having complementarity to a target sequence predominantly hybridizes with the target sequence, and substantially does not hybridize to non-target sequences. Stringent conditions are generally sequence-dependent, and vary depending on a number of factors. In general, the longer the sequence, the higher the temperature at which the sequence specifically hybridizes to its target sequence. Non-limiting examples of stringent conditions are described in detail in Tijssen (1993). Laboratory Techniques In Biochemistry And Molecular Biology-Hybridization With Nucleic Acid Probes Part I, Second Chapter “Overview of principles of hybridization and the strategy of nucleic acid probe assay”, Elsevier, N.Y. Where reference is made to a polynucleotide sequence, then complementary or partially complementary sequences are also envisaged. These are preferably capable of hybridising to the reference sequence under highly stringent conditions. Generally, in order to maximize the hybridization rate, relatively low-stringency hybridization conditions are selected: about 20 to 25 degrees Celsius. lower than the thermal melting point (Tm). The Tm is the temperature at which 50% of specific target sequence hybridizes to a perfectly complementary probe in solution at a defined ionic strength and pH. Generally, in order to require at least about 85% nucleotide complementarity of hybridized sequences, highly stringent washing conditions are selected to be about 5 to 15 degrees Celsius lower than the Tm. In order to require at least about 70% nucleotide complementarity of hybridized sequences, moderately-stringent washing conditions are selected to be about 15 to 30 degrees Celsius lower than the Tm. Highly permissive (very low stringency) washing conditions may be as low as 50 degrees Celsius below the Tm, allowing a high level of mis-matching between hybridized sequences. Those skilled in the art will recognize that other physical and chemical parameters in the hybridization and wash stages can also be altered to affect the outcome of a detectable hybridization signal from a specific level of homology between target and probe sequences.

“Hybridization” refers to a reaction in which one or more polynucleotides react to form a complex that is stabilized via hydrogen bonding between the bases of the nucleotide residues. The hydrogen bonding may occur by Watson Crick base pairing, Hoogstein binding, or in any other sequence specific manner. The complex may comprise two strands forming a duplex structure, three or more strands forming a multi stranded complex, a single self-hybridizing strand, or any combination of these. A hybridization reaction may constitute a step in a more extensive process, such as the initiation of PCR, or the cleavage of a polynucleotide by an enzyme. A sequence capable of hybridizing with a given sequence is referred to as the “complement” of the given sequence.

As used herein, the term “genomic locus” or “locus” (plural loci) is the specific location of a gene or DNA sequence on a chromosome. A “gene” refers to stretches of DNA or RNA that encode a polypeptide or an RNA chain that has functional role to play in an organism and hence is the molecular unit of heredity in living organisms. For the purpose of this invention it may be considered that genes include regions which regulate the production of the gene product, whether or not such regulatory sequences are adjacent to coding and/or transcribed sequences. Accordingly, a gene includes, but is not necessarily limited to, promoter sequences, terminators, translational regulatory sequences such as ribosome binding sites and internal ribosome entry sites, enhancers, silencers, insulators, boundary elements, replication origins, matrix attachment sites and locus control regions.

As used herein, “expression of a genomic locus” or “gene expression” is the process by which information from a gene is used in the synthesis of a functional gene product. The products of gene expression are often proteins, but in non-protein coding genes such as rRNA genes or tRNA genes, the product is functional RNA. The process of gene expression is used by all known life—eukaryotes (including multicellular organisms), prokaryotes (bacteria and archaea) and viruses to generate functional products to survive. As used herein “expression” of a gene or nucleic acid encompasses not only cellular gene expression, but also the transcription and translation of nucleic acid(s) in cloning systems and in any other context. As used herein, “expression” also refers to the process by which a polynucleotide is transcribed from a DNA template (such as into and mRNA or other RNA transcript) and/or the process by which a transcribed mRNA is subsequently translated into peptides, polypeptides, or proteins. Transcripts and encoded polypeptides may be collectively referred to as “gene product.” If the polynucleotide is derived from genomic DNA, expression may include splicing of the mRNA in a eukaryotic cell.

The terms “polypeptide”, “peptide” and “protein” are used interchangeably herein to refer to polymers of amino acids of any length. The polymer may be linear or branched, it may comprise modified amino acids, and it may be interrupted by non amino acids. The terms also encompass an amino acid polymer that has been modified; for example, disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, or any other manipulation, such as conjugation with a labeling component. As used herein the term “amino acid” includes natural and/or unnatural or synthetic amino acids, including glycine and both the D or L optical isomers, and amino acid analogs and peptidomimetics.

As used herein, the term “domain” or “protein domain” refers to a part of a protein sequence that may exist and function independently of the rest of the protein chain.

As described in aspects of the invention, sequence identity is related to sequence homology. Homology comparisons may be conducted by eye, or more usually, with the aid of readily available sequence comparison programs. These commercially available computer programs may calculate percent (%) homology between two or more sequences and may also calculate the sequence identity shared by two or more amino acid or nucleic acid sequences. Sequence homologies may be generated by any of a number of computer programs known in the art, for example BLAST or FASTA, etc. A suitable computer program for carrying out such an alignment is the GCG Wisconsin Bestfit package (University of Wisconsin. U.S.A; Devereux et al., 1984, Nucleic Acids Research 12:387). Examples of other software than may perform sequence comparisons include, but are not limited to, the BLAST package (see Ausubel et al., 1999 ibid—Chapter 18), FASTA (Atschul et al., 1990, J. Mol. Biol., 403-410) and the GENEWORKS suite of comparison tools. Both BLAST and FASTA are available for offline and online searching (see Ausubel et al., 1999 ibid, pages 7-58 to 7-60). However it is preferred to use the GCG Bestfit program.

Percent homology may be calculated over contiguous sequences, i.e., one sequence is aligned with the other sequence and each amino acid or nucleotide in one sequence is directly compared with the corresponding amino acid or nucleotide in the other sequence, one residue at a time. This is called an “ungapped” alignment. Typically, such ungapped alignments are performed only over a relatively short number of residues.

Although this is a very simple and consistent method, it fails to take into consideration that, for example, in an otherwise identical pair of sequences, one insertion or deletion may cause the following amino acid residues to be put out of alignment, thus potentially resulting in a large reduction in % homology when a global alignment is performed. Consequently, most sequence comparison methods are designed to produce optimal alignments that take into consideration possible insertions and deletions without unduly penalizing the overall homology or identity score. This is achieved by inserting “gaps” in the sequence alignment to try to maximize local homology or identity.

However, these more complex methods assign “gap penalties” to each gap that occurs in the alignment so that, for the same number of identical amino acids, a sequence alignment with as few gaps as possible—reflecting higher relatedness between the two compared sequences—may achieve a higher score than one with many gaps. “Affinity gap costs” are typically used that charge a relatively high cost for the existence of a gap and a smaller penalty for each subsequent residue in the gap. This is the most commonly used gap scoring system. High gap penalties may, of course, produce optimized alignments with fewer gaps. Most alignment programs allow the gap penalties to be modified. However, it is preferred to use the default values when using such software for sequence comparisons. For example, when using the GCG Wisconsin Bestfit package the default gap penalty for amino acid sequences is −12 for a gap and −4 for each extension.

Calculation of maximum % homology therefore first requires the production of an optimal alignment, taking into consideration gap penalties. A suitable computer program for carrying out such an alignment is the GCG Wisconsin Bestfit package (Devereux et al., 1984 Nuc. Acids Research 12 p387). Examples of other software that may perform sequence comparisons include, but are not limited to, the BLAST package (see Ausubel et al., 1999 Short Protocols in Molecular Biology, 4th Ed.—Chapter 18), FASTA (Altschul et al., 1990 J. Mol. Biol. 403-410) and the GENEWORKS suite of comparison tools. Both BLAST and FASTA are available for offline and online searching (see Ausubel et al., 1999, Short Protocols in Molecular Biology, pages 7-58 to 7-60). However, for some applications, it is preferred to use the GCG Bestfit program. A new tool, called BLAST 2 Sequences is also available for comparing protein and nucleotide sequences (see FEMS Microbiol Lett. 1999 174(2): 247-50; FEMS Microbiol Lett. 1999 177(1): 187-8 and the website of the National Center for Biotechnology information at the website of the National Institutes for Health).

Although the final % homology may be measured in terms of identity, the alignment process itself is typically not based on an all-or-nothing pair comparison. Instead, a scaled similarity score matrix is generally used that assigns scores to each pair-wise comparison based on chemical similarity or evolutionary distance. An example of such a matrix commonly used is the BLOSUM62 matrix—the default matrix for the BLAST suite of programs. GCG Wisconsin programs generally use either the public default values or a custom symbol comparison table, if supplied (see user manual for further details). For some applications, it is preferred to use the public default values for the GCG package, or in the case of other software, the default matrix, such as BLOSUM62.

Alternatively, percentage homologies may be calculated using the multiple alignment feature in DNASIS™ (Hitachi Software), based on an algorithm, analogous to CLUSTAL (Higgins D G & Sharp P M (1988), Gene 73(1), 237-244). Once the software has produced an optimal alignment, it is possible to calculate % homology, preferably % sequence identity. The software typically does this as part of the sequence comparison and generates a numerical result.

Sequences may also have deletions, insertions or substitutions of amino acid residues which produce a silent change and result in a functionally equivalent substance. Deliberate amino acid substitutions may be made on the basis of similarity in amino acid properties (such as polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues) and it is therefore useful to group amino acids together in functional groups. Amino acids may be grouped together based on the properties of their side chains alone. However, it is more useful to include mutation data as well. The sets of amino acids thus derived are likely to be conserved for structural reasons. These sets may be described in the form of a Venn diagram (Livingstone C. D. and Barton G. J. (1993) “Protein sequence alignments: a strategy for the hierarchical analysis of residue conservation” Comput. Appl. Biosci. 9: 745-756) (Taylor W. R. (1986) “The classification of amino acid conservation” J. Theor. Biol. 119; 205-218). Conservative substitutions may be made, for example according to the table below which describes a generally accepted Venn diagram grouping of amino acids.

Embodiments of the invention include sequences (both polynucleotide or polypeptide) which may comprise homologous substitution (substitution and replacement are both used herein to mean the interchange of an existing amino acid residue or nucleotide, with an alternative residue or nucleotide) that may occur i.e., like-for-like substitution in the case of amino acids such as basic for basic, acidic for acidic, polar for polar, etc. Non-homologous substitution may also occur i.e., from one class of residue to another or alternatively involving the inclusion of unnatural amino acids such as ornithine (hereinafter referred to as Z), diaminobutyric acid ornithine (hereinafter referred to as B), norleucine ornithine (hereinafter referred to as O), pyridylalanine, thienylalanine, naphthylalanine and phenylglycine.

Variant amino acid sequences may include suitable spacer groups that may be inserted between any two amino acid residues of the sequence including alkyl groups such as methyl, ethyl or propyl groups in addition to amino acid spacers such as glycine or .beta.-alanine residues. A further form of variation, which involves the presence of one or more amino acid residues in peptoid form, may be well understood by those skilled in the art. For the avoidance of doubt, “the peptoid form” is used to refer to variant amino acid residues wherein the .alpha.-carbon substituent group is on the residue's nitrogen atom rather than the .alpha.-carbon. Processes for preparing peptides in the peptoid form are known in the art, for example Simon R J et al., PNAS (1992) 89(20), 9367-9371 and Horwell D C, Trends Biotechnol. (1995) 13(4), 132-134.

The practice of the present invention employs, unless otherwise indicated, conventional techniques of immunology, biochemistry, chemistry, molecular biology, microbiology, cell biology, genomics and recombinant DNA, which are within the skill of the art. See Green and Sambrook, (Molecular Cloning: A Laboratory Manual. 4th, ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 2014); CURRENT PROTOCOLS IN MOLECULAR BIOLOGY (F. M. Ausubel, et al. eds., (2017)); Short Protocols in Molecular Biology, (Ausubel et al., 1999)); the series METHODS IN ENZYMOLOGY (Academic Press, Inc.): PCR 2: A PRACTICAL APPROACH (M. J. MacPherson, B. D. Hames and G. R. Taylor eds. (1995)), ANTIBODIES, A LABORATORY MANUAL, SECOND EDITION (Harlow and Lane, eds. (2014) and CULTURE OF ANIMAL CELLS: A MANUAL BASIC TECHNIQUE, 7TH EDITION (R. I. Freshney, ed. (2016)).

EXAMPLES

The following examples are given for the purpose of illustrating various embodiments of the invention and are not meant to limit the present invention in any fashion. The present examples, along with the methods described herein are presently representative of preferred embodiments, are exemplary, and are not intended as limitations on the scope of the invention. Changes therein and other uses which are encompassed within the spirit of the invention as defined by the scope of the claims will occur to those skilled in the art.

Example 1. Nucleic Acid-Guided Nucleases

Sequences for twenty nucleic acid guided nucleases, termed MAD1-MAD20 (SEQ ID NOs 1-20), were aligned and compared to other nucleic acid guided nucleases. A partial alignment and phylogenetic tree are depicted in FIG. 1A and FIG. 1B respectively. Key residues in that may be involved in the recognition of a PAM site are shown in FIG. 1A. These include amino acids at positions 167, 539, 548, 599, 603, 604, 605, 606, and 607.

Sequence alignments were built using PSI-BLAST to search for MAD nuclease homologs in the NCBI non-redundant databases. Multiple sequence alignments were further refined using the MUSCLE alignment algorithm with default settings as implemented in Geneious 10. The percent identity of each homolog to SpCas9 and AsCpf 1 reference sequences were computed based on the pairwise alignment matching from these global alignments.

Genomic source sequences were identified using Uniprot linkage information or TBLASTN searches of NCBI using the default parameters and searching all possible frames for translational matches.

Percent identities of MAD1-8 and 10-12 to other various nuclease are summarized in Table 1. These percent identities represent the shared amino acid sequence identity between the indicated proteins.

TABLE 1
Protein identifier or accession number MAD1 MAD2 MAD3 MAD4 MAD5 MAD6 MAD7 MAD8 MAD10 MAD11 MAD12
gi|1025734861|pdb|5B43|A 6.4 32.8 33.2 29.7 29.4 31.1 30.3 31.7 26.7 27.9 98.8
gi|1052245173|pdb|5KK5|A 6.4 32.7 33.1 29.7 29.3 31 30.3 31.7 26.7 27.8 98.7
gi|1086216683|emb|SDC16215.1| 6.1 33 34.4 29.6 30.1 33.5 32.3 32.1 26.2 27.2 46.8
gi|1120175333|ref|WP_073043853.1| 5.9 30.9 37.2 32.8 33.6 34.4 35.7 35.1 26.3 28.3 34.9
Cpf1.Sj|WP_081839471 6.6 33.6 41.7 37.2 33.4 37.6 40.1 37.7 29.1 30.3 34.1
Cpf1.Ss|KFO67989 6.9 32.3 35.7 43 33.7 45.9 34.8 48 33.2 33.4 33.8
MAD3 5.8 31 100 32.9 35.9 35 35.6 34.3 28 27.6 33.1
gi|1082474576|gb|OFY19591.1| 7 31.4 35.9 43.2 31.4 45 33.6 48.6 30.8 33.5 33
MAD2 6.1 100 31 30.7 30.2 31 31.2 31.2 25.8 27.7 32.6
Cpf1.Lb5|WP_016301126 7.8 32.8 36.5 38.2 34.2 45.5 35.8 43.6 30.7 35.7 32.5
gi|1088286736|gb|OHB41002.1| 6.7 30.6 35.3 42.4 33.2 44.7 32.1 46.8 30.7 32.6 32.4
gi|1094423310|emb|SER03894.1| 6.8 30.8 36.1 40.4 31.8 50.4 35.2 46.6 30.4 36.8 32.3
gi|493326531|ref|WP_006283774.1| 6.8 30.8 36.1 40.3 31.8 50.3 35.1 46.6 30.4 36.8 32.3
MAD8 7.6 31.2 34.3 40.4 32 41.6 32.8 100 30.1 32.1 31.7
Cpf1.Bot|WP_009217842 6.9 30.1 36.6 41.5 32.5 50.2 35.4 45.5 29.8 34.1 31.6
Cpf1.Li|WP_020988726 7.3 30.2 34.6 39.3 30.3 40.7 31.8 39.4 32.1 31.3 31.5
Cpf1.Pb|WP_044110123 6.3 31.4 31.8 36.1 30.8 45.7 30.4 39.4 27.7 33.5 31.5
gi|817911372|gb|AKG08867.1| 7.3 29.8 35 40.7 32.1 40.3 32.6 41.7 29.1 31 31.4
gi|1052838533|emb|SCH45297.1| 6.6 30.8 35.5 32 31.5 34.4 51.9 33.4 26.1 29 31.3
gi|1053713332|ref|WP_066040075.1| 7.2 29.6 33.2 39.6 29.8 49.1 32.2 41.4 30.1 32.4 31.3
gi|817909002|gb|AKG06878.1| 7.3 29.8 35 40.7 32 40.3 32.5 41.6 29.1 30.9 31.3
gi|1042201477|ref|WP_065256572.1| 7.2 29.5 35.2 40.6 31.9 40.1 32.7 41.6 29 30.8 31.2
MAD6 7.5 31 35 38.9 33.1 100 34.3 41.6 30.5 33.6 31
gi|490468773|ref|WP_004339290.1| 6.8 31.8 31.7 36.2 28.6 36.5 31.4 38.4 28.5 31.4 31
gi|565853704|ref|WP_023936172.1| 7.5 30.8 34.9 38.9 33.1 99.7 34.1 41.6 30.4 33.6 31
gi|739005707|ref|WP_036887416.1| 7.5 30.9 35 38.9 33 99.9 34.2 41.5 30.4 33.5 31
gi|739008549|ref|WP_036890108.1| 7.5 31 35 38.8 33 99.8 34.2 41.5 30.4 33.5 31
Cpf1.Ft|WP_014550095 7.1 31.9 33.8 40.3 29.7 39.4 34.1 41 29.8 32.5 30.8
gi|0504362993|ref|WP_014550095.1| 7.2 32.4 33.8 40.3 29.6 39.4 33.8 40.9 30.1 32.5 30.8
gi|0640557447|ref|WP_024988992.1| 6.6 31.4 34.8 40.7 31.2 48 34.1 45.1 28.8 35.2 30.8
gi|1098944113|ref|WP_071304624.1| 7.1 32.3 33.5 40.3 29.6 39.2 33.8 40.9 30.1 32.5 30.6
gi|0489124848|ref|WP_003034647.1| 7.1 32.3 33.9 40.9 29.9 39.2 33.9 40.9 29.9 32.2 30.6
gi|738967776|ref|WP_036851563.1| 6.8 29.4 33.1 35.5 28.9 40.3 30.7 35.9 28.7 31.3 30.5
MAD7 5.9 31.2 35.6 30.8 33.9 34.3 100 32.8 24.2 28.9 30.5
Cpf1.Lb6|WP_044910713 6.7 29.8 33.7 36.6 30.9 43 34 39.8 29.1 32.1 30.4
gi|1052961977|emb|SCH47915.1| 5.5 30.5 35.8 32.3 34 35 53.8 33.4 26.2 27.4 30.4
gi|817918353|gb|AKG14689.1| 7 29.1 34.4 39.8 31.7 40 32.4 41.1 28.4 30.1 30.3
gi|917059416|ref|WP_051666128.1| 6.9 29.9 31.5 35.7 31.6 41.8 32.9 39.1 30.1 34 30.2
gi|1011649201|ref|WP_062499108.1| 6.8 29 34.7 40.3 31.4 40.1 33.1 41.6 28.5 30.4 30.1
Cpf1.Pm|WP_018359861 6.3 29.2 32.3 34.2 27.4 38.7 29.4 35 27.2 30.1 30
gi|817922537|gb|AKG18099.1| 6.8 29.1 34.5 39.6 31.5 39.9 32.7 40.7 28.3 29.8 30
gi|769142322|ref|WP_044919442.1| 6.7 31 34.6 37.8 31.5 41.4 33.3 39.2 28 31.9 29.9
gi|1023176441|pdb|5ID6|A 6.7 29.7 31.3 35.5 31.3 41 32.6 38.5 29.7 33.3 29.8
gi|0491540987|ref|WP_005398606.1| 5.9 28.3 30.4 29.7 28.5 29 30.7 29.8 25.8 27.8 29.8
gi|652820612|ref|WP_027109509.1| 6.4 31.1 34 35.3 31.7 40.3 33.4 37.5 28.5 33.3 29.8
gi|502240446|ref|WP_012739647.1| 5.9 31.6 36.1 31.2 33 35.4 49.4 34 26.6 29.4 29.7
gi|524278046|emb|CDA41776.1| 5.8 31.6 36 31 33 35.4 50 34 26.6 29.5 29.7
gi|737831580|ref|WP_035798880.1| 6.2 31.3 34.8 38.1 31.5 42.1 33 39.6 28.4 32.4 29.7
gi|909652572|ref|WP_049895985.1| 6.9 30.7 34.2 37.2 30.8 41.5 34.2 38.7 28 32 29.7
MAD4 6.7 30.7 32.9 100 30.7 38.9 30.8 40.4 28.8 29.4 29.7
gi|942073049|ref|WP_055286279.1| 5.9 31.6 36.1 31.1 32.7 35 49.7 33.9 27.1 29.5 29.6
gi|654794505|ref|WP_028248456.1| 7.4 30.5 35.9 37.4 31.3 42.8 34.2 40.2 27.9 33.5 29.5
gi|933014786|emb|CUO47728.1| 5.6 31.3 34.9 31.2 31.5 32.4 46.7 30.6 25.4 27.7 29.4
gi|941887450|ref|WP_055224182.1| 5.6 31.4 35 31.3 31.6 32.5 46.6 30.7 25.3 27.8 29.4
gi|920071674|ref|WP_052943011.1| 6.3 31 31.8 38.8 31.8 41.3 33.8 42.6 29.8 34.7 29
MAD5 5.1 30.2 35.9 30.7 100 33.1 33.9 32 24.3 28.7 29
gi|1081462674|emb|SCZ76797.1| 6.9 30.4 33.5 34.7 29.7 40.1 30.5 37.4 27.3 32.5 28.9
gi|918722523|ref|WP_052585281.1| 7.4 27.5 30.5 35.7 28.3 35.2 28.5 36 26 27.1 28.8
gi|524816323|emb|CDF09621.1| 6.2 30 34.1 29.3 31.2 32.7 47.6 32.2 25.5 25.9 28.4
gi|941782328|ref|WP_055176369.1| 6.2 30.2 33.1 28.9 30.9 32 46.9 32.1 26 27.1 28.4
gi|942113296|ref|WP_055306762.1| 6.4 29.8 33.8 29.7 31.3 33.1 48 32.5 25.8 26.2 28.4
MAD11 6.4 27.7 27.6 29.4 28.7 33.6 28.9 32.1 26.2 100 27.8
gi|653158548|ref|WP_027407524.1| 5.9 26.4 28.1 33.5 27.4 32.5 27.8 32 27 26.8 27.6
gi|652963004|ref|WP_027216152.1| 6.6 30.3 32.5 33.2 30.4 38.2 29.6 34.6 25.9 30.5 27.2
gi|1083069650|gb|OGD68774.1| 6.2 25 24.3 26.6 23.1 28.1 23.2 26.4 45 24.9 27.1
gi|302483275|gb|EFL46285.1| 5.6 24.7 26.8 30.3 24.9 34.8 26 30.4 24.4 27.5 27.1
gi|915400855|ref|WP_050786240.1| 5.6 24.7 26.8 30.3 24.9 34.8 26 30.4 24.4 27.5 27.1
MAD10 5.6 25.8 28 28.8 24.3 30.5 24.2 30.1 100 26.2 26.6
gi|1101117967|gb|OIO75780.1| 6.1 26.8 26 27.3 24.3 28.1 24.4 28.2 44.1 25.4 26.1
g|11088204458|gb|OHA63117.1| 6.5 25.2 23.5 25.8 22.9 27 22 26.1 36.5 24.2 24.7
gi|809198071|ref|WP_046328599.1| 4.9 25.6 26.5 22.2 23.9 23.8 25.8 23.9 20.3 25.1 24
gi|1088079929|gb|OGZ45678.1| 5.6 21.9 23.8 26.9 23.4 27.8 23.3 26.7 28.8 24.7 23.5
gi|1101053499|gb|OIO15737.1| 5.9 23.1 26.2 25.2 23 26.4 25.1 26.5 29.2 23.2 23.4
gi|1101058058|gb|OIO19978.1| 5.4 21.2 22.8 23.6 20.6 25 20.7 25 25.9 22.2 23
gi|1088000848|gb|OGY73485.1| 5.7 23.5 25.2 25.5 23.9 27 25.1 25.6 31.6 23.6 22.9
gi|407014433|gb|EKE28449.1| 5.2 23.5 25.9 26.7 24.3 25.8 23 27.8 29.9 25.3 22.9
gi|818249855|gb|KKP36646.1| 6 21 20.7 23.5 20 24.2 21 24 24.6 21.8 22.6
gi|818703647|gb|KKT48220.1| 5.8 23.3 25 25.1 23.5 26.5 24.7 25.3 31.2 23.3 22.6
gi|818705786|gb|KKT50231.1| 5.8 23.1 24.6 24.7 22.9 26.2 24.2 24.8 30.8 22.9 22.2
gi|1083950632|gb|OGJ66851.1| 4.5 20 22.1 23.5 20.6 24.6 20 24 23.5 20.7 22.1
gi|1083932199|gb|OGJ49885.1| 6 20.4 20.2 22.6 19.3 23.3 20.6 23.2 23.9 21 21.8
gi|1083410735|gb|OGF20863.1| 5 21.7 23.3 25.5 23 25 22.7 25.9 27.2 22.4 21.5
gi|1011480927|ref|WP_062376669.1| 4.7 20.1 20.1 21.4 19.3 23.3 21.4 22 20.2 19.7 20.9
gi|818539593|gb|KKR91555.1| 5.1 19.8 21.6 22.1 20.5 22.9 21.2 22.8 24 20.5 19.9
gi|503048015|ref|WP_013282991.1| 5.1 18.8 20.7 15.3 19.7 18.9 19.3 17.7 15.9 19 19.2
gi|1096232746|ref|WP_071177645.1| 5 19.1 20.5 17.4 20.1 19.7 20.4 20.4 17.5 18.5 18.9
gi|769130404|ref|WP_044910712.1| 4.6 19.4 18.2 16.1 18.1 17.1 18.7 17.9 14.5 16.8 17.5
gi|1085569500|gb|OGX23684.1| 2.6 11.6 12.1 12.7 10.2 12.1 12.7 11.6 10.9 11.1 10.5
gi|818357062|gb|KKQ38176.1| 3.3 10 11.1 10.6 11.1 11.8 12.1 11.5 12.2 10.8 9.8
gi|745626763|gb|KIE18642.1| 3.7 9.4 11.7 11.1 11.1 12.5 11.9 11.9 10.2 10.6 8.8
MAD1 100 6.1 5.8 6.7 5.1 7.5 5.9 7.6 5.6 6.4 6.4
SpCas9 4 6.3 6.5 8.3 5.6 8.1 6.9 7.7 6.9 6.3 6.3
MAD12 6.4 32.6 33.1 29.7 29 31 30.5 31.7 26.6 27.8 100

Example 2: Expression of MAD Nucleases

Wild-type nucleic acid sequences for MAD1-MAD20 include SEQ ID NOs 21-40, respectively. These MAD nucleases were codon optimized for expression in E. coli and the codon optimized sequences are listed as SEQ ID NO: 41-60, respectively (summarized in Table 2).

Codon optimized MAD1-MAD20 were cloned into an expression construct comprising a constitutive or inducible promoter (eg., proB promoter SEQ ID NO: 83, or pBAD promoter SEQ ID NO: 81 or SEQ ID NO: 82) and an optional 6×-His tag (eg., FIG. 2). The generated MAD1-MAD2 expression constructs are provided as SEQ ID NOs: 61-80, respectively. The expression constructs as depicted in FIG. 2 were generated either by restriction/ligation-based cloning or homology-based cloning.

Example 3. Testing Guide Nucleic Acid Sequences Compatible with MAD Nucleases

In order to have a functioning targetable nuclease complex, a nucleic acid-guided nuclease and a compatible guide nucleic acid is needed. To determine the compatible guide nucleic acid sequence, specifically the scaffold sequence portion of the guide nucleic acid, multiple approaches were taken. First, scaffold sequences were looked for near the endogenous loci of each MAD nuclease. In some cases, such as with MAD2, no endogenous scaffold sequence was found. Therefore, we tested the compatibility of MAD2 with scaffold sequences found near the endogenous loci of the other MAD nucleases. A list of the MAD nucleases and corresponding endogenous scaffold sequences that were tested is listed in Table 2.

TABLE 2
Endogenous
Codon optimized scaffold sequence
WT nucleic acid nucleic acid Amino acid for guide nucleic
MAD nuclease sequence sequence sequence acid
MAD1 SEQ ID NO: 21 SEQ ID NO: 41 SEQ ID NO: 1 SEQ ID NO: 84
MAD2 SEQ ID NO: 22 SEQ ID NO: 42 SEQ ID NO: 2 None identified
MAD3 SEQ ID NO: 23 SEQ ID NO: 43 SEQ ID NO: 3 SEQ ID NO: 86
MAD4 SEQ ID NO: 24 SEQ ID NO: 44 SEQ ID NO: 4 SEQ ID NO: 87
MAD5 SEQ ID NO: 25 SEQ ID NO: 45 SEQ ID NO: 5 SEQ ID NO: 88
MAD6 SEQ ID NO: 26 SEQ ID NO: 46 SEQ ID NO: 6 SEQ ID NO: 89
MAD7 SEQ ID NO: 27 SEQ ID NO: 47 SEQ ID NO: 7 SEQ ID NO: 90
MAD8 SEQ ID NO: 28 SEQ ID NO: 48 SEQ ID NO: 8 SEQ ID NO: 91
MAD9 SEQ ID NO: 29 SEQ ID NO: 49 SEQ ID NO: 9 SEQ ID NO: 92;
SEQ ID NO: 103;
SEQ ID NO: 106
MAD10 SEQ ID NO: 30 SEQ ID NO: 50 SEQ ID NO: 10 SEQ ID NO: 93
MAD11 SEQ ID NO: 31 SEQ ID NO: 51 SEQ ID NO: 11 SEQ ID NO: 94
MAD12 SEQ ID NO: 32 SEQ ID NO: 52 SEQ ID NO: 12 SEQ ID NO: 95
MAD13 SEQ ID NO: 33 SEQ ID NO: 53 SEQ ID NO: 13 SEQ ID NO: 96;
SEQ ID NO: 105;
SEQ ID NO: 107
MAD14 SEQ ID NO: 34 SEQ ID NO: 54 SEQ ID NO: 14 SEQ ID NO: 97
MAD15 SEQ ID NO: 35 SEQ ID NO: 55 SEQ ID NO: 15 SEQ ID NO: 98
MAD16 SEQ ID NO: 36 SEQ ID NO: 56 SEQ ID NO: 16 SEQ ID NO: 99
MAD17 SEQ ID NO: 37 SEQ ID NO: 57 SEQ ID NO: 17 SEQ ID NO: 100
MAD18 SEQ ID NO: 38 SEQ ID NO: 58 SEQ ID NO: 18 SEQ ID NO: 101
MAD19 SEQ ID NO: 39 SEQ ID NO: 59 SEQ ID NO: 19 SEQ ID NO: 102
MAD20 SEQ ID NO: 40 SEQ ID NO: 60 SEQ ID NO: 20 SEQ ID NO: 103

Editing cassettes as depicted in FIG. 3 were generated to assess the functionality of the MAD nucleases and corresponding guide nucleic acids. Each editing cassette comprises an editing sequence and a promoter operably linked to an encoded guide nucleic acid. The editing cassettes further comprises primer sites (P1 and P2) on flanking ends. The guide nucleic acids comprised various scaffold sequences to be tested, as well as a guide sequence to guide the MAD nuclease to the target sequence for editing. The editing sequences comprised a PAM mutation and/or codon mutation relative to the target sequence. The mutations were flanked by regions of homology (homology arms or HA) which would allow recombination into the cleaved target sequence. (agcagctttatcatctgccg (SEQ ID No: 183); QQLYHLP (SEQ ID No: 184); agcagttataataactgccg (SEQ ID No: 186; and QQLLP (SEQ ID No: 206)

FIG. 4 depicts an experimental designed to test different MAD nuclease and guide nucleic acid combinations. An expression cassette encoding the MAD nuclease or the MAD nuclease protein were added to host cells along with various editing cassettes as described above. In this example, the guide nucleic acids were engineered to target the galK gene in the host cell, and the editing sequence was designed to mutate the targeted galK gene in order to turn the gene off, thereby allowing for screening of successfully edited cells. This design was used for identification of functional or compatible MAD nuclease and guide nucleic acid combinations. Editing efficiency was determined by qPCR to measure the editing plasmid in the recovered cells in a high-throughput manner. Validation of MAD11 and Cas9 primers is shown in FIGS. 14A and 14B. These results show that the selected primer pairs are orthogonal and allow quantitative measurement of input plasmid DNA

FIGS. 5A-5B is a depiction of a similar experimental design. In this case, the editing cassette (FIG. 5B) further comprises a selectable marker, in this case kanamycin resistance (kan) and the MAD nuclease expression vector (FIG. 5A) further comprises a selectable marker, in this case chloramphenicol resistance (Cm), and the lambda RED recombination system to aid homologous recombination (HR) of the editing sequence into the target sequence. A compatible MAD nuclease and guide nucleic acid combination will cause a double strand break in the target sequence if a PAM sequence is present. Since the editing sequence (eg. FIG. 3) contains a PAM mutation that is not recognized by the MAD nuclease, edited cells that contain the PAM mutation survive cleavage by the MAD nuclease, while wild-type non-edited cells die (FIG. 5C). The editing sequence further comprises a mutation in the galK gene that allows for screening of edited cells, while the MAD nuclease expression vector and editing cassette contain drug selection markers, allowing for selection of edited cells.

Using these methods, compatible guide nucleic acids for MAD1-MAD20 were tested. Twenty scaffold sequences were tested. The guide nucleic acids used in the experiments contained one of the twenty scaffold sequences, referred to as scaffold-1, scaffold-2, etc., and a guide sequence that targets the galK gene. Sequences for Scaffold-1 through Scaffold-20 are listed as SEQ ID NO: 84-103, respectively. It should be understood that the guide sequence of the guide nucleic acid is variable and can be engineered or designed to target any desired target sequence. Since MAD2 does not have an endogenous scaffold sequence to test, a scaffold sequence from a close homology (scaffold-2, SEQ ID NO: 85) was tested and found to be a non-functional pair, meaning MAD2 and scaffold-2 were not compatible. Therefore, MAD2 was tested with the other nineteen scaffold sequences, despite the low sequence homology between MAD2 and the other MAD nucleases.

This workflow could also be used to identify or test PAM sequences compatible with a given MAD nuclease. Another method for identifying a PAM site is described in the next example.

In general, for the assays described, transformations were carried out as follows. E. coli strains expressing the codon optimized MAD nucleases were grown overnight. Saturated cultures were diluted 1/100 and grown to an OD600 of 0.6 and induced by adding arabinose at a filing concentration of 0.4% and (if a temperature sensitive plasmid is used) shifting the culture to 42 degrees Celsius in a shaking water bath. Following induction, cells were chilled on ice for 15 min prior to washing thrice with ¼ the initial culture volume with 10% glycerol (for example, 50 mL washed for a 200 mL culture). Cells were resuspended in 1/100 the initial volume (for example, 2 mL for a 200 mL culture) and stores at −90 degrees Celsius until ready to use. To perform the compatibility and editing efficiency screens described here, 50 ng of editing cassette was transformed into cell aliquots by electroporation. Following electroporation, the cells were recovered in LB for 3 hours and 100 μL of cells were plated on Macconkey plates containing 1% galactose.

Editing efficiencies were determined by dividing the number of white colonies (edited cells) by the total number of white and red colonies (edited and non-edited cells).

Example 4. PAM Selection Assay

In order to generate a double strand break in a target sequence, a guide nucleic acid must hybridize to a target sequence, and the MAD nuclease must recognize a PAM sequence adjacent to the target sequence. If the guide nucleic acid hybridizes to the target sequence, but the MAD nuclease does not recognize a PAM site, then cleavage does not occur.

A PAM is MAD nuclease-specific and not all MAD nucleases necessarily recognize the same PAM. In order to assess the PAM site requirements for the MAD nucleases, an assay as depicted in FIGS. 6A-6C was performed.

FIG. 6A depicts a MAD nuclease expression vector as described elsewhere, which also contains a chloramphenicol resistance gene and the lambda RED recombination system.

FIG. 6B depicts a self-targeting editing cassette. The guided nucleic acid is designed to target the target sequence which is contained on the same nucleic acid molecule. The target sequence is flanked by random nucleotides, depicted by N4, meaning four random nucleotides on either end of the target sequence. It should be understood that any number of random nucleotides could also be used (for example, 3, 5, 6, 7, 8, etc). The random nucleotides serve as a library of potential PAMs.

FIG. 6C depicts the experimental design. Basically, the MAD nuclease expression vector and editing cassette comprising the random PAM sites were transformed into a host cell. If a functional targetable nuclease complex was formed and the MAD nuclease recognized a PAM site, then the editing cassette vector was cleaved and which leads to cell death. If a functional targetable complex was not formed or if the MAD nuclease did not recognize the PAM, then the target sequence was not cleaved and the cell survived. Next generation sequence (NGS) was then used to sequence the starting and final cell populations in order to determine what PAM sites were recognized by a given MAD nuclease. These recognized PAM sites were then used to determine a consensus or non-consensus PAM for a given MAD nuclease.

The consensus PAM for MAD1-MAD8, and MAD10-MAD12 was determined to be TTTN. The consensus PAM for MAD9 was determined to be NNG. The consensus PAM for MAD13-MAD15 was determined to be TTN. The consensus PAM for MAD16-MAD18 was determined to be TA. The consensus PAM for MAD19-MAD20 was determined to be TTCN.

Example 5: Testing Heterologous Guide Nucleic Acids

Editing efficiencies were tested for MAD1, MAD2, MAD4, and MAD7 and are depicted in FIG. 7A and FIG. 7B. Experiment details and editing efficiencies are summarized in Table 3. Editing efficiency was determined by dividing the number of edited cells by the total number of recovered cells. Various editing cassettes targeting the galK gene were used to allow screening of editing cells. The guide nucleic acids encoded on the editing cassette contained a guide sequence targeting the galK gene and one of various scaffold sequences in order to test the compatibility of the indicated MAD nuclease with the indicated scaffold sequence, as summarized in Table 3.

Editing efficiencies for compatible MAD nuclease and guide nucleic acids (comprising the indicated scaffold sequences) were observed to have between 75-100% editing efficiency. MAD2 had between a 75-100% editing efficiency and MAD7 had between a 97-100% editing efficiency.

MAD2 combined with scaffold-1, scaffold-2, scaffold-4, or scaffold-13 in these experiments results in 0% editing efficiency. These data imply that MAD2 did not form a functional complex with these tested guide nucleic acids and that MAD2 is not compatible with these scaffold sequences.

MAD7 combined with scaffold-1, scaffold-2, scaffold-4, or scaffold-13 in these experiments results in 0% editing efficiency. These data imply that MAD7 did not form a functional complex with these tested guide nucleic acids and that MAD7 is not compatible with these scaffold sequences.

For MAD1 and MAD4, all tested guide nucleic acid combinations resulted in 0% editing efficiency, implying that MAD1 and MAD4 did not form a functional complex with any of the tested guide nucleic acids. These data also imply that MAD1 and MAD4 are not compatible with the tested scaffold sequences.

Combined, these data highlight the unpredictability of finding a compatible MAD nuclease and scaffold sequence pair in order to form a functional targetable nuclease complex. Some tested MAD nucleases did not function with any tested scaffold sequence. Some tested MAD nucleases only functioned with some tested scaffold sequences and not with others.

TABLE 3
Editing
Nucleic acid- Guide nucleic acid scaffold sequence Editing
# guided nuclease sequence mutation Target gene efficiency
1 MAD1 Scaffold-1; SEQ ID NO: 84 L80** galK  0%
2 MAD1 Scaffold-2; SEQ ID NO: 85 Y145** galK  0%
3 MAD1 Scaffold-4; SEQ ID NO: 87 Y145** galK  0%
4 MAD1 Scaffold-10; SEQ ID NO: 93 Y145** galK  0%
5 MAD1 Scaffold-11; SEQ ID NO: 94 L80** galK  0%
6 MAD1 Scaffold-12; SEQ ID NO: 95 L10KpnI galK  0%
7 MAD1 Scaffold-13; SEQ ID NO: 96 Y145** galK  0%
8 MAD1 Scaffold-12; SEQ ID NO: 95 L10KpnI galK  0%
9 MAD2 Scaffold-10; SEQ ID NO: 93 L80** galK  0%
10 MAD2 Scaffold-10; SEQ ID NO: 93 Y145** galK 100%
11 MAD2 Scaffold-11; SEQ ID NO: 94 L80** galK  98%
12 MAD2 Scaffold-11; SEQ ID NO: 94 Y145** galK  99%
13 MAD2 Scaffold-12; SEQ ID NO: 95 Y145** galK  98%
14 MAD2 Scaffold-12; SEQ ID NO: 95 Y145** galK  0%
15 MAD2 Scaffold-13; SEQ ID NO: 96 Y145** galK  0%
16 MAD2 Scaffold-1; SEQ ID NO: 84 L80** galK  0%
17 MAD2 Scaffold-2; SEQ ID NO: 85 Y145** galK  0%
18 MAD2 Scaffold-2; SEQ ID NO: 85 Y145** galK  0%
19 MAD2 Scaffold-4; SEQ ID NO: 87 Y145** galK  0%
20 MAD2 Scaffold-5; SEQ ID NO: 88 L80** galK  99%
21 MAD2 Scaffold-12; SEQ ID NO: 95 89** galK  0%
22 MAD2 Scaffold-12; SEQ ID NO: 95 70** galK  75%
23 MAD2 Scaffold-12; SEQ ID NO: 95 L10KpnI galK  79%
24 MAD4 Scaffold-1; SEQ ID NO: 84 L80** galK  0%
25 MAD4 Scaffold-2; SEQ ID NO: 85 Y145** galK  0%
26 MAD4 Scaffold-4; SEQ ID NO: 87 Y145** galK  0%
27 MAD4 Scaffold-10; SEQ ID NO: 93 Y145** galK  0%
28 MAD4 Scaffold-11; SEQ ID NO: 94 L80** galK  0%
29 MAD4 Scaffold-12; SEQ ID NO: 95 L10KpnI galK  0%
30 MAD4 Scaffold-13; SEQ ID NO: 96 Y145** galK  0%
31 MAD4 Scaffold-12; SEQ ID NO: 95 L10KpnI galK  0%
32 MAD7 Scaffold-1; SEQ ID NO: 84 L80** galK  0%
33 MAD7 Scaffold-2; SEQ ID NO: 85 Y145** galK  0%
34 MAD7 Scaffold-4; SEQ ID NO: 87 Y145** galK  0%
35 MAD7 Scaffold-10; SEQ ID NO: 93 Y145** galK 100%
36 MAD7 Scaffold-11; SEQ ID NO: 94 L80** galK  97%
37 MAD7 Scaffold-12; SEQ ID NO: 95 L10KpnI galK  0%
38 MAD7 Scaffold-13; SEQ ID NO: 96 Y145** galK  0%
39 MAD7 Scaffold-12; SEQ ID NO: 95 L10KpnI galK  0%

Example 6. Assessment of MAD2 and MAD7

The ability of MAD2 and MAD7 to function with heterologous guide nucleic acids were tested using a similar experimental design as described above.

The compatibility of MAD2 with other scaffold sequences was tested and the results of an experiment are depicted in FIG. 8. The MAD nucleases, guide nucleic acid scaffold sequences, and editing sequences used in this experiment are summarized in Table 4.

The compatibility of MAD7 with other scaffold sequences was tested and the results of an experiment are depicted in FIG. 9. The MAD nucleases, guide nucleic acid scaffold sequences, and editing sequences used in this experiment are summarized in Table 5.

TABLE 4
Editing
Nucleic acid- Guide nucleic acid scaffold sequence Target
# guided nuclease sequence mutation gene
1 MAD2 Scaffold-12; SEQ ID NO: 95 N89KpnI galK
2 MAD2 Scaffold-10; SEQ ID NO: 93 L80** galK
3 MAD2 Scaffold-5; SEQ ID NO: 88 L80** galK
4 MAD2 Scaffold-12; SEQ ID NO: 95 D70KpnI galK
5 MAD2 Scaffold-12; SEQ ID NO: 95 Y145** galK
6 MAD2 Scaffold-11; SEQ ID NO: 94 Y145** galK
7 MAD2 Scaffold-10; SEQ ID NO: 93 Y145** galK
8 MAD2 Scaffold-12; SEQ ID NO: 95 L10KpnI galK
9 MAD2 Scaffold-11; SEQ ID NO: 94 L80** galK
10 SpCas9 S. pyogenese gRNA Y145** galK
11 MAD2 Scaffold-2; SEQ ID NO: 85 Y145** galK
12 MAD2 Scaffold-4; SEQ ID NO: 87 Y145** galK
13 MAD2 Scaffold-1; SEQ ID NO: 84 L80** galK
14 MAD2 Scaffold-13; SEQ ID NO: 96 Y145** galK

TABLE 5
Editing
Nucleic acid- Guide nucleic acid scaffold sequence Target
# guided nuclease sequence mutation gene
1 MAD7 Scaffold-1; SEQ ID NO: 84 L80** galK
2 MAD7 Scaffold-2; SEQ ID NO: 85 Y145** galK
3 MAD7 Scaffold-4; SEQ ID NO: 87 Y145** galK
4 MAD7 Scaffold-10; SEQ ID NO: 93 Y145** galK
5 MAD7 Scaffold-11; SEQ ID NO: 95 L80** galK

In another experiment, transformation efficiencies (FIG. 10B) were determined by calculating the total number of recovered cells compared to the start number of cells. An example plate image is depicted in FIG. 10C. Editing efficiencies (FIG. 10A) were determined by calculating the ratio of editing colonies (white colonies, edited galK gene) versus total colonies.

In this example (FIG. 10A-10C), cells expressing galK were transformed with expression constructs expressing either MAD2 or MAD7 and a corresponding editing cassette comprising a guide nucleic acid targeting the galK gene. The guide nucleic acid was comprised of a guide sequence targeting the galK gene and the scaffold-12 sequence (SEQ ID NO: 95).

In the depicted example, MAD2 and MAD7 has a lower transformation efficiency compared to S. pyogenes Cas9, though the editing efficiency of MAD2 and MAD7 was slightly higher than S. pyogenes Cas9.

FIG. 11 depicts the sequencing results from select colonies recovered from the assay described above. The target sequence was in the galK coding sequence (CDS). The TTTN PAM is shown as the reverse complement (wild-type NAAA, mutated NGAA). The mutations targeted by the editing sequence are labeled as target codons. Changes compared to the wild-type sequence are highlighted. In these experiments, the scaffold-12 sequence (SEQ ID NO: 95) was used. The guide sequence of the guide nucleic acid targeted the galK gene.

Six of the seven depicted sequences from the MAD2 experiment contained the designed PAM mutation and designed mutations in the target codons of galK, which one sequences colony maintained the wild-type PAM and wild-type target codons while also containing an unintended mutation upstream of the target site.

Two of the four depicted sequences from the MAD7 experiment contained the designed PAM mutation and mutated target codons. One colony comprises a wildtype sequence, while another contained a deletion of eight nucleotides upstream of the target sequence.

FIG. 12 depicts results from another experiment testing the ability to recover edited cells. In Experiment 0, the MAD2 nuclease was used with a guide nucleic acid comprising scaffold-11 sequence and a guide sequence targeting galK. The editing cassette comprised an editing sequence designed to incorporate an L80** mutation into galK, thereby allowing screening of the edited cells. In experiment 1, the MAD2 nuclease was used with a guide nucleic acid comprising scaffold-12 sequence and a guide sequence targeting galK. The editing cassette comprised an editing sequence designed to incorporate an L10KpnI mutation into galK. In both experiments, a negative control plasmid a guide nucleic acid that is not compatible with MAD2 was included in the transformations. Following transformation, the ratio of the compatible editing cassette (those containing scaffold-11 or scaffold-12 guide nucleic acids) to the non-compatible editing cassette (negative control) was measure. The experiments were done in the presence or absence of selection. The results show that more compatible editing cassette containing cells were recovered compared to the non-compatible editing cassette, and this result is magnified when selection is used.

Example 7. Guide Nucleic Acid Characterization

The sequences of scaffolds 1-8, and 10-12 (SEQ ID NO: 84-91, and 93-95) were aligned and are depicted in FIG. 13A. Nucleotides that match the consensus sequence are faded, while those diverging from the consensus sequence are visible. The predicted pseudoknot region is indicated. Without being bound by theory, the region 5′ of the pseudoknot may be influence binding and/or kinetics of the nucleic acid-guided nuclease. As is shown in FIG. 13A, in general, there appears to be less variability in the pseudoknot region (e.g., SEQ ID NO: 172-181) as compared to the sequence outside of the pseudoknot region.

FIG. 13B shows a preliminary model of MAD2 and MAD12 complexed with a guide nucleic acid (in this example, a guide RNA) and target sequence (DNA).

Example 8. Editing Efficiency of the MAD Nucleases

A plate-based editing efficiency assay and a molecular editing efficiency assay were used to test editing efficiency of various MAD nuclease and guide nucleic acid combinations.

FIG. 15 depicts quantification of the data obtained using the molecular editing efficiency assay using MAD2 nuclease with a guide nucleic acid comprising scaffold-12 and a guide sequencing targeting galK. The indicated mutations were incorporated into the galK using corresponding editing cassettes containing the mutation. FIG. 16 shows the comparison of the editing efficiencies determined by the plate-based assay using white and red colonies as described previously, and the molecular editing efficiency assay. As shown in FIG. 16, the editing efficiencies as determined by the two separate assays are consistent.

Example 9. Trackable Editing

Genetic edits can be tracked by the use of a barcode. A barcode can be incorporated into or near the edit site as described in the present specification. When multiple rounds of engineering are being performed, with a different edit being made in each round, it may be beneficial to insert a barcode in a common region during each round of engineering, this way one could sequence a single site and get the sequences of all of the barcodes from each round without the need to sequence each edited site individually. FIGS. 17A-17C, 18, and 19 depict examples of such trackable engineering workflows.

As depicted in FIG. 17A, a cell expressing a MAD nuclease is transformed with a plasmid containing an editing cassette and a recording cassette. The editing cassette contains a PAM mutation and a gene edit. The recorder cassette comprises a barcode, in this case 15N. Both the editing cassette and recording cassette each comprise a guide nucleic acid to a distinct target sequence. Within a library of such plasmids, the recorder cassette for each round can contain the same guide nucleic acid, such that the first round barcode is inserted into the same location across all variants, regardless of what editing cassette and corresponding gene edit is used. The correlation between the barcode and editing cassette is determined beforehand though such that the edit can be identified by sequencing the barcode. FIG. 17B shows an example of a recording cassette designed to delete a PAM site while incorporating a 15N barcode (actatcaatg ggctaactnnnnnnnnnnnnnnntgaaacatctgcaactgcg (SEQ ID No: 203); actatcaatgggctaactac gttcgtggcgtggtgaaacatctgcaactgcg (SEQ ID No: 204). The deleted PAM is used to enrich for edited cells since mutated PAM cells escape cell death while cells containing a wild-type PAM sequence are killed. Fire 21 C depicts how sequencing the barcode region can be used to identify which edit is comprised within each cell.

A similar approach is depicted in FIG. 18. In this case, the recorder cassette from each round is designed to target a sequence adjacent to the previous round, and each time, a new PAM site is deleted by the recorder cassette. The result is a barcode array with the barcodes from each round that can be sequenced to confirm each round of engineering took place and to determine which combination of mutations are contained in the cell, and in which order the mutations were made. Each successive recorder cassette can be designed to be homologous on one end to the region comprising the mutated PAM from the previous round, which could increase the efficiency of getting fully edited cells at the end of the experiment. In other examples, the recorder cassette is designed to target a unique landing site that was incorporated by the previous recorder cassette. This increases the efficiency of recovering cells containing all of the desired mutations since the subsequent recorder cassette and barcode can only target a cell that has successfully completed the previous round of engineering.

FIG. 19 depicts another approach that allows the recycling of selectable markers or to otherwise cure the cell of the plasmid form the previous round of engineering. In this case, the transformed plasmid containing a guide nucleic acid designed to target a selectable marker or other unique sequence in the plasmid form the previous round of engineering.

SEQUENCE LISTING

Table 6

TABLE 6
SEQUENCE LISTING
SEQ
ID
NO: Sequence
SEQ MGKMYYLGLDIGTNSVGYAVTDPSYHLLKFKGEPMWGAHVFAAGNQSAERRSFRT
ID SRRRLDRRQQRVKLVQEIFAPVISPIDPRFFIRLHESALWRDDVAETDKHIFFND
NO: PTYTDKEYYSDYPTIHHLIVDLMESSEKHDPRLVYLAVAWLVAHRGHFLNEVDKD
1 NIGDVLSFDAFYPEFLAFLSDNGVSPWVCESKALQATLLSRNSVNDKYKALKSLI
FGSQKPEDNFDANISEDGLIQLLAGKKVKVNKLFPQESNDASFTLNDKEDAIEEI
LGTLTPDECEWIAHIRRLFDWAIMKHALKDGRTISESKVKLYEQHHHDLTQLKYF
VKTYLAKEYDDIFRNVDSETTKNYVAYSYHVKEVKGTLPKNKATQEEFCKYVLGK
VKNIECSEADKVDFDEMIQRLTDNSFMPKQVSGENRVIPYQLYYYELKTILNKAA
SYLPFLTQCGKDAISNQDKLLSIMTFRIPYFVGPLRKDNSEHAWLERKAGKIYPW
NFNDKVDLDKSEEAFIRRMTNTCTYYPGEDVLPLDSLIYEKFMILNEINNIRIDG
YPISVDVKQQVFGLFEKKRRVTVKDIQNLLLSLGALDKHGKLTGIDTTIHSNYNT
YHHFKSLMERGVLTRDDVERIVERMTYSDDTKRVRLWLNNNYGTLTADDVKHISR
LRKHDFGRLSKMFLTGLKGVHKETGERASILDFMWNTNDNLMQLLSECYTFSDEI
TKLQEAYYAKAQLSLNDFLDSMYISNAVKRPIYRTLAVVNDIRKACGTAPKRIFI
EMARDGESKKKRSVTRREQIKNLYRSIRKDFQQEVDFLEKILENKSDGQLQSDAL
YLYFAQLGRDMYTGDPIKLEHIKDQSFYNIDHIYPQSMVKDDSLDNKVLVQSEIN
GEKSSRYPLDAAIRNKMKPLWDAYYNHGLISLKKYQRLTRSTPFTDDEKWDFINR
QLVETRQSTKALAILLKRKFPDTEIVYSKAGLSSDFRHEFGLVKSRNINDLHHAK
DAFLAIVTGNVYHERFNRRWFMVNQPYSVKTKTLFTHSIKNGNFVAWNGEEDLGR
IVKMLKQNKNTIHFTRFSFDRKEGLFDIQPLKASTGLVPRKAGLDVVKYGGYDKS
TAAYYLLVRFTLEDKKTQHKLMMIPVEGLYKARIDHDKEFLTDYAQTTISEILQK
DKQKVINIMFPMGTRHIKLNSMISIDGFYLSIGGKSSKGKSVLCHAMVPLIVPHK
IECYIKAMESFARKFKENNKLRIVEKFDKITVEDNLNLYELFLQKLQHNPYNKFF
STQFDVLTNGRSTFTKLSPEEQVQTLLNILSIFKTCRSSGCDLKSINGSAQAARI
MISADLTGLSKKYSDIRLVEQSASGLFVSKSQNLLEYL*
SEQ MSSLTKFTNKYSKQLTIKNELIPVGKTLENIKENGLIDGDEQLNENYQKAKIIVD
ID DFLRDFINKALNNTQIGNWRELADALNKEDEDNIEKLQDKIRGIIVSKFETFDLF
NO: SSYSIKKDEKIIDDDNDVEEEELDLGKKTSSFKYIFKKNLFKLVLPSYLKTTNQD
2 KLKIISSFDNFSTYFRGFFENRKNIFTKKPISTSIAYRIVHDNFPKFLDNIRCFN
VWQTECPQLIVKADNYLKSKNVIAKDKSLANYFTVGAYDYFLSQNGIDFYNNIIG
GLPAFAGHEKIQGLNEFINQECQKDSELKSKLKNRHAFKMAVLFKQILSDREKSF
VIDEFESDAQVIDAVKNFYAEQCKDNNVIFNLLNLIKNIAFLSDDELDGIFIEGK
YLSSVSQKLYSDWSKLRNDIEDSANSKQGNKELAKKIKTNKGDVEKAISKYEFSL
SELNSIVHDNTKFSDLLSCTLHKVASEKLVKVNEGDWPKHLKNNEEKQKIKEPLD
ALLEIYNTLLIFNCKSFNKNGNFYVDYDRCINELSSVVYLYNKTRNYCTKKPYNT
DKFKLNFNSPQLGEGFSKSKENDCLTLLFKKDDNYYVGIIRKGAKINFDDTQAIA
DNTDNCIFKMNYFLLKDAKKFIPKCSIQLKEVKAHFKKSEDDYILSDKEKFASPL
VIKKSTFLLATAHVKGKKGNIKKFQKEYSKENPTEYRNSLNEWIAFCKEFLKTYK
AATIFDITTLKKAEEYADIVEFYKDVDNLCYKLEFCPIKTSFIENLIDNGDLYLF
RINNKDFSSKSTGTKNLHTLYLQAIFDERNLNNPTIMLNGGAELFYRKESIEQKN
RITHKAGSILVNKVCKDGTSLDDKIRNEIYQYENKFIDTLSDEAKKVLPNVIKKE
ATHDITKDKRFTSDKFFFHCPLTINYKEGDTKQFNNEVLSFLRGNPDINIIGIDR
GERNLIYVTVINQKGEILDSVSFNTVTNKSSKIEQTVDYEEKLAVREKERIEAKR
SWDSISKIATLKEGYLSAIVHEICLLMIKHNAIVVLENLNAGFKRIRGGLSEKSV
YQKFEKMLINKLNYFVSKKESDWNKPSGLLNGLQLSDQFESFEKLGIQSGFIFYV
PAAYTSKIDPTTGFANVLNLSKVRNVDAIKSFFSNFNEISYSKKEALFKFSFDLD
SLSKKGFSSFVKFSKSKWNVYTFGERIIKPKNKQGYREDKRINLTFEMKKLLNEY
KVSFDLENNLIPNLTSANLKDTFWKELFFIFKTTLQLRNSVTNGKEDVLISPVKN
AKGEFFVSGTHNKTLPQDCDANGAYHIALKGLMILERNNLVREEKDTKKIMAISN
VDWFEYVQKRRGVL*
SEQ MNNYDEFTKLYPIQKTIRFELKPQGRTMEHLETFNFFEEDRDRAEKYKILKEAID
ID EYHKKFIDEHLTNMSLDWNSLKQISEKYYKSREEKDKKVFLSEQKRMRQEIVSEF
NO: KKDDRFKDLFSKKLFSELLKEEIYKKGNHQEIDALKSFDKFSGYFIGLHENRKNM
3 YSDGDEITAISNRIVNENFPKFLDNLQKYQEARKKYPEWIIKAESALVAHNIKMD
EVFSLEYFNKVLNQEGIQRYNLALGGYVTKSGEKMMGLNDALNLAHQSEKSSKGR
IHMTPLFKQILSEKESFSYIPDVFTEDSQLLPSIGGFFAQIENDKDGNIFDRALE
LISSYAEYDTERIYIRQADINRVSNVIFGEWGTLGGLMREYKADSINDINLERTC
KKVDKWLDSKEFALSDVLEAIKRTGNNDAFNEYISKMRTAREKIDAARKEMKFIS
EKISGDEESIHIIKTLLDSVQQFLHFFNLFKARQDIPLDGAFYAEFDEVHSKLFA
IVPLYNKVRNYLTKNNLNTKKIKLNFKNPTLANGWDQNKVYDYASLIFLRDGNYY
LGIINPKRKKNIKFEQGSGNGPFYRKMVYKQIPGPNKNLPRVFLTSTKGKKEYKP
SKEIIEGYEADKHIRGDKFDLDFCHKLIDFFKESIEKHKDWSKFNFYFSPTESYG
DISEFYLDVEKQGYRMHFENISAETIDEYVEKGDLFLFQIYNKDFVKAATGKKDM
HTIYWNAAFSPENLQDVVVKLNGEAELFYRDKSDIKEIVHREGEILVNRTYNGRT
PVPDKIHKKLTDYHNGRTKDLGEAKEYLDKVRYFKAHYDITKDRRYLNDKIYFHV
PLTLNFKANGKKNLNKMVIEKFLSDEKAHIIGIDRGERNLLYYSIIDRSGKIIDQ
QSLNVIDGFDYREKLNQREIEMKDARQSWNAIGKIKDLKEGYLSKAVHEITKMAI
QYNAIVVMEELNYGFKRGRFKVEKQIYQKFENMLIDKMNYLVFKDAPDESPGGVL
NAYQLTNPLESFAKLGKQTGILFYVPAAYTSKIDPTTGFVNLFNTSSKTNAQERK
EFLQKFESISYSAKDGGIFAFAFDYRKFGTSKTDHKNVWTAYTNGERMRYIKEKK
RNELFDPSKEIKEALTSSGIKYDGGQNILPDILRSNNNGLIYTMYSSFIAAIQMR
VYDGKEDYIISPIKNSKGEFFRTDPKRRELPIDADANGAYNIALRGELTMRAIAE
KFDPDSEKMAKLELKHKDWFEFMQTRGD*
SEQ MTKTFDSEFFNLYSLQKTVRFELKPVGETASFVEDFKNEGLKRVVSEDERRAVDY
ID QKVKEIIDDYHRDFIEESLNYFPEQVSKDALEQAFHLYQKLKAAKVEEREKALKE
NO: WEALQKKLREKVVKCFSDSNKARFSRIDKKELIKEDLINWLVAQNREDDIPTVET
4 FNNFTTYFTGFHENRKNIYSKDDHATAISFRLIHENLPKFFDNVISENKLKEGFP
ELKFDKVKEDLEVDYDLKHAFEIEYFVNFVTQAGIDQYNYLLGGKTLEDGTKKQG
MNEQINLFKQQQTRDKARQIPKLIPLFKQILSERTESQSFIPKQFESDQELFDSL
QKLHNNCQDKFTVLQQAILGLAEADLKKVFIKTSDLNALSNTIFGNYSVFSDALN
LYKESLKTKKAQEAFEKLPAHSIHDLIQYLEQFNSSLDAEKQQSTDTVLNYFIKT
DELYSRFIKSTSEAFTQVQPLFELEALSSKRRPPESEDEGAKGQEGFEQIKRIKA
YLDTLMEAVHFAKPLYLVKGRKMIEGLDKDQSFYEAFEMAYQELESLIIPIYNKA
RSYLSRKPFKADKFKINFDNNTLLSGWDANKETANASILFKKDGLYYLGIMPKGK
TFLFDYFVSSEDSEKLKQRRQKTAEEALAQDGESYFEKIRYKLLPGASKMLPKVF
FSNKNIGFYNPSDDILRIRNTASHTKNGTPQKGHSKVEFNLNDCHKMIDFFKSSI
QKHPEWGSFGFTFSDTSDFEDMSAFYREVENQGYVISFDKIKETYIQSQVEQGNL
YLFQIYNKDFSPYSKGKPNLHTLYWKALFEEANLNNVVAKLNGEAEIFFRRHSIK
ASDKVVHPANQAIDNKNPHTEKTQSTFEYDLVKDKRYTQDKFFFHVPISLNFKAQ
GVSKFNDKVNGFLKGNPDVNIIGIDRGERHLLYFTVVNQKGEILVQESLNTLMSD
KGHVNDYQQKLDKKEQERDAARKSWTTVENIKELKEGYLSHVVHKLAHLIIKYNA
IVCLEDLNFGFKRGRFKVEKQVYQKFEKALIDKLNYLVFKEKELGEVGHYLTAYQ
LTAPFESFKKLGKQSGILFYVPADYTSKIDPTTGFVNFLDLRYQSVEKAKQLLSD
FNAIRFNSVQNYFEFEIDYKKLTPKRKVGTQSKWVICTYGDVRYQNRRNQKGHWE
TEEVNVTEKLKALFASDSKTTTVIDYANDDNLIDVILEQDKASFFKELLWLLKLT
MTLRHSKIKSEDDFILSPVKNEQGEFYDSRKAGEVWPKDADANGAYHIALKGLWN
LQQINQWEKGKTLNLAIKNQDWFSFIQEKPYQE*
SEQ MHTGGLLSMDAKEFTGQYPLSKTLRFELRPIGRTWDNLEASGYLAEDRHRAECYP
ID RAKELLDDNHRAFLNRVLPQIDMDWHPIAEAFCKVHKNPGNKELAQDYNLQLSKR
NO: RKEISAYLQDADGYKGLFAKPALDEAMKIAKENGNESDIEVLEAFNGFSVYFTGY
5 HESRENIYSDEDMVSVAYRITEDNFPRFVSNALIFDKLNESHPDIISEVSGNLGV
DDIGKYFDVSNYNNFLSQAGIDDYNHIIGGHTTEDGLIQAFNVVLNLRHQKDPGF
EKIQFKQLYKQILSVRTSKSYIPKQFDNSKEMVDCICDYVSKIEKSETVERALKL
VRNISSFDLRGIFVNKKNLRILSNKLIGDWDAIETALMHSSSSENDKKSVYDSAE
AFTLDDIFSSVKKFSDASAEDIGNRAEDICRVISETAPFINDLRAVDLDSLNDDG
YEAAVSKIRESLEPYMDLFHELEIFSVGDEFPKCAAFYSELEEVSEQLIEIIPLE
NKARSFCTRKRYSTDKIKVNLKFPTLADGWDLNKERDNKAAILRKDGKYYLAILD
MKKDLSSIRTSDEDESSFEKMEYKLLPSPVKMLPKIFVKSKAAKEKYGLTDRMLE
CYDKGMHKSGSAFDLGFCHELIDYYKRCIAEYPGWDVFDFKFRETSDYGSMKEFN
EDVAGAGYYMSLRKIPCSEVYRLLDEKSIYLFQIYNKDYSENAHGNKNMHTMYWE
GLFSPQNLESPVFKLSGGAELFFRKSSIPNDAKTVHPKGSVLVPRNDVNGRRIPD
SIYRELTRYFNRGDCRISDEAKSYLDKVKTKKADHDIVKDRRFTVDKMMFHVPIA
MNFKAISKPNLNKKVIDGIIDDQDLKIIGIDRGERNLIYVTMVDRKGNILYQDSL
NILNGYDYRKALDVREYDNKEARRNWTKVEGIRKMKEGYLSLAVSKLADMIIENN
AIIVMEDLNHGFKAGRSKIEKQVYQKFESMLINKLGYMVLKDKSIDQSGGALHGY
QLANHVTTLASVGKQCGVIFYIPAAFTSKIDPTTGFADLFALSNVKNVASMREFF
SKMKSVIYDKAEGKFAFTFDYLDYNVKSECGRTLWTVYTVGERFTYSRVNREYVR
KVPTDIIYDALQKAGISVEGDLRDRIAESDGDTLKSIFYAFKYALDMRVENREED
YIQSPVKNASGEFFCSKNAGKSLPQDSDANGAYNIALKGILQLRMLSEQYDPNAE
SIRLPLITNKAWLTFMQSGMKTWKN*
SEQ MDSLKDFTNLYPVSKTLRFELKPVGKTLENIEKAGILKEDEHRAESYRRVKKIID
ID TYHKVFIDSSLENMAKMGIENEIKAMLQSFCELYKKDHRTEGEDKALDKIRAVLR
NO: GLIVGAFTGVCGRRENTVQNEKYESLFKEKLIKEILPDFVLSTEAESLPFSVEEA
6 TRSLKEFDSFTSYFAGFYENRKNIYSTKPQSTAIAYRLIHENLPKFIDNILVFQK
IKEPIAKELEHIRADFSAGGYIKKDERLEDIFSLNYYIHVLSQAGIEKYNALIGK
IVTEGDGEMKGLNEHINLYNQQRGREDRLPLFRPLYKQILSDREQLSYLPESFEK
DEELLRALKEFYDHIAEDILGRTQQLMTSISEYDLSRIYVRNDSQLTDISKKMLG
DWNAIYMARERAYDHEQAPKRITAKYERDRIKALKGEESISLANLNSCIAFLDNV
RDCRVDTYLSTLGQKEGPHGLSNLVENVFASYHEAEQLLSFPYPEENNLIQDKDN
VVLIKNLLDNISDLQRFLKPLWGMGDEPDKDERFYGEYNYIRGALDQVIPLYNKV
RNYLTRKPYSTRKVKLNFGNSQLLSGWDRNKEKDNSCVILRKGQNFYLAIMNNRH
KRSFENKVLPEYKEGEPYFEKMDYKFLPDPNKMLPKVFLSKKGIEIYKPSPKLLE
QYGHGTHKKGDTFSMDDLHELIDFFKHSIEAHEDWKQFGFKFSDTATYENVSSFY
REVEDQGYKLSFRKVSESYVYSLIDQGKLYLFQIYNKDFSPCSKGTPNLHTLYWR
MLFDERNLADVIYKLDGKAEIFFREKSLKNDHPTHPAGKPIKKKSRQKKGEESLF
EYDLVKDRHYTMDKFQFHVPITMNFKCSAGSKVNDMVNAHIREAKDMHVIGIDRG
ERNLLYICVIDSRGTILDQISLNTINDIDYHDLLESRDKDRQQERRNWQTIEGIK
ELKQGYLSQAVHRIAELMVAYKAVVALEDLNMGFKRGRQKVESSVYQQFEKQLID
KLNYLVDKKKRPEDIGGLLRAYQFTAPFKSFKEMGKQNGFLFYIPAWNTSNIDPT
TGFVNLFHAQYENVDKAKSFFQKFDSISYNPKKDWFEFAFDYKNFTKKAEGSRSM
WILCTHGSRIKNFRNSQKNGQWDSEEFALTEAFKSLFVRYEIDYTADLKTAIVDE
KQKDFFVDLLKLFKLTVQMRNSWKEKDLDYLISPVAGADGRFFDTREGNKSLPKD
ADANGAYNIALKGLWALRQIRQTSEGGKLKLAISNKEWLQFVQERSYEKD*
SEQ MNNGTNNFQNFIGISSLQKTLRNALIPTETTQQFIVKNGIIKEDELRGENRQILK
ID DIMDDYYRGFISETLSSIDDIDWTSLFEKMEIQLKNGDNKDTLIKEQTEYRKAIH
NO: KKFANDDRFKNMFSAKLISDILPEFVIHNNNYSASEKEEKTQVIKLFSRFATSFK
7 DYFKNRANCFSADDISSSSCHRIVNDNAEIFFSNALVYRRIVKSLSNDDINKISG
DMKDSLKEMSLEEIYSYEKYGEFITQEGISFYNDICGKVNSFMNLYCQKNKENKN
LYKLQKLHKQILCIADTSYEVPYKFESDEEVYQSVNGFLDNISSKHIVERLRKIG
DNYNGYNLDKIYIVSKFYESVSQKTYRDWETINTALEIHYNNILPGNGKSKADKV
KKAVKNDLQKSITEINELVSNYKLCSDDNIKAETYIHEISHILNNFEAQELKYNP
EIHLVESELKASELKNVLDVIMNAFHWCSVFMTEELVDKDNNFYAELEEIYDEIY
PVISLYNLVRNYVTQKPYSTKKIKLNFGIPTLADGWSKSKEYSNNAIILMRDNLY
YLGIFNAKNKPDKKIIEGNTSENKGDYKKMIYNLLPGPNKMIPKVFLSSKTGVET
YKPSAYILEGYKQNKHIKSSKDFDITFCHDLIDYFKNCIAIHPEWKNFGFDFSDT
STYEDISGFYREVELQGYKIDWTYISEKDIDLLQEKGQLYLFQIYNKDFSKKSTG
NDNLHTMYLKNLFSEENLKDIVLKLNGEAEIFFRKSSIKNPIIHKKGSILVNRTY
EAEEKDQFGNIQIVRKNIPENIYQELYKYFNDKSDKELSDEAAKLKNVVGHHEAA
TNIVKDYRYTYDKYFLHMPITINFKANKTGFINDRILQYIAKEKDLHVIGIDRGE
RNLIYVSVIDTCGNIVEQKSFNIVNGYDYQIKLKQQEGARQIARKEWKEIGKIKE
IKEGYLSLVIHEISKMVIKYNAIIAMEDLSYGFKKGRFKVERQVYQKFETMLINK
LNYLVFKDISITENGGLLKGYQLTYIPDKLKNVGHQCGCIFYVPAAYTSKIDPTT
GFVNIFKFKDLTVDAKREFIKKFDSIRYDSEKNLFCFTFDYNNFITQNTVMSKSS
WSVYTYGVRIKRRFVNGRFSNESDTIDITKDMEKTLEMTDINWRDGHDLRQDIID
YEIVQHIFEIFRLTVQMRNSLSELEDRDYDRLISPVLNENNIFYDSAKAGDALPK
DADANGAYCIALKGLYEIKQITENWKEDGKFSRDKLKISNKDWFDFIQNKRYL*
SEQ MTNKFTNQYSLSKTLRFELIPQGKTLEFIQEKGLLSQDKQRAESYQEMKKTIDKF
ID HKYFIDLALSNAKLTHLETYLELYNKSAETKKEQKFKDDLKKVQDNLRKEIVKSF
NO: SDGDAKSIFAILDKKELITVELEKWFENNEQKDIYFDEKFKTFTTYFTGFHQNRK
8 NMYSVEPNSTAIAYRLIHENLPKFLENAKAFEKIKQVESLQVNFRELMGEFGDEG
LIFVNELEEMFQINYYNDVLSQNGITIYNSIISGFTKNDIKYKGLNEYINNYNQT
KDKKDRLPKLKQLYKQILSDRISLSFLPDAFTDGKQVLKAIFDFYKINLLSYTIE
GQEESQNLLLLIRQTIENLSSFDTQKIYLKNDTHLTTISQQVFGDFSVFSTALNY
WYETKVNPKFETEYSKANEKKREILDKAKAVFTKQDYFSIAFLQEVLSEYILTLD
HTSDIVKKHSSNCIADYFKNHFVAKKENETDKTFDFIANITAKYQCIQGILENAD
QYEDELKQDQKLIDNLKFFLDAILELLHFIKPLHLKSESITEKDTAFYDVFENYY
EALSLLTPLYNMVRNYVTQKPYSTEKIKLNFENAQLLNGWDANKEGDYLTTILKK
DGNYFLAIMDKKHNKAFQKFPEGKENYEKMVYKLLPGVNKMLPKVFFSNKNIAYF
NPSKELLENYKKETHKKGDTFNLEHCHTLIDFFKDSLNKHEDWKYFDFQFSETKS
YQDLSGFYREVEHQGYKINFKNIDSEYIDGLVNEGKLFLFQIYSKDFSPFSKGKP
NMHTLYWKALFEEQNLQNVIYKLNGQAEIFFRKASIKPKNIILHKKKIKIAKKHF
IDKKTKTSEIVPVQTIKNLNMYYQGKISEKELTQDDLRYIDNFSIFNEKNKTIDI
IKDKRFTVDKFQFHVPITMNFKATGGSYINQTVLEYLQNNPEVKIIGLDRGERHL
VYLTLIDQQGNILKQESLNTITDSKISTPYHKLLDNKENERDLARKNWGTVENIK
ELKEGYISQVVHKIATLMLEENAIVVMEDLNFGFKRGRFKVEKQIYQKLEKMLID
KLNYLVLKDKQPQELGGLYNALQLTNKFESFQKMGKQSGFLFYVPAWNTSKIDPT
TGFVNYFYTKYENVDKAKAFFEKFEAIRFNAEKKYFEFEVKKYSDENPKAEGTQQ
AWTICTYGERIETKRQKDQNNKFVSTPINLTEKIEDFLGKNQIVYGDGNCIKSQI
ASKDDKAFFETLLYWFKMTLQMRNSETRTDIDYLISPVMNDNGTFYNSRDYEKLE
NPTLPKDADANGAYHIAKKGLMLLNKIDQADLTKKVDLSISNRDWLQFVQKNK*
SEQ MEQEYYLGLDMGTGSVGWAVTDSEYHVLRKHGKALWGVRLFESASTAEERRMFRT
ID SRRRLDRRNWRIEILQEIFAEEISKKDPGFFLRMKESKYYPEDKRDINGNCPELP
NO: YALFVDDDFTDKDYHKKFPTIYHLRKMLMNTEETPDIRLVYLAIHHMMKHRGHFL
9 LSGDINEIKEFGTTFSKLLENIKNEELDWNLELGKEEYAVVESILKDNMLNRSTK
KTRLIKALKAKSICEKAVLNLLAGGTVKLSDIFGLEELNETERPKISFADNGYDD
YIGEVENELGEQFYIIETAKAVYDWAVLVEILGKYTSISEAKVATYEKHKSDLQF
LKKIVRKYLTKEEYKDIFVSTSDKLKNYSAYIGMTKINGKKVDLQSKRCSKEEFY
DFIKKNVLKKLEGQPEYEYLKEELERETFLPKQVNRDNGVIPYQIHLYELKKILG
NLRDKIDLIKENEDKLVQLFEFRIPYYVGPLNKIDDGKEGKFTWAVRKSNEKIYP
WNFENVVDIEASAEKFIRRMTNKCTYLMGEDVLPKDSLLYSKYMVLNELNNVKLD
GEKLSVELKQRLYTDVFCKYRKVTVKKIKNYLKCEGIISGNVEITGIDGDFKASL
TAYHDFKEILTGTELAKKDKENIITNIVLFGDDKKLLKKRLNRLYPQITPNQLKK
ICALSYTGWGRFSKKFLEEITAPDPETGEVWNIITALWESNNNLMQLLSNEYRFM
EEVETYNMGKQTKTLSYETVENMYVSPSVKRQIWQTLKIVKELEKVMKESPKRVF
IEMAREKQESKRTESRKKQLIDLYKACKNEEKDWVKELGDQEEQKLRSDKLYLYY
TQKGRCMYSGEVIELKDLWDNTKYDIDHIYPQSKTMDDSLNNRVLVKKKYNATKS
DKYPLNENIRHERKGFWKSLLDGGFISKEKYERLIRNTELSPEELAGFIERQIVE
TRQSTKAVAEILKQVFPESEIVYVKAGTVSRFRKDFELLKVREVNDLHHAKDAYL
NIVVGNSYYVKFTKNASWFIKENPGRTYNLKKMFTSGWNIERNGEVAWEVGKKGT
IVTVKQIMNKNNILVTRQVHEAKGGLFDQQIMKKGKGQIAIKETDERLASIEKYG
GYNKAAGAYFMLVESKDKKGKTIRTIEFIPLYLKNKIESDESIALNFLEKGRGLK
EPKILLKKIKIDTLFDVDGFKMWLSGRTGDRLLFKCANQLILDEKIIVTMKKIVK
FIQRRQENRELKLSDKDGIDNEVLMEIYNTFVDKLENTVYRIRLSEQAKTLIDKQ
KEFERLSLEDKSSTLFEILHIFQCQSSAANLKMIGGPGKAGILVMNNNISKCNKI
SIINQSPTGIFENEIDLLK
SEQ MNKFENFTGLYPISKTLRFELIPQGKTLEYIEKSEILENDNYRAEKYEEVKDIID
ID GYHKWFINETLHDLHINWSELKVALENNRIEKSDASKKELQRVQKIKREEIYNAF
NO: IEHEAFQYLFKENLLSDLLPIQIEQSEDLDAEKKKQAVETFNRFSTYFTGFHENR
10 KNIYSKEGISTSVTYRIVHDNFPKFLENMKVFEILRNECPEVISDTANELAPFID
GVRIEDIFLIDFFNSTFSQNGIDYYNRILGGVTTETGEKYRGINEFTNLYRQQHP
EFGKSKKATKMVVLFKQILSDRDTLSFIPEMFGNDKQVQNSIQLFYNREISQFEN
EGVKTDVCTALATLTSKIAEFDTEKIYIQQPELPNVSQRLFGSWNELNACLFKYA
ELKFGTAEKVANRKKIDKWLKSDLFSFTELNKALEFSGKDERIENYFSETGIFAQ
LVKTGFDEAQSILETEYTSEVHLKDQQTDIEKIKTFLDALQNLMHLLKSLCVSEE
ADRDAAFYNEFDMLYNQLKLVVPLYNKVRNYITQKLFRSDKIKIYFENKGQFLGG
WVDSQTENSDNGTQAGGYIFRKENVINEYDYYLGICSDPKLFRRTTIVSENDRSS
FERLDYYQLKTASVYGNSYCGKHPYTEDKNELVNSIDRFVHLSGNNILIEKIAKD
KVKSNPTTNTPSGYLNFIHREAPNTYECLLQDENFVSLNQRVVSALKATLATLVR
VPKALVYAKKDYHLFSEIINDIDELSYEKAFSYFPVSQTEFENSSNRTIKPLLLF
KISNKDLSFAENFEKGNRQKIGKKNLHTLYFEALMKGNQDTIDIGTGMVFHRVKS
LNYNEKTLKYGHHSTQLNEKFSYPIIKDKRFASDKFLFHLSTEINYKEKRKPLNN
SIIEFLTNNPDINIIGLDRGERHLIYLTLINQKGEILRQKTFNIVGNTNYHEKLN
QREKERDNARKSWATIGKIKELKEGFLSLVIHEIAKIMVENNAIVVLEDLNFGFK
RGRFKVEKQIYQKFEKMLIDKLNYLVFKDKKANEAGGVLKGYQLAEKFESFQKMG
KQSGFLFYVPAAYTSKIDPTTGFVNMLNLNYTNMKDAQTLLSGMDKISFNADANY
FEFELDYEKFKTNQTDHTNKWTICTVGEKRFTYNSATKETTTVNVTEDLKKLLDK
FEVKYSNGDNIKDEICRQTDAKFFEIILWLLKLTMQMRNSNTKTEEDFILSPVKN
SNGEFFRSNDDANGIWPADADANGAYHIALKGLYLVKECFNKNEKSLKIEHKNWF
KFAQTRENGSLTKNG*
SEQ MENFKNLYPINKTLRFELRPYGKTLENFKKSGLLEKDAFKANSRRSMQAIIDEKF
ID KETIEERLKYTEFSECDLGNMTSKDKKITDKAATNLKKQVILSFDDEIFNNYLKP
NO: DKNIDALFKNDPSNPVISTFKGFTTYFVNFFEIRKHIFKGESSGSMAYRIIDENL
11 TTYLNNIEKIKKLPEELKSQLEGIDQIDKLNNYNEFITQSGITHYNEIIGGISKS
ENVKIQGINEGINLYCQKNKVKLPRLTPLYKMILSDRVSNSFVLDTIENDTELIE
MISDLINKTEISQDVIMSDIQNIFIKYKQLGNLPGISYSSIVNAICSDYDNNFGD
GKRKKSYENDRKKHLETNVYSINYISELLTDTDVSSNIKMRYKELEQNYQVCKEN
FNATNWMNIKNIKQSEKTNLIKDLLDILKSIQRFYDLFDIVDEDKNPSAEFYTWL
SKNAEKLDFEFNSVYNKSRNYLTRKQYSDKKIKLNFDSPTLAKGWDANKEIDNST
IIMRKENNDRGDYDYFLGIWNKSTPANEKIIPLEDNGLFEKMQYKLYPDPSKMLP
KQFLSKIWKAKHPTTPEFDKKYKEGRHKKGPDFEKEFLHELIDCFKHGLVNHDEK
YQDVFGFNLRNTEDYNSYTEFLEDVERCNYNLSENKIADTSNLINDGKLYVFQIW
SKDFSIDSKGTKNLNTIYFESLFSEENMIEKMFKLSGEAEIFYRPASLNYCEDII
KKGHHHAELKDKFDYPIIKDKRYSQDKFFFHVPMVINYKSEKLNSKSLNNRTNEN
LGQFTHIIGIDRGERHLIYLTVVDVSTGEIVEQKHLDEIINTDTKGVEHKTHYLN
KLEEKSKTRDNERKSWEAIETIKELKEGYISHVINEIQKLQEKYNALIVMENLNY
GFKNSRIKVEKQVYQKFETALIKKFNYIIDKKDPETYIHGYQLTNPITTLDKIGN
QSGIVLYIPAWNTSKIDPVTGFVNLLYADDLKYKNQEQAKSFIQKIDNIYFENGE
FKFDIDFSKWNNRYSISKTKWTLTSYGTRIQTFRNPQKNNKWDSAEYDLTEEFKL
ILNIDGTLKSQDVETYKKFMSLFKLMLQLRNSVTGTDIDYMISPVTDKTGTHFDS
RENIKNLPADADANGAYNIARKGIMAIENIMNGISDPLKISNEDYLKYIQNQQE
SEQ MTQFEGFTNLYQVSKTLRFELIPQGKTLKHIQEQGFIEEDKARNDHYKELKPIID
ID RIYKTYADQCLQLVQLDWENLSAAIDSYRKEKTEETRNALIEEQATYRNAIHDYF
NO: IGRTDNLTDAINKRHAEIYKGLFKAELFNGKVLKQLGTVTTTEHENALLRSFDKF
12 TTYFSGFYENRKNVFSAEDISTAIPHRIVQDNFPKFKENCHIFTRLITAVPSLRE
HFENVKKAIGIFVSTSIEEVFSFPFYNQLLTQTQIDLYNQLLGGISREAGTEKIK
GLNEVLNLAIQKNDETAHIIASLPHRFIPLFKQILSDRNTLSFILEEFKSDEEVI
QSFCKYKTLLRNENVLETAEALFNELNSIDLTHIFISHKKLETISSALCDHWDTL
RNALYERRISELTGKITKSAKEKVQRSLKHEDINLQEIISAAGKELSEAFKQKTS
EILSHAHAALDQPLPTTLKKQEEKEILKSQLDSLLGLYHLLDWFAVDESNEVDPE
FSARLTGIKLEMEPSLSFYNKARNYATKKPYSVEKFKLNFQMPTLASGWDVNKEK
NNGAILFVKNGLYYLGIMPKQKGRYKALSFEPTEKTSEGFDKMYYDYFPDAAKMI
PKCSTQLKAVTAHFQTHTTPILLSNNFIEPLEITKEIYDLNNPEKEPKKFQTAYA
KKTGDQKGYREALCKWIDFTRDFLSKYTKTTSIDLSSLRPSSQYKDLGEYYAELN
PLLYHISFQRIAEKEIMDAVETGKLYLFQIYNKDFAKGHHGKPNLHTLYWTGLFS
PENLAKTSIKLNGQAELFYRPKSRMKRMAHRLGEKMLNKKLKDQKTPIPDTLYQE
LYDYVNHRLSHDLSDEARALLPNVITKEVSHEIIKDRRFTSDKFFFHVPITLNYQ
AANSPSKFNQRVNAYLKEHPETPIIGIDRGERNLIYITVIDSTGKILEQRSLNTI
QQFDYQKKLDNREKERVAARQAWSVVGTIKDLKQGYLSQVIHEIVDLMIHYQAVV
VLENLNFGFKSKRTGIAEKAVYQQFEKMLIDKLNCLVLKDYPAEKVGGVLNPYQL
TDQFTSFAKMGTQSGFLFYVPAPYTSKIDPLTGFVDPFVWKTIKNHESRKHFLEG
FDFLHYDVKTGDFILHFKMNRNLSFQRGLPGFMPAWDIVFEKNETQFDAKGTPFI
AGKRIVPVIENHRFTGRYRDLYPANELIALLEEKGIVFRDGSNILPKLLENDDSH
AIDTMVALIRSVLQMRNSNAATGEDYINSPVRDLNGVCFDSRFQNPEWPMDADAN
GAYHIALKGQLLLNHLKESKDLKLQNGISNQDWLAYIQELRN*
SEQ MAVKSIKVKLRLDDMPEIRAGLWKLHKEVNAGVRYYTEWLSLLRQENLYRRSPNG
ID DGEQECDKTAEECKAELLERLRARQVENGHRGPAGSDDELLQLARQLYELLVPQA
NO: IGAKGDAQQIARKFLSPLADKDAVGGLGIAKAGNKPRWVRMREAGEPGWEEEKEK
13 AETRKSADRTADVLRALADFGLKPLMRVYTDSEMSSVEWKPLRKGQAVRTWDRDM
FQQAIERMMSWESWNQRVGQEYAKLVEQKNRFEQKNFVGQEHLVHLVNQLQQDMK
EASPGLESKEQTAHYVTGRALRGSDKVFEKWGKLAPDAPFDLYDAEIKNVQRRNT
RRFGSHDLFAKLAEPEYQALWREDASFLTRYAVYNSILRKLNHAKMFATFTLPDA
TAHPIWTRFDKLGGNLHQYTFLFNEFGERRHAIRFHKLLKVENGVAREVDDVTVP
ISMSEQLDNLLPRDPNEPIALYFRDYGAEQHFTGEFGGAKIQCRRDQLAHMHRRR
GARDVYLNVSVRVQSQSEARGERRPPYAAVFRLVGDNHRAFVHFDKLSDYLAEHP
DDGKLGSEGLLSGLRVMSVDLGLRTSASISVFRVARKDELKPNSKGRVPFFFPIK
GNDNLVAVHERSQLLKLPGETESKDLRAIREERQRTLRQLRTQLAYLRLLVRCGS
EDVGRRERSWAKLIEQPVDAANHMTPDWREAFENELQKLKSLHGICSDKEWMDAV
YESVRRVWRHMGKQVRDWRKDVRSGERPKIRGYAKDVVGGNSIEQIEYLERQYKF
LKSWSFFGKVSGQVIRAEKGSRFAITLREHIDHAKEDRLKKLADRIIMEALGYVY
ALDERGKGKWVAKYPPCQLILLEELSEYQFNNDRPPSENNQLMQWSHRGVFQELI
NQAQVHDLLVGTMYAAFSSRFDARTGAPGIRCRRVPARCTQEHNPEPFPWWLNKF
VVEHTLDACPLRADDLIPTGEGEIFVSPFSAEEGDFHQIHADLNAAQNLQQRLWS
DFDISQIRLRCDWGEVDGELVLIPRLTGKRTADSYSNKVFYTNTGVTYYERERGK
KRRKVFAQEKLSEEEAELLVEADEAREKSVVLMRDPSGIINRGNWTRQKEFWSMV
NQRIEGYLVKQIRSRVPLQDSACENTGDI*
SEQ MATRSFILKIEPNEEVKKGLWKTHEVLNHGIAYYMNILKLIRQEAIYEHHEQDPK
ID NPKKVSKAEIQAELWDFVLKMQKCNSFTHEVDKDVVFNILRELYEELVPSSVEKK
NO: GEANQLSNKFLYPLVDPNSQSGKGTASSGRKPRWYNLKIAGDPSWEEEKKKWEED
14 KKKDPLAKILGKLAEYGLIPLFIPFTDSNEPIVKEIKWMEKSRNQSVRRLDKDMF
IQALERFLSWESWNLKVKEEYEKVEKEHKTLEERIKEDIQAFKSLEQYEKERQEQ
LLRDTLNTNEYRLSKRGLRGWREIIQKWLKMDENEPSEKYLEVFKDYQRKHPREA
GDYSVYEFLSKKENHFIWRNHPEYPYLYATFCEIDKKKKDAKQQATFTLADPINH
PLWVRFEERSGSNLNKYRILTEQLHTEKLKKKLTVQLDRLIYPTESGGWEEKGKV
DIVLLPSRQFYNQIFLDIEEKGKHAFTYKDESIKFPLKGTLGGARVQFDRDHLRR
YPHKVESGNVGRIYFNMTVNIEPTESPVSKSLKIHRDDFPKFVNFKPKELTEWIK
DSKGKKLKSGIESLEIGLRVMSIDLGQRQAAAASIFEVVDQKPDIEGKLFFPIKG
TELYAVHRASFNIKLPGETLVKSREVLRKAREDNLKLMNQKLNFLRNVLHFQQFE
DITEREKRVTKWISRQENSDVPLVYQDELIQIRELMYKPYKDWVAFLKQLHKRLE
VEIGKEVKHWRKSLSDGRKGLYGISLKNIDEIDRTRKFLLRWSLRPTEPGEVRRL
EPGQRFAIDQLNHLNALKEDRLKKMANTIIMHALGYCYDVRKKKWQAKNPACQII
LFEDLSNYNPYEERSRFENSKLMKWSRREIPRQVALQGEIYGLQVGEVGAQFSSR
FHAKTGSPGIRCSVVTKEKLQDNRFFKNLQREGRLTLDKIAVLKEGDLYPDKGGE
KFISLSKDRKLVTTHADINAAQNLQKRFWTRTHGFYKVYCKAYQVDGQTVYIPES
KDQKQKIIEEFGEGYFILKDGVYEWGNAGKLKIKKGSSKQSSSELVDSDILKDSF
DLASELKGEKLMLYRDPSGNVFPSDKWMAAGVFFGKLERILISKLTNQYSISTIE
DDSSKQSM*
SEQ MPTRTINLKLVLGKNPENATLRRALFSTHRLVNQATKRIEEFLLLCRGEAYRTVD
ID NEGKEAEIPRHAVQEEALAFAKAAQRHNGCISTYEDQEILDVLRQLYERLVPSVN
NO: ENNEAGDAQAANAWVSPLMSAESEGGLSVYDKVLDPPPVWMKLKEEKAPGWEAAS
15 QIWIQSDEGQSLLNKPGSPPRWIRKLRSGQPWQDDFVSDQKKKQDELTKGNAPLI
KQLKEMGLLPLVNPFFRHLLDPEGKGVSPWDRLAVRAAVAHFISWESWNHRTRAE
YNSLKLRRDEFEAASDEFKDDFTLLRQYEAKRHSTLKSIALADDSNPYRIGVRSL
RAWNRVREEWIDKGATEEQRVTILSKLQTQLRGKFGDPDLFNWLAQDRHVHLWSP
RDSVTPLVRINAVDKVLRRRKPYALMTFAHPRFHPRWILYEAPGGSNLRQYALDC
TENALHITLPLLVDDAHGTWIEKKIRVPLAPSGQIQDLTLEKLEKKKNRLYYRSG
FQQFAGLAGGAEVLFHRPYMEHDERSEESLLERPGAVWFKLTLDVATQAPPNWLD
GKGRVRTPPEVHHFKTALSNKSKHTRTLQPGLRVLSVDLGMRTFASCSVFELIEG
KPETGRAFPVADERSMDSPNKLWAKHERSFKLTLPGETPSRKEEEERSIARAEIY
ALKRDIQRLKSLLRLGEEDNDNRRDALLEQFFKGWGEEDVVPGQAFPRSLFQGLG
AAPFRSTPELWRQHCQTYYDKAEACLAKHISDWRKRTRPRPTSREMWYKTRSYHG
GKSIWMLEYLDAVRKLLLSWSLRGRTYGAINRQDTARFGSLASRLLHHINSLKED
RIKTGADSIVQAARGYIPLPHGKGWEQRYEPCQLILFEDLARYRFRVDRPRRENS
QLMQWNHRAIVAETTMQAELYGQIVENTAAGFSSRFHAATGAPGVRCRFLLERDF
DNDLPKPYLLRELSWMLGNTKVESEEEKLRLLSEKIRPGSLVPWDGGEQFATLHP
KRQTLCVIHADMNAAQNLQRRFFGRCGEAFRLVCQPHGDDVLRLASTPGARLLGA
LQQLENGQGAFELVRDMGSTSQMNRFVMKSLGKKKIKPLQDNNGDDELEDVLSVL
PEEDDTGRITVFRDSSGIFFPCNVWIPAKQFWPAVRAMIWKVMASHSLG*
SEQ MTKLRHRQKKLTHDWAGSKKREVLGSNGKLQNPLLMPVKKGQVTEFRKAFSAYAR
ID ATKGEMTDGRKNMFTHSFEPFKTKPSLHQCELADKAYQSLHSYLPGSLAHFLLSA
NO: HALGFRIFSKSGEATAFQASSKIEAYESKLASELACVDLSIQNLTISTLFNALTT
16 SVRGKGEETSADPLIARFYTLLTGKPLSRDTQGPERDLAEVISRKIASSFGTWKE
MTANPLQSLQFFEEELHALDANVSLSPAFDVLIKMNDLQGDLKNRTIVFDPDAPV
FEYNAEDPADIIIKLTARYAKEAVIKNQNVGNYVKNAITTTNANGLGWLLNKGLS
LLPVSTDDELLEFIGVERSHPSCHALIELIAQLEAPELFEKNVFSDTRSEVQGMI
DSAVSNHIARLSSSRNSLSMDSEELERLIKSFQIHTPHCSLFIGAQSLSQQLESL
PEALQSGVNSADILLGSTQYMLTNSLVEESIATYQRTLNRINYLSGVAGQINGAI
KRKAIDGEKIHLPAAWSELISLPFIGQPVIDVESDLAHLKNQYQTLSNEFDTLIS
ALQKNFDLNFNKALLNRTQHFEAMCRSTKKNALSKPEIVSYRDLLARLTSCLYRG
SLVLRRAGIEVLKKHKIFESNSELREHVHERKHFVFVSPLDRKAKKLLRLTDSRP
DLLHVIDEILQHDNLENKDRESLWLVRSGYLLAGLPDQLSSSFINLPIITQKGDR
RLIDLIQYDQINRDAFVMLVTSAFKSNLSGLQYRANKQSFVVTRTLSPYLGSKLV
YVPKDKDWLVPSQMFEGRFADILQSDYMVWKDAGRLCVIDTAKHLSNIKKSVFSS
EEVLAFLRELPHRTFIQTEVRGLGVNVDGIAFNNGDIPSLKTFSNCVQVKVSRTN
TSLVQTLNRWFEGGKVSPPSIQFERAYYKKDDQIHEDAAKRKIRFQMPATELVHA
SDDAGWTPSYLLGIDPGEYGMGLSLVSINNGEVLDSGFIHINSLINFASKKSNHQ
TKVVPRQQYKSPYANYLEQSKDSAAGDIAHILDRLIYKLNALPVFEALSGNSQSA
ADQVWTKVLSFYTWGDNDAQNSIRKQHWFGASHWDIKGMLRQPPTEKKPKPYIAF
PGSQVSSYGNSQRCSCCGRNPIEQLREMAKDTSIKELKIRNSEIQLFDGTIKLFN
PDPSTVIERRRHNLGPSRIPVADRTFKNISPSSLEFKELITIVSRSIRHSPEFIA
KKRGIGSEYFCAYSDCNSSLNSEANAAANVAQKFQKQLFFEL*
SEQ MKRILNSLKVAALRLLFRGKGSELVKTVKYPLVSPVQGAVEELAEAIRHDNLHLF
ID GQKEIVDLMEKDEGTQVYSVVDFWLDTLRLGMFFSPSANALKITLGKFNSDQVSP
NO: FRKVLEQSPFFLAGRLKVEPAERILSVEIRKIGKRENRVENYAADVETCFIGQLS
17 SDEKQSIQKLANDIWDSKDHEEQRMLKADFFAIPLIKDPKAVTEEDPENETAGKQ
KPLELCVCLVPELYTRGFGSIADFLVQRLTLLRDKMSTDTAEDCLEYVGIEEEKG
NGMNSLLGTFLKNLQGDGFEQIFQFMLGSYVGWQGKEDVLRERLDLLAEKVKRLP
KPKFAGEWSGHRMFLHGQLKSWSSNFFRLFNETRELLESIKSDIQHATMLISYVE
EKGGYHPQLLSQYRKLMEQLPALRTKVLDPEIEMTHMSEAVRSYIMIHKSVAGFL
PDLLESLDRDKDREFLLSIFPRIPKIDKKTKEIVAWELPGEPEEGYLFTANNLFR
NFLENPKHVPRFMAERIPEDWTRLRSAPVWFDGMVKQWQKVVNQLVESPGALYQF
NESFLRQRLQAMLTVYKRDLQTEKFLKLLADVCRPLVDFFGLGGNDIIFKSCQDP
RKQWQTVIPLSVPADVYTACEGLAIRLRETLGFEWKNLKGHEREDFLRLHQLLGN
LLFWIRDAKLVVKLEDWMNNPCVQEYVEARKAIDLPLEIFGFEVPIFLNGYLFSE
LRQLELLLRRKSVMTSYSVKTTGSPNRLFQLVYLPLNPSDPEKKNSNNFQERLDT
PTGLSRRFLDLTLDAFAGKLLTDPVTQELKTMAGFYDHLFGFKLPCKLAAMSNHP
GSSSKMVVLAKPKKGVASNIGFEPIPDPAHPVFRVRSSWPELKYLEGLLYLPEDT
PLTIELAETSVSCQSVSSVAFDLKNLTTILGRVGEFRVTADQPFKLTPIIPEKEE
SFIGKTYLGLDAGERSGVGFAIVTVDGDGYEVQRLGVHEDTQLMALQQVASKSLK
EPVFQPLRKGTFRQQERIRKSLRGCYWNFYHALMIKYRAKVVHEESVGSSGLVGQ
WLRAFQKDLKKADVLPKKGGKNGVDKKKRESSAQDTLWGGAFSKKEEQQIAFEVQ
AAGSSQFCLKCGWWFQLGMREVNRVQESGVVLDWNRSIVTFLIESSGEKVYGFSP
QQLEKGFRPDIETFKKMVRDFMRPPMFDRKGRPAAAYERFVLGRRHRRYRFDKVF
EERFGRSALFICPRVGCGNFDHSSEQSAVVLALIGYIADKEGMSGKKLVYVRLAE
LMAEWKLKKLERSRVEEQSSAQ*
SEQ MAESKQMQCRKCGASMKYEVIGLGKKSCRYMCPDCGNHTSARKIQNKKKRDKKYG
ID SASKAQSQRIAVAGALYPDKKVQTIKTYKYPADLNGEVHDSGVAEKIAQAIQEDE
NO: IGLLGPSSEYACWIASQKQSEPYSVVDFWFDAVCAGGVFAYSGARLLSTVLQLSG
18 EESVLRAALASSPFVDDINLAQAEKFLAVSRRTGQDKLGKRIGECFAEGRLEALG
IKDRMREFVQAIDVAQTAGQRFAAKLKIFGISQMPEAKQWNNDSGLTVCILPDYY
VPEENRADQLVVLLRRLREIAYCMGIEDEAGFEHLGIDPGALSNFSNGNPKRGFL
GRLLNNDIIALANNMSAMTPYWEGRKGELIERLAWLKHRAEGLYLKEPHFGNSWA
DHRSRIFSRIAGWLSGCAGKLKIAKDQISGVRTDLFLLKRLLDAVPQSAPSPDFI
ASISALDRFLEAAESSQDPAEQVRALYAFHLNAPAVRSIANKAVQRSDSQEWLIK
ELDAVDHLEFNKAFPFFSDTGKKKKKGANSNGAPSEEEYTETESIQQPEDAEQEV
NGQEGNGASKNQKKFQRIPRFFGEGSRSEYRILTEAPQYFDMFCNNMRAIFMQLE
SQPRKAPRDFKCFLQNRLQKLYKQTFLNARSNKCRALLESVLISWGEFYTYGANE
KKFRLRHEASERSSDPDYVVQQALEIARRLFLFGFEWRDCSAGERVDLVEIHKKA
ISFLLAITQAEVSVGSYNWLGNSTVSRYLSVAGTDTLYGTQLEEFLNATVLSQMR
GLAIRLSSQELKDGFDVQLESSCQDNLQHLLVYRASRDLAACKRATCPAELDPKI
LVLPVGAFIASVMKMIERGDEPLAGAYLRHRPHSFGWQIRVRGVAEVGMDQGTAL
AFQKPTESEPFKIKPFSAQYGPVLWLNSSSYSQSQYLDGFLSQPKNWSMRVLPQA
GSVRVEQRVALIWNLQAGKMRLERSGARAFFMPVPFSFRPSGSGDEAVLAPNRYL
GLFPHSGGIEYAVVDVLDSAGFKILERGTIAVNGFSQKRGERQEEAHREKQRRGI
SDIGRKKPVQAEVDAANELHRKYTDVATRLGCRIVVQWAPQPKPGTAPTAQTVYA
RAVRTEAPRSGNQEDHARMKSSWGYTWGTYWEKRKPEDILGISTQVYWTGGIGES
CPAVAVALLGHIRATSTQTEWEKEEVVFGRLKKFFPS*
SEQ MEKRINKIRKKLSADNATKPVSRSGPMKTLLVRVMTDDLKKRLEKRRKKPEVMPQ
ID VISNNAANNLRMLLDDYTKMKEAILQVYWQEFKDDHVGLMCKFAQPASKKIDQNK
NO: LKPEMDEKGNLTTAGFACSQCGQPLFVYKLEQVSEKGKAYTNYFGRCNVAEHEKL
19 ILLAQLKPEKDSDEAVTYSLGKFGQRALDFYSIHVTKESTHPVKPLAQIAGNRYA
SGPVGKALSDACMGTIASFLSKYQDIIIEHQKVVKGNQKRLESLRELAGKENLEY
PSVTLPPQPHTKEGVDAYNEVIARVRMWVNLNLWQKLKLSRDDAKPLLRLKGFPS
FPVVERRENEVDWWNTINEVKKLIDAKRDMGRVFWSGVTAEKRNTILEGYNYLPN
ENDHKKREGSLENPKKPAKRQFGDLLLYLEKKYAGDWGKVFDEAWERIDKKIAGL
TSHIEREEARNAEDAQSKAVLTDWLRAKASFVLERLKEMDEKEFYACEIQLQKWY
GDLRGNPFAVEAENRVVDISGFSIGSDGHSIQYRNLLAWKYLENGKREFYLLMNY
GKKGRIRFTDGTDIKKSGKWQGLLYGGGKAKVIDLTFDPDDEQLIILPLAFGTRQ
GREFIWNDLLSLETGLIKLANGRVIEKTIYNKKIGRDEPALFVALTFERREVVDP
SNIKPVNLIGVDRGENIPAVIALTDPEGCPLPEFKDSSGGPTDILRIGEGYKEKQ
RAIQAAKEVEQRRAGGYSRKFASKSRNLADDMVRNSARDLFYHAVTHDAVLVFEN
LSRGFGRQGKRTFMTERQYTKMEDWLTAKLAYEGLTSKTYLSKTLAQYTSKTCSN
CGFTITTADYDGMLVRLKKTSDGWATTLNNKELKAEGQITYYNRYKRQTVEKELS
AELDRLSEESGNNDISKWTKGRRDEALFLLKKRFSHRPVQEQFVCLDCGHEVHAD
EQAALNIARSWLFLNSNSTEFKSYKSGKQPFVGAWQAFYKRRLKEVWKPNA
SEQ MKRINKIRRRLVKDSNTKKAGKTGPMKTLLVRVMTPDLRERLENLRKKPENIPQP
ID ISNTSRANLNKLLTDYTEMKKAILHVYWEEFQKDPVGLMSRVAQPAPKNIDQRKL
NO: IPVKDGNERLTSSGFACSQCCQPLYVYKLEQVNDKGKPHTNYFGRCNVSEHERLI
20 LLSPHKPEANDELVTYSLGKFGQRALDFYSIHVTRESNHPVKPLEQIGGNSCASG
PVGKALSDACMGAVASFLTKYQDIILEHQKVIKKNEKRLANLKDIASANGLAFPK
ITLPPQPHTKEGIEAYNNVVAQIVIWVNLNLWQKLKIGRDEAKPLQRLKGFPSFP
LVERQANEVDWWDMVCNVKKLINEKKEDGKVFWQNLAGYKRQEALLPYLSSEEDR
KKGKKFARYQFGDLLLHLEKKHGEDWGKVYDEAWERIDKKVEGLSKHIKLEEERR
SEDAQSKAALTDWLRAKASFVIEGLKEADKDEFCRCELKLQKWYGDLRGKPFAIE
AENSILDISGFSKQYNCAFIWQKDGVKKLNLYLIINYFKGGKLRFKKIKPEAFEA
NRFYTVINKKSGEIVPMEVNFNFDDPNLIILPLAFGKRQGREFIWNDLLSLETGS
LKLANGRVIEKTLYNRRTRQDEPALFVALTFERREVLDSSNIKPMNLIGIDRGEN
IPAVIALTDPEGCPLSRFKDSLGNPTHILRIGESYKEKQRTIQAAKEVEQRRAGG
YSRKYASKAKNLADDMVRNTARDLLYYAVTQDAMLIFENLSRGFGRQGKRTFMAE
RQYTRMEDWLTAKLAYEGLPSKTYLSKTLAQYTSKTCSNCGFTITSADYDRVLEK
LKKTATGWMTTINGKELKVEGQITYYNRYKRQNVVKDLSVELDRLSEESVNNDIS
SWTKGRSGEALSLLKKRFSHRPVQEKFVCLNCGFETHADEQAALNIARSWLFLRS
QEYKKYQTNKTTGNTDKRAFVETWQSFYRKKLKEVWKP
SEQ atgGGAAAAATGTATTATCTTGGTCTGGATATAGGAACAAATTCTGTTGGATATG
ID CCGTAACCGACCCATCGTACCATTTGCTCAAATTTAAAGGCGAACCGATGTGGGG
NO: TGCCCACGTGTTTGCTGCGGGGAATCAATCAGCTGAACGGAGAAGCTTTCGTACG
21 AGCCGCAGACGCCTTGACCGCAGGCAACAGCGTGTCAAACTGGTTCAAGAAATCT
TTGCTCCCGTGATTAGTCCCATTGATCCACGTTTTTTTATCAGACTTCATGAGAG
CGCTTTATGGCGGGATGATGTGGCTGAAACGGATAAACATATTTTCTTTAATGAC
CCGACCTATACGGATAAGGAATATTATTCTGACTATCCAACCATCCATCATCTCA
TTGTGGACCTTATGGAAAGCAGTGAAAAGCATGACCCGCGGCTTGTTTATTTGGC
TGTTGCCTGGCTGGTTGCTCATCGTGGTCATTTCCTCAATGAAGTGGATAAGGAT
AATATTGGGGATGTCCTGAGTTTTGACGCCTTTTATCCTGAGTTTCTGGCATTTC
TTTCCGATAATGGGGTGTCACCTTGGGTATGTGAGTCAAAAGCACTCCAAGCGAC
CCTGCTTTCACGAAACTCCGTCAACGATAAGTATAAAGCCTTGAAGTCTCTGATC
TTTGGCAGCCAAAAGCCGGAGGATAATTTTGATGCCAATATCAGTGAAGATGGAC
TTATCCAACTTTTAGCAGGAAAAAAGGTCAAGGTCAATAAACTTTTTCCTCAAGA
AAGTAATGATGCTTCCTTTACACTCAATGATAAGGAAGATGCAATTGAGGAAATC
TTAGGAACGCTTACACCGGATGAGTGTGAATGGATTGCGCATATTAGGAGGCTGT
TTGATTGGGCCATCATGAAACATGCTCTCAAAGATGGCAGAACAATCTCCGAATC
GAAAGTAAAGCTCTATGAACAGCATCACCATGACTTGACACAGCTCAAGTATTTT
GTGAAGACCTATCTAGCAAAGGAATATGATGACATTTTTCGAAACGTAGATAGTG
AAACAACCAAAAACTATGTCGCATATTCCTATCATGTAAAAGAAGTCAAGGGTAC
ATTGCCCAAAAATAAGGCAACCCAAGAAGAATTTTGCAAGTATGTCCTTGGAAAG
GTAAAGAACATCGAATGCAGTGAAGCTGATAAGGTTGATTTTGATGAAATGATTC
AGCGTCTTACAGACAATTCCTTTATGCCGAAACAAGTATCAGGTGAAAACAGGGT
TATCCCTTACCAGCTTTACTATTATGAACTAAAGACTATTTTGAATAAAGCCGCT
TCTTATCTGCCTTTTTTGACCCAATGCGGAAAAGATGCCATCTCCAATCAAGATA
AGCTCCTTTCCATCATGACCTTTCGGATTCCGTATTTCGTTGGGCCCTTGCGCAA
GGACAATTCAGAGCATGCCTGGCTGGAACGAAAAGCAGGGAAAATCTATCCGTGG
AATTTTAACGACAAAGTTGACCTTGATAAAAGTGAAGAAGCGTTCATTCGGAGAA
TGACGAATACCTGCACTTATTATCCCGGTGAAGATGTTTTGCCACTTGACTCCCT
TATTTATGAAAAATTCATGATCCTCAATGAAATCAATAATATCCGAATTGATGGT
TATCCTATTTCTGTAGATGTAAAACAGCAGGTTTTTGGCCTCTTTGAAAAGAAGA
GAAGAGTGACCGTAAAGGATATCCAGAATCTCCTGCTTTCCTTGGGTGCCTTGGA
TAAGCATGGTAAATTGACGGGAATCGATACTACCATCCATAGCAATTACAATACA
TACCATCATTTTAAATCGCTCATGGAGCGTGGCGTTCTTACTCGTGATGATGTGG
AACGCATTGTGGAGCGTATGACCTATAGTGATGATACAAAACGCGTCCGTCTTTG
GCTGAACAATAATTATGGAACGCTCACTGCTGACGACGTAAAGCATATTTCAAGG
CTCCGAAAGCATGATTTTGGCCGGCTTTCCAAAATGTTCCTCACAGGCCTAAAGG
GAGTTCATAAGGAAACGGGGGAACGAGCTTCCATTTTGGATTTTATGTGGAATAC
CAATGATAACTTGATGCAGCTTTTATCTGAATGTTATACTTTTTCGGATGAAATT
ACCAAGCTGCAGGAAGCATACTATGCCAAGGCGCAGCTTTCCCTGAATGATTTTC
TGGACTCCATGTATATTTCAAATGCTGTCAAACGTCCTATCTATCGAACTCTTGC
CGTTGTAAATGACATACGCAAAGCCTGTGGGACGGCGCCAAAACGCATTTTTATC
GAAATGGCAAGAGATGGGGAAAGCAAAAAGAAAAGGAGCGTAACAAGAAGAGAAC
AAATCAAGAATCTTTATAGGTCCATCCGCAAGGATTTTCAGCAGGAGGTAGATTT
CCTTGAAAAAATCCTTGAAAACAAAAGCGATGGACAGCTGCAAAGCGATGCGCTC
TATCTATACTTTGCGCAGCTTGGAAGGGATATGTATACCGGGGACCCTATCAAGT
TGGAGCATATCAAGGACCAGTCCTTCTATAATATTGATCATATCTATCCCCAAAG
CATGGTCAAGGACGATAGTCTTGATAACAAGGTGTTGGTTCAATCGGAAATTAAT
GGAGAGAAGAGCAGTCGATATCCTCTTGATGCTGCTATCCGTAATAAAATGAAGC
CTCTTTGGGATGCTTATTATAACCATGGCCTGATTTCCCTCAAGAAGTATCAGCG
TTTGACGCGGAGCACTCCCTTTACAGATGATGAAAAGTGGGATTTCATCAATCGG
CAGCTTGTTGAGACAAGACAATCCACGAAGGCCTTGGCAATCTTACTAAAAAGGA
AGTTCCCTGATACGGAGATTGTCTACTCCAAGGCAGGGCTTTCTTCTGATTTTCG
GCATGAGTTTGGTCTCGTAAAATCGAGGAATATCAATGACCTGCACCATGCAAAG
GACGCATTTCTTGCGATTGTAACAGGAAATGTCTATCATGAACGCTTTAATCGCC
GGTGGTTTATGGTGAACCAGCCCTATTCCGTCAAGACCAAGACGTTGTTTACGCA
TTCTATTAAAAATGGTAATTTTGTAGCTTGGAATGGAGAAGAGGATCTTGGCCGC
ATTGTTAAAATGTTAAAGCAAAATAAGAACACTATTCATTTCACGCGGTTCTCTT
TTGATCGAAAGGAAGGCCTGTTTGATATTCAGCCACTAAAAGCGTCAACCGGTCT
TGTACCAAGAAAAGCCGGACTAGACGTGGTAAAATATGGTGGCTATGACAAATCG
ACAGCAGCTTATTATCTCCTTGTTCGATTTACACTAGAAGATAAAAAGACTCAAC
ATAAATTGATGATGATTCCTGTAGAAGGCTTGTATAAAGCTCGAATTGACCATGA
TAAGGAATTCTTAACGGACTATGCACAAACTACAATCAGTGAAATCCTACAAAAA
GATAAACAAAAGGTGATAAATATAATGTTTCCAATGGGAACAAGGCACATTAAAC
TGAATTCCATGATTTCAATCGATGGTTTTTATCTTTCCATTGGAGGAAAGTCTAG
TAAGGGAAAATCGGTGTTGTGTCATGCTATGGTACCTCTTATTGTACCTCATAAG
ATAGAATGTTATATTAAGGCGATGGAGTCTTTTGCACGTAAATTTAAAGAAAATA
ATAAATTAAGGATTGTGGAAAAGTTTGATAAGATTACGGTGGAAGATAACTTGAA
CCTATACGAACTATTTTTACAAAAACTTCAACATAACCCATATAATAAGTTCTTC
TCCACACAATTTGATGTGCTGACTAATGGAAGAAGTACATTTACTAAATTATCTC
CAGAGGAACAAGTTCAAACGTTATTGAATATCTTATCAATTTTTAAAACTTGTCG
GAGCTCTGGCTGCGATTTAAAATCCATTAACGGTTCTGCTCAAGCTGCCAGAATT
ATGATCAGCGCAGATTTAACTGGACTCTCAAAAAAATATTCCGATATTCGGCTTG
TTGAGCAATCAGCATCTGGACTTTTTGTTAGTAAATCACAAAATCTTTTGGAGTA
TTTAtga
SEQ atgtcttcattaacaaaatttacaaataaatacagtaagcagctaaccataaaaa
ID atgaactcatcccagtaggaaagactctcgagaacattaaggaaaacggtctcat
NO: agatggagatgaacagctaaacgagaattatcaaaaagcaaagataatcgttgat
22 gattttctacgagatttcataaataaagctttaaataatacccaaataggaaatt
ggagagaattagcagatgctttaaataaagaagatgaagataacatagaaaagct
ccaagacaaaatcagaggaataattgtaagtaaattcgagacatttgatttgttt
tcttcttactcgataaagaaagacgaaaagataatagatgatgataatgatgttg
aagaagaggagctagatctaggaaaaaaaacttcctcatttaaatatatttttaa
gaaaaacctttttaaattagtacttccttcttatttaaagacaacaaatcaggat
aaactgaaaataatctcttcttttgataatttttctacctatttcagaggattct
ttgagaacagaaaaaatattttcactaagaagcctatatctacgtcaattgccta
cagaattgtccatgataactttccaaagtttctagataacatcagatgttttaat
gtgtggcaaacagaatgcccacagttaattgtaaaggctgataattatttaaaat
caaagaacgtcatagctaaagataaatctttagcaaactattttactgtaggagc
atatgattacttcttatcccagaatggcattgatttctacaacaacattatcggc
ggtctaccagcatttgctggtcatgagaaaatccaaggacttaatgaatttataa
atcaagaatgccaaaaggacagcgaactaaaatctaaactgaaaaacagacatgc
tttcaaaatggctgttctatttaagcaaattctttcagatagagaaaaaagtttt
gttatagacgagttcgaatctgatgctcaggtcatagatgcggttaagaacttct
atgcagaacaatgtaaggataataatgttatttttaaccttctaaatcttatcaa
gaatatagcgttcttatctgatgatgaattagatggaatttttatagaaggcaag
tatttaagctctgtttcccaaaagctatattcagattggtcgaagcttcgaaatg
atattgaagatagtgcaaacagtaaacaaggaaataaagagttagcaaagaaaat
taaaacaaataaaggcgatgttgaaaaggccataagtaaatatgagttttcttta
tcagaacttaactcaattgtacatgataatacaaaattcagtgaccttctttctt
gtacgttacataaagtggctagcgaaaaactagtgaaagttaatgaaggggactg
gccaaaacacctgaaaaataatgaagaaaaacaaaagataaaagagcctttagat
gcattgttagaaatttataatacattgctgatattcaactgcaagtcatttaata
agaacggtaatttctatgttgattatgacagatgcataaatgagctttctagtgt
tgtttatttatataacaaaacaagaaattactgtacaaagaaaccttataacaca
gacaaattcaaattaaactttaacagtcctcaattaggagagggctttagtaagt
cgaaagaaaatgactgtctgacattattatttaaaaaagacgacaattactatgt
tggaattatcagaaaaggggcaaaaattaactttgatgatacacaagccattgca
gacaatacagataactgtatatttaagatgaattatttcctattaaaagatgcta
aaaagtttattcctaaatgttcaattcagttaaaagaagtaaaagcacattttaa
aaaatcagaggatgattatatcctgagtgacaaagaaaaatttgcctctcccctt
gttattaagaaatcaacatttttattagcaacagcacatgtaaaaggaaagaaag
gaaacataaaaaaattccaaaaggaatattctaaggaaaatccaacagaatatag
aaattctctgaatgaatggattgcattttgtaaagaatttctaaaaacatataag
gcggcaacaatctttgacattacaacgttaaaaaaagctgaagaatatgctgata
ttgttgagttttataaggatgtagataatctttgttataaactagagttttgccc
tattaaaacatctttcattgagaatcttattgataatggggacttatatttattc
agaatcaataataaagatttcagttcaaaatctactggtacaaagaatcttcata
cgctctatcttcaggcaatctttgatgaaagaaacctcaataatcctactattat
gttaaatggcggagcagagttattttatcgaaaagaaagcattgaacagaaaaat
aggataactcataaggcaggatcaattcttgtaaacaaggtttgtaaggatggaa
caagtctagatgacaaaatcagaaacgaaatatatcaatatgaaaacaagtttat
tgatacattgtctgatgaagctaaaaaagttttacctaatgtaataaaaaaagaa
gcaactcacgacataacaaaagataagcgatttacatcagataagttctttttcc
attgcccattaacaattaactataaggaaggagatacaaaacaatttaacaatga
ggttttatctttccttagaggtaatccagacattaatatcatcggaattgacaga
ggagaaagaaaccttatatacgtaactgttattaatcagaaaggcgaaatacttg
acagcgtttcgtttaacacagtaacaaacaagtcgagcaaaattgaacaaactgt
tgattatgaggaaaagcttgctgttagggaaaaagaaagaatagaagcaaaaaga
tcctgggattcaatatcaaagatagcaaccttaaaagaaggttatctatcagcta
ttgttcatgagatatgcctactgatgatcaaacacaacgcaatcgttgtacttga
gaatctaaatgcaggatttaagagaattagaggaggattatcagaaaagtctgtt
tatcagaaattcgagaagatgcttattaacaaactaaattactttgtatctaaaa
aagaatcagactggaataaacctagtggacttttaaatggtttacaactttcaga
ccagttcgagtcatttgagaaattaggaattcaatctgggttcatcttctatgtt
cctgcagcatatacatctaagattgatcctacaacaggatttgcaaatgttctta
acttatccaaggtaagaaatgttgatgcaataaagagttttttcagtaatttcaa
tgaaatttcatatagcaaaaaagaagctctctttaaattctcttttgatttagat
tccttatcaaagaagggcttcagctcatttgtaaaattcagtaaatctaaatgga
atgtatatacatttggagagagaataataaaaccaaagaataagcaagggtatcg
tgaagataagagaattaatttaacatttgaaatgaaaaaacttctgaatgaatat
aaagtaagttttgatcttgaaaacaacttaattccaaatctaacctctgcaaatc
tgaaagataccttctggaaagaactattctttatttttaaaacaactctgcagct
tagaaacagtgtaacaaatggcaaagaagatgtactgatttctccagtaaagaac
gctaaaggagagttctttgtatcaggaactcataacaagacattacctcaagact
gtgatgcaaatggagcatatcatatcgccctaaaaggtctgatgattcttgaacg
taacaatcttgttagagaagaaaaagacacaaagaagataatggcaatttctaat
gttgactggtttgagtatgttcaaaaaaggagaggtgtcctgtaa
SEQ ATGAACAACTATGATGAGTTTACCAAACTGTACCCAATACAGAAAACGATAAGGT
ID TCGAATTGAAGCCGCAGGGAAGAACGATGGAACACCTCGAAACATTCAACTTTTT
NO: CGAAGAGGACAGGGATAGAGCGGAGAAATATAAGATTTTAAAGGAAGCAATCGAC
23 GAGTATCATAAGAAGTTTATAGACGAACATCTAACAAATATGTCTCTTGACTGGA
ATTCTTTAAAACAGATTTCAGAGAAATACTATAAGAGTAGAGAGGAAAAAGACAA
GAAAGTTTTTCTGTCAGAACAGAAACGCATGAGGCAAGAGATAGTTTCTGAGTTC
AAAAAAGACGATCGGTTTAAAGATCTTTTTTCAAAAAAATTGTTTTCTGAACTTC
TCAAGGAAGAGATTTACAAAAAAGGAAACCATCAGGAAATTGACGCATTGAAAAG
TTTTGATAAATTCTCAGGCTATTTTATTGGGTTGCATGAGAACCGAAAAAATATG
TATTCTGACGGAGACGAGATCACGGCTATCTCTAACCGTATTGTAAATGAGAATT
TCCCGAAGTTCCTCGACAACCTTCAGAAATATCAGGAAGCTCGTAAAAAATATCC
AGAGTGGATCATTAAGGCAGAATCTGCTTTAGTTGCACATAATATCAAGATGGAT
GAAGTCTTTTCCTTAGAGTATTTCAACAAAGTCCTGAATCAAGAAGGAATACAGA
GATACAATCTCGCCCTAGGTGGCTATGTGACCAAAAGTGGTGAGAAAATGATGGG
GCTTAATGATGCACTTAATCTTGCCCATCAAAGTGAAAAAAGCAGCAAGGGAAGG
ATACACATGACTCCACTCTTCAAACAGATTCTGAGTGAAAAAGAGTCCTTTTCTT
ATATACCAGATGTTTTTACAGAAGACTCTCAACTTTTACCATCCATTGGTGGGTT
CTTTGCACAAATAGAAAATGATAAGGACGGGAATATTTTTGACAGAGCATTAGAA
TTGATATCTTCTTATGCAGAATACGATACAGAAAGGATATATATCAGGCAAGCGG
ACATAAACAGAGTTTCTAATGTTATTTTCGGGGAGTGGGGAACACTGGGGGGGTT
AATGAGGGAATACAAAGCAGACTCTATCAACGACATCAATTTGGAGAGAACATGC
AAGAAGGTAGACAAGTGGCTCGACTCAAAGGAGTTTGCGTTATCAGATGTATTAG
AGGCAATAAAAAGAACCGGCAATAATGATGCTTTTAATGAATATATCTCAAAGAT
GCGCACTGCCAGGGAAAAGATTGACGCTGCAAGAAAGGAAATGAAATTCATTTCG
GAAAAAATATCTGGAGACGAAGAATCGATCCATATTATCAAAACCTTATTGGACT
CGGTGCAACAGTTTTTACATTTTTTCAATTTATTCAAAGCGCGTCAGGACATTCC
TCTTGATGGAGCATTCTATGCGGAGTTCGATGAAGTCCATAGCAAACTGTTTGCT
ATTGTTCCGTTGTATAATAAGGTTAGGAACTATCTTACGAAAAATAACCTTAACA
CGAAAAAGATAAAGCTAAACTTCAAGAATCCAACTCTGGCAAACGGATGGGATCA
AAACAAGGTATATGACTACGCCTCCTTAATCTTTCTCCGCGATGGTAATTATTAT
CTCGGAATAATAAATCCAAAAAGGAAAAAGAATATTAAATTCGAACAAGGGTCTG
GAAATGGCCCATTCTACCGGAAGATGGTGTACAAACAAATTCCAGGGCCGAACAA
GAACTTACCAAGAGTCTTCCTCACATCTACGAAAGGCAAAAAAGAGTACAAGCCG
TCAAAGGAGATAATAGAAGGATATGAAGCGGACAAACACATAAGAGGAGATAAAT
TCGATCTGGATTTCTGTCATAAGCTGATAGACTTCTTCAAGGAATCCATCGAGAA
GCACAAGGACTGGAGTAAGTTCAACTTCTATTTCTCTCCAACTGAATCATATGGA
GACATCAGCGAATTCTATCTGGATGTAGAAAAACAGGGATACCGGATGCATTTTG
AGAATATTTCTGCCGAGACGATTGATGAGTATGTCGAAAAGGGGGACTTATTCCT
CTTCCAGATATACAACAAAGACTTTGTGAAAGCGGCAACCGGAAAAAAAGATATG
CACACCATTTATTGGAACGCGGCATTCTCGCCCGAGAACCTTCAGGATGTGGTAG
TGAAACTGAACGGTGAAGCAGAACTTTTCTACAGAGACAAGAGCGACATCAAGGA
GATAGTTCACAGGGAGGGAGAGATACTGGTCAATCGTACCTACAACGGCAGGACA
CCTGTGCCTGACAAGATCCACAAAAAATTAACAGATTATCATAATGGCCGTACCA
AAGATCTCGGAGAAGCAAAAGAATACCTCGATAAGGTCAGATATTTCAAAGCGCA
CTACGACATCACAAAGGATCGCAGATACCTGAATGATAAAATATACTTCCATGTG
CCTCTGACATTGAATTTCAAAGCAAACGGGAAGAAGAATCTCAATAAGATGGTAA
TTGAAAAGTTCCTCTCGGACGAAAAAGCGCATATTATTGGGATTGATCGCGGGGA
AAGGAATCTTCTTTACTATTCTATCATTGACAGGTCAGGTAAAATAATCGATCAA
CAGAGCCTCAACGTCATCGATGGATTCGATTACCGAGAGAAACTGAATCAGAGGG
AGATCGAGATGAAGGATGCCAGACAAAGCTGGAATGCTATCGGGAAGATAAAGGA
CCTCAAGGAAGGGTATCTTTCAAAAGCGGTCCACGAAATTACCAAGATGGCGATA
CAATACAATGCCATTGTTGTCATGGAGGAACTCAATTATGGGTTCAAACGCGGAC
GTTTCAAAGTTGAGAAGCAGATATATCAGAAATTCGAGAATATGCTGATTGACAA
GATGAATTATCTGGTATTCAAGGATGCTCCGGATGAAAGTCCGGGAGGAGTCCTC
AATGCATATCAGCTTACTAATCCGCTTGAAAGTTTCGCTAAACTTGGGAAACAGA
CAGGAATTCTTTTCTATGTTCCGGCAGCCTATACTTCGAAGATAGATCCGACGAC
CGGGTTTGTCAATCTTTTCAATACTTCAAGTAAAACGAACGCACAGGAAAGAAAA
GAATTCTTGCAAAAATTCGAGTCGATCTCCTATTCCGCTAAAGACGGAGGAATAT
TCGCATTCGCGTTCGATTATCGGAAGTTCGGAACGTCAAAAACAGACCACAAAAA
TGTATGGACCGCATACACGAACGGGGAAAGGATGAGGTACATAAAAGAGAAAAAA
CGCAACGAACTGTTCGACCCCTCGAAGGAGATCAAAGAGGCTCTCACTTCATCAG
GAATCAAATATGACGGCGGACAGAACATATTGCCAGATATCCTGAGGAGCAACAA
TAACGGTCTGATCTACACAATGTATTCCTCTTTCATAGCGGCCATTCAAATGAGG
GTCTATGACGGGAAAGAAGACTATATCATCTCGCCGATAAAGAACAGCAAGGGAG
AGTTCTTCAGGACCGATCCGAAAAGAAGGGAACTTCCGATAGACGCGGATGCGAA
CGGCGCGTATAACATTGCTCTCAGGGGCGAATTGACGATGCGTGCGATAGCGGAG
AAGTTCGATCCGGACTCGGAAAAGATGGCGAAGCTAGAACTGAAACATAAGGACT
GGTTCGAATTCATGCAGACAAGGGGGGATTGA
SEQ ATGACAAAAACATTTGATTCAGAATTTTTTAATTTATATTCTCTTCAAAAAACAG
ID TTCGTTTTGAACTCAAGCCGGTTGGTGAAACAGCCTCGTTTGTTGAAGATTTTAA
NO: AAACGAAGGTTTGAAACGAGTTGTTTCAGAGGATGAACGGCGTGCGGTTGATTAC
24 CAAAAAGTGAAAGAAATTATTGATGACTACCACCGAGATTTTATTGAAGAATCGC
TGAACTATTTTCCTGAGCAGGTCTCAAAAGACGCTTTGGAACAAGCTTTTCACCT
TTATCAAAAACTAAAAGCCGCTAAGGTTGAAGAGCGTGAAAAAGCATTGAAAGAA
TGGGAAGCCCTTCAGAAAAAACTGCGCGAAAAAGTTGTTAAATGTTTTTCAGATT
CAAACAAAGCACGCTTTTCCCGCATTGATAAAAAAGAACTGATTAAAGAAGATTT
AATTAACTGGTTGGTTGCACAAAATCGCGAAGATGACATTCCAACCGTTGAAACC
TTTAACAACTTTACGACTTATTTTACGGGGTTTCATGAAAACCGAAAAAACATTT
ATTCAAAAGACGATCATGCCACAGCCATTTCATTTCGACTCATTCATGAAAACCT
GCCTAAGTTTTTTGATAATGTGATCAGCTTTAATAAATTGAAGGAAGGATTTCCA
GAGCTGAAATTTGATAAGGTTAAGGAAGATTTAGAAGTTGATTATGACTTGAAAC
ATGCCTTTGAAATCGAATACTTTGTCAATTTTGTTACCCAAGCCGGAATTGACCA
ATATAACTATCTTTTGGGGGGTAAAACCTTAGAAGACGGCACCAAAAAGCAAGGC
ATGAATGAACAAATCAATCTGTTCAAGCAACAGCAAACCCGAGACAAAGCCCGAC
AAATTCCCAAACTCATACCATTGTTTAAACAAATTCTAAGCGAACGAACGGAAAG
CCAATCGTTTATTCCAAAACAATTTGAATCAGACCAAGAGCTATTTGACTCACTG
CAAAAACTGCATAACAACTGCCAAGATAAATTTACCGTACTGCAACAAGCCATTT
TAGGCTTAGCCGAAGCAGATCTGAAAAAAGTATTCATTAAAACATCTGATCTTAA
TGCGCTATCAAATACCATTTTTGGAAATTACAGTGTGTTTTCGGATGCGTTGAAT
TTATACAAAGAATCGCTCAAAACAAAAAAGGCGCAAGAAGCGTTTGAAAAACTAC
CCGCTCACAGCATTCATGACTTGATTCAATATTTGGAGCAATTTAATAGCTCTTT
GGATGCAGAAAAACAGCAATCAACTGACACCGTACTGAATTACTTTATTAAAACA
GACGAGCTGTATTCTCGGTTCATAAAATCAACGAGCGAAGCCTTCACACAAGTAC
AACCACTCTTTGAATTGGAAGCATTAAGCTCAAAACGTCGTCCACCGGAAAGTGA
AGACGAAGGCGCAAAAGGTCAGGAAGGGTTTGAGCAAATTAAACGCATAAAAGCC
TATTTGGATACCTTGATGGAGGCGGTGCATTTTGCAAAACCACTTTATCTGGTGA
AGGGGCGCAAAATGATTGAAGGTCTGGACAAAGACCAAAGTTTCTATGAAGCCTT
TGAAATGGCTTACCAAGAACTAGAAAGTCTGATTATTCCAATCTACAACAAAGCT
CGTAGTTATTTAAGTCGTAAACCGTTTAAAGCGGACAAATTCAAAATTAATTTTG
ATAATAATACATTGCTTTCCGGTTGGGATGCTAATAAAGAAACGGCTAACGCTTC
AATTTTGTTTAAGAAGGATGGTTTGTATTATTTAGGAATCATGCCTAAAGGAAAA
ACGTTTTTGTTCGATTACTTCGTTTCATCGGAAGATTCTGAAAAGTTAAAACAAA
GAAGACAAAAAACCGCCGAAGAAGCGCTTGCGCAAGATGGCGAAAGCTACTTTGA
AAAAATTCGTTACAAGCTGTTACCTGGCGCCAGCAAAATGTTGCCGAAAGTATTT
TTTTCCAACAAAAACATAGGGTTTTACAACCCAAGTGATGACATACTTCGTATCA
GGAATACAGCCTCTCACACTAAAAACGGAACACCGCAAAAAGGGCACTCTAAAGT
AGAGTTTAATTTGAATGATTGTCATAAGATGATTGATTTCTTTAAATCAAGCATT
CAAAAGCATCCAGAGTGGGGAAGTTTTGGATTCACCTTTTCAGATACATCAGATT
TTGAAGATATGAGCGCCTTTTATCGAGAAGTCGAAAACCAAGGTTATGTCATTAG
TTTCGATAAAATAAAAGAAACTTACATTCAGAGTCAAGTTGAACAGGGGAACCTA
TATTTATTCCAAATCTACAATAAAGACTTCTCGCCCTACAGCAAAGGCAAACCAA
ATTTACACACGCTTTACTGGAAAGCGTTGTTTGAGGAAGCCAACCTAAATAATGT
GGTGGCAAAACTCAATGGTGAAGCTGAAATTTTCTTTAGGCGACACTCAATCAAA
GCATCTGATAAAGTGGTGCACCCAGCGAATCAAGCCATTGACAATAAAAACCCGC
ATACCGAAAAAACGCAAAGCACCTTTGAATATGATCTTGTAAAAGACAAGCGCTA
TACCCAAGACAAATTCTTCTTCCATGTACCGATTTCATTGAACTTTAAGGCACAA
GGTGTTTCAAAATTTAACGATAAAGTGAATGGATTTTTAAAGGGTAACCCAGATG
TCAATATTATTGGCATTGACCGAGGCGAACGACACCTTCTGTATTTCACTGTGGT
GAATCAGAAAGGTGAAATTTTGGTTCAAGAGTCGCTTAATACCCTAATGAGTGAT
AAAGGGCATGTGAATGACTACCAGCAAAAACTCGACAAAAAAGAACAAGAACGCG
ATGCCGCTCGCAAAAGCTGGACGACGGTTGAAAATATCAAAGAATTAAAAGAAGG
CTATTTATCTCATGTTGTTCATAAGTTGGCACACCTGATTATTAAATACAATGCC
ATTGTTTGCTTGGAAGACCTGAATTTTGGTTTCAAACGCGGGCGTTTTAAAGTGG
AAAAACAAGTTTATCAGAAATTTGAAAAAGCGCTTATTGATAAGCTTAACTACTT
GGTATTTAAAGAAAAAGAGTTAGGCGAGGTGGGCCATTATCTAACCGCCTATCAG
TTGACCGCACCGTTTGAAAGTTTCAAGAAGTTAGGCAAGCAAAGTGGCATATTGT
TTTATGTTCCGGCGGATTACACCTCCAAAATTGACCCAACCACCGGGTTTGTCAA
CTTTCTTGATCTGCGTTATCAGAGTGTCGAAAAAGCCAAACAGCTCTTAAGCGAC
TTTAATGCCATTCGTTTTAATTCAGTACAAAACTATTTTGAGTTCGAAATAGATT
ACAAAAAACTCACACCCAAACGTAAAGTTGGTACTCAGAGTAAATGGGTGATTTG
TACCTATGGAGATGTCCGCTATCAAAATCGGCGTAATCAAAAAGGTCACTGGGAA
ACGGAAGAAGTCAATGTGACTGAAAAACTAAAAGCCCTTTTCGCCAGTGATTCCA
AAACTACAACCGTAATCGATTACGCCAATGACGACAACCTAATTGACGTCATTCT
GGAACAGGACAAAGCCAGCTTCTTCAAAGAACTGTTATGGTTATTAAAACTCACC
ATGACGCTCCGCCACAGCAAAATCAAAAGTGAAGACGACTTTATTCTTTCACCCG
TTAAAAACGAACAAGGCGAGTTTTACGATAGTCGAAAAGCGGGCGAGGTGTGGCC
TAAAGATGCAGACGCCAATGGCGCTTATCACATAGCGTTGAAAGGCTTGTGGAAT
CTGCAACAGATCAATCAGTGGGAAAAGGGTAAAACACTTAATCTGGCGATTAAAA
ACCAGGATTGGTTCAGTTTTATTCAAGAAAAGCCCTATCAAGAATAA
SEQ ATGCACACAGGCGGATTACTTAGCATGGATGCCAAGGAGTTTACCGGACAGTACC
ID CCCTTTCGAAGACTCTGCGTTTTGAACTGAGACCGATAGGCAGAACGTGGGACAA
NO: TCTCGAAGCATCGGGGTATCTTGCGGAGGACAGACACCGTGCAGAATGCTATCCC
25 AGGGCAAAAGAGCTCTTGGACGACAACCATCGTGCATTCCTCAACCGTGTCCTGC
CTCAGATCGATATGGATTGGCACCCGATCGCAGAGGCATTCTGCAAAGTCCACAA
GAATCCGGGAAACAAGGAATTGGCTCAGGATTACAATCTTCAGCTGTCCAAACGC
AGAAAGGAGATTTCGGCCTATCTGCAGGATGCGGACGGCTATAAAGGTCTGTTTG
CCAAACCTGCATTGGATGAAGCAATGAAGATCGCGAAAGAAAACGGAAATGAATC
GGACATAGAGGTTCTTGAGGCATTCAACGGTTTCTCCGTATACTTCACCGGATAT
CATGAGAGCAGGGAGAACATCTATTCGGACGAGGATATGGTGTCGGTAGCTTATC
GCATCACCGAAGACAATTTCCCGAGATTCGTTTCCAATGCGCTTATATTCGATAA
GCTGAATGAGTCGCACCCCGATATAATCTCGGAAGTATCCGGAAATCTGGGCGTA
GACGACATCGGAAAATATTTTGATGTGTCTAACTACAATAATTTCCTGTCGCAGG
CCGGTATAGATGACTACAATCACATCATCGGCGGCCATACGACGGAGGACGGTCT
GATCCAGGCATTCAATGTTGTTCTGAATCTCAGGCATCAGAAAGACCCCGGATTC
GAAAAAATCCAATTCAAACAGCTGTACAAACAGATACTCAGCGTCCGTACATCCA
AATCCTATATCCCGAAACAGTTCGATAATTCGAAGGAGATGGTGGACTGCATCTG
CGACTATGTGTCCAAGATCGAAAAATCCGAAACGGTCGAGAGAGCATTGAAGCTG
GTAAGGAACATATCTTCTTTTGATTTGCGCGGAATATTCGTAAACAAGAAGAATC
TCCGCATTCTTTCCAACAAACTGATTGGTGATTGGGACGCGATCGAAACCGCGCT
GATGCACTCCTCCTCTTCGGAAAATGATAAGAAATCCGTCTACGACAGCGCCGAG
GCATTTACGCTGGATGATATCTTTTCGTCCGTTAAAAAATTCTCAGATGCATCTG
CAGAGGATATCGGAAACCGGGCGGAGGACATATGCAGAGTCATATCTGAGACCGC
TCCGTTCATAAACGATCTGAGGGCTGTCGATTTGGACAGTTTGAATGACGACGGT
TACGAGGCGGCGGTTTCCAAGATAAGGGAATCTCTGGAACCATATATGGATCTGT
TTCATGAACTGGAGATATTCTCCGTAGGCGATGAATTCCCGAAATGTGCAGCTTT
CTACAGTGAACTTGAAGAAGTCTCCGAACAGCTAATCGAGATTATACCGTTATTC
AACAAGGCCCGTTCGTTCTGTACGCGCAAGAGATACAGTACGGACAAGATAAAGG
TCAATTTGAAATTCCCGACACTCGCCGACGGATGGGATCTCAACAAAGAACGCGA
CAACAAAGCCGCAATACTCAGGAAAGACGGAAAGTACTACCTGGCCATACTGGAT
ATGAAGAAAGATCTTTCTTCGATCAGAACTTCGGATGAAGACGAATCCAGTTTTG
AGAAAATGGAGTACAAGCTTCTTCCGAGTCCGGTAAAGATGCTGCCAAAGATCTT
CGTAAAATCGAAGGCGGCCAAGGAGAAGTACGGTCTGACCGACCGTATGCTGGAG
TGCTACGATAAAGGGATGCACAAGAGCGGCAGTGCATTCGATCTCGGATTTTGTC
ACGAATTGATCGATTACTACAAGAGGTGCATCGCAGAATATCCCGGCTGGGACGT
CTTCGATTTCAAGTTCAGGGAAACATCGGATTATGGCAGCATGAAGGAGTTCAAT
GAGGATGTTGCAGGGGCCGGATACTATATGTCCCTCAGAAAGATCCCTTGTTCGG
AGGTCTACAGGCTTCTTGATGAGAAATCGATATATCTTTTCCAGATCTACAACAA
AGATTATTCGGAAAACGCTCATGGGAATAAGAACATGCATACCATGTATTGGGAA
GGGCTCTTTTCCCCCCAGAATCTGGAATCCCCTGTGTTTAAACTCAGCGGCGGTG
CGGAGCTTTTCTTCCGTAAATCCTCCATACCCAATGACGCCAAAACGGTCCATCC
GAAGGGAAGCGTCCTGGTTCCGCGCAATGATGTAAACGGCCGCAGGATACCTGAC
AGCATATATCGGGAGCTCACCAGATATTTCAACCGCGGAGATTGCCGCATAAGCG
ACGAGGCAAAGAGTTATCTGGACAAGGTGAAAACCAAGAAAGCTGACCACGATAT
CGTGAAAGACAGGAGGTTCACGGTGGACAAGATGATGTTCCACGTCCCTATCGCC
ATGAATTTCAAAGCGATTTCGAAGCCGAATCTCAATAAAAAGGTGATTGACGGCA
TAATCGACGACCAAGATCTGAAGATCATCGGCATAGACCGCGGAGAGCGCAACCT
CATCTACGTAACCATGGTGGATCGCAAAGGGAACATCCTCTATCAGGATAGCCTC
AATATTCTGAACGGATACGATTACCGTAAGGCCCTCGACGTCCGCGAATATGACA
ATAAAGAGGCTCGGAGGAACTGGACGAAGGTCGAAGGCATCCGTAAGATGAAAGA
GGGGTATCTGTCGCTTGCAGTCAGCAAATTGGCAGATATGATCATAGAGAACAAT
GCGATTATCGTCATGGAGGATCTCAATCACGGATTCAAGGCAGGGCGTTCGAAGA
TAGAGAAACAGGTCTATCAGAAGTTCGAATCCATGCTCATAAACAAACTCGGTTA
CATGGTCCTCAAGGATAAGTCTATCGATCAGAGCGGCGGAGCTCTCCACGGATAC
CAGCTTGCCAACCATGTGACAACATTGGCATCTGTAGGTAAACAATGTGGAGTGA
TATTCTACATCCCTGCTGCATTTACATCCAAGATAGATCCGACAACAGGATTTGC
AGATCTGTTCGCCCTCAGCAATGTTAAAAACGTGGCATCTATGAGAGAATTTTTC
TCCAAGATGAAGTCTGTAATCTATGATAAGGCGGAGGGAAAATTCGCATTTACCT
TCGACTATCTTGATTATAATGTGAAATCCGAGTGCGGAAGGACCCTTTGGACCGT
GTATACGGTCGGAGAGAGATTCACATACAGCAGGGTCAATAGAGAATATGTCAGA
AAAGTTCCGACAGACATAATCTACGACGCATTGCAAAAGGCAGGAATATCTGTTG
AAGGGGATCTCAGGGACAGGATTGCTGAATCGGATGGCGACACTCTGAAGAGCAT
ATTCTATGCATTCAAGTATGCATTGGATATGAGAGTAGAGAACCGCGAAGAGGAT
TACATACAGTCTCCTGTCAAAAATGCCTCCGGAGAATTCTTCTGTTCCAAGAACG
CAGGCAAATCGCTCCCTCAGGATTCCGATGCGAACGGTGCATACAATATCGCACT
CAAGGGGATCCTGCAGCTACGTATGCTTTCCGAGCAGTATGATCCGAATGCAGAG
AGCATACGGTTGCCACTGATAACCAACAAGGCCTGGCTGACCTTTATGCAGTCCG
GTATGAAGACATGGAAGAACTGA
SEQ atgGATAGTTTGAAAGATTTCACCAATCTGTACCCTGTCAGTAAGACATTGAGAT
ID TTGAATTAAAGCCCGTTGGAAAGACTTTAGAAAATATCGAGAAAGCAGGTATTTT
NO: GAAAGAGGATGAGCATCGTGCAGAAAGTTATCGGAGGGTGAAGAAAATAATTGAT
26 ACTTATCATAAGGTATTTATCGATTCTTCTCTTGAAAATATGGCTAAAATGGGTA
TTGAGAATGAAATAAAAGCAATGCTCCAAAGTTTCTGCGAATTGTATAAAAAAGA
TCATCGCACTGAGGGTGAAGACAAGGCATTAGATAAAATTCGAGCAGTACTTCGT
GGCCTGATTGTTGGGGCTTTCACTGGTGTTTGCGGAAGACGGGAAAATACAGTCC
AAAACGAGAAGTACGAGAGTTTGTTCAAAGAAAAGTTGATAAAAGAAATTTTACC
TGATTTTGTGCTCTCTACTGAGGCTGAAAGCTTGCCTTTCTCTGTTGAAGAAGCT
ACGAGGTCACTGAAGGAGTTTGATAGCTTTACATCCTACTTTGCTGGTTTTTACG
AGAATAGAAAGAATATATACTCGACGAAACCTCAATCCACTGCCATTGCTTATCG
TCTTATTCATGAGAACTTGCCGAAGTTCATTGATAATATTCTTGTTTTTCAGAAG
ATCAAAGAGCCTATAGCCAAAGAGCTGGAACATATTCGTGCGGACTTTTCTGCCG
GGGGGTACATAAAAAAGGATGAGAGATTGGAGGATATTTTTTCGTTGAACTATTA
TATCCACGTGTTATCTCAGGCTGGGATCGAAAAATATAACGCATTGATTGGGAAG
ATTGTGACAGAAGGAGATGGAGAGATGAAAGGGCTCAATGAACACATCAACCTTT
ACAACCAACAAAGAGGCAGAGAGGATCGGCTCCCTCTTTTTAGGCCTCTTTATAA
ACAGATATTGAGTGACAGAGAGCAATTATCATACTTGCCTGAGAGTTTTGAAAAA
GATGAGGAGCTCCTCAGGGCTCTAAAAGAGTTCTATGATCATATCGCAGAAGACA
TTCTCGGACGTACTCAACAGTTGATGACTTCTATTTCAGAATATGATTTATCTCG
GATATACGTAAGGAACGATAGCCAATTGACTGATATATCAAAAAAAATGTTGGGA
GATTGGAATGCTATCTACATGGCTAGAGAACGAGCATATGACCACGAGCAGGCTC
CCAAAAGAATCACGGCGAAATACGAGAGGGACAGGATTAAAGCTCTTAAAGGAGA
AGAGAGTATAAGTCTGGCAAATCTTAATAGTTGTATTGCCTTTCTGGACAATGTT
AGAGATTGCCGTGTAGATACTTATCTTTCCACACTGGGCCAGAAGGAAGGACCAC
ATGGTCTATCTAATCTCGTTGAGAACGTTTTTGCCTCATACCATGAAGCAGAGCA
ATTGTTGAGCTTTCCATACCCCGAAGAGAATAATCTGATTCAGGACAAGGACAAT
GTGGTGTTAATTAAGAATCTTCTCGACAATATCAGTGATCTGCAGAGGTTCTTGA
AACCTCTTTGGGGTATGGGAGACGAACCCGATAAAGATGAAAGATTTTATGGAGA
GTATAATTATATCCGAGGAGCTCTAGATCAGGTGATCCCTCTGTACAATAAGGTA
AGGAACTACCTCACTCGGAAGCCTTATTCGACCAGAAAAGTAAAACTCAATTTTG
GGAATTCTCAATTGCTTAGTGGTTGGGATAGAAATAAGGAAAAGGATAATAGCTG
TGTGATTTTGCGTAAGGGGCAGAACTTCTATTTGGCTATTATGAACAATAGGCAC
AAAAGAAGTTTCGAAAACAAGGTGTTGCCCGAGTATAAGGAGGGAGAACCTTACT
TCGAAAAGATGGATTATAAATTTTTGCCTGATCCTAATAAAATGCTTCCTAAGGT
TTTTCTTTCGAAAAAAGGAATAGAGATATACAAACCAAGTCCGAAGCTTTTAGAA
CAATATGGACATGGAACTCACAAAAAGGGAGATACCTTTAGTATGGATGATTTGC
ACGAACTGATCGATTTCTTCAAACACTCAATCGAGGCTCATGAAGATTGGAAGCA
ATTCGGATTCAAATTTTCTGATACGGCTACTTATGAGAATGTATCTAGTTTCTAT
AGAGAAGTTGAGGATCAGGGGTATAAGCTCTCTTTCCGAAAAGTTTCGGAATCTT
ATGTCTATTCATTAATAGATCAAGGCAAGTTGTATTTATTTCAGATATACAACAA
GGACTTTTCTCCCTGCAGCAAAGGGACACCTAATCTGCATACCTTGTATTGGAGA
ATGCTTTTTGACGAGCGCAATTTGGCAGATGTCATATACAAACTGGATGGGAAGG
CTGAAATCTTTTTCCGAGAGAAGAGTTTGAAAAATGATCATCCCACGCATCCGGC
TGGTAAGCCTATCAAAAAGAAAAGTCGACAAAAAAAAGGAGAGGAGAGTCTGTTT
GAGTATGATTTAGTCAAGGATAGGCACTATACGATGGATAAGTTCCAGTTTCATG
TGCCTATTACTATGAATTTTAAATGTTCTGCAGGAAGCAAAGTCAATGATATGGT
TAATGCTCATATTCGAGAGGCAAAGGATATGCATGTCATTGGAATTGATCGTGGA
GAACGCAATCTGCTGTATATATGCGTGATAGATAGTCGAGGGACGATTTTGGATC
AAATTTCTCTGAATACGATTAACGATATAGACTATCATGATTTATTGGAGAGTCG
AGACAAAGACCGTCAGCAGGAGCGCCGAAACTGGCAAACTATCGAAGGGATCAAG
GAGCTAAAACAAGGCTACCTTAGTCAGGCGGTTCATCGGATAGCCGAACTGATGG
TGGCTTATAAGGCTGTAGTTGCTTTGGAGGATTTGAATATGGGGTTCAAACGTGG
GCGGCAGAAAGTAGAAAGTTCTGTTTATCAGCAGTTTGAGAAACAGCTGATAGAT
AAGCTCAACTATCTTGTGGACAAGAAGAAAAGGCCTGAAGATATTGGAGGATTGT
TGAGAGCCTATCAATTTACGGCCCCATTTAAGAGTTTTAAGGAAATGGGAAAGCA
AAACGGCTTCTTGTTTTATATCCCGGCTTGGAACACGAGCAACATAGATCCGACT
ACTGGATTTGTTAATTTATTTCATGCCCAGTATGAAAATGTAGATAAAGCGAAGA
GCTTCTTTCAAAAGTTTGATTCAATTAGTTACAACCCGAAGAAAGACTGGTTTGA
GTTTGCATTCGATTATAAAAACTTTACTAAAAAGGCTGAAGGAAGTCGTTCTATG
TGGATATTATGCACACATGGTTCCCGAATAAAGAATTTTAGAAATTCCCAGAAGA
ATGGTCAATGGGATTCCGAAGAATTCGCCTTGACGGAGGCTTTTAAGTCTCTTTT
TGTGCGATATGAGATAGATTATACCGCTGATTTGAAAACAGCTATTGTGGACGAA
AAGCAAAAAGACTTCTTCGTGGATCTTCTGAAGCTATTCAAATTGACAGTACAGA
TGCGCAACAGCTGGAAAGAGAAGGATTTGGATTATCTAATCTCTCCTGTAGCAGG
GGCTGATGGCCGTTTCTTCGATACAAGAGAGGGAAATAAAAGTCTGCCTAAGGAT
GCAGATGCCAATGGAGCTTATAATATTGCCCTAAAAGGACTTTGGGCTCTACGCC
AGATTCGGCAAACTTCAGAAGGCGGTAAACTCAAATTGGCGATTTCCAATAAGGA
ATGGCTACAGTTTGTGCAAGAGAGATCTTACGAGAAAGACtga
SEQ atgaataatggaacaaataactttcagaattttatcggaatttcttctttgcaga
ID agactcttaggaatgctctcattccaaccgaaacaacacagcaatttattgttaa
NO: aaacggaataattaaagaagatgagctaagaggagaaaatcgtcagatacttaaa
27 gatatcatggatgattattacagaggtttcatttcagaaactttatcgtcaattg
atgatattgactggacttctttatttgagaaaatggaaattcagttaaaaaatgg
agataacaaagacactcttataaaagaacagactgaataccgtaaggcaattcat
aaaaaatttgcaaatgatgatagatttaaaaatatgttcagtgcaaaattaatct
cagatattcttcctgaatttgtcattcataacaataattattctgcatcagaaaa
ggaagaaaaaacacaggtaattaaattattttccagatttgcaacgtcattcaag
gactattttaaaaacagggctaattgtttttcggctgatgatatatcttcatctt
cttgtcatagaatagttaatgataatgcagagatattttttagtaatgcattggt
gtataggagaattgtaaaaagtctttcaaatgatgatataaataaaatatccgga
gatatgaaggattcattaaaggaaatgtctctggaagaaatttattcttatgaaa
aatatggggaatttattacacaggaaggtatatctttttataatgatatatgtgg
taaagtaaattcatttatgaatttatattgccagaaaaataaagaaaacaaaaat
ctctataagctgcaaaagcttcataaacagatactgtgcatagcagatacttctt
atgaggtgccgtataaatttgaatcagatgaagaggtttatcaatcagtgaatgg
atttttggacaatattagttcgaaacatatcgttgaaagattgcgtaagattgga
gacaactataacggctacaatcttgataagatttatattgttagtaaattctatg
aatcagtttcacaaaagacatatagagattgggaaacaataaatactgcattaga
aattcattacaacaatatattacccggaaatggtaaatctaaagctgacaaggta
aaaaaagcggtaaagaatgatctgcaaaaaagcattactgaaatcaatgagcttg
ttagcaattataaattatgttcggatgataatattaaagctgagacatatataca
tgaaatatcacatattttgaataattttgaagcacaggagcttaagtataatcct
gaaattcatctggtggaaagtgaattgaaagcatctgaattaaaaaatgttctcg
atgtaataatgaatgcttttcattggtgttcggttttcatgacagaggagctggt
agataaagataataatttttatgccgagttagaagagatatatgacgaaatatat
ccggtaatttcattgtataatcttgtgcgtaattatgtaacgcagaagccatata
gtacaaaaaaaattaaattgaattttggtattcctacactagcggatggatggag
taaaagtaaagaatatagtaataatgcaattattctcatgcgtgataatttgtac
tatttaggaatatttaatgcaaaaaataagcctgacaaaaagataattgaaggta
atacatcagaaaataaaggggattataagaagatgatttataatcttctgccagg
accaaataaaatgatccccaaggtattcctctcttcaaaaaccggagtggaaaca
tataagccgtctgcctatatattggagggctataaacaaaacaagcatattaaat
cctctaaggattttgatataacattttgtcacgatttgattgattattttaagaa
ctgtatagcaatacatcctgaatggaagaattttggctttgatttttctgacacc
tccacatatgaagatatcagcggattttacagagaagtcgaattacaaggttata
aaatcgactggacatatatcagcgaaaaggatattgatttgttgcaggaaaaagg
acagttatatttattccaaatatataacaaagatttttccaagaaaagtaccgga
aatgataatcttcatactatgtatttgaagaatttgtttagtgaagagaatttaa
aggatattgtactgaaattaaacggtgaggcggaaatcttctttagaaaatcaag
cataaagaatccaataattcataaaaaaggctctattcttgttaatagaacatat
gaagcagaggaaaaagatcaatttggaaatatccagatagtcagaaaaaacatac
cggaaaatatatatcaggagctttataaatatttcaatgataaaagtgataaaga
actttcggatgaagcagctaagcttaagaatgtagtaggtcatcatgaggctgct
acaaacatagtaaaagattatagatatacatatgataaatattttcttcatatgc
ctattacaatcaattttaaagccaataagacaggctttattaatgacagaatatt
acaatatattgctaaagaaaaggatttgcatgtaataggcattgatcgtggtgaa
agaaacctgatatatgtttcagtaattgatacttgtggaaatattgttgaacaaa
aatcgtttaacattgttaatggatatgattatcagattaagctcaagcagcagga
gggggcgcgacaaatcgcacgaaaagaatggaaagaaatcggcaaaataaaagaa
attaaagaaggctatttatctcttgtaattcatgaaatttcaaagatggttatta
aatataatgccataattgcaatggaggatttaagctacggatttaaaaaaggtcg
tttcaaggttgagcgacaggtttaccagaagtttgagacaatgcttatcaacaaa
ctcaactatctggtatttaaagatatatccataacggaaaacggtggtcttctaa
agggataccagcttacatatattccagataaactgaaaaatgtgggtcatcaatg
tggctgtatattttatgtacctgctgcctatacatcaaaaatagatcctacaacc
ggatttgtaaatatattcaaatttaaagatttaacagttgatgcgaagagagaat
ttataaaaaaatttgacagtatcagatatgattcagaaaaaaatctgttttgttt
tacattcgattataataactttattacgcaaaatactgttatgtcaaagtcaagc
tggagtgtatatacgtacggagttaggataaaaagaagatttgtcaatggcaggt
tctcaaatgaatcggatacaattgatataacaaaagatatggaaaaaacactcga
aatgacagatataaattggagagatggtcatgatctgaggcaggatattattgat
tatgaaatcgtacaacacatatttgagatttttagattgactgtacaaatgagaa
acagtttaagtgaattagaagacagggattatgaccgtttgatttctccggtgct
caatgaaaataatatattttatgattcagctaaagcaggagatgcgttacctaaa
gacgcagatgctaatggtgcatattgtatagctctaaaaggcttgtatgaaatca
aacaaattacagagaattggaaagaagacggtaagttttcaagagataaacttaa
aatttccaataaggactggtttgactttattcaaaataaaaggtatttataa
SEQ atgacaaacaaatttacaaaccagtactcgctttccaaaacacttcgatttgagt
ID tgattccacaaggaaaaacattggaatttattcaagaaaaaggattgctctctca
NO: agataaacaacgagcggagagttatcaagaaatgaaaaaaactattgataaattt
28 cataaatactttatcgatttagctttaagcaatgctaaactaactcatttagaaa
cttacttggaattatacaataaaagtgctgaaacaaaaaaagaacaaaaatttaa
agacgatttaaagaaagtacaagacaatttacgaaaagaaatcgttaaatctttt
tcagatggtgatgcaaaatcaatttttgcaattttggataaaaaagaactgatta
ccgtagaacttgaaaaatggtttgaaaacaacgaacaaaaagacatttattttga
cgaaaaattcaaaacgtttactacttattttactggttttcatcaaaacagaaaa
aacatgtattcggttgaacccaattctacagcaattgcttatcgattgattcatg
aaaatttacctaaatttttagaaaatgctaaagcatttgaaaaaataaaacaagt
agaaagtttgcaagttaattttagagaattaatgggggaatttggagatgaaggg
ctaattttcgtaaatgaattagaagaaatgtttcaaatcaattattataatgatg
tgctttcacaaaatggaattacaatttataatagtataatttcaggatttaccaa
aaatgatataaaatataaaggtctaaatgaatacataaataattacaatcaaacc
aaagacaaaaaagaccgtttgccaaaattaaaacaattgtataaacagattttga
gtgataggatttcactttcgtttttgcccgatgcttttacggatgggaaacaagt
tttgaaagccatatttgacttttataaaatcaacttactttcttataccattgaa
ggacaggaagaaagccaaaatcttttactattaattcgtcagacaattgaaaacc
tttctagttttgatacccaaaaaatttatctaaaaaatgatacccatttaaccac
tatttcacaacaagtatttggcgatttttcggtgttttcaactgctttaaattat
tggtatgaaactaaagtaaatccaaaatttgaaacggaatatagcaaagccaacg
aaaaaaaacgagaaattttagataaagccaaagcggtatttacaaaacaagatta
tttttcaattgcttttttacaagaagtactttcggaatacattcttaccttagat
cacacttctgatattgtaaaaaagcattcctccaactgtattgcggattatttta
aaaatcattttgtagccaaaaaagaaaatgaaaccgacaaaacctttgattttat
tgctaatattactgcaaaataccaatgtattcaaggtattttagaaaatgcagac
caatacgaagacgaactcaaacaagaccaaaaattaattgataatttgaaattct
ttttagatgctattttagaattgttgcattttattaaacctttgcatttaaaatc
agaaagcattaccgaaaaagacactgctttttatgatgtgtttgaaaattattac
gaagcattgagtttgttgaccccattatataatatggtgcgaaactatgtaacgc
aaaagccgtacagcaccgaaaaaataaaattaaattttgaaaatgcacaattatt
gaatggttgggatgccaataaagaaggtgattacctaactaccattttgaaaaaa
gacggtaattattttttagccataatggataaaaagcataacaaagcgtttcaaa
agtttccagaaggaaaagaaaattatgaaaaaatggtgtataaactattgcctgg
agtaaataagatgttgccaaaagtatttttttccaataaaaatattgcttacttc
aacccatcaaaagagttattagaaaactataaaaaagagacgcacaaaaaaggag
acacattcaatttagaacattgtcatacgttgatcgattttttcaaggactcttt
aaacaaacatgaagactggaaatactttgattttcaattttctgaaacaaaatcg
tatcaagatttgagtggtttttatagagaagtagaacatcaaggctacaaaatca
attttaaaaatatcgattcagaatatattgatggtttggtgaacgaaggtaaatt
gtttctatttcaaatttacagcaaagatttttcgcctttttccaaagggaaaccg
aacatgcacactttgtattggaaagccttatttgaagaacaaaatttgcaaaatg
taatctataaattgaatggacaagccgaaatattttttagaaaagcctctataaa
acctaaaaatataatattgcacaaaaagaaaattaaaattgccaaaaagcatttt
attgataaaaaaacaaaaacatctgaaattgttcctgttcaaacaataaaaaacc
tcaatatgtactaccaaggaaaaataagtgaaaaagaattaacacaagatgattt
aaggtatattgataattttagcattttcaatgaaaaaaataaaacaattgatatt
ataaaagacaaacgatttacggttgataaatttcagtttcatgtgccgattacca
tgaactttaaagcaacgggcggaagttatatcaatcaaaccgtattagaatattt
gcaaaacaatcccgaagttaagattattggattggatagaggcgaacgccatttg
gtatatctgacactgatagaccagcaaggaaacatcttgaaacaagaaagtttga
atacaatcaccgattctaaaatctcgacaccttatcataagttgttggataacaa
ggaaaacgagcgtgacttggctcgaaaaaattggggaacggtggaaaacatcaaa
gaactcaaagaaggctacatcagtcaagtggtgcataaaattgctacgttgatgc
tggaagaaaatgccattgtggtaatggaagatttgaattttggatttaaacgtgg
acgttttaaagtggaaaaacaaatttatcaaaagctggaaaaaatgttgattgac
aaattgaattatttggttttaaaagacaaacaacctcaggaattaggcggattgt
acaacgcattacaactcaccaataaatttgaaagtttccaaaaaatgggtaaaca
atcgggctttttttttatgtacccgcttggaacacctccaaaatagacccaacca
cagggtttgtcaattatttttataccaaatatgaaaatgttgacaaagccaaagc
cttttttgaaaaatttgaggcgattcgtttcaatgcagaaaagaagtattttgaa
tttgaagtaaaaaaatatagcgattttaacccaaaagccgaaggcactcaacaag
cctggaccatttgcacgtatggcgaacgaatagaaaccaaacgacaaaaagacca
aaacaacaaatttgtaagcactccaattaatctaaccgaaaagatagaagacttt
ttgggtaaaaaccaaattgtttatggtgatggtaattgcatcaaatctcaaattg
ctagcaaagacgacaaggctttttttgaaaccttattgtattggttcaaaatgac
tttacaaatgcgaaacagcgaaacaagaacagatatagattatctaatttcgccc
gtgatgaatgacaacggaacattttacaacagccgagattatgaaaaattagaaa
atccaactttgcccaaagatgccgatgccaacggagcgtatcatattgccaaaaa
aggattgatgcttttgaataaaatagaccaagccgacttgacaaaaaaagtggat
ttatctattagtaacagagattggttgcaatttgtacaaaaaaataaataa
SEQ atggaacaggagtactatttaggactggatatgggaaccggatctgtaggatggg
ID ctgttacagattcggaatatcatgtcttgcgtaaacatggaaaagcactatgggg
NO: agtccgattatttgaaagtgcatcgacagcagaagaacgaagaatgttccgaaca
29 tcaagaagaagactagatcgaagaaactggagaattgaaattttacaggaaattt
ttgcagaggaaataagtaagaaagatccaggatttttcttgcgaatgaaagaaag
caaatattatccagaagataagcgagatatcaatggaaattgtccggaactgcca
tatgcattatttgttgatgacgattttacagataaagattatcataaaaaatttc
cgacaatttatcatctcaggaaaatgttgatgaatacagaggagacaccggatat
ccggttggtgtatctggcaattcatcatatgatgaagcataggggccatttcttg
ttatctggtgacattaatgagattaaggagttcggaacgacattttcaaaattgt
tggagaatatcaaaaatgaggaattggattggaatcttgaactgggaaaagaaga
atatgctgttgtagaaagtattttaaaagataacatgttaaaccgatccacaaag
aaaaccagattaataaaagcattaaaagcaaaatcaatatgtgaaaaggctgtac
tgaatttattggctggtggaacggtgaaattgagtgatatatttggtcttgaaga
attaaatgagacagaaagaccgaagatttcctttgctgataatggatacgatgat
tatatcggagaagttgaaaatgagctgggagaacaattctatattatagagacgg
caaaagcagtgtatgactgggcggtattagttgaaatattgggaaaatatacgtc
aatttcagaagcgaaagtagcaacgtatgaaaaacataaatcggatttacaattt
ttgaaaaagatagttcggaaatatctgacaaaggaggaatataaagatatttttg
taagtacgagtgacaaattgaaaaattactctgcttatataggaatgacgaaaat
aaatggaaaaaaggttgatttgcagagcaaacggtgcagtaaagaagaattctat
gattttattaagaaaaacgtacttaaaaagctagaaggacaacctgaatatgaat
atttgaaagaagagctagaaagagaaacatttctaccaaaacaggtgaacaggga
taatggtgtaataccgtatcagattcatttgtacgagttgaaaaagatattagga
aatttacgggataaaatagacctcattaaagagaacgaagataaactggttcaat
tatttgaattcagaattccgtattatgttggtccgctgaataagatagatgacgg
aaaagagggaaaatttacatgggctgtacggaaaagtaatgaaaagatatatcca
tggaattttgaaaatgtagttgatatagaagcaagtgcagaaaaatttatccgga
gaatgacaaataagtgtacatatctgatgggcgaagatgtattgccgaaggattc
attgctttacagtaaatatatggttttaaatgaattaaataatgtaaagttggat
ggcgaaaaattatctgtagaattgaaacaacggttgtatacagatgtattttgta
agtatcggaaagtaactgtaaagaagataaaaaattacttgaaatgtgaaggtat
catatccggcaatgtcgaaataactggaattgatggtgattttaaggcatcgtta
acggcatatcatgattttaaagaaatcttgacaggaacagaattggctaaaaagg
acaaagaaaatattattaccaatatagtattgtttggagatgataaaaagctgct
gaaaaagagactgaatcgattatatcctcagattacgccgaatcagttgaagaaa
atatgtgcgctatcctatacaggctggggaagattttctaaaaagttcttagaag
aaataacagctccagatccggaaacgggagaggtatggaatatcattacggcatt
gtgggaatcgaataataatctgatgcaattattaagtaatgaatatcggtttatg
gaagaagtcgaaacatacaatatgggaaaacagactaaaacattgtcgtacgaaa
cagtagagaatatgtatgtttctccatctgtgaaaagacagatatggcagacgct
gaaaatcgtgaaagaattagaaaaagtaatgaaagaatctccgaaacgtgtattt
attgagatggcgagagaaaagcaagaaagtaagagaaccgaatcgcgtaaaaaac
aactaatagatttgtataaggcttgtaaaaatgaagaaaaagattgggtaaaaga
actgggagatcaggaagaacagaaattacgaagcgataagttgtacctatattat
acgcaaaagggtcgttgtatgtattctggcgaggtaatagaactgaaagacttat
gggataatacaaaatatgatattgatcatatatatccacaatctaaaacgatgga
tgacagtcttaataatcgcgtattggtaaaaaagaaatataatgcaacaaaatca
gataagtatccattaaatgaaaatatacgacatgagagaaaaggcttttggaagt
cactgttagatggagggtttataagtaaagaaaaatatgaacgcttaataagaaa
tacagaattgagtccggaagaattagcaggatttattgaaaggcagattgttgaa
acgaggcagagtacaaaagctgtagcggaaatattaaagcaagtgtttccggaaa
gtgaaattgtatatgtcaaagcaggtacggtttcaagattcagaaaagattttga
attactgaaagttcgagaagtgaatgatttgcatcacgcaaaggatgcgtattta
aatattgtagttggtaatagttattatgtgaaatttactaagaatgcatcatggt
ttataaaagaaaatccgggacgtacttacaacttaaaaaagatgtttacatcagg
ttggaatattgaacgaaatggagaagttgcatgggaagtcgggaaaaaaggaaca
attgtaacggtaaaacaaataatgaataaaaataatatattggtgacaagacagg
ttcatgaagcgaaaggtgggctgtttgatcagcagattatgaaaaaaggaaaagg
tcagattgctataaaggaaactgatgaacgtcttgcatcaatagaaaagtatgga
ggctataataaagctgccggggcatattttatgctggtagaatctaaagataaaa
aaggaaaaacaattcgaacgatagaatttataccattatatttaaagaataaaat
cgagtcggatgaatcaatagcattgaactttttagaaaaaggcagaggtttgaaa
gaaccaaagatactattgaaaaaaattaagattgatacattatttgatgtggacg
gattcaaaatgtggttgtctggaagaacaggggacagactactatttaaatgtgc
aaatcaattgattttggatgagaaaataattgtaacaatgaaaaaaattgtaaag
tttattcaaaggagacaagaaaatagagaattaaaattatctgataaagatggaa
ttgataatgaagtacttatggaaatatataacacttttgtggataagttagaaaa
cacagtgtatagaatacgattatccgaacaggcaaaaacgcttatagataaacaa
aaagaatttgaaaggttatcactagaggataaaagtagtactttgtttgaaattt
tacatatttttcagtgtcaaagtagtgcggccaatttaaaaatgataggcggacc
tggaaaagcaggaatattagttatgaataataatataagtaagtgtaacaaaatt
tctattataaatcagtctccaacaggaattttcgaaaatgagattgatttgttaa
agat
SEQ ATGAAATCTTTCGATTCATTCACAAATCTTTATTCTCTTTCAAAAACCTTGAAAT
ID TTGAGATGAGACCTGTCGGAAATACCCAAAAAATGCTCGACAATGCAGGAGTATT
NO: TGAAAAAGACAAACTAATTCAAAAAAAGTACGGAAAAACAAAGCCGTATTTCGAC
30 AGACTCCACAGAGAATTTATAGAAGAAGCGCTCACGGGGGTAGAGCTAATAGGAC
TAGATGAGAACTTTAGGACACTTGTTGACTGGCAAAAAGATAAGAAAAATAATGT
CGCAATGAAAGCGTATGAAAATAGTTTGCAGCGGCTGAGAACGGAAATAGGTAAA
ATATTTAACCTAAAGGCTGAGGATTGGGTAAAGAACAAATATCCAATATTAGGGC
TGAAAAATAAAAATACCGATATTTTATTCGAAGAGGCTGTATTCGGGATATTGAA
AGCCCGATATGGAGAAGAAAAAGATACTTTTATAGAAGTAGAGGAAATAGATAAA
ACCGGCAAATCAAAGATCAATCAAATATCAATTTTCGATAGTTGGAAAGGATTTA
CAGGATATTTCAAAAAATTTTTTGAAACCAGAAAGAATTTTTACAAAAACGACGG
AACTTCTACAGCAATTGCTACAAGGATCATTGATCAAAATCTGAAAAGATTCATA
GATAATCTGTCAATAGTTGAAAGTGTGAGACAAAAGGTTGATCTCGCCGAGACAG
AAAAATCTTTCAGCATATCTCTATCGCAATTCTTCTCAATAGACTTTTATAACAA
GTGTCTCCTTCAAGATGGTATTGATTACTACAACAAGATAATCGGTGGAGAAACT
CTCAAAAATGGCGAAAAACTAATAGGTCTCAATGAACTAATAAATCAATATAGGC
AGAATAATAAGGATCAGAAAATCCCATTTTTCAAACTTCTTGATAAACAAATTTT
GAGTGAAAAGATATTATTTTTGGATGAAATAAAAAATGACACAGAACTGATCGAG
GCGCTGAGTCAGTTCGCAAAAACAGCCGAAGAAAAAACAAAAATTGTCAAAAAGC
TTTTTGCCGATTTTGTAGAAAATAATTCCAAATACGATCTTGCACAGATTTATAT
TTCCCAAGAAGCATTCAATACTATATCAAACAAGTGGACAAGCGAAACTGAGACG
TTCGCTAAATATCTATTCGAAGCAATGAAGAGTGGAAAACTTGCAAAGTATGAGA
AAAAAGATAATAGCTATAAATTTCCTGATTTTATTGCCCTTTCACAGATGAAGAG
TGCTTTATTAAGTATCAGCCTTGAGGGACATTTTTGGAAAGAGAAATACTACAAA
ATTTCAAAATTCCAAGAGAAGACCAATTGGGAGCAGTTTCTTGCAATTTTTCTAT
ACGAGTTTAACTCTCTTTTCAGCGACAAAATAAATACAAAAGATGGAGAAACAAA
GCAAGTTGGATACTATCTATTTGCCAAAGACCTGCATAATCTTATCTTAAGTGAG
CAGATTGATATTCCAAAAGATTCAAAAGTCACAATAAAAGATTTTGCCGATTCTG
TACTCACAATCTACCAAATGGCAAAATATTTTGCGGTAGAAAAAAAACGAGCGTG
GCTTGCCGAGTATGAACTAGATTCATTTTATACCCAGCCAGACACAGGCTATTTA
CAGTTTTATGATAACGCCTACGAGGATATTGTGCAGGTATACAACAAGCTTCGAA
ACTATCTGACCAAAAAGCCATATAGCGAGGAGAAATGGAAGTTGAATTTTGAAAA
TTCTACGCTGGCAAATGGATGGGATAAGAACAAAGAATCTGATAATTCAGCAGTT
ATTCTACAAAAAGGTGGAAAATATTATTTGGGACTGATTACTAAAGGACACAACA
AAATTTTTGATGACCGTTTTCAAGAAAAATTTATTGTGGGAATTGAAGGTGGAAA
ATATGAAAAAATAGTCTATAAATTTTTCCCCGACCAGGCAAAAATGTTTCCCAAA
GTGTGCTTTTCTGCAAAAGGACTCGAATTTTTTAGACCGTCTGAAGAAATTTTAA
GAATTTATAACAATGCAGAGTTTAAAAAAGGAGAAACTTATTCAATAGATAGTAT
GCAGAAGTTGATTGATTTTTATAAAGATTGCTTGACTAAATATGAAGGCTGGGCA
TGTTATACCTTTCGGCATCTAAAACCCACAGAAGAATACCAAAACAATATTGGAG
AGTTTTTTCGAGATGTTGCAGAGGACGGATACAGGATTGATTTTCAAGGCATTTC
AGATCAATATATTCATGAAAAAAACGAGAAAGGCGAACTTCACCTTTTTGAAATC
CACAATAAAGATTGGAATTTGGATAAGGCACGAGACGGAAAGTCAAAAACAACAC
AAAAAAACCTTCATACACTCTATTTCGAATCGCTCTTTTCAAACGATAATGTTGT
TCAAAACTTTCCAATAAAACTCAATGGTCAAGCTGAAATTTTTTATAGACCGAAA
ACGGAAAAAGACAAATTAGAATCAAAAAAAGATAAGAAAGGGAATAAAGTGATTG
ACCATAAACGCTATAGTGAGAATAAGATTTTTTTTCATGTTCCTCTCACACTAAA
CCGCACTAAAAATGACTCATATCGCTTTAATGCTCAAATCAACAACTTTCTCGCA
AATAATAAAGATATCAACATCATCGGTGTAGATAGGGGAGAAAAGCATTTAGTCT
ATTATTCGGTGATTACACAAGCTAGTGACATCTTAGAAAGTGGCTCACTAAATGA
GCTAAATGGCGTGAATTATGCTGAAAAACTGGGAAAAAAGGCAGAAAATCGAGAA
CAAGCACGCAGAGACTGGCAAGACGTACAAGGGATCAAAGACCTCAAGAAAGGAT
ATATTTCACAGGTGGTGCGAAAGCTTGCTGATTTAGCAATTAAACACAATGCCAT
TATCATTCTTGAAGATTTGAATATGAGATTTAAACAAGTTCGGGGCGGTATCGAA
AAATCCATTTATCAACAGTTAGAAAAAGCACTGATAGATAAATTAAGCTTTCTTG
TAGACAAAGGTGAAAAAAATCCCGAGCAAGCAGGACATCTTCTGAAAGCATATCA
GCTTTCGGCCCCATTTGAGACATTTCAAAAAATGGGCAAACAGACGGGTATAATC
TTTTATACACAAGCTTCGTATACCTCAAAAAGTGACCCTGTAACAGGTTGGCGAC
CACACCTGTATCTCAAATATTTCAGTGCCAAAAAAGCAAAAGACGATATTGCAAA
GTTTACAAAAATAGAATTTGTAAACGATAGGTTTGAGCTTACCTATGATATAAAG
GACTTTCAGCAAGCAAAAGAATATCCAAATAAAACTGTTTGGAAAGTTTGCTCAA
ATGTAGAGAGATTCAGGTGGGACAAAAACCTCAATCAAAACAAAGGCGGATATAC
TCACTACACAAATATAACTGAGAATATCCAAGAGCTTTTTACAAAATATGGAATT
GATATCACAAAAGATTTGCTCACACAGATTTCTACAATTGATGAAAAACAAAATA
CCTCATTTTTTAGAGATTTTATTTTTTATTTCAACCTTATTTGCCAAATCAGAAA
TACCGATGATTCTGAGATTGCTAAAAAGAATGGGAAAGATGATTTTATACTGTCA
CCTGTTGAGCCGTTTTTCGATAGCCGAAAAGACAATGGAAATAAACTTCCTGAGA
ATGGAGATGATAACGGCGCGTATAACATAGCAAGAAAAGGGATTGTCATACTCAA
CAAAATCTCACAATATTCAGAGAAAAACGAAAATTGCGAGAAAATGAAATGGGGG
GATTTGTATGTATCAAACATTGACTGGGACAATTTTGTAACCCAAGCTAATGCAC
GGCATTAA
SEQ ATGATTATCTTATATATTAGTACCTCGAATATGAACATGGAAGGAGTATTTATGG
ID AAAATTTTAAAAACTTGTATCCAATAAACAAAACACTTCGATTTGAATTAAGACC
NO: CTATGGAAAAACATTGGAAAATTTTAAAAAATCCGGACTTTTAGAAAAAGATGCC
31 TTTAAGGCAAATAGTAGACGAAGTATGCAAGCTATAATCGATGAAAAATTCAAAG
AGACTATCGAAGAACGCTTAAAGTACACTGAATTCAGTGAATGTGATCTTGGAAA
CATGACATCAAAAGATAAAAAAATAACTGATAAAGCAGCTACAAATTTAAAAAAG
CAAGTTATCTTATCTTTTGACGATGAAATATTTAATAATTACCTAAAACCTGATA
AAAATATTGACGCATTATTTAAAAATGATCCTTCAAATCCTGTAATCTCTACATT
TAAAGGTTTTACGACATATTTTGTGAATTTTTTTGAAATTCGAAAACATATTTTC
AAGGGAGAATCATCAGGCTCAATGGCATACCGAATTATAGATGAAAACCTGACAA
CATACTTGAATAATATTGAAAAAATAAAAAAACTGCCAGAAGAATTAAAATCACA
GCTAGAAGGCATTGATCAGATTGATAAACTTAATAATTATAATGAGTTCATTACA
CAGTCAGGTATAACACACTATAATGAAATCATCGGCGGTATATCAAAATCAGAGA
ATGTCAAAATACAGGGAATTAATGAAGGAATTAATCTATACTGTCAGAAGAACAA
AGTTAAACTTCCTCGACTGACTCCGCTATACAAAATGATATTATCAGACAGAGTT
TCCAACTCTTTTGTATTAGACACTATTGAAAATGACACAGAATTAATTGAAATGA
TAAGTGATTTGATTAATAAGACTGAGATTTCGCAAGATGTTATAATGTCAGATAT
TCAAAATATTTTCATAAAATACAAACAACTTGGTAATTTGCCGGGTATCTCATAT
TCTTCAATAGTTAATGCTATTTGCTCGGATTATGACAACAATTTCGGAGATGGGA
AGCGAAAAAAATCTTACGAAAATGATCGCAAAAAGCATTTGGAGACTAATGTATA
CTCCATAAATTATATTTCTGAATTGCTTACAGATACCGATGTTTCATCAAATATC
AAGATGAGATATAAAGAGCTTGAGCAAAATTATCAGGTTTGCAAAGAAAATTTTA
ATGCCACAAACTGGATGAATATTAAAAATATAAAACAATCTGAAAAAACAAACCT
TATTAAAGATTTGTTAGATATACTTAAATCGATTCAACGTTTCTATGATTTGTTT
GATATTGTTGACGAAGATAAAAATCCAAGTGCTGAATTTTATACCTGGTTATCAA
AAAATGCTGAAAAGCTTGACTTTGAATTCAATTCTGTATATAACAAGTCACGAAA
CTATCTCACCAGGAAACAATACTCTGATAAAAAAATCAAGCTGAATTTTGATTCT
CCAACATTGGCCAAAGGGTGGGATGCTAACAAAGAAATAGATAACTCCACGATTA
TAATGCGTAAATTTAATAATGACAGAGGCGATTATGATTACTTCCTTGGCATATG
GAATAAATCCACACCTGCAAATGAAAAAATAATCCCACTGGAGGATAATGGATTA
TTCGAAAAAATGCAATATAAGCTGTATCCAGATCCTAGTAAGATGTTACCGAAAC
AATTTCTATCAAAAATATGGAAGGCAAAGCATCCTACGACACCTGAATTTGATAA
AAAATATAAAGAGGGAAGACATAAAAAAGGTCCTGATTTCGAAAAAGAATTCCTG
CATGAATTGATTGATTGCTTCAAACATGGTCTTGTTAATCACGATGAAAAATATC
AGGATGTTTTTGGCTTCAATCTCCGTAACACTGAAGATTATAATTCATATACAGA
GTTTCTCGAAGATGTGGAAAGATGCAATTACAATCTTTCATTTAACAAAATTGCT
GATACTTCAAACCTTATTAATGATGGGAAATTGTATGTATTTCAGATATGGTCAA
AAGACTTTTCTATTGATTCAAAAGGTACTAAAAACTTGAATACAATCTATTTTGA
ATCACTATTTTCAGAAGAAAACATGATAGAAAAAATGTTCAAGCTTTCTGGAGAG
GCTGAGATATTCTATCGACCAGCATCGTTGAATTATTGTGAAGATATCATAAAAA
AAGGTCATCACCATGCAGAATTAAAAGATAAGTTTGACTATCCTATAATAAAAGA
TAAGCGATATTCACAAGATAAGTTTTTCTTTCATGTGCCAATGGTTATAAATTAT
AAATCTGAGAAACTGAATTCCAAAAGCCTTAACAACCGAACAAATGAAAACCTGG
GACAGTTTACACATATTATAGGTATAGACAGGGGCGAGCGGCACTTGATTTATTT
AACTGTTGTTGATGTTTCCACTGGTGAAATCGTTGAACAGAAACATCTGGACGAA
ATTATCAATACTGATACCAAGGGAGTTGAACACAAAACCCATTATTTGAATAAAT
TGGAAGAAAAATCTAAAACAAGAGATAACGAGCGTAAATCATGGGAAGCTATTGA
AACTATCAAAGAATTAAAAGAAGGCTATATTTCTCATGTAATTAATGAAATACAA
AAGCTGCAAGAAAAATATAATGCCTTAATCGTAATGGAAAATCTTAACTATGGGT
TCAAAAACTCACGAATCAAAGTTGAAAAACAGGTTTATCAAAAATTCGAGACAGC
ATTGATTAAAAAGTTCAATTATATTATTGATAAAAAAGATCCAGAAACCTATATA
CATGGTTACCAGCTTACAAATCCTATTACCACTCTGGATAAGATTGGAAATCAAT
CTGGAATAGTGCTGTATATTCCTGCGTGGAATACTTCTAAGATAGATCCCGTCAC
AGGATTTGTAAACCTTCTGTACGCAGATGATTTGAAGTATAAAAATCAGGAGCAG
GCCAAATCATTCATTCAGAAAATAGACAACATATATTTTGAAAATGGAGAGTTTA
AATTTGATATTGATTTTTCCAAATGGAATAATCGCTACTCAATAAGTAAAACTAA
ATGGACGTTAACAAGTTATGGGACTCGCATCCAGACATTTAGAAATCCCCAGAAA
AACAATAAGTGGGATTCTGCTGAATATGATTTGACAGAAGAGTTTAAATTAATTT
TAAATATAGACGGAACGTTAAAGTCACAGGACGTAGAAACATACAAAAAATTCAT
GTCTTTATTTAAACTAATGCTACAGCTTCGAAACTCTGTTACAGGAACCGACATT
GATTATATGATCTCTCCTGTCACTGATAAAACAGGAACACATTTCGATTCAAGAG
AAAATATTAAAAATCTTCCTGCCGATGCAGATGCCAATGGTGCCTACAACATTGC
GCGCAAAGGAATAATGGCTATTGAAAATATAATGAACGGTATAAGCGATCCACTA
AAAATAAGCAACGAAGACTATTTAAAGTATATTCAGAATCAACAGGAATAA
SEQ ATGACCCAATTTGAAGGTTTTACCAATTTATACCAAGTTTCGAAGACCCTTCGTT
ID TTGAACTGATTCCCCAAGGAAAAACACTCAAACATATCCAGGAGCAAGGGTTCAT
NO: TGAGGAGGATAAAGCTCGCAATGACCATTACAAAGAGTTAAAACCAATCATTGAC
32 CGCATCTATAAGACTTATGCTGATCAATGTCTCCAACTGGTACAGCTTGACTGGG
AGAATCTATCTGCAGCCATAGACTCCTATCGTAAGGAAAAAACCGAAGAAACACG
AAATGCGCTGATTGAGGAGCAAGCAACATATAGAAATGCGATTCATGACTACTTT
ATAGGTCGGACGGATAATCTGACAGATGCCATAAATAAGCGCCATGCTGAAATCT
ATAAAGGACTTTTTAAAGCTGAACTTTTCAATGGAAAAGTTTTAAAGCAATTAGG
GACCGTAACCACGACAGAACATGAAAATGCTCTACTCCGTTCGTTTGACAAATTT
ACGACCTATTTTTCCGGCTTTTATGAAAACCGAAAAAATGTCTTTAGCGCTGAAG
ATATCAGCACGGCAATTCCCCATCGAATCGTCCAGGACAATTTCCCTAAATTTAA
GGAAAACTGCCATATTTTTACAAGATTGATAACCGCAGTTCCTTCTTTGCGGGAG
CATTTTGAAAATGTCAAAAAGGCCATTGGAATCTTTGTTAGTACGTCTATTGAAG
AAGTCTTTTCCTTTCCCTTTTATAATCAACTTCTAACCCAAACGCAAATTGATCT
TTATAATCAACTTCTCGGCGGCATATCTAGGGAAGCAGGCACAGAAAAAATCAAG
GGACTTAATGAAGTTCTCAATCTGGCTATCCAAAAAAATGATGAAACAGCCCATA
TAATCGCGTCCCTGCCGCATCGTTTTATTCCTCTTTTTAAACAAATTCTTTCCGA
TCGAAATACGTTATCCTTTATTTTGGAAGAATTCAAAAGCGATGAGGAAGTCATC
CAATCCTTCTGCAAATATAAAACCCTCTTGAGAAACGAAAATGTACTGGAGACTG
CAGAAGCCCTTTTCAATGAATTAAATTCCATTGATTTGACTCATATCTTTATTTC
CCATAAAAAGTTAGAAACCATCTCTTCAGCGCTTTGTGACCATTGGGATACCTTG
CGCAATGCACTTTACGAAAGACGGATTTCTGAACTCACTGGCAAAATAACAAAAA
GTGCCAAAGAAAAAGTTCAAAGGTCATTAAAACATGAGGATATAAATCTCCAAGA
AATTATTTCTGCTGCAGGAAAAGAACTATCAGAAGCATTCAAACAAAAAACAAGT
GAAATTCTTTCCCATGCCCATGCTGCACTTGACCAGCCTCTTCCCACAACATTAA
AAAAACAGGAAGAAAAAGAAATCCTCAAATCACAGCTCGATTCGCTTTTAGGCCT
TTATCATCTTCTTGATTGGTTTGCTGTCGATGAAAGCAATGAAGTCGACCCAGAA
TTCTCAGCACGGCTGACAGGCATTAAACTAGAAATGGAACCAAGCCTTTCGTTTT
ATAATAAAGCAAGAAATTATGCGACAAAAAAGCCCTATTCGGTGGAAAAATTTAA
ATTGAATTTTCAAATGCCAACCCTTGCCTCTGGTTGGGATGTCAATAAAGAAAAA
AATAATGGAGCTATTTTATTCGTAAAAAATGGTCTCTATTACCTTGGTATCATGC
CTAAACAGAAGGGGCGCTATAAAGCCCTGTCTTTTGAGCCGACAGAAAAAACATC
AGAAGGATTCGATAAGATGTACTATGACTACTTCCCAGATGCCGCAAAAATGATT
CCTAAGTGTTCCACTCAGCTAAAGGCTGTAACCGCTCATTTTCAAACTCATACCA
CCCCCATTCTTCTCTCAAATAATTTCATTGAACCTCTTGAAATCACAAAAGAAAT
TTATGACCTGAACAATCCTGAAAAGGAGCCTAAAAAGTTTCAAACGGCTTATGCA
AAGAAGACAGGCGATCAAAAAGGCTATAGAGAAGCGCTTTGCAAATGGATTGACT
TTACGCGGGATTTTCTCTCTAAATATACGAAAACAACTTCAATCGATTTATCTTC
ACTCCGCCCTTCTTCGCAATATAAAGATTTAGGGGAATATTACGCCGAACTGAAT
CCGCTTCTCTATCATATCTCCTTCCAACGAATTGCTGAAAAGGAAATCATGGATG
CTGTAGAAACGGGAAAATTGTATCTGTTCCAAATCTACAATAAGGATTTTGCGAA
GGGCCATCACGGGAAACCAAATCTCCACACCCTGTATTGGACAGGTCTCTTCAGT
CCTGAAAACCTTGCGAAAACCAGCATCAAACTTAATGGTCAAGCAGAATTGTTCT
ATCGACCTAAAAGCCGCATGAAGCGGATGGCCCATCGTCTTGGGGAAAAAATGCT
GAACAAAAAACTAAAGGACCAGAAGACACCGATTCCAGATACCCTCTACCAAGAA
CTGTACGATTATGTCAACCACCGGCTAAGCCATGATCTTTCCGATGAAGCAAGGG
CCCTGCTTCCAAATGTTATCACCAAAGAAGTCTCCCATGAAATTATAAAGGATCG
GCGGTTTACTTCCGATAAATTTTTCTTCCATGTTCCCATTACACTGAATTATCAA
GCAGCCAATAGTCCCAGTAAATTCAACCAGCGTGTCAATGCCTACCTTAAGGAGC
ATCCGGAAACGCCCATCATTGGTATCGATCGTGGAGAACGCAATCTAATCTATAT
TACCGTCATTGACAGTACTGGGAAAATTTTGGAGCAGCGTTCCCTGAATACCATC
CAGCAATTTGACTACCAAAAAAAATTGGACAACAGGGAAAAAGAGCGTGTTGCCG
CCCGTCAAGCCTGGTCCGTCGTCGGAACGATCAAAGACCTTAAACAAGGCTACTT
GTCACAGGTCATCCATGAAATTGTAGACCTGATGATTCATTACCAAGCTGTTGTC
GTCCTTGAAAACCTCAACTTCGGATTTAAATCAAAACGGACAGGCATTGCCGAAA
AAGCAGTCTACCAACAATTTGAAAAGATGCTAATAGATAAACTCAACTGTTTGGT
TCTCAAAGATTATCCTGCTGAGAAAGTGGGAGGCGTCTTAAACCCGTATCAACTT
ACAGATCAGTTCACGAGCTTTGCAAAAATGGGCACGCAAAGCGGCTTCCTTTTCT
ATGTACCGGCCCCTTATACCTCAAAGATTGATCCCCTGACTGGTTTTGTCGATCC
CTTTGTATGGAAGACCATTAAAAATCATGAAAGTCGGAAGCATTTCCTAGAAGGA
TTTGATTTCCTGCATTATGATGTCAAAACAGGTGATTTTATCCTCCATTTTAAAA
TGAATCGGAATCTCTCTTTCCAGAGAGGGCTTCCTGGCTTCATGCCAGCTTGGGA
TATTGTTTTCGAAAAGAATGAAACCCAATTTGATGCAAAAGGGACGCCCTTCATT
GCAGGAAAACGAATTGTTCCTGTAATCGAAAATCATCGTTTTACGGGTCGTTACA
GAGACCTCTATCCCGCTAATGAACTCATTGCCCTTCTGGAAGAAAAAGGCATTGT
CTTTAGAGACGGAAGTAATATATTACCCAAACTTTTAGAAAATGATGATTCTCAT
GCAATTGATACGATGGTCGCCTTGATTCGCAGTGTACTCCAAATGAGAAACAGCA
ATGCCGCAACGGGGGAAGACTACATCAACTCTCCCGTTAGGGATCTGAACGGGGT
GTGTTTCGACAGTCGATTCCAAAATCCAGAATGGCCAATGGATGCGGATGCCAAC
GGAGCTTATCATATTGCCTTAAAAGGGCAGCTTCTTCTGAACCACCTCAAAGAAA
GCAAAGATCTGAAATTACAAAACGGCATCAGCAACCAAGATTGGCTGGCCTACAT
TCAGGAACTGAGAAACTGA
SEQ ATGGCCGTCAAATCCATCAAAGTGAAACTTCGTCTCGACGATATGCCGGAGATTC
ID GGGCCGGTCTATGGAAACTTCATAAGGAAGTCAATGCGGGGGTTCGATATTACAC
NO: GGAATGGCTCAGTCTTCTCCGTCAAGAGAACTTGTATCGAAGAAGTCCGAATGGG
33 GACGGAGAGCAAGAATGTGATAAGACTGCAGAAGAATGCAAAGCCGAATTGTTGG
AGCGGCTGCGCGCGCGTCAAGTGGAGAATGGACACCGTGGTCCGGCGGGATCGGA
CGATGAATTGCTGCAGTTGGCGCGTCAACTCTATGAGTTGTTGGTTCCGCAGGCG
ATAGGTGCGAAAGGCGACGCGCAGCAAATTGCCCGCAAATTTTTGAGCCCCTTGG
CCGACAAGGACGCAGTTGGTGGGCTTGGAATCGCGAAGGCGGGGAACAAACCGCG
GTGGGTTCGCATGCGCGAAGCGGGGGAACCAGGCTGGGAAGAGGAGAAGGAGAAG
GCTGAGACGAGGAAATCTGCGGATCGGACTGCGGATGTTTTGCGCGCGCTCGCGG
ATTTTGGGTTAAAGCCACTGATGCGCGTATACACCGATTCTGAGATGTCATCGGT
GGAGTGGAAACCGCTTCGGAAGGGACAAGCCGTTCGGACGTGGGATAGGGACATG
TTCCAACAAGCTATCGAACGGATGATGTCGTGGGAGTCGTGGAATCAGCGCGTTG
GGCAAGAGTACGCGAAACTCGTAGAACAAAAAAATCGATTTGAGCAGAAGAATTT
CGTCGGCCAGGAACATCTGGTCCATCTCGTCAATCAGTTGCAACAAGATATGAAA
GAAGCATCGCCCGGACTCGAATCGAAAGAGCAAACCGCGCACTATGTGACGGGAC
GGGCATTGCGCGGATCGGACAAGGTATTTGAGAAGTGGGGGAAACTCGCCCCCGA
TGCACCTTTCGATTTGTACGACGCCGAAATCAAGAATGTGCAGAGACGTAACACG
AGACGATTCGGATCACATGACTTGTTCGCAAAATTGGCAGAGCCAGAGTATCAGG
CCCTGTGGCGCGAAGATGCTTCGTTTCTCACGCGTTACGCGGTGTACAACAGCAT
CCTTCGCAAACTGAATCACGCCAAAATGTTCGCGACGTTTACTTTGCCGGATGCA
ACGGCGCACCCGATTTGGACTCGCTTCGATAAATTGGGTGGGAATTTGCACCAGT
ACACCTTTTTGTTCAACGAATTTGGAGAACGCAGGCACGCGATTCGTTTTCACAA
GCTATTGAAAGTCGAGAATGGTGTCGCAAGAGAAGTTGATGATGTCACCGTGCCC
ATTTCAATGTCAGAGCAATTGGATAATCTGCTTCCCAGAGATCCCAATGAACCGA
TTGCGCTATATTTTCGAGATTACGGAGCCGAACAGCATTTCACAGGTGAATTTGG
TGGCGCGAAGATCCAGTGCCGCCGGGATCAGCTGGCTCATATGCACCGACGCAGA
GGGGCGAGGGATGTTTATCTCAATGTCAGCGTACGTGTGCAGAGTCAGTCTGAGG
CGCGGGGAGAACGTCGCCCGCCGTATGCGGCAGTATTTCGTCTGGTCGGGGACAA
CCATCGCGCGTTTGTCCATTTCGATAAACTATCGGATTATCTTGCGGAACATCCG
GATGATGGGAAGCTCGGGTCGGAGGGGTTGCTTTCCGGGCTGCGGGTGATGAGTG
TCGATCTCGGCCTTCGCACATCTGCATCGATTTCCGTTTTTCGCGTTGCCCGGAA
GGACGAGTTGAAGCCGAACTCAAAAGGTCGTGTACCGTTTTTCTTTCCGATAAAA
GGGAATGACAATCTCGTCGCGGTTCATGAGCGATCACAACTCTTGAAGCTGCCTG
GCGAAACGGAGTCGAAGGACCTGCGTGCTATCCGAGAAGAACGCCAACGGACATT
GCGGCAGTTGCGGACGCAACTGGCGTATTTGCGGCTGCTCGTGCGGTGTGGGTCG
GAAGATGTGGGGCGGCGTGAACGGAGTTGGGCAAAGCTTATCGAGCAGCCGGTGG
ATGCGGCCAATCACATGACACCGGATTGGCGCGAGGCTTTTGAAAACGAACTTCA
GAAGCTTAAGTCACTCCATGGTATCTGTAGCGACAAGGAATGGATGGATGCTGTC
TACGAGAGCGTTCGCCGCGTGTGGCGTCACATGGGCAAACAGGTTCGCGATTGGC
GAAAGGACGTACGAAGCGGAGAGCGGCCCAAGATTCGCGGCTATGCGAAAGACGT
GGTCGGTGGAAACTCGATTGAGCAAATCGAGTATCTGGAACGTCAGTACAAGTTC
CTCAAGAGTTGGAGCTTCTTTGGTAAGGTGTCGGGACAAGTGATTCGTGCGGAGA
AGGGATCTCGTTTTGCGATCACGCTGCGCGAACACATTGATCACGCGAAGGAAGA
TCGGCTGAAGAAATTGGCGGATCGCATCATTATGGAGGCTCTCGGCTATGTGTAC
GCGTTGGATGAGCGCGGCAAAGGAAAGTGGGTTGCGAAGTATCCGCCGTGCCAGC
TCATCCTGCTGGAGGAATTGAGCGAGTACCAGTTCAATAACGACAGGCCTCCGAG
CGAAAACAACCAGTTGATGCAATGGAGTCATCGCGGCGTGTTCCAGGAGTTGATA
AATCAGGCCCAAGTCCATGATTTACTCGTTGGGACGATGTATGCAGCGTTCTCGT
CGCGATTCGACGCGCGAACTGGGGCACCGGGTATCCGCTGTCGCCGGGTTCCGGC
GCGTTGCACCCAGGAGCACAATCCAGAACCATTTCCTTGGTGGCTGAACAAGTTT
GTGGTGGAACATACGTTGGATGCTTGTCCCCTACGCGCAGACGACCTCATCCCAA
CGGGTGAAGGAGAGATTTTTGTCTCGCCGTTCAGCGCGGAGGAGGGGGACTTTCA
TCAGATTCACGCCGACCTGAATGCGGCGCAAAATCTGCAGCAGCGACTCTGGTCT
GATTTTGATATCAGTCAAATTCGGTTGCGGTGTGATTGGGGTGAAGTGGACGGTG
AACTCGTTCTGATCCCAAGGCTTACAGGAAAACGAACGGCGGATTCATATAGCAA
CAAGGTGTTTTATACCAATACAGGTGTCACCTATTATGAGCGAGAGCGGGGGAAG
AAGCGGAGAAAGGTTTTCGCGCAAGAGAAATTGTCGGAGGAAGAGGCGGAGTTGC
TCGTGGAAGCAGACGAGGCGAGGGAGAAATCGGTCGTTTTGATGCGTGATCCGTC
TGGCATCATCAATCGGGGAAATTGGACCAGGCAAAAGGAATTTTGGTCGATGGTG
AACCAGCGGATCGAAGGATACTTGGTCAAGCAGATTCGCTCGCGCGTTCCATTAC
AAGATAGTGCGTGTGAAAACACGGGGGATATTTAA
SEQ ATGGCGACACGCAGTTTTATTTTAAAAATTGAACCAAATGAAGAAGTTAAAAAGG
ID GATTATGGAAGACGCATGAGGTATTGAATCATGGAATTGCCTACTACATGAATAT
NO: TCTGAAACTAATTAGACAGGAAGCTATTTATGAACATCATGAACAAGATCCTAAA
34 AATCCGAAAAAAGTTTCAAAAGCAGAAATACAAGCCGAGTTATGGGATTTTGTTT
TAAAAATGCAAAAATGTAATAGTTTTACACATGAAGTTGACAAAGATGTTGTTTT
TAACATCCTGCGTGAACTATATGAAGAGTTGGTCCCTAGTTCAGTCGAGAAAAAG
GGTGAAGCCAATCAATTATCGAATAAGTTTCTGTACCCGCTAGTTGATCCGAACA
GTCAAAGTGGGAAAGGGACGGCATCATCCGGACGTAAACCTCGGTGGTATAATTT
AAAAATAGCAGGCGACCCATCGTGGGAGGAAGAAAAGAAAAAATGGGAAGAGGAT
AAAAAGAAAGATCCCCTTGCTAAAATCTTAGGTAAGTTAGCAGAATATGGGCTTA
TTCCGCTATTTATTCCATTTACTGACAGCAACGAACCAATTGTAAAAGAAATTAA
ATGGATGGAAAAAAGTCGTAATCAAAGTGTCCGGCGACTTGATAAGGATATGTTT
ATCCAAGCATTAGAGCGTTTTCTTTCATGGGAAAGCTGGAACCTTAAAGTAAAGG
AAGAGTATGAAAAAGTTGAAAAGGAACACAAAACACTAGAGGAAAGGATAAAAGA
GGACATTCAAGCATTTAAATCCCTTGAACAATATGAAAAAGAACGGCAGGAGCAA
CTTCTTAGAGATACATTGAATACAAATGAATACCGATTAAGCAAAAGAGGATTAC
GTGGTTGGCGTGAAATTATCCAAAAATGGCTAAAGATGGATGAAAATGAACCATC
AGAAAAATATTTAGAAGTATTTAAAGATTATCAACGGAAACATCCACGAGAAGCC
GGGGACTATTCTGTCTATGAATTTTTAAGCAAGAAAGAAAATCATTTTATTTGGC
GAAATCATCCTGAATATCCTTATTTGTATGCTACATTTTGTGAAATTGACAAAAA
AAAGAAAGACGCTAAGCAACAGGCAACTTTTACTTTGGCTGACCCGATTAACCAT
CCGTTATGGGTACGATTTGAAGAAAGAAGCGGTTCGAACTTAAACAAATATCGAA
TTTTAACAGAGCAATTACACACTGAAAAGTTAAAAAAGAAATTAACAGTTCAACT
TGATCGTTTAATTTATCCAACTGAATCCGGCGGTTGGGAGGAAAAAGGTAAAGTA
GATATCGTTTTGTTGCCGTCAAGACAATTTTATAATCAAATCTTCCTTGATATAG
AAGAAAAGGGGAAACATGCTTTTACTTATAAGGATGAAAGTATTAAATTCCCCCT
TAAAGGTACACTTGGTGGTGCAAGAGTGCAGTTTGACCGTGACCATTTGCGGAGA
TATCCGCATAAAGTAGAATCAGGAAATGTTGGACGGATTTATTTTAACATGACAG
TAAATATTGAACCAACTGAGAGCCCTGTTAGTAAGTCTTTGAAAATACATAGGGA
CGATTTCCCCAAGTTCGTTAATTTTAAACCGAAAGAGCTCACCGAATGGATAAAA
GATAGTAAAGGGAAAAAATTAAAAAGTGGTATAGAATCCCTTGAAATTGGTCTAC
GGGTGATGAGTATCGACTTAGGTCAACGTCAAGCGGCTGCTGCATCGATTTTTGA
AGTAGTTGATCAGAAACCGGATATTGAAGGGAAGTTATTTTTTCCAATCAAAGGA
ACTGAGCTTTATGCTGTTCACCGGGCAAGTTTTAACATTAAATTACCGGGTGAAA
CATTAGTAAAATCACGGGAAGTATTGCGGAAAGCTCGGGAGGACAACTTAAAATT
AATGAATCAAAAGTTAAACTTTCTAAGAAATGTTCTACATTTCCAACAGTTTGAA
GATATCACAGAAAGAGAGAAGCGTGTAACTAAATGGATTTCTAGACAAGAAAATA
GTGATGTTCCTCTTGTATATCAAGATGAGCTAATTCAAATTCGTGAATTAATGTA
TAAACCCTATAAAGATTGGGTTGCCTTTTTAAAACAACTCCATAAACGGCTAGAA
GTCGAGATTGGCAAAGAGGTTAAGCATTGGCGAAAATCATTAAGTGACGGGAGAA
AAGGTCTTTACGGAATCTCCCTAAAAAATATTGATGAAATTGATCGAACAAGGAA
ATTCCTTTTAAGATGGAGCTTACGTCCAACAGAACCTGGGGAAGTAAGACGCTTG
GAACCAGGACAGCGTTTTGCGATTGATCAATTAAACCACCTAAATGCATTAAAAG
AAGATCGATTAAAAAAGATGGCAAATACGATTATCATGCATGCCTTAGGTTACTG
TTATGATGTAAGAAAGAAAAAGTGGCAGGCAAAAAATCCAGCATGTCAAATTATT
TTATTTGAAGATTTATCTAACTACAATCCTTACGAGGAAAGGTCCCGTTTTGAAA
ACTCAAAACTGATGAAGTGGTCACGGAGAGAAATTCCACGACAAGTCGCCTTACA
AGGTGAAATTTACGGATTACAAGTTGGGGAAGTAGGTGCCCAATTCAGTTCAAGA
TTCCATGCGAAAACCGGGTCGCCGGGAATTCGTTGCAGTGTTGTAACGAAAGAAA
AATTGCAGGATAATCGCTTTTTTAAAAATTTACAAAGAGAAGGACGACTTACTCT
TGATAAAATCGCAGTTTTAAAAGAAGGAGACTTATATCCAGATAAAGGTGGAGAA
AAGTTTATTTCTTTATCAAAGGATCGAAAGTTGGTAACTACGCATGCTGATATTA
ACGCGGCCCAAAATTTACAGAAGCGTTTTTGGACAAGAACACATGGATTTTATAA
AGTTTACTGCAAAGCCTATCAGGTTGATGGACAAACTGTTTATATTCCGGAGAGC
AAGGACCAAAAACAAAAAATAATTGAAGAATTTGGGGAAGGCTATTTTATTTTAA
AAGATGGTGTATATGAATGGGGTAATGCGGGGAAACTAAAAATTAAAAAAGGTTC
CTCTAAACAATCATCGAGTGAATTAGTAGATTCGGACATACTGAAAGATTCATTT
GATTTAGCAAGTGAACTTAAGGGAGAGAAACTCATGTTATATCGAGATCCGAGTG
GAAACGTATTTCCTTCCGACAAGTGGATGGCAGCAGGAGTATTTTTTGGCAAATT
AGAAAGAATATTGATTTCTAAGTTAACAAATCAATACTCAATATCAACAATAGAA
GATGATTCTTCAAAACAATCAATGTAA
SEQ ATGCCCACCCGCACCATCAATCTGAAACTTGTTCTTGGGAAAAATCCTGAAAACG
ID CAACATTGCGACGCGCCCTATTTTCGACACACCGTTTGGTTAACCAAGCGACGAA
NO: ACGTATTGAGGAATTCTTGTTGCTGTGTCGTGGAGAAGCCTACAGAACAGTGGAT
35 AATGAGGGGAAGGAAGCCGAGATTCCACGTCATGCAGTCCAAGAAGAAGCTCTTG
CCTTTGCCAAAGCTGCTCAACGCCACAACGGCTGTATATCCACCTATGAAGACCA
AGAGATTCTTGATGTACTGCGGCAACTGTACGAACGTCTTGTTCCTTCGGTCAAC
GAAAACAACGAGGCAGGCGATGCTCAAGCTGCTAACGCCTGGGTCAGTCCGCTCA
TGTCGGCAGAAAGCGAAGGAGGCTTGTCGGTCTACGACAAGGTGCTTGATCCACC
GCCGGTTTGGATGAAGCTTAAAGAAGAAAAGGCTCCAGGATGGGAAGCCGCTTCT
CAAATTTGGATTCAGAGTGATGAGGGACAGTCGTTACTTAATAAGCCAGGTAGCC
CTCCCCGCTGGATTCGAAAACTGCGATCTGGGCAACCGTGGCAAGATGATTTCGT
CAGTGACCAAAAGAAAAAGCAAGATGAGCTGACCAAAGGGAACGCACCACTTATA
AAACAACTCAAAGAAATGGGGTTGTTGCCTCTTGTTAACCCATTTTTTAGACATC
TTCTTGACCCTGAAGGTAAAGGCGTGAGTCCATGGGACCGTCTTGCTGTACGCGC
TGCAGTGGCTCACTTTATCTCCTGGGAAAGTTGGAATCATAGAACACGTGCAGAA
TACAATTCCTTGAAACTACGGCGAGACGAGTTTGAGGCAGCATCCGACGAATTCA
AAGACGATTTTACTTTGCTCCGACAATATGAAGCCAAACGCCATAGTACATTGAA
AAGCATCGCGCTGGCCGACGATTCGAACCCTTACCGGATTGGAGTACGTTCTCTG
CGTGCCTGGAACCGCGTTCGTGAAGAATGGATAGACAAGGGTGCAACAGAAGAAC
AACGCGTGACCATATTGTCAAAGCTTCAAACACAACTTCGGGGAAAATTCGGCGA
TCCCGATCTGTTCAACTGGCTAGCTCAGGATAGGCATGTCCATTTGTGGTCTCCT
CGGGACAGCGTGACACCATTGGTTCGCATCAATGCGGTAGATAAAGTTCTGCGTC
GACGAAAACCGTATGCATTGATGACCTTTGCCCATCCCCGCTTCCACCCTCGATG
GATACTGTACGAGGCTCCAGGAGGAAGCAATCTCCGTCAATATGCATTGGATTGT
ACAGAAAACGCTCTACACATCACGTTGCCTTTGCTTGTCGACGATGCGCACGGAA
CCTGGATTGAAAAAAAGATCAGGGTGCCGCTGGCACCATCCGGACAAATTCAAGA
TTTAACTCTGGAAAAACTTGAGAAGAAAAAAAATCGTTTATACTACCGTTCCGGT
TTTCAGCAGTTTGCCGGCTTGGCTGGCGGAGCTGAGGTTCTTTTCCACAGACCCT
ATATGGAACACGACGAACGCAGCGAGGAGTCTCTTTTGGAACGTCCGGGAGCCGT
TTGGTTCAAATTGACCCTGGATGTGGCAACACAGGCTCCCCCGAACTGGCTTGAT
GGTAAGGGCCGTGTCCGTACACCGCCGGAGGTACATCATTTTAAAACCGCATTGT
CGAATAAAAGCAAACATACACGTACGCTGCAGCCGGGTCTCCGTGTCTTGTCAGT
AGACTTGGGCATGCGAACATTCGCCTCCTGCTCAGTATTTGAACTCATCGAGGGA
AAGCCTGAGACAGGCCGTGCCTTCCCTGTTGCCGATGAGAGATCAATGGACAGCC
CGAATAAACTGTGGGCCAAGCATGAACGTAGTTTTAAACTGACGCTCCCCGGCGA
AACCCCTTCTCGAAAGGAAGAGGAAGAGCGTAGCATAGCAAGAGCGGAAATTTAT
GCACTGAAACGCGACATACAACGCCTCAAAAGCCTACTCCGCTTAGGTGAAGAAG
ATAACGATAACCGTCGTGATGCATTGCTTGAACAGTTCTTTAAAGGATGGGGAGA
AGAAGACGTTGTGCCTGGACAAGCGTTTCCACGCTCTCTTTTCCAAGGGTTGGGA
GCTGCCCCGTTTCGCTCAACTCCAGAGTTATGGCGTCAGCATTGCCAAACATATT
ATGACAAAGCGGAAGCCTGTCTGGCTAAACATATCAGTGATTGGCGCAAGCGAAC
TCGTCCCCGTCCGACATCGCGGGAGATGTGGTACAAAACACGTTCCTATCATGGC
GGCAAGTCCATTTGGATGTTGGAATATCTTGATGCCGTTCGAAAACTGCTTCTCA
GTTGGAGCTTACGTGGTCGTACTTACGGTGCCATTAATCGCCAGGATACAGCCCG
GTTTGGTTCTTTGGCATCACGGCTGCTCCACCATATCAATTCCCTAAAGGAAGAC
CGCATCAAAACAGGAGCCGACTCTATCGTTCAGGCTGCTCGCGGGTATATTCCTC
TCCCTCATGGCAAGGGTTGGGAACAAAGATATGAGCCTTGTCAGCTCATATTATT
TGAAGACCTCGCACGATATCGCTTTCGCGTGGATCGACCTCGTCGAGAGAACAGC
CAACTCATGCAGTGGAACCATCGAGCCATCGTGGCAGAAACAACGATGCAAGCCG
AACTCTACGGACAAATTGTCGAAAATACTGCAGCGGGGTTCAGCAGTCGTTTTCA
CGCGGCGACAGGTGCCCCCGGTGTACGTTGTCGTTTTCTTCTAGAAAGAGACTTT
GATAACGATTTGCCCAAACCGTACCTTCTCAGGGAACTTTCTTGGATGCTCGGCA
ATACAAAAGTCGAGTCTGAAGAAGAAAAGCTTCGATTGCTGTCTGAAAAAATCAG
GCCAGGCAGTCTTGTTCCTTGGGATGGAGGCGAACAGTTCGCTACCCTGCATCCC
AAAAGACAAACACTTTGCGTCATTCATGCCGATATGAATGCTGCCCAAAATTTAC
AACGCCGGTTTTTCGGTCGATGCGGCGAGGCCTTTCGGCTTGTTTGTCAACCCCA
CGGTGACGACGTGTTACGACTCGCATCCACCCCAGGAGCTCGTCTTCTTGGAGCC
CTGCAGCAGCTTGAAAATGGACAAGGAGCTTTCGAGTTGGTTCGAGACATGGGGT
CAACAAGTCAAATGAACCGGTTCGTCATGAAGTCTTTGGGAAAAAAGAAAATAAA
ACCCCTTCAGGACAACAATGGAGACGACGAGCTTGAAGACGTGTTGTCCGTACTC
CCGGAGGAAGACGACACAGGACGTATCACAGTCTTCCGCGATTCATCAGGAATCT
TTTTTCCTTGCAACGTCTGGATACCGGCCAAACAGTTTTGGCCAGCAGTACGCGC
CATGATTTGGAAGGTCATGGCTTCCCATTCTTTGGGGTGA
SEQ ATGACAAAGTTAAGACACCGACAGAAAAAATTAACACACGACTGGGCTGGCTCCA
ID AAAAGAGGGAAGTATTAGGCTCAAATGGCAAGCTTCAGAATCCGTTGTTAATGCC
NO: GGTTAAAAAAGGTCAGGTTACTGAGTTCCGGAAAGCGTTTTCTGCGTATGCTCGC
36 GCAACGAAAGGAGAAATGACTGACGGCCGAAAGAATATGTTTACGCATAGTTTCG
AGCCATTTAAGACAAAGCCCTCGCTTCATCAGTGTGAATTGGCAGATAAAGCATA
TCAATCTTTACATTCGTATCTGCCTGGTTCTCTTGCTCATTTTCTATTATCTGCT
CACGCATTAGGTTTTCGTATTTTTTCAAAATCTGGTGAAGCAACTGCATTCCAGG
CATCCTCTAAAATTGAAGCTTACGAATCAAAATTGGCAAGCGAATTAGCTTGTGT
AGATTTATCTATTCAAAACTTGACTATTTCAACGCTTTTTAATGCGCTTACAACG
TCTGTAAGAGGGAAGGGCGAAGAAACTAGCGCTGACCCCTTAATTGCACGATTTT
ACACCTTACTTACTGGCAAGCCTCTGTCTCGAGACACTCAAGGGCCTGAACGTGA
TTTAGCAGAAGTTATCTCGCGTAAGATAGCTAGTTCTTTTGGCACATGGAAAGAA
ATGACGGCAAACCCTCTTCAGTCATTACAATTTTTTGAAGAGGAACTCCATGCGC
TGGATGCCAATGTCTCGCTCTCACCCGCCTTCGACGTTTTAATTAAAATGAATGA
TTTGCAGGGCGATTTAAAAAATCGAACCATTGTTTTTGATCCTGACGCCCCTGTT
TTTGAATATAACGCAGAAGACCCTGCCGACATAATTATTAAACTTACAGCTCGTT
ACGCTAAAGAAGCTGTCATCAAAAATCAAAACGTAGGAAATTACGTTAAAAACGC
TATTACTACCACAAATGCCAATGGTCTTGGTTGGCTTTTGAACAAAGGTTTGTCG
TTACTCCCTGTCTCGACCGATGACGAATTGCTAGAGTTTATTGGCGTTGAACGAT
CTCATCCCTCATGCCATGCCTTAATTGAATTGATTGCACAATTAGAAGCCCCCGA
GCTCTTTGAGAAGAACGTATTTTCAGATACTCGTTCTGAAGTTCAAGGTATGATT
GATTCAGCTGTTTCTAATCATATTGCTCGTCTTTCCAGCTCTAGAAATAGCTTGT
CAATGGATAGTGAAGAATTAGAACGTTTAATCAAAAGCTTTCAGATACACACACC
TCATTGCTCACTTTTTATTGGCGCCCAATCACTTTCACAGCAGTTAGAATCTTTG
CCTGAAGCCCTTCAATCGGGCGTTAATTCAGCCGATATTTTACTAGGCTCTACTC
AATATATGCTCACCAATTCTTTGGTTGAAGAGTCAATTGCAACTTATCAAAGAAC
ACTTAATCGCATCAATTACTTGTCAGGTGTTGCAGGTCAGATTAACGGCGCAATA
AAGCGAAAAGCGATAGATGGAGAAAAAATTCACTTGCCTGCAGCTTGGTCAGAGT
TGATATCTTTACCATTTATAGGCCAGCCTGTTATAGATGTTGAAAGCGATTTAGC
TCATCTAAAAAATCAATACCAAACACTTTCAAATGAGTTTGATACTCTTATATCT
GCTTTGCAAAAGAATTTTGATTTGAACTTTAATAAAGCGCTCCTTAATCGTACTC
AGCATTTTGAAGCCATGTGTAGAAGCACTAAGAAAAACGCTTTATCCAAACCAGA
GATCGTTTCCTATCGCGACCTGCTTGCTCGATTAACTTCTTGTTTGTATCGAGGC
TCTTTAGTTTTGCGTCGTGCCGGCATTGAAGTGTTAAAAAAACATAAAATATTTG
AGTCAAACAGCGAACTTCGTGAACATGTTCATGAAAGAAAGCATTTCGTGTTTGT
TAGTCCTCTAGATCGCAAAGCCAAGAAACTCCTTCGATTAACTGATTCGCGTCCA
GACTTGTTACATGTTATTGATGAAATATTGCAGCACGATAATCTTGAAAACAAAG
ACCGCGAGTCACTTTGGCTAGTTCGCTCTGGTTATTTGCTTGCAGGACTTCCAGA
TCAACTTTCTTCATCTTTTATTAACTTGCCTATCATTACTCAAAAAGGAGATAGA
CGCCTTATAGACCTGATTCAGTATGATCAAATTAATCGTGATGCTTTTGTTATGT
TAGTGACCTCTGCATTCAAGTCTAATTTGTCTGGTCTGCAGTATCGTGCCAATAA
GCAATCGTTCGTTGTTACTCGCACGCTAAGCCCTTATCTCGGCTCAAAACTTGTC
TACGTACCCAAGGATAAAGATTGGTTAGTTCCTTCTCAAATGTTTGAAGGACGAT
TTGCTGACATTCTTCAATCAGATTATATGGTCTGGAAAGATGCCGGTCGTCTTTG
TGTTATTGATACTGCAAAACACCTTTCTAATATAAAGAAGTCTGTATTTTCATCC
GAAGAAGTTCTCGCTTTTTTAAGAGAACTCCCTCACCGCACATTTATCCAGACCG
AAGTTCGCGGCCTTGGCGTTAATGTCGATGGAATTGCATTTAATAATGGTGATAT
TCCGTCATTAAAAACCTTTTCAAATTGCGTTCAGGTAAAAGTTTCTCGGACTAAT
ACATCCCTAGTTCAAACACTTAATCGTTGGTTTGAAGGAGGAAAAGTTTCTCCTC
CGAGCATTCAATTTGAACGGGCGTATTATAAAAAAGACGATCAAATTCATGAAGA
CGCAGCGAAAAGAAAGATACGATTCCAGATGCCCGCAACTGAGTTGGTTCATGCT
TCTGACGATGCGGGGTGGACACCAAGTTATTTGCTCGGCATTGATCCTGGCGAGT
ATGGAATGGGTCTTTCATTGGTTTCGATTAATAACGGAGAAGTCTTAGATTCAGG
CTTTATTCATATTAATTCTCTGATCAATTTTGCCTCTAAAAAGAGCAACCATCAA
ACTAAGGTTGTTCCGCGTCAGCAGTACAAATCTCCTTATGCAAATTATTTAGAAC
AATCTAAAGATTCTGCTGCTGGTGATATTGCGCATATACTCGATCGACTTATATA
CAAATTAAATGCGTTGCCTGTTTTTGAGGCTCTTTCAGGTAATTCTCAGAGTGCT
GCTGATCAAGTTTGGACGAAAGTCTTATCGTTTTACACTTGGGGTGATAATGACG
CTCAGAATTCTATTAGAAAGCAGCATTGGTTTGGAGCCAGTCATTGGGATATCAA
AGGTATGTTAAGGCAACCCCCTACGGAGAAGAAGCCTAAACCGTATATTGCTTTT
CCTGGCTCTCAGGTTTCTTCGTATGGTAATTCCCAACGTTGCTCTTGCTGCGGTC
GCAATCCTATTGAACAACTTCGAGAAATGGCAAAGGATACCTCTATTAAAGAGCT
AAAAATTCGCAATTCTGAGATACAGCTTTTTGACGGAACCATTAAATTATTTAAT
CCAGACCCATCCACTGTGATAGAGAGAAGGCGACATAATCTTGGTCCATCAAGAA
TTCCTGTTGCTGACCGTACTTTCAAAAACATCAGTCCATCAAGTCTAGAATTTAA
AGAATTGATTACTATCGTGTCTCGATCTATCCGTCATTCACCTGAGTTTATCGCT
AAAAAACGCGGCATAGGGTCTGAGTATTTTTGCGCTTATTCCGATTGCAACTCAT
CCTTAAATTCTGAAGCTAACGCAGCTGCTAACGTAGCGCAAAAATTTCAAAAACA
GTTATTTTTTGAGTTATAA
SEQ ATGAAGAGAATTCTGAACAGTCTGAAAGTTGCTGCCTTGAGACTTCTGTTTCGAG
ID GCAAAGGTTCTGAATTAGTGAAGACAGTCAAATATCCATTGGTTTCCCCGGTTCA
NO: AGGCGCGGTTGAAGAACTTGCTGAAGCAATTCGGCACGACAACCTGCACCTTTTT
37 GGGCAGAAGGAAATAGTGGATCTTATGGAGAAAGACGAAGGAACCCAGGTGTATT
CGGTTGTGGATTTTTGGTTGGATACCCTGCGTTTAGGGATGTTTTTCTCACCATC
AGCGAATGCGTTGAAAATCACGCTGGGAAAATTCAATTCTGATCAGGTTTCACCT
TTTCGTAAGGTTTTGGAGCAGTCACCTTTTTTTCTTGCGGGTCGCTTGAAGGTTG
AACCTGCGGAAAGGATACTTTCTGTTGAAATCAGAAAGATTGGTAAAAGAGAAAA
CAGAGTTGAGAACTATGCCGCCGATGTGGAGACATGCTTCATTGGTCAGCTTTCT
TCAGATGAGAAACAGAGTATCCAGAAGCTGGCAAATGATATCTGGGATAGCAAGG
ATCATGAGGAACAGAGAATGTTGAAGGCGGATTTTTTTGCTATACCTCTTATAAA
AGACCCCAAAGCTGTCACAGAAGAAGATCCTGAAAATGAAACGGCGGGAAAACAG
AAACCGCTTGAATTATGTGTTTGTCTTGTTCCTGAGTTGTATACCCGAGGTTTCG
GCTCCATTGCTGATTTTCTGGTTCAGCGACTTACCTTGCTGCGTGACAAAATGAG
TACCGACACGGCGGAAGATTGCCTCGAGTATGTTGGCATTGAGGAAGAAAAAGGC
AATGGAATGAATTCCTTGCTCGGCACTTTTTTGAAGAACCTGCAGGGTGATGGTT
TTGAACAGATTTTTCAGTTTATGCTTGGGTCTTATGTTGGCTGGCAGGGGAAGGA
AGATGTACTGCGCGAACGATTGGATTTGCTGGCCGAAAAAGTCAAAAGATTACCA
AAGCCAAAATTTGCCGGAGAATGGAGTGGTCATCGTATGTTTCTCCATGGTCAGC
TGAAAAGCTGGTCGTCGAATTTCTTCCGTCTTTTTAATGAGACGCGGGAACTTCT
GGAAAGTATCAAGAGTGATATTCAACATGCCACCATGCTCATTAGCTATGTGGAA
GAGAAAGGAGGCTATCATCCACAGCTGTTGAGTCAGTATCGGAAGTTAATGGAAC
AATTACCGGCGTTGCGGACTAAGGTTTTGGATCCTGAGATTGAGATGACGCATAT
GTCCGAGGCTGTTCGAAGTTACATTATGATACACAAGTCTGTAGCGGGATTTCTG
CCGGATTTACTCGAGTCTTTGGATCGAGATAAGGATAGGGAATTTTTGCTTTCCA
TCTTTCCTCGTATTCCAAAGATAGATAAGAAGACGAAAGAGATCGTTGCATGGGA
GCTACCGGGCGAGCCAGAGGAAGGCTATTTGTTCACAGCAAACAACCTTTTCCGG
AATTTTCTTGAGAATCCGAAACATGTGCCACGATTTATGGCAGAGAGGATTCCCG
AGGATTGGACGCGTTTGCGCTCGGCCCCTGTGTGGTTTGATGGGATGGTGAAGCA
ATGGCAGAAGGTGGTGAATCAGTTGGTTGAATCTCCAGGCGCCCTTTATCAGTTC
AATGAAAGTTTTTTGCGTCAAAGACTGCAAGCAATGCTTACGGTCTATAAGCGGG
ATCTCCAGACTGAGAAGTTTCTGAAGCTGCTGGCTGATGTCTGTCGTCCACTCGT
TGATTTTTTCGGACTTGGAGGAAATGATATTATCTTCAAGTCATGTCAGGATCCA
AGAAAGCAATGGCAGACTGTTATTCCACTCAGTGTCCCAGCGGATGTTTATACAG
CATGTGAAGGCTTGGCTATTCGTCTCCGCGAAACTCTTGGATTCGAATGGAAAAA
TCTGAAAGGACACGAGCGGGAAGATTTTTTACGGCTGCATCAGTTGCTGGGAAAT
CTGCTGTTCTGGATCAGGGATGCGAAACTTGTCGTGAAGCTGGAAGACTGGATGA
ACAATCCTTGTGTTCAGGAGTATGTGGAAGCACGAAAAGCCATTGATCTTCCCTT
GGAGATTTTCGGATTTGAGGTGCCGATTTTTCTCAATGGCTATCTCTTTTCGGAA
CTGCGCCAGCTGGAATTGTTGCTGAGGCGTAAGTCGGTGATGACGTCTTACAGCG
TCAAAACGACAGGCTCGCCAAATAGGCTCTTCCAGTTGGTTTACCTACCTCTAAA
CCCTTCAGATCCGGAAAAGAAAAATTCCAACAACTTTCAGGAGCGCCTCGATACA
CCTACCGGTTTGTCGCGTCGTTTTCTGGATCTTACGCTGGATGCATTTGCTGGCA
AACTCTTGACGGATCCGGTAACTCAGGAACTGAAGACGATGGCCGGTTTTTACGA
TCATCTCTTTGGCTTCAAGTTGCCGTGTAAACTGGCGGCGATGAGTAACCATCCA
GGATCCTCTTCCAAAATGGTGGTTCTGGCAAAACCAAAGAAGGGTGTTGCTAGTA
ACATCGGCTTTGAACCTATTCCCGATCCTGCTCATCCTGTGTTCCGGGTGAGAAG
TTCCTGGCCGGAGTTGAAGTACCTGGAGGGGTTGTTGTATCTTCCCGAAGATACA
CCACTGACCATTGAACTGGCGGAAACGTCGGTCAGTTGTCAGTCTGTGAGTTCAG
TCGCTTTCGATTTGAAGAATCTGACGACTATCTTGGGTCGTGTTGGTGAATTCAG
GGTGACGGCAGATCAACCTTTCAAGCTGACGCCCATTATTCCTGAGAAAGAGGAA
TCCTTCATCGGGAAGACCTACCTCGGTCTTGATGCTGGAGAGCGATCTGGCGTTG
GTTTCGCGATTGTGACGGTTGACGGCGATGGGTATGAGGTGCAGAGGTTGGGTGT
GCATGAAGATACTCAGCTTATGGCGCTTCAGCAAGTCGCCAGCAAGTCTCTTAAG
GAGCCGGTTTTCCAGCCACTCCGTAAGGGCACATTTCGTCAGCAGGAGCGCATTC
GCAAAAGCCTCCGCGGTTGCTACTGGAATTTCTATCATGCATTGATGATCAAGTA
CCGAGCTAAAGTTGTGCATGAGGAATCGGTGGGTTCATCCGGTCTGGTGGGGCAG
TGGCTGCGTGCATTTCAGAAGGATCTCAAAAAGGCTGATGTTCTGCCCAAGAAGG
GTGGAAAAAATGGTGTAGACAAAAAAAAGAGAGAAAGCAGCGCTCAGGATACCTT
ATGGGGAGGAGCTTTCTCGAAGAAGGAAGAGCAGCAGATAGCCTTTGAGGTTCAG
GCAGCTGGATCAAGCCAGTTTTGTCTGAAGTGTGGTTGGTGGTTTCAGTTGGGGA
TGCGGGAAGTAAATCGTGTGCAGGAGAGTGGCGTGGTGCTGGACTGGAACCGGTC
CATTGTAACCTTCCTCATCGAATCCTCAGGAGAAAAGGTATATGGTTTCAGTCCT
CAGCAACTGGAAAAAGGCTTTCGTCCTGACATCGAAACGTTCAAAAAAATGGTAA
GGGATTTTATGAGACCCCCCATGTTTGATCGCAAAGGTCGGCCGGCCGCGGCGTA
TGAAAGATTCGTACTGGGACGTCGTCACCGTCGTTATCGCTTTGATAAAGTTTTT
GAAGAGAGATTTGGTCGCAGTGCTCTTTTCATCTGCCCGCGGGTCGGGTGTGGGA
ATTTCGATCACTCCAGTGAGCAGTCAGCCGTTGTCCTTGCCCTTATTGGTTACAT
TGCTGATAAGGAAGGGATGAGTGGTAAGAAGCTTGTTTATGTGAGGCTGGCTGAA
CTTATGGCTGAGTGGAAGCTGAAGAAACTGGAGAGATCAAGGGTGGAAGAACAGA
GCTCGGCACAATAA
SEQ ATGGCAGAAAGCAAGCAGATGCAATGCCGCAAGTGCGGCGCAAGCATGAAGTATG
ID AAGTAATTGGATTGGGCAAGAAGTCATGCAGATATATGTGCCCAGATTGCGGCAA
NO: TCACACCAGCGCGCGCAAGATTCAGAACAAGAAAAAGCGCGACAAAAAGTATGGA
38 TCCGCAAGCAAAGCGCAGAGCCAGAGGATAGCTGTGGCTGGCGCGCTTTATCCAG
ACAAAAAAGTGCAGACCATAAAGACCTACAAATACCCAGCGGATCTTAATGGCGA
AGTTCATGACAGCGGCGTCGCAGAGAAGATTGCGCAGGCGATTCAGGAAGATGAG
ATCGGCCTGCTTGGCCCGTCCAGCGAATACGCTTGCTGGATTGCTTCACAAAAAC
AGAGCGAGCCGTATTCAGTTGTAGATTTTTGGTTTGACGCGGTGTGCGCAGGCGG
AGTATTCGCGTATTCTGGCGCGCGCCTGCTTTCCACAGTCCTCCAGTTGAGTGGC
GAGGAAAGCGTTTTGCGCGCTGCTTTAGCATCTAGCCCGTTTGTAGATGACATTA
ATTTGGCGCAAGCGGAAAAGTTCCTAGCCGTTAGCCGGCGCACAGGCCAAGATAA
GCTAGGCAAGCGCATTGGAGAATGTTTTGCGGAAGGCCGGCTTGAAGCGCTTGGC
ATCAAAGATCGCATGCGCGAATTCGTGCAAGCGATTGATGTGGCCCAAACCGCGG
GCCAGCGGTTCGCGGCCAAGCTAAAGATATTCGGCATCAGTCAGATGCCTGAAGC
CAAGCAATGGAACAATGATTCCGGGCTCACTGTATGTATTTTGCCGGATTATTAT
GTCCCGGAAGAAAACCGCGCGGACCAGCTGGTTGTTTTGCTTCGGCGCTTACGCG
AGATCGCGTATTGCATGGGAATTGAGGATGAAGCAGGATTTGAGCATCTAGGCAT
TGACCCTGGTGCTCTTTCCAATTTTTCCAATGGCAATCCAAAGCGAGGATTTCTC
GGCCGCCTGCTCAATAATGACATTATAGCGCTGGCAAACAACATGTCAGCCATGA
CGCCGTATTGGGAAGGCAGAAAAGGCGAGTTGATTGAGCGCCTTGCATGGCTTAA
ACATCGCGCTGAAGGATTGTATTTGAAAGAGCCACATTTCGGCAACTCCTGGGCA
GACCACCGCAGCAGGATTTTCAGTCGCATTGCGGGCTGGCTTTCCGGATGCGCGG
GCAAGCTCAAGATTGCCAAGGATCAGATTTCAGGCGTGCGTACGGATTTGTTTCT
GCTCAAGCGCCTTCTGGATGCGGTACCGCAAAGCGCGCCGTCGCCGGACTTTATT
GCTTCCATCAGCGCGCTGGATCGGTTTTTGGAAGCGGCAGAAAGCAGCCAGGATC
CGGCAGAACAGGTACGCGCTTTGTACGCGTTTCATCTGAACGCGCCTGCGGTCCG
ATCCATCGCCAACAAGGCGGTACAGAGGTCTGATTCCCAGGAGTGGCTTATCAAG
GAACTGGATGCTGTAGATCACCTTGAATTCAACAAAGCATTTCCGTTTTTTTCGG
ATACAGGAAAGAAAAAGAAGAAAGGAGCGAATAGCAACGGAGCGCCTTCTGAAGA
AGAATACACGGAAACAGAATCCATTCAACAACCAGAAGATGCAGAGCAGGAAGTG
AATGGTCAAGAAGGAAATGGCGCTTCAAAGAACCAGAAAAAGTTTCAGCGCATTC
CTCGATTTTTCGGGGAAGGGTCAAGGAGTGAGTATCGAATTTTAACAGAAGCGCC
GCAATATTTTGACATGTTCTGCAATAATATGCGCGCGATCTTTATGCAGCTAGAG
AGTCAGCCGCGCAAGGCGCCTCGTGATTTCAAATGCTTTCTGCAGAATCGTTTGC
AGAAGCTTTACAAGCAAACCTTTCTCAATGCTCGCAGTAATAAATGCCGCGCGCT
TCTGGAATCCGTCCTTATTTCATGGGGAGAATTTTATACTTATGGCGCGAATGAA
AAGAAGTTTCGTCTGCGCCATGAAGCGAGCGAGCGCAGCTCGGATCCGGACTATG
TGGTTCAGCAGGCATTGGAAATCGCGCGCCGGCTTTTCTTGTTCGGATTTGAGTG
GCGCGATTGCTCTGCTGGAGAGCGCGTGGATTTGGTTGAAATCCACAAAAAAGCA
ATCTCATTTTTGCTTGCAATCACTCAGGCCGAGGTTTCAGTTGGTTCCTATAACT
GGCTTGGGAATAGCACCGTGAGCCGGTATCTTTCGGTTGCTGGCACAGACACATT
GTACGGCACTCAACTGGAGGAGTTTTTGAACGCCACAGTGCTTTCACAGATGCGT
GGGCTGGCGATTCGGCTTTCATCTCAGGAGTTAAAAGACGGATTTGATGTTCAGT
TGGAGAGTTCGTGCCAGGACAATCTCCAGCATCTGCTGGTGTATCGCGCTTCGCG
CGACTTGGCTGCGTGCAAACGCGCTACATGCCCGGCTGAATTGGATCCGAAAATT
CTTGTTCTGCCGGTTGGTGCGTTTATCGCGAGCGTAATGAAAATGATTGAGCGTG
GCGATGAACCATTAGCAGGCGCGTATTTGCGTCATCGGCCGCATTCATTCGGCTG
GCAGATACGGGTTCGTGGAGTGGCGGAAGTAGGCATGGATCAGGGCACAGCGCTA
GCATTCCAGAAGCCGACTGAATCAGAGCCGTTTAAAATAAAGCCGTTTTCCGCTC
AATACGGCCCAGTACTTTGGCTTAATTCTTCATCCTATAGCCAGAGCCAGTATCT
GGATGGATTTTTAAGCCAGCCAAAGAATTGGTCTATGCGGGTGCTACCTCAAGCC
GGATCAGTGCGCGTGGAACAGCGCGTTGCTCTGATATGGAATTTGCAGGCAGGCA
AGATGCGGCTGGAGCGCTCTGGAGCGCGCGCGTTTTTCATGCCAGTGCCATTCAG
CTTCAGGCCGTCTGGTTCAGGAGATGAAGCAGTATTGGCGCCGAATCGGTACTTG
GGACTTTTTCCGCATTCCGGAGGAATAGAATACGCGGTGGTGGATGTATTAGATT
CCGCGGGTTTCAAAATTCTTGAGCGCGGTACGATTGCGGTAAATGGCTTTTCCCA
GAAGCGCGGCGAACGCCAAGAGGAGGCACACAGAGAAAAACAGAGACGCGGAATT
TCTGATATAGGCCGCAAGAAGCCGGTGCAAGCTGAAGTTGACGCAGCCAATGAAT
TGCACCGCAAATACACCGATGTTGCCACTCGTTTAGGGTGCAGAATTGTGGTTCA
GTGGGCGCCCCAGCCAAAGCCGGGCACAGCGCCGACCGCGCAAACAGTATACGCG
CGCGCAGTGCGGACCGAAGCGCCGCGATCTGGAAATCAAGAGGATCATGCTCGTA
TGAAATCCTCTTGGGGATATACCTGGGGCACCTATTGGGAGAAGCGCAAACCAGA
GGATATTTTGGGCATCTCAACCCAAGTATACTGGACCGGCGGTATAGGCGAGTCA
TGTCCCGCAGTCGCGGTTGCGCTTTTGGGGCACATTAGGGCAACATCCACTCAAA
CTGAATGGGAAAAAGAGGAGGTTGTATTCGGTCGACTGAAGAAGTTCTTTCCAAG
CTAG
SEQ ATGGAAAAGAGAATAAACAAGATACGAAAGAAACTATCGGCCGATAATGCCACAA
ID AGCCTGTGAGCAGGAGCGGCCCCATGAAAACACTCCTTGTCCGGGTCATGACGGA
NO: CGACTTGAAAAAAAGACTGGAGAAGCGTCGGAAAAAGCCGGAAGTTATGCCGCAG
39 GTTATTTCAAATAACGCAGCAAACAATCTTAGAATGCTCCTTGATGACTATACAA
AGATGAAGGAGGCGATACTACAAGTTTACTGGCAGGAATTTAAGGACGACCATGT
GGGCTTGATGTGCAAATTTGCCCAGCCTGCTTCCAAAAAAATTGACCAGAACAAA
CTAAAACCGGAAATGGATGAAAAAGGAAATCTAACAACTGCCGGTTTTGCATGTT
CTCAATGCGGTCAGCCGCTATTTGTTTATAAGCTTGAACAGGTGAGTGAAAAAGG
CAAGGCTTATACAAATTACTTCGGCCGGTGTAATGTGGCCGAGCATGAGAAATTG
ATTCTTCTTGCTCAATTAAAACCTGAAAAAGACAGTGACGAAGCAGTGACATACT
CCCTTGGCAAATTCGGCCAGAGGGCATTGGACTTTTATTCAATCCACGTAACAAA
AGAATCCACCCATCCAGTAAAGCCCCTGGCACAGATTGCGGGCAACCGCTATGCA
AGCGGACCTGTTGGCAAGGCCCTTTCCGATGCCTGTATGGGCACTATAGCCAGTT
TTCTTTCGAAATATCAAGACATCATCATAGAACATCAAAAGGTTGTGAAGGGTAA
TCAAAAGAGGTTAGAGAGTCTCAGGGAATTGGCAGGGAAAGAAAATCTTGAGTAC
CCATCGGTTACACTGCCGCCGCAGCCGCATACGAAAGAAGGGGTTGACGCTTATA
ACGAAGTTATTGCAAGGGTACGTATGTGGGTTAATCTTAATCTGTGGCAAAAGCT
GAAGCTCAGCCGTGATGACGCAAAACCGCTACTGCGGCTAAAAGGATTCCCATCT
TTCCCTGTTGTGGAGCGGCGTGAAAACGAAGTTGACTGGTGGAATACGATTAATG
AAGTAAAAAAACTGATTGACGCTAAACGAGATATGGGACGGGTATTCTGGAGCGG
CGTTACCGCAGAAAAGAGAAATACCATCCTTGAAGGATACAACTATCTGCCAAAT
GAGAATGACCATAAAAAGAGAGAGGGCAGTTTGGAAAACCCTAAGAAGCCTGCCA
AACGCCAGTTTGGAGACCTCTTGCTGTATCTTGAAAAGAAATATGCCGGAGACTG
GGGAAAGGTCTTCGATGAGGCATGGGAGAGGATAGATAAGAAAATAGCCGGACTC
ACAAGCCATATAGAGCGCGAAGAAGCAAGAAACGCGGAAGACGCTCAATCCAAAG
CCGTACTTACAGACTGGCTAAGGGCAAAGGCATCATTTGTTCTTGAAAGACTGAA
GGAAATGGATGAAAAGGAATTCTATGCGTGTGAAATCCAACTTCAAAAATGGTAT
GGCGATCTTCGAGGCAACCCGTTTGCCGTTGAAGCTGAGAATAGAGTTGTTGATA
TAAGCGGGTTTTCTATCGGAAGCGATGGCCATTCAATCCAATACAGAAATCTCCT
TGCCTGGAAATATCTGGAGAACGGCAAGCGTGAATTCTATCTGTTAATGAATTAT
GGCAAGAAAGGGCGCATCAGATTTACAGATGGAACAGATATTAAAAAGAGCGGCA
AATGGCAGGGACTATTATATGGCGGTGGCAAGGCAAAGGTTATTGATCTGACTTT
CGACCCCGATGATGAACAGTTGATAATCCTGCCGCTGGCCTTTGGCACAAGGCAA
GGCCGCGAGTTTATCTGGAACGATTTGCTGAGTCTTGAAACAGGCCTGATAAAGC
TCGCAAACGGAAGAGTTATCGAAAAAACAATCTATAACAAAAAAATAGGGCGGGA
TGAACCGGCTCTATTCGTTGCCTTAACATTTGAGCGCCGGGAAGTTGTTGATCCA
TCAAATATAAAGCCTGTAAACCTTATAGGCGTTGACCGCGGCGAAAACATCCCGG
CGGTTATTGCATTGACAGACCCTGAAGGTTGTCCTTTACCGGAATTCAAGGATTC
ATCAGGGGGCCCAACAGACATCCTGCGAATAGGAGAAGGATATAAGGAAAAGCAG
AGGGCTATTCAGGCAGCAAAGGAGGTAGAGCAAAGGCGGGCTGGCGGTTATTCAC
GGAAGTTTGCATCCAAGTCGAGGAACCTGGCGGACGACATGGTGAGAAATTCAGC
GCGAGACCTTTTTTACCATGCCGTTACCCACGATGCCGTCCTTGTCTTTGAAAAC
CTGAGCAGGGGTTTTGGAAGGCAGGGCAAAAGGACCTTCATGACGGAAAGACAAT
ATACAAAGATGGAAGACTGGCTGACAGCGAAGCTCGCATACGAAGGTCTTACGTC
AAAAACCTACCTTTCAAAGACGCTGGCGCAATATACGTCAAAAACATGCTCCAAC
TGCGGGTTTACTATAACGACTGCCGATTATGACGGGATGTTGGTAAGGCTTAAAA
AGACTTCTGATGGATGGGCAACTACCCTCAACAACAAAGAATTAAAAGCCGAAGG
CCAGATAACGTATTATAACCGGTATAAAAGGCAAACCGTGGAAAAAGAACTCTCC
GCAGAGCTTGACAGGCTTTCAGAAGAGTCGGGCAATAATGATATTTCTAAGTGGA
CCAAGGGTCGCCGGGACGAGGCATTATTTTTGTTAAAGAAAAGATTCAGCCATCG
GCCTGTTCAGGAACAGTTTGTTTGCCTCGATTGCGGCCATGAAGTCCACGCCGAT
GAACAGGCAGCCTTGAATATTGCAAGGTCATGGCTTTTTCTAAACTCAAATTCAA
CAGAATTCAAAAGTTATAAATCGGGTAAACAGCCCTTCGTTGGTGCTTGGCAGGC
CTTTTACAAAAGGAGGCTTAAAGAGGTATGGAAGCCCAACGCC
SEQ ATGAAAAGGATAAATAAAATACGAAGGAGATTGGTAAAGGATAGCAACACGAAAA
ID AAGCCGGCAAAACCGGCCCTATGAAAACCTTGCTCGTTCGGGTTATGACACCTGA
NO: CCTGAGAGAAAGGTTAGAGAATCTTCGCAAAAAGCCGGAAAACATTCCTCAGCCC
40 ATTTCAAATACTTCACGTGCAAATTTAAATAAACTCCTCACTGACTATACGGAAA
TGAAGAAAGCAATCCTGCATGTTTATTGGGAAGAGTTCCAAAAAGACCCTGTCGG
ATTGATGAGCAGGGTTGCACAACCAGCGCCCAAGAATATTGATCAGAGAAAATTG
ATTCCGGTGAAGGACGGAAATGAGAGACTAACAAGTTCTGGATTTGCCTGTTCTC
AGTGCTGTCAACCCCTCTATGTTTATAAGCTTGAACAAGTGAATGACAAGGGTAA
GCCCCATACAAATTACTTTGGCCGTTGTAATGTCTCCGAGCATGAACGTTTGATA
TTGCTCTCGCCGCATAAACCGGAGGCAAATGACGAGCTAGTAACGTATTCGTTGG
GGAAGTTCGGTCAAAGGGCATTGGACTTTTATTCAATCCACGTAACAAGAGAATC
GAACCATCCTGTAAAGCCGCTAGAACAGATCGGTGGCAATAGCTGCGCAAGTGGT
CCCGTTGGTAAGGCTTTATCTGATGCCTGTATGGGAGCAGTAGCCAGTTTCCTTA
CAAAGTACCAGGACATCATCCTCGAACACCAAAAGGTTATAAAAAAAAACGAAAA
GAGATTGGCAAATCTAAAGGATATAGCAAGTGCAAACGGGCTTGCATTTCCTAAA
ATCACTCTTCCACCGCAACCGCATACAAAAGAAGGGATTGAAGCTTATAACAATG
TTGTTGCTCAGATAGTGATCTGGGTAAACCTGAATCTTTGGCAGAAACTCAAAAT
TGGCAGGGATGAGGCAAAGCCCTTACAGCGGCTTAAGGGTTTTCCGTCCTTCCCT
CTTGTTGAACGCCAGGCGAATGAGGTTGATTGGTGGGATATGGTCTGTAATGTCA
AAAAGTTGATTAACGAAAAGAAAGAGGACGGGAAGGTCTTCTGGCAAAATCTTGC
TGGATATAAAAGGCAGGAAGCCTTGCTTCCATATCTTTCGTCTGAAGAAGACCGT
AAAAAAGGAAAAAAGTTTGCGCGTTATCAGTTTGGTGACCTTTTGCTTCACCTTG
AAAAGAAACACGGTGAAGATTGGGGCAAAGTTTATGATGAGGCATGGGAAAGAAT
AGATAAAAAAGTTGAAGGTCTGAGTAAGCACATAAAGTTGGAGGAAGAAAGAAGG
TCTGAAGATGCTCAATCAAAGGCTGCCCTCACTGATTGGCTCAGGGCAAAGGCCT
CTTTTGTTATTGAAGGGCTCAAAGAAGCTGATAAGGATGAGTTTTGCAGGTGTGA
GTTAAAGCTTCAAAAGTGGTATGGAGATTTGAGAGGAAAACCATTTGCTATAGAA
GCAGAGAACAGCATTTTAGATATAAGCGGATTTTCTAAACAGTATAATTGTGCAT
TTATATGGCAGAAAGACGGCGTAAAGAAGTTAAATCTTTATTTAATAATAAATTA
CTTCAAAGGTGGTAAGCTACGCTTCAAAAAAATCAAGCCAGAAGCTTTTGAAGCA
AATAGGTTTTATACAGTAATTAATAAAAAAAGCGGTGAGATTGTGCCTATGGAGG
TCAACTTCAATTTTGATGACCCGAATTTGATAATTCTGCCTTTGGCCTTTGGAAA
AAGGCAGGGGAGGGAGTTTATCTGGAACGACCTATTGAGCCTTGAGACGGGTTCA
TTGAAACTCGCCAATGGCAGGGTTATTGAAAAAACGCTCTATAACAGAAGGACGA
GACAGGATGAACCAGCACTTTTTGTTGCCCTGACATTTGAAAGAAGAGAGGTGCT
TGACTCATCGAATATAAAACCGATGAATCTGATAGGAATAGACCGGGGAGAAAAT
ATCCCGGCAGTCATAGCATTAACAGACCCGGAAGGATGCCCCTTGTCAAGATTCA
AAGATTCATTGGGCAATCCAACGCATATTTTGCGAATAGGAGAAAGTTATAAGGA
AAAACAACGGACTATTCAGGCTGCTAAAGAAGTTGAACAAAGGCGGGCAGGCGGA
TATTCGAGAAAATATGCATCAAAGGCGAAGAATCTGGCGGACGATATGGTAAGAA
ATACAGCTCGTGACCTCTTATATTATGCTGTTACTCAAGATGCAATGCTCATTTT
TGAAAATCTTTCCCGCGGTTTTGGTAGACAAGGCAAGAGGACTTTTATGGCGGAA
AGGCAGTACACGAGGATGGAAGACTGGCTGACTGCAAAGCTTGCCTATGAAGGTC
TGCCATCAAAAACCTATCTTTCAAAGACTCTGGCACAGTATACCTCAAAGACATG
TTCTAATTGTGGTTTTACAATCACAAGTGCAGATTATGACAGGGTGCTCGAAAAG
CTCAAGAAGACGGCTACTGGATGGATGACTACAATCAATGGAAAAGAGTTAAAAG
TTGAAGGACAGATAACATACTATAACCGGTATAAAAGGCAGAATGTGGTAAAAGA
CCTCTCTGTAGAGCTGGATAGACTTTCGGAAGAGTCGGTAAATAATGATATTTCT
AGTTGGACAAAAGGCCGCAGTGGTGAAGCTTTATCTCTGCTAAAAAAGAGATTTA
GTCACAGGCCGGTGCAGGAAAAGTTTGTTTGCCTGAACTGTGGTTTTGAAACCCA
TGCAGACGAACAAGCAGCACTGAATATTGCAAGGTCGTGGCTCTTTCTCCGTTCT
CAAGAATATAAGAAGTATCAAACCAATAAAACGACCGGAAATACTGACAAAAGGG
CATTTGTTGAAACATGGCAATCCTTTTACAGAAAGAAGCTCAAAGAAGTATGGAA
ACCA
SEQ ATGGGTAAAATGTATTACCTTGGTTTAGACATTGGCACGAATTCCGTGGGCTACG
ID CGGTGACCGACCCCTCATACCACCTGCTGAAGTTTAAGGGGGAACCAATGTGGGG
NO: TGCGCACGTATTTGCCGCCGGTAATCAGAGCGCGGAACGACGCTCGTTCCGCACA
41 TCGCGTCGTCGTTTGGACCGACGCCAACAGCGCGTTAAACTGGTACAGGAGATTT
TTGCCCCGGTGATTAGTCCGATCGACCCACGCTTCTTCATTCGTCTGCATGAATC
CGCCCTGTGGCGCGATGACGTCGCGGAGACGGATAAACATATCTTTTTCAATGAT
CCTACCTATACCGATAAGGAATATTATAGCGATTACCCGACTATCCATCACCTGA
TCGTTGATCTGATGGAAAGCTCTGAGAAACACGATCCGCGGCTGGTGTACCTTGC
AGTGGCGTGGTTAGTGGCACACCGTGGTCATTTTCTGAACGAGGTGGACAAGGAT
AATATTGGAGATGTGTTGTCGTTCGACGCATTTTATCCGGAGTTTCTCGCGTTCC
TGTCGGACAACGGTGTATCACCGTGGGTGTGCGAAAGCAAAGCGCTGCAGGCGAC
CTTGCTGAGCCGTAACTCAGTGAACGACAAATATAAAGCCCTTAAGTCTCTGATC
TTCGGATCCCAGAAACCTGAAGATAACTTCGATGCCAATATTTCGGAAGATGGAC
TCATTCAACTGCTGGCCGGCAAAAAGGTAAAAGTTAACAAACTGTTCCCTCAGGA
ATCGAACGATGCATCCTTCACATTGAATGATAAAGAAGACGCGATAGAAGAAATC
CTGGGTACGCTTACACCAGATGAATGTGAATGGATTGCGCATATACGCCGCCTTT
TTGACTGGGCTATCATGAAACATGCTCTGAAAGATGGCAGGACTATTAGCGAGTC
AAAAGTCAAACTGTATGAGCAGCACCATCACGATCTGACCCAACTTAAATACTTC
GTGAAAACCTACCTTGCAAAAGAATACGACGATATTTTCCGCAACGTGGATAGCG
AAACAACGAAAAACTATGTAGCGTATTCCTATCATGTGAAAGAGGTGAAAGGCAC
TCTGCCTAAAAATAAGGCAACGCAAGAAGAGTTTTGTAAGTATGTCCTGGGCAAG
GTTAAAAACATTGAATGCTCTGAAGCAGACAAGGTTGACTTTGATGAGATGATTC
AGCGTCTTACCGACAACTCTTTTATGCCTAAGCAGGTTTCGGGCGAAAACCGCGT
TATTCCTTATCAGTTATATTATTATGAACTGAAGACAATTCTGAATAAAGCAGCC
TCGTACCTGCCTTTCCTGACGCAGTGTGGAAAAGATGCAATTTCGAACCAGGACA
AACTACTGTCGATCATGACGTTCCGTATTCCTTACTTCGTCGGACCCTTGCGAAA
AGATAATTCGGAACATGCATGGCTCGAACGAAAGGCCGGTAAGATTTATCCGTGG
AACTTTAACGACAAAGTGGACTTGGATAAATCAGAAGAAGCGTTCATTCGCCGAA
TGACCAATACCTGTACCTATTATCCCGGCGAAGATGTTTTACCGTTGGATTCGCT
GATCTATGAGAAATTTATGATTTTAAATGAAATCAATAATATTCGTATTGACGGC
TACCCGATTAGTGTTGACGTTAAACAGCAGGTTTTTGGCTTGTTCGAAAAAAAAC
GACGCGTAACCGTGAAAGATATTCAGAACCTGCTGCTGTCTCTCGGAGCTCTGGA
CAAACACGGGAAGCTGACAGGCATCGATACCACTATCCACTCAAACTATAATACG
TATCACCATTTTAAATCTCTCATGGAACGCGGCGTCCTGACCCGGGATGACGTGG
AACGCATCGTTGAAAGGATGACCTACAGCGACGATACTAAGCGTGTGCGTCTGTG
GCTGAATAACAACTATGGTACTTTAACCGCCGACGATGTGAAACACATTTCGCGT
CTGCGCAAACACGATTTTGGCCGTTTATCCAAAATGTTCTTAACAGGTCTGAAGG
GTGTCCATAAGGAGACCGGTGAACGTGCCTCCATACTGGATTTCATGTGGAACAC
GAACGATAACCTGATGCAGCTCCTTTCCGAATGCTACACGTTCAGTGATGAAATC
ACAAAGCTGCAAGAGGCGTATTATGCAAAAGCCCAGTTGTCTTTAAACGATTTTT
TAGACTCGATGTACATCTCTAACGCGGTGAAACGTCCGATTTACAGAACTCTGGC
AGTGGTGAACGATATTCGAAAAGCATGTGGGACGGCCCCTAAACGCATTTTCATC
GAAATGGCTCGTGATGGTGAATCAAAAAAAAAGAGAAGTGTTACACGTCGCGAGC
AGATCAAAAACCTGTACCGCTCGATTCGTAAAGATTTCCAGCAGGAAGTTGATTT
TCTGGAAAAGATCCTGGAAAATAAATCTGATGGTCAACTTCAGTCAGATGCTTTG
TATCTTTACTTTGCACAATTAGGGCGCGATATGTACACGGGCGATCCAATAAAGC
TGGAGCACATCAAAGATCAGAGTTTCTATAACATAGACCATATTTACCCGCAGTC
TATGGTGAAAGACGATTCCCTAGATAACAAAGTGCTGGTGCAAAGCGAAATTAAC
GGCGAGAAAAGCTCGCGATACCCTTTGGACGCCGCGATCCGCAATAAAATGAAGC
CCCTTTGGGACGCTTACTATAATCATGGCCTGATCTCCTTAAAGAAATACCAGCG
TCTAACGCGCTCGACCCCGTTTACCGATGATGAAAAATGGGACTTTATTAATCGC
CAGTTAGTGGAAACCCGTCAATCTACCAAAGCGCTGGCCATTTTGTTGAAGCGTA
AGTTTCCAGACACCGAAATTGTGTATTCGAAGGCGGGGTTATCGTCCGACTTCAG
ACATGAATTCGGCCTTGTAAAAAGTCGCAATATTAATGATTTGCACCACGCTAAA
GACGCATTCTTGGCTATCGTTACCGGCAATGTGTACCATGAAAGATTCAATCGCA
GATGGTTTATGGTGAACCAGCCGTACTCAGTTAAAACTAAAACTCTTTTTACCCA
CAGCATAAAGAATGGCAACTTCGTTGCCTGGAACGGCGAAGAAGATCTCGGTCGT
ATTGTAAAAATGCTGAAGCAAAACAAAAATACCATTCACTTCACGCGCTTCTCCT
TCGATCGCAAAGAAGGATTATTTGATATCCAACCTCTGAAAGCCAGCACCGGCTT
AGTCCCACGAAAAGCCGGTCTGGATGTCGTTAAATACGGCGGATATGACAAATCT
ACCGCGGCCTATTACCTGCTGGTGAGGTTCACGCTCGAGGACAAGAAAACCCAGC
ACAAGCTGATGATGATTCCTGTAGAAGGCCTGTACAAGGCTCGCATTGATCATGA
CAAGGAATTTCTTACCGATTATGCGCAAACGACTATAAGCGAAATCCTACAGAAA
GATAAACAGAAAGTGATCAATATTATGTTTCCAATGGGTACGAGGCATATAAAAC
TCAATTCAATGATTAGTATCGATGGCTTCTATCTTAGTATCGGCGGAAAGTCCTC
TAAAGGTAAGTCAGTTCTATGTCACGCAATGGTTCCACTGATCGTCCCTCACAAA
ATCGAATGTTACATTAAAGCAATGGAAAGCTTCGCCCGGAAGTTTAAAGAAAACA
ACAAGCTGCGCATCGTAGAAAAATTCGATAAAATCACCGTTGAAGACAACCTGAA
TCTCTACGAGCTCTTTCTCCAAAAACTGCAGCATAATCCCTATAATAAGTTTTTT
TCGACACAGTTTGACGTACTGACGAACGGCCGTTCTACTTTCACAAAACTGTCGC
CGGAGGAACAGGTACAGACGCTCTTGAACATTTTAAGTATCTTTAAAACATGCCG
CAGTTCGGGTTGCGACCTGAAATCCATCAACGGCAGTGCCCAGGCAGCGCGCATC
ATGATTAGCGCTGACTTAACTGGACTGTCGAAAAAATATTCAGATATTAGGTTGG
TTGAACAGTCAGCTTCTGGTTTGTTCGTATCCAAAAGTCAGAACTTACTGGAGTA
TCTCTAA
SEQ ATGTCATCGCTCACGAAATTCACTAACAAATACTCTAAACAGCTCACCATTAAGA
ID ATGAACTCATCCCAGTTGGCAAAACACTGGAGAACATCAAAGAGAATGGTCTGAT
NO: AGATGGCGACGAACAGCTGAATGAGAATTATCAGAAGGCGAAAATTATTGTGGAT
42 GATTTTCTGCGGGACTTCATTAATAAAGCACTGAATAATACGCAGATCGGGAACT
GGCGCGAACTGGCGGATGCCCTTAATAAAGAGGATGAAGATAACATCGAGAAATT
GCAGGATAAAATTCGGGGAATCATTGTATCCAAATTTGAAACGTTTGATCTGTTT
AGCAGCTATTCTATTAAGAAAGATGAAAAGATTATTGACGACGACAATGATGTTG
AAGAAGAGGAACTGGATCTGGGCAAGAAGACCAGCTCATTTAAATACATATTTAA
AAAAAACCTGTTTAAGTTAGTGTTGCCATCCTACCTGAAAACCACAAACCAGGAC
AAGCTGAAGATTATTAGCTCGTTTGATAATTTTTCAACGTACTTCCGCGGGTTCT
TTGAAAACCGGAAAAACATTTTTACCAAGAAACCGATCTCCACAAGTATTGCGTA
TCGCATTGTTCATGATAACTTCCCGAAATTCCTTGATAACATTCGTTGTTTTAAT
GTGTGGCAGACGGAATGCCCGCAACTAATCGTGAAAGCAGATAACTATCTGAAAA
GCAAAAATGTTATAGCGAAAGATAAAAGTTTGGCAAACTATTTTACCGTGGGCGC
GTATGACTATTTCCTGTCTCAGAATGGTATAGATTTTTACAACAATATTATAGGT
GGACTGCCAGCGTTCGCCGGCCATGAGAAAATCCAAGGTCTCAATGAATTCATCA
ATCAAGAGTGCCAAAAAGACAGCGAGCTGAAAAGTAAGCTGAAAAACCGTCACGC
GTTCAAAATGGCGGTACTGTTCAAACAGATACTCAGCGATCGTGAAAAAAGTTTT
GTAATTGATGAGTTCGAGTCGGATGCTCAAGTTATTGACGCCGTTAAAAACTTTT
ACGCCGAACAGTGCAAAGATAACAATGTTATTTTTAACTTATTAAATCTTATCAA
GAATATCGCTTTCTTAAGTGATGACGAACTGGACGGCATATTCATTGAAGGGAAA
TACCTGTCGAGCGTTAGTCAAAAACTCTATAGCGATTGGTCAAAATTACGTAACG
ACATTGAGGATTCGGCTAACTCTAAACAAGGCAATAAAGAGCTGGCCAAGAAGAT
CAAAACCAACAAAGGGGATGTAGAAAAAGCGATCTCGAAATATGAGTTCTCGCTG
TCGGAACTGAACTCGATTGTACATGATAACACCAAGTTTTCTGACCTCCTTAGTT
GTACACTGCATAAGGTGGCTTCTGAGAAACTGGTGAAGGTCAATGAAGGCGACTG
GCCGAAACATCTCAAGAATAATGAAGAGAAACAAAAAATCAAAGAGCCGCTTGAT
GCTCTGCTGGAGATCTATAATACACTTCTGATTTTTAACTGCAAAAGCTTCAATA
AAAACGGCAACTTCTATGTCGACTATGATCGTTGCATCAATGAACTGAGTTCGGT
CGTGTATCTGTATAATAAAACACGTAACTATTGCACTAAAAAACCCTATAACACG
GACAAGTTCAAACTCAATTTTAACAGTCCGCAGCTCGGTGAAGGCTTTTCCAAGT
CGAAAGAAAATGACTGTCTGACTCTTTTGTTTAAAAAAGACGACAACTATTATGT
AGGCATTATCCGCAAAGGTGCAAAAATCAATTTTGATGATACACAAGCAATCGCC
GATAACACCGACAATTGCATCTTTAAAATGAATTATTTCCTACTTAAAGACGCAA
AAAAATTTATCCCGAAATGTAGCATTCAGCTGAAAGAAGTCAAGGCCCATTTTAA
GAAATCTGAAGATGATTACATTTTGTCTGATAAAGAGAAATTTGCTAGCCCGCTG
GTCATTAAAAAGAGCACATTTTTGCTGGCAACTGCACATGTGAAAGGGAAAAAAG
GCAATATCAAGAAATTTCAGAAAGAATATTCGAAAGAAAACCCCACTGAGTATCG
CAATTCTTTAAACGAATGGATTGCTTTTTGTAAAGAGTTCTTAAAAACTTATAAA
GCGGCTACCATTTTTGATATAACCACATTGAAAAAGGCAGAGGAATATGCTGATA
TTGTAGAATTCTACAAGGATGTCGATAATCTGTGCTACAAACTGGAGTTCTGCCC
GATTAAAACCTCGTTTATAGAAAACCTGATAGATAACGGCGACCTGTATCTGTTT
CGCATCAATAACAAAGACTTCAGCAGTAAATCGACCGGCACCAAGAACCTTCATA
CGTTATATTTACAAGCTATATTCGATGAACGTAATCTGAACAATCCGACAATTAT
GCTGAATGGGGGAGCAGAACTGTTCTATCGTAAAGAAAGTATTGAGCAGAAAAAC
CGTATCACACACAAAGCCGGTTCAATTCTCGTGAATAAGGTGTGTAAAGACGGTA
CAAGCCTGGATGATAAGATACGTAATGAAATTTATCAATATGAGAATAAATTTAT
TGATACCCTGTCTGATGAAGCTAAAAAGGTGTTACCGAATGTCATTAAAAAGGAA
GCTACCCATGACATTACAAAAGATAAACGTTTCACTAGTGACAAATTCTTCTTTC
ACTGCCCCCTGACAATTAATTATAAGGAAGGCGATACCAAGCAGTTCAATAACGA
AGTGCTGAGTTTTCTGCGTGGAAATCCTGACATCAACATTATCGGCATTGACCGC
GGAGAGCGTAATTTAATCTATGTAACGGTTATAAACCAGAAAGGCGAGATTCTGG
ATTCGGTTTCATTCAATACCGTGACCAACAAGAGTTCAAAAATCGAGCAGACAGT
CGATTATGAAGAGAAATTGGCAGTCCGCGAGAAAGAGAGGATTGAAGCAAAACGT
TCCTGGGACTCTATCTCAAAAATTGCGACACTAAAGGAAGGTTATCTGAGCGCAA
TAGTTCACGAGATCTGTCTGTTAATGATTAAACACAACGCGATCGTTGTCTTAGA
GAATCTTAATGCAGGCTTTAAGCGTATTCGTGGCGGTTTATCAGAAAAAAGTGTT
TATCAAAAATTCGAAAAAATGTTGATTAACAAACTGAACTATTTTGTCAGCAAGA
AGGAATCCGACTGGAATAAACCGTCTGGTCTGCTGAATGGACTGCAGCTTTCGGA
TCAGTTTGAAAGCTTCGAAAAACTGGGTATTCAGTCTGGTTTTATTTTTTACGTG
CCGGCTGCATATACCTCAAAGATTGATCCGACCACGGGCTTCGCCAATGTTCTGA
ATCTGTCGAAGGTACGCAATGTTGATGCGATCAAAAGCTTTTTTTCTAACTTCAA
CGAAATTAGTTATAGCAAGAAAGAAGCCCTTTTCAAATTCTCATTCGATCTGGAT
TCACTGAGTAAGAAAGGCTTTAGTAGCTTTGTGAAATTTAGTAAGAGTAAATGGA
ACGTCTACACCTTTGGAGAACGTATCATAAAGCCAAAGAATAAGCAAGGTTATCG
GGAGGACAAAAGAATCAACTTGACCTTCGAGATGAAGAAGTTACTTAACGAGTAT
AAGGTTTCTTTTGATCTTGAAAATAACTTGATTCCGAATCTCACGAGTGCCAACC
TGAAGGATACTTTTTGGAAAGAGCTATTCTTTATCTTCAAGACTACGCTGCAGCT
CCGTAACAGCGTTACTAACGGTAAAGAAGATGTGCTCATCTCTCCGGTCAAAAAT
GCGAAGGGTGAATTCTTCGTTTCGGGAACGCATAACAAGACTCTTCCGCAAGATT
GCGATGCGAACGGTGCATACCATATTGCGTTGAAAGGTCTGATGATACTCGAACG
TAACAACCTTGTACGTGAGGAGAAAGATACGAAAAAGATTATGGCGATTTCAAAC
GTGGATTGGTTCGAGTACGTGCAGAAACGTAGAGGCGTTCTGTAA
SEQ ATGAACAACTACGACGAATTCACCAAACTGTACCCGATCCAGAAAACCATCCGTT
ID TCGAACTGAAACCGCAGGGTCGTACCATGGAACACCTGGAAACCTTCAACTTCTT
NO: CGAAGAAGACCGTGACCGTGCGGAAAAATACAAAATCCTGAAAGAAGCGATCGAC
43 GAATACCACAAAAAATTCATCGACGAACACCTGACCAACATGTCTCTGGACTGGA
ACTCTCTGAAACAGATCTCTGAAAAATACTACAAATCTCGTGAAGAAAAAGACAA
AAAAGTTTTCCTGTCTGAACAGAAACGTATGCGTCAGGAAATCGTTTCTGAATTC
AAAAAAGACGACCGTTTCAAAGACCTGTTCTCTAAAAAACTGTTCTCTGAACTGC
TGAAAGAAGAAATCTACAAAAAAGGTAACCACCAGGAAATCGACGCGCTGAAATC
TTTCGACAAATTCTCTGGTTACTTCATCGGTCTGCACGAAAACCGTAAAAACATG
TACTCTGACGGTGACGAAATCACCGCGATCTCTAACCGTATCGTTAACGAAAACT
TCCCGAAATTCCTGGACAACCTGCAGAAATACCAGGAAGCGCGTAAAAAATACCC
GGAATGGATCATCAAAGCGGAATCTGCGCTGGTTGCGCACAACATCAAAATGGAC
GAAGTTTTCTCTCTGGAATACTTCAACAAAGTTCTGAACCAGGAAGGTATCCAGC
GTTACAACCTGGCGCTGGGTGGTTACGTTACCAAATCTGGTGAAAAAATGATGGG
TCTGAACGACGCGCTGAACCTGGCGCACCAGTCTGAAAAATCTTCTAAAGGTCGT
ATCCACATGACCCCGCTGTTCAAACAGATCCTGTCTGAAAAAGAATCTTTCTCTT
ACATCCCGGACGTTTTCACCGAAGACTCTCAGCTGCTGCCGTCTATCGGTGGTTT
CTTCGCGCAGATCGAAAACGACAAAGACGGTAACATCTTCGACCGTGCGCTGGAA
CTGATCTCTTCTTACGCGGAATACGACACCGAACGTATCTACATCCGTCAGGCGG
ACATCAACCGTGTTTCTAACGTTATCTTCGGTGAATGGGGTACCCTGGGTGGTCT
GATGCGTGAATACAAAGCGGACTCTATCAACGACATCAACCTGGAACGTACCTGC
AAAAAAGTTGACAAATGGCTGGACTCTAAAGAATTCGCGCTGTCTGACGTTCTGG
AAGCGATCAAACGTACCGGTAACAACGACGCGTTCAACGAATACATCTCTAAAAT
GCGTACCGCGCGTGAAAAAATCGACGCGGCGCGTAAAGAAATGAAATTCATCTCT
GAAAAAATCTCTGGTGACGAAGAATCTATCCACATCATCAAAACCCTGCTGGACT
CTGTTCAGCAGTTCCTGCACTTCTTCAACCTGTTCAAAGCGCGTCAGGACATCCC
GCTGGACGGTGCGTTCTACGCGGAATTCGACGAAGTTCACTCTAAACTGTTCGCG
ATCGTTCCGCTGTACAACAAAGTTCGTAACTACCTGACCAAAAACAACCTGAACA
CCAAAAAAATCAAACTGAACTTCAAAAACCCGACCCTGGCGAACGGTTGGGACCA
GAACAAAGTTTACGACTACGCGTCTCTGATCTTCCTGCGTGACGGTAACTACTAC
CTGGGTATCATCAACCCGAAACGTAAAAAAAACATCAAATTCGAACAGGGTTCTG
GTAACGGTCCGTTCTACCGTAAAATGGTTTACAAACAGATCCCGGGTCCGAACAA
AAACCTGCCGCGTGTTTTCCTGACCTCTACCAAAGGTAAAAAAGAATACAAACCG
TCTAAAGAAATCATCGAAGGTTACGAAGCGGACAAACACATCCGTGGTGACAAAT
TCGACCTGGACTTCTGCCACAAACTGATCGACTTCTTCAAAGAATCTATCGAAAA
ACACAAAGACTGGTCTAAATTCAACTTCTACTTCTCTCCGACCGAATCTTACGGT
GACATCTCTGAATTCTACCTGGACGTTGAAAAACAGGGTTACCGTATGCACTTCG
AAAACATCTCTGCGGAAACCATCGACGAATACGTTGAAAAAGGTGACCTGTTCCT
GTTCCAGATCTACAACAAAGACTTCGTTAAAGCGGCGACCGGTAAAAAAGACATG
CACACCATCTACTGGAACGCGGCGTTCTCTCCGGAAAACCTGCAGGACGTTGTTG
TTAAACTGAACGGTGAAGCGGAACTGTTCTACCGTGACAAATCTGACATCAAAGA
AATCGTTCACCGTGAAGGTGAAATCCTGGTTAACCGTACCTACAACGGTCGTACC
CCGGTTCCGGACAAAATCCACAAAAAACTGACCGACTACCACAACGGTCGTACCA
AAGACCTGGGTGAAGCGAAAGAATACCTGGACAAAGTTCGTTACTTCAAAGCGCA
CTACGACATCACCAAAGACCGTCGTTACCTGAACGACAAAATCTACTTCCACGTT
CCGCTGACCCTGAACTTCAAAGCGAACGGTAAAAAAAACCTGAACAAAATGGTTA
TCGAAAAATTCCTGTCTGACGAAAAAGCGCACATCATCGGTATCGACCGTGGTGA
ACGTAACCTGCTGTACTACTCTATCATCGACCGTTCTGGTAAAATCATCGACCAG
CAGTCTCTGAACGTTATCGACGGTTTCGACTACCGTGAAAAACTGAACCAGCGTG
AAATCGAAATGAAAGACGCGCGTCAGTCTTGGAACGCGATCGGTAAAATCAAAGA
CCTGAAAGAAGGTTACCTGTCTAAAGCGGTTCACGAAATCACCAAAATGGCGATC
CAGTACAACGCGATCGTTGTTATGGAAGAACTGAACTACGGTTTCAAACGTGGTC
GTTTCAAAGTTGAAAAACAGATCTACCAGAAATTCGAAAACATGCTGATCGACAA
AATGAACTACCTGGTTTTCAAAGACGCGCCGGACGAATCTCCGGGTGGTGTTCTG
AACGCGTACCAGCTGACCAACCCGCTGGAATCTTTCGCGAAACTGGGTAAACAGA
CCGGTATCCTGTTCTACGTTCCGGCGGCGTACACCTCTAAAATCGACCCGACCAC
CGGTTTCGTTAACCTGTTCAACACCTCTTCTAAAACCAACGCGCAGGAACGTAAA
GAATTCCTGCAGAAATTCGAATCTATCTCTTACTCTGCGAAAGACGGTGGTATCT
TCGCGTTCGCGTTCGACTACCGTAAATTCGGTACCTCTAAAACCGACCACAAAAA
CGTTTGGACCGCGTACACCAACGGTGAACGTATGCGTTACATCAAAGAAAAAAAA
CGTAACGAACTGTTCGACCCGTCTAAAGAAATCAAAGAAGCGCTGACCTCTTCTG
GTATCAAATACGACGGTGGTCAGAACATCCTGCCGGACATCCTGCGTTCTAACAA
CAACGGTCTGATCTACACCATGTACTCTTCTTTCATCGCGGCGATCCAGATGCGT
GTTTACGACGGTAAAGAAGACTACATCATCTCTCCGATCAAAAACTCTAAAGGTG
AATTCTTCCGTACCGACCCGAAACGTCGTGAACTGCCGATCGACGCGGACGCGAA
CGGTGCGTACAACATCGCGCTGCGTGGTGAACTGACCATGCGTGCGATCGCGGAA
AAATTCGACCCGGACTCTGAAAAAATGGCGAAACTGGAACTGAAACACAAAGACT
GGTTCGAATTCATGCAGACCCGTGGTGACTAA
SEQ ATGACTAAAACATTTGATTCAGAGTTTTTTAATTTGTACTCGCTGCAAAAAACGG
ID TACGCTTTGAGTTAAAACCCGTGGGAGAAACCGCGTCATTTGTGGAAGACTTTAA
NO: AAACGAGGGCTTGAAACGTGTTGTGAGCGAAGATGAAAGGCGAGCCGTCGATTAC
44 CAGAAAGTTAAGGAAATAATTGACGATTACCATCGGGATTTCATTGAAGAAAGTT
TAAATTATTTTCCGGAACAGGTGAGTAAAGATGCTCTTGAGCAGGCGTTTCATCT
TTATCAGAAACTGAAGGCAGCAAAAGTTGAGGAAAGGGAAAAAGCGCTGAAAGAA
TGGGAAGCGCTGCAGAAAAAGCTACGTGAAAAAGTGGTGAAATGCTTCTCGGACT
CGAATAAAGCCCGCTTCTCAAGGATTGATAAAAAGGAACTGATTAAGGAAGACCT
GATAAATTGGTTGGTCGCCCAGAATCGCGAGGATGATATCCCTACGGTCGAAACG
TTTAACAACTTCACCACATATTTTACCGGCTTCCATGAGAATCGTAAAAATATTT
ACTCCAAAGATGATCACGCCACCGCTATTAGCTTTCGCCTTATTCATGAAAATCT
TCCAAAGTTTTTTGACAACGTGATTAGCTTCAATAAGTTGAAAGAGGGTTTCCCT
GAATTAAAATTTGATAAAGTGAAAGAGGATTTAGAAGTAGATTATGATCTGAAGC
ATGCGTTTGAAATAGAATATTTCGTTAACTTCGTGACCCAAGCGGGCATAGATCA
GTATAATTATCTGTTAGGAGGGAAAACCCTGGAGGACGGGACGAAAAAACAAGGG
ATGAATGAGCAAATTAATCTGTTCAAACAACAGCAAACGCGAGATAAAGCGCGTC
AGATTCCCAAACTGATCCCCCTGTTCAAACAGATTCTTAGCGAAAGGACTGAAAG
CCAGTCCTTTATTCCTAAACAATTTGAAAGTGATCAGGAGTTGTTCGATTCACTG
CAGAAGTTACATAATAACTGCCAGGATAAATTCACCGTGCTGCAACAAGCCATTC
TCGGTCTGGCAGAGGCGGATCTTAAGAAGGTCTTCATCAAAACCTCTGATTTAAA
TGCCTTATCTAACACCATTTTCGGGAATTACAGCGTCTTTTCCGATGCACTGAAC
CTGTATAAAGAAAGCCTGAAAACGAAAAAAGCGCAGGAGGCTTTTGAGAAACTAC
CGGCCCATTCTATTCACGACCTCATTCAATACTTGGAACAGTTCAATTCCAGCCT
GGACGCGGAAAAACAACAGAGCACCGACACCGTCCTGAACTACTTCATCAAGACC
GATGAATTATATTCTCGCTTCATTAAATCCACTAGCGAGGCTTTCACTCAGGTGC
AGCCTTTGTTCGAACTGGAAGCCCTGTCATCTAAGCGCCGCCCACCGGAATCGGA
AGATGAAGGGGCAAAAGGGCAGGAAGGCTTCGAGCAGATCAAGCGTATTAAAGCT
TACCTGGATACGCTTATGGAAGCGGTACACTTTGCAAAGCCGTTGTATCTTGTTA
AGGGTCGTAAAATGATCGAAGGGCTCGATAAAGACCAGTCCTTTTATGAAGCGTT
TGAAATGGCGTACCAAGAACTTGAATCGTTAATCATTCCTATCTATAACAAAGCG
CGGAGCTATCTGTCGCGGAAACCTTTCAAGGCCGATAAATTCAAGATTAATTTTG
ACAACAACACGCTACTGAGCGGATGGGATGCGAACAAGGAAACTGCTAACGCGTC
CATTCTGTTTAAGAAAGACGGGTTATATTACCTTGGAATTATGCCGAAAGGTAAG
ACCTTTCTCTTTGACTACTTTGTATCGAGCGAGGATTCAGAGAAACTGAAACAGC
GTCGCCAGAAGACCGCCGAAGAAGCTCTGGCGCAGGATGGTGAAAGTTACTTCGA
AAAAATTCGTTATAAACTGTTACCAGGGGCTTCAAAGATGTTACCGAAAGTCTTT
TTTAGCAACAAAAATATTGGCTTTTACAACCCGTCGGATGACATTTTACGCATTC
GCAACACAGCCTCTCACACCAAAAACGGGACCCCTCAGAAAGGCCACTCAAAAGT
TGAGTTTAACCTGAATGATTGTCATAAGATGATTGATTTCTTCAAATCATCAATT
CAGAAACACCCGGAATGGGGGTCTTTTGGCTTTACGTTTTCTGATACCAGTGATT
TTGAAGACATGAGTGCCTTCTACCGGGAAGTAGAAAACCAGGGTTACGTAATTAG
CTTTGACAAAATCAAAGAGACCTATATACAGAGCCAGGTGGAACAGGGTAATCTC
TACTTATTCCAGATTTATAACAAGGATTTCTCGCCCTACAGCAAAGGCAAACCAA
ACCTGCATACTCTGTACTGGAAAGCCCTGTTTGAAGAAGCGAACCTGAATAACGT
AGTGGCGAAGTTGAACGGTGAAGCGGAAATCTTCTTCCGTCGTCACTCCATTAAG
GCCTCTGATAAAGTTGTCCATCCGGCAAATCAGGCCATTGATAATAAGAATCCAC
ACACGGAAAAAACGCAGTCAACCTTTGAATATGACCTCGTTAAAGACAAACGCTA
CACGCAAGATAAGTTCTTTTTCCACGTCCCAATCAGCCTCAACTTTAAAGCACAA
GGGGTTTCAAAGTTTAATGATAAAGTCAATGGGTTCCTCAAGGGCAACCCGGATG
TCAACATTATAGGTATAGACAGGGGCGAACGCCATCTGCTTTACTTTACCGTAGT
GAATCAGAAAGGTGAAATACTGGTTCAGGAATCATTAAATACCTTGATGTCGGAC
AAAGGGCACGTTAATGATTACCAGCAGAAACTGGATAAAAAAGAACAGGAACGTG
ATGCTGCGCGTAAATCGTGGACCACGGTTGAGAACATTAAAGAGCTGAAAGAGGG
GTATCTAAGCCATGTGGTACACAAACTGGCGCACCTCATCATTAAATATAACGCA
ATAGTCTGCCTAGAAGACTTGAATTTTGGCTTTAAACGCGGCCGCTTCAAAGTGG
AAAAACAAGTTTATCAAAAATTTGAAAAGGCGCTTATAGATAAACTGAATTATCT
GGTTTTTAAAGAAAAGGAACTTGGTGAGGTAGGGCACTACTTGACAGCTTATCAA
CTGACGGCCCCGTTCGAATCATTCAAAAAACTGGGCAAACAGTCTGGCATTCTGT
TTTACGTGCCGGCAGATTATACTTCAAAAATCGATCCAACAACTGGCTTTGTGAA
CTTCCTGGACCTGAGATATCAGTCTGTAGAAAAAGCTAAACAACTTCTTAGCGAT
TTTAATGCCATTCGTTTTAACAGCGTTCAGAATTACTTTGAATTCGAAATTGACT
ATAAAAAACTTACTCCGAAACGTAAAGTCGGAACCCAAAGTAAATGGGTAATTTG
TACGTATGGCGATGTCAGGTATCAGAACCGTCGGAATCAAAAAGGTCATTGGGAG
ACCGAAGAAGTGAACGTGACCGAAAAGCTGAAGGCTCTGTTCGCCAGCGATTCAA
AAACTACAACTGTGATCGATTACGCAAATGATGATAACCTGATAGATGTGATTTT
AGAGCAGGATAAAGCCAGCTTTTTTAAAGAACTGTTGTGGCTCCTGAAACTTACG
ATGACCTTACGACATTCCAAGATCAAATCGGAAGATGATTTTATTCTGTCACCGG
TCAAGAATGAGCAGGGTGAATTCTATGATAGTAGGAAAGCCGGCGAAGTGTGGCC
GAAAGACGCCGACGCCAATGGCGCCTATCATATCGCGCTCAAAGGGCTTTGGAAT
TTGCAGCAGATTAACCAGTGGGAAAAAGGTAAAACCCTGAATCTGGCTATCAAAA
ACCAGGATTGGTTTAGCTTTATCCAAGAGAAACCGTATCAGGAATGA
SEQ ATGCATACAGGCGGTCTTCTTAGTATGGACGCGAAAGAGTTCACAGGTCAGTATC
ID CGTTGTCGAAAACATTACGATTCGAACTTCGGCCCATCGGCCGCACGTGGGATAA
NO: CCTGGAGGCCTCAGGCTACTTAGCGGAAGACCGCCATCGTGCCGAATGTTATCCT
45 CGTGCGAAAGAGTTATTGGATGACAACCATCGTGCCTTCCTGAATCGTGTGTTGC
CACAAATCGATATGGATTGGCACCCGATTGCGGAGGCCTTTTGTAAGGTACATAA
AAACCCTGGTAATAAAGAACTTGCCCAGGATTACAACCTTCAGTTGTCAAAGCGC
CGTAAGGAGATCAGCGCATATCTTCAGGATGCAGATGGCTATAAAGGCCTGTTCG
CGAAGCCCGCCTTAGACGAAGCTATGAAAATTGCGAAAGAAAACGGGAACGAAAG
TGATATTGAGGTTCTCGAAGCGTTTAACGGTTTTAGCGTATACTTCACCGGTTAT
CATGAGTCACGCGAGAACATTTATAGCGATGAGGATATGGTGAGCGTAGCCTACC
GAATTACTGAGGATAATTTCCCGCGCTTTGTCTCAAACGCTTTGATCTTTGATAA
ATTAAACGAAAGCCATCCGGATATTATCTCTGAAGTATCGGGCAATCTTGGAGTT
GATGACATTGGTAAGTACTTTGACGTGTCGAACTATAACAATTTTCTTTCCCAGG
CCGGTATAGATGACTACAATCACATTATTGGCGGCCATACAACCGAAGACGGACT
GATACAAGCGTTTAATGTCGTATTGAACTTACGTCACCAAAAAGACCCTGGCTTT
GAAAAAATTCAGTTCAAACAGCTCTACAAACAAATCCTGAGCGTGCGTACCAGCA
AAAGCTACATCCCGAAACAGTTTGACAACTCTAAGGAGATGGTTGACTGCATTTG
CGATTATGTCAGCAAAATAGAGAAATCCGAAACAGTAGAACGGGCCCTGAAACTA
GTCCGTAATATCAGTTCTTTCGACTTGCGCGGGATCTTTGTCAATAAAAAGAACT
TGCGCATACTGAGCAACAAACTGATAGGAGATTGGGACGCGATCGAAACCGCATT
GATGCATAGTTCTTCATCAGAAAACGATAAGAAAAGCGTATATGATAGCGCGGAG
GCTTTTACGTTGGATGACATCTTTTCAAGCGTGAAAAAATTTTCTGATGCCTCTG
CCGAAGATATTGGCAACAGGGCGGAAGACATCTGTAGAGTGATAAGTGAGACGGC
CCCTTTTATCAACGATCTGCGAGCGGTGGACCTGGATAGCCTGAACGACGATGGT
TATGAAGCGGCCGTCTCAAAAATTCGGGAGTCGCTGGAGCCTTATATGGATCTTT
TCCATGAACTGGAAATTTTCTCGGTTGGCGATGAGTTCCCAAAATGCGCAGCATT
TTACAGCGAACTGGAGGAAGTCAGCGAACAGCTGATCGAAATTATTCCGTTATTC
AACAAGGCGCGTTCGTTCTGCACCCGGAAACGCTATAGCACCGATAAGATTAAAG
TGAACTTAAAATTCCCGACCTTGGCGGACGGGTGGGACCTGAACAAAGAGAGAGA
CAACAAAGCCGCGATTCTGCGGAAAGACGGTAAGTATTATCTGGCAATTCTGGAT
ATGAAGAAAGATCTGTCAAGCATTAGGACCAGCGACGAAGATGAATCCAGCTTCG
AAAAGATGGAGTATAAACTGTTACCGAGTCCAGTAAAAATGCTGCCAAAGATATT
CGTAAAATCGAAAGCCGCTAAGGAAAAATATGGCCTGACAGATCGTATGCTTGAA
TGCTACGATAAAGGTATGCATAAGTCGGGTAGTGCGTTTGATCTTGGCTTTTGCC
ATGAACTCATTGATTATTACAAGCGTTGTATCGCGGAGTACCCAGGCTGGGATGT
GTTCGATTTCAAGTTTCGCGAAACTTCCGATTATGGGTCCATGAAAGAGTTCAAT
GAAGATGTGGCCGGAGCCGGTTACTATATGAGTCTGAGAAAAATTCCGTGCAGCG
AAGTGTACCGTCTGTTAGACGAGAAATCGATTTATCTATTTCAAATTTATAACAA
AGATTACTCTGAAAATGCACATGGTAATAAGAACATGCATACCATGTACTGGGAG
GGTCTCTTTTCCCCGCAAAACCTGGAGTCGCCCGTTTTCAAGTTGTCGGGTGGGG
CAGAACTTTTCTTTCGAAAATCCTCAATCCCTAACGATGCCAAAACAGTACACCC
GAAAGGCTCAGTGCTGGTTCCACGTAATGATGTTAACGGTCGGCGTATTCCAGAT
TCAATCTACCGCGAACTGACACGCTATTTTAACCGTGGCGATTGCCGAATCAGTG
ACGAAGCCAAAAGTTATCTTGACAAGGTTAAGACTAAAAAAGCGGACCATGACAT
TGTGAAAGATCGCCGCTTTACCGTGGATAAAATGATGTTCCACGTCCCGATTGCG
ATGAACTTTAAGGCGATCAGTAAACCGAACTTAAACAAAAAAGTCATTGATGGCA
TCATTGATGATCAGGATCTGAAAATCATTGGTATTGATCGTGGCGAGCGGAACTT
AATTTACGTCACGATGGTTGACAGAAAAGGGAATATCTTATATCAGGATTCTCTT
AACATCCTCAATGGCTACGACTATCGTAAAGCTCTGGATGTGCGCGAATATGACA
ACAAGGAAGCGCGTCGTAACTGGACTAAAGTGGAGGGCATTCGCAAAATGAAGGA
AGGCTATCTGTCATTAGCGGTCTCGAAATTAGCGGATATGATTATCGAAAATAAC
GCCATCATCGTTATGGAGGACCTGAACCACGGATTCAAAGCGGGCCGCTCAAAGA
TTGAAAAACAAGTTTATCAGAAATTTGAGAGTATGCTGATTAACAAACTGGGCTA
TATGGTGTTAAAAGACAAGTCAATTGACCAATCAGGTGGCGCGCTGCATGGATAC
CAGCTGGCGAACCATGTTACCACCTTAGCATCAGTTGGAAAGCAGTGTGGGGTTA
TCTTTTATATACCGGCAGCGTTCACTAGTAAAATAGATCCGACCACTGGTTTCGC
CGATCTCTTTGCCCTGAGTAACGTTAAAAACGTAGCGAGCATGCGTGAATTCTTT
TCCAAAATGAAATCTGTCATTTATGATAAAGCTGAAGGCAAATTCGCATTCACCT
TTGATTACTTGGATTACAACGTGAAGAGCGAATGTGGTCGTACGCTGTGGACCGT
TTACACCGTTGGTGAGCGCTTCACCTATTCCCGTGTGAACCGCGAATATGTACGT
AAAGTCCCCACCGATATTATCTATGATGCCCTCCAGAAAGCAGGCATTAGCGTCG
AAGGAGACTTAAGGGACAGAATTGCCGAAAGCGATGGCGATACGCTGAAGTCTAT
TTTTTACGCATTCAAATACGCGCTAGATATGCGCGTTGAGAATCGCGAGGAAGAC
TACATTCAATCACCTGTGAAAAATGCCTCTGGGGAATTTTTTTGTTCAAAAAATG
CTGGTAAAAGCCTCCCACAAGATAGCGATGCAAACGGTGCATATAACATTGCCCT
GAAAGGTATTCTTCAATTACGCATGCTGTCTGAGCAGTACGACCCCAACGCGGAA
TCTATTAGACTTCCGCTGATAACCAATAAAGCCTGGCTGACATTCATGCAGTCTG
GCATGAAGACCTGGAAAAATTAG
SEQ ATGGATAGTTTAAAAGATTTTACGAATCTATATCCCGTAAGCAAAACTCTTCGTT
ID TTGAACTGAAACCTGTTGGAAAAACGTTGGAGAATATCGAGAAAGCGGGCATCCT
NO: GAAAGAAGACGAGCACCGTGCCGAAAGCTACAGGCGTGTCAAAAAGATTATCGAT
46 ACTTATCACAAAGTGTTCATTGATAGCAGTCTGGAGAACATGGCAAAAATGGGCA
TAGAAAATGAAATCAAAGCAATGCTGCAGAGCTTTTGCGAGCTCTACAAGAAAGA
TCACCGAACGGAAGGTGAAGATAAAGCACTGGACAAAATTCGCGCCGTTCTTCGC
GGTCTGATTGTTGGCGCGTTCACCGGCGTGTGCGGCCGCCGTGAAAACACCGTGC
AGAACGAAAAGTACGAGTCGCTGTTCAAAGAAAAACTGATAAAAGAAATTTTGCC
TGACTTTGTGCTTTCGACCGAAGCGGAATCCCTGCCATTTTCTGTCGAAGAAGCG
ACCCGCAGCCTGAAAGAATTTGACTCATTCACAAGTTACTTTGCAGGCTTCTACG
AAAACCGTAAAAACATCTACAGCACGAAGCCACAGAGCACGGCTATTGCTTATCG
CCTGATTCATGAGAACCTGCCGAAGTTCATCGATAACATCCTTGTTTTTCAAAAA
ATTAAAGAGCCGATTGCGAAAGAGTTAGAACATATTCGAGCTGACTTTTCTGCGG
GTGGGTACATTAAAAAAGATGAGCGGCTGGAAGACATCTTCAGTCTAAACTATTA
TATCCACGTTCTGTCGCAGGCAGGCATTGAGAAATATAATGCGCTGATTGGTAAG
ATTGTCACAGAAGGCGATGGTGAGATGAAAGGTCTTAATGAACATATCAATCTGT
ATAACCAGCAGCGTGGTCGCGAAGACCGTCTTCCACTGTTCCGCCCACTGTATAA
ACAGATCCTGTCTGACCGGGAACAGCTGTCCTACCTGCCGGAAAGCTTTGAAAAG
GATGAAGAGCTACTTCGCGCATTAAAGGAGTTTTACGACCATATTGCGGAAGACA
TTTTGGGTAGAACGCAGCAACTGATGACGTCAATTTCTGAATACGATCTGAGTAG
AATCTACGTTAGGAATGATAGCCAGCTGACCGATATTAGCAAAAAAATGCTGGGC
GACTGGAACGCTATCTATATGGCACGTGAACGTGCATATGATCATGAACAAGCAC
CGAAACGTATAACCGCGAAATATGAGCGTGATCGCATTAAGGCGCTAAAGGGAGA
AGAAAGCATCTCACTCGCAAACCTGAACTCCTGTATCGCTTTCTTAGATAACGTG
CGCGATTGTCGCGTCGACACGTATCTGTCAACCCTTGGGCAGAAAGAGGGTCCAC
ATGGTCTGTCTAACCTGGTGGAAAATGTCTTTGCGAGTTACCATGAAGCGGAACA
ACTGCTGTCTTTTCCATACCCCGAAGAAAACAATCTAATACAGGATAAAGATAAC
GTGGTGTTAATCAAAAACCTGCTGGACAACATCAGCGATCTGCAACGTTTCCTGA
AACCTTTGTGGGGTATGGGTGACGAGCCAGACAAAGACGAACGTTTTTATGGTGA
GTATAATTATATACGTGGCGCCCTTGACCAAGTTATTCCGCTGTATAACAAAGTA
CGGAACTATCTGACCCGTAAGCCATATTCTACCCGTAAAGTGAAACTGAACTTTG
GCAACTCGCAACTGCTGTCGGGTTGGGATCGTAACAAAGAAAAAGATAATAGTTG
TGTTATCCTGCGTAAGGGACAAAATTTTTACCTCGCGATTATGAACAACAGACAC
AAGCGTTCATTTGAAAATAAGGTTCTGCCGGAGTATAAAGAGGGCGAACCGTACT
TCGAGAAAATGGATTATAAGTTCTTACCAGACCCTAATAAGATGTTACCGAAAGT
CTTTCTTTCGAAAAAAGGCATAGAAATCTATAAGCCGTCCCCGAAATTACTCGAA
CAGTATGGGCACGGGACCCACAAGAAAGGGGATACTTTTAGCATGGACGATCTGC
ACGAACTGATCGATTTTTTTAAACACTCCATCGAAGCCCATGAAGACTGGAAACA
GTTTGGGTTCAAGTTCTCTGATACAGCCACATACGAGAATGTGTCTAGTTTTTAT
CGGGAAGTGGAGGATCAGGGCTACAAACTTAGTTTTCGTAAAGTTTCAGAGAGTT
ATGTTTATAGTTTAATTGATCAGGGAAAACTTTACCTGTTCCAGATCTACAACAA
AGATTTCTCGCCATGTAGTAAGGGTACCCCGAATCTGCATACACTCTATTGGAGA
ATGTTATTCGATGAGCGTAACTTAGCGGATGTCATTTATAAATTGGACGGGAAAG
CAGAGATCTTTTTTCGTGAAAAATCACTGAAGAATGACCACCCGACTCATCCGGC
CGGGAAACCGATCAAAAAAAAATCCCGCCAGAAAAAAGGAGAAGAGTCTCTGTTT
GAATATGATCTGGTGAAAGACCGTCATTACACTATGGATAAATTTCAATTTCATG
TTCCAATTACAATGAACTTCAAATGTTCGGCGGGTTCCAAAGTAAATGATATGGT
AAACGCCCATATTCGCGAAGCGAAAGATATGCATGTTATTGGCATCGATAGAGGC
GAAAGAAACCTGCTTTATATTTGCGTAATTGACAGCCGTGGTACCATTCTGGACC
AGATCTCTTTAAACACCATCAATGACATCGATTATCACGACCTGTTGGAGTCTCG
GGACAAGGACCGCCAGCAGGAGCGCCGTAATTGGCAGACAATTGAAGGCATAAAA
GAATTAAAACAGGGTTACCTTTCCCAGGCCGTACACCGCATAGCGGAACTGATGG
TGGCCTACAAAGCCGTAGTTGCCCTGGAAGACTTGAATATGGGGTTTAAACGTGG
CCGTCAAAAAGTCGAGAGCAGCGTGTATCAGCAATTTGAAAAACAGTTGATTGAC
AAGTTGAATTATTTGGTTGATAAAAAGAAACGTCCAGAAGATATTGGTGGCTTAC
TGCGTGCATACCAGTTTACGGCACCTTTTAAGTCCTTCAAAGAAATGGGTAAACA
GAACGGGTTTCTGTTTTACATCCCGGCCTGGAATACATCCAACATCGATCCTACC
ACCGGGTTTGTCAACCTGTTTCATGCACAATATGAAAACGTGGATAAAGCGAAGA
GTTTTTTCCAAAAATTCGATAGTATTTCGTATAACCCAAAAAAAGATTGGTTTGA
GTTTGCGTTCGATTATAAAAATTTTACTAAAAAGGCTGAGGGATCCCGCAGTATG
TGGATCCTCTGCACCCATGGCAGTCGTATTAAAAATTTTCGTAATTCGCAAAAGA
ATGGCCAGTGGGACTCGGAAGAGTTTGCCCTGACCGAAGCGTTCAAATCGCTGTT
TGTACGCTACGAAATTGACTACACAGCAGATCTGAAAACAGCCATCGTCGATGAA
AAACAGAAAGATTTTTTTGTAGATCTCCTAAAACTGTTCAAACTGACTGTTCAGA
TGCGCAATTCCTGGAAAGAGAAAGACCTGGATTATCTGATTAGCCCGGTAGCCGG
TGCTGATGGACGATTTTTCGATACTCGTGAAGGTAACAAAAGTCTCCCGAAAGAT
GCTGATGCCAATGGTGCATACAATATTGCATTAAAGGGGCTATGGGCCTTGCGAC
AGATCCGCCAGACCAGCGAAGGCGGCAAGCTGAAATTGGCCATATCGAATAAGGA
ATGGTTACAATTTGTTCAGGAACGTAGCTATGAAAAAGATTGA
SEQ ATGAACAACGGCACAAATAATTTTCAGAACTTCATCGGGATCTCAAGTTTGCAGA
ID AAACGCTGCGCAATGCTCTGATCCCCACGGAAACCACGCAACAGTTCATCGTCAA
NO: GAACGGAATAATTAAAGAAGATGAGTTACGTGGCGAGAACCGCCAGATTCTGAAA
47 GATATCATGGATGACTACTACCGCGGATTCATCTCTGAGACTCTGAGTTCTATTG
ATGACATAGATTGGACTAGCCTGTTCGAAAAAATGGAAATTCAGCTGAAAAATGG
TGATAATAAAGATACCTTAATTAAGGAACAGACAGAGTATCGGAAAGCAATCCAT
AAAAAATTTGCGAACGACGATCGGTTTAAGAACATGTTTAGCGCCAAACTGATTA
GTGACATATTACCTGAATTTGTCATCCACAACAATAATTATTCGGCATCAGAGAA
AGAGGAAAAAACCCAGGTGATAAAATTGTTTTCGCGCTTTGCGACTAGCTTTAAA
GATTACTTCAAGAACCGTGCAAATTGCTTTTCAGCGGACGATATTTCATCAAGCA
GCTGCCATCGCATCGTCAACGACAATGCAGAGATATTCTTTTCAAATGCGCTGGT
CTACCGCCGGATCGTAAAATCGCTGAGCAATGACGATATCAACAAAATTTCGGGC
GATATGAAAGATTCATTAAAAGAAATGAGTCTGGAAGAAATATATTCTTACGAGA
AGTATGGGGAATTTATTACCCAGGAAGGCATTAGCTTCTATAATGATATCTGTGG
GAAAGTGAATTCTTTTATGAACCTGTATTGTCAGAAAAATAAAGAAAACAAAAAT
TTATACAAACTTCAGAAACTTCACAAACAGATTCTATGCATTGCGGACACTAGCT
ATGAGGTCCCGTATAAATTTGAAAGTGACGAGGAAGTGTACCAATCAGTTAACGG
CTTCCTTGATAACATTAGCAGCAAACATATAGTCGAAAGATTACGCAAAATCGGC
GATAACTATAACGGCTACAACCTGGATAAAATTTATATCGTGTCCAAATTTTACG
AGAGCGTTAGCCAAAAAACCTACCGCGACTGGGAAACAATTAATACCGCCCTCGA
AATTCATTACAATAATATCTTGCCGGGTAACGGTAAAAGTAAAGCCGACAAAGTA
AAAAAAGCGGTTAAGAATGATTTACAGAAATCCATCACCGAAATAAATGAACTAG
TGTCAAACTATAAGCTGTGCAGTGACGACAACATCAAAGCGGAGACTTATATACA
TGAGATTAGCCATATCTTGAATAACTTTGAAGCACAGGAATTGAAATACAATCCG
GAAATTCACCTAGTTGAATCCGAGCTCAAAGCGAGTGAGCTTAAAAACGTGCTGG
ACGTGATCATGAATGCGTTTCATTGGTGTTCGGTTTTTATGACTGAGGAACTTGT
TGATAAAGACAACAATTTTTATGCGGAACTGGAGGAGATTTACGATGAAATTTAT
CCAGTAATTAGTCTGTACAACCTGGTTCGTAACTACGTTACCCAGAAACCGTACA
GCACGAAAAAGATTAAATTGAACTTTGGAATACCGACGTTAGCAGACGGTTGGTC
AAAGTCCAAAGAGTATTCTAATAACGCTATCATACTGATGCGCGACAATCTGTAT
TATCTGGGCATCTTTAATGCGAAGAATAAACCGGACAAGAAGATTATCGAGGGTA
ATACGTCAGAAAATAAGGGTGACTACAAAAAGATGATTTATAATTTGCTCCCGGG
TCCCAACAAAATGATCCCGAAAGTTTTCTTGAGCAGCAAGACGGGGGTGGAAACG
TATAAACCGAGCGCCTATATCCTAGAGGGGTATAAACAGAATAAACATATCAAGT
CTTCAAAAGACTTTGATATCACTTTCTGTCATGATCTGATCGACTACTTCAAAAA
CTGTATTGCAATTCATCCCGAGTGGAAAAACTTCGGTTTTGATTTTAGCGACACC
AGTACTTATGAAGACATTTCCGGGTTTTATCGTGAGGTAGAGTTACAAGGTTACA
AGATTGATTGGACATACATTAGCGAAAAAGACATTGATCTGCTGCAGGAAAAAGG
TCAACTGTATCTGTTCCAGATATATAACAAAGATTTTTCGAAAAAATCAACCGGG
AATGACAACCTTCACACCATGTACCTGAAAAATCTTTTCTCAGAAGAAAATCTTA
AGGATATCGTCCTGAAACTTAACGGCGAAGCGGAAATCTTCTTCAGGAAGAGCAG
CATAAAGAACCCAATCATTCATAAAAAAGGCTCGATTTTAGTCAACCGTACCTAC
GAAGCAGAAGAAAAAGACCAGTTTGGCAACATTCAAATTGTGCGTAAAAATATTC
CGGAAAACATTTATCAGGAGCTGTACAAATACTTCAACGATAAAAGCGACAAAGA
GCTGTCTGATGAAGCAGCCAAACTGAAGAATGTAGTGGGACACCACGAGGCAGCG
ACGAATATAGTCAAGGACTATCGCTACACGTATGATAAATACTTCCTTCATATGC
CTATTACGATCAATTTCAAAGCCAATAAAACGGGTTTTATTAATGATAGGATCTT
ACAGTATATCGCTAAAGAAAAAGACTTACATGTGATCGGCATTGATCGGGGCGAG
CGTAACCTGATCTACGTGTCCGTGATTGATACTTGTGGTAATATAGTTGAACAGA
AAAGCTTTAACATTGTAAACGGCTACGACTATCAGATAAAACTGAAACAACAGGA
GGGCGCTAGACAGATTGCGCGGAAAGAATGGAAAGAAATTGGTAAAATTAAAGAG
ATCAAAGAGGGCTACCTGAGCTTAGTAATCCACGAGATCTCTAAAATGGTAATCA
AATACAATGCAATTATAGCGATGGAGGATTTGTCTTATGGTTTTAAAAAAGGGCG
CTTTAAGGTCGAACGGCAAGTTTACCAGAAATTTGAAACCATGCTCATCAATAAA
CTCAACTATCTGGTATTTAAAGATATTTCGATTACCGAGAATGGCGGTCTCCTGA
AAGGTTATCAGCTGACATACATTCCTGATAAACTTAAAAACGTGGGTCATCAGTG
CGGCTGCATTTTTTATGTGCCTGCTGCATACACGAGCAAAATTGATCCGACCACC
GGCTTTGTGAATATCTTTAAATTTAAAGACCTGACAGTGGACGCAAAACGTGAAT
TCATTAAAAAATTTGACTCAATTCGTTATGACAGTGAAAAAAATCTGTTCTGCTT
TACATTTGACTACAATAACTTTATTACGCAAAACACGGTCATGAGCAAATCATCG
TGGAGTGTGTATACATACGGCGTGCGCATCAAACGTCGCTTTGTGAACGGCCGCT
TCTCAAACGAAAGTGATACCATTGACATAACCAAAGATATGGAGAAAACGTTGGA
AATGACGGACATTAACTGGCGCGATGGCCACGATCTTCGTCAAGACATTATAGAT
TATGAAATTGTTCAGCACATATTCGAAATTTTCCGTTTAACAGTGCAAATGCGTA
ACTCCTTGTCTGAACTGGAGGACCGTGATTACGATCGTCTCATTTCACCTGTACT
GAACGAAAATAACATTTTTTATGACAGCGCGAAAGCGGGGGATGCACTTCCTAAG
GATGCCGATGCAAATGGTGCGTATTGTATTGCATTAAAAGGGTTATATGAAATTA
AACAAATTACCGAAAATTGGAAAGAAGATGGTAAATTTTCGCGCGATAAACTCAA
AATCAGCAATAAAGATTGGTTCGACTTTATCCAGAATAAGCGCTATCTCTAA
SEQ ATGACCAATAAATTCACTAACCAGTATTCTCTCTCTAAGACCCTGCGCTTTGAAC
ID TGATTCCGCAGGGGAAAACCTTGGAGTTCATTCAAGAAAAAGGCCTCTTGTCTCA
NO: GGATAAACAGAGGGCTGAATCTTACCAAGAAATGAAGAAAACTATTGATAAGTTT
48 CATAAATATTTCATTGATTTAGCCTTGTCTAACGCCAAATTAACTCACTTGGAAA
CGTATCTGGAGTTATACAACAAATCTGCCGAAACTAAGAAAGAACAGAAATTTAA
AGACGATTTGAAAAAAGTACAGGACAATCTGCGTAAAGAAATTGTCAAATCCTTC
AGTGACGGCGATGCTAAAAGCATTTTTGCCATTCTGGACAAAAAAGAGTTGATTA
CTGTGGAATTAGAAAAGTGGTTTGAAAACAATGAGCAGAAAGACATCTACTTCGA
TGAGAAATTCAAAACTTTCACCACCTATTTTACAGGATTTCATCAAAACCGGAAG
AACATGTACTCAGTAGAACCGAACTCCACGGCCATTGCGTATCGTTTGATCCATG
AGAATCTGCCTAAATTTCTGGAGAATGCGAAAGCCTTTGAAAAGATTAAGCAGGT
CGAATCGCTGCAAGTGAATTTTCGTGAACTCATGGGCGAATTTGGTGACGAAGGT
CTAATCTTCGTTAACGAACTGGAAGAAATGTTTCAGATTAATTACTACAATGACG
TGCTATCGCAGAACGGTATCACAATCTACAATAGTATTATCTCAGGGTTCACAAA
AAACGATATAAAATACAAAGGCCTGAACGAGTATATCAATAACTACAACCAAACA
AAGGACAAAAAGGATAGGCTTCCGAAACTGAAGCAGTTATACAAACAGATTTTAT
CTGACAGAATCTCCCTGAGCTTTCTGCCGGATGCTTTCACTGATGGGAAGCAGGT
TCTGAAAGCGATTTTCGATTTTTATAAGATTAACTTACTGAGCTACACGATTGAA
GGTCAAGAAGAATCTCAAAACTTACTGCTCTTGATCCGTCAAACCATTGAAAATC
TATCATCGTTCGATACGCAGAAAATCTACCTCAAAAACGATACTCACCTGACTAC
GATCTCTCAGCAGGTTTTCGGGGATTTTAGTGTATTTTCAACAGCTCTGAACTAC
TGGTATGAAACCAAAGTCAATCCGAAATTCGAGACGGAATATTCTAAGGCCAACG
AAAAAAAACGTGAGATTCTTGATAAAGCTAAAGCCGTATTTACTAAACAGGATTA
CTTTTCTATTGCTTTCCTGCAGGAAGTTTTATCGGAGTATATCCTGACCCTGGAT
CATACATCTGATATCGTTAAAAAACACAGCAGCAATTGCATCGCTGACTATTTCA
AAAACCACTTTGTCGCCAAAAAAGAAAACGAAACAGACAAGACTTTCGATTTCAT
TGCTAACATCACCGCAAAATACCAGTGTATTCAGGGTATCTTGGAAAACGCCGAC
CAATACGAAGACGAACTGAAACAAGATCAGAAGCTGATCGATAATTTAAAATTCT
TCTTAGATGCAATCCTGGAGCTGCTGCACTTCATCAAACCGCTTCATTTAAAGAG
CGAGTCCATTACCGAAAAGGACACCGCCTTCTATGACGTTTTTGAAAATTATTAT
GAAGCCCTCTCCTTGCTGACTCCGCTGTATAATATGGTACGCAATTACGTAACCC
AGAAACCATATTCTACCGAAAAAATTAAACTGAACTTTGAAAACGCACAGCTGCT
CAACGGTTGGGACGCGAATAAAGAAGGTGACTACCTCACCACCATCCTGAAAAAA
GATGGTAACTATTTTCTGGCAATTATGGATAAGAAACATAATAAAGCATTCCAGA
AATTTCCTGAAGGGAAAGAAAATTACGAAAAGATGGTGTACAAACTCTTACCTGG
AGTTAACAAAATGTTGCCGAAAGTATTTTTTAGTAATAAGAACATCGCGTACTTT
AACCCGTCCAAAGAACTGCTGGAAAATTATAAAAAGGAGACGCATAAGAAAGGGG
ATACCTTTAACCTGGAACATTGCCATACCTTAATAGACTTCTTCAAGGATTCCCT
GAATAAACACGAGGATTGGAAATATTTCGATTTTCAGTTTAGTGAGACCAAGTCA
TACCAGGATCTTAGCGGCTTTTATCGCGAAGTAGAACACCAAGGCTATAAAATTA
ACTTCAAAAACATCGACAGCGAATACATCGACGGTTTAGTTAACGAGGGCAAACT
GTTTCTGTTCCAGATCTATTCAAAGGATTTTAGCCCGTTCTCTAAAGGCAAACCA
AATATGCATACGTTGTACTGGAAAGCACTGTTTGAAGAGCAAAACCTGCAGAATG
TGATTTATAAACTGAACGGCCAAGCTGAGATTTTTTTCCGTAAAGCCTCGATTAA
ACCGAAAAATATCATCCTTCATAAGAAGAAAATAAAGATCGCTAAAAAACACTTC
ATAGATAAAAAAACCAAAACCTCCGAAATAGTGCCTGTTCAAACAATTAAGAACT
TGAATATGTACTACCAGGGCAAGATATCGGAAAAGGAGTTGACTCAAGACGATCT
TCGCTATATCGATAACTTTTCGATTTTTAACGAAAAAAACAAGACGATCGACATC
ATCAAAGATAAACGCTTCACTGTAGATAAGTTCCAGTTTCATGTGCCGATTACTA
TGAACTTCAAAGCTACCGGGGGTAGCTATATCAACCAAACGGTGTTGGAATACCT
GCAGAATAACCCGGAAGTCAAAATCATTGGGCTGGACCGCGGAGAACGTCACCTT
GTGTACTTGACCTTAATCGATCAGCAAGGCAACATCTTAAAACAAGAATCGCTGA
ATACCATTACGGATTCAAAGATTAGCACCCCGTATCATAAGCTGCTCGATAACAA
GGAGAATGAGCGCGACCTGGCCCGTAAAAACTGGGGCACGGTGGAAAACATTAAG
GAGTTAAAGGAGGGTTATATTTCCCAGGTAGTGCATAAGATCGCCACTCTCATGC
TCGAGGAAAATGCGATCGTTGTCATGGAAGACTTAAACTTCGGATTTAAACGTGG
GCGATTTAAAGTAGAGAAACAAATCTACCAGAAGTTAGAAAAAATGCTGATTGAC
AAATTAAATTACTTGGTCCTAAAAGACAAACAGCCGCAAGAATTGGGTGGATTAT
ACAACGCCCTCCAACTTACCAATAAATTCGAAAGTTTTCAGAAAATGGGTAAACA
GTCAGGCTTTCTTTTTTATGTTCCTGCGTGGAACACATCCAAAATCGACCCTACA
ACCGGCTTCGTCAATTACTTCTATACTAAATATGAAAACGTCGACAAAGCAAAAG
CATTCTTTGAAAAGTTCGAAGCAATACGTTTTAACGCTGAGAAAAAATATTTCGA
GTTCGAAGTCAAGAAATACTCAGACTTTAACCCCAAAGCTGAGGGCACACAGCAA
GCGTGGACAATCTGCACCTACGGCGAGCGCATCGAAACGAAGCGTCAAAAAGATC
AGAATAACAAATTTGTTTCAACACCTATCAACCTGACCGAGAAGATTGAAGACTT
CTTAGGTAAAAATCAGATTGTTTATGGCGACGGTAACTGTATAAAATCTCAAATA
GCCTCAAAGGATGATAAAGCATTTTTCGAAACATTATTATATTGGTTCAAAATGA
CACTGCAGATGCGCAATAGTGAGACGCGTACAGATATTGATTATCTTATCAGCCC
GGTCATGAACGACAACGGTACTTTTTACAACTCCAGAGACTATGAAAAACTTGAG
AATCCAACTCTCCCCAAAGATGCTGATGCGAACGGTGCTTATCACATCGCGAAAA
AAGGTCTGATGCTGCTGAACAAAATCGACCAAGCCGATCTGACTAAGAAAGTTGA
CCTAAGCATTTCAAATCGGGACTGGTTACAGTTTGTTCAAAAGAACAAATGA
SEQ ATGGAACAGGAATATTATCTGGGCTTGGACATGGGCACCGGTTCCGTCGGCTGGG
ID CTGTTACTGACAGTGAATATCACGTTCTAAGAAAGCATGGTAAGGCATTGTGGGG
NO: TGTAAGACTTTTCGAATCTGCTTCCACTGCTGAAGAGCGTAGAATGTTTAGAACG
49 AGTCGACGTAGGCTAGACAGGCGCAATTGGAGAATCGAAATTTTACAAGAAATTT
TTGCGGAAGAGATATCTAAGAAAGACCCAGGCTTTTTCCTGAGAATGAAGGAATC
TAAGTATTACCCTGAGGATAAAAGAGATATAAATGGTAACTGTCCCGAATTGCCT
TACGCATTATTTGTGGACGATGATTTTACCGATAAGGATTACCATAAAAAGTTCC
CAACTATCTACCATTTACGCAAAATGTTAATGAATACAGAGGAAACCCCAGACAT
AAGACTAGTTTATCTGGCAATACACCATATGATGAAACATAGAGGCCATTTCTTA
CTTTCCGGGGATATCAACGAAATCAAAGAGTTTGGTACCACATTTAGTAAGTTAC
TGGAAAACATAAAGAATGAAGAATTGGATTGGAACTTAGAACTCGGAAAAGAAGA
ATACGCGGTTGTCGAATCTATCCTGAAGGATAATATGCTGAATAGGTCGACCAAA
AAAACTAGGCTGATCAAAGCACTGAAAGCCAAATCTATCTGCGAAAAAGCTGTTT
TAAATTTACTTGCTGGTGGCACTGTTAAGTTATCAGACATTTTTGGTTTGGAAGA
ATTGAACGAAACCGAGCGTCCAAAAATTAGTTTCGCTGATAATGGCTACGATGAT
TACATTGGTGAGGTGGAAAACGAGTTGGGCGAACAATTTTATATTATAGAGACAG
CTAAGGCAGTCTATGACTGGGCTGTTTTAGTAGAAATCCTTGGTAAATACACATC
TATCTCCGAAGCGAAAGTTGCTACTTACGAAAAGCACAAGTCCGATCTCCAGTTT
TTGAAGAAAATTGTCAGGAAATATCTGACTAAGGAAGAATATAAAGATATTTTCG
TTAGTACCTCTGACAAACTGAAAAATTACTCCGCTTACATCGGGATGACCAAGAT
TAATGGCAAAAAAGTTGATCTGCAAAGCAAAAGGTGTTCGAAGGAAGAATTTTAT
GATTTCATTAAAAAGAATGTCTTAAAAAAATTAGAAGGTCAGCCAGAATACGAAT
ATTTGAAAGAAGAACTGGAAAGAGAGACATTCTTACCAAAACAAGTCAACAGAGA
TAATGGGGTAATTCCATATCAAATTCACCTCTACGAATTAAAAAAAATTTTAGGC
AATTTACGCGATAAAATTGACCTTATCAAAGAAAATGAGGATAAGCTGGTTCAAC
TCTTTGAATTCAGAATACCCTATTATGTGGGCCCACTGAACAAGATTGATGACGG
CAAAGAAGGTAAATTCACATGGGCCGTCCGCAAATCCAATGAAAAAATTTACCCA
TGGAACTTTGAAAATGTAGTAGATATTGAAGCGTCTGCGGAGAAATTTATTCGAA
GAATGACTAATAAATGCACTTACTTGATGGGAGAGGATGTTCTGCCTAAAGACAG
CTTATTATACAGCAAGTACATGGTTCTAAACGAACTTAACAACGTTAAGTTGGAC
GGTGAGAAATTAAGTGTAGAATTGAAACAAAGATTGTATACTGACGTCTTCTGCA
AGTACAGAAAAGTGACAGTTAAAAAAATTAAGAATTACTTGAAGTGCGAAGGTAT
AATTTCTGGAAACGTAGAGATTACTGGTATTGATGGTGATTTCAAAGCATCCCTA
ACAGCTTACCACGATTTCAAGGAAATCCTGACAGGAACTGAACTCGCAAAAAAAG
ATAAAGAAAACATTATTACTAATATTGTTCTTTTCGGTGATGACAAGAAATTGTT
GAAGAAAAGACTGAATAGACTTTACCCCCAGATTACTCCCAATCAACTTAAGAAA
ATTTGTGCTTTGTCTTACACAGGATGGGGTCGTTTTTCAAAAAAGTTCTTAGAAG
AGATTACCGCACCTGATCCAGAAACAGGCGAAGTATGGAATATAATTACCGCCTT
ATGGGAATCGAACAATAATCTTATGCAACTTCTGAGCAATGAATATCGTTTCATG
GAAGAAGTTGAGACTTACAACATGGGCAAACAGACGAAGACTTTATCCTATGAAA
CTGTGGAAAATATGTATGTATCACCTTCTGTCAAGAGACAAATTTGGCAAACCTT
AAAAATTGTCAAAGAATTAGAAAAGGTAATGAAGGAGTCTCCTAAACGTGTGTTT
ATTGAAATGGCTAGAGAAAAACAAGAGTCAAAAAGAACCGAGTCAAGAAAGAAGC
AGTTAATCGATTTATATAAGGCTTGTAAAAACGAAGAGAAAGATTGGGTTAAAGA
ATTGGGGGACCAAGAGGAACAAAAACTACGGTCGGATAAGTTGTATTTATACTAT
ACGCAAAAGGGACGATGTATGTATTCCGGCGAGGTAATAGAATTGAAGGATTTAT
GGGACAATACAAAATATGACATAGACCATATATATCCCCAATCAAAAACGATGGA
CGATAGCTTGAACAATAGAGTACTCGTGAAAAAAAAATATAATGCGACCAAATCT
GATAAGTATCCTCTGAATGAAAATATCAGACATGAAAGAAAGGGGTTCTGGAAGT
CCTTGTTAGATGGTGGGTTTATAAGCAAAGAAAAGTACGAGCGTCTAATAAGAAA
CACGGAGTTATCGCCAGAAGAACTCGCTGGTTTTATTGAGAGGCAAATCGTGGAA
ACGAGACAATCTACCAAAGCCGTTGCTGAGATCCTAAAGCAAGTTTTCCCAGAGT
CGGAGATTGTCTATGTCAAAGCTGGCACAGTGAGCAGGTTTAGGAAAGACTTCGA
ACTATTAAAGGTAAGAGAAGTGAACGATTTACATCACGCAAAGGACGCTTACCTA
AATATCGTTGTAGGTAACTCATATTATGTTAAATTTACCAAGAACGCCTCTTGGT
TTATAAAGGAGAACCCAGGTAGAACATATAACCTGAAAAAGATGTTCACCTCTGG
TTGGAATATTGAGAGAAACGGAGAAGTCGCATGGGAAGTTGGTAAGAAAGGGACT
ATAGTGACAGTAAAGCAAATTATGAACAAAAATAATATCCTCGTTACAAGGCAGG
TTCATGAAGCAAAGGGCGGCCTTTTTGACCAACAAATTATGAAGAAAGGGAAAGG
TCAAATTGCAATAAAAGAAACCGATGAGAGACTAGCGTCAATAGAAAAGTATGGT
GGCTATAATAAAGCTGCGGGTGCATACTTTATGCTTGTTGAATCAAAAGACAAGA
AAGGTAAGACTATTAGAACTATAGAATTTATACCCCTGTACCTTAAAAACAAAAT
TGAATCGGATGAGTCAATCGCGTTAAATTTTCTAGAGAAAGGAAGGGGTTTAAAA
GAACCAAAGATCCTGTTAAAAAAGATTAAGATTGACACCTTGTTCGATGTAGATG
GATTTAAAATGTGGTTATCTGGCAGAACAGGCGATAGACTTTTGTTTAAGTGCGC
TAATCAATTAATTTTGGATGAGAAAATCATTGTCACAATGAAAAAAATAGTTAAG
TTTATTCAGAGAAGACAAGAAAACAGGGAGTTGAAATTATCTGATAAAGATGGTA
TCGACAATGAAGTTTTAATGGAAATCTACAATACATTCGTTGATAAACTTGAAAA
TACCGTATATCGAATCAGGTTAAGTGAACAAGCCAAAACATTAATTGATAAACAA
AAAGAATTTGAAAGGCTATCACTGGAAGACAAATCCTCCACCCTATTTGAAATTT
TGCATATATTCCAGTGCCAATCTTCAGCAGCTAATTTAAAAATGATTGGCGGACC
TGGGAAAGCCGGCATCCTAGTGATGAACAATAATATCTCCAAGTGTAACAAAATA
TCAATTATTAACCAATCTCCGACAGGTATTTTTGAAAATGAAATAGACTTGCTTA
AGATATAA
SEQ ATGTCTTTCGACTCTTTCACCAACCTGTACTCTCTGTCTAAAACCCTGAAATTCG
ID AAATGCGTCCGGTTGGTAACACCCAGAAAATGCTGGACAACGCGGGTGTTTTCGA
NO: AAAAGACAAACTGATCCAGAAAAAATACGGTAAAACCAAACCGTACTTCGACCGT
50 CTGCACCGTGAATTCATCGAAGAAGCGCTGACCGGTGTTGAACTGATCGGTCTGG
ACGAAAACTTCCGTACCCTGGTTGACTGGCAGAAAGACAAAAAAAACAACGTTGC
GATGAAAGCGTACGAAAACTCTCTGCAGCGTCTGCGTACCGAAATCGGTAAAATC
TTCAACCTGAAAGCGGAAGACTGGGTTAAAAACAAATACCCGATCCTGGGTCTGA
AAAACAAAAACACCGACATCCTGTTCGAAGAAGCGGTTTTCGGTATCCTGAAAGC
GCGTTACGGTGAAGAAAAAGACACCTTCATCGAAGTTGAAGAAATCGACAAAACC
GGTAAATCTAAAATCAACCAGATCTCTATCTTCGACTCTTGGAAAGGTTTCACCG
GTTACTTCAAAAAATTCTTCGAAACCCGTAAAAACTTCTACAAAAACGACGGTAC
CTCTACCGCGATCGCGACCCGTATCATCGACCAGAACCTGAAACGTTTCATCGAC
AACCTGTCTATCGTTGAATCTGTTCGTCAGAAAGTTGACCTGGCGGAAACCGAAA
AATCTTTCTCTATCTCTCTGTCTCAGTTCTTCTCTATCGACTTCTACAACAAATG
CCTGCTGCAGGACGGTATCGACTACTACAACAAAATCATCGGTGGTGAAACCCTG
AAAAACGGTGAAAAACTGATCGGTCTGAACGAACTGATCAACCAGTACCGTCAGA
ACAACAAAGACCAGAAAATCCCGTTCTTCAAACTGCTGGACAAACAGATCCTGTC
TGAAAAAATCCTGTTCCTGGACGAAATCAAAAACGACACCGAACTGATCGAAGCG
CTGTCTCAGTTCGCGAAAACCGCGGAAGAAAAAACCAAAATCGTTAAAAAACTGT
TCGCGGACTTCGTTGAAAACAACTCTAAATACGACCTGGCGCAGATCTACATCTC
TCAGGAAGCGTTCAACACCATCTCTAACAAATGGACCTCTGAAACCGAAACCTTC
GCGAAATACCTGTTCGAAGCGATGAAATCTGGTAAACTGGCGAAATACGAAAAAA
AAGACAACTCTTACAAATTCCCGGACTTCATCGCGCTGTCTCAGATGAAATCTGC
GCTGCTGTCTATCTCTCTGGAAGGTCACTTCTGGAAAGAAAAATACTACAAAATC
TCTAAATTCCAGGAAAAAACCAACTGGGAACAGTTCCTGGCGATCTTCCTGTACG
AATTCAACTCTCTGTTCTCTGACAAAATCAACACCAAAGACGGTGAAACCAAACA
GGTTGGTTACTACCTGTTCGCGAAAGACCTGCACAACCTGATCCTGTCTGAACAG
ATCGACATCCCGAAAGACTCTAAAGTTACCATCAAAGACTTCGCGGACTCTGTTC
TGACCATCTACCAGATGGCGAAATACTTCGCGGTTGAAAAAAAACGTGCGTGGCT
GGCGGAATACGAACTGGACTCTTTCTACACCCAGCCGGACACCGGTTACCTGCAG
TTCTACGACAACGCGTACGAAGACATCGTTCAGGTTTACAACAAACTGCGTAACT
ACCTGACCAAAAAACCGTACTCTGAAGAAAAATGGAAACTGAACTTCGAAAACTC
TACCCTGGCGAACGGTTGGGACAAAAACAAAGAATCTGACAACTCTGCGGTTATC
CTGCAGAAAGGTGGTAAATACTACCTGGGTCTGATCACCAAAGGTCACAACAAAA
TCTTCGACGACCGTTTCCAGGAAAAATTCATCGTTGGTATCGAAGGTGGTAAATA
CGAAAAAATCGTTTACAAATTCTTCCCGGACCAGGCGAAAATGTTCCCGAAAGTT
TGCTTCTCTGCGAAAGGTCTGGAATTCTTCCGTCCGTCTGAAGAAATCCTGCGTA
TCTACAACAACGCGGAATTCAAAAAAGGTGAAACCTACTCTATCGACTCTATGCA
GAAACTGATCGACTTCTACAAAGACTGCCTGACCAAATACGAAGGTTGGGCGTGC
TACACCTTCCGTCACCTGAAACCGACCGAAGAATACCAGAACAACATCGGTGAAT
TCTTCCGTGACGTTGCGGAAGACGGTTACCGTATCGACTTCCAGGGTATCTCTGA
CCAGTACATCCACGAAAAAAACGAAAAAGGTGAACTGCACCTGTTCGAAATCCAC
AACAAAGACTGGAACCTGGACAAAGCGCGTGACGGTAAATCTAAAACCACCCAGA
AAAACCTGCACACCCTGTACTTCGAATCTCTGTTCTCTAACGACAACGTTGTTCA
GAACTTCCCGATCAAACTGAACGGTCAGGCGGAAATCTTCTACCGTCCGAAAACC
GAAAAAGACAAACTGGAATCTAAAAAAGACAAAAAAGGTAACAAAGTTATCGACC
ACAAACGTTACTCTGAAAACAAAATCTTCTTCCACGTTCCGCTGACCCTGAACCG
TACCAAAAACGACTCTTACCGTTTCAACGCGCAGATCAACAACTTCCTGGCGAAC
AACAAAGACATCAACATCATCGGTGTTGACCGTGGTGAAAAACACCTGGTTTACT
ACTCTGTTATCACCCAGGCGTCTGACATCCTGGAATCTGGTTCTCTGAACGAACT
GAACGGTGTTAACTACGCGGAAAAACTGGGTAAAAAAGCGGAAAACCGTGAACAG
GCGCGTCGTGACTGGCAGGACGTTCAGGGTATCAAAGACCTGAAAAAAGGTTACA
TCTCTCAGGTTGTTCGTAAACTGGCGGACCTGGCGATCAAACACAACGCGATCAT
CATCCTGGAAGACCTGAACATGCGTTTCAAACAGGTTCGTGGTGGTATCGAAAAA
TCTATCTACCAGCAGCTGGAAAAAGCGCTGATCGACAAACTGTCTTTCCTGGTTG
ACAAAGGTGAAAAAAACCCGGAACAGGCGGGTCACCTGCTGAAAGCGTACCAGCT
GTCTGCGCCGTTCGAAACCTTCCAGAAAATGGGTAAACAGACCGGTATCATCTTC
TACACCCAGGCGTCTTACACCTCTAAATCTGACCCGGTTACCGGTTGGCGTCCGC
ACCTGTACCTGAAATACTTCTCTGCGAAAAAAGCGAAAGACGACATCGCGAAATT
CACCAAAATCGAATTCGTTAACGACCGTTTCGAACTGACCTACGACATCAAAGAC
TTCCAGCAGGCGAAAGAATACCCGAACAAAACCGTTTGGAAAGTTTGCTCTAACG
TTGAACGTTTCCGTTGGGACAAAAACCTGAACCAGAACAAAGGTGGTTACACCCA
CTACACCAACATCACCGAAAACATCCAGGAACTGTTCACCAAATACGGTATCGAC
ATCACCAAAGACCTGCTGACCCAGATCTCTACCATCGACGAAAAACAGAACACCT
CTTTCTTCCGTGACTTCATCTTCTACTTCAACCTGATCTGCCAGATCCGTAACAC
CGACGACTCTGAAATCGCGAAAAAAAACGGTAAAGACGACTTCATCCTGTCTCCG
GTTGAACCGTTCTTCGACTCTCGTAAAGACAACGGTAACAAACTGCCGGAAAACG
GTGACGACAACGGTGCGTACAACATCGCGCGTAAAGGTATCGTTATCCTGAACAA
AATCTCTCAGTACTCTGAAAAAAACGAAAACTGCGAAAAAATGAAATGGGGTGAC
CTGTACGTTTCTAACATCGACTGGGACAACTTCGTT
SEQ ATGGAAAACTTTAAAAACTTATACCCAATAAACAAAACGTTACGTTTTGAACTGC
ID GTCCATATGGTAAAACACTGGAAAACTTTAAAAAAAGCGGTTTGTTGGAGAAGGA
NO: TGCATTTAAAGCGAACTCTCGCAGATCCATGCAGGCCATCATTGATGAAAAATTT
51 AAAGAGACGATCGAAGAACGTCTGAAATACACGGAATTTAGTGAGTGTGACTTAG
GTAATATGACTTCTAAAGATAAGAAAATCACCGATAAGGCGGCGACCAACCTGAA
GAAGCAAGTCATTTTATCTTTTGATGATGAAATCTTTAACAACTATTTGAAACCG
GACAAAAACATCGATGCCTTATTTAAAAATGACCCTTCGAACCCGGTGATTAGCA
CATTTAAGGGCTTCACAACGTATTTTGTCAATTTTTTTGAAATTCGTAAACATAT
CTTCAAAGGAGAATCAAGCGGCTCTATGGCTTATCGCATTATTGATGAAAACCTG
ACGACCTATTTGAATAACATTGAAAAAATCAAAAAACTGCCAGAGGAATTAAAGT
CTCAGTTAGAAGGCATCGACCAGATCGACAAACTCAACAACTATAACGAATTTAT
TACGCAGTCTGGTATCACCCACTATAATGAAATTATTGGAGGTATCAGTAAATCA
GAAAATGTGAAAATCCAAGGGATTAATGAAGGCATTAACCTCTATTGCCAGAAAA
ATAAAGTGAAACTGCCGAGGCTGACTCCACTCTACAAAATGATCCTGTCTGACCG
CGTCTCGAATAGCTTTGTCCTGGACACAATTGAAAACGATACGGAATTGATTGAG
ATGATAAGCGATCTGATTAACAAAACCGAAATTTCACAGGATGTAATCATGAGTG
ATATACAAAACATCTTTATTAAATATAAACAGCTTGGTAATCTGCCTGGAATTAG
CTATTCGTCAATAGTGAACGCAATCTGTTCTGATTATGATAACAATTTTGGCGAC
GGTAAGCGTAAAAAGAGTTATGAAAACGATAGGAAAAAACACCTGGAAACTAACG
TGTATTCTATCAACTATATCAGCGAACTGCTTACGGACACCGATGTGAGTTCAAA
CATTAAGATGCGGTATAAGGAGCTTGAACAGAACTACCAGGTCTGTAAGGAAAAC
TTCAACGCAACCAACTGGATGAACATTAAAAATATCAAACAATCCGAGAAGACCA
ACTTAATCAAAGATCTGCTGGATATTTTGAAGAGCATTCAACGTTTTTATGATCT
GTTCGATATCGTTGATGAAGACAAGAATCCTAGTGCGGAATTTTATACATGGCTG
TCTAAAAATGCGGAGAAATTGGATTTCGAATTCAATTCTGTTTATAATAAATCAC
GCAACTATTTGACCCGCAAACAATACAGCGACAAAAAGATAAAACTAAACTTCGA
CAGTCCGACATTGGCAAAGGGCTGGGACGCAAATAAGGAAATCGATAACTCTACG
ATAATTATGCGTAAGTTCAATAATGATCGAGGTGATTATGATTATTTCTTAGGCA
TTTGGAACAAAAGCACCCCGGCCAACGAAAAGATAATTCCACTGGAGGATAACGG
TCTGTTCGAAAAAATGCAGTACAAATTATATCCGGATCCAAGCAAGATGCTTCCA
AAGCAGTTTCTGTCTAAAATTTGGAAAGCTAAGCATCCGACCACCCCAGAATTTG
ACAAGAAATATAAGGAAGGCCGCCATAAGAAAGGTCCCGATTTTGAAAAAGAATT
CTTGCACGAACTGATTGATTGCTTTAAACATGGCTTAGTCAATCACGATGAAAAG
TATCAAGATGTTTTTGGATTCAATTTGAGAAACACAGAAGACTACAATTCCTACA
CTGAGTTTCTCGAAGATGTGGAACGATGTAATTATAATCTGAGCTTTAACAAAAT
CGCGGACACCTCGAATCTGATTAACGATGGTAAACTTTATGTTTTCCAGATCTGG
AGCAAGGATTTCTCTATTGACAGCAAAGGCACCAAAAACCTGAACACCATTTACT
TTGAAAGTCTCTTCAGCGAAGAAAATATGATTGAGAAAATGTTTAAACTTAGCGG
TGAAGCTGAAATATTCTATCGCCCGGCAAGCCTGAACTATTGCGAAGACATTATC
AAAAAGGGTCATCACCACGCTGAACTGAAAGATAAATTTGATTATCCTATCATAA
AAGATAAACGCTATAGCCAGGATAAATTTTTTTTTCATGTTCCTATGGTCATTAA
CTACAAATCAGAAAAACTGAACTCTAAAAGCCTCAATAATCGAACCAATGAAAAC
CTTGGGCAGTTTACCCATATAATTGGAATTGATCGCGGAGAGCGTCATTTAATCT
ACCTGACCGTAGTCGATGTATCGACCGGCGAGATCGTCGAGCAGAAGCACTTAGA
CGAGATTATCAACACTGATACCAAAGGTGTTGAGCATAAGACGCACTATCTAAAC
AAGCTGGAGGAAAAATCGAAAACCCGTGATAATGAACGTAAGAGTTGGGAGGCAA
TTGAAACGATTAAAGAACTGAAGGAGGGTTATATCAGCCACGTAATCAATGAAAT
TCAAAAACTGCAGGAAAAATACAACGCCCTGATCGTTATGGAAAATCTGAATTAC
GGTTTCAAAAATTCTCGCATCAAAGTGGAAAAACAGGTATATCAGAAGTTCGAGA
CGGCATTAATTAAAAAGTTTAATTACATCATTGACAAAAAAGATCCGGAAACTTA
TATTCATGGCTATCAGCTGACGAACCCGATCACCACACTGGATAAAATTGGTAAC
CAGTCTGGTATCGTGCTTTACATCCCTGCCTGGAATACCAGTAAAATCGATCCGG
TAACGGGATTCGTCAACCTTCTATATGCAGATGACCTCAAATATAAGAATCAGGA
ACAGGCCAAGTCTTTTATTCAGAAAATCGATAACATTTACTTTGAGAATGGGGAA
TTCAAATTTGATATTGATTTTTCTAAATGGAACAATCGTTATAGTATATCTAAGA
CGAAATGGACGCTCACCTCGTACGGAACCCGAATCCAGACATTCCGCAATCCGCA
GAAGAACAATAAATGGGACAGCGCCGAGTATGATCTCACTGAAGAATTCAAATTG
ATTCTGAACATTGACGGTACCCTGAAAAGCCAGGATGTCGAAACCTATAAAAAAT
TTATGTCTCTGTTCAAGCTGATGCTGCAACTTAGGAACTCTGTTACCGGCACTGA
TATCGATTATATGATCTCCCCTGTCACTGATAAAACAGGTACGCATTTCGATTCG
CGCGAAAATATCAAAAATCTGCCCGCAGATGCCGACGCCAATGGGGCGTACAATA
TTGCACGCAAGGGTATCATGGCGATCGAAAACATTATGAATGGTATCAGCGACCC
GCTGAAAATCTCAAACGAAGATTATTTGAAATATATCCAAAACCAGCAGGAATAA
SEQ ATGACCCAGTTCGAAGGTTTCACCAACCTGTACCAGGTTTCTAAAACCCTGCGTT
ID TCGAACTGATCCCGCAGGGTAAAACCCTGAAACACATCCAGGAACAGGGTTTCAT
NO: CGAAGAAGACAAAGCGCGTAACGACCACTACAAAGAACTGAAACCGATCATCGAC
52 CGTATCTACAAAACCTACGCGGACCAGTGCCTGCAGCTGGTTCAGCTGGACTGGG
AAAACCTGTCTGCGGCGATCGACTCTTACCGTAAAGAAAAAACCGAAGAAACCCG
TAACGCGCTGATCGAAGAACAGGCGACCTACCGTAACGCGATCCACGACTACTTC
ATCGGTCGTACCGACAACCTGACCGACGCGATCAACAAACGTCACGCGGAAATCT
ACAAAGGTCTGTTCAAAGCGGAACTGTTCAACGGTAAAGTTCTGAAACAGCTGGG
TACCGTTACCACCACCGAACACGAAAACGCGCTGCTGCGTTCTTTCGACAAATTC
ACCACCTACTTCTCTGGTTTCTACGAAAACCGTAAAAACGTTTTCTCTGCGGAAG
ACATCTCTACCGCGATCCCGCACCGTATCGTTCAGGACAACTTCCCGAAATTCAA
AGAAAACTGCCACATCTTCACCCGTCTGATCACCGCGGTTCCGTCTCTGCGTGAA
CACTTCGAAAACGTTAAAAAAGCGATCGGTATCTTCGTTTCTACCTCTATCGAAG
AAGTTTTCTCTTTCCCGTTCTACAACCAGCTGCTGACCCAGACCCAGATCGACCT
GTACAACCAGCTGCTGGGTGGTATCTCTCGTGAAGCGGGTACCGAAAAAATCAAA
GGTCTGAACGAAGTTCTGAACCTGGCGATCCAGAAAAACGACGAAACCGCGCACA
TCATCGCGTCTCTGCCGCACCGTTTCATCCCGCTGTTCAAACAGATCCTGTCTGA
CCGTAACACCCTGTCTTTCATCCTGGAAGAATTCAAATCTGACGAAGAAGTTATC
CAGTCTTTCTGCAAATACAAAACCCTGCTGCGTAACGAAAACGTTCTGGAAACCG
CGGAAGCGCTGTTCAACGAACTGAACTCTATCGACCTGACCCACATCTTCATCTC
TCACAAAAAACTGGAAACCATCTCTTCTGCGCTGTGCGACCACTGGGACACCCTG
CGTAACGCGCTGTACGAACGTCGTATCTCTGAACTGACCGGTAAAATCACCAAAT
CTGCGAAAGAAAAAGTTCAGCGTTCTCTGAAACACGAAGACATCAACCTGCAGGA
AATCATCTCTGCGGCGGGTAAAGAACTGTCTGAAGCGTTCAAACAGAAAACCTCT
GAAATCCTGTCTCACGCGCACGCGGCGCTGGACCAGCCGCTGCCGACCACCCTGA
AAAAACAGGAAGAAAAAGAAATCCTGAAATCTCAGCTGGACTCTCTGCTGGGTCT
GTACCACCTGCTGGACTGGTTCGCGGTTGACGAATCTAACGAAGTTGACCCGGAA
TTCTCTGCGCGTCTGACCGGTATCAAACTGGAAATGGAACCGTCTCTGTCTTTCT
ACAACAAAGCGCGTAACTACGCGACCAAAAAACCGTACTCTGTTGAAAAATTCAA
ACTGAACTTCCAGATGCCGACCCTGGCGTCTGGTTGGGACGTTAACAAAGAAAAA
AACAACGGTGCGATCCTGTTCGTTAAAAACGGTCTGTACTACCTGGGTATCATGC
CGAAACAGAAAGGTCGTTACAAAGCGCTGTCTTTCGAACCGACCGAAAAAACCTC
TGAAGGTTTCGACAAAATGTACTACGACTACTTCCCGGACGCGGCGAAAATGATC
CCGAAATGCTCTACCCAGCTGAAAGCGGTTACCGCGCACTTCCAGACCCACACCA
CCCCGATCCTGCTGTCTAACAACTTCATCGAACCGCTGGAAATCACCAAAGAAAT
CTACGACCTGAACAACCCGGAAAAAGAACCGAAAAAATTCCAGACCGCGTACGCG
AAAAAAACCGGTGACCAGAAAGGTTACCGTGAAGCGCTGTGCAAATGGATCGACT
TCACCCGTGACTTCCTGTCTAAATACACCAAAACCACCTCTATCGACCTGTCTTC
TCTGCGTCCGTCTTCTCAGTACAAAGACCTGGGTGAATACTACGCGGAACTGAAC
CCGCTGCTGTACCACATCTCTTTCCAGCGTATCGCGGAAAAAGAAATCATGGACG
CGGTTGAAACCGGTAAACTGTACCTGTTCCAGATCTACAACAAAGACTTCGCGAA
AGGTCACCACGGTAAACCGAACCTGCACACCCTGTACTGGACCGGTCTGTTCTCT
CCGGAAAACCTGGCGAAAACCTCTATCAAACTGAACGGTCAGGCGGAACTGTTCT
ACCGTCCGAAATCTCGTATGAAACGTATGGCGCACCGTCTGGGTGAAAAAATGCT
GAACAAAAAACTGAAAGACCAGAAAACCCCGATCCCGGACACCCTGTACCAGGAA
CTGTACGACTACGTTAACCACCGTCTGTCTCACGACCTGTCTGACGAAGCGCGTG
CGCTGCTGCCGAACGTTATCACCAAAGAAGTTTCTCACGAAATCATCAAAGACCG
TCGTTTCACCTCTGACAAATTCTTCTTCCACGTTCCGATCACCCTGAACTACCAG
GCGGCGAACTCTCCGTCTAAATTCAACCAGCGTGTTAACGCGTACCTGAAAGAAC
ACCCGGAAACCCCGATCATCGGTATCGACCGTGGTGAACGTAACCTGATCTACAT
CACCGTTATCGACTCTACCGGTAAAATCCTGGAACAGCGTTCTCTGAACACCATC
CAGCAGTTCGACTACCAGAAAAAACTGGACAACCGTGAAAAAGAACGTGTTGCGG
CGCGTCAGGCGTGGTCTGTTGTTGGTACCATCAAAGACCTGAAACAGGGTTACCT
GTCTCAGGTTATCCACGAAATCGTTGACCTGATGATCCACTACCAGGCGGTTGTT
GTTCTGGAAAACCTGAACTTCGGTTTCAAATCTAAACGTACCGGTATCGCGGAAA
AAGCGGTTTACCAGCAGTTCGAAAAAATGCTGATCGACAAACTGAACTGCCTGGT
TCTGAAAGACTACCCGGCGGAAAAAGTTGGTGGTGTTCTGAACCCGTACCAGCTG
ACCGACCAGTTCACCTCTTTCGCGAAAATGGGTACCCAGTCTGGTTTCCTGTTCT
ACGTTCCGGCGCCGTACACCTCTAAAATCGACCCGCTGACCGGTTTCGTTGACCC
GTTCGTTTGGAAAACCATCAAAAACCACGAATCTCGTAAACACTTCCTGGAAGGT
TTCGACTTCCTGCACTACGACGTTAAAACCGGTGACTTCATCCTGCACTTCAAAA
TGAACCGTAACCTGTCTTTCCAGCGTGGTCTGCCGGGTTTCATGCCGGCGTGGGA
CATCGTTTTCGAAAAAAACGAAACCCAGTTCGACGCGAAAGGTACCCCGTTCATC
GCGGGTAAACGTATCGTTCCGGTTATCGAAAACCACCGTTTCACCGGTCGTTACC
GTGACCTGTACCCGGCGAACGAACTGATCGCGCTGCTGGAAGAAAAAGGTATCGT
TTTCCGTGACGGTTCTAACATCCTGCCGAAACTGCTGGAAAACGACGACTCTCAC
GCGATCGACACCATGGTTGCGCTGATCCGTTCTGTTCTGCAGATGCGTAACTCTA
ACGCGGCGACCGGTGAAGACTACATCAACTCTCCGGTTCGTGACCTGAACGGTGT
TTGCTTCGACTCTCGTTTCCAGAACCCGGAATGGCCGATGGACGCGGACGCGAAC
GGTGCGTACCACATCGCGCTGAAAGGTCAGCTGCTGCTGAACCACCTGAAAGAAT
CTAAAGACCTGAAACTGCAGAACGGTATCTCTAACCAGGACTGGCTGGCGTACAT
CCAGGAACTGCGTAACTA
SEQ ATGGCGGTTAAATCTATCAAAGTTAAACTGCGTCTGGACGACATGCCGGAAATCC
ID GTGCGGGTCTGTGGAAACTGCACAAAGAAGTTAACGCGGGTGTTCGTTACTACAC
NO: CGAATGGCTGTCTCTGCTGCGTCAGGAAAACCTGTACCGTCGTTCTCCGAACGGT
53 GACGGTGAACAGGAATGCGACAAAACCGCGGAAGAATGCAAAGCGGAACTGCTGG
AACGTCTGCGTGCGCGTCAGGTTGAAAACGGTCACCGTGGTCCGGCGGGTTCTGA
CGACGAACTGCTGCAGCTGGCGCGTCAGCTGTACGAACTGCTGGTTCCGCAGGCG
ATCGGTGCGAAAGGTGACGCGCAGCAGATCGCGCGTAAATTCCTGTCTCCGCTGG
CGGACAAAGACGCGGTTGGTGGTCTGGGTATCGCGAAAGCGGGTAACAAACCGCG
TTGGGTTCGTATGCGTGAAGCGGGTGAACCGGGTTGGGAAGAAGAAAAAGAAAAA
GCGGAAACCCGTAAATCTGCGGACCGTACCGCGGACGTTCTGCGTGCGCTGGCGG
ACTTCGGTCTGAAACCGCTGATGCGTGTTTACACCGACTCTGAAATGTCTTCTGT
TGAATGGAAACCGCTGCGTAAAGGTCAGGCGGTTCGTACCTGGGACCGTGACATG
TTCCAGCAGGCGATCGAACGTATGATGTCTTGGGAATCTTGGAACCAGCGTGTTG
GTCAGGAATACGCGAAACTGGTTGAACAGAAAAACCGTTTCGAACAGAAAAACTT
CGTTGGTCAGGAACACCTGGTTCACCTGGTTAACCAGCTGCAGCAGGACATGAAA
GAAGCGTCTCCGGGTCTGGAATCTAAAGAACAGACCGCGCACTACGTTACCGGTC
GTGCGCTGCGTGGTTCTGACAAAGTTTTCGAAAAATGGGGTAAACTGGCGCCGGA
CGCGCCGTTCGACCTGTACGACGCGGAAATCAAAAACGTTCAGCGTCGTAACACC
CGTCGTTTCGGTTCTCACGACCTGTTCGCGAAACTGGCGGAACCGGAATACCAGG
CGCTGTGGCGTGAAGACGCGTCTTTCCTGACCCGTTACGCGGTTTACAACTCTAT
CCTGCGTAAACTGAACCACGCGAAAATGTTCGCGACCTTCACCCTGCCGGACGCG
ACCGCGCACCCGATCTGGACCCGTTTCGACAAACTGGGTGGTAACCTGCACCAGT
ACACCTTCCTGTTCAACGAATTCGGTGAACGTCGTCACGCGATCCGTTTCCACAA
ACTGCTGAAAGTTGAAAACGGTGTTGCGCGTGAAGTTGACGACGTTACCGTTCCG
ATCTCTATGTCTGAACAGCTGGACAACCTGCTGCCGCGTGACCCGAACGAACCGA
TCGCGCTGTACTTCCGTGACTACGGTGCGGAACAGCACTTCACCGGTGAATTCGG
TGGTGCGAAAATCCAGTGCCGTCGTGACCAGCTGGCGCACATGCACCGTCGTCGT
GGTGCGCGTGACGTTTACCTGAACGTTTCTGTTCGTGTTCAGTCTCAGTCTGAAG
CGCGTGGTGAACGTCGTCCGCCGTACGCGGCGGTTTTCCGTCTGGTTGGTGACAA
CCACCGTGCGTTCGTTCACTTCGACAAACTGTCTGACTACCTGGCGGAACACCCG
GACGACGGTAAACTGGGTTCTGAAGGTCTGCTGTCTGGTCTGCGTGTTATGTCTG
TTGACCTGGGTCTGCGTACCTCTGCGTCTATCTCTGTTTTCCGTGTTGCGCGTAA
AGACGAACTGAAACCGAACTCTAAAGGTCGTGTTCCGTTCTTCTTCCCGATCAAA
GGTAACGACAACCTGGTTGCGGTTCACGAACGTTCTCAGCTGCTGAAACTGCCGG
GTGAAACCGAATCTAAAGACCTGCGTGCGATCCGTGAAGAACGTCAGCGTACCCT
GCGTCAGCTGCGTACCCAGCTGGCGTACCTGCGTCTGCTGGTTCGTTGCGGTTCT
GAAGACGTTGGTCGTCGTGAACGTTCTTGGGCGAAACTGATCGAACAGCCGGTTG
ACGCGGCGAACCACATGACCCCGGACTGGCGTGAAGCGTTCGAAAACGAACTGCA
GAAACTGAAATCTCTGCACGGTATCTGCTCTGACAAAGAATGGATGGACGCGGTT
TACGAATCTGTTCGTCGTGTTTGGCGTCACATGGGTAAACAGGTTCGTGACTGGC
GTAAAGACGTTCGTTCTGGTGAACGTCCGAAAATCCGTGGTTACGCGAAAGACGT
TGTTGGTGGTAACTCTATCGAACAGATCGAATACCTGGAACGTCAGTACAAATTC
CTGAAATCTTGGTCTTTCTTCGGTAAAGTTTCTGGTCAGGTTATCCGTGCGGAAA
AAGGTTCTCGTTTCGCGATCACCCTGCGTGAACACATCGACCACGCGAAAGAAGA
CCGTCTGAAAAAACTGGCGGACCGTATCATCATGGAAGCGCTGGGTTACGTTTAC
GCGCTGGACGAACGTGGTAAAGGTAAATGGGTTGCGAAATACCCGCCGTGCCAGC
TGATCCTGCTGGAAGAACTGTCTGAATACCAGTTCAACAACGACCGTCCGCCGTC
TGAAAACAACCAGCTGATGCAGTGGTCTCACCGTGGTGTTTTCCAGGAACTGATC
AACCAGGCGCAGGTTCACGACCTGCTGGTTGGTACCATGTACGCGGCGTTCTCTT
CTCGTTTCGACGCGCGTACCGGTGCGCCGGGTATCCGTTGCCGTCGTGTTCCGGC
GCGTTGCACCCAGGAACACAACCCGGAACCGTTCCCGTGGTGGCTGAACAAATTC
GTTGTTGAACACACCCTGGACGCGTGCCCGCTGCGTGCGGACGACCTGATCCCGA
CCGGTGAAGGTGAAATCTTCGTTTCTCCGTTCTCTGCGGAAGAAGGTGACTTCCA
CCAGATCCACGCGGACCTGAACGCGGCGCAGAACCTGCAGCAGCGTCTGTGGTCT
GACTTCGACATCTCTCAGATCCGTCTGCGTTGCGACTGGGGTGAAGTTGACGGTG
AACTGGTTCTGATCCCGCGTCTGACCGGTAAACGTACCGCGGACTCTTACTCTAA
CAAAGTTTTCTACACCAACACCGGTGTTACCTACTACGAACGTGAACGTGGTAAA
AAACGTCGTAAAGTTTTCGCGCAGGAAAAACTGTCTGAAGAAGAAGCGGAACTGC
TGGTTGAAGCGGACGAAGCGCGTGAAAAATCTGTTGTTCTGATGCGTGACCCGTC
TGGTATCATCAACCGTGGTAACTGGACCCGTCAGAAAGAATTCTGGTCTATGGTT
AACCAGCGTATCGAAGGTTACCTGGTTAAACAGATCCGTTCTCGTGTTCCGCTGC
AGGACTCTGCGTGCGAAAACACCGGTGACATCTAA
SEQ ATGGCGACCCGTTCTTTCATCCTGAAAATCGAACCGAACGAAGAAGTTAAAAAAG
ID GTCTGTGGAAAACCCACGAAGTTCTGAACCACGGTATCGCGTACTACATGAACAT
NO: CCTGAAACTGATCCGTCAGGAAGCGATCTACGAACACCACGAACAGGACCCGAAA
54 AACCCGAAAAAAGTTTCTAAAGCGGAAATCCAGGCGGAACTGTGGGACTTCGTTC
TGAAAATGCAGAAATGCAACTCTTTCACCCACGAAGTTGACAAAGACGTTGTTTT
CAACATCCTGCGTGAACTGTACGAAGAACTGGTTCCGTCTTCTGTTGAAAAAAAA
GGTGAAGCGAACCAGCTGTCTAACAAATTCCTGTACCCGCTGGTTGACCCGAACT
CTCAGTCTGGTAAAGGTACCGCGTCTTCTGGTCGTAAACCGCGTTGGTACAACCT
GAAAATCGCGGGTGACCCGTCTTGGGAAGAAGAAAAAAAAAAATGGGAAGAAGAC
AAAAAAAAAGACCCGCTGGCGAAAATCCTGGGTAAACTGGCGGAATACGGTCTGA
TCCCGCTGTTCATCCCGTTCACCGACTCTAACGAACCGATCGTTAAAGAAATCAA
ATGGATGGAAAAATCTCGTAACCAGTCTGTTCGTCGTCTGGACAAAGACATGTTC
ATCCAGGCGCTGGAACGTTTCCTGTCTTGGGAATCTTGGAACCTGAAAGTTAAAG
AAGAATACGAAAAAGTTGAAAAAGAACACAAAACCCTGGAAGAACGTATCAAAGA
AGACATCCAGGCGTTCAAATCTCTGGAACAGTACGAAAAAGAACGTCAGGAACAG
CTGCTGCGTGACACCCTGAACACCAACGAATACCGTCTGTCTAAACGTGGTCTGC
GTGGTTGGCGTGAAATCATCCAGAAATGGCTGAAAATGGACGAAAACGAACCGTC
TGAAAAATACCTGGAAGTTTTCAAAGACTACCAGCGTAAACACCCGCGTGAAGCG
GGTGACTACTCTGTTTACGAATTCCTGTCTAAAAAAGAAAACCACTTCATCTGGC
GTAACCACCCGGAATACCCGTACCTGTACGCGACCTTCTGCGAAATCGACAAAAA
AAAAAAAGACGCGAAACAGCAGGCGACCTTCACCCTGGCGGACCCGATCAACCAC
CCGCTGTGGGTTCGTTTCGAAGAACGTTCTGGTTCTAACCTGAACAAATACCGTA
TCCTGACCGAACAGCTGCACACCGAAAAACTGAAAAAAAAACTGACCGTTCAGCT
GGACCGTCTGATCTACCCGACCGAATCTGGTGGTTGGGAAGAAAAAGGTAAAGTT
GACATCGTTCTGCTGCCGTCTCGTCAGTTCTACAACCAGATCTTCCTGGACATCG
AAGAAAAAGGTAAACACGCGTTCACCTACAAAGACGAATCTATCAAATTCCCGCT
GAAAGGTACCCTGGGTGGTGCGCGTGTTCAGTTCGACCGTGACCACCTGCGTCGT
TACCCGCACAAAGTTGAATCTGGTAACGTTGGTCGTATCTACTTCAACATGACCG
TTAACATCGAACCGACCGAATCTCCGGTTTCTAAATCTCTGAAAATCCACCGTGA
CGACTTCCCGAAATTCGTTAACTTCAAACCGAAAGAACTGACCGAATGGATCAAA
GACTCTAAAGGTAAAAAACTGAAATCTGGTATCGAATCTCTGGAAATCGGTCTGC
GTGTTATGTCTATCGACCTGGGTCAGCGTCAGGCGGCGGCGGCGTCTATCTTCGA
AGTTGTTGACCAGAAACCGGACATCGAAGGTAAACTGTTCTTCCCGATCAAAGGT
ACCGAACTGTACGCGGTTCACCGTGCGTCTTTCAACATCAAACTGCCGGGTGAAA
CCCTGGTTAAATCTCGTGAAGTTCTGCGTAAAGCGCGTGAAGACAACCTGAAACT
GATGAACCAGAAACTGAACTTCCTGCGTAACGTTCTGCACTTCCAGCAGTTCGAA
GACATCACCGAACGTGAAAAACGTGTTACCAAATGGATCTCTCGTCAGGAAAACT
CTGACGTTCCGCTGGTTTACCAGGACGAACTGATCCAGATCCGTGAACTGATGTA
CAAACCGTACAAAGACTGGGTTGCGTTCCTGAAACAGCTGCACAAACGTCTGGAA
GTTGAAATCGGTAAAGAAGTTAAACACTGGCGTAAATCTCTGTCTGACGGTCGTA
AAGGTCTGTACGGTATCTCTCTGAAAAACATCGACGAAATCGACCGTACCCGTAA
ATTCCTGCTGCGTTGGTCTCTGCGTCCGACCGAACCGGGTGAAGTTCGTCGTCTG
GAACCGGGTCAGCGTTTCGCGATCGACCAGCTGAACCACCTGAACGCGCTGAAAG
AAGACCGTCTGAAAAAAATGGCGAACACCATCATCATGCACGCGCTGGGTTACTG
CTACGACGTTCGTAAAAAAAAATGGCAGGCGAAAAACCCGGCGTGCCAGATCATC
CTGTTCGAAGACCTGTCTAACTACAACCCGTACGAAGAACGTTCTCGTTTCGAAA
ACTCTAAACTGATGAAATGGTCTCGTCGTGAAATCCCGCGTCAGGTTGCGCTGCA
GGGTGAAATCTACGGTCTGCAGGTTGGTGAAGTTGGTGCGCAGTTCTCTTCTCGT
TTCCACGCGAAAACCGGTTCTCCGGGTATCCGTTGCTCTGTTGTTACCAAAGAAA
AACTGCAGGACAACCGTTTCTTCAAAAACCTGCAGCGTGAAGGTCGTCTGACCCT
GGACAAAATCGCGGTTCTGAAAGAAGGTGACCTGTACCCGGACAAAGGTGGTGAA
AAATTCATCTCTCTGTCTAAAGACCGTAAACTGGTTACCACCCACGCGGACATCA
ACGCGGCGCAGAACCTGCAGAAACGTTTCTGGACCCGTACCCACGGTTTCTACAA
AGTTTACTGCAAAGCGTACCAGGTTGACGGTCAGACCGTTTACATCCCGGAATCT
AAAGACCAGAAACAGAAAATCATCGAAGAATTCGGTGAAGGTTACTTCATCCTGA
AAGACGGTGTTTACGAATGGGGTAACGCGGGTAAACTGAAAATCAAAAAAGGTTC
TTCTAAACAGTCTTCTTCTGAACTGGTTGACTCTGACATCCTGAAAGACTCTTTC
GACCTGGCGTCTGAACTGAAAGGTGAAAAACTGATGCTGTACCGTGACCCGTCTG
GTAACGTTTTCCCGTCTGACAAATGGATGGCGGCGGGTGTTTTCTTCGGTAAACT
GGAACGTATCCTGATCTCTAAACTGACCAACCAGTACTCTATCTCTACCATCGAA
GACGACTCTTCTAAACAGTCTATGTAA
SEQ ATGCCGACCCGTACCATCAACCTGAAACTGGTTCTGGGTAAAAACCCGGAAAACG
ID CGACCCTGCGTCGTGCGCTGTTCTCTACCCACCGTCTGGTTAACCAGGCGACCAA
NO: ACGTATCGAAGAATTCCTGCTGCTGTGCCGTGGTGAAGCGTACCGTACCGTTGAC
55 AACGAAGGTAAAGAAGCGGAAATCCCGCGTCACGCGGTTCAGGAAGAAGCGCTGG
CGTTCGCGAAAGCGGCGCAGCGTCACAACGGTTGCATCTCTACCTACGAAGACCA
GGAAATCCTGGACGTTCTGCGTCAGCTGTACGAACGTCTGGTTCCGTCTGTTAAC
GAAAACAACGAAGCGGGTGACGCGCAGGCGGCGAACGCGTGGGTTTCTCCGCTGA
TGTCTGCGGAATCTGAAGGTGGTCTGTCTGTTTACGACAAAGTTCTGGACCCGCC
GCCGGTTTGGATGAAACTGAAAGAAGAAAAAGCGCCGGGTTGGGAAGCGGCGTCT
CAGATCTGGATCCAGTCTGACGAAGGTCAGTCTCTGCTGAACAAACCGGGTTCTC
CGCCGCGTTGGATCCGTAAACTGCGTTCTGGTCAGCCGTGGCAGGACGACTTCGT
TTCTGACCAGAAAAAAAAACAGGACGAACTGACCAAAGGTAACGCGCCGCTGATC
AAACAGCTGAAAGAAATGGGTCTGCTGCCGCTGGTTAACCCGTTCTTCCGTCACC
TGCTGGACCCGGAAGGTAAAGGTGTTTCTCCGTGGGACCGTCTGGCGGTTCGTGC
GGCGGTTGCGCACTTCATCTCTTGGGAATCTTGGAACCACCGTACCCGTGCGGAA
TACAACTCTCTGAAACTGCGTCGTGACGAATTCGAAGCGGCGTCTGACGAATTCA
AAGACGACTTCACCCTGCTGCGTCAGTACGAAGCGAAACGTCACTCTACCCTGAA
ATCTATCGCGCTGGCGGACGACTCTAACCCGTACCGTATCGGTGTTCGTTCTCTG
CGTGCGTGGAACCGTGTTCGTGAAGAATGGATCGACAAAGGTGCGACCGAAGAAC
AGCGTGTTACCATCCTGTCTAAACTGCAGACCCAGCTGCGTGGTAAATTCGGTGA
CCCGGACCTGTTCAACTGGCTGGCGCAGGACCGTCACGTTCACCTGTGGTCTCCG
CGTGACTCTGTTACCCCGCTGGTTCGTATCAACGCGGTTGACAAAGTTCTGCGTC
GTCGTAAACCGTACGCGCTGATGACCTTCGCGCACCCGCGTTTCCACCCGCGTTG
GATCCTGTACGAAGCGCCGGGTGGTTCTAACCTGCGTCAGTACGCGCTGGACTGC
ACCGAAAACGCGCTGCACATCACCCTGCCGCTGCTGGTTGACGACGCGCACGGTA
CCTGGATCGAAAAAAAAATCCGTGTTCCGCTGGCGCCGTCTGGTCAGATCCAGGA
CCTGACCCTGGAAAAACTGGAAAAAAAAAAAAACCGTCTGTACTACCGTTCTGGT
TTCCAGCAGTTCGCGGGTCTGGCGGGTGGTGCGGAAGTTCTGTTCCACCGTCCGT
ACATGGAACACGACGAACGTTCTGAAGAATCTCTGCTGGAACGTCCGGGTGCGGT
TTGGTTCAAACTGACCCTGGACGTTGCGACCCAGGCGCCGCCGAACTGGCTGGAC
GGTAAAGGTCGTGTTCGTACCCCGCCGGAAGTTCACCACTTCAAAACCGCGCTGT
CTAACAAATCTAAACACACCCGTACCCTGCAGCCGGGTCTGCGTGTTCTGTCTGT
TGACCTGGGTATGCGTACCTTCGCGTCTTGCTCTGTTTTCGAACTGATCGAAGGT
AAACCGGAAACCGGTCGTGCGTTCCCGGTTGCGGACGAACGTTCTATGGACTCTC
CGAACAAACTGTGGGCGAAACACGAACGTTCTTTCAAACTGACCCTGCCGGGTGA
AACCCCGTCTCGTAAAGAAGAAGAAGAACGTTCTATCGCGCGTGCGGAAATCTAC
GCGCTGAAACGTGACATCCAGCGTCTGAAATCTCTGCTGCGTCTGGGTGAAGAAG
ACAACGACAACCGTCGTGACGCGCTGCTGGAACAGTTCTTCAAAGGTTGGGGTGA
AGAAGACGTTGTTCCGGGTCAGGCGTTCCCGCGTTCTCTGTTCCAGGGTCTGGGT
GCGGCGCCGTTCCGTTCTACCCCGGAACTGTGGCGTCAGCACTGCCAGACCTACT
ACGACAAAGCGGAAGCGTGCCTGGCGAAACACATCTCTGACTGGCGTAAACGTAC
CCGTCCGCGTCCGACCTCTCGTGAAATGTGGTACAAAACCCGTTCTTACCACGGT
GGTAAATCTATCTGGATGCTGGAATACCTGGACGCGGTTCGTAAACTGCTGCTGT
CTTGGTCTCTGCGTGGTCGTACCTACGGTGCGATCAACCGTCAGGACACCGCGCG
TTTCGGTTCTCTGGCGTCTCGTCTGCTGCACCACATCAACTCTCTGAAAGAAGAC
CGTATCAAAACCGGTGCGGACTCTATCGTTCAGGCGGCGCGTGGTTACATCCCGC
TGCCGCACGGTAAAGGTTGGGAACAGCGTTACGAACCGTGCCAGCTGATCCTGTT
CGAAGACCTGGCGCGTTACCGTTTCCGTGTTGACCGTCCGCGTCGTGAAAACTCT
CAGCTGATGCAGTGGAACCACCGTGCGATCGTTGCGGAAACCACCATGCAGGCGG
AACTGTACGGTCAGATCGTTGAAAACACCGCGGCGGGTTTCTCTTCTCGTTTCCA
CGCGGCGACCGGTGCGCCGGGTGTTCGTTGCCGTTTCCTGCTGGAACGTGACTTC
GACAACGACCTGCCGAAACCGTACCTGCTGCGTGAACTGTCTTGGATGCTGGGTA
ACACCAAAGTTGAATCTGAAGAAGAAAAACTGCGTCTGCTGTCTGAAAAAATCCG
TCCGGGTTCTCTGGTTCCGTGGGACGGTGGTGAACAGTTCGCGACCCTGCACCCG
AAACGTCAGACCCTGTGCGTTATCCACGCGGACATGAACGCGGCGCAGAACCTGC
AGCGTCGTTTCTTCGGTCGTTGCGGTGAAGCGTTCCGTCTGGTTTGCCAGCCGCA
CGGTGACGACGTTCTGCGTCTGGCGTCTACCCCGGGTGCGCGTCTGCTGGGTGCG
CTGCAGCAGCTGGAAAACGGTCAGGGTGCGTTCGAACTGGTTCGTGACATGGGTT
CTACCTCTCAGATGAACCGTTTCGTTATGAAATCTCTGGGTAAAAAAAAAATCAA
ACCGCTGCAGGACAACAACGGTGACGACGAACTGGAAGACGTTCTGTCTGTTCTG
CCGGAAGAAGACGACACCGGTCGTATCACCGTTTTCCGTGACTCTTCTGGTATCT
TCTTCCCGTGCAACGTTTGGATCCCGGCGAAACAGTTCTGGCCGGCGGTTCGTGC
GATGATCTGGAAAGTTATGGCGTCTCACTCTCTGGGTTAA
SEQ ATGACCAAACTGCGTCACCGTCAGAAAAAACTGACCCACGACTGGGCGGGTTCTA
ID AAAAACGTGAAGTTCTGGGTTCTAACGGTAAACTGCAGAACCCGCTGCTGATGCC
NO: GGTTAAAAAAGGTCAGGTTACCGAATTCCGTAAAGCGTTCTCTGCGTACGCGCGT
56 GCGACCAAAGGTGAAATGACCGACGGTCGTAAAAACATGTTCACCCACTCTTTCG
AACCGTTCAAAACCAAACCGTCTCTGCACCAGTGCGAACTGGCGGACAAAGCGTA
CCAGTCTCTGCACTCTTACCTGCCGGGTTCTCTGGCGCACTTCCTGCTGTCTGCG
CACGCGCTGGGTTTCCGTATCTTCTCTAAATCTGGTGAAGCGACCGCGTTCCAGG
CGTCTTCTAAAATCGAAGCGTACGAATCTAAACTGGCGTCTGAACTGGCGTGCGT
TGACCTGTCTATCCAGAACCTGACCATCTCTACCCTGTTCAACGCGCTGACCACC
TCTGTTCGTGGTAAAGGTGAAGAAACCTCTGCGGACCCGCTGATCGCGCGTTTCT
ACACCCTGCTGACCGGTAAACCGCTGTCTCGTGACACCCAGGGTCCGGAACGTGA
CCTGGCGGAAGTTATCTCTCGTAAAATCGCGTCTTCTTTCGGTACCTGGAAAGAA
ATGACCGCGAACCCGCTGCAGTCTCTGCAGTTCTTCGAAGAAGAACTGCACGCGC
TGGACGCGAACGTTTCTCTGTCTCCGGCGTTCGACGTTCTGATCAAAATGAACGA
CCTGCAGGGTGACCTGAAAAACCGTACCATCGTTTTCGACCCGGACGCGCCGGTT
TTCGAATACAACGCGGAAGACCCGGCGGACATCATCATCAAACTGACCGCGCGTT
ACGCGAAAGAAGCGGTTATCAAAAACCAGAACGTTGGTAACTACGTTAAAAACGC
GATCACCACCACCAACGCGAACGGTCTGGGTTGGCTGCTGAACAAAGGTCTGTCT
CTGCTGCCGGTTTCTACCGACGACGAACTGCTGGAATTCATCGGTGTTGAACGTT
CTCACCCGTCTTGCCACGCGCTGATCGAACTGATCGCGCAGCTGGAAGCGCCGGA
ACTGTTCGAAAAAAACGTTTTCTCTGACACCCGTTCTGAAGTTCAGGGTATGATC
GACTCTGCGGTTTCTAACCACATCGCGCGTCTGTCTTCTTCTCGTAACTCTCTGT
CTATGGACTCTGAAGAACTGGAACGTCTGATCAAATCTTTCCAGATCCACACCCC
GCACTGCTCTCTGTTCATCGGTGCGCAGTCTCTGTCTCAGCAGCTGGAATCTCTG
CCGGAAGCGCTGCAGTCTGGTGTTAACTCTGCGGACATCCTGCTGGGTTCTACCC
AGTACATGCTGACCAACTCTCTGGTTGAAGAATCTATCGCGACCTACCAGCGTAC
CCTGAACCGTATCAACTACCTGTCTGGTGTTGCGGGTCAGATCAACGGTGCGATC
AAACGTAAAGCGATCGACGGTGAAAAAATCCACCTGCCGGCGGCGTGGTCTGAAC
TGATCTCTCTGCCGTTCATCGGTCAGCCGGTTATCGACGTTGAATCTGACCTGGC
GCACCTGAAAAACCAGTACCAGACCCTGTCTAACGAATTCGACACCCTGATCTCT
GCGCTGCAGAAAAACTTCGACCTGAACTTCAACAAAGCGCTGCTGAACCGTACCC
AGCACTTCGAAGCGATGTGCCGTTCTACCAAAAAAAACGCGCTGTCTAAACCGGA
AATCGTTTCTTACCGTGACCTGCTGGCGCGTCTGACCTCTTGCCTGTACCGTGGT
TCTCTGGTTCTGCGTCGTGCGGGTATCGAAGTTCTGAAAAAACACAAAATCTTCG
AATCTAACTCTGAACTGCGTGAACACGTTCACGAACGTAAACACTTCGTTTTCGT
TTCTCCGCTGGACCGTAAAGCGAAAAAACTGCTGCGTCTGACCGACTCTCGTCCG
GACCTGCTGCACGTTATCGACGAAATCCTGCAGCACGACAACCTGGAAAACAAAG
ACCGTGAATCTCTGTGGCTGGTTCGTTCTGGTTACCTGCTGGCGGGTCTGCCGGA
CCAGCTGTCTTCTTCTTTCATCAACCTGCCGATCATCACCCAGAAAGGTGACCGT
CGTCTGATCGACCTGATCCAGTACGACCAGATCAACCGTGACGCGTTCGTTATGC
TGGTTACCTCTGCGTTCAAATCTAACCTGTCTGGTCTGCAGTACCGTGCGAACAA
ACAGTCTTTCGTTGTTACCCGTACCCTGTCTCCGTACCTGGGTTCTAAACTGGTT
TACGTTCCGAAAGACAAAGACTGGCTGGTTCCGTCTCAGATGTTCGAAGGTCGTT
TCGCGGACATCCTGCAGTCTGACTACATGGTTTGGAAAGACGCGGGTCGTCTGTG
CGTTATCGACACCGCGAAACACCTGTCTAACATCAAAAAATCTGTTTTCTCTTCT
GAAGAAGTTCTGGCGTTCCTGCGTGAACTGCCGCACCGTACCTTCATCCAGACCG
AAGTTCGTGGTCTGGGTGTTAACGTTGACGGTATCGCGTTCAACAACGGTGACAT
CCCGTCTCTGAAAACCTTCTCTAACTGCGTTCAGGTTAAAGTTTCTCGTACCAAC
ACCTCTCTGGTTCAGACCCTGAACCGTTGGTTCGAAGGTGGTAAAGTTTCTCCGC
CGTCTATCCAGTTCGAACGTGCGTACTACAAAAAAGACGACCAGATCCACGAAGA
CGCGGCGAAACGTAAAATCCGTTTCCAGATGCCGGCGACCGAACTGGTTCACGCG
TCTGACGACGCGGGTTGGACCCCGTCTTACCTGCTGGGTATCGACCCGGGTGAAT
ACGGTATGGGTCTGTCTCTGGTTTCTATCAACAACGGTGAAGTTCTGGACTCTGG
TTTCATCCACATCAACTCTCTGATCAACTTCGCGTCTAAAAAATCTAACCACCAG
ACCAAAGTTGTTCCGCGTCAGCAGTACAAATCTCCGTACGCGAACTACCTGGAAC
AGTCTAAAGACTCTGCGGCGGGTGACATCGCGCACATCCTGGACCGTCTGATCTA
CAAACTGAACGCGCTGCCGGTTTTCGAAGCGCTGTCTGGTAACTCTCAGTCTGCG
GCGGACCAGGTTTGGACCAAAGTTCTGTCTTTCTACACCTGGGGTGACAACGACG
CGCAGAACTCTATCCGTAAACAGCACTGGTTCGGTGCGTCTCACTGGGACATCAA
AGGTATGCTGCGTCAGCCGCCGACCGAAAAAAAACCGAAACCGTACATCGCGTTC
CCGGGTTCTCAGGTTTCTTCTTACGGTAACTCTCAGCGTTGCTCTTGCTGCGGTC
GTAACCCGATCGAACAGCTGCGTGAAATGGCGAAAGACACCTCTATCAAAGAACT
GAAAATCCGTAACTCTGAAATCCAGCTGTTCGACGGTACCATCAAACTGTTCAAC
CCGGACCCGTCTACCGTTATCGAACGTCGTCGTCACAACCTGGGTCCGTCTCGTA
TCCCGGTTGCGGACCGTACCTTCAAAAACATCTCTCCGTCTTCTCTGGAATTCAA
AGAACTGATCACCATCGTTTCTCGTTCTATCCGTCACTCTCCGGAATTCATCGCG
AAAAAACGTGGTATCGGTTCTGAATACTTCTGCGCGTACTCTGACTGCAACTCTT
CTCTGAACTCTGAAGCGAACGCGGCGGCGAACGTTGCGCAGAAATTCCAGAAACA
GCTGTTCTTCGAACTGTAA
SEQ ATGAAACGTATCCTGAACTCTCTGAAAGTTGCGGCGCTGCGTCTGCTGTTCCGTG
ID GTAAAGGTTCTGAACTGGTTAAAACCGTTAAATACCCGCTGGTTTCTCCGGTTCA
NO: GGGTGCGGTTGAAGAACTGGCGGAAGCGATCCGTCACGACAACCTGCACCTGTTC
57 GGTCAGAAAGAAATCGTTGACCTGATGGAAAAAGACGAAGGTACCCAGGTTTACT
CTGTTGTTGACTTCTGGCTGGACACCCTGCGTCTGGGTATGTTCTTCTCTCCGTC
TGCGAACGCGCTGAAAATCACCCTGGGTAAATTCAACTCTGACCAGGTTTCTCCG
TTCCGTAAAGTTCTGGAACAGTCTCCGTTCTTCCTGGCGGGTCGTCTGAAAGTTG
AACCGGCGGAACGTATCCTGTCTGTTGAAATCCGTAAAATCGGTAAACGTGAAAA
CCGTGTTGAAAACTACGCGGCGGACGTTGAAACCTGCTTCATCGGTCAGCTGTCT
TCTGACGAAAAACAGTCTATCCAGAAACTGGCGAACGACATCTGGGACTCTAAAG
ACCACGAAGAACAGCGTATGCTGAAAGCGGACTTCTTCGCGATCCCGCTGATCAA
AGACCCGAAAGCGGTTACCGAAGAAGACCCGGAAAACGAAACCGCGGGTAAACAG
AAACCGCTGGAACTGTGCGTTTGCCTGGTTCCGGAACTGTACACCCGTGGTTTCG
GTTCTATCGCGGACTTCCTGGTTCAGCGTCTGACCCTGCTGCGTGACAAAATGTC
TACCGACACCGCGGAAGACTGCCTGGAATACGTTGGTATCGAAGAAGAAAAAGGT
AACGGTATGAACTCTCTGCTGGGTACCTTCCTGAAAAACCTGCAGGGTGACGGTT
TCGAACAGATCTTCCAGTTCATGCTGGGTTCTTACGTTGGTTGGCAGGGTAAAGA
AGACGTTCTGCGTGAACGTCTGGACCTGCTGGCGGAAAAAGTTAAACGTCTGCCG
AAACCGAAATTCGCGGGTGAATGGTCTGGTCACCGTATGTTCCTGCACGGTCAGC
TGAAATCTTGGTCTTCTAACTTCTTCCGTCTGTTCAACGAAACCCGTGAACTGCT
GGAATCTATCAAATCTGACATCCAGCACGCGACCATGCTGATCTCTTACGTTGAA
GAAAAAGGTGGTTACCACCCGCAGCTGCTGTCTCAGTACCGTAAACTGATGGAAC
AGCTGCCGGCGCTGCGTACCAAAGTTCTGGACCCGGAAATCGAAATGACCCACAT
GTCTGAAGCGGTTCGTTCTTACATCATGATCCACAAATCTGTTGCGGGTTTCCTG
CCGGACCTGCTGGAATCTCTGGACCGTGACAAAGACCGTGAATTCCTGCTGTCTA
TCTTCCCGCGTATCCCGAAAATCGACAAAAAAACCAAAGAAATCGTTGCGTGGGA
ACTGCCGGGTGAACCGGAAGAAGGTTACCTGTTCACCGCGAACAACCTGTTCCGT
AACTTCCTGGAAAACCCGAAACACGTTCCGCGTTTCATGGCGGAACGTATCCCGG
AAGACTGGACCCGTCTGCGTTCTGCGCCGGTTTGGTTCGACGGTATGGTTAAACA
GTGGCAGAAAGTTGTTAACCAGCTGGTTGAATCTCCGGGTGCGCTGTACCAGTTC
AACGAATCTTTCCTGCGTCAGCGTCTGCAGGCGATGCTGACCGTTTACAAACGTG
ACCTGCAGACCGAAAAATTCCTGAAACTGCTGGCGGACGTTTGCCGTCCGCTGGT
TGACTTCTTCGGTCTGGGTGGTAACGACATCATCTTCAAATCTTGCCAGGACCCG
CGTAAACAGTGGCAGACCGTTATCCCGCTGTCTGTTCCGGCGGACGTTTACACCG
CGTGCGAAGGTCTGGCGATCCGTCTGCGTGAAACCCTGGGTTTCGAATGGAAAAA
CCTGAAAGGTCACGAACGTGAAGACTTCCTGCGTCTGCACCAGCTGCTGGGTAAC
CTGCTGTTCTGGATCCGTGACGCGAAACTGGTTGTTAAACTGGAAGACTGGATGA
ACAACCCGTGCGTTCAGGAATACGTTGAAGCGCGTAAAGCGATCGACCTGCCGCT
GGAAATCTTCGGTTTCGAAGTTCCGATCTTCCTGAACGGTTACCTGTTCTCTGAA
CTGCGTCAGCTGGAACTGCTGCTGCGTCGTAAATCTGTTATGACCTCTTACTCTG
TTAAAACCACCGGTTCTCCGAACCGTCTGTTCCAGCTGGTTTACCTGCCGCTGAA
CCCGTCTGACCCGGAAAAAAAAAACTCTAACAACTTCCAGGAACGTCTGGACACC
CCGACCGGTCTGTCTCGTCGTTTCCTGGACCTGACCCTGGACGCGTTCGCGGGTA
AACTGCTGACCGACCCGGTTACCCAGGAACTGAAAACCATGGCGGGTTTCTACGA
CCACCTGTTCGGTTTCAAACTGCCGTGCAAACTGGCGGCGATGTCTAACCACCCG
GGTTCTTCTTCTAAAATGGTTGTTCTGGCGAAACCGAAAAAAGGTGTTGCGTCTA
ACATCGGTTTCGAACCGATCCCGGACCCGGCGCACCCGGTTTTCCGTGTTCGTTC
TTCTTGGCCGGAACTGAAATACCTGGAAGGTCTGCTGTACCTGCCGGAAGACACC
CCGCTGACCATCGAACTGGCGGAAACCTCTGTTTCTTGCCAGTCTGTTTCTTCTG
TTGCGTTCGACCTGAAAAACCTGACCACCATCCTGGGTCGTGTTGGTGAATTCCG
TGTTACCGCGGACCAGCCGTTCAAACTGACCCCGATCATCCCGGAAAAAGAAGAA
TCTTTCATCGGTAAAACCTACCTGGGTCTGGACGCGGGTGAACGTTCTGGTGTTG
GTTTCGCGATCGTTACCGTTGACGGTGACGGTTACGAAGTTCAGCGTCTGGGTGT
TCACGAAGACACCCAGCTGATGGCGCTGCAGCAGGTTGCGTCTAAATCTCTGAAA
GAACCGGTTTTCCAGCCGCTGCGTAAAGGTACCTTCCGTCAGCAGGAACGTATCC
GTAAATCTCTGCGTGGTTGCTACTGGAACTTCTACCACGCGCTGATGATCAAATA
CCGTGCGAAAGTTGTTCACGAAGAATCTGTTGGTTCTTCTGGTCTGGTTGGTCAG
TGGCTGCGTGCGTTCCAGAAAGACCTGAAAAAAGCGGACGTTCTGCCGAAAAAAG
GTGGTAAAAACGGTGTTGACAAAAAAAAACGTGAATCTTCTGCGCAGGACACCCT
GTGGGGTGGTGCGTTCTCTAAAAAAGAAGAACAGCAGATCGCGTTCGAAGTTCAG
GCGGCGGGTTCTTCTCAGTTCTGCCTGAAATGCGGTTGGTGGTTCCAGCTGGGTA
TGCGTGAAGTTAACCGTGTTCAGGAATCTGGTGTTGTTCTGGACTGGAACCGTTC
TATCGTTACCTTCCTGATCGAATCTTCTGGTGAAAAAGTTTACGGTTTCTCTCCG
CAGCAGCTGGAAAAAGGTTTCCGTCCGGACATCGAAACCTTCAAAAAAATGGTTC
GTGACTTCATGCGTCCGCCGATGTTCGACCGTAAAGGTCGTCCGGCGGCGGCGTA
CGAACGTTTCGTTCTGGGTCGTCGTCACCGTCGTTACCGTTTCGACAAAGTTTTC
GAAGAACGTTTCGGTCGTTCTGCGCTGTTCATCTGCCCGCGTGTTGGTTGCGGTA
ACTTCGACCACTCTTCTGAACAGTCTGCGGTTGTTCTGGCGCTGATCGGTTACAT
CGCGGACAAAGAAGGTATGTCTGGTAAAAAACTGGTTTACGTTCGTCTGGCGGAA
CTGATGGCGGAATGGAAACTGAAAAAACTGGAACGTTCTCGTGTTGAAGAACAGT
CTTCTGCGCAGTAA
SEQ ATGGCGGAATCTAAACAGATGCAGTGCCGTAAATGCGGTGCGTCTATGAAATACG
ID AAGTTATCGGTCTGGGTAAAAAATCTTGCCGTTACATGTGCCCGGACTGCGGTAA
NO: CCACACCTCTGCGCGTAAAATCCAGAACAAAAAAAAACGTGACAAAAAATACGGT
58 TCTGCGTCTAAAGCGCAGTCTCAGCGTATCGCGGTTGCGGGTGCGCTGTACCCGG
ACAAAAAAGTTCAGACCATCAAAACCTACAAATACCCGGCGGACCTGAACGGTGA
AGTTCACGACTCTGGTGTTGCGGAAAAAATCGCGCAGGCGATCCAGGAAGACGAA
ATCGGTCTGCTGGGTCCGTCTTCTGAATACGCGTGCTGGATCGCGTCTCAGAAAC
AGTCTGAACCGTACTCTGTTGTTGACTTCTGGTTCGACGCGGTTTGCGCGGGTGG
TGTTTTCGCGTACTCTGGTGCGCGTCTGCTGTCTACCGTTCTGCAGCTGTCTGGT
GAAGAATCTGTTCTGCGTGCGGCGCTGGCGTCTTCTCCGTTCGTTGACGACATCA
ACCTGGCGCAGGCGGAAAAATTCCTGGCGGTTTCTCGTCGTACCGGTCAGGACAA
ACTGGGTAAACGTATCGGTGAATGCTTCGCGGAAGGTCGTCTGGAAGCGCTGGGT
ATCAAAGACCGTATGCGTGAATTCGTTCAGGCGATCGACGTTGCGCAGACCGCGG
GTCAGCGTTTCGCGGCGAAACTGAAAATCTTCGGTATCTCTCAGATGCCGGAAGC
GAAACAGTGGAACAACGACTCTGGTCTGACCGTTTGCATCCTGCCGGACTACTAC
GTTCCGGAAGAAAACCGTGCGGACCAGCTGGTTGTTCTGCTGCGTCGTCTGCGTG
AAATCGCGTACTGCATGGGTATCGAAGACGAAGCGGGTTTCGAACACCTGGGTAT
CGACCCGGGTGCGCTGTCTAACTTCTCTAACGGTAACCCGAAACGTGGTTTCCTG
GGTCGTCTGCTGAACAACGACATCATCGCGCTGGCGAACAACATGTCTGCGATGA
CCCCGTACTGGGAAGGTCGTAAAGGTGAACTGATCGAACGTCTGGCGTGGCTGAA
ACACCGTGCGGAAGGTCTGTACCTGAAAGAACCGCACTTCGGTAACTCTTGGGCG
GACCACCGTTCTCGTATCTTCTCTCGTATCGCGGGTTGGCTGTCTGGTTGCGCGG
GTAAACTGAAAATCGCGAAAGACCAGATCTCTGGTGTTCGTACCGACCTGTTCCT
GCTGAAACGTCTGCTGGACGCGGTTCCGCAGTCTGCGCCGTCTCCGGACTTCATC
GCGTCTATCTCTGCGCTGGACCGTTTCCTGGAAGCGGCGGAATCTTCTCAGGACC
CGGCGGAACAGGTTCGTGCGCTGTACGCGTTCCACCTGAACGCGCCGGCGGTTCG
TTCTATCGCGAACAAAGCGGTTCAGCGTTCTGACTCTCAGGAATGGCTGATCAAA
GAACTGGACGCGGTTGACCACCTGGAATTCAACAAAGCGTTCCCGTTCTTCTCTG
ACACCGGTAAAAAAAAAAAAAAAGGTGCGAACTCTAACGGTGCGCCGTCTGAAGA
AGAATACACCGAAACCGAATCTATCCAGCAGCCGGAAGACGCGGAACAGGAAGTT
AACGGTCAGGAAGGTAACGGTGCGTCTAAAAACCAGAAAAAATTCCAGCGTATCC
CGCGTTTCTTCGGTGAAGGTTCTCGTTCTGAATACCGTATCCTGACCGAAGCGCC
GCAGTACTTCGACATGTTCTGCAACAACATGCGTGCGATCTTCATGCAGCTGGAA
TCTCAGCCGCGTAAAGCGCCGCGTGACTTCAAATGCTTCCTGCAGAACCGTCTGC
AGAAACTGTACAAACAGACCTTCCTGAACGCGCGTTCTAACAAATGCCGTGCGCT
GCTGGAATCTGTTCTGATCTCTTGGGGTGAATTCTACACCTACGGTGCGAACGAA
AAAAAATTCCGTCTGCGTCACGAAGCGTCTGAACGTTCTTCTGACCCGGACTACG
TTGTTCAGCAGGCGCTGGAAATCGCGCGTCGTCTGTTCCTGTTCGGTTTCGAATG
GCGTGACTGCTCTGCGGGTGAACGTGTTGACCTGGTTGAAATCCACAAAAAAGCG
ATCTCTTTCCTGCTGGCGATCACCCAGGCGGAAGTTTCTGTTGGTTCTTACAACT
GGCTGGGTAACTCTACCGTTTCTCGTTACCTGTCTGTTGCGGGTACCGACACCCT
GTACGGTACCCAGCTGGAAGAATTCCTGAACGCGACCGTTCTGTCTCAGATGCGT
GGTCTGGCGATCCGTCTGTCTTCTCAGGAACTGAAAGACGGTTTCGACGTTCAGC
TGGAATCTTCTTGCCAGGACAACCTGCAGCACCTGCTGGTTTACCGTGCGTCTCG
TGACCTGGCGGCGTGCAAACGTGCGACCTGCCCGGCGGAACTGGACCCGAAAATC
CTGGTTCTGCCGGTTGGTGCGTTCATCGCGTCTGTTATGAAAATGATCGAACGTG
GTGACGAACCGCTGGCGGGTGCGTACCTGCGTCACCGTCCGCACTCTTTCGGTTG
GCAGATCCGTGTTCGTGGTGTTGCGGAAGTTGGTATGGACCAGGGTACCGCGCTG
GCGTTCCAGAAACCGACCGAATCTGAACCGTTCAAAATCAAACCGTTCTCTGCGC
AGTACGGTCCGGTTCTGTGGCTGAACTCTTCTTCTTACTCTCAGTCTCAGTACCT
GGACGGTTTCCTGTCTCAGCCGAAAAACTGGTCTATGCGTGTTCTGCCGCAGGCG
GGTTCTGTTCGTGTTGAACAGCGTGTTGCGCTGATCTGGAACCTGCAGGCGGGTA
AAATGCGTCTGGAACGTTCTGGTGCGCGTGCGTTCTTCATGCCGGTTCCGTTCTC
TTTCCGTCCGTCTGGTTCTGGTGACGAAGCGGTTCTGGCGCCGAACCGTTACCTG
GGTCTGTTCCCGCACTCTGGTGGTATCGAATACGCGGTTGTTGACGTTCTGGACT
CTGCGGGTTTCAAAATCCTGGAACGTGGTACCATCGCGGTTAACGGTTTCTCTCA
GAAACGTGGTGAACGTCAGGAAGAAGCGCACCGTGAAAAACAGCGTCGTGGTATC
TCTGACATCGGTCGTAAAAAACCGGTTCAGGCGGAAGTTGACGCGGCGAACGAAC
TGCACCGTAAATACACCGACGTTGCGACCCGTCTGGGTTGCCGTATCGTTGTTCA
GTGGGCGCCGCAGCCGAAACCGGGTACCGCGCCGACCGCGCAGACCGTTTACGCG
CGTGCGGTTCGTACCGAAGCGCCGCGTTCTGGTAACCAGGAAGACCACGCGCGTA
TGAAATCTTCTTGGGGTTACACCTGGGGTACCTACTGGGAAAAACGTAAACCGGA
AGACATCCTGGGTATCTCTACCCAGGTTTACTGGACCGGTGGTATCGGTGAATCT
TGCCCGGCGGTTGCGGTTGCGCTGCTGGGTCACATCCGTGCGACCTCTACCCAGA
CCGAATGGGAAAAAGAAGAAGTTGTTTTCGGTCGTCTGAAAAAATTCTTCCCGTC
TTAA
SEQ ATGGAAAAACGTATCAACAAAATCCGTAAAAAACTGTCTGCGGACAACGCGACCA
ID AACCGGTTTCTCGTTCTGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCGA
NO: CGACCTGAAAAAACGTCTGGAAAAACGTCGTAAAAAACCGGAAGTTATGCCGCAG
59 GTTATCTCTAACAACGCGGCGAACAACCTGCGTATGCTGCTGGACGACTACACCA
AAATGAAAGAAGCGATCCTGCAGGTTTACTGGCAGGAATTCAAAGACGACCACGT
TGGTCTGATGTGCAAATTCGCGCAGCCGGCGTCTAAAAAAATCGACCAGAACAAA
CTGAAACCGGAAATGGACGAAAAAGGTAACCTGACCACCGCGGGTTTCGCGTGCT
CTCAGTGCGGTCAGCCGCTGTTCGTTTACAAACTGGAACAGGTTTCTGAAAAAGG
TAAAGCGTACACCAACTACTTCGGTCGTTGCAACGTTGCGGAACACGAAAAACTG
ATCCTGCTGGCGCAGCTGAAACCGGAAAAAGACTCTGACGAAGCGGTTACCTACT
CTCTGGGTAAATTCGGTCAGCGTGCGCTGGACTTCTACTCTATCCACGTTACCAA
AGAATCTACCCACCCGGTTAAACCGCTGGCGCAGATCGCGGGTAACCGTTACGCG
TCTGGTCCGGTTGGTAAAGCGCTGTCTGACGCGTGCATGGGTACCATCGCGTCTT
TCCTGTCTAAATACCAGGACATCATCATCGAACACCAGAAAGTTGTTAAAGGTAA
CCAGAAACGTCTGGAATCTCTGCGTGAACTGGCGGGTAAAGAAAACCTGGAATAC
CCGTCTGTTACCCTGCCGCCGCAGCCGCACACCAAAGAAGGTGTTGACGCGTACA
ACGAAGTTATCGCGCGTGTTCGTATGTGGGTTAACCTGAACCTGTGGCAGAAACT
GAAACTGTCTCGTGACGACGCGAAACCGCTGCTGCGTCTGAAAGGTTTCCCGTCT
TTCCCGGTTGTTGAACGTCGTGAAAACGAAGTTGACTGGTGGAACACCATCAACG
AAGTTAAAAAACTGATCGACGCGAAACGTGACATGGGTCGTGTTTTCTGGTCTGG
TGTTACCGCGGAAAAACGTAACACCATCCTGGAAGGTTACAACTACCTGCCGAAC
GAAAACGACCACAAAAAACGTGAAGGTTCTCTGGAAAACCCGAAAAAACCGGCGA
AACGTCAGTTCGGTGACCTGCTGCTGTACCTGGAAAAAAAATACGCGGGTGACTG
GGGTAAAGTTTTCGACGAAGCGTGGGAACGTATCGACAAAAAAATCGCGGGTCTG
ACCTCTCACATCGAACGTGAAGAAGCGCGTAACGCGGAAGACGCGCAGTCTAAAG
CGGTTCTGACCGACTGGCTGCGTGCGAAAGCGTCTTTCGTTCTGGAACGTCTGAA
AGAAATGGACGAAAAAGAATTCTACGCGTGCGAAATCCAGCTGCAGAAATGGTAC
GGTGACCTGCGTGGTAACCCGTTCGCGGTTGAAGCGGAAAACCGTGTTGTTGACA
TCTCTGGTTTCTCTATCGGTTCTGACGGTCACTCTATCCAGTACCGTAACCTGCT
GGCGTGGAAATACCTGGAAAACGGTAAACGTGAATTCTACCTGCTGATGAACTAC
GGTAAAAAAGGTCGTATCCGTTTCACCGACGGTACCGACATCAAAAAATCTGGTA
AATGGCAGGGTCTGCTGTACGGTGGTGGTAAAGCGAAAGTTATCGACCTGACCTT
CGACCCGGACGACGAACAGCTGATCATCCTGCCGCTGGCGTTCGGTACCCGTCAG
GGTCGTGAATTCATCTGGAACGACCTGCTGTCTCTGGAAACCGGTCTGATCAAAC
TGGCGAACGGTCGTGTTATCGAAAAAACCATCTACAACAAAAAAATCGGTCGTGA
CGAACCGGCGCTGTTCGTTGCGCTGACCTTCGAACGTCGTGAAGTTGTTGACCCG
TCTAACATCAAACCGGTTAACCTGATCGGTGTTGACCGTGGTGAAAACATCCCGG
CGGTTATCGCGCTGACCGACCCGGAAGGTTGCCCGCTGCCGGAATTCAAAGACTC
TTCTGGTGGTCCGACCGACATCCTGCGTATCGGTGAAGGTTACAAAGAAAAACAG
CGTGCGATCCAGGCGGCGAAAGAAGTTGAACAGCGTCGTGCGGGTGGTTACTCTC
GTAAATTCGCGTCTAAATCTCGTAACCTGGCGGACGACATGGTTCGTAACTCTGC
GCGTGACCTGTTCTACCACGCGGTTACCCACGACGCGGTTCTGGTTTTCGAAAAC
CTGTCTCGTGGTTTCGGTCGTCAGGGTAAACGTACCTTCATGACCGAACGTCAGT
ACACCAAAATGGAAGACTGGCTGACCGCGAAACTGGCGTACGAAGGTCTGACCTC
TAAAACCTACCTGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTGCTCTAAC
TGCGGTTTCACCATCACCACCGCGGACTACGACGGTATGCTGGTTCGTCTGAAAA
AAACCTCTGACGGTTGGGCGACCACCCTGAACAACAAAGAACTGAAAGCGGAAGG
TCAGATCACCTACTACAACCGTTACAAACGTCAGACCGTTGAAAAAGAACTGTCT
GCGGAACTGGACCGTCTGTCTGAAGAATCTGGTAACAACGACATCTCTAAATGGA
CCAAAGGTCGTCGTGACGAAGCGCTGTTCCTGCTGAAAAAACGTTTCTCTCACCG
TCCGGTTCAGGAACAGTTCGTTTGCCTGGACTGCGGTCACGAAGTTCACGCGGAC
GAACAGGCGGCGCTGAACATCGCGCGTTCTTGGCTGTTCCTGAACTCTAACTCTA
CCGAATTCAAATCTTACAAATCTGGTAAACAGCCGTTCGTTGGTGCGTGGCAGGC
GTTCTACAAACGTCGTCTGAAAGAAGTTTGGAAACCGAACGCG
SEQ ATGAAACGTATCAACAAAATCCGTCGTCGTCTGGTTAAAGACTCTAACACCAAAA
ID AAGCGGGTAAAACCGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCCCGGA
NO: CCTGCGTGAACGTCTGGAAAACCTGCGTAAAAAACCGGAAAACATCCCGCAGCCG
60 ATCTCTAACACCTCTCGTGCGAACCTGAACAAACTGCTGACCGACTACACCGAAA
TGAAAAAAGCGATCCTGCACGTTTACTGGGAAGAATTCCAGAAAGACCCGGTTGG
TCTGATGTCTCGTGTTGCGCAGCCGGCGCCGAAAAACATCGACCAGCGTAAACTG
ATCCCGGTTAAAGACGGTAACGAACGTCTGACCTCTTCTGGTTTCGCGTGCTCTC
AGTGCTGCCAGCCGCTGTACGTTTACAAACTGGAACAGGTTAACGACAAAGGTAA
ACCGCACACCAACTACTTCGGTCGTTGCAACGTTTCTGAACACGAACGTCTGATC
CTGCTGTCTCCGCACAAACCGGAAGCGAACGACGAACTGGTTACCTACTCTCTGG
GTAAATTCGGTCAGCGTGCGCTGGACTTCTACTCTATCCACGTTACCCGTGAATC
TAACCACCCGGTTAAACCGCTGGAACAGATCGGTGGTAACTCTTGCGCGTCTGGT
CCGGTTGGTAAAGCGCTGTCTGACGCGTGCATGGGTGCGGTTGCGTCTTTCCTGA
CCAAATACCAGGACATCATCCTGGAACACCAGAAAGTTATCAAAAAAAACGAAAA
ACGTCTGGCGAACCTGAAAGACATCGCGTCTGCGAACGGTCTGGCGTTCCCGAAA
ATCACCCTGCCGCCGCAGCCGCACACCAAAGAAGGTATCGAAGCGTACAACAACG
TTGTTGCGCAGATCGTTATCTGGGTTAACCTGAACCTGTGGCAGAAACTGAAAAT
CGGTCGTGACGAAGCGAAACCGCTGCAGCGTCTGAAAGGTTTCCCGTCTTTCCCG
CTGGTTGAACGTCAGGCGAACGAAGTTGACTGGTGGGACATGGTTTGCAACGTTA
AAAAACTGATCAACGAAAAAAAAGAAGACGGTAAAGTTTTCTGGCAGAACCTGGC
GGGTTACAAACGTCAGGAAGCGCTGCTGCCGTACCTGTCTTCTGAAGAAGACCGT
AAAAAAGGTAAAAAATTCGCGCGTTACCAGTTCGGTGACCTGCTGCTGCACCTGG
AAAAAAAACACGGTGAAGACTGGGGTAAAGTTTACGACGAAGCGTGGGAACGTAT
CGACAAAAAAGTTGAAGGTCTGTCTAAACACATCAAACTGGAAGAAGAACGTCGT
TCTGAAGACGCGCAGTCTAAAGCGGCGCTGACCGACTGGCTGCGTGCGAAAGCGT
CTTTCGTTATCGAAGGTCTGAAAGAAGCGGACAAAGACGAATTCTGCCGTTGCGA
ACTGAAACTGCAGAAATGGTACGGTGACCTGCGTGGTAAACCGTTCGCGATCGAA
GCGGAAAACTCTATCCTGGACATCTCTGGTTTCTCTAAACAGTACAACTGCGCGT
TCATCTGGCAGAAAGACGGTGTTAAAAAACTGAACCTGTACCTGATCATCAACTA
CTTCAAAGGTGGTAAACTGCGTTTCAAAAAAATCAAACCGGAAGCGTTCGAAGCG
AACCGTTTCTACACCGTTATCAACAAAAAATCTGGTGAAATCGTTCCGATGGAAG
TTAACTTCAACTTCGACGACCCGAACCTGATCATCCTGCCGCTGGCGTTCGGTAA
ACGTCAGGGTCGTGAATTCATCTGGAACGACCTGCTGTCTCTGGAAACCGGTTCT
CTGAAACTGGCGAACGGTCGTGTTATCGAAAAAACCCTGTACAACCGTCGTACCC
GTCAGGACGAACCGGCGCTGTTCGTTGCGCTGACCTTCGAACGTCGTGAAGTTCT
GGACTCTTCTAACATCAAACCGATGAACCTGATCGGTATCGACCGTGGTGAAAAC
ATCCCGGCGGTTATCGCGCTGACCGACCCGGAAGGTTGCCCGCTGTCTCGTTTCA
AAGACTCTCTGGGTAACCCGACCCACATCCTGCGTATCGGTGAATCTTACAAAGA
AAAACAGCGTACCATCCAGGCGGCGAAAGAAGTTGAACAGCGTCGTGCGGGTGGT
TACTCTCGTAAATACGCGTCTAAAGCGAAAAACCTGGCGGACGACATGGTTCGTA
ACACCGCGCGTGACCTGCTGTACTACGCGGTTACCCAGGACGCGATGCTGATCTT
CGAAAACCTGTCTCGTGGTTTCGGTCGTCAGGGTAAACGTACCTTCATGGCGGAA
CGTCAGTACACCCGTATGGAAGACTGGCTGACCGCGAAACTGGCGTACGAAGGTC
TGCCGTCTAAAACCTACCTGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTG
CTCTAACTGCGGTTTCACCATCACCTCTGCGGACTACGACCGTGTTCTGGAAAAA
CTGAAAAAAACCGCGACCGGTTGGATGACCACCATCAACGGTAAAGAACTGAAAG
TTGAAGGTCAGATCACCTACTACAACCGTTACAAACGTCAGAACGTTGTTAAAGA
CCTGTCTGTTGAACTGGACCGTCTGTCTGAAGAATCTGTTAACAACGACATCTCT
TCTTGGACCAAAGGTCGTTCTGGTGAAGCGCTGTCTCTGCTGAAAAAACGTTTCT
CTCACCGTCCGGTTCAGGAAAAATTCGTTTGCCTGAACTGCGGTTTCGAAACCCA
CGCGGACGAACAGGCGGCGCTGAACATCGCGCGTTCTTGGCTGTTCCTGCGTTCT
CAGGAATACAAAAAATACCAGACCAACAAAACCACCGGTAACACCGACAAACGTG
CGTTCGTTGAAACCTGGCAGTCTTTCTACCGTAAAAAACTGAAAGAAGTTTGGAA
ACCG
SEQ AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAA
ID GCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGA
NO: CAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTT
61 TTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgcgtc
ttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattc
tgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataa
tcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgcc
atagcatttttatccataagattagcggatcctacctgacgctttttatcgcaac
tctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg
agataggcggagatacgaactttaagAAGGAGatataccATGGGTAAAATGTATT
ACCTTGGTTTAGACATTGGCACGAATTCCGTGGGCTACGCGGTGACCGACCCCTC
ATACCACCTGCTGAAGTTTAAGGGGGAACCAATGTGGGGTGCGCACGTATTTGCC
GCCGGTAATCAGAGCGCGGAACGACGCTCGTTCCGCACATCGCGTCGTCGTTTGG
ACCGACGCCAACAGCGCGTTAAACTGGTACAGGAGATTTTTGCCCCGGTGATTAG
TCCGATCGACCCACGCTTCTTCATTCGTCTGCATGAATCCGCCCTGTGGCGCGAT
GACGTCGCGGAGACGGATAAACATATCTTTTTCAATGATCCTACCTATACCGATA
AGGAATATTATAGCGATTACCCGACTATCCATCACCTGATCGTTGATCTGATGGA
AAGCTCTGAGAAACACGATCCGCGGCTGGTGTACCTTGCAGTGGCGTGGTTAGTG
GCACACCGTGGTCATTTTCTGAACGAGGTGGACAAGGATAATATTGGAGATGTGT
TGTCGTTCGACGCATTTTATCCGGAGTTTCTCGCGTTCCTGTCGGACAACGGTGT
ATCACCGTGGGTGTGCGAAAGCAAAGCGCTGCAGGCGACCTTGCTGAGCCGTAAC
TCAGTGAACGACAAATATAAAGCCCTTAAGTCTCTGATCTTCGGATCCCAGAAAC
CTGAAGATAACTTCGATGCCAATATTTCGGAAGATGGACTCATTCAACTGCTGGC
CGGCAAAAAGGTAAAAGTTAACAAACTGTTCCCTCAGGAATCGAACGATGCATCC
TTCACATTGAATGATAAAGAAGACGCGATAGAAGAAATCCTGGGTACGCTTACAC
CAGATGAATGTGAATGGATTGCGCATATACGCCGCCTTTTTGACTGGGCTATCAT
GAAACATGCTCTGAAAGATGGCAGGACTATTAGCGAGTCAAAAGTCAAACTGTAT
GAGCAGCACCATCACGATCTGACCCAACTTAAATACTTCGTGAAAACCTACCTTG
CAAAAGAATACGACGATATTTTCCGCAACGTGGATAGCGAAACAACGAAAAACTA
TGTAGCGTATTCCTATCATGTGAAAGAGGTGAAAGGCACTCTGCCTAAAAATAAG
GCAACGCAAGAAGAGTTTTGTAAGTATGTCCTGGGCAAGGTTAAAAACATTGAAT
GCTCTGAAGCAGACAAGGTTGACTTTGATGAGATGATTCAGCGTCTTACCGACAA
CTCTTTTATGCCTAAGCAGGTTTCGGGCGAAAACCGCGTTATTCCTTATCAGTTA
TATTATTATGAACTGAAGACAATTCTGAATAAAGCAGCCTCGTACCTGCCTTTCC
TGACGCAGTGTGGAAAAGATGCAATTTCGAACCAGGACAAACTACTGTCGATCAT
GACGTTCCGTATTCCTTACTTCGTCGGACCCTTGCGAAAAGATAATTCGGAACAT
GCATGGCTCGAACGAAAGGCCGGTAAGATTTATCCGTGGAACTTTAACGACAAAG
TGGACTTGGATAAATCAGAAGAAGCGTTCATTCGCCGAATGACCAATACCTGTAC
CTATTATCCCGGCGAAGATGTTTTACCGTTGGATTCGCTGATCTATGAGAAATTT
ATGATTTTAAATGAAATCAATAATATTCGTATTGACGGCTACCCGATTAGTGTTG
ACGTTAAACAGCAGGTTTTTGGCTTGTTCGAAAAAAAACGACGCGTAACCGTGAA
AGATATTCAGAACCTGCTGCTGTCTCTCGGAGCTCTGGACAAACACGGGAAGCTG
ACAGGCATCGATACCACTATCCACTCAAACTATAATACGTATCACCATTTTAAAT
CTCTCATGGAACGCGGCGTCCTGACCCGGGATGACGTGGAACGCATCGTTGAAAG
GATGACCTACAGCGACGATACTAAGCGTGTGCGTCTGTGGCTGAATAACAACTAT
GGTACTTTAACCGCCGACGATGTGAAACACATTTCGCGTCTGCGCAAACACGATT
TTGGCCGTTTATCCAAAATGTTCTTAACAGGTCTGAAGGGTGTCCATAAGGAGAC
CGGTGAACGTGCCTCCATACTGGATTTCATGTGGAACACGAACGATAACCTGATG
CAGCTCCTTTCCGAATGCTACACGTTCAGTGATGAAATCACAAAGCTGCAAGAGG
CGTATTATGCAAAAGCCCAGTTGTCTTTAAACGATTTTTTAGACTCGATGTACAT
CTCTAACGCGGTGAAACGTCCGATTTACAGAACTCTGGCAGTGGTGAACGATATT
CGAAAAGCATGTGGGACGGCCCCTAAACGCATTTTCATCGAAATGGCTCGTGATG
GTGAATCAAAAAAAAAGAGAAGTGTTACACGTCGCGAGCAGATCAAAAACCTGTA
CCGCTCGATTCGTAAAGATTTCCAGCAGGAAGTTGATTTTCTGGAAAAGATCCTG
GAAAATAAATCTGATGGTCAACTTCAGTCAGATGCTTTGTATCTTTACTTTGCAC
AATTAGGGCGCGATATGTACACGGGCGATCCAATAAAGCTGGAGCACATCAAAGA
TCAGAGTTTCTATAACATAGACCATATTTACCCGCAGTCTATGGTGAAAGACGAT
TCCCTAGATAACAAAGTGCTGGTGCAAAGCGAAATTAACGGCGAGAAAAGCTCGC
GATACCCTTTGGACGCCGCGATCCGCAATAAAATGAAGCCCCTTTGGGACGCTTA
CTATAATCATGGCCTGATCTCCTTAAAGAAATACCAGCGTCTAACGCGCTCGACC
CCGTTTACCGATGATGAAAAATGGGACTTTATTAATCGCCAGTTAGTGGAAACCC
GTCAATCTACCAAAGCGCTGGCCATTTTGTTGAAGCGTAAGTTTCCAGACACCGA
AATTGTGTATTCGAAGGCGGGGTTATCGTCCGACTTCAGACATGAATTCGGCCTT
GTAAAAAGTCGCAATATTAATGATTTGCACCACGCTAAAGACGCATTCTTGGCTA
TCGTTACCGGCAATGTGTACCATGAAAGATTCAATCGCAGATGGTTTATGGTGAA
CCAGCCGTACTCAGTTAAAACTAAAACTCTTTTTACCCACAGCATAAAGAATGGC
AACTTCGTTGCCTGGAACGGCGAAGAAGATCTCGGTCGTATTGTAAAAATGCTGA
AGCAAAACAAAAATACCATTCACTTCACGCGCTTCTCCTTCGATCGCAAAGAAGG
ATTATTTGATATCCAACCTCTGAAAGCCAGCACCGGCTTAGTCCCACGAAAAGCC
GGTCTGGATGTCGTTAAATACGGCGGATATGACAAATCTACCGCGGCCTATTACC
TGCTGGTGAGGTTCACGCTCGAGGACAAGAAAACCCAGCACAAGCTGATGATGAT
TCCTGTAGAAGGCCTGTACAAGGCTCGCATTGATCATGACAAGGAATTTCTTACC
GATTATGCGCAAACGACTATAAGCGAAATCCTACAGAAAGATAAACAGAAAGTGA
TCAATATTATGTTTCCAATGGGTACGAGGCATATAAAACTCAATTCAATGATTAG
TATCGATGGCTTCTATCTTAGTATCGGCGGAAAGTCCTCTAAAGGTAAGTCAGTT
CTATGTCACGCAATGGTTCCACTGATCGTCCCTCACAAAATCGAATGTTACATTA
AAGCAATGGAAAGCTTCGCCCGGAAGTTTAAAGAAAACAACAAGCTGCGCATCGT
AGAAAAATTCGATAAAATCACCGTTGAAGACAACCTGAATCTCTACGAGCTCTTT
CTCCAAAAACTGCAGCATAATCCCTATAATAAGTTTTTTTCGACACAGTTTGACG
TACTGACGAACGGCCGTTCTACTTTCACAAAACTGTCGCCGGAGGAACAGGTACA
GACGCTCTTGAACATTTTAAGTATCTTTAAAACATGCCGCAGTTCGGGTTGCGAC
CTGAAATCCATCAACGGCAGTGCCCAGGCAGCGCGCATCATGATTAGCGCTGACT
TAACTGGACTGTCGAAAAAATATTCAGATATTAGGTTGGTTGAACAGTCAGCTTC
TGGTTTGTTCGTATCCAAAAGTCAGAACTTACTGGAGTATCTCTAAGAAATCATC
CTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTA
TTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGA
ATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACT
CAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGT
GTTAAGGGATGTTATTTCC
SEQ AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAA
ID GCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGA
NO: CAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTT
62 TTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgcgtc
ttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattc
tgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataa
tcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgcc
atagcatttttatccataagattagcggatcctacctgacgctttttatcgcaac
tctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg
agataggcggagatacgaactttaagAAGGAGatataccATGTCATCGCTCACGA
AATTCACTAACAAATACTCTAAACAGCTCACCATTAAGAATGAACTCATCCCAGT
TGGCAAAACACTGGAGAACATCAAAGAGAATGGTCTGATAGATGGCGACGAACAG
CTGAATGAGAATTATCAGAAGGCGAAAATTATTGTGGATGATTTTCTGCGGGACT
TCATTAATAAAGCACTGAATAATACGCAGATCGGGAACTGGCGCGAACTGGCGGA
TGCCCTTAATAAAGAGGATGAAGATAACATCGAGAAATTGCAGGATAAAATTCGG
GGAATCATTGTATCCAAATTTGAAACGTTTGATCTGTTTAGCAGCTATTCTATTA
AGAAAGATGAAAAGATTATTGACGACGACAATGATGTTGAAGAAGAGGAACTGGA
TCTGGGCAAGAAGACCAGCTCATTTAAATACATATTTAAAAAAAACCTGTTTAAG
TTAGTGTTGCCATCCTACCTGAAAACCACAAACCAGGACAAGCTGAAGATTATTA
GCTCGTTTGATAATTTTTCAACGTACTTCCGCGGGTTCTTTGAAAACCGGAAAAA
CATTTTTACCAAGAAACCGATCTCCACAAGTATTGCGTATCGCATTGTTCATGAT
AACTTCCCGAAATTCCTTGATAACATTCGTTGTTTTAATGTGTGGCAGACGGAAT
GCCCGCAACTAATCGTGAAAGCAGATAACTATCTGAAAAGCAAAAATGTTATAGC
GAAAGATAAAAGTTTGGCAAACTATTTTACCGTGGGCGCGTATGACTATTTCCTG
TCTCAGAATGGTATAGATTTTTACAACAATATTATAGGTGGACTGCCAGCGTTCG
CCGGCCATGAGAAAATCCAAGGTCTCAATGAATTCATCAATCAAGAGTGCCAAAA
AGACAGCGAGCTGAAAAGTAAGCTGAAAAACCGTCACGCGTTCAAAATGGCGGTA
CTGTTCAAACAGATACTCAGCGATCGTGAAAAAAGTTTTGTAATTGATGAGTTCG
AGTCGGATGCTCAAGTTATTGACGCCGTTAAAAACTTTTACGCCGAACAGTGCAA
AGATAACAATGTTATTTTTAACTTATTAAATCTTATCAAGAATATCGCTTTCTTA
AGTGATGACGAACTGGACGGCATATTCATTGAAGGGAAATACCTGTCGAGCGTTA
GTCAAAAACTCTATAGCGATTGGTCAAAATTACGTAACGACATTGAGGATTCGGC
TAACTCTAAACAAGGCAATAAAGAGCTGGCCAAGAAGATCAAAACCAACAAAGGG
GATGTAGAAAAAGCGATCTCGAAATATGAGTTCTCGCTGTCGGAACTGAACTCGA
TTGTACATGATAACACCAAGTTTTCTGACCTCCTTAGTTGTACACTGCATAAGGT
GGCTTCTGAGAAACTGGTGAAGGTCAATGAAGGCGACTGGCCGAAACATCTCAAG
AATAATGAAGAGAAACAAAAAATCAAAGAGCCGCTTGATGCTCTGCTGGAGATCT
ATAATACACTTCTGATTTTTAACTGCAAAAGCTTCAATAAAAACGGCAACTTCTA
TGTCGACTATGATCGTTGCATCAATGAACTGAGTTCGGTCGTGTATCTGTATAAT
AAAACACGTAACTATTGCACTAAAAAACCCTATAACACGGACAAGTTCAAACTCA
ATTTTAACAGTCCGCAGCTCGGTGAAGGCTTTTCCAAGTCGAAAGAAAATGACTG
TCTGACTCTTTTGTTTAAAAAAGACGACAACTATTATGTAGGCATTATCCGCAAA
GGTGCAAAAATCAATTTTGATGATACACAAGCAATCGCCGATAACACCGACAATT
GCATCTTTAAAATGAATTATTTCCTACTTAAAGACGCAAAAAAATTTATCCCGAA
ATGTAGCATTCAGCTGAAAGAAGTCAAGGCCCATTTTAAGAAATCTGAAGATGAT
TACATTTTGTCTGATAAAGAGAAATTTGCTAGCCCGCTGGTCATTAAAAAGAGCA
CATTTTTGCTGGCAACTGCACATGTGAAAGGGAAAAAAGGCAATATCAAGAAATT
TCAGAAAGAATATTCGAAAGAAAACCCCACTGAGTATCGCAATTCTTTAAACGAA
TGGATTGCTTTTTGTAAAGAGTTCTTAAAAACTTATAAAGCGGCTACCATTTTTG
ATATAACCACATTGAAAAAGGCAGAGGAATATGCTGATATTGTAGAATTCTACAA
GGATGTCGATAATCTGTGCTACAAACTGGAGTTCTGCCCGATTAAAACCTCGTTT
ATAGAAAACCTGATAGATAACGGCGACCTGTATCTGTTTCGCATCAATAACAAAG
ACTTCAGCAGTAAATCGACCGGCACCAAGAACCTTCATACGTTATATTTACAAGC
TATATTCGATGAACGTAATCTGAACAATCCGACAATTATGCTGAATGGGGGAGCA
GAACTGTTCTATCGTAAAGAAAGTATTGAGCAGAAAAACCGTATCACACACAAAG
CCGGTTCAATTCTCGTGAATAAGGTGTGTAAAGACGGTACAAGCCTGGATGATAA
GATACGTAATGAAATTTATCAATATGAGAATAAATTTATTGATACCCTGTCTGAT
GAAGCTAAAAAGGTGTTACCGAATGTCATTAAAAAGGAAGCTACCCATGACATTA
CAAAAGATAAACGTTTCACTAGTGACAAATTCTTCTTTCACTGCCCCCTGACAAT
TAATTATAAGGAAGGCGATACCAAGCAGTTCAATAACGAAGTGCTGAGTTTTCTG
CGTGGAAATCCTGACATCAACATTATCGGCATTGACCGCGGAGAGCGTAATTTAA
TCTATGTAACGGTTATAAACCAGAAAGGCGAGATTCTGGATTCGGTTTCATTCAA
TACCGTGACCAACAAGAGTTCAAAAATCGAGCAGACAGTCGATTATGAAGAGAAA
TTGGCAGTCCGCGAGAAAGAGAGGATTGAAGCAAAACGTTCCTGGGACTCTATCT
CAAAAATTGCGACACTAAAGGAAGGTTATCTGAGCGCAATAGTTCACGAGATCTG
TCTGTTAATGATTAAACACAACGCGATCGTTGTCTTAGAGAATCTTAATGCAGGC
TTTAAGCGTATTCGTGGCGGTTTATCAGAAAAAAGTGTTTATCAAAAATTCGAAA
AAATGTTGATTAACAAACTGAACTATTTTGTCAGCAAGAAGGAATCCGACTGGAA
TAAACCGTCTGGTCTGCTGAATGGACTGCAGCTTTCGGATCAGTTTGAAAGCTTC
GAAAAACTGGGTATTCAGTCTGGTTTTATTTTTTACGTGCCGGCTGCATATACCT
CAAAGATTGATCCGACCACGGGCTTCGCCAATGTTCTGAATCTGTCGAAGGTACG
CAATGTTGATGCGATCAAAAGCTTTTTTTCTAACTTCAACGAAATTAGTTATAGC
AAGAAAGAAGCCCTTTTCAAATTCTCATTCGATCTGGATTCACTGAGTAAGAAAG
GCTTTAGTAGCTTTGTGAAATTTAGTAAGAGTAAATGGAACGTCTACACCTTTGG
AGAACGTATCATAAAGCCAAAGAATAAGCAAGGTTATCGGGAGGACAAAAGAATC
AACTTGACCTTCGAGATGAAGAAGTTACTTAACGAGTATAAGGTTTCTTTTGATC
TTGAAAATAACTTGATTCCGAATCTCACGAGTGCCAACCTGAAGGATACTTTTTG
GAAAGAGCTATTCTTTATCTTCAAGACTACGCTGCAGCTCCGTAACAGCGTTACT
AACGGTAAAGAAGATGTGCTCATCTCTCCGGTCAAAAATGCGAAGGGTGAATTCT
TCGTTTCGGGAACGCATAACAAGACTCTTCCGCAAGATTGCGATGCGAACGGTGC
ATACCATATTGCGTTGAAAGGTCTGATGATACTCGAACGTAACAACCTTGTACGT
GAGGAGAAAGATACGAAAAAGATTATGGCGATTTCAAACGTGGATTGGTTCGAGT
ACGTGCAGAAACGTAGAGGCGTTCTGTAAGAAATCATCCTTAGCGAAAGCTAAGG
ATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGT
TTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAAT
AAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCA
GGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTT
CC
SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT
ID TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC
NO: TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA
63 CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC
ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC
GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG
TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC
TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATAACAACTACGACGAA
TTCACCAAACTGTACCCGATCCAGAAAACCATCCGTTTCGAACTGAAACCGCAGG
GTCGTACCATGGAACACCTGGAAACCTTCAACTTCTTCGAAGAAGACCGTGACCG
TGCGGAAAAATACAAAATCCTGAAAGAAGCGATCGACGAATACCACAAAAAATTC
ATCGACGAACACCTGACCAACATGTCTCTGGACTGGAACTCTCTGAAACAGATCT
CTGAAAAATACTACAAATCTCGTGAAGAAAAAGACAAAAAAGTTTTCCTGTCTGA
ACAGAAACGTATGCGTCAGGAAATCGTTTCTGAATTCAAAAAAGACGACCGTTTC
AAAGACCTGTTCTCTAAAAAACTGTTCTCTGAACTGCTGAAAGAAGAAATCTACA
AAAAAGGTAACCACCAGGAAATCGACGCGCTGAAATCTTTCGACAAATTCTCTGG
TTACTTCATCGGTCTGCACGAAAACCGTAAAAACATGTACTCTGACGGTGACGAA
ATCACCGCGATCTCTAACCGTATCGTTAACGAAAACTTCCCGAAATTCCTGGACA
ACCTGCAGAAATACCAGGAAGCGCGTAAAAAATACCCGGAATGGATCATCAAAGC
GGAATCTGCGCTGGTTGCGCACAACATCAAAATGGACGAAGTTTTCTCTCTGGAA
TACTTCAACAAAGTTCTGAACCAGGAAGGTATCCAGCGTTACAACCTGGCGCTGG
GTGGTTACGTTACCAAATCTGGTGAAAAAATGATGGGTCTGAACGACGCGCTGAA
CCTGGCGCACCAGTCTGAAAAATCTTCTAAAGGTCGTATCCACATGACCCCGCTG
TTCAAACAGATCCTGTCTGAAAAAGAATCTTTCTCTTACATCCCGGACGTTTTCA
CCGAAGACTCTCAGCTGCTGCCGTCTATCGGTGGTTTCTTCGCGCAGATCGAAAA
CGACAAAGACGGTAACATCTTCGACCGTGCGCTGGAACTGATCTCTTCTTACGCG
GAATACGACACCGAACGTATCTACATCCGTCAGGCGGACATCAACCGTGTTTCTA
ACGTTATCTTCGGTGAATGGGGTACCCTGGGTGGTCTGATGCGTGAATACAAAGC
GGACTCTATCAACGACATCAACCTGGAACGTACCTGCAAAAAAGTTGACAAATGG
CTGGACTCTAAAGAATTCGCGCTGTCTGACGTTCTGGAAGCGATCAAACGTACCG
GTAACAACGACGCGTTCAACGAATACATCTCTAAAATGCGTACCGCGCGTGAAAA
AATCGACGCGGCGCGTAAAGAAATGAAATTCATCTCTGAAAAAATCTCTGGTGAC
GAAGAATCTATCCACATCATCAAAACCCTGCTGGACTCTGTTCAGCAGTTCCTGC
ACTTCTTCAACCTGTTCAAAGCGCGTCAGGACATCCCGCTGGACGGTGCGTTCTA
CGCGGAATTCGACGAAGTTCACTCTAAACTGTTCGCGATCGTTCCGCTGTACAAC
AAAGTTCGTAACTACCTGACCAAAAACAACCTGAACACCAAAAAAATCAAACTGA
ACTTCAAAAACCCGACCCTGGCGAACGGTTGGGACCAGAACAAAGTTTACGACTA
CGCGTCTCTGATCTTCCTGCGTGACGGTAACTACTACCTGGGTATCATCAACCCG
AAACGTAAAAAAAACATCAAATTCGAACAGGGTTCTGGTAACGGTCCGTTCTACC
GTAAAATGGTTTACAAACAGATCCCGGGTCCGAACAAAAACCTGCCGCGTGTTTT
CCTGACCTCTACCAAAGGTAAAAAAGAATACAAACCGTCTAAAGAAATCATCGAA
GGTTACGAAGCGGACAAACACATCCGTGGTGACAAATTCGACCTGGACTTCTGCC
ACAAACTGATCGACTTCTTCAAAGAATCTATCGAAAAACACAAAGACTGGTCTAA
ATTCAACTTCTACTTCTCTCCGACCGAATCTTACGGTGACATCTCTGAATTCTAC
CTGGACGTTGAAAAACAGGGTTACCGTATGCACTTCGAAAACATCTCTGCGGAAA
CCATCGACGAATACGTTGAAAAAGGTGACCTGTTCCTGTTCCAGATCTACAACAA
AGACTTCGTTAAAGCGGCGACCGGTAAAAAAGACATGCACACCATCTACTGGAAC
GCGGCGTTCTCTCCGGAAAACCTGCAGGACGTTGTTGTTAAACTGAACGGTGAAG
CGGAACTGTTCTACCGTGACAAATCTGACATCAAAGAAATCGTTCACCGTGAAGG
TGAAATCCTGGTTAACCGTACCTACAACGGTCGTACCCCGGTTCCGGACAAAATC
CACAAAAAACTGACCGACTACCACAACGGTCGTACCAAAGACCTGGGTGAAGCGA
AAGAATACCTGGACAAAGTTCGTTACTTCAAAGCGCACTACGACATCACCAAAGA
CCGTCGTTACCTGAACGACAAAATCTACTTCCACGTTCCGCTGACCCTGAACTTC
AAAGCGAACGGTAAAAAAAACCTGAACAAAATGGTTATCGAAAAATTCCTGTCTG
ACGAAAAAGCGCACATCATCGGTATCGACCGTGGTGAACGTAACCTGCTGTACTA
CTCTATCATCGACCGTTCTGGTAAAATCATCGACCAGCAGTCTCTGAACGTTATC
GACGGTTTCGACTACCGTGAAAAACTGAACCAGCGTGAAATCGAAATGAAAGACG
CGCGTCAGTCTTGGAACGCGATCGGTAAAATCAAAGACCTGAAAGAAGGTTACCT
GTCTAAAGCGGTTCACGAAATCACCAAAATGGCGATCCAGTACAACGCGATCGTT
GTTATGGAAGAACTGAACTACGGTTTCAAACGTGGTCGTTTCAAAGTTGAAAAAC
AGATCTACCAGAAATTCGAAAACATGCTGATCGACAAAATGAACTACCTGGTTTT
CAAAGACGCGCCGGACGAATCTCCGGGTGGTGTTCTGAACGCGTACCAGCTGACC
AACCCGCTGGAATCTTTCGCGAAACTGGGTAAACAGACCGGTATCCTGTTCTACG
TTCCGGCGGCGTACACCTCTAAAATCGACCCGACCACCGGTTTCGTTAACCTGTT
CAACACCTCTTCTAAAACCAACGCGCAGGAACGTAAAGAATTCCTGCAGAAATTC
GAATCTATCTCTTACTCTGCGAAAGACGGTGGTATCTTCGCGTTCGCGTTCGACT
ACCGTAAATTCGGTACCTCTAAAACCGACCACAAAAACGTTTGGACCGCGTACAC
CAACGGTGAACGTATGCGTTACATCAAAGAAAAAAAACGTAACGAACTGTTCGAC
CCGTCTAAAGAAATCAAAGAAGCGCTGACCTCTTCTGGTATCAAATACGACGGTG
GTCAGAACATCCTGCCGGACATCCTGCGTTCTAACAACAACGGTCTGATCTACAC
CATGTACTCTTCTTTCATCGCGGCGATCCAGATGCGTGTTTACGACGGTAAAGAA
GACTACATCATCTCTCCGATCAAAAACTCTAAAGGTGAATTCTTCCGTACCGACC
CGAAACGTCGTGAACTGCCGATCGACGCGGACGCGAACGGTGCGTACAACATCGC
GCTGCGTGGTGAACTGACCATGCGTGCGATCGCGGAAAAATTCGACCCGGACTCT
GAAAAAATGGCGAAACTGGAACTGAAACACAAAGACTGGTTCGAATTCATGCAGA
CCCGTGGTGACTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGA
AATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAG
CAAAGAGGATTACA
SEQ AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAA
ID GCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGA
NO: CAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTT
64 TTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgcgtc
ttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattc
tgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataa
tcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgcc
atagcatttttatccataagattagcggatcctacctgacgctttttatcgcaac
tctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg
agataggcggagatacgaactttaagAAGGAGatataccATGACTAAAACATTTG
ATTCAGAGTTTTTTAATTTGTACTCGCTGCAAAAAACGGTACGCTTTGAGTTAAA
ACCCGTGGGAGAAACCGCGTCATTTGTGGAAGACTTTAAAAACGAGGGCTTGAAA
CGTGTTGTGAGCGAAGATGAAAGGCGAGCCGTCGATTACCAGAAAGTTAAGGAAA
TAATTGACGATTACCATCGGGATTTCATTGAAGAAAGTTTAAATTATTTTCCGGA
ACAGGTGAGTAAAGATGCTCTTGAGCAGGCGTTTCATCTTTATCAGAAACTGAAG
GCAGCAAAAGTTGAGGAAAGGGAAAAAGCGCTGAAAGAATGGGAAGCGCTGCAGA
AAAAGCTACGTGAAAAAGTGGTGAAATGCTTCTCGGACTCGAATAAAGCCCGCTT
CTCAAGGATTGATAAAAAGGAACTGATTAAGGAAGACCTGATAAATTGGTTGGTC
GCCCAGAATCGCGAGGATGATATCCCTACGGTCGAAACGTTTAACAACTTCACCA
CATATTTTACCGGCTTCCATGAGAATCGTAAAAATATTTACTCCAAAGATGATCA
CGCCACCGCTATTAGCTTTCGCCTTATTCATGAAAATCTTCCAAAGTTTTTTGAC
AACGTGATTAGCTTCAATAAGTTGAAAGAGGGTTTCCCTGAATTAAAATTTGATA
AAGTGAAAGAGGATTTAGAAGTAGATTATGATCTGAAGCATGCGTTTGAAATAGA
ATATTTCGTTAACTTCGTGACCCAAGCGGGCATAGATCAGTATAATTATCTGTTA
GGAGGGAAAACCCTGGAGGACGGGACGAAAAAACAAGGGATGAATGAGCAAATTA
ATCTGTTCAAACAACAGCAAACGCGAGATAAAGCGCGTCAGATTCCCAAACTGAT
CCCCCTGTTCAAACAGATTCTTAGCGAAAGGACTGAAAGCCAGTCCTTTATTCCT
AAACAATTTGAAAGTGATCAGGAGTTGTTCGATTCACTGCAGAAGTTACATAATA
ACTGCCAGGATAAATTCACCGTGCTGCAACAAGCCATTCTCGGTCTGGCAGAGGC
GGATCTTAAGAAGGTCTTCATCAAAACCTCTGATTTAAATGCCTTATCTAACACC
ATTTTCGGGAATTACAGCGTCTTTTCCGATGCACTGAACCTGTATAAAGAAAGCC
TGAAAACGAAAAAAGCGCAGGAGGCTTTTGAGAAACTACCGGCCCATTCTATTCA
CGACCTCATTCAATACTTGGAACAGTTCAATTCCAGCCTGGACGCGGAAAAACAA
CAGAGCACCGACACCGTCCTGAACTACTTCATCAAGACCGATGAATTATATTCTC
GCTTCATTAAATCCACTAGCGAGGCTTTCACTCAGGTGCAGCCTTTGTTCGAACT
GGAAGCCCTGTCATCTAAGCGCCGCCCACCGGAATCGGAAGATGAAGGGGCAAAA
GGGCAGGAAGGCTTCGAGCAGATCAAGCGTATTAAAGCTTACCTGGATACGCTTA
TGGAAGCGGTACACTTTGCAAAGCCGTTGTATCTTGTTAAGGGTCGTAAAATGAT
CGAAGGGCTCGATAAAGACCAGTCCTTTTATGAAGCGTTTGAAATGGCGTACCAA
GAACTTGAATCGTTAATCATTCCTATCTATAACAAAGCGCGGAGCTATCTGTCGC
GGAAACCTTTCAAGGCCGATAAATTCAAGATTAATTTTGACAACAACACGCTACT
GAGCGGATGGGATGCGAACAAGGAAACTGCTAACGCGTCCATTCTGTTTAAGAAA
GACGGGTTATATTACCTTGGAATTATGCCGAAAGGTAAGACCTTTCTCTTTGACT
ACTTTGTATCGAGCGAGGATTCAGAGAAACTGAAACAGCGTCGCCAGAAGACCGC
CGAAGAAGCTCTGGCGCAGGATGGTGAAAGTTACTTCGAAAAAATTCGTTATAAA
CTGTTACCAGGGGCTTCAAAGATGTTACCGAAAGTCTTTTTTAGCAACAAAAATA
TTGGCTTTTACAACCCGTCGGATGACATTTTACGCATTCGCAACACAGCCTCTCA
CACCAAAAACGGGACCCCTCAGAAAGGCCACTCAAAAGTTGAGTTTAACCTGAAT
GATTGTCATAAGATGATTGATTTCTTCAAATCATCAATTCAGAAACACCCGGAAT
GGGGGTCTTTTGGCTTTACGTTTTCTGATACCAGTGATTTTGAAGACATGAGTGC
CTTCTACCGGGAAGTAGAAAACCAGGGTTACGTAATTAGCTTTGACAAAATCAAA
GAGACCTATATACAGAGCCAGGTGGAACAGGGTAATCTCTACTTATTCCAGATTT
ATAACAAGGATTTCTCGCCCTACAGCAAAGGCAAACCAAACCTGCATACTCTGTA
CTGGAAAGCCCTGTTTGAAGAAGCGAACCTGAATAACGTAGTGGCGAAGTTGAAC
GGTGAAGCGGAAATCTTCTTCCGTCGTCACTCCATTAAGGCCTCTGATAAAGTTG
TCCATCCGGCAAATCAGGCCATTGATAATAAGAATCCACACACGGAAAAAACGCA
GTCAACCTTTGAATATGACCTCGTTAAAGACAAACGCTACACGCAAGATAAGTTC
TTTTTCCACGTCCCAATCAGCCTCAACTTTAAAGCACAAGGGGTTTCAAAGTTTA
ATGATAAAGTCAATGGGTTCCTCAAGGGCAACCCGGATGTCAACATTATAGGTAT
AGACAGGGGCGAACGCCATCTGCTTTACTTTACCGTAGTGAATCAGAAAGGTGAA
ATACTGGTTCAGGAATCATTAAATACCTTGATGTCGGACAAAGGGCACGTTAATG
ATTACCAGCAGAAACTGGATAAAAAAGAACAGGAACGTGATGCTGCGCGTAAATC
GTGGACCACGGTTGAGAACATTAAAGAGCTGAAAGAGGGGTATCTAAGCCATGTG
GTACACAAACTGGCGCACCTCATCATTAAATATAACGCAATAGTCTGCCTAGAAG
ACTTGAATTTTGGCTTTAAACGCGGCCGCTTCAAAGTGGAAAAACAAGTTTATCA
AAAATTTGAAAAGGCGCTTATAGATAAACTGAATTATCTGGTTTTTAAAGAAAAG
GAACTTGGTGAGGTAGGGCACTACTTGACAGCTTATCAACTGACGGCCCCGTTCG
AATCATTCAAAAAACTGGGCAAACAGTCTGGCATTCTGTTTTACGTGCCGGCAGA
TTATACTTCAAAAATCGATCCAACAACTGGCTTTGTGAACTTCCTGGACCTGAGA
TATCAGTCTGTAGAAAAAGCTAAACAACTTCTTAGCGATTTTAATGCCATTCGTT
TTAACAGCGTTCAGAATTACTTTGAATTCGAAATTGACTATAAAAAACTTACTCC
GAAACGTAAAGTCGGAACCCAAAGTAAATGGGTAATTTGTACGTATGGCGATGTC
AGGTATCAGAACCGTCGGAATCAAAAAGGTCATTGGGAGACCGAAGAAGTGAACG
TGACCGAAAAGCTGAAGGCTCTGTTCGCCAGCGATTCAAAAACTACAACTGTGAT
CGATTACGCAAATGATGATAACCTGATAGATGTGATTTTAGAGCAGGATAAAGCC
AGCTTTTTTAAAGAACTGTTGTGGCTCCTGAAACTTACGATGACCTTACGACATT
CCAAGATCAAATCGGAAGATGATTTTATTCTGTCACCGGTCAAGAATGAGCAGGG
TGAATTCTATGATAGTAGGAAAGCCGGCGAAGTGTGGCCGAAAGACGCCGACGCC
AATGGCGCCTATCATATCGCGCTCAAAGGGCTTTGGAATTTGCAGCAGATTAACC
AGTGGGAAAAAGGTAAAACCCTGAATCTGGCTATCAAAAACCAGGATTGGTTTAG
CTTTATCCAAGAGAAACCGTATCAGGAATGAGAAATCATCCTTAGCGAAAGCTAA
GGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTA
GTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATA
ATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACT
CAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTAT
TTCC
SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT
ID TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC
NO: TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA
65 CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC
ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC
GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG
TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC
TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATCATACAGGCGGTCTT
CTTAGTATGGACGCGAAAGAGTTCACAGGTCAGTATCCGTTGTCGAAAACATTAC
GATTCGAACTTCGGCCCATCGGCCGCACGTGGGATAACCTGGAGGCCTCAGGCTA
CTTAGCGGAAGACCGCCATCGTGCCGAATGTTATCCTCGTGCGAAAGAGTTATTG
GATGACAACCATCGTGCCTTCCTGAATCGTGTGTTGCCACAAATCGATATGGATT
GGCACCCGATTGCGGAGGCCTTTTGTAAGGTACATAAAAACCCTGGTAATAAAGA
ACTTGCCCAGGATTACAACCTTCAGTTGTCAAAGCGCCGTAAGGAGATCAGCGCA
TATCTTCAGGATGCAGATGGCTATAAAGGCCTGTTCGCGAAGCCCGCCTTAGACG
AAGCTATGAAAATTGCGAAAGAAAACGGGAACGAAAGTGATATTGAGGTTCTCGA
AGCGTTTAACGGTTTTAGCGTATACTTCACCGGTTATCATGAGTCACGCGAGAAC
ATTTATAGCGATGAGGATATGGTGAGCGTAGCCTACCGAATTACTGAGGATAATT
TCCCGCGCTTTGTCTCAAACGCTTTGATCTTTGATAAATTAAACGAAAGCCATCC
GGATATTATCTCTGAAGTATCGGGCAATCTTGGAGTTGATGACATTGGTAAGTAC
TTTGACGTGTCGAACTATAACAATTTTCTTTCCCAGGCCGGTATAGATGACTACA
ATCACATTATTGGCGGCCATACAACCGAAGACGGACTGATACAAGCGTTTAATGT
CGTATTGAACTTACGTCACCAAAAAGACCCTGGCTTTGAAAAAATTCAGTTCAAA
CAGCTCTACAAACAAATCCTGAGCGTGCGTACCAGCAAAAGCTACATCCCGAAAC
AGTTTGACAACTCTAAGGAGATGGTTGACTGCATTTGCGATTATGTCAGCAAAAT
AGAGAAATCCGAAACAGTAGAACGGGCCCTGAAACTAGTCCGTAATATCAGTTCT
TTCGACTTGCGCGGGATCTTTGTCAATAAAAAGAACTTGCGCATACTGAGCAACA
AACTGATAGGAGATTGGGACGCGATCGAAACCGCATTGATGCATAGTTCTTCATC
AGAAAACGATAAGAAAAGCGTATATGATAGCGCGGAGGCTTTTACGTTGGATGAC
ATCTTTTCAAGCGTGAAAAAATTTTCTGATGCCTCTGCCGAAGATATTGGCAACA
GGGCGGAAGACATCTGTAGAGTGATAAGTGAGACGGCCCCTTTTATCAACGATCT
GCGAGCGGTGGACCTGGATAGCCTGAACGACGATGGTTATGAAGCGGCCGTCTCA
AAAATTCGGGAGTCGCTGGAGCCTTATATGGATCTTTTCCATGAACTGGAAATTT
TCTCGGTTGGCGATGAGTTCCCAAAATGCGCAGCATTTTACAGCGAACTGGAGGA
AGTCAGCGAACAGCTGATCGAAATTATTCCGTTATTCAACAAGGCGCGTTCGTTC
TGCACCCGGAAACGCTATAGCACCGATAAGATTAAAGTGAACTTAAAATTCCCGA
CCTTGGCGGACGGGTGGGACCTGAACAAAGAGAGAGACAACAAAGCCGCGATTCT
GCGGAAAGACGGTAAGTATTATCTGGCAATTCTGGATATGAAGAAAGATCTGTCA
AGCATTAGGACCAGCGACGAAGATGAATCCAGCTTCGAAAAGATGGAGTATAAAC
TGTTACCGAGTCCAGTAAAAATGCTGCCAAAGATATTCGTAAAATCGAAAGCCGC
TAAGGAAAAATATGGCCTGACAGATCGTATGCTTGAATGCTACGATAAAGGTATG
CATAAGTCGGGTAGTGCGTTTGATCTTGGCTTTTGCCATGAACTCATTGATTATT
ACAAGCGTTGTATCGCGGAGTACCCAGGCTGGGATGTGTTCGATTTCAAGTTTCG
CGAAACTTCCGATTATGGGTCCATGAAAGAGTTCAATGAAGATGTGGCCGGAGCC
GGTTACTATATGAGTCTGAGAAAAATTCCGTGCAGCGAAGTGTACCGTCTGTTAG
ACGAGAAATCGATTTATCTATTTCAAATTTATAACAAAGATTACTCTGAAAATGC
ACATGGTAATAAGAACATGCATACCATGTACTGGGAGGGTCTCTTTTCCCCGCAA
AACCTGGAGTCGCCCGTTTTCAAGTTGTCGGGTGGGGCAGAACTTTTCTTTCGAA
AATCCTCAATCCCTAACGATGCCAAAACAGTACACCCGAAAGGCTCAGTGCTGGT
TCCACGTAATGATGTTAACGGTCGGCGTATTCCAGATTCAATCTACCGCGAACTG
ACACGCTATTTTAACCGTGGCGATTGCCGAATCAGTGACGAAGCCAAAAGTTATC
TTGACAAGGTTAAGACTAAAAAAGCGGACCATGACATTGTGAAAGATCGCCGCTT
TACCGTGGATAAAATGATGTTCCACGTCCCGATTGCGATGAACTTTAAGGCGATC
AGTAAACCGAACTTAAACAAAAAAGTCATTGATGGCATCATTGATGATCAGGATC
TGAAAATCATTGGTATTGATCGTGGCGAGCGGAACTTAATTTACGTCACGATGGT
TGACAGAAAAGGGAATATCTTATATCAGGATTCTCTTAACATCCTCAATGGCTAC
GACTATCGTAAAGCTCTGGATGTGCGCGAATATGACAACAAGGAAGCGCGTCGTA
ACTGGACTAAAGTGGAGGGCATTCGCAAAATGAAGGAAGGCTATCTGTCATTAGC
GGTCTCGAAATTAGCGGATATGATTATCGAAAATAACGCCATCATCGTTATGGAG
GACCTGAACCACGGATTCAAAGCGGGCCGCTCAAAGATTGAAAAACAAGTTTATC
AGAAATTTGAGAGTATGCTGATTAACAAACTGGGCTATATGGTGTTAAAAGACAA
GTCAATTGACCAATCAGGTGGCGCGCTGCATGGATACCAGCTGGCGAACCATGTT
ACCACCTTAGCATCAGTTGGAAAGCAGTGTGGGGTTATCTTTTATATACCGGCAG
CGTTCACTAGTAAAATAGATCCGACCACTGGTTTCGCCGATCTCTTTGCCCTGAG
TAACGTTAAAAACGTAGCGAGCATGCGTGAATTCTTTTCCAAAATGAAATCTGTC
ATTTATGATAAAGCTGAAGGCAAATTCGCATTCACCTTTGATTACTTGGATTACA
ACGTGAAGAGCGAATGTGGTCGTACGCTGTGGACCGTTTACACCGTTGGTGAGCG
CTTCACCTATTCCCGTGTGAACCGCGAATATGTACGTAAAGTCCCCACCGATATT
ATCTATGATGCCCTCCAGAAAGCAGGCATTAGCGTCGAAGGAGACTTAAGGGACA
GAATTGCCGAAAGCGATGGCGATACGCTGAAGTCTATTTTTTACGCATTCAAATA
CGCGCTAGATATGCGCGTTGAGAATCGCGAGGAAGACTACATTCAATCACCTGTG
AAAAATGCCTCTGGGGAATTTTTTTGTTCAAAAAATGCTGGTAAAAGCCTCCCAC
AAGATAGCGATGCAAACGGTGCATATAACATTGCCCTGAAAGGTATTCTTCAATT
ACGCATGCTGTCTGAGCAGTACGACCCCAACGCGGAATCTATTAGACTTCCGCTG
ATAACCAATAAAGCCTGGCTGACATTCATGCAGTCTGGCATGAAGACCTGGAAAA
ATTAGGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGG
AGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGA
TTACA
SEQ AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAA
ID GCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGA
NO: CAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTT
66 TTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgcgtc
ttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattc
tgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataa
tcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgcc
atagcatttttatccataagattagcggatcctacctgacgctttttatcgcaac
tctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg
agataggcggagatacgaactttaagAAGGAGatataccatgGATAGTTTGAAAG
ATTTCACCAATCTGTACCCTGTCAGTAAGACATTGAGATTTGAATTAAAGCCCGT
TGGAAAGACTTTAGAAAATATCGAGAAAGCAGGTATTTTGAAAGAGGATGAGCAT
CGTGCAGAAAGTTATCGGAGGGTGAAGAAAATAATTGATACTTATCATAAGGTAT
TTATCGATTCTTCTCTTGAAAATATGGCTAAAATGGGTATTGAGAATGAAATAAA
AGCAATGCTCCAAAGTTTCTGCGAATTGTATAAAAAAGATCATCGCACTGAGGGT
GAAGACAAGGCATTAGATAAAATTCGAGCAGTACTTCGTGGCCTGATTGTTGGGG
CTTTCACTGGTGTTTGCGGAAGACGGGAAAATACAGTCCAAAACGAGAAGTACGA
GAGTTTGTTCAAAGAAAAGTTGATAAAAGAAATTTTACCTGATTTTGTGCTCTCT
ACTGAGGCTGAAAGCTTGCCTTTCTCTGTTGAAGAAGCTACGAGGTCACTGAAGG
AGTTTGATAGCTTTACATCCTACTTTGCTGGTTTTTACGAGAATAGAAAGAATAT
ATACTCGACGAAACCTCAATCCACTGCCATTGCTTATCGTCTTATTCATGAGAAC
TTGCCGAAGTTCATTGATAATATTCTTGTTTTTCAGAAGATCAAAGAGCCTATAG
CCAAAGAGCTGGAACATATTCGTGCGGACTTTTCTGCCGGGGGGTACATAAAAAA
GGATGAGAGATTGGAGGATATTTTTTCGTTGAACTATTATATCCACGTGTTATCT
CAGGCTGGGATCGAAAAATATAACGCATTGATTGGGAAGATTGTGACAGAAGGAG
ATGGAGAGATGAAAGGGCTCAATGAACACATCAACCTTTACAACCAACAAAGAGG
CAGAGAGGATCGGCTCCCTCTTTTTAGGCCTCTTTATAAACAGATATTGAGTGAC
AGAGAGCAATTATCATACTTGCCTGAGAGTTTTGAAAAAGATGAGGAGCTCCTCA
GGGCTCTAAAAGAGTTCTATGATCATATCGCAGAAGACATTCTCGGACGTACTCA
ACAGTTGATGACTTCTATTTCAGAATATGATTTATCTCGGATATACGTAAGGAAC
GATAGCCAATTGACTGATATATCAAAAAAAATGTTGGGAGATTGGAATGCTATCT
ACATGGCTAGAGAACGAGCATATGACCACGAGCAGGCTCCCAAAAGAATCACGGC
GAAATACGAGAGGGACAGGATTAAAGCTCTTAAAGGAGAAGAGAGTATAAGTCTG
GCAAATCTTAATAGTTGTATTGCCTTTCTGGACAATGTTAGAGATTGCCGTGTAG
ATACTTATCTTTCCACACTGGGCCAGAAGGAAGGACCACATGGTCTATCTAATCT
CGTTGAGAACGTTTTTGCCTCATACCATGAAGCAGAGCAATTGTTGAGCTTTCCA
TACCCCGAAGAGAATAATCTGATTCAGGACAAGGACAATGTGGTGTTAATTAAGA
ATCTTCTCGACAATATCAGTGATCTGCAGAGGTTCTTGAAACCTCTTTGGGGTAT
GGGAGACGAACCCGATAAAGATGAAAGATTTTATGGAGAGTATAATTATATCCGA
GGAGCTCTAGATCAGGTGATCCCTCTGTACAATAAGGTAAGGAACTACCTCACTC
GGAAGCCTTATTCGACCAGAAAAGTAAAACTCAATTTTGGGAATTCTCAATTGCT
TAGTGGTTGGGATAGAAATAAGGAAAAGGATAATAGCTGTGTGATTTTGCGTAAG
GGGCAGAACTTCTATTTGGCTATTATGAACAATAGGCACAAAAGAAGTTTCGAAA
ACAAGGTGTTGCCCGAGTATAAGGAGGGAGAACCTTACTTCGAAAAGATGGATTA
TAAATTTTTGCCTGATCCTAATAAAATGCTTCCTAAGGTTTTTCTTTCGAAAAAA
GGAATAGAGATATACAAACCAAGTCCGAAGCTTTTAGAACAATATGGACATGGAA
CTCACAAAAAGGGAGATACCTTTAGTATGGATGATTTGCACGAACTGATCGATTT
CTTCAAACACTCAATCGAGGCTCATGAAGATTGGAAGCAATTCGGATTCAAATTT
TCTGATACGGCTACTTATGAGAATGTATCTAGTTTCTATAGAGAAGTTGAGGATC
AGGGGTATAAGCTCTCTTTCCGAAAAGTTTCGGAATCTTATGTCTATTCATTAAT
AGATCAAGGCAAGTTGTATTTATTTCAGATATACAACAAGGACTTTTCTCCCTGC
AGCAAAGGGACACCTAATCTGCATACCTTGTATTGGAGAATGCTTTTTGACGAGC
GCAATTTGGCAGATGTCATATACAAACTGGATGGGAAGGCTGAAATCTTTTTCCG
AGAGAAGAGTTTGAAAAATGATCATCCCACGCATCCGGCTGGTAAGCCTATCAAA
AAGAAAAGTCGACAAAAAAAAGGAGAGGAGAGTCTGTTTGAGTATGATTTAGTCA
AGGATAGGCACTATACGATGGATAAGTTCCAGTTTCATGTGCCTATTACTATGAA
TTTTAAATGTTCTGCAGGAAGCAAAGTCAATGATATGGTTAATGCTCATATTCGA
GAGGCAAAGGATATGCATGTCATTGGAATTGATCGTGGAGAACGCAATCTGCTGT
ATATATGCGTGATAGATAGTCGAGGGACGATTTTGGATCAAATTTCTCTGAATAC
GATTAACGATATAGACTATCATGATTTATTGGAGAGTCGAGACAAAGACCGTCAG
CAGGAGCGCCGAAACTGGCAAACTATCGAAGGGATCAAGGAGCTAAAACAAGGCT
ACCTTAGTCAGGCGGTTCATCGGATAGCCGAACTGATGGTGGCTTATAAGGCTGT
AGTTGCTTTGGAGGATTTGAATATGGGGTTCAAACGTGGGCGGCAGAAAGTAGAA
AGTTCTGTTTATCAGCAGTTTGAGAAACAGCTGATAGATAAGCTCAACTATCTTG
TGGACAAGAAGAAAAGGCCTGAAGATATTGGAGGATTGTTGAGAGCCTATCAATT
TACGGCCCCATTTAAGAGTTTTAAGGAAATGGGAAAGCAAAACGGCTTCTTGTTT
TATATCCCGGCTTGGAACACGAGCAACATAGATCCGACTACTGGATTTGTTAATT
TATTTCATGCCCAGTATGAAAATGTAGATAAAGCGAAGAGCTTCTTTCAAAAGTT
TGATTCAATTAGTTACAACCCGAAGAAAGACTGGTTTGAGTTTGCATTCGATTAT
AAAAACTTTACTAAAAAGGCTGAAGGAAGTCGTTCTATGTGGATATTATGCACAC
ATGGTTCCCGAATAAAGAATTTTAGAAATTCCCAGAAGAATGGTCAATGGGATTC
CGAAGAATTCGCCTTGACGGAGGCTTTTAAGTCTCTTTTTGTGCGATATGAGATA
GATTATACCGCTGATTTGAAAACAGCTATTGTGGACGAAAAGCAAAAAGACTTCT
TCGTGGATCTTCTGAAGCTATTCAAATTGACAGTACAGATGCGCAACAGCTGGAA
AGAGAAGGATTTGGATTATCTAATCTCTCCTGTAGCAGGGGCTGATGGCCGTTTC
TTCGATACAAGAGAGGGAAATAAAAGTCTGCCTAAGGATGCAGATGCCAATGGAG
CTTATAATATTGCCCTAAAAGGACTTTGGGCTCTACGCCAGATTCGGCAAACTTC
AGAAGGCGGTAAACTCAAATTGGCGATTTCCAATAAGGAATGGCTACAGTTTGTG
CAAGAGAGATCTTACGAGAAAGACtgaGAAATCATCCTTAGCGAAAGCTAAGGAT
TTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTT
TAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAATAA
GTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGG
AAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTTCC
SEQ AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAA
ID GCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGA
NO: CAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTT
67 TTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgcgtc
ttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattc
tgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataa
tcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgcc
atagcatttttatccataagattagcggatcctacctgacgctttttatcgcaac
tctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg
agataggcggagatacgaactttaagAAGGAGatataccATGAACAACGGCACAA
ATAATTTTCAGAACTTCATCGGGATCTCAAGTTTGCAGAAAACGCTGCGCAATGC
TCTGATCCCCACGGAAACCACGCAACAGTTCATCGTCAAGAACGGAATAATTAAA
GAAGATGAGTTACGTGGCGAGAACCGCCAGATTCTGAAAGATATCATGGATGACT
ACTACCGCGGATTCATCTCTGAGACTCTGAGTTCTATTGATGACATAGATTGGAC
TAGCCTGTTCGAAAAAATGGAAATTCAGCTGAAAAATGGTGATAATAAAGATACC
TTAATTAAGGAACAGACAGAGTATCGGAAAGCAATCCATAAAAAATTTGCGAACG
ACGATCGGTTTAAGAACATGTTTAGCGCCAAACTGATTAGTGACATATTACCTGA
ATTTGTCATCCACAACAATAATTATTCGGCATCAGAGAAAGAGGAAAAAACCCAG
GTGATAAAATTGTTTTCGCGCTTTGCGACTAGCTTTAAAGATTACTTCAAGAACC
GTGCAAATTGCTTTTCAGCGGACGATATTTCATCAAGCAGCTGCCATCGCATCGT
CAACGACAATGCAGAGATATTCTTTTCAAATGCGCTGGTCTACCGCCGGATCGTA
AAATCGCTGAGCAATGACGATATCAACAAAATTTCGGGCGATATGAAAGATTCAT
TAAAAGAAATGAGTCTGGAAGAAATATATTCTTACGAGAAGTATGGGGAATTTAT
TACCCAGGAAGGCATTAGCTTCTATAATGATATCTGTGGGAAAGTGAATTCTTTT
ATGAACCTGTATTGTCAGAAAAATAAAGAAAACAAAAATTTATACAAACTTCAGA
AACTTCACAAACAGATTCTATGCATTGCGGACACTAGCTATGAGGTCCCGTATAA
ATTTGAAAGTGACGAGGAAGTGTACCAATCAGTTAACGGCTTCCTTGATAACATT
AGCAGCAAACATATAGTCGAAAGATTACGCAAAATCGGCGATAACTATAACGGCT
ACAACCTGGATAAAATTTATATCGTGTCCAAATTTTACGAGAGCGTTAGCCAAAA
AACCTACCGCGACTGGGAAACAATTAATACCGCCCTCGAAATTCATTACAATAAT
ATCTTGCCGGGTAACGGTAAAAGTAAAGCCGACAAAGTAAAAAAAGCGGTTAAGA
ATGATTTACAGAAATCCATCACCGAAATAAATGAACTAGTGTCAAACTATAAGCT
GTGCAGTGACGACAACATCAAAGCGGAGACTTATATACATGAGATTAGCCATATC
TTGAATAACTTTGAAGCACAGGAATTGAAATACAATCCGGAAATTCACCTAGTTG
AATCCGAGCTCAAAGCGAGTGAGCTTAAAAACGTGCTGGACGTGATCATGAATGC
GTTTCATTGGTGTTCGGTTTTTATGACTGAGGAACTTGTTGATAAAGACAACAAT
TTTTATGCGGAACTGGAGGAGATTTACGATGAAATTTATCCAGTAATTAGTCTGT
ACAACCTGGTTCGTAACTACGTTACCCAGAAACCGTACAGCACGAAAAAGATTAA
ATTGAACTTTGGAATACCGACGTTAGCAGACGGTTGGTCAAAGTCCAAAGAGTAT
TCTAATAACGCTATCATACTGATGCGCGACAATCTGTATTATCTGGGCATCTTTA
ATGCGAAGAATAAACCGGACAAGAAGATTATCGAGGGTAATACGTCAGAAAATAA
GGGTGACTACAAAAAGATGATTTATAATTTGCTCCCGGGTCCCAACAAAATGATC
CCGAAAGTTTTCTTGAGCAGCAAGACGGGGGTGGAAACGTATAAACCGAGCGCCT
ATATCCTAGAGGGGTATAAACAGAATAAACATATCAAGTCTTCAAAAGACTTTGA
TATCACTTTCTGTCATGATCTGATCGACTACTTCAAAAACTGTATTGCAATTCAT
CCCGAGTGGAAAAACTTCGGTTTTGATTTTAGCGACACCAGTACTTATGAAGACA
TTTCCGGGTTTTATCGTGAGGTAGAGTTACAAGGTTACAAGATTGATTGGACATA
CATTAGCGAAAAAGACATTGATCTGCTGCAGGAAAAAGGTCAACTGTATCTGTTC
CAGATATATAACAAAGATTTTTCGAAAAAATCAACCGGGAATGACAACCTTCACA
CCATGTACCTGAAAAATCTTTTCTCAGAAGAAAATCTTAAGGATATCGTCCTGAA
ACTTAACGGCGAAGCGGAAATCTTCTTCAGGAAGAGCAGCATAAAGAACCCAATC
ATTCATAAAAAAGGCTCGATTTTAGTCAACCGTACCTACGAAGCAGAAGAAAAAG
ACCAGTTTGGCAACATTCAAATTGTGCGTAAAAATATTCCGGAAAACATTTATCA
GGAGCTGTACAAATACTTCAACGATAAAAGCGACAAAGAGCTGTCTGATGAAGCA
GCCAAACTGAAGAATGTAGTGGGACACCACGAGGCAGCGACGAATATAGTCAAGG
ACTATCGCTACACGTATGATAAATACTTCCTTCATATGCCTATTACGATCAATTT
CAAAGCCAATAAAACGGGTTTTATTAATGATAGGATCTTACAGTATATCGCTAAA
GAAAAAGACTTACATGTGATCGGCATTGATCGGGGCGAGCGTAACCTGATCTACG
TGTCCGTGATTGATACTTGTGGTAATATAGTTGAACAGAAAAGCTTTAACATTGT
AAACGGCTACGACTATCAGATAAAACTGAAACAACAGGAGGGCGCTAGACAGATT
GCGCGGAAAGAATGGAAAGAAATTGGTAAAATTAAAGAGATCAAAGAGGGCTACC
TGAGCTTAGTAATCCACGAGATCTCTAAAATGGTAATCAAATACAATGCAATTAT
AGCGATGGAGGATTTGTCTTATGGTTTTAAAAAAGGGCGCTTTAAGGTCGAACGG
CAAGTTTACCAGAAATTTGAAACCATGCTCATCAATAAACTCAACTATCTGGTAT
TTAAAGATATTTCGATTACCGAGAATGGCGGTCTCCTGAAAGGTTATCAGCTGAC
ATACATTCCTGATAAACTTAAAAACGTGGGTCATCAGTGCGGCTGCATTTTTTAT
GTGCCTGCTGCATACACGAGCAAAATTGATCCGACCACCGGCTTTGTGAATATCT
TTAAATTTAAAGACCTGACAGTGGACGCAAAACGTGAATTCATTAAAAAATTTGA
CTCAATTCGTTATGACAGTGAAAAAAATCTGTTCTGCTTTACATTTGACTACAAT
AACTTTATTACGCAAAACACGGTCATGAGCAAATCATCGTGGAGTGTGTATACAT
ACGGCGTGCGCATCAAACGTCGCTTTGTGAACGGCCGCTTCTCAAACGAAAGTGA
TACCATTGACATAACCAAAGATATGGAGAAAACGTTGGAAATGACGGACATTAAC
TGGCGCGATGGCCACGATCTTCGTCAAGACATTATAGATTATGAAATTGTTCAGC
ACATATTCGAAATTTTCCGTTTAACAGTGCAAATGCGTAACTCCTTGTCTGAACT
GGAGGACCGTGATTACGATCGTCTCATTTCACCTGTACTGAACGAAAATAACATT
TTTTATGACAGCGCGAAAGCGGGGGATGCACTTCCTAAGGATGCCGATGCAAATG
GTGCGTATTGTATTGCATTAAAAGGGTTATATGAAATTAAACAAATTACCGAAAA
TTGGAAAGAAGATGGTAAATTTTCGCGCGATAAACTCAAAATCAGCAATAAAGAT
TGGTTCGACTTTATCCAGAATAAGCGCTATCTCTAAGAAATCATCCTTAGCGAAA
GCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGT
TTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCAT
TCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTA
TTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGAT
GTTATTTCC
SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT
ID TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC
NO: TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA
68 CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC
ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC
GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG
TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC
TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATACCAATAAATTCACT
AACCAGTATTCTCTCTCTAAGACCCTGCGCTTTGAACTGATTCCGCAGGGGAAAA
CCTTGGAGTTCATTCAAGAAAAAGGCCTCTTGTCTCAGGATAAACAGAGGGCTGA
ATCTTACCAAGAAATGAAGAAAACTATTGATAAGTTTCATAAATATTTCATTGAT
TTAGCCTTGTCTAACGCCAAATTAACTCACTTGGAAACGTATCTGGAGTTATACA
ACAAATCTGCCGAAACTAAGAAAGAACAGAAATTTAAAGACGATTTGAAAAAAGT
ACAGGACAATCTGCGTAAAGAAATTGTCAAATCCTTCAGTGACGGCGATGCTAAA
AGCATTTTTGCCATTCTGGACAAAAAAGAGTTGATTACTGTGGAATTAGAAAAGT
GGTTTGAAAACAATGAGCAGAAAGACATCTACTTCGATGAGAAATTCAAAACTTT
CACCACCTATTTTACAGGATTTCATCAAAACCGGAAGAACATGTACTCAGTAGAA
CCGAACTCCACGGCCATTGCGTATCGTTTGATCCATGAGAATCTGCCTAAATTTC
TGGAGAATGCGAAAGCCTTTGAAAAGATTAAGCAGGTCGAATCGCTGCAAGTGAA
TTTTCGTGAACTCATGGGCGAATTTGGTGACGAAGGTCTAATCTTCGTTAACGAA
CTGGAAGAAATGTTTCAGATTAATTACTACAATGACGTGCTATCGCAGAACGGTA
TCACAATCTACAATAGTATTATCTCAGGGTTCACAAAAAACGATATAAAATACAA
AGGCCTGAACGAGTATATCAATAACTACAACCAAACAAAGGACAAAAAGGATAGG
CTTCCGAAACTGAAGCAGTTATACAAACAGATTTTATCTGACAGAATCTCCCTGA
GCTTTCTGCCGGATGCTTTCACTGATGGGAAGCAGGTTCTGAAAGCGATTTTCGA
TTTTTATAAGATTAACTTACTGAGCTACACGATTGAAGGTCAAGAAGAATCTCAA
AACTTACTGCTCTTGATCCGTCAAACCATTGAAAATCTATCATCGTTCGATACGC
AGAAAATCTACCTCAAAAACGATACTCACCTGACTACGATCTCTCAGCAGGTTTT
CGGGGATTTTAGTGTATTTTCAACAGCTCTGAACTACTGGTATGAAACCAAAGTC
AATCCGAAATTCGAGACGGAATATTCTAAGGCCAACGAAAAAAAACGTGAGATTC
TTGATAAAGCTAAAGCCGTATTTACTAAACAGGATTACTTTTCTATTGCTTTCCT
GCAGGAAGTTTTATCGGAGTATATCCTGACCCTGGATCATACATCTGATATCGTT
AAAAAACACAGCAGCAATTGCATCGCTGACTATTTCAAAAACCACTTTGTCGCCA
AAAAAGAAAACGAAACAGACAAGACTTTCGATTTCATTGCTAACATCACCGCAAA
ATACCAGTGTATTCAGGGTATCTTGGAAAACGCCGACCAATACGAAGACGAACTG
AAACAAGATCAGAAGCTGATCGATAATTTAAAATTCTTCTTAGATGCAATCCTGG
AGCTGCTGCACTTCATCAAACCGCTTCATTTAAAGAGCGAGTCCATTACCGAAAA
GGACACCGCCTTCTATGACGTTTTTGAAAATTATTATGAAGCCCTCTCCTTGCTG
ACTCCGCTGTATAATATGGTACGCAATTACGTAACCCAGAAACCATATTCTACCG
AAAAAATTAAACTGAACTTTGAAAACGCACAGCTGCTCAACGGTTGGGACGCGAA
TAAAGAAGGTGACTACCTCACCACCATCCTGAAAAAAGATGGTAACTATTTTCTG
GCAATTATGGATAAGAAACATAATAAAGCATTCCAGAAATTTCCTGAAGGGAAAG
AAAATTACGAAAAGATGGTGTACAAACTCTTACCTGGAGTTAACAAAATGTTGCC
GAAAGTATTTTTTAGTAATAAGAACATCGCGTACTTTAACCCGTCCAAAGAACTG
CTGGAAAATTATAAAAAGGAGACGCATAAGAAAGGGGATACCTTTAACCTGGAAC
ATTGCCATACCTTAATAGACTTCTTCAAGGATTCCCTGAATAAACACGAGGATTG
GAAATATTTCGATTTTCAGTTTAGTGAGACCAAGTCATACCAGGATCTTAGCGGC
TTTTATCGCGAAGTAGAACACCAAGGCTATAAAATTAACTTCAAAAACATCGACA
GCGAATACATCGACGGTTTAGTTAACGAGGGCAAACTGTTTCTGTTCCAGATCTA
TTCAAAGGATTTTAGCCCGTTCTCTAAAGGCAAACCAAATATGCATACGTTGTAC
TGGAAAGCACTGTTTGAAGAGCAAAACCTGCAGAATGTGATTTATAAACTGAACG
GCCAAGCTGAGATTTTTTTCCGTAAAGCCTCGATTAAACCGAAAAATATCATCCT
TCATAAGAAGAAAATAAAGATCGCTAAAAAACACTTCATAGATAAAAAAACCAAA
ACCTCCGAAATAGTGCCTGTTCAAACAATTAAGAACTTGAATATGTACTACCAGG
GCAAGATATCGGAAAAGGAGTTGACTCAAGACGATCTTCGCTATATCGATAACTT
TTCGATTTTTAACGAAAAAAACAAGACGATCGACATCATCAAAGATAAACGCTTC
ACTGTAGATAAGTTCCAGTTTCATGTGCCGATTACTATGAACTTCAAAGCTACCG
GGGGTAGCTATATCAACCAAACGGTGTTGGAATACCTGCAGAATAACCCGGAAGT
CAAAATCATTGGGCTGGACCGCGGAGAACGTCACCTTGTGTACTTGACCTTAATC
GATCAGCAAGGCAACATCTTAAAACAAGAATCGCTGAATACCATTACGGATTCAA
AGATTAGCACCCCGTATCATAAGCTGCTCGATAACAAGGAGAATGAGCGCGACCT
GGCCCGTAAAAACTGGGGCACGGTGGAAAACATTAAGGAGTTAAAGGAGGGTTAT
ATTTCCCAGGTAGTGCATAAGATCGCCACTCTCATGCTCGAGGAAAATGCGATCG
TTGTCATGGAAGACTTAAACTTCGGATTTAAACGTGGGCGATTTAAAGTAGAGAA
ACAAATCTACCAGAAGTTAGAAAAAATGCTGATTGACAAATTAAATTACTTGGTC
CTAAAAGACAAACAGCCGCAAGAATTGGGTGGATTATACAACGCCCTCCAACTTA
CCAATAAATTCGAAAGTTTTCAGAAAATGGGTAAACAGTCAGGCTTTCTTTTTTA
TGTTCCTGCGTGGAACACATCCAAAATCGACCCTACAACCGGCTTCGTCAATTAC
TTCTATACTAAATATGAAAACGTCGACAAAGCAAAAGCATTCTTTGAAAAGTTCG
AAGCAATACGTTTTAACGCTGAGAAAAAATATTTCGAGTTCGAAGTCAAGAAATA
CTCAGACTTTAACCCCAAAGCTGAGGGCACACAGCAAGCGTGGACAATCTGCACC
TACGGCGAGCGCATCGAAACGAAGCGTCAAAAAGATCAGAATAACAAATTTGTTT
CAACACCTATCAACCTGACCGAGAAGATTGAAGACTTCTTAGGTAAAAATCAGAT
TGTTTATGGCGACGGTAACTGTATAAAATCTCAAATAGCCTCAAAGGATGATAAA
GCATTTTTCGAAACATTATTATATTGGTTCAAAATGACACTGCAGATGCGCAATA
GTGAGACGCGTACAGATATTGATTATCTTATCAGCCCGGTCATGAACGACAACGG
TACTTTTTACAACTCCAGAGACTATGAAAAACTTGAGAATCCAACTCTCCCCAAA
GATGCTGATGCGAACGGTGCTTATCACATCGCGAAAAAAGGTCTGATGCTGCTGA
ACAAAATCGACCAAGCCGATCTGACTAAGAAAGTTGACCTAAGCATTTCAAATCG
GGACTGGTTACAGTTTGTTCAAAAGAACAAATGAGAAATCATCCTTAGCGAAAGC
TAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCA
GGAAGTTATTACTCAGGAAGCAAAGAGGATTACA
SEQ AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAA
ID GCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGA
NO: CAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTT
69 TTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgcgtc
ttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattc
tgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataa
tcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgcc
atagcatttttatccataagattagcggatcctacctgacgctttttatcgcaac
tctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg
agataggcggagatacgaactttaagAAGGAGatataccATGGAACAGGAATATT
ATCTGGGCTTGGACATGGGCACCGGTTCCGTCGGCTGGGCTGTTACTGACAGTGA
ATATCACGTTCTAAGAAAGCATGGTAAGGCATTGTGGGGTGTAAGACTTTTCGAA
TCTGCTTCCACTGCTGAAGAGCGTAGAATGTTTAGAACGAGTCGACGTAGGCTAG
ACAGGCGCAATTGGAGAATCGAAATTTTACAAGAAATTTTTGCGGAAGAGATATC
TAAGAAAGACCCAGGCTTTTTCCTGAGAATGAAGGAATCTAAGTATTACCCTGAG
GATAAAAGAGATATAAATGGTAACTGTCCCGAATTGCCTTACGCATTATTTGTGG
ACGATGATTTTACCGATAAGGATTACCATAAAAAGTTCCCAACTATCTACCATTT
ACGCAAAATGTTAATGAATACAGAGGAAACCCCAGACATAAGACTAGTTTATCTG
GCAATACACCATATGATGAAACATAGAGGCCATTTCTTACTTTCCGGGGATATCA
ACGAAATCAAAGAGTTTGGTACCACATTTAGTAAGTTACTGGAAAACATAAAGAA
TGAAGAATTGGATTGGAACTTAGAACTCGGAAAAGAAGAATACGCGGTTGTCGAA
TCTATCCTGAAGGATAATATGCTGAATAGGTCGACCAAAAAAACTAGGCTGATCA
AAGCACTGAAAGCCAAATCTATCTGCGAAAAAGCTGTTTTAAATTTACTTGCTGG
TGGCACTGTTAAGTTATCAGACATTTTTGGTTTGGAAGAATTGAACGAAACCGAG
CGTCCAAAAATTAGTTTCGCTGATAATGGCTACGATGATTACATTGGTGAGGTGG
AAAACGAGTTGGGCGAACAATTTTATATTATAGAGACAGCTAAGGCAGTCTATGA
CTGGGCTGTTTTAGTAGAAATCCTTGGTAAATACACATCTATCTCCGAAGCGAAA
GTTGCTACTTACGAAAAGCACAAGTCCGATCTCCAGTTTTTGAAGAAAATTGTCA
GGAAATATCTGACTAAGGAAGAATATAAAGATATTTTCGTTAGTACCTCTGACAA
ACTGAAAAATTACTCCGCTTACATCGGGATGACCAAGATTAATGGCAAAAAAGTT
GATCTGCAAAGCAAAAGGTGTTCGAAGGAAGAATTTTATGATTTCATTAAAAAGA
ATGTCTTAAAAAAATTAGAAGGTCAGCCAGAATACGAATATTTGAAAGAAGAACT
GGAAAGAGAGACATTCTTACCAAAACAAGTCAACAGAGATAATGGGGTAATTCCA
TATCAAATTCACCTCTACGAATTAAAAAAAATTTTAGGCAATTTACGCGATAAAA
TTGACCTTATCAAAGAAAATGAGGATAAGCTGGTTCAACTCTTTGAATTCAGAAT
ACCCTATTATGTGGGCCCACTGAACAAGATTGATGACGGCAAAGAAGGTAAATTC
ACATGGGCCGTCCGCAAATCCAATGAAAAAATTTACCCATGGAACTTTGAAAATG
TAGTAGATATTGAAGCGTCTGCGGAGAAATTTATTCGAAGAATGACTAATAAATG
CACTTACTTGATGGGAGAGGATGTTCTGCCTAAAGACAGCTTATTATACAGCAAG
TACATGGTTCTAAACGAACTTAACAACGTTAAGTTGGACGGTGAGAAATTAAGTG
TAGAATTGAAACAAAGATTGTATACTGACGTCTTCTGCAAGTACAGAAAAGTGAC
AGTTAAAAAAATTAAGAATTACTTGAAGTGCGAAGGTATAATTTCTGGAAACGTA
GAGATTACTGGTATTGATGGTGATTTCAAAGCATCCCTAACAGCTTACCACGATT
TCAAGGAAATCCTGACAGGAACTGAACTCGCAAAAAAAGATAAAGAAAACATTAT
TACTAATATTGTTCTTTTCGGTGATGACAAGAAATTGTTGAAGAAAAGACTGAAT
AGACTTTACCCCCAGATTACTCCCAATCAACTTAAGAAAATTTGTGCTTTGTCTT
ACACAGGATGGGGTCGTTTTTCAAAAAAGTTCTTAGAAGAGATTACCGCACCTGA
TCCAGAAACAGGCGAAGTATGGAATATAATTACCGCCTTATGGGAATCGAACAAT
AATCTTATGCAACTTCTGAGCAATGAATATCGTTTCATGGAAGAAGTTGAGACTT
ACAACATGGGCAAACAGACGAAGACTTTATCCTATGAAACTGTGGAAAATATGTA
TGTATCACCTTCTGTCAAGAGACAAATTTGGCAAACCTTAAAAATTGTCAAAGAA
TTAGAAAAGGTAATGAAGGAGTCTCCTAAACGTGTGTTTATTGAAATGGCTAGAG
AAAAACAAGAGTCAAAAAGAACCGAGTCAAGAAAGAAGCAGTTAATCGATTTATA
TAAGGCTTGTAAAAACGAAGAGAAAGATTGGGTTAAAGAATTGGGGGACCAAGAG
GAACAAAAACTACGGTCGGATAAGTTGTATTTATACTATACGCAAAAGGGACGAT
GTATGTATTCCGGCGAGGTAATAGAATTGAAGGATTTATGGGACAATACAAAATA
TGACATAGACCATATATATCCCCAATCAAAAACGATGGACGATAGCTTGAACAAT
AGAGTACTCGTGAAAAAAAAATATAATGCGACCAAATCTGATAAGTATCCTCTGA
ATGAAAATATCAGACATGAAAGAAAGGGGTTCTGGAAGTCCTTGTTAGATGGTGG
GTTTATAAGCAAAGAAAAGTACGAGCGTCTAATAAGAAACACGGAGTTATCGCCA
GAAGAACTCGCTGGTTTTATTGAGAGGCAAATCGTGGAAACGAGACAATCTACCA
AAGCCGTTGCTGAGATCCTAAAGCAAGTTTTCCCAGAGTCGGAGATTGTCTATGT
CAAAGCTGGCACAGTGAGCAGGTTTAGGAAAGACTTCGAACTATTAAAGGTAAGA
GAAGTGAACGATTTACATCACGCAAAGGACGCTTACCTAAATATCGTTGTAGGTA
ACTCATATTATGTTAAATTTACCAAGAACGCCTCTTGGTTTATAAAGGAGAACCC
AGGTAGAACATATAACCTGAAAAAGATGTTCACCTCTGGTTGGAATATTGAGAGA
AACGGAGAAGTCGCATGGGAAGTTGGTAAGAAAGGGACTATAGTGACAGTAAAGC
AAATTATGAACAAAAATAATATCCTCGTTACAAGGCAGGTTCATGAAGCAAAGGG
CGGCCTTTTTGACCAACAAATTATGAAGAAAGGGAAAGGTCAAATTGCAATAAAA
GAAACCGATGAGAGACTAGCGTCAATAGAAAAGTATGGTGGCTATAATAAAGCTG
CGGGTGCATACTTTATGCTTGTTGAATCAAAAGACAAGAAAGGTAAGACTATTAG
AACTATAGAATTTATACCCCTGTACCTTAAAAACAAAATTGAATCGGATGAGTCA
ATCGCGTTAAATTTTCTAGAGAAAGGAAGGGGTTTAAAAGAACCAAAGATCCTGT
TAAAAAAGATTAAGATTGACACCTTGTTCGATGTAGATGGATTTAAAATGTGGTT
ATCTGGCAGAACAGGCGATAGACTTTTGTTTAAGTGCGCTAATCAATTAATTTTG
GATGAGAAAATCATTGTCACAATGAAAAAAATAGTTAAGTTTATTCAGAGAAGAC
AAGAAAACAGGGAGTTGAAATTATCTGATAAAGATGGTATCGACAATGAAGTTTT
AATGGAAATCTACAATACATTCGTTGATAAACTTGAAAATACCGTATATCGAATC
AGGTTAAGTGAACAAGCCAAAACATTAATTGATAAACAAAAAGAATTTGAAAGGC
TATCACTGGAAGACAAATCCTCCACCCTATTTGAAATTTTGCATATATTCCAGTG
CCAATCTTCAGCAGCTAATTTAAAAATGATTGGCGGACCTGGGAAAGCCGGCATC
CTAGTGATGAACAATAATATCTCCAAGTGTAACAAAATATCAATTATTAACCAAT
CTCCGACAGGTATTTTTGAAAATGAAATAGACTTGCTTAAGATATAAGAAATCAT
CCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATT
ATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATG
AATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCAC
TCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAG
TGTTAAGGGATGTTATTTCC
SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT
ID TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC
NO: TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA
70 CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC
ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC
GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG
TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC
TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATTCTTTCGACTCTTTC
ACCAACCTGTACTCTCTGTCTAAAACCCTGAAATTCGAAATGCGTCCGGTTGGTA
ACACCCAGAAAATGCTGGACAACGCGGGTGTTTTCGAAAAAGACAAACTGATCCA
GAAAAAATACGGTAAAACCAAACCGTACTTCGACCGTCTGCACCGTGAATTCATC
GAAGAAGCGCTGACCGGTGTTGAACTGATCGGTCTGGACGAAAACTTCCGTACCC
TGGTTGACTGGCAGAAAGACAAAAAAAACAACGTTGCGATGAAAGCGTACGAAAA
CTCTCTGCAGCGTCTGCGTACCGAAATCGGTAAAATCTTCAACCTGAAAGCGGAA
GACTGGGTTAAAAACAAATACCCGATCCTGGGTCTGAAAAACAAAAACACCGACA
TCCTGTTCGAAGAAGCGGTTTTCGGTATCCTGAAAGCGCGTTACGGTGAAGAAAA
AGACACCTTCATCGAAGTTGAAGAAATCGACAAAACCGGTAAATCTAAAATCAAC
CAGATCTCTATCTTCGACTCTTGGAAAGGTTTCACCGGTTACTTCAAAAAATTCT
TCGAAACCCGTAAAAACTTCTACAAAAACGACGGTACCTCTACCGCGATCGCGAC
CCGTATCATCGACCAGAACCTGAAACGTTTCATCGACAACCTGTCTATCGTTGAA
TCTGTTCGTCAGAAAGTTGACCTGGCGGAAACCGAAAAATCTTTCTCTATCTCTC
TGTCTCAGTTCTTCTCTATCGACTTCTACAACAAATGCCTGCTGCAGGACGGTAT
CGACTACTACAACAAAATCATCGGTGGTGAAACCCTGAAAAACGGTGAAAAACTG
ATCGGTCTGAACGAACTGATCAACCAGTACCGTCAGAACAACAAAGACCAGAAAA
TCCCGTTCTTCAAACTGCTGGACAAACAGATCCTGTCTGAAAAAATCCTGTTCCT
GGACGAAATCAAAAACGACACCGAACTGATCGAAGCGCTGTCTCAGTTCGCGAAA
ACCGCGGAAGAAAAAACCAAAATCGTTAAAAAACTGTTCGCGGACTTCGTTGAAA
ACAACTCTAAATACGACCTGGCGCAGATCTACATCTCTCAGGAAGCGTTCAACAC
CATCTCTAACAAATGGACCTCTGAAACCGAAACCTTCGCGAAATACCTGTTCGAA
GCGATGAAATCTGGTAAACTGGCGAAATACGAAAAAAAAGACAACTCTTACAAAT
TCCCGGACTTCATCGCGCTGTCTCAGATGAAATCTGCGCTGCTGTCTATCTCTCT
GGAAGGTCACTTCTGGAAAGAAAAATACTACAAAATCTCTAAATTCCAGGAAAAA
ACCAACTGGGAACAGTTCCTGGCGATCTTCCTGTACGAATTCAACTCTCTGTTCT
CTGACAAAATCAACACCAAAGACGGTGAAACCAAACAGGTTGGTTACTACCTGTT
CGCGAAAGACCTGCACAACCTGATCCTGTCTGAACAGATCGACATCCCGAAAGAC
TCTAAAGTTACCATCAAAGACTTCGCGGACTCTGTTCTGACCATCTACCAGATGG
CGAAATACTTCGCGGTTGAAAAAAAACGTGCGTGGCTGGCGGAATACGAACTGGA
CTCTTTCTACACCCAGCCGGACACCGGTTACCTGCAGTTCTACGACAACGCGTAC
GAAGACATCGTTCAGGTTTACAACAAACTGCGTAACTACCTGACCAAAAAACCGT
ACTCTGAAGAAAAATGGAAACTGAACTTCGAAAACTCTACCCTGGCGAACGGTTG
GGACAAAAACAAAGAATCTGACAACTCTGCGGTTATCCTGCAGAAAGGTGGTAAA
TACTACCTGGGTCTGATCACCAAAGGTCACAACAAAATCTTCGACGACCGTTTCC
AGGAAAAATTCATCGTTGGTATCGAAGGTGGTAAATACGAAAAAATCGTTTACAA
ATTCTTCCCGGACCAGGCGAAAATGTTCCCGAAAGTTTGCTTCTCTGCGAAAGGT
CTGGAATTCTTCCGTCCGTCTGAAGAAATCCTGCGTATCTACAACAACGCGGAAT
TCAAAAAAGGTGAAACCTACTCTATCGACTCTATGCAGAAACTGATCGACTTCTA
CAAAGACTGCCTGACCAAATACGAAGGTTGGGCGTGCTACACCTTCCGTCACCTG
AAACCGACCGAAGAATACCAGAACAACATCGGTGAATTCTTCCGTGACGTTGCGG
AAGACGGTTACCGTATCGACTTCCAGGGTATCTCTGACCAGTACATCCACGAAAA
AAACGAAAAAGGTGAACTGCACCTGTTCGAAATCCACAACAAAGACTGGAACCTG
GACAAAGCGCGTGACGGTAAATCTAAAACCACCCAGAAAAACCTGCACACCCTGT
ACTTCGAATCTCTGTTCTCTAACGACAACGTTGTTCAGAACTTCCCGATCAAACT
GAACGGTCAGGCGGAAATCTTCTACCGTCCGAAAACCGAAAAAGACAAACTGGAA
TCTAAAAAAGACAAAAAAGGTAACAAAGTTATCGACCACAAACGTTACTCTGAAA
ACAAAATCTTCTTCCACGTTCCGCTGACCCTGAACCGTACCAAAAACGACTCTTA
CCGTTTCAACGCGCAGATCAACAACTTCCTGGCGAACAACAAAGACATCAACATC
ATCGGTGTTGACCGTGGTGAAAAACACCTGGTTTACTACTCTGTTATCACCCAGG
CGTCTGACATCCTGGAATCTGGTTCTCTGAACGAACTGAACGGTGTTAACTACGC
GGAAAAACTGGGTAAAAAAGCGGAAAACCGTGAACAGGCGCGTCGTGACTGGCAG
GACGTTCAGGGTATCAAAGACCTGAAAAAAGGTTACATCTCTCAGGTTGTTCGTA
AACTGGCGGACCTGGCGATCAAACACAACGCGATCATCATCCTGGAAGACCTGAA
CATGCGTTTCAAACAGGTTCGTGGTGGTATCGAAAAATCTATCTACCAGCAGCTG
GAAAAAGCGCTGATCGACAAACTGTCTTTCCTGGTTGACAAAGGTGAAAAAAACC
CGGAACAGGCGGGTCACCTGCTGAAAGCGTACCAGCTGTCTGCGCCGTTCGAAAC
CTTCCAGAAAATGGGTAAACAGACCGGTATCATCTTCTACACCCAGGCGTCTTAC
ACCTCTAAATCTGACCCGGTTACCGGTTGGCGTCCGCACCTGTACCTGAAATACT
TCTCTGCGAAAAAAGCGAAAGACGACATCGCGAAATTCACCAAAATCGAATTCGT
TAACGACCGTTTCGAACTGACCTACGACATCAAAGACTTCCAGCAGGCGAAAGAA
TACCCGAACAAAACCGTTTGGAAAGTTTGCTCTAACGTTGAACGTTTCCGTTGGG
ACAAAAACCTGAACCAGAACAAAGGTGGTTACACCCACTACACCAACATCACCGA
AAACATCCAGGAACTGTTCACCAAATACGGTATCGACATCACCAAAGACCTGCTG
ACCCAGATCTCTACCATCGACGAAAAACAGAACACCTCTTTCTTCCGTGACTTCA
TCTTCTACTTCAACCTGATCTGCCAGATCCGTAACACCGACGACTCTGAAATCGC
GAAAAAAAACGGTAAAGACGACTTCATCCTGTCTCCGGTTGAACCGTTCTTCGAC
TCTCGTAAAGACAACGGTAACAAACTGCCGGAAAACGGTGACGACAACGGTGCGT
ACAACATCGCGCGTAAAGGTATCGTTATCCTGAACAAAATCTCTCAGTACTCTGA
AAAAAACGAAAACTGCGAAAAAATGAAATGGGGTGACCTGTACGTTTCTAACATC
GACTGGGACAACTTCGTTGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTAT
CTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAG
GAAGCAAAGAGGATTACA
SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT
ID TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC
NO: TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA
71 CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC
ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC
GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG
TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC
TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATAACAAATTCGAAAAC
TTCACCGGTCTGTACCCGATCTCTAAAACCCTGCGTTTCGAACTGATCCCGCAGG
GTAAAACCCTGGAATACATCGAAAAATCTGAAATCCTGGAAAACGACAACTACCG
TGCGGAAAAATACGAAGAAGTTAAAGACATCATCGACGGTTACCACAAATGGTTC
ATCAACGAAACCCTGCACGACCTGCACATCAACTGGTCTGAACTGAAAGTTGCGC
TGGAAAACAACCGTATCGAAAAATCTGACGCGTCTAAAAAAGAACTGCAGCGTGT
TCAGAAAATCAAACGTGAAGAAATCTACAACGCGTTCATCGAACACGAAGCGTTC
CAGTACCTGTTCAAAGAAAACCTGCTGTCTGACCTGCTGCCGATCCAGATCGAAC
AGTCTGAAGACCTGGACGCGGAAAAAAAAAAACAGGCGGTTGAAACCTTCAACCG
TTTCTCTACCTACTTCACCGGTTTCCACGAAAACCGTAAAAACATCTACTCTAAA
GAAGGTATCTCTACCTCTGTTACCTACCGTATCGTTCACGACAACTTCCCGAAAT
TCCTGGAAAACATGAAAGTTTTCGAAATCCTGCGTAACGAATGCCCGGAAGTTAT
CTCTGACACCGCGAACGAACTGGCGCCGTTCATCGACGGTGTTCGTATCGAAGAC
ATCTTCCTGATCGACTTCTTCAACTCTACCTTCTCTCAGAACGGTATCGACTACT
ACAACCGTATCCTGGGTGGTGTTACCACCGAAACCGGTGAAAAATACCGTGGTAT
CAACGAATTCACCAACCTGTACCGTCAGCAGCACCCGGAATTCGGTAAATCTAAA
AAAGCGACCAAAATGGTTGTTCTGTTCAAACAGATCCTGTCTGACCGTGACACCC
TGTCTTTCATCCCGGAAATGTTCGGTAACGACAAACAGGTTCAGAACTCTATCCA
GCTGTTCTACAACCGTGAAATCTCTCAGTTCGAAAACGAAGGTGTTAAAACCGAC
GTTTGCACCGCGCTGGCGACCCTGACCTCTAAAATCGCGGAATTCGACACCGAAA
AAATCTACATCCAGCAGCCGGAACTGCCGAACGTTTCTCAGCGTCTGTTCGGTTC
TTGGAACGAACTGAACGCGTGCCTGTTCAAATACGCGGAACTGAAATTCGGTACC
GCGGAAAAAGTTGCGAACCGTAAAAAAATCGACAAATGGCTGAAATCTGACCTGT
TCTCTTTCACCGAACTGAACAAAGCGCTGGAATTCTCTGGTAAAGACGAACGTAT
CGAAAACTACTTCTCTGAAACCGGTATCTTCGCGCAGCTGGTTAAAACCGGTTTC
GACGAAGCGCAGTCTATCCTGGAAACCGAATACACCTCTGAAGTTCACCTGAAAG
ACCAGCAGACCGACATCGAAAAAATCAAAACCTTCCTGGACGCGCTGCAGAACCT
GATGCACCTGCTGAAATCTCTGTGCGTTTCTGAAGAAGCGGACCGTGACGCGGCG
TTCTACAACGAATTCGACATGCTGTACAACCAGCTGAAACTGGTTGTTCCGCTGT
ACAACAAAGTTCGTAACTACATCACCCAGAAACTGTTCCGTTCTGACAAAATCAA
AATCTACTTCGAAAACAAAGGTCAGTTCCTGGGTGGTTGGGTTGACTCTCAGACC
GAAAACTCTGACAACGGTACCCAGGCGGGTGGTTACATCTTCCGTAAAGAAAACG
TTATCAACGAATACGACTACTACCTGGGTATCTGCTCTGACCCGAAACTGTTCCG
TCGTACCACCATCGTTTCTGAAAACGACCGTTCTTCTTTCGAACGTCTGGACTAC
TACCAGCTGAAAACCGCGTCTGTTTACGGTAACTCTTACTGCGGTAAACACCCGT
ACACCGAAGACAAAAACGAACTGGTTAACTCTATCGACCGTTTCGTTCACCTGTC
TGGTAACAACATCCTGATCGAAAAAATCGCGAAAGACAAAGTTAAATCTAACCCG
ACCACCAACACCCCGTCTGGTTACCTGAACTTCATCCACCGTGAAGCGCCGAACA
CCTACGAATGCCTGCTGCAGGACGAAAACTTCGTTTCTCTGAACCAGCGTGTTGT
TTCTGCGCTGAAAGCGACCCTGGCGACCCTGGTTCGTGTTCCGAAAGCGCTGGTT
TACGCGAAAAAAGACTACCACCTGTTCTCTGAAATCATCAACGACATCGACGAAC
TGTCTTACGAAAAAGCGTTCTCTTACTTCCCGGTTTCTCAGACCGAATTCGAAAA
CTCTTCTAACCGTACCATCAAACCGCTGCTGCTGTTCAAAATCTCTAACAAAGAC
CTGTCTTTCGCGGAAAACTTCGAAAAAGGTAACCGTCAGAAAATCGGTAAAAAAA
ACCTGCACACCCTGTACTTCGAAGCGCTGATGAAAGGTAACCAGGACACCATCGA
CATCGGTACCGGTATGGTTTTCCACCGTGTTAAATCTCTGAACTACAACGAAAAA
ACCCTGAAATACGGTCACCACTCTACCCAGCTGAACGAAAAATTCTCTTACCCGA
TCATCAAAGACAAACGTTTCGCGTCTGACAAATTCCTGTTCCACCTGTCTACCGA
AATCAACTACAAAGAAAAACGTAAACCGCTGAACAACTCTATCATCGAATTCCTG
ACCAACAACCCGGACATCAACATCATCGGTCTGGACCGTGGTGAACGTCACCTGA
TCTACCTGACCCTGATCAACCAGAAAGGTGAAATCCTGCGTCAGAAAACCTTCAA
CATCGTTGGTAACACCAACTACCACGAAAAACTGAACCAGCGTGAAAAAGAACGT
GACAACGCGCGTAAATCTTGGGCGACCATCGGTAAAATCAAAGAACTGAAAGAAG
GTTTCCTGTCTCTGGTTATCCACGAAATCGCGAAAATCATGGTTGAAAACAACGC
GATCGTTGTTCTGGAAGACCTGAACTTCGGTTTCAAACGTGGTCGTTTCAAAGTT
GAAAAACAGATCTACCAGAAATTCGAAAAAATGCTGATCGACAAACTGAACTACC
TGGTTTTCAAAGACAAAAAAGCGAACGAAGCGGGTGGTGTTCTGAAAGGTTACCA
GCTGGCGGAAAAATTCGAATCTTTCCAGAAAATGGGTAAACAGTCTGGTTTCCTG
TTCTACGTTCCGGCGGCGTACACCTCTAAAATCGACCCGACCACCGGTTTCGTTA
ACATGCTGAACCTGAACTACACCAACATGAAAGACGCGCAGACCCTGCTGTCTGG
TATGGACAAAATCTCTTTCAACGCGGACGCGAACTACTTCGAATTCGAACTGGAC
TACGAAAAATTCAAAACCAACCAGACCGACCACACCAACAAATGGACCATCTGCA
CCGTTGGTGAAAAACGTTTCACCTACAACTCTGCGACCAAAGAAACCACCACCGT
TAACGTTACCGAAGACCTGAAAAAACTGCTGGACAAATTCGAAGTTAAATACTCT
AACGGTGACAACATCAAAGACGAAATCTGCCGTCAGACCGACGCGAAATTCTTCG
AAATCATCCTGTGGCTGCTGAAACTGACCATGCAGATGCGTAACTCTAACACCAA
AACCGAAGAAGACTTCATCCTGTCTCCGGTTAAAAACTCTAACGGTGAATTCTTC
CGTTCTAACGACGACGCGAACGGTATCTGGCCGGCGGACGCGGACGCGAACGGTG
CGTACCACATCGCGCTGAAAGGTCTGTACCTGGTTAAAGAATGCTTCAACAAAAA
CGAAAAATCTCTGAAAATCGAACACAAAAACTGGTTCAAATTCGCGCAGACCCGT
TTCAACGGTTCTCTGACCAAAAACGGTTAAGAAATCATCCTTAGCGAAAGCTAAG
GATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAA
GTTATTACTCAGGAAGCAAAGAGGATTACA
SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT
ID TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC
NO: TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA
72 CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC
ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC
GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG
TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC
TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATACCCAGTTCGAAGGT
TTCACCAACCTGTACCAGGTTTCTAAAACCCTGCGTTTCGAACTGATCCCGCAGG
GTAAAACCCTGAAACACATCCAGGAACAGGGTTTCATCGAAGAAGACAAAGCGCG
TAACGACCACTACAAAGAACTGAAACCGATCATCGACCGTATCTACAAAACCTAC
GCGGACCAGTGCCTGCAGCTGGTTCAGCTGGACTGGGAAAACCTGTCTGCGGCGA
TCGACTCTTACCGTAAAGAAAAAACCGAAGAAACCCGTAACGCGCTGATCGAAGA
ACAGGCGACCTACCGTAACGCGATCCACGACTACTTCATCGGTCGTACCGACAAC
CTGACCGACGCGATCAACAAACGTCACGCGGAAATCTACAAAGGTCTGTTCAAAG
CGGAACTGTTCAACGGTAAAGTTCTGAAACAGCTGGGTACCGTTACCACCACCGA
ACACGAAAACGCGCTGCTGCGTTCTTTCGACAAATTCACCACCTACTTCTCTGGT
TTCTACGAAAACCGTAAAAACGTTTTCTCTGCGGAAGACATCTCTACCGCGATCC
CGCACCGTATCGTTCAGGACAACTTCCCGAAATTCAAAGAAAACTGCCACATCTT
CACCCGTCTGATCACCGCGGTTCCGTCTCTGCGTGAACACTTCGAAAACGTTAAA
AAAGCGATCGGTATCTTCGTTTCTACCTCTATCGAAGAAGTTTTCTCTTTCCCGT
TCTACAACCAGCTGCTGACCCAGACCCAGATCGACCTGTACAACCAGCTGCTGGG
TGGTATCTCTCGTGAAGCGGGTACCGAAAAAATCAAAGGTCTGAACGAAGTTCTG
AACCTGGCGATCCAGAAAAACGACGAAACCGCGCACATCATCGCGTCTCTGCCGC
ACCGTTTCATCCCGCTGTTCAAACAGATCCTGTCTGACCGTAACACCCTGTCTTT
CATCCTGGAAGAATTCAAATCTGACGAAGAAGTTATCCAGTCTTTCTGCAAATAC
AAAACCCTGCTGCGTAACGAAAACGTTCTGGAAACCGCGGAAGCGCTGTTCAACG
AACTGAACTCTATCGACCTGACCCACATCTTCATCTCTCACAAAAAACTGGAAAC
CATCTCTTCTGCGCTGTGCGACCACTGGGACACCCTGCGTAACGCGCTGTACGAA
CGTCGTATCTCTGAACTGACCGGTAAAATCACCAAATCTGCGAAAGAAAAAGTTC
AGCGTTCTCTGAAACACGAAGACATCAACCTGCAGGAAATCATCTCTGCGGCGGG
TAAAGAACTGTCTGAAGCGTTCAAACAGAAAACCTCTGAAATCCTGTCTCACGCG
CACGCGGCGCTGGACCAGCCGCTGCCGACCACCCTGAAAAAACAGGAAGAAAAAG
AAATCCTGAAATCTCAGCTGGACTCTCTGCTGGGTCTGTACCACCTGCTGGACTG
GTTCGCGGTTGACGAATCTAACGAAGTTGACCCGGAATTCTCTGCGCGTCTGACC
GGTATCAAACTGGAAATGGAACCGTCTCTGTCTTTCTACAACAAAGCGCGTAACT
ACGCGACCAAAAAACCGTACTCTGTTGAAAAATTCAAACTGAACTTCCAGATGCC
GACCCTGGCGTCTGGTTGGGACGTTAACAAAGAAAAAAACAACGGTGCGATCCTG
TTCGTTAAAAACGGTCTGTACTACCTGGGTATCATGCCGAAACAGAAAGGTCGTT
ACAAAGCGCTGTCTTTCGAACCGACCGAAAAAACCTCTGAAGGTTTCGACAAAAT
GTACTACGACTACTTCCCGGACGCGGCGAAAATGATCCCGAAATGCTCTACCCAG
CTGAAAGCGGTTACCGCGCACTTCCAGACCCACACCACCCCGATCCTGCTGTCTA
ACAACTTCATCGAACCGCTGGAAATCACCAAAGAAATCTACGACCTGAACAACCC
GGAAAAAGAACCGAAAAAATTCCAGACCGCGTACGCGAAAAAAACCGGTGACCAG
AAAGGTTACCGTGAAGCGCTGTGCAAATGGATCGACTTCACCCGTGACTTCCTGT
CTAAATACACCAAAACCACCTCTATCGACCTGTCTTCTCTGCGTCCGTCTTCTCA
GTACAAAGACCTGGGTGAATACTACGCGGAACTGAACCCGCTGCTGTACCACATC
TCTTTCCAGCGTATCGCGGAAAAAGAAATCATGGACGCGGTTGAAACCGGTAAAC
TGTACCTGTTCCAGATCTACAACAAAGACTTCGCGAAAGGTCACCACGGTAAACC
GAACCTGCACACCCTGTACTGGACCGGTCTGTTCTCTCCGGAAAACCTGGCGAAA
ACCTCTATCAAACTGAACGGTCAGGCGGAACTGTTCTACCGTCCGAAATCTCGTA
TGAAACGTATGGCGCACCGTCTGGGTGAAAAAATGCTGAACAAAAAACTGAAAGA
CCAGAAAACCCCGATCCCGGACACCCTGTACCAGGAACTGTACGACTACGTTAAC
CACCGTCTGTCTCACGACCTGTCTGACGAAGCGCGTGCGCTGCTGCCGAACGTTA
TCACCAAAGAAGTTTCTCACGAAATCATCAAAGACCGTCGTTTCACCTCTGACAA
ATTCTTCTTCCACGTTCCGATCACCCTGAACTACCAGGCGGCGAACTCTCCGTCT
AAATTCAACCAGCGTGTTAACGCGTACCTGAAAGAACACCCGGAAACCCCGATCA
TCGGTATCGACCGTGGTGAACGTAACCTGATCTACATCACCGTTATCGACTCTAC
CGGTAAAATCCTGGAACAGCGTTCTCTGAACACCATCCAGCAGTTCGACTACCAG
AAAAAACTGGACAACCGTGAAAAAGAACGTGTTGCGGCGCGTCAGGCGTGGTCTG
TTGTTGGTACCATCAAAGACCTGAAACAGGGTTACCTGTCTCAGGTTATCCACGA
AATCGTTGACCTGATGATCCACTACCAGGCGGTTGTTGTTCTGGAAAACCTGAAC
TTCGGTTTCAAATCTAAACGTACCGGTATCGCGGAAAAAGCGGTTTACCAGCAGT
TCGAAAAAATGCTGATCGACAAACTGAACTGCCTGGTTCTGAAAGACTACCCGGC
GGAAAAAGTTGGTGGTGTTCTGAACCCGTACCAGCTGACCGACCAGTTCACCTCT
TTCGCGAAAATGGGTACCCAGTCTGGTTTCCTGTTCTACGTTCCGGCGCCGTACA
CCTCTAAAATCGACCCGCTGACCGGTTTCGTTGACCCGTTCGTTTGGAAAACCAT
CAAAAACCACGAATCTCGTAAACACTTCCTGGAAGGTTTCGACTTCCTGCACTAC
GACGTTAAAACCGGTGACTTCATCCTGCACTTCAAAATGAACCGTAACCTGTCTT
TCCAGCGTGGTCTGCCGGGTTTCATGCCGGCGTGGGACATCGTTTTCGAAAAAAA
CGAAACCCAGTTCGACGCGAAAGGTACCCCGTTCATCGCGGGTAAACGTATCGTT
CCGGTTATCGAAAACCACCGTTTCACCGGTCGTTACCGTGACCTGTACCCGGCGA
ACGAACTGATCGCGCTGCTGGAAGAAAAAGGTATCGTTTTCCGTGACGGTTCTAA
CATCCTGCCGAAACTGCTGGAAAACGACGACTCTCACGCGATCGACACCATGGTT
GCGCTGATCCGTTCTGTTCTGCAGATGCGTAACTCTAACGCGGCGACCGGTGAAG
ACTACATCAACTCTCCGGTTCGTGACCTGAACGGTGTTTGCTTCGACTCTCGTTT
CCAGAACCCGGAATGGCCGATGGACGCGGACGCGAACGGTGCGTACCACATCGCG
CTGAAAGGTCAGCTGCTGCTGAACCACCTGAAAGAATCTAAAGACCTGAAACTGC
AGAACGGTATCTCTAACCAGGACTGGCTGGCGTACATCCAGGAACTGCGTAACTA
GAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACC
CTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA
SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT
ID TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC
NO: TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA
73 CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC
ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC
GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG
TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC
TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATGCGGTTAAATCTATC
AAAGTTAAACTGCGTCTGGACGACATGCCGGAAATCCGTGCGGGTCTGTGGAAAC
TGCACAAAGAAGTTAACGCGGGTGTTCGTTACTACACCGAATGGCTGTCTCTGCT
GCGTCAGGAAAACCTGTACCGTCGTTCTCCGAACGGTGACGGTGAACAGGAATGC
GACAAAACCGCGGAAGAATGCAAAGCGGAACTGCTGGAACGTCTGCGTGCGCGTC
AGGTTGAAAACGGTCACCGTGGTCCGGCGGGTTCTGACGACGAACTGCTGCAGCT
GGCGCGTCAGCTGTACGAACTGCTGGTTCCGCAGGCGATCGGTGCGAAAGGTGAC
GCGCAGCAGATCGCGCGTAAATTCCTGTCTCCGCTGGCGGACAAAGACGCGGTTG
GTGGTCTGGGTATCGCGAAAGCGGGTAACAAACCGCGTTGGGTTCGTATGCGTGA
AGCGGGTGAACCGGGTTGGGAAGAAGAAAAAGAAAAAGCGGAAACCCGTAAATCT
GCGGACCGTACCGCGGACGTTCTGCGTGCGCTGGCGGACTTCGGTCTGAAACCGC
TGATGCGTGTTTACACCGACTCTGAAATGTCTTCTGTTGAATGGAAACCGCTGCG
TAAAGGTCAGGCGGTTCGTACCTGGGACCGTGACATGTTCCAGCAGGCGATCGAA
CGTATGATGTCTTGGGAATCTTGGAACCAGCGTGTTGGTCAGGAATACGCGAAAC
TGGTTGAACAGAAAAACCGTTTCGAACAGAAAAACTTCGTTGGTCAGGAACACCT
GGTTCACCTGGTTAACCAGCTGCAGCAGGACATGAAAGAAGCGTCTCCGGGTCTG
GAATCTAAAGAACAGACCGCGCACTACGTTACCGGTCGTGCGCTGCGTGGTTCTG
ACAAAGTTTTCGAAAAATGGGGTAAACTGGCGCCGGACGCGCCGTTCGACCTGTA
CGACGCGGAAATCAAAAACGTTCAGCGTCGTAACACCCGTCGTTTCGGTTCTCAC
GACCTGTTCGCGAAACTGGCGGAACCGGAATACCAGGCGCTGTGGCGTGAAGACG
CGTCTTTCCTGACCCGTTACGCGGTTTACAACTCTATCCTGCGTAAACTGAACCA
CGCGAAAATGTTCGCGACCTTCACCCTGCCGGACGCGACCGCGCACCCGATCTGG
ACCCGTTTCGACAAACTGGGTGGTAACCTGCACCAGTACACCTTCCTGTTCAACG
AATTCGGTGAACGTCGTCACGCGATCCGTTTCCACAAACTGCTGAAAGTTGAAAA
CGGTGTTGCGCGTGAAGTTGACGACGTTACCGTTCCGATCTCTATGTCTGAACAG
CTGGACAACCTGCTGCCGCGTGACCCGAACGAACCGATCGCGCTGTACTTCCGTG
ACTACGGTGCGGAACAGCACTTCACCGGTGAATTCGGTGGTGCGAAAATCCAGTG
CCGTCGTGACCAGCTGGCGCACATGCACCGTCGTCGTGGTGCGCGTGACGTTTAC
CTGAACGTTTCTGTTCGTGTTCAGTCTCAGTCTGAAGCGCGTGGTGAACGTCGTC
CGCCGTACGCGGCGGTTTTCCGTCTGGTTGGTGACAACCACCGTGCGTTCGTTCA
CTTCGACAAACTGTCTGACTACCTGGCGGAACACCCGGACGACGGTAAACTGGGT
TCTGAAGGTCTGCTGTCTGGTCTGCGTGTTATGTCTGTTGACCTGGGTCTGCGTA
CCTCTGCGTCTATCTCTGTTTTCCGTGTTGCGCGTAAAGACGAACTGAAACCGAA
CTCTAAAGGTCGTGTTCCGTTCTTCTTCCCGATCAAAGGTAACGACAACCTGGTT
GCGGTTCACGAACGTTCTCAGCTGCTGAAACTGCCGGGTGAAACCGAATCTAAAG
ACCTGCGTGCGATCCGTGAAGAACGTCAGCGTACCCTGCGTCAGCTGCGTACCCA
GCTGGCGTACCTGCGTCTGCTGGTTCGTTGCGGTTCTGAAGACGTTGGTCGTCGT
GAACGTTCTTGGGCGAAACTGATCGAACAGCCGGTTGACGCGGCGAACCACATGA
CCCCGGACTGGCGTGAAGCGTTCGAAAACGAACTGCAGAAACTGAAATCTCTGCA
CGGTATCTGCTCTGACAAAGAATGGATGGACGCGGTTTACGAATCTGTTCGTCGT
GTTTGGCGTCACATGGGTAAACAGGTTCGTGACTGGCGTAAAGACGTTCGTTCTG
GTGAACGTCCGAAAATCCGTGGTTACGCGAAAGACGTTGTTGGTGGTAACTCTAT
CGAACAGATCGAATACCTGGAACGTCAGTACAAATTCCTGAAATCTTGGTCTTTC
TTCGGTAAAGTTTCTGGTCAGGTTATCCGTGCGGAAAAAGGTTCTCGTTTCGCGA
TCACCCTGCGTGAACACATCGACCACGCGAAAGAAGACCGTCTGAAAAAACTGGC
GGACCGTATCATCATGGAAGCGCTGGGTTACGTTTACGCGCTGGACGAACGTGGT
AAAGGTAAATGGGTTGCGAAATACCCGCCGTGCCAGCTGATCCTGCTGGAAGAAC
TGTCTGAATACCAGTTCAACAACGACCGTCCGCCGTCTGAAAACAACCAGCTGAT
GCAGTGGTCTCACCGTGGTGTTTTCCAGGAACTGATCAACCAGGCGCAGGTTCAC
GACCTGCTGGTTGGTACCATGTACGCGGCGTTCTCTTCTCGTTTCGACGCGCGTA
CCGGTGCGCCGGGTATCCGTTGCCGTCGTGTTCCGGCGCGTTGCACCCAGGAACA
CAACCCGGAACCGTTCCCGTGGTGGCTGAACAAATTCGTTGTTGAACACACCCTG
GACGCGTGCCCGCTGCGTGCGGACGACCTGATCCCGACCGGTGAAGGTGAAATCT
TCGTTTCTCCGTTCTCTGCGGAAGAAGGTGACTTCCACCAGATCCACGCGGACCT
GAACGCGGCGCAGAACCTGCAGCAGCGTCTGTGGTCTGACTTCGACATCTCTCAG
ATCCGTCTGCGTTGCGACTGGGGTGAAGTTGACGGTGAACTGGTTCTGATCCCGC
GTCTGACCGGTAAACGTACCGCGGACTCTTACTCTAACAAAGTTTTCTACACCAA
CACCGGTGTTACCTACTACGAACGTGAACGTGGTAAAAAACGTCGTAAAGTTTTC
GCGCAGGAAAAACTGTCTGAAGAAGAAGCGGAACTGCTGGTTGAAGCGGACGAAG
CGCGTGAAAAATCTGTTGTTCTGATGCGTGACCCGTCTGGTATCATCAACCGTGG
TAACTGGACCCGTCAGAAAGAATTCTGGTCTATGGTTAACCAGCGTATCGAAGGT
TACCTGGTTAAACAGATCCGTTCTCGTGTTCCGCTGCAGGACTCTGCGTGCGAAA
ACACCGGTGACATCTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATC
TGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGG
AAGCAAAGAGGATTACA
SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT
ID TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC
NO: TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA
74 CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC
ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC
GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG
TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC
TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATGCGACCCGTTCTTTC
ATCCTGAAAATCGAACCGAACGAAGAAGTTAAAAAAGGTCTGTGGAAAACCCACG
AAGTTCTGAACCACGGTATCGCGTACTACATGAACATCCTGAAACTGATCCGTCA
GGAAGCGATCTACGAACACCACGAACAGGACCCGAAAAACCCGAAAAAAGTTTCT
AAAGCGGAAATCCAGGCGGAACTGTGGGACTTCGTTCTGAAAATGCAGAAATGCA
ACTCTTTCACCCACGAAGTTGACAAAGACGTTGTTTTCAACATCCTGCGTGAACT
GTACGAAGAACTGGTTCCGTCTTCTGTTGAAAAAAAAGGTGAAGCGAACCAGCTG
TCTAACAAATTCCTGTACCCGCTGGTTGACCCGAACTCTCAGTCTGGTAAAGGTA
CCGCGTCTTCTGGTCGTAAACCGCGTTGGTACAACCTGAAAATCGCGGGTGACCC
GTCTTGGGAAGAAGAAAAAAAAAAATGGGAAGAAGACAAAAAAAAAGACCCGCTG
GCGAAAATCCTGGGTAAACTGGCGGAATACGGTCTGATCCCGCTGTTCATCCCGT
TCACCGACTCTAACGAACCGATCGTTAAAGAAATCAAATGGATGGAAAAATCTCG
TAACCAGTCTGTTCGTCGTCTGGACAAAGACATGTTCATCCAGGCGCTGGAACGT
TTCCTGTCTTGGGAATCTTGGAACCTGAAAGTTAAAGAAGAATACGAAAAAGTTG
AAAAAGAACACAAAACCCTGGAAGAACGTATCAAAGAAGACATCCAGGCGTTCAA
ATCTCTGGAACAGTACGAAAAAGAACGTCAGGAACAGCTGCTGCGTGACACCCTG
AACACCAACGAATACCGTCTGTCTAAACGTGGTCTGCGTGGTTGGCGTGAAATCA
TCCAGAAATGGCTGAAAATGGACGAAAACGAACCGTCTGAAAAATACCTGGAAGT
TTTCAAAGACTACCAGCGTAAACACCCGCGTGAAGCGGGTGACTACTCTGTTTAC
GAATTCCTGTCTAAAAAAGAAAACCACTTCATCTGGCGTAACCACCCGGAATACC
CGTACCTGTACGCGACCTTCTGCGAAATCGACAAAAAAAAAAAAGACGCGAAACA
GCAGGCGACCTTCACCCTGGCGGACCCGATCAACCACCCGCTGTGGGTTCGTTTC
GAAGAACGTTCTGGTTCTAACCTGAACAAATACCGTATCCTGACCGAACAGCTGC
ACACCGAAAAACTGAAAAAAAAACTGACCGTTCAGCTGGACCGTCTGATCTACCC
GACCGAATCTGGTGGTTGGGAAGAAAAAGGTAAAGTTGACATCGTTCTGCTGCCG
TCTCGTCAGTTCTACAACCAGATCTTCCTGGACATCGAAGAAAAAGGTAAACACG
CGTTCACCTACAAAGACGAATCTATCAAATTCCCGCTGAAAGGTACCCTGGGTGG
TGCGCGTGTTCAGTTCGACCGTGACCACCTGCGTCGTTACCCGCACAAAGTTGAA
TCTGGTAACGTTGGTCGTATCTACTTCAACATGACCGTTAACATCGAACCGACCG
AATCTCCGGTTTCTAAATCTCTGAAAATCCACCGTGACGACTTCCCGAAATTCGT
TAACTTCAAACCGAAAGAACTGACCGAATGGATCAAAGACTCTAAAGGTAAAAAA
CTGAAATCTGGTATCGAATCTCTGGAAATCGGTCTGCGTGTTATGTCTATCGACC
TGGGTCAGCGTCAGGCGGCGGCGGCGTCTATCTTCGAAGTTGTTGACCAGAAACC
GGACATCGAAGGTAAACTGTTCTTCCCGATCAAAGGTACCGAACTGTACGCGGTT
CACCGTGCGTCTTTCAACATCAAACTGCCGGGTGAAACCCTGGTTAAATCTCGTG
AAGTTCTGCGTAAAGCGCGTGAAGACAACCTGAAACTGATGAACCAGAAACTGAA
CTTCCTGCGTAACGTTCTGCACTTCCAGCAGTTCGAAGACATCACCGAACGTGAA
AAACGTGTTACCAAATGGATCTCTCGTCAGGAAAACTCTGACGTTCCGCTGGTTT
ACCAGGACGAACTGATCCAGATCCGTGAACTGATGTACAAACCGTACAAAGACTG
GGTTGCGTTCCTGAAACAGCTGCACAAACGTCTGGAAGTTGAAATCGGTAAAGAA
GTTAAACACTGGCGTAAATCTCTGTCTGACGGTCGTAAAGGTCTGTACGGTATCT
CTCTGAAAAACATCGACGAAATCGACCGTACCCGTAAATTCCTGCTGCGTTGGTC
TCTGCGTCCGACCGAACCGGGTGAAGTTCGTCGTCTGGAACCGGGTCAGCGTTTC
GCGATCGACCAGCTGAACCACCTGAACGCGCTGAAAGAAGACCGTCTGAAAAAAA
TGGCGAACACCATCATCATGCACGCGCTGGGTTACTGCTACGACGTTCGTAAAAA
AAAATGGCAGGCGAAAAACCCGGCGTGCCAGATCATCCTGTTCGAAGACCTGTCT
AACTACAACCCGTACGAAGAACGTTCTCGTTTCGAAAACTCTAAACTGATGAAAT
GGTCTCGTCGTGAAATCCCGCGTCAGGTTGCGCTGCAGGGTGAAATCTACGGTCT
GCAGGTTGGTGAAGTTGGTGCGCAGTTCTCTTCTCGTTTCCACGCGAAAACCGGT
TCTCCGGGTATCCGTTGCTCTGTTGTTACCAAAGAAAAACTGCAGGACAACCGTT
TCTTCAAAAACCTGCAGCGTGAAGGTCGTCTGACCCTGGACAAAATCGCGGTTCT
GAAAGAAGGTGACCTGTACCCGGACAAAGGTGGTGAAAAATTCATCTCTCTGTCT
AAAGACCGTAAACTGGTTACCACCCACGCGGACATCAACGCGGCGCAGAACCTGC
AGAAACGTTTCTGGACCCGTACCCACGGTTTCTACAAAGTTTACTGCAAAGCGTA
CCAGGTTGACGGTCAGACCGTTTACATCCCGGAATCTAAAGACCAGAAACAGAAA
ATCATCGAAGAATTCGGTGAAGGTTACTTCATCCTGAAAGACGGTGTTTACGAAT
GGGGTAACGCGGGTAAACTGAAAATCAAAAAAGGTTCTTCTAAACAGTCTTCTTC
TGAACTGGTTGACTCTGACATCCTGAAAGACTCTTTCGACCTGGCGTCTGAACTG
AAAGGTGAAAAACTGATGCTGTACCGTGACCCGTCTGGTAACGTTTTCCCGTCTG
ACAAATGGATGGCGGCGGGTGTTTTCTTCGGTAAACTGGAACGTATCCTGATCTC
TAAACTGACCAACCAGTACTCTATCTCTACCATCGAAGACGACTCTTCTAAACAG
TCTATGTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGT
AGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAG
AGGATTACA
SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT
ID TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC
NO: TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA
75 CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC
ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC
GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG
TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC
TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATCCGACCCGTACCATC
AACCTGAAACTGGTTCTGGGTAAAAACCCGGAAAACGCGACCCTGCGTCGTGCGC
TGTTCTCTACCCACCGTCTGGTTAACCAGGCGACCAAACGTATCGAAGAATTCCT
GCTGCTGTGCCGTGGTGAAGCGTACCGTACCGTTGACAACGAAGGTAAAGAAGCG
GAAATCCCGCGTCACGCGGTTCAGGAAGAAGCGCTGGCGTTCGCGAAAGCGGCGC
AGCGTCACAACGGTTGCATCTCTACCTACGAAGACCAGGAAATCCTGGACGTTCT
GCGTCAGCTGTACGAACGTCTGGTTCCGTCTGTTAACGAAAACAACGAAGCGGGT
GACGCGCAGGCGGCGAACGCGTGGGTTTCTCCGCTGATGTCTGCGGAATCTGAAG
GTGGTCTGTCTGTTTACGACAAAGTTCTGGACCCGCCGCCGGTTTGGATGAAACT
GAAAGAAGAAAAAGCGCCGGGTTGGGAAGCGGCGTCTCAGATCTGGATCCAGTCT
GACGAAGGTCAGTCTCTGCTGAACAAACCGGGTTCTCCGCCGCGTTGGATCCGTA
AACTGCGTTCTGGTCAGCCGTGGCAGGACGACTTCGTTTCTGACCAGAAAAAAAA
ACAGGACGAACTGACCAAAGGTAACGCGCCGCTGATCAAACAGCTGAAAGAAATG
GGTCTGCTGCCGCTGGTTAACCCGTTCTTCCGTCACCTGCTGGACCCGGAAGGTA
AAGGTGTTTCTCCGTGGGACCGTCTGGCGGTTCGTGCGGCGGTTGCGCACTTCAT
CTCTTGGGAATCTTGGAACCACCGTACCCGTGCGGAATACAACTCTCTGAAACTG
CGTCGTGACGAATTCGAAGCGGCGTCTGACGAATTCAAAGACGACTTCACCCTGC
TGCGTCAGTACGAAGCGAAACGTCACTCTACCCTGAAATCTATCGCGCTGGCGGA
CGACTCTAACCCGTACCGTATCGGTGTTCGTTCTCTGCGTGCGTGGAACCGTGTT
CGTGAAGAATGGATCGACAAAGGTGCGACCGAAGAACAGCGTGTTACCATCCTGT
CTAAACTGCAGACCCAGCTGCGTGGTAAATTCGGTGACCCGGACCTGTTCAACTG
GCTGGCGCAGGACCGTCACGTTCACCTGTGGTCTCCGCGTGACTCTGTTACCCCG
CTGGTTCGTATCAACGCGGTTGACAAAGTTCTGCGTCGTCGTAAACCGTACGCGC
TGATGACCTTCGCGCACCCGCGTTTCCACCCGCGTTGGATCCTGTACGAAGCGCC
GGGTGGTTCTAACCTGCGTCAGTACGCGCTGGACTGCACCGAAAACGCGCTGCAC
ATCACCCTGCCGCTGCTGGTTGACGACGCGCACGGTACCTGGATCGAAAAAAAAA
TCCGTGTTCCGCTGGCGCCGTCTGGTCAGATCCAGGACCTGACCCTGGAAAAACT
GGAAAAAAAAAAAAACCGTCTGTACTACCGTTCTGGTTTCCAGCAGTTCGCGGGT
CTGGCGGGTGGTGCGGAAGTTCTGTTCCACCGTCCGTACATGGAACACGACGAAC
GTTCTGAAGAATCTCTGCTGGAACGTCCGGGTGCGGTTTGGTTCAAACTGACCCT
GGACGTTGCGACCCAGGCGCCGCCGAACTGGCTGGACGGTAAAGGTCGTGTTCGT
ACCCCGCCGGAAGTTCACCACTTCAAAACCGCGCTGTCTAACAAATCTAAACACA
CCCGTACCCTGCAGCCGGGTCTGCGTGTTCTGTCTGTTGACCTGGGTATGCGTAC
CTTCGCGTCTTGCTCTGTTTTCGAACTGATCGAAGGTAAACCGGAAACCGGTCGT
GCGTTCCCGGTTGCGGACGAACGTTCTATGGACTCTCCGAACAAACTGTGGGCGA
AACACGAACGTTCTTTCAAACTGACCCTGCCGGGTGAAACCCCGTCTCGTAAAGA
AGAAGAAGAACGTTCTATCGCGCGTGCGGAAATCTACGCGCTGAAACGTGACATC
CAGCGTCTGAAATCTCTGCTGCGTCTGGGTGAAGAAGACAACGACAACCGTCGTG
ACGCGCTGCTGGAACAGTTCTTCAAAGGTTGGGGTGAAGAAGACGTTGTTCCGGG
TCAGGCGTTCCCGCGTTCTCTGTTCCAGGGTCTGGGTGCGGCGCCGTTCCGTTCT
ACCCCGGAACTGTGGCGTCAGCACTGCCAGACCTACTACGACAAAGCGGAAGCGT
GCCTGGCGAAACACATCTCTGACTGGCGTAAACGTACCCGTCCGCGTCCGACCTC
TCGTGAAATGTGGTACAAAACCCGTTCTTACCACGGTGGTAAATCTATCTGGATG
CTGGAATACCTGGACGCGGTTCGTAAACTGCTGCTGTCTTGGTCTCTGCGTGGTC
GTACCTACGGTGCGATCAACCGTCAGGACACCGCGCGTTTCGGTTCTCTGGCGTC
TCGTCTGCTGCACCACATCAACTCTCTGAAAGAAGACCGTATCAAAACCGGTGCG
GACTCTATCGTTCAGGCGGCGCGTGGTTACATCCCGCTGCCGCACGGTAAAGGTT
GGGAACAGCGTTACGAACCGTGCCAGCTGATCCTGTTCGAAGACCTGGCGCGTTA
CCGTTTCCGTGTTGACCGTCCGCGTCGTGAAAACTCTCAGCTGATGCAGTGGAAC
CACCGTGCGATCGTTGCGGAAACCACCATGCAGGCGGAACTGTACGGTCAGATCG
TTGAAAACACCGCGGCGGGTTTCTCTTCTCGTTTCCACGCGGCGACCGGTGCGCC
GGGTGTTCGTTGCCGTTTCCTGCTGGAACGTGACTTCGACAACGACCTGCCGAAA
CCGTACCTGCTGCGTGAACTGTCTTGGATGCTGGGTAACACCAAAGTTGAATCTG
AAGAAGAAAAACTGCGTCTGCTGTCTGAAAAAATCCGTCCGGGTTCTCTGGTTCC
GTGGGACGGTGGTGAACAGTTCGCGACCCTGCACCCGAAACGTCAGACCCTGTGC
GTTATCCACGCGGACATGAACGCGGCGCAGAACCTGCAGCGTCGTTTCTTCGGTC
GTTGCGGTGAAGCGTTCCGTCTGGTTTGCCAGCCGCACGGTGACGACGTTCTGCG
TCTGGCGTCTACCCCGGGTGCGCGTCTGCTGGGTGCGCTGCAGCAGCTGGAAAAC
GGTCAGGGTGCGTTCGAACTGGTTCGTGACATGGGTTCTACCTCTCAGATGAACC
GTTTCGTTATGAAATCTCTGGGTAAAAAAAAAATCAAACCGCTGCAGGACAACAA
CGGTGACGACGAACTGGAAGACGTTCTGTCTGTTCTGCCGGAAGAAGACGACACC
GGTCGTATCACCGTTTTCCGTGACTCTTCTGGTATCTTCTTCCCGTGCAACGTTT
GGATCCCGGCGAAACAGTTCTGGCCGGCGGTTCGTGCGATGATCTGGAAAGTTAT
GGCGTCTCACTCTCTGGGTTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTT
TTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTAC
TCAGGAAGCAAAGAGGATTACA
SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT
ID TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC
NO: TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA
76 CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC
ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC
GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG
TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC
TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATACCAAACTGCGTCAC
CGTCAGAAAAAACTGACCCACGACTGGGCGGGTTCTAAAAAACGTGAAGTTCTGG
GTTCTAACGGTAAACTGCAGAACCCGCTGCTGATGCCGGTTAAAAAAGGTCAGGT
TACCGAATTCCGTAAAGCGTTCTCTGCGTACGCGCGTGCGACCAAAGGTGAAATG
ACCGACGGTCGTAAAAACATGTTCACCCACTCTTTCGAACCGTTCAAAACCAAAC
CGTCTCTGCACCAGTGCGAACTGGCGGACAAAGCGTACCAGTCTCTGCACTCTTA
CCTGCCGGGTTCTCTGGCGCACTTCCTGCTGTCTGCGCACGCGCTGGGTTTCCGT
ATCTTCTCTAAATCTGGTGAAGCGACCGCGTTCCAGGCGTCTTCTAAAATCGAAG
CGTACGAATCTAAACTGGCGTCTGAACTGGCGTGCGTTGACCTGTCTATCCAGAA
CCTGACCATCTCTACCCTGTTCAACGCGCTGACCACCTCTGTTCGTGGTAAAGGT
GAAGAAACCTCTGCGGACCCGCTGATCGCGCGTTTCTACACCCTGCTGACCGGTA
AACCGCTGTCTCGTGACACCCAGGGTCCGGAACGTGACCTGGCGGAAGTTATCTC
TCGTAAAATCGCGTCTTCTTTCGGTACCTGGAAAGAAATGACCGCGAACCCGCTG
CAGTCTCTGCAGTTCTTCGAAGAAGAACTGCACGCGCTGGACGCGAACGTTTCTC
TGTCTCCGGCGTTCGACGTTCTGATCAAAATGAACGACCTGCAGGGTGACCTGAA
AAACCGTACCATCGTTTTCGACCCGGACGCGCCGGTTTTCGAATACAACGCGGAA
GACCCGGCGGACATCATCATCAAACTGACCGCGCGTTACGCGAAAGAAGCGGTTA
TCAAAAACCAGAACGTTGGTAACTACGTTAAAAACGCGATCACCACCACCAACGC
GAACGGTCTGGGTTGGCTGCTGAACAAAGGTCTGTCTCTGCTGCCGGTTTCTACC
GACGACGAACTGCTGGAATTCATCGGTGTTGAACGTTCTCACCCGTCTTGCCACG
CGCTGATCGAACTGATCGCGCAGCTGGAAGCGCCGGAACTGTTCGAAAAAAACGT
TTTCTCTGACACCCGTTCTGAAGTTCAGGGTATGATCGACTCTGCGGTTTCTAAC
CACATCGCGCGTCTGTCTTCTTCTCGTAACTCTCTGTCTATGGACTCTGAAGAAC
TGGAACGTCTGATCAAATCTTTCCAGATCCACACCCCGCACTGCTCTCTGTTCAT
CGGTGCGCAGTCTCTGTCTCAGCAGCTGGAATCTCTGCCGGAAGCGCTGCAGTCT
GGTGTTAACTCTGCGGACATCCTGCTGGGTTCTACCCAGTACATGCTGACCAACT
CTCTGGTTGAAGAATCTATCGCGACCTACCAGCGTACCCTGAACCGTATCAACTA
CCTGTCTGGTGTTGCGGGTCAGATCAACGGTGCGATCAAACGTAAAGCGATCGAC
GGTGAAAAAATCCACCTGCCGGCGGCGTGGTCTGAACTGATCTCTCTGCCGTTCA
TCGGTCAGCCGGTTATCGACGTTGAATCTGACCTGGCGCACCTGAAAAACCAGTA
CCAGACCCTGTCTAACGAATTCGACACCCTGATCTCTGCGCTGCAGAAAAACTTC
GACCTGAACTTCAACAAAGCGCTGCTGAACCGTACCCAGCACTTCGAAGCGATGT
GCCGTTCTACCAAAAAAAACGCGCTGTCTAAACCGGAAATCGTTTCTTACCGTGA
CCTGCTGGCGCGTCTGACCTCTTGCCTGTACCGTGGTTCTCTGGTTCTGCGTCGT
GCGGGTATCGAAGTTCTGAAAAAACACAAAATCTTCGAATCTAACTCTGAACTGC
GTGAACACGTTCACGAACGTAAACACTTCGTTTTCGTTTCTCCGCTGGACCGTAA
AGCGAAAAAACTGCTGCGTCTGACCGACTCTCGTCCGGACCTGCTGCACGTTATC
GACGAAATCCTGCAGCACGACAACCTGGAAAACAAAGACCGTGAATCTCTGTGGC
TGGTTCGTTCTGGTTACCTGCTGGCGGGTCTGCCGGACCAGCTGTCTTCTTCTTT
CATCAACCTGCCGATCATCACCCAGAAAGGTGACCGTCGTCTGATCGACCTGATC
CAGTACGACCAGATCAACCGTGACGCGTTCGTTATGCTGGTTACCTCTGCGTTCA
AATCTAACCTGTCTGGTCTGCAGTACCGTGCGAACAAACAGTCTTTCGTTGTTAC
CCGTACCCTGTCTCCGTACCTGGGTTCTAAACTGGTTTACGTTCCGAAAGACAAA
GACTGGCTGGTTCCGTCTCAGATGTTCGAAGGTCGTTTCGCGGACATCCTGCAGT
CTGACTACATGGTTTGGAAAGACGCGGGTCGTCTGTGCGTTATCGACACCGCGAA
ACACCTGTCTAACATCAAAAAATCTGTTTTCTCTTCTGAAGAAGTTCTGGCGTTC
CTGCGTGAACTGCCGCACCGTACCTTCATCCAGACCGAAGTTCGTGGTCTGGGTG
TTAACGTTGACGGTATCGCGTTCAACAACGGTGACATCCCGTCTCTGAAAACCTT
CTCTAACTGCGTTCAGGTTAAAGTTTCTCGTACCAACACCTCTCTGGTTCAGACC
CTGAACCGTTGGTTCGAAGGTGGTAAAGTTTCTCCGCCGTCTATCCAGTTCGAAC
GTGCGTACTACAAAAAAGACGACCAGATCCACGAAGACGCGGCGAAACGTAAAAT
CCGTTTCCAGATGCCGGCGACCGAACTGGTTCACGCGTCTGACGACGCGGGTTGG
ACCCCGTCTTACCTGCTGGGTATCGACCCGGGTGAATACGGTATGGGTCTGTCTC
TGGTTTCTATCAACAACGGTGAAGTTCTGGACTCTGGTTTCATCCACATCAACTC
TCTGATCAACTTCGCGTCTAAAAAATCTAACCACCAGACCAAAGTTGTTCCGCGT
CAGCAGTACAAATCTCCGTACGCGAACTACCTGGAACAGTCTAAAGACTCTGCGG
CGGGTGACATCGCGCACATCCTGGACCGTCTGATCTACAAACTGAACGCGCTGCC
GGTTTTCGAAGCGCTGTCTGGTAACTCTCAGTCTGCGGCGGACCAGGTTTGGACC
AAAGTTCTGTCTTTCTACACCTGGGGTGACAACGACGCGCAGAACTCTATCCGTA
AACAGCACTGGTTCGGTGCGTCTCACTGGGACATCAAAGGTATGCTGCGTCAGCC
GCCGACCGAAAAAAAACCGAAACCGTACATCGCGTTCCCGGGTTCTCAGGTTTCT
TCTTACGGTAACTCTCAGCGTTGCTCTTGCTGCGGTCGTAACCCGATCGAACAGC
TGCGTGAAATGGCGAAAGACACCTCTATCAAAGAACTGAAAATCCGTAACTCTGA
AATCCAGCTGTTCGACGGTACCATCAAACTGTTCAACCCGGACCCGTCTACCGTT
ATCGAACGTCGTCGTCACAACCTGGGTCCGTCTCGTATCCCGGTTGCGGACCGTA
CCTTCAAAAACATCTCTCCGTCTTCTCTGGAATTCAAAGAACTGATCACCATCGT
TTCTCGTTCTATCCGTCACTCTCCGGAATTCATCGCGAAAAAACGTGGTATCGGT
TCTGAATACTTCTGCGCGTACTCTGACTGCAACTCTTCTCTGAACTCTGAAGCGA
ACGCGGCGGCGAACGTTGCGCAGAAATTCCAGAAACAGCTGTTCTTCGAACTGTA
AGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGAC
CCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTAC
A
SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT
ID TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC
NO: TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA
77 CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC
ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC
GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG
TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC
TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATAAACGTATCCTGAAC
TCTCTGAAAGTTGCGGCGCTGCGTCTGCTGTTCCGTGGTAAAGGTTCTGAACTGG
TTAAAACCGTTAAATACCCGCTGGTTTCTCCGGTTCAGGGTGCGGTTGAAGAACT
GGCGGAAGCGATCCGTCACGACAACCTGCACCTGTTCGGTCAGAAAGAAATCGTT
GACCTGATGGAAAAAGACGAAGGTACCCAGGTTTACTCTGTTGTTGACTTCTGGC
TGGACACCCTGCGTCTGGGTATGTTCTTCTCTCCGTCTGCGAACGCGCTGAAAAT
CACCCTGGGTAAATTCAACTCTGACCAGGTTTCTCCGTTCCGTAAAGTTCTGGAA
CAGTCTCCGTTCTTCCTGGCGGGTCGTCTGAAAGTTGAACCGGCGGAACGTATCC
TGTCTGTTGAAATCCGTAAAATCGGTAAACGTGAAAACCGTGTTGAAAACTACGC
GGCGGACGTTGAAACCTGCTTCATCGGTCAGCTGTCTTCTGACGAAAAACAGTCT
ATCCAGAAACTGGCGAACGACATCTGGGACTCTAAAGACCACGAAGAACAGCGTA
TGCTGAAAGCGGACTTCTTCGCGATCCCGCTGATCAAAGACCCGAAAGCGGTTAC
CGAAGAAGACCCGGAAAACGAAACCGCGGGTAAACAGAAACCGCTGGAACTGTGC
GTTTGCCTGGTTCCGGAACTGTACACCCGTGGTTTCGGTTCTATCGCGGACTTCC
TGGTTCAGCGTCTGACCCTGCTGCGTGACAAAATGTCTACCGACACCGCGGAAGA
CTGCCTGGAATACGTTGGTATCGAAGAAGAAAAAGGTAACGGTATGAACTCTCTG
CTGGGTACCTTCCTGAAAAACCTGCAGGGTGACGGTTTCGAACAGATCTTCCAGT
TCATGCTGGGTTCTTACGTTGGTTGGCAGGGTAAAGAAGACGTTCTGCGTGAACG
TCTGGACCTGCTGGCGGAAAAAGTTAAACGTCTGCCGAAACCGAAATTCGCGGGT
GAATGGTCTGGTCACCGTATGTTCCTGCACGGTCAGCTGAAATCTTGGTCTTCTA
ACTTCTTCCGTCTGTTCAACGAAACCCGTGAACTGCTGGAATCTATCAAATCTGA
CATCCAGCACGCGACCATGCTGATCTCTTACGTTGAAGAAAAAGGTGGTTACCAC
CCGCAGCTGCTGTCTCAGTACCGTAAACTGATGGAACAGCTGCCGGCGCTGCGTA
CCAAAGTTCTGGACCCGGAAATCGAAATGACCCACATGTCTGAAGCGGTTCGTTC
TTACATCATGATCCACAAATCTGTTGCGGGTTTCCTGCCGGACCTGCTGGAATCT
CTGGACCGTGACAAAGACCGTGAATTCCTGCTGTCTATCTTCCCGCGTATCCCGA
AAATCGACAAAAAAACCAAAGAAATCGTTGCGTGGGAACTGCCGGGTGAACCGGA
AGAAGGTTACCTGTTCACCGCGAACAACCTGTTCCGTAACTTCCTGGAAAACCCG
AAACACGTTCCGCGTTTCATGGCGGAACGTATCCCGGAAGACTGGACCCGTCTGC
GTTCTGCGCCGGTTTGGTTCGACGGTATGGTTAAACAGTGGCAGAAAGTTGTTAA
CCAGCTGGTTGAATCTCCGGGTGCGCTGTACCAGTTCAACGAATCTTTCCTGCGT
CAGCGTCTGCAGGCGATGCTGACCGTTTACAAACGTGACCTGCAGACCGAAAAAT
TCCTGAAACTGCTGGCGGACGTTTGCCGTCCGCTGGTTGACTTCTTCGGTCTGGG
TGGTAACGACATCATCTTCAAATCTTGCCAGGACCCGCGTAAACAGTGGCAGACC
GTTATCCCGCTGTCTGTTCCGGCGGACGTTTACACCGCGTGCGAAGGTCTGGCGA
TCCGTCTGCGTGAAACCCTGGGTTTCGAATGGAAAAACCTGAAAGGTCACGAACG
TGAAGACTTCCTGCGTCTGCACCAGCTGCTGGGTAACCTGCTGTTCTGGATCCGT
GACGCGAAACTGGTTGTTAAACTGGAAGACTGGATGAACAACCCGTGCGTTCAGG
AATACGTTGAAGCGCGTAAAGCGATCGACCTGCCGCTGGAAATCTTCGGTTTCGA
AGTTCCGATCTTCCTGAACGGTTACCTGTTCTCTGAACTGCGTCAGCTGGAACTG
CTGCTGCGTCGTAAATCTGTTATGACCTCTTACTCTGTTAAAACCACCGGTTCTC
CGAACCGTCTGTTCCAGCTGGTTTACCTGCCGCTGAACCCGTCTGACCCGGAAAA
AAAAAACTCTAACAACTTCCAGGAACGTCTGGACACCCCGACCGGTCTGTCTCGT
CGTTTCCTGGACCTGACCCTGGACGCGTTCGCGGGTAAACTGCTGACCGACCCGG
TTACCCAGGAACTGAAAACCATGGCGGGTTTCTACGACCACCTGTTCGGTTTCAA
ACTGCCGTGCAAACTGGCGGCGATGTCTAACCACCCGGGTTCTTCTTCTAAAATG
GTTGTTCTGGCGAAACCGAAAAAAGGTGTTGCGTCTAACATCGGTTTCGAACCGA
TCCCGGACCCGGCGCACCCGGTTTTCCGTGTTCGTTCTTCTTGGCCGGAACTGAA
ATACCTGGAAGGTCTGCTGTACCTGCCGGAAGACACCCCGCTGACCATCGAACTG
GCGGAAACCTCTGTTTCTTGCCAGTCTGTTTCTTCTGTTGCGTTCGACCTGAAAA
ACCTGACCACCATCCTGGGTCGTGTTGGTGAATTCCGTGTTACCGCGGACCAGCC
GTTCAAACTGACCCCGATCATCCCGGAAAAAGAAGAATCTTTCATCGGTAAAACC
TACCTGGGTCTGGACGCGGGTGAACGTTCTGGTGTTGGTTTCGCGATCGTTACCG
TTGACGGTGACGGTTACGAAGTTCAGCGTCTGGGTGTTCACGAAGACACCCAGCT
GATGGCGCTGCAGCAGGTTGCGTCTAAATCTCTGAAAGAACCGGTTTTCCAGCCG
CTGCGTAAAGGTACCTTCCGTCAGCAGGAACGTATCCGTAAATCTCTGCGTGGTT
GCTACTGGAACTTCTACCACGCGCTGATGATCAAATACCGTGCGAAAGTTGTTCA
CGAAGAATCTGTTGGTTCTTCTGGTCTGGTTGGTCAGTGGCTGCGTGCGTTCCAG
AAAGACCTGAAAAAAGCGGACGTTCTGCCGAAAAAAGGTGGTAAAAACGGTGTTG
ACAAAAAAAAACGTGAATCTTCTGCGCAGGACACCCTGTGGGGTGGTGCGTTCTC
TAAAAAAGAAGAACAGCAGATCGCGTTCGAAGTTCAGGCGGCGGGTTCTTCTCAG
TTCTGCCTGAAATGCGGTTGGTGGTTCCAGCTGGGTATGCGTGAAGTTAACCGTG
TTCAGGAATCTGGTGTTGTTCTGGACTGGAACCGTTCTATCGTTACCTTCCTGAT
CGAATCTTCTGGTGAAAAAGTTTACGGTTTCTCTCCGCAGCAGCTGGAAAAAGGT
TTCCGTCCGGACATCGAAACCTTCAAAAAAATGGTTCGTGACTTCATGCGTCCGC
CGATGTTCGACCGTAAAGGTCGTCCGGCGGCGGCGTACGAACGTTTCGTTCTGGG
TCGTCGTCACCGTCGTTACCGTTTCGACAAAGTTTTCGAAGAACGTTTCGGTCGT
TCTGCGCTGTTCATCTGCCCGCGTGTTGGTTGCGGTAACTTCGACCACTCTTCTG
AACAGTCTGCGGTTGTTCTGGCGCTGATCGGTTACATCGCGGACAAAGAAGGTAT
GTCTGGTAAAAAACTGGTTTACGTTCGTCTGGCGGAACTGATGGCGGAATGGAAA
CTGAAAAAACTGGAACGTTCTCGTGTTGAAGAACAGTCTTCTGCGCAGTAAGAAA
TCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCA
GGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA
SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT
ID TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC
NO: TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA
78 CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC
ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC
GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG
TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC
TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATGCGGAATCTAAACAG
ATGCAGTGCCGTAAATGCGGTGCGTCTATGAAATACGAAGTTATCGGTCTGGGTA
AAAAATCTTGCCGTTACATGTGCCCGGACTGCGGTAACCACACCTCTGCGCGTAA
AATCCAGAACAAAAAAAAACGTGACAAAAAATACGGTTCTGCGTCTAAAGCGCAG
TCTCAGCGTATCGCGGTTGCGGGTGCGCTGTACCCGGACAAAAAAGTTCAGACCA
TCAAAACCTACAAATACCCGGCGGACCTGAACGGTGAAGTTCACGACTCTGGTGT
TGCGGAAAAAATCGCGCAGGCGATCCAGGAAGACGAAATCGGTCTGCTGGGTCCG
TCTTCTGAATACGCGTGCTGGATCGCGTCTCAGAAACAGTCTGAACCGTACTCTG
TTGTTGACTTCTGGTTCGACGCGGTTTGCGCGGGTGGTGTTTTCGCGTACTCTGG
TGCGCGTCTGCTGTCTACCGTTCTGCAGCTGTCTGGTGAAGAATCTGTTCTGCGT
GCGGCGCTGGCGTCTTCTCCGTTCGTTGACGACATCAACCTGGCGCAGGCGGAAA
AATTCCTGGCGGTTTCTCGTCGTACCGGTCAGGACAAACTGGGTAAACGTATCGG
TGAATGCTTCGCGGAAGGTCGTCTGGAAGCGCTGGGTATCAAAGACCGTATGCGT
GAATTCGTTCAGGCGATCGACGTTGCGCAGACCGCGGGTCAGCGTTTCGCGGCGA
AACTGAAAATCTTCGGTATCTCTCAGATGCCGGAAGCGAAACAGTGGAACAACGA
CTCTGGTCTGACCGTTTGCATCCTGCCGGACTACTACGTTCCGGAAGAAAACCGT
GCGGACCAGCTGGTTGTTCTGCTGCGTCGTCTGCGTGAAATCGCGTACTGCATGG
GTATCGAAGACGAAGCGGGTTTCGAACACCTGGGTATCGACCCGGGTGCGCTGTC
TAACTTCTCTAACGGTAACCCGAAACGTGGTTTCCTGGGTCGTCTGCTGAACAAC
GACATCATCGCGCTGGCGAACAACATGTCTGCGATGACCCCGTACTGGGAAGGTC
GTAAAGGTGAACTGATCGAACGTCTGGCGTGGCTGAAACACCGTGCGGAAGGTCT
GTACCTGAAAGAACCGCACTTCGGTAACTCTTGGGCGGACCACCGTTCTCGTATC
TTCTCTCGTATCGCGGGTTGGCTGTCTGGTTGCGCGGGTAAACTGAAAATCGCGA
AAGACCAGATCTCTGGTGTTCGTACCGACCTGTTCCTGCTGAAACGTCTGCTGGA
CGCGGTTCCGCAGTCTGCGCCGTCTCCGGACTTCATCGCGTCTATCTCTGCGCTG
GACCGTTTCCTGGAAGCGGCGGAATCTTCTCAGGACCCGGCGGAACAGGTTCGTG
CGCTGTACGCGTTCCACCTGAACGCGCCGGCGGTTCGTTCTATCGCGAACAAAGC
GGTTCAGCGTTCTGACTCTCAGGAATGGCTGATCAAAGAACTGGACGCGGTTGAC
CACCTGGAATTCAACAAAGCGTTCCCGTTCTTCTCTGACACCGGTAAAAAAAAAA
AAAAAGGTGCGAACTCTAACGGTGCGCCGTCTGAAGAAGAATACACCGAAACCGA
ATCTATCCAGCAGCCGGAAGACGCGGAACAGGAAGTTAACGGTCAGGAAGGTAAC
GGTGCGTCTAAAAACCAGAAAAAATTCCAGCGTATCCCGCGTTTCTTCGGTGAAG
GTTCTCGTTCTGAATACCGTATCCTGACCGAAGCGCCGCAGTACTTCGACATGTT
CTGCAACAACATGCGTGCGATCTTCATGCAGCTGGAATCTCAGCCGCGTAAAGCG
CCGCGTGACTTCAAATGCTTCCTGCAGAACCGTCTGCAGAAACTGTACAAACAGA
CCTTCCTGAACGCGCGTTCTAACAAATGCCGTGCGCTGCTGGAATCTGTTCTGAT
CTCTTGGGGTGAATTCTACACCTACGGTGCGAACGAAAAAAAATTCCGTCTGCGT
CACGAAGCGTCTGAACGTTCTTCTGACCCGGACTACGTTGTTCAGCAGGCGCTGG
AAATCGCGCGTCGTCTGTTCCTGTTCGGTTTCGAATGGCGTGACTGCTCTGCGGG
TGAACGTGTTGACCTGGTTGAAATCCACAAAAAAGCGATCTCTTTCCTGCTGGCG
ATCACCCAGGCGGAAGTTTCTGTTGGTTCTTACAACTGGCTGGGTAACTCTACCG
TTTCTCGTTACCTGTCTGTTGCGGGTACCGACACCCTGTACGGTACCCAGCTGGA
AGAATTCCTGAACGCGACCGTTCTGTCTCAGATGCGTGGTCTGGCGATCCGTCTG
TCTTCTCAGGAACTGAAAGACGGTTTCGACGTTCAGCTGGAATCTTCTTGCCAGG
ACAACCTGCAGCACCTGCTGGTTTACCGTGCGTCTCGTGACCTGGCGGCGTGCAA
ACGTGCGACCTGCCCGGCGGAACTGGACCCGAAAATCCTGGTTCTGCCGGTTGGT
GCGTTCATCGCGTCTGTTATGAAAATGATCGAACGTGGTGACGAACCGCTGGCGG
GTGCGTACCTGCGTCACCGTCCGCACTCTTTCGGTTGGCAGATCCGTGTTCGTGG
TGTTGCGGAAGTTGGTATGGACCAGGGTACCGCGCTGGCGTTCCAGAAACCGACC
GAATCTGAACCGTTCAAAATCAAACCGTTCTCTGCGCAGTACGGTCCGGTTCTGT
GGCTGAACTCTTCTTCTTACTCTCAGTCTCAGTACCTGGACGGTTTCCTGTCTCA
GCCGAAAAACTGGTCTATGCGTGTTCTGCCGCAGGCGGGTTCTGTTCGTGTTGAA
CAGCGTGTTGCGCTGATCTGGAACCTGCAGGCGGGTAAAATGCGTCTGGAACGTT
CTGGTGCGCGTGCGTTCTTCATGCCGGTTCCGTTCTCTTTCCGTCCGTCTGGTTC
TGGTGACGAAGCGGTTCTGGCGCCGAACCGTTACCTGGGTCTGTTCCCGCACTCT
GGTGGTATCGAATACGCGGTTGTTGACGTTCTGGACTCTGCGGGTTTCAAAATCC
TGGAACGTGGTACCATCGCGGTTAACGGTTTCTCTCAGAAACGTGGTGAACGTCA
GGAAGAAGCGCACCGTGAAAAACAGCGTCGTGGTATCTCTGACATCGGTCGTAAA
AAACCGGTTCAGGCGGAAGTTGACGCGGCGAACGAACTGCACCGTAAATACACCG
ACGTTGCGACCCGTCTGGGTTGCCGTATCGTTGTTCAGTGGGCGCCGCAGCCGAA
ACCGGGTACCGCGCCGACCGCGCAGACCGTTTACGCGCGTGCGGTTCGTACCGAA
GCGCCGCGTTCTGGTAACCAGGAAGACCACGCGCGTATGAAATCTTCTTGGGGTT
ACACCTGGGGTACCTACTGGGAAAAACGTAAACCGGAAGACATCCTGGGTATCTC
TACCCAGGTTTACTGGACCGGTGGTATCGGTGAATCTTGCCCGGCGGTTGCGGTT
GCGCTGCTGGGTCACATCCGTGCGACCTCTACCCAGACCGAATGGGAAAAAGAAG
AAGTTGTTTTCGGTCGTCTGAAAAAATTCTTCCCGTCTTAAGAAATCATCCTTAG
CGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATAT
TCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA
SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT
ID TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC
NO: TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA
79 CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC
ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC
GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG
TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC
TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATGAAAAACGTATCAAC
AAAATCCGTAAAAAACTGTCTGCGGACAACGCGACCAAACCGGTTTCTCGTTCTG
GTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCGACGACCTGAAAAAACGTCT
GGAAAAACGTCGTAAAAAACCGGAAGTTATGCCGCAGGTTATCTCTAACAACGCG
GCGAACAACCTGCGTATGCTGCTGGACGACTACACCAAAATGAAAGAAGCGATCC
TGCAGGTTTACTGGCAGGAATTCAAAGACGACCACGTTGGTCTGATGTGCAAATT
CGCGCAGCCGGCGTCTAAAAAAATCGACCAGAACAAACTGAAACCGGAAATGGAC
GAAAAAGGTAACCTGACCACCGCGGGTTTCGCGTGCTCTCAGTGCGGTCAGCCGC
TGTTCGTTTACAAACTGGAACAGGTTTCTGAAAAAGGTAAAGCGTACACCAACTA
CTTCGGTCGTTGCAACGTTGCGGAACACGAAAAACTGATCCTGCTGGCGCAGCTG
AAACCGGAAAAAGACTCTGACGAAGCGGTTACCTACTCTCTGGGTAAATTCGGTC
AGCGTGCGCTGGACTTCTACTCTATCCACGTTACCAAAGAATCTACCCACCCGGT
TAAACCGCTGGCGCAGATCGCGGGTAACCGTTACGCGTCTGGTCCGGTTGGTAAA
GCGCTGTCTGACGCGTGCATGGGTACCATCGCGTCTTTCCTGTCTAAATACCAGG
ACATCATCATCGAACACCAGAAAGTTGTTAAAGGTAACCAGAAACGTCTGGAATC
TCTGCGTGAACTGGCGGGTAAAGAAAACCTGGAATACCCGTCTGTTACCCTGCCG
CCGCAGCCGCACACCAAAGAAGGTGTTGACGCGTACAACGAAGTTATCGCGCGTG
TTCGTATGTGGGTTAACCTGAACCTGTGGCAGAAACTGAAACTGTCTCGTGACGA
CGCGAAACCGCTGCTGCGTCTGAAAGGTTTCCCGTCTTTCCCGGTTGTTGAACGT
CGTGAAAACGAAGTTGACTGGTGGAACACCATCAACGAAGTTAAAAAACTGATCG
ACGCGAAACGTGACATGGGTCGTGTTTTCTGGTCTGGTGTTACCGCGGAAAAACG
TAACACCATCCTGGAAGGTTACAACTACCTGCCGAACGAAAACGACCACAAAAAA
CGTGAAGGTTCTCTGGAAAACCCGAAAAAACCGGCGAAACGTCAGTTCGGTGACC
TGCTGCTGTACCTGGAAAAAAAATACGCGGGTGACTGGGGTAAAGTTTTCGACGA
AGCGTGGGAACGTATCGACAAAAAAATCGCGGGTCTGACCTCTCACATCGAACGT
GAAGAAGCGCGTAACGCGGAAGACGCGCAGTCTAAAGCGGTTCTGACCGACTGGC
TGCGTGCGAAAGCGTCTTTCGTTCTGGAACGTCTGAAAGAAATGGACGAAAAAGA
ATTCTACGCGTGCGAAATCCAGCTGCAGAAATGGTACGGTGACCTGCGTGGTAAC
CCGTTCGCGGTTGAAGCGGAAAACCGTGTTGTTGACATCTCTGGTTTCTCTATCG
GTTCTGACGGTCACTCTATCCAGTACCGTAACCTGCTGGCGTGGAAATACCTGGA
AAACGGTAAACGTGAATTCTACCTGCTGATGAACTACGGTAAAAAAGGTCGTATC
CGTTTCACCGACGGTACCGACATCAAAAAATCTGGTAAATGGCAGGGTCTGCTGT
ACGGTGGTGGTAAAGCGAAAGTTATCGACCTGACCTTCGACCCGGACGACGAACA
GCTGATCATCCTGCCGCTGGCGTTCGGTACCCGTCAGGGTCGTGAATTCATCTGG
AACGACCTGCTGTCTCTGGAAACCGGTCTGATCAAACTGGCGAACGGTCGTGTTA
TCGAAAAAACCATCTACAACAAAAAAATCGGTCGTGACGAACCGGCGCTGTTCGT
TGCGCTGACCTTCGAACGTCGTGAAGTTGTTGACCCGTCTAACATCAAACCGGTT
AACCTGATCGGTGTTGACCGTGGTGAAAACATCCCGGCGGTTATCGCGCTGACCG
ACCCGGAAGGTTGCCCGCTGCCGGAATTCAAAGACTCTTCTGGTGGTCCGACCGA
CATCCTGCGTATCGGTGAAGGTTACAAAGAAAAACAGCGTGCGATCCAGGCGGCG
AAAGAAGTTGAACAGCGTCGTGCGGGTGGTTACTCTCGTAAATTCGCGTCTAAAT
CTCGTAACCTGGCGGACGACATGGTTCGTAACTCTGCGCGTGACCTGTTCTACCA
CGCGGTTACCCACGACGCGGTTCTGGTTTTCGAAAACCTGTCTCGTGGTTTCGGT
CGTCAGGGTAAACGTACCTTCATGACCGAACGTCAGTACACCAAAATGGAAGACT
GGCTGACCGCGAAACTGGCGTACGAAGGTCTGACCTCTAAAACCTACCTGTCTAA
AACCCTGGCGCAGTACACCTCTAAAACCTGCTCTAACTGCGGTTTCACCATCACC
ACCGCGGACTACGACGGTATGCTGGTTCGTCTGAAAAAAACCTCTGACGGTTGGG
CGACCACCCTGAACAACAAAGAACTGAAAGCGGAAGGTCAGATCACCTACTACAA
CCGTTACAAACGTCAGACCGTTGAAAAAGAACTGTCTGCGGAACTGGACCGTCTG
TCTGAAGAATCTGGTAACAACGACATCTCTAAATGGACCAAAGGTCGTCGTGACG
AAGCGCTGTTCCTGCTGAAAAAACGTTTCTCTCACCGTCCGGTTCAGGAACAGTT
CGTTTGCCTGGACTGCGGTCACGAAGTTCACGCGGACGAACAGGCGGCGCTGAAC
ATCGCGCGTTCTTGGCTGTTCCTGAACTCTAACTCTACCGAATTCAAATCTTACA
AATCTGGTAAACAGCCGTTCGTTGGTGCGTGGCAGGCGTTCTACAAACGTCGTCT
GAAAGAAGTTTGGAAACCGAACGCGTAAGAAATCATCCTTAGCGAAAGCTAAGGA
TTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGT
TATTACTCAGGAAGCAAAGAGGATTACA
SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT
ID TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC
NO: TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA
80 CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC
ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC
GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG
TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC
TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATAAACGTATCAACAAA
ATCCGTCGTCGTCTGGTTAAAGACTCTAACACCAAAAAAGCGGGTAAAACCGGTC
CGATGAAAACCCTGCTGGTTCGTGTTATGACCCCGGACCTGCGTGAACGTCTGGA
AAACCTGCGTAAAAAACCGGAAAACATCCCGCAGCCGATCTCTAACACCTCTCGT
GCGAACCTGAACAAACTGCTGACCGACTACACCGAAATGAAAAAAGCGATCCTGC
ACGTTTACTGGGAAGAATTCCAGAAAGACCCGGTTGGTCTGATGTCTCGTGTTGC
GCAGCCGGCGCCGAAAAACATCGACCAGCGTAAACTGATCCCGGTTAAAGACGGT
AACGAACGTCTGACCTCTTCTGGTTTCGCGTGCTCTCAGTGCTGCCAGCCGCTGT
ACGTTTACAAACTGGAACAGGTTAACGACAAAGGTAAACCGCACACCAACTACTT
CGGTCGTTGCAACGTTTCTGAACACGAACGTCTGATCCTGCTGTCTCCGCACAAA
CCGGAAGCGAACGACGAACTGGTTACCTACTCTCTGGGTAAATTCGGTCAGCGTG
CGCTGGACTTCTACTCTATCCACGTTACCCGTGAATCTAACCACCCGGTTAAACC
GCTGGAACAGATCGGTGGTAACTCTTGCGCGTCTGGTCCGGTTGGTAAAGCGCTG
TCTGACGCGTGCATGGGTGCGGTTGCGTCTTTCCTGACCAAATACCAGGACATCA
TCCTGGAACACCAGAAAGTTATCAAAAAAAACGAAAAACGTCTGGCGAACCTGAA
AGACATCGCGTCTGCGAACGGTCTGGCGTTCCCGAAAATCACCCTGCCGCCGCAG
CCGCACACCAAAGAAGGTATCGAAGCGTACAACAACGTTGTTGCGCAGATCGTTA
TCTGGGTTAACCTGAACCTGTGGCAGAAACTGAAAATCGGTCGTGACGAAGCGAA
ACCGCTGCAGCGTCTGAAAGGTTTCCCGTCTTTCCCGCTGGTTGAACGTCAGGCG
AACGAAGTTGACTGGTGGGACATGGTTTGCAACGTTAAAAAACTGATCAACGAAA
AAAAAGAAGACGGTAAAGTTTTCTGGCAGAACCTGGCGGGTTACAAACGTCAGGA
AGCGCTGCTGCCGTACCTGTCTTCTGAAGAAGACCGTAAAAAAGGTAAAAAATTC
GCGCGTTACCAGTTCGGTGACCTGCTGCTGCACCTGGAAAAAAAACACGGTGAAG
ACTGGGGTAAAGTTTACGACGAAGCGTGGGAACGTATCGACAAAAAAGTTGAAGG
TCTGTCTAAACACATCAAACTGGAAGAAGAACGTCGTTCTGAAGACGCGCAGTCT
AAAGCGGCGCTGACCGACTGGCTGCGTGCGAAAGCGTCTTTCGTTATCGAAGGTC
TGAAAGAAGCGGACAAAGACGAATTCTGCCGTTGCGAACTGAAACTGCAGAAATG
GTACGGTGACCTGCGTGGTAAACCGTTCGCGATCGAAGCGGAAAACTCTATCCTG
GACATCTCTGGTTTCTCTAAACAGTACAACTGCGCGTTCATCTGGCAGAAAGACG
GTGTTAAAAAACTGAACCTGTACCTGATCATCAACTACTTCAAAGGTGGTAAACT
GCGTTTCAAAAAAATCAAACCGGAAGCGTTCGAAGCGAACCGTTTCTACACCGTT
ATCAACAAAAAATCTGGTGAAATCGTTCCGATGGAAGTTAACTTCAACTTCGACG
ACCCGAACCTGATCATCCTGCCGCTGGCGTTCGGTAAACGTCAGGGTCGTGAATT
CATCTGGAACGACCTGCTGTCTCTGGAAACCGGTTCTCTGAAACTGGCGAACGGT
CGTGTTATCGAAAAAACCCTGTACAACCGTCGTACCCGTCAGGACGAACCGGCGC
TGTTCGTTGCGCTGACCTTCGAACGTCGTGAAGTTCTGGACTCTTCTAACATCAA
ACCGATGAACCTGATCGGTATCGACCGTGGTGAAAACATCCCGGCGGTTATCGCG
CTGACCGACCCGGAAGGTTGCCCGCTGTCTCGTTTCAAAGACTCTCTGGGTAACC
CGACCCACATCCTGCGTATCGGTGAATCTTACAAAGAAAAACAGCGTACCATCCA
GGCGGCGAAAGAAGTTGAACAGCGTCGTGCGGGTGGTTACTCTCGTAAATACGCG
TCTAAAGCGAAAAACCTGGCGGACGACATGGTTCGTAACACCGCGCGTGACCTGC
TGTACTACGCGGTTACCCAGGACGCGATGCTGATCTTCGAAAACCTGTCTCGTGG
TTTCGGTCGTCAGGGTAAACGTACCTTCATGGCGGAACGTCAGTACACCCGTATG
GAAGACTGGCTGACCGCGAAACTGGCGTACGAAGGTCTGCCGTCTAAAACCTACC
TGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTGCTCTAACTGCGGTTTCAC
CATCACCTCTGCGGACTACGACCGTGTTCTGGAAAAACTGAAAAAAACCGCGACC
GGTTGGATGACCACCATCAACGGTAAAGAACTGAAAGTTGAAGGTCAGATCACCT
ACTACAACCGTTACAAACGTCAGAACGTTGTTAAAGACCTGTCTGTTGAACTGGA
CCGTCTGTCTGAAGAATCTGTTAACAACGACATCTCTTCTTGGACCAAAGGTCGT
TCTGGTGAAGCGCTGTCTCTGCTGAAAAAACGTTTCTCTCACCGTCCGGTTCAGG
AAAAATTCGTTTGCCTGAACTGCGGTTTCGAAACCCACGCGGACGAACAGGCGGC
GCTGAACATCGCGCGTTCTTGGCTGTTCCTGCGTTCTCAGGAATACAAAAAATAC
CAGACCAACAAAACCACCGGTAACACCGACAAACGTGCGTTCGTTGAAACCTGGC
AGTCTTTCTACCGTAAAAAACTGAAAGAAGTTTGGAAACCGGAAATCATCCTTAG
CGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATAT
TCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA
SEQ tgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgc
ID ttattaaaagcattctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaa
NO: caaaagtgtctataatcacggcagaaaagtccacattgattatttgcacggcgtc
81 acactttgctatgccatagcatttttatccataagattagcggatcctacctgac
gctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcac
cgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatatacc
SEQ TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC
ID TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA
NO: CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC
82 ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC
GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGTAGCGGA
TCCTACCTGAC
SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT
ID TTCTAGAGCACAGCTAACACCACGTCGTCCCTATCTGCTGCCCTAGGTCTATGAG
NO: TGGTTGCTGGATAACTTTACGGGCATGCATAAGGCTCGTAATATATATTCAGGGA
83 GACCACAACGGTTTCCCTCTACAAATAATTTTGTTTAACTTTTACTAGAGCTAGC
AGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGA
ACTTTAAGAGGAGGATATACCA
SEQ GTTTGAGAGATATGTAAATTCAAAGGATAATCAAAC
ID
NO:
84
SEQ actacattttttaagacctaattttgagt
ID
NO:
85
SEQ ctcaaaactcattcgaatctctactctttgtagat
ID
NO:
86
SEQ CTCTAGCAGGCCTGGCAAATTTCTACTGTTGTAGAT
ID
NO:
87
SEQ CCGTCTAAAACTCATTCAGAATTTCTACTAGTGTAGAT
ID
NO:
88
SEQ GTCTAGGTACTCTCTTTAATTTCTACTATTGT
ID
NO:
89
SEQ gttaagttatatagaataatttctactgttgtaga
ID
NO:
90
SEQ gtttaaaaccactttaaaatttctactattgta
ID
NO:
91
SEQ GTTTGAGAATGATGTAAAAATGTATGGTACACAGAAATGTTTTAATACCATATTT
ID TTACATCACTCTCAAACATACATCTCTTGTTACTGTTTATCGTATCCAGATTAAA
NO: TTTCACGTTTTT
92
SEQ CTCTACAACTGATAAAGAATTTCTACTTTTGTAGAT
ID
NO:
93
SEQ GTCTGGCCCCAAATTTTAATTTCTACTGTTGTAGAT
ID
NO:
94
SEQ GTCAAAAGACCTTTTTAATTTCTACTCTTGTAGAT
ID
NO:
95
SEQ GTCTAGAGGACAGAATTTTTCAACGGGTGTGCCAATGGCCACTTTCCAGGTGGCA
ID AAGCCCGTTGAGCTTCTACGGAAGTGGCAC
NO:
96
SEQ CGAGGTTCTGTCTTTTGGTCAGGACAACCGTCTAGCTATAAGTGCTGCAGGGGTG
ID TGAGAAACTCCTATTGCTGGACGATGTCTCTTTTAACGAGGCATTAGCAC
NO:
97
SEQ GAACGAGGGACGTTTTGTCTCCAATGATTTTGCTATGACGACCTCGAACTGTGCC
ID TTCAAGTCTGAGGCGAAAAAGAAATGGAAAAAAGTGTCTCATCGCTCTACCTCGT
NO: AGTTAGAGG
98
SEQ AATTACTGATGTTGTGATGAAGG
ID
NO:
99
SEQ TATACCATAAGGATTTAAAGACT
ID
NO:
100
SEQ GTCTTTACTCTCACCTTTCCACCTG
ID
NO:
101
SEQ ATTTGAAGGTATCTCCGATAAGTAAAACGCATCAAAG
ID
NO:
102
SEQ GTTTGAAGATATCTCCGATAAATAAGAAGCATCAAAG
ID
NO:
103
SEQ TTGTTTTAATACCATATTTTTACATCACTCTCAAAC
ID
NO:
104
SEQ AAAGAACGCTCGCTCAGTGTTCTGACCTTTCGAGCGCCTGTTCAGGGCGAAAACC
ID CTGGGAGGCGCTCGAATCATAGGTGGGACAAGGGATTCGCGGCGAAAA
NO:
105
SEQ GTTTGAGAATGATGTAAAAATGTATGGTACACAGAAATGTTTTAATACCATATTT
ID TTACATCACTCTCAAACATACATCTCTTGTTACTGTTTATCGTATCCAGATTAAA
NO: TTTCACGTTTTT
106
SEQ GTCTAGAGGACAGAATTTTTCAACGGGTGTGCCAATGGCCACTTTCCAGGTGGCA
ID AAGCCCGTTGAGCTTCTACGGAAGTGGCAC
NO:
107
SEQ MSIYQEFVNKYSLSKTLRFELIPQGKTLENIKARGLILDDEKRAKDYKKAKQIID
ID KYHQFFIEEILSSVCISEDLLQNYSDVYFKLKKSDDDNLQKDFKSAKDTIKKQIS
NO: EYIKDSEKFKNLFNQNLIDAKKGQESDLILWLKQSKDNGIELFKANSDITDIDEA
108 LEIIKSFKGWTTYFKGFHENRKNVYSSNDIPTSIIYRIVDDNLPKFLENKAKYES
LKDKAPEAINYEQIKKDLAEELTFDIDYKTSEVNQRVFSLDEVFEIANFNNYLNQ
SGITKFNTIIGGKFVNGENTKRKGINEYINLYSQQINDKTLKKYKMSVLFKQILS
DTESKSFVIDKLEDDSDVVTTMQSFYEQIAAFKTVEEKSIKETLSLLFDDLKAQK
LDLSKIYFKNDKSLTDLSQQVEDDYSVIGTAVLEYITQQIAPKNLDNPSKKEQEL
IAKKTEKAKYLSLETIKLALEEFNKHRDIDKQCRFEEILANFAAIPMIFDEIAQN
KDNLAQISIKYQNQGKKDLLQASAEDDVKAIKDLLDQTNNLLHKLKIFHISQSED
KANILDKDEHFYLVFEECYFELANIVPLYNKIRNYITQKPYSDEKFKLNFENSTL
ANGWDKNKEPDNTAILFIKDDKYYLGVMNKKNNKIFDDKAIKENKGEGYKKIVYK
LLPGANKMLPKVFFSAKSIKFYNPSEDILRIRNHSTHTKNGSPQKGYEKFEFNIE
DCRKFIDFYKQSISKHPEWKDFGFRFSDTQRYNSIDEFYREVENQGYKLTFENIS
ESYIDSVVNQGKLYLFQIYNKDFSAYSKGRPNLHTLYWKALFDERNLQDVVYKLN
GEAELFYRKQSIPKKITHPAKEAIANKNKDNPKKESVFEYDLIKDKRFTEDKFFF
HCPITINFKSSGANKENDEINLLLKEKANDVHILSIDRGERHLAYYTLVDGKGNI
IKQDTFNIIGNDRMKTNYHDKLAAIEKDRDSARKDWKKINNIKEMKEGYLSQVVH
EIAKLVIEYNAIVVFEDLNFGFKRGRFKVEKQVYQKLEKMLIEKLNYLVFKDNEF
DKTGGVLRAYQLTAPFETFKKMGKQTGIIYYVPAGFTSKICPVTGFVNQLYPKYE
SVSKSQEFFSKFDKICYNLDKGYFEFSFDYKNFGDKAAKGKWTIASFGSRLINFR
NSDKNHNWDTREVYPTKELEKLLKDYSIEYGHGECIKAAICGESDKKFFAKLTSV
LNTILQMRNSKTGTELDYLISPVADVNGNFFDSRQAPKNMPQDADANGAYHIGLK
GLMLLGRIKNNQEGKKLNLVIKNEEYFEFVQNRNN
SEQ MSIYQEFVNKYSLSKTLRFELIPQGKTLENIKARGLILDDEKRAKDYKKAKQIID
ID KYHQFFIEEILSSVCISEDLLQNYSDVYFKLKKSDDDNLQKDFKSAKDTIKKQIS
NO: EYIKDSEKFKNLFNQNLIDAKKGQESDLILWLKQSKDNGIELFKANSDITDIDEA
109 LEIIKSFKGWTTYFKGFHENRKNVYSSDDIPTSIIYRIVDDNLPKFLENKAKYES
LKDKAPEAINYEQIKKDLAEELTFDIDYKTSEVNQRVFSLDEVFEIANFNNYLNQ
SGITKFNTIIGGKFVNGENTKRKGINEYINLYSQQINDKTLKKYKMSVLFKQILS
DTESKSFVIDKLEDDSDVVTTMQSFYEQIAAFKTVEEKSIKETLSLLFDDLKAQK
LDLSKIYFKNDKSLTDLSQQVEDDYSVIGTAVLEYITQQVAPKNLDNPSKKEQDL
IAKKTEKAKYLSLETIKLALEEFNKHRDIDKQCRFEEILANFAAIPMIFDEIAQN
KDNLAQISLKYQNQGKKDLLQASAEEDVKAIKDLLDQTNNLLHRLKIFHISQSED
KANILDKDEHFYLVFEECYFELANIVPLYNKIRNYITQKPYSDEKFKLNFENSTL
ANGWDKNKEPDNTAILFIKDDKYYLGVMNKKNNKIFDDKAIKENKGEGYKKIVYK
LLPGANKMLPKVFFSAKSIKFYNPSEDILRIRNHSTHTKNGNPQKGYEKFEFNIE
DCRKFIDFYKESISKHPEWKDFGFRFSDTQRYNSIDEFYREVENQGYKLTFENIS
ESYIDSVVNQGKLYLFQIYNKDFSAYSKGRPNLHTLYWKALFDERNLQDVVYKLN
GEAELFYRKQSIPKKITHPAKEAIANKNKDNPKKESVFEYDLIKDKRFTEDKFFF
HCPITINFKSSGANKENDEINLLLKEKANDVHILSIDRGERHLAYYTLVDGKGNI
IKQDTFNIIGNDRMKTNYHDKLAAIEKDRDSARKDWKKINNIKEMKEGYLSQVVH
EIAKLVIEHNAIVVFEDLNFGFKRGRFKVEKQVYQKLEKMLIEKLNYLVFKDNEF
DKTGGVLRAYQLTAPFETFKKMGKQTGIIYYVPAGFTSKICPVTGFVNQLYPKYE
SVSKSQEFFSKFDKICYNLDKGYFEFSFDYKNFGDKAAKGKWTIASFGSRLINFR
NSDKNHNWDTREVYPTKELEKLLKDYSIEYGHGECIKAAICGESDKKFFAKLTSV
LNTILQMRNSKTGTELDYLISPVADVNGNFFDSRQAPKNMPQDADANGAYHIGLK
GLMLLDRIKNNQEGKKLNLVIKNEEYFEFVQNRNN
SEQ MDKKYSIGLDIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDS
ID GETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEED
NO: KKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFR
110 GHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDAKAILSARLSKSR
RLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDL
DNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQ
DLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDG
TEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKI
EKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERM
TNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVD
LLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKD
FLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWG
RLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSG
QGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTT
QKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVD
QELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMK
NYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQIL
DSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLN
AVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNF
FKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEV
QTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSK
KLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGR
KRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHY
LDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIHLFTLTNLGA
PAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGD
SEQ PKKKRKV
ID
NO:
111
SEQ KRPAATKKAGQAKKKK
ID
NO:
112
SEQ PAAKRVKLD
ID
NO:
113
SEQ RQRRNELKRSP
ID
NO:
114
SEQ NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY
ID
NO:
115
SEQ RMRIZFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV
ID
NO:
116
SEQ VSRKRPRP
ID
NO:
117
SEQ PPKKARED
ID
NO:
118
SEQ PQPKKKPL
ID
NO:
119
SEQ SALIKKKKKMAP
ID
NO:
120
SEQ DRLRR
ID
NO:
121
SEQ PKQKKRK
ID
NO:
122
SEQ RKLKKKIKKL
ID
NO:
123
SEQ REKKKFLKRR
ID
NO:
124
SEQ KRKGDEVDGVDEVAKKKSKK
ID
NO:
125
SEQ RKCLQAGMNLEARKTKK
ID
NO:
126
SEQ ATGGGTAAGATGTATTATCTGGGTTTGGATATAGGCACTAACTCTGTGGGATATG
ID CAGTAACTGATCCCTCGTATCACTTGTTAAAGTTCAAAGGCGAACCCATGTGGGG
NO: AGCACATGTATTTGCTGCGGGTAATCAGAGTGCCGAAAGGCGATCTTTCAGAACA
127 TCCAGGAGGCGATTAGATAGGAGACAGCAAAGAGTAAAGCTTGTGCAAGAGATCT
TTGCTCCTGTCATTTCACCTATAGACCCTCGTTTTTTTATAAGATTGCACGAATC
GGCTCTATGGAGAGACGATGTTGCCGAAACAGATAAACATATCTTTTTCAATGAT
CCCACTTATACAGACAAGGAATACTACTCCGACTACCCGACAATTCATCATTTGA
TCGTCGATCTTATGGAGAGCTCTGAAAAGCATGACCCCCGACTTGTCTATTTGGC
TGTAGCTTGGTTAGTTGCTCATAGAGGTCATTTCTTGAATGAAGTAGATAAAGAC
AATATAGGTGATGTACTTTCTTTTGATGCTTTCTACCCGGAATTTTTGGCCTTTT
TGTCAGACAATGGCGTCAGTCCCTGGGTCTGTGAGTCGAAGGCCCTTCAAGCTAC
TCTGCTGTCTAGGAATAGCGTCAACGACAAATATAAAGCATTAAAATCGCTGATA
TTCGGATCGCAAAAACCGGAAGATAACTTTGACGCTAACATCTCTGAAGATGGTT
TAATCCAATTGCTGGCGGGTAAGAAAGTTAAAGTAAACAAACTATTCCCACAAGA
GTCCAACGATGCTAGCTTTACGTTGAATGATAAAGAAGACGCTATTGAAGAAATT
CTAGGTACTTTAACGCCTGACGAGTGCGAATGGATCGCTCATATTCGCAGATTGT
TCGATTGGGCCATCATGAAACACGCGCTAAAGGATGGCAGGACGATATCTGAATC
AAAAGTGAAGCTATACGAGCAGCATCATCATGACTTGACTCAGTTAAAGTACTTT
GTGAAGACCTACCTAGCTAAAGAGTATGATGATATCTTCAGAAACGTAGACTCCG
AGACAACTAAAAATTATGTAGCTTATTCTTACCATGTGAAGGAAGTGAAAGGCAC
ATTACCAAAAAATAAAGCAACGCAAGAAGAATTTTGTAAATACGTCCTTGGCAAA
GTCAAAAACATTGAATGTTCCGAAGCAGACAAGGTTGATTTTGATGAAATGATAC
AACGACTTACGGACAATTCTTTTATGCCAAAGCAAGTCTCAGGTGAAAATAGAGT
AATACCATACCAGTTGTACTACTATGAATTAAAGACAATTTTAAACAAAGCCGCC
TCATATCTACCTTTTTTGACACAATGCGGTAAAGATGCTATTTCTAACCAAGACA
AATTACTGTCTATAATGACATTTCGCATACCATATTTCGTCGGCCCTTTAAGGAA
AGATAATTCAGAACATGCCTGGTTGGAACGTAAAGCGGGTAAAATTTACCCGTGG
AACTTTAATGATAAAGTAGATCTTGATAAATCGGAGGAAGCCTTTATCCGTAGGA
TGACCAATACTTGCACGTATTACCCAGGAGAAGACGTGTTACCATTAGATTCACT
TATCTATGAAAAGTTTATGATCTTGAATGAGATAAACAATATTAGGATTGACGGA
TACCCCATTTCTGTTGATGTGAAACAACAAGTATTTGGTTTATTTGAGAAGAAAA
GGCGAGTAACAGTTAAGGATATTCAAAATCTACTATTATCTCTTGGAGCGTTGGA
TAAACACGGTAAGCTGACTGGTATTGACACGACAATACACTCTAATTATAACACT
TATCATCATTTTAAATCTCTTATGGAGCGGGGAGTATTGACCAGAGATGATGTGG
AAAGAATAGTGGAAAGAATGACATATTCTGACGATACTAAGAGGGTCAGACTGTG
GTTAAATAATAATTATGGAACTCTAACAGCTGACGATGTTAAGCATATCTCAAGA
CTCAGAAAACACGATTTCGGCCGTTTGTCTAAAATGTTTTTGACAGGATTGAAAG
GTGTTCATAAGGAGACAGGCGAGAGAGCAAGTATACTGGATTTTATGTGGAATAC
TAACGACAATTTAATGCAACTACTGTCCGAATGTTACACATTCTCGGATGAGATC
ACCAAATTACAAGAGGCCTACTACGCAAAAGCTCAATTATCGCTAAATGACTTCT
TGGACTCTATGTATATATCAAACGCCGTTAAGAGACCTATTTATCGGACCTTAGC
GGTAGTAAATGATATTAGAAAGGCATGCGGGACGGCACCTAAAAGAATTTTCATC
GAGATGGCGCGAGATGGAGAGTCTAAGAAGAAAAGATCTGTGACTCGTAGAGAGC
AAATTAAAAATCTCTATAGATCAATTCGTAAAGACTTTCAACAAGAAGTTGATTT
TCTGGAAAAGATATTGGAAAATAAGAGTGACGGGCAGCTTCAGTCTGACGCTTTA
TATTTGTATTTTGCTCAATTAGGCAGAGACATGTACACAGGTGATCCAATCAAAT
TAGAACATATTAAAGACCAATCTTTTTACAACATTGATCATATTTATCCTCAATC
GATGGTGAAAGATGACAGTTTGGATAACAAGGTACTAGTCCAAAGCGAAATCAAT
GGCGAAAAGAGTTCGCGCTATCCATTAGACGCAGCCATTAGAAACAAAATGAAGC
CGTTGTGGGATGCCTACTATAATCATGGATTAATTTCTCTTAAGAAATACCAGCG
TTTGACGAGATCTACTCCATTTACGGACGACGAGAAGTGGGATTTTATCAATCGT
CAGCTAGTTGAAACTAGGCAATCTACTAAAGCTTTAGCAATATTGTTAAAGCGTA
AGTTTCCAGATACTGAAATAGTTTACTCAAAGGCTGGACTATCCAGCGATTTTAG
ACATGAATTCGGCCTGGTTAAGAGTAGGAATATTAATGATCTACACCATGCTAAA
GATGCCTTTCTCGCAATAGTTACTGGGAACGTTTATCATGAAAGATTTAATAGAA
GATGGTTTATGGTTAACCAGCCATACTCTGTGAAAACTAAGACATTGTTTACCCA
TTCAATTAAGAATGGCAACTTTGTCGCTTGGAATGGAGAAGAAGATCTTGGACGT
ATCGTAAAGATGTTGAAACAAAACAAGAACACAATCCACTTCACCAGGTTTTCCT
TTGATAGGAAGGAGGGATTGTTCGATATTCAACCTCTCAAAGCTTCTACCGGATT
GGTTCCACGAAAAGCAGGGTTGGATGTTGTTAAATATGGAGGATACGATAAAAGC
ACTGCCGCGTATTATTTATTAGTACGTTTTACACTCGAGGATAAGAAGACTCAAC
ACAAATTGATGATGATTCCTGTTGAAGGTCTCTACAAAGCACGTATTGACCATGA
TAAAGAGTTTTTAACAGATTATGCTCAGACCACGATCAGCGAAATTCTTCAAAAG
GACAAGCAGAAAGTGATCAACATCATGTTCCCTATGGGCACGAGACATATCAAAC
TGAATTCGATGATTTCTATTGATGGATTCTATCTTTCTATTGGTGGGAAGAGTAG
CAAAGGTAAGTCAGTACTATGTCATGCTATGGTGCCATTAATCGTCCCACACAAG
ATAGAATGTTATATCAAGGCTATGGAATCGTTTGCAAGAAAATTCAAAGAAAATA
ATAAATTGAGGATCGTTGAAAAGTTTGATAAAATAACTGTTGAAGATAACTTGAA
CTTATACGAGCTTTTTCTACAAAAGTTGCAACATAACCCATATAATAAATTTTTC
TCTACACAATTTGATGTGTTGACGAACGGTAGAAGTACATTCACCAAATTGTCTC
CAGAGGAGCAAGTCCAGACTTTACTTAATATACTGAGTATATTTAAAACTTGTCG
TTCTTCTGGGTGTGATTTAAAATCAATAAATGGTTCCGCTCAAGCGGCTAGAATT
ATGATATCCGCTGATTTAACTGGCTTATCAAAAAAGTATTCAGATATTAGATTAG
TTGAGCAAAGCGCATCAGGTCTATTTGTTTCAAAATCTCAAAATCTCTTGGAATA
CTTGCCAAAAAAGAAAAGGAAAGTTTAG
SEQ ATGAGTAGTTTAACAAAGTTTACCAATAAATATAGTAAGCAACTAACTATAAAGA
ID ACGAATTGATACCGGTCGGTAAGACTTTGGAAAACATAAAAGAAAATGGGTTGAT
NO: TGATGGAGACGAGCAATTGAATGAGAATTATCAAAAAGCAAAGATAATAGTAGAT
128 GATTTTTTGAGAGACTTTATTAATAAAGCTCTAAATAACACTCAAATTGGTAACT
GGAGAGAGCTAGCCGACGCCTTGAACAAGGAAGATGAGGATAATATTGAGAAATT
ACAAGATAAGATTAGAGGGATTATCGTGTCTAAGTTTGAGACTTTTGATCTGTTC
AGTTCGTATTCGATTAAAAAGGACGAGAAAATCATCGATGATGATAACGATGTGG
AAGAAGAGGAGCTAGACCTTGGGAAGAAGACATCTAGCTTCAAATACATATTCAA
GAAAAATTTGTTCAAACTTGTCCTTCCTTCATATTTAAAAACAACAAATCAAGAT
AAGTTAAAAATCATTTCTTCCTTCGATAATTTTAGTACTTATTTTCGTGGTTTTT
TCGAAAACAGGAAAAATATATTCACTAAAAAGCCTATATCTACCTCTATAGCTTA
TAGAATTGTTCACGATAATTTCCCAAAATTTCTAGATAATATCAGGTGTTTTAAT
GTTTGGCAAACCGAGTGTCCTCAGTTAATAGTCAAGGCCGACAACTACCTTAAAA
GCAAGAATGTGATTGCAAAAGATAAGTCTTTGGCTAACTATTTTACAGTCGGTGC
CTATGATTATTTTCTGAGTCAAAATGGTATCGATTTCTATAACAACATTATTGGC
GGCTTACCAGCTTTTGCCGGGCATGAGAAGATTCAGGGTTTGAACGAATTTATCA
ATCAAGAATGTCAAAAGGATTCTGAATTAAAGTCTAAGCTCAAGAATAGGCACGC
TTTCAAAATGGCAGTCTTATTCAAACAAATCCTTTCAGACAGAGAAAAGTCATTT
GTGATTGACGAGTTCGAATCAGACGCTCAGGTAATTGATGCTGTTAAAAATTTTT
ACGCGGAACAATGCAAAGATAATAACGTCATATTTAATTTATTGAATCTGATCAA
GAATATTGCTTTTTTGTCGGATGATGAGTTAGACGGCATTTTCATAGAGGGTAAA
TACCTGTCCTCTGTGTCTCAAAAATTGTATAGTGATTGGTCAAAGTTGAGAAATG
ATATTGAAGATTCGGCTAATTCTAAACAGGGTAACAAAGAATTAGCGAAGAAAAT
CAAAACTAACAAGGGTGATGTTGAAAAGGCTATAAGTAAGTACGAGTTCAGTTTA
TCTGAACTAAATTCAATTGTTCATGATAACACAAAATTTTCCGATCTTTTATCAT
GCACATTACATAAAGTTGCAAGTGAAAAATTAGTCAAAGTAAACGAAGGTGATTG
GCCAAAACATCTAAAAAACAACGAGGAAAAACAGAAGATAAAAGAACCTCTTGAC
GCTTTATTGGAAATATACAATACTCTATTAATATTTAACTGTAAAAGTTTTAACA
AAAATGGTAATTTCTATGTCGACTACGATCGCTGCATTAATGAGTTGTCCAGTGT
TGTGTACTTGTATAATAAAACTCGTAATTATTGTACGAAAAAGCCGTACAACACT
GACAAATTTAAGTTGAATTTCAACTCCCCACAACTGGGTGAGGGCTTCTCTAAAA
GTAAAGAGAATGATTGCCTTACATTATTATTTAAAAAAGATGATAATTATTATGT
CGGAATCATAAGAAAGGGGGCAAAGATCAACTTCGATGACACTCAGGCCATAGCA
GACAACACAGATAACTGTATATTCAAAATGAATTATTTTTTGCTGAAGGATGCTA
AAAAATTTATCCCCAAATGTTCAATACAATTAAAAGAGGTTAAGGCCCATTTCAA
AAAGTCGGAAGATGACTATATTTTGTCCGATAAGGAAAAATTCGCTAGTCCGCTT
GTTATTAAAAAATCCACATTTCTTCTCGCTACGGCTCATGTGAAAGGAAAGAAGG
GCAATATTAAGAAATTTCAGAAAGAATACTCCAAAGAAAATCCTACGGAGTATAG
AAATAGTCTGAACGAATGGATAGCATTCTGCAAAGAGTTCTTGAAGACCTATAAA
GCTGCCACCATCTTTGATATTACAACTTTGAAAAAGGCCGAGGAATACGCTGACA
TTGTGGAATTCTATAAGGATGTAGATAATCTTTGTTACAAGTTAGAATTTTGCCC
TATCAAAACTTCTTTTATCGAAAATCTTATAGATAATGGCGATTTATACCTGTTT
AGAATTAATAACAAGGACTTTTCTTCAAAAAGTACAGGCACGAAAAACTTACACA
CATTATACTTGCAGGCTATATTTGACGAGCGAAACTTAAACAACCCCACGATAAT
GTTGAATGGAGGTGCAGAGTTATTCTACAGAAAAGAATCTATAGAACAGAAAAAT
CGGATCACGCACAAAGCCGGTAGTATCTTAGTGAATAAAGTGTGCAAAGATGGTA
CAAGTCTAGATGACAAAATCCGTAACGAAATTTACCAGTATGAAAACAAATTCAT
TGATACTCTTTCGGACGAAGCTAAAAAGGTTCTGCCAAACGTTATTAAGAAAGAG
GCTACGCATGATATAACAAAAGATAAACGTTTCACTAGCGACAAATTCTTCTTTC
ATTGTCCTTTAACAATCAACTACAAGGAAGGTGACACCAAACAATTTAATAATGA
AGTGCTCTCATTCCTTAGAGGTAACCCCGATATCAATATTATCGGCATTGATAGA
GGAGAAAGAAACCTAATCTATGTAACAGTCATTAACCAAAAAGGCGAAATATTGG
ATAGCGTCTCCTTCAATACTGTCACCAATAAGTCATCGAAGATAGAACAAACTGT
TGATTACGAAGAAAAATTGGCCGTTAGAGAAAAGGAACGTATCGAAGCGAAGAGA
TCTTGGGATAGCATATCCAAGATTGCCACCTTGAAGGAGGGTTATCTAAGCGCGA
TCGTACATGAAATCTGCTTATTAATGATTAAGCATAATGCTATTGTCGTGTTAGA
AAACCTGAATGCCGGTTTTAAAAGGATTAGAGGTGGTTTGTCAGAAAAGTCAGTA
TATCAAAAGTTTGAAAAGATGCTTATTAATAAACTCAACTACTTCGTTAGCAAGA
AAGAAAGTGATTGGAATAAACCGTCAGGTTTGCTCAATGGTCTTCAGTTAAGTGA
TCAATTTGAGTCTTTCGAAAAATTAGGAATTCAAAGTGGATTCATTTTTTATGTA
CCAGCCGCGTACACTTCAAAAATTGACCCTACGACCGGATTTGCCAACGTCTTGA
ATTTGTCCAAGGTCAGAAATGTTGACGCCATCAAAAGTTTTTTTAGCAACTTCAA
TGAAATCTCTTATTCCAAAAAGGAAGCCCTTTTCAAGTTTTCTTTTGACCTAGAC
TCGTTATCGAAGAAAGGATTTTCATCTTTCGTAAAGTTTAGCAAGTCCAAGTGGA
ATGTATACACATTCGGCGAGAGAATTATCAAGCCCAAGAACAAACAGGGCTATAG
AGAAGACAAGAGAATCAACTTGACTTTTGAGATGAAAAAATTACTCAACGAATAC
AAGGTTTCATTTGATTTGGAGAACAACTTGATTCCCAATTTGACATCAGCTAACT
TGAAGGATACGTTCTGGAAGGAGTTATTCTTTATATTCAAAACGACATTACAACT
GCGTAATAGTGTTACAAACGGTAAAGAAGATGTATTAATCTCACCTGTAAAGAAT
GCCAAAGGAGAATTTTTCGTATCCGGTACTCACAATAAGACACTACCACAGGATT
GCGACGCTAACGGTGCGTATCATATTGCGTTGAAAGGATTAATGATACTTGAAAG
AAATAACCTTGTTCGCGAAGAAAAAGACACCAAGAAGATCATGGCTATTAGCAAT
GTTGATTGGTTTGAATACGTGCAAAAGAGGAGAGGTGTTTTGTAA
SEQ ATGAACAATTATGACGAGTTCACAAAGCTATACCCTATCCAAAAAACTATCAGGT
ID TCGAATTGAAACCACAAGGGAGAACAATGGAACATCTGGAGACATTCAACTTTTT
NO: TGAAGAGGACAGAGACAGAGCGGAGAAATACAAAATTTTAAAAGAGGCCATCGAT
129 GAATATCACAAAAAGTTTATCGACGAGCATTTAACAAACATGTCTTTGGACTGGA
ATTCACTTAAACAAATTTCTGAGAAATATTATAAGTCTCGGGAGGAAAAAGACAA
AAAGGTCTTTTTGTCCGAGCAAAAGAGAATGAGACAAGAAATTGTCTCGGAGTTT
AAAAAAGATGATCGGTTCAAAGATTTGTTTAGCAAGAAATTGTTTTCTGAATTGT
TGAAGGAGGAGATATACAAGAAAGGCAACCATCAAGAAATAGATGCTTTGAAATC
GTTTGACAAGTTCAGCGGTTACTTCATTGGTTTACATGAAAATAGGAAGAACATG
TATAGCGACGGCGATGAGATCACCGCTATATCGAATAGAATCGTTAACGAAAATT
TTCCGAAATTTTTGGATAATTTGCAAAAATACCAGGAAGCTAGGAAAAAGTACCC
TGAATGGATAATAAAGGCGGAATCAGCTTTGGTGGCTCACAACATAAAGATGGAT
GAAGTCTTCTCGCTGGAATATTTTAACAAAGTATTAAATCAGGAAGGAATCCAAA
GATACAACTTAGCCTTGGGTGGATACGTAACCAAATCAGGTGAGAAAATGATGGG
CTTAAATGATGCACTTAATCTAGCTCACCAATCCGAAAAGTCCTCTAAAGGGAGG
ATACACATGACACCATTGTTTAAGCAAATCCTTTCGGAGAAAGAATCTTTTTCAT
ATATCCCCGATGTTTTCACTGAGGATAGTCAATTGTTGCCCAGCATTGGTGGATT
TTTTGCACAAATAGAAAATGATAAAGATGGTAACATCTTCGATAGAGCCTTGGAA
TTGATAAGCTCCTATGCAGAATACGATACGGAACGAATATACATTAGACAAGCTG
ACATCAACAGAGTAAGCAATGTTATTTTTGGTGAGTGGGGAACTTTAGGTGGATT
AATGCGGGAGTACAAAGCTGACTCAATCAATGATATTAATTTGGAACGTACGTGC
AAAAAAGTCGATAAGTGGCTTGATAGTAAGGAGTTTGCTCTGTCGGATGTACTAG
AAGCAATTAAGAGAACAGGAAACAATGATGCATTTAATGAATATATTAGTAAAAT
GAGGACGGCTAGAGAAAAGATAGACGCCGCACGTAAGGAAATGAAGTTTATTTCC
GAGAAAATATCTGGCGATGAAGAGTCGATTCACATCATCAAGACCCTACTCGATT
CTGTTCAGCAATTTCTCCATTTTTTTAACCTCTTCAAAGCAAGACAAGACATTCC
CTTAGATGGGGCTTTTTATGCCGAATTTGATGAAGTTCATTCAAAGTTGTTTGCT
ATTGTTCCTCTTTACAATAAGGTCCGTAATTACCTTACTAAAAATAACTTGAACA
CCAAGAAAATAAAGTTAAACTTCAAGAATCCGACTCTTGCCAACGGGTGGGATCA
GAATAAAGTTTATGATTATGCTAGCTTAATATTTCTAAGAGATGGGAATTATTAC
TTAGGAATCATCAATCCAAAGCGTAAGAAAAACATTAAATTTGAACAAGGGTCAG
GCAATGGCCCATTCTATAGAAAAATGGTGTATAAGCAAATACCAGGACCTAACAA
GAACTTGCCTCGCGTATTTTTAACTTCAACAAAGGGTAAAAAAGAATATAAACCA
AGCAAAGAAATTATTGAAGGTTACGAAGCAGATAAACACATCAGAGGTGATAAGT
TCGATCTGGATTTCTGCCATAAATTGATTGACTTTTTTAAGGAATCTATAGAAAA
ACATAAGGACTGGTCCAAATTTAATTTCTACTTCTCACCTACAGAAAGTTATGGT
GACATTTCAGAATTTTATTTAGACGTTGAGAAACAAGGATATAGGATGCATTTTG
AAAATATTTCAGCGGAAACCATCGACGAATACGTTGAGAAGGGTGATTTATTCTT
GTTCCAAATTTACAATAAAGACTTCGTTAAAGCTGCAACCGGAAAGAAGGATATG
CATACCATATATTGGAACGCTGCATTCTCGCCAGAAAACTTACAAGATGTCGTTG
TAAAGCTTAATGGAGAAGCTGAGCTGTTCTATAGAGACAAGAGTGATATAAAAGA
GATTGTGCATCGGGAAGGTGAAATTCTGGTGAACAGAACTTACAATGGTCGTACA
CCCGTTCCAGACAAAATACATAAAAAACTGACCGATTATCATAATGGTAGGACAA
AGGACTTGGGCGAGGCCAAGGAGTACCTCGATAAAGTTAGATATTTCAAGGCACA
CTATGATATTACGAAAGACAGGAGATATTTAAACGATAAAATTTACTTTCATGTC
CCTTTGACCCTTAACTTTAAAGCTAATGGTAAAAAGAATTTGAACAAAATGGTAA
TTGAGAAGTTTTTATCGGACGAAAAAGCTCACATAATCGGAATCGACCGCGGAGA
GAGAAATTTACTGTATTATAGTATCATCGACAGAAGTGGAAAGATTATTGATCAG
CAATCTTTGAACGTCATTGATGGGTTTGACTATCGGGAAAAGTTAAATCAAAGGG
AAATTGAAATGAAGGATGCGAGACAATCATGGAATGCCATTGGTAAAATTAAAGA
TCTCAAGGAGGGGTACTTATCAAAAGCTGTACACGAGATAACTAAAATGGCTATC
CAATATAATGCAATTGTTGTAATGGAAGAATTGAATTATGGTTTTAAACGCGGCA
GGTTTAAAGTCGAAAAACAAATATACCAAAAGTTTGAAAACATGTTAATTGATAA
GATGAACTATCTTGTTTTCAAAGATGCACCTGATGAGAGTCCTGGCGGTGTGCTG
AACGCCTATCAATTAACAAACCCATTAGAGTCCTTTGCTAAACTGGGTAAACAAA
CTGGCATTCTATTTTATGTTCCAGCCGCTTACACCTCAAAGATCGATCCAACGAC
CGGTTTTGTAAACTTATTTAATACTTCTTCCAAAACAAACGCGCAAGAACGCAAA
GAATTCCTACAAAAATTTGAATCAATATCCTATAGCGCAAAAGATGGAGGTATAT
TCGCTTTCGCTTTTGACTACAGAAAGTTTGGCACTTCCAAGACAGATCATAAAAA
TGTGTGGACCGCTTATACCAACGGAGAAAGGATGCGTTATATTAAAGAAAAAAAG
AGGAACGAACTATTTGATCCATCGAAAGAAATTAAAGAAGCTTTGACAAGCAGCG
GAATCAAATATGATGGAGGTCAAAACATACTTCCAGATATTCTCAGATCTAATAA
TAACGGTCTTATTTACACGATGTATTCATCTTTTATCGCTGCCATCCAAATGCGT
GTGTATGATGGCAAGGAAGATTATATTATATCTCCTATTAAAAATTCAAAGGGTG
AATTTTTTCGCACGGATCCAAAAAGAAGAGAGCTTCCAATTGACGCCGATGCTAA
CGGTGCTTACAATATTGCATTGCGTGGTGAACTTACTATGAGAGCCATCGCCGAA
AAGTTTGATCCGGACAGTGAAAAAATGGCGAAATTGGAGCTAAAGCACAAGGATT
GGTTTGAATTCATGCAGACCCGTGGCGATTGA
SEQ ATGACTAAAACGTTCGACTCCGAGTTTTTTAATCTCTATTCCTTGCAAAAGACCG
ID TTAGGTTTGAATTGAAACCAGTTGGTGAAACTGCCTCATTTGTCGAAGACTTTAA
NO: AAACGAGGGATTGAAAAGAGTGGTTAGTGAAGATGAAAGAAGGGCAGTAGACTAT
130 CAAAAGGTTAAAGAAATCATTGACGATTACCACAGAGATTTTATAGAAGAATCTC
TGAACTATTTTCCAGAGCAGGTTTCAAAAGATGCTCTAGAGCAAGCGTTTCATTT
GTATCAAAAGTTGAAAGCAGCGAAGGTGGAAGAAAGGGAAAAAGCTTTAAAAGAA
TGGGAAGCATTACAGAAAAAATTGCGAGAAAAAGTCGTCAAATGTTTCAGCGACT
CTAATAAAGCTCGCTTTTCTAGAATCGATAAAAAAGAATTGATTAAGGAAGATTT
AATAAATTGGCTGGTAGCACAAAACAGAGAGGATGATATTCCTACTGTTGAAACG
TTCAATAATTTTACTACTTACTTCACTGGTTTCCATGAGAACAGGAAGAATATTT
ACTCTAAAGATGATCACGCTACTGCTATAAGTTTTAGGTTGATTCACGAAAACTT
GCCTAAATTTTTTGACAATGTCATCAGTTTTAACAAGTTGAAAGAAGGTTTCCCG
GAATTAAAATTCGACAAAGTTAAAGAAGATTTAGAAGTAGATTACGACTTGAAGC
ATGCGTTTGAAATTGAATATTTCGTTAATTTCGTCACACAAGCTGGTATCGACCA
ATATAATTACCTGCTTGGAGGCAAAACTCTAGAAGACGGTACGAAGAAACAAGGA
ATGAATGAACAGATTAATTTATTTAAGCAACAACAAACTCGCGATAAAGCTAGAC
AGATTCCAAAACTGATTCCACTTTTCAAACAGATTCTATCTGAGAGAACTGAATC
TCAGAGTTTTATCCCTAAGCAGTTCGAGTCTGATCAGGAACTATTCGATTCCCTG
CAGAAATTGCATAACAACTGTCAAGATAAGTTTACCGTTTTGCAACAGGCGATCT
TGGGATTGGCTGAGGCAGATCTTAAAAAGGTCTTTATTAAAACTAGTGATCTAAA
CGCATTGTCTAACACTATTTTTGGAAATTATTCTGTGTTCTCAGACGCGCTCAAT
TTATATAAAGAGTCGCTAAAAACTAAAAAGGCTCAAGAAGCTTTTGAAAAGTTGC
CTGCACATAGTATTCATGATTTAATCCAATACTTAGAACAATTTAATTCGTCTCT
CGATGCTGAAAAGCAACAGTCTACCGATACTGTATTAAACTACTTTATTAAAACC
GACGAATTATATAGTCGTTTCATTAAATCCACCTCTGAGGCATTCACCCAAGTAC
AACCTCTCTTTGAACTGGAAGCTTTGAGCTCCAAAAGAAGACCCCCAGAAAGTGA
AGATGAGGGGGCTAAAGGCCAAGAAGGTTTCGAACAAATTAAGAGAATCAAAGCT
TATCTAGACACTCTAATGGAGGCTGTCCACTTTGCTAAGCCTTTGTATCTTGTCA
AGGGTAGAAAGATGATAGAGGGTCTAGACAAGGATCAAAGCTTCTACGAAGCGTT
TGAAATGGCCTACCAGGAGTTGGAGTCTTTAATCATCCCCATTTACAATAAGGCC
AGATCTTACCTGTCTAGGAAGCCATTTAAAGCGGATAAATTCAAAATTAATTTTG
ACAATAATACACTTCTATCTGGGTGGGATGCTAACAAGGAGACGGCTAACGCCAG
CATATTGTTTAAGAAGGATGGTTTATACTACCTGGGAATCATGCCAAAAGGCAAA
ACTTTCTTGTTCGATTATTTCGTTAGTTCAGAAGATTCTGAAAAGTTGAAACAAC
GGAGACAGAAAACCGCAGAGGAAGCGCTCGCACAGGATGGAGAATCCTATTTTGA
AAAAATACGGTATAAACTCCTACCAGGTGCTAGTAAGATGTTGCCAAAGGTATTT
TTTAGCAATAAAAATATTGGGTTTTACAATCCCTCAGATGATATTCTACGAATTC
GGAATACGGCCTCTCATACTAAGAATGGTACTCCCCAGAAGGGTCATTCCAAGGT
AGAATTTAACTTGAATGACTGTCACAAAATGATTGATTTTTTTAAATCTTCCATA
CAGAAACATCCCGAGTGGGGATCCTTTGGTTTCACTTTTTCTGATACGTCGGACT
TTGAAGATATGAGTGCTTTCTACCGAGAAGTTGAAAATCAAGGTTACGTTATAAG
TTTTGATAAAATAAAAGAAACTTACATTCAGTCTCAAGTTGAGCAAGGTAACTTA
TATTTATTTCAAATTTACAACAAAGATTTTAGTCCGTATTCAAAGGGAAAGCCAA
ACCTGCACACTTTATACTGGAAAGCTCTGTTTGAAGAGGCTAATTTGAATAACGT
AGTGGCTAAGCTAAACGGCGAAGCAGAAATCTTTTTCAGAAGACACAGTATCAAA
GCATCTGATAAAGTGGTACATCCTGCTAATCAAGCTATAGATAATAAGAATCCCC
ATACTGAGAAGACGCAGTCCACATTTGAATATGACTTGGTCAAAGACAAAAGATA
TACCCAAGACAAATTTTTTTTTCATGTACCGATATCTTTAAACTTTAAGGCTCAG
GGCGTTTCAAAGTTTAATGATAAGGTAAATGGATTCTTAAAGGGCAATCCCGACG
TTAATATAATCGGTATAGATCGAGGTGAGAGACATCTTTTATACTTTACCGTGGT
GAATCAAAAAGGAGAAATATTAGTGCAAGAGTCCTTGAATACATTAATGTCTGAC
AAGGGTCATGTCAACGATTATCAACAGAAATTGGACAAGAAGGAACAGGAAAGGG
ACGCTGCCAGGAAGTCCTGGACGACAGTAGAAAATATTAAAGAATTAAAAGAAGG
TTATTTATCACATGTGGTTCATAAACTTGCACATTTAATCATCAAATATAACGCA
ATAGTGTGCTTGGAAGATCTTAATTTTGGCTTCAAGAGGGGTAGGTTCAAGGTCG
AAAAACAGGTCTACCAGAAGTTCGAGAAAGCTCTGATCGATAAATTGAATTATCT
TGTTTTCAAAGAAAAAGAATTAGGAGAAGTTGGTCATTATCTTACAGCATACCAA
CTCACTGCACCATTTGAAAGCTTCAAAAAGCTAGGCAAGCAATCTGGGATTTTGT
TCTATGTTCCGGCTGATTATACATCAAAGATAGATCCTACCACAGGCTTTGTAAA
TTTTTTAGATCTTAGGTACCAATCCGTTGAAAAAGCTAAACAGTTGCTGTCCGAT
TTTAATGCGATAAGATTTAATAGTGTTCAGAATTATTTTGAGTTCGAAATTGATT
ATAAAAAATTGACACCAAAACGTAAAGTAGGAACACAATCTAAATGGGTTATTTG
TACCTATGGAGATGTTAGATACCAAAACAGAAGAAATCAGAAAGGTCACTGGGAA
ACTGAAGAAGTTAACGTTACTGAAAAACTTAAAGCTCTATTTGCGAGCGATTCAA
AAACGACGACGGTGATCGATTATGCAAATGATGATAACCTTATTGATGTAATTCT
GGAACAAGATAAGGCATCATTTTTTAAAGAACTACTATGGTTGTTAAAGCTAACC
ATGACCCTAAGGCACTCCAAGATAAAGTCAGAGGATGATTTTATCCTCTCTCCAG
TGAAAAACGAACAAGGTGAGTTTTACGACTCAAGAAAGGCGGGTGAAGTCTGGCC
TAAGGATGCTGATGCCAATGGAGCTTATCACATCGCTCTGAAGGGGCTATGGAAC
TTACAGCAAATTAACCAATGGGAAAAAGGTAAAACTTTAAACCTCGCCATAAAGA
ACCAGGATTGGTTCAGCTTTATCCAAGAAAAACCATATCAAGAATAA
SEQ ATGCACACAGGAGGTCTACTCTCGATGGATGCTAAGGAATTTACCGGTCAATATC
ID CGCTGTCCAAAACTTTGCGTTTTGAGCTTAGACCTATTGGCCGAACGTGGGATAA
NO: CCTAGAGGCTTCTGGTTATTTGGCGGAAGATAGACATAGAGCTGAGTGTTATCCC
131 CGAGCTAAAGAATTGCTGGATGATAACCACAGGGCGTTCCTGAATAGAGTTCTAC
CGCAAATCGATATGGATTGGCATCCAATTGCTGAAGCTTTCTGCAAGGTGCACAA
AAATCCAGGTAATAAAGAATTGGCTCAGGATTATAATTTGCAGCTTAGTAAGAGA
AGAAAAGAAATTTCCGCTTATTTGCAGGATGCTGATGGATACAAGGGGTTGTTCG
CGAAACCTGCCCTGGACGAAGCTATGAAAATAGCTAAGGAAAACGGCAATGAATC
TGATATTGAAGTTTTGGAAGCCTTCAATGGATTTTCCGTTTATTTCACTGGTTAT
CATGAGAGTAGGGAGAATATATACTCAGACGAAGATATGGTATCCGTCGCCTATC
GCATAACTGAAGATAATTTTCCAAGGTTCGTGTCGAACGCGTTAATTTTTGATAA
ACTAAATGAATCGCACCCGGATATTATTTCGGAAGTGTCCGGTAATCTGGGGGTA
GACGATATTGGTAAATATTTTGATGTGTCCAACTACAATAATTTCCTTAGTCAAG
CAGGAATTGATGACTACAACCATATTATAGGAGGGCATACAACTGAAGACGGTCT
CATTCAAGCTTTTAACGTAGTGTTAAACCTAAGGCACCAAAAAGACCCAGGTTTT
GAGAAAATTCAATTTAAGCAACTCTACAAGCAGATACTGAGCGTTAGGACTAGTA
AGTCATATATCCCAAAGCAATTCGATAACTCAAAGGAAATGGTCGACTGTATATG
CGACTACGTCTCAAAAATAGAAAAATCTGAAACAGTAGAAAGAGCTCTGAAATTG
GTAAGAAATATATCTTCTTTTGATTTAAGAGGTATTTTCGTAAATAAAAAAAACC
TTCGAATTTTGTCTAATAAGTTAATTGGAGACTGGGACGCAATAGAGACAGCTTT
GATGCACAGTTCCAGCAGTGAAAACGATAAGAAATCAGTGTATGACTCTGCAGAG
GCATTCACCCTTGATGATATCTTCAGTTCTGTGAAAAAGTTCAGCGACGCCTCCG
CTGAGGATATAGGAAACCGCGCTGAAGACATATGTCGTGTTATCTCAGAAACAGC
TCCTTTCATTAACGACTTAAGGGCTGTAGATTTGGATTCTTTAAATGATGACGGC
TATGAAGCGGCCGTGTCTAAAATACGGGAATCTCTTGAACCCTACATGGATCTAT
TTCACGAATTGGAGATCTTTAGCGTGGGTGATGAGTTTCCTAAATGTGCTGCCTT
TTATAGCGAGTTGGAAGAGGTCTCAGAACAACTGATTGAAATCATTCCTTTATTT
AACAAAGCAAGAAGTTTTTGCACAAGGAAAAGGTATTCAACCGACAAAATCAAAG
TCAATTTAAAATTCCCTACTCTGGCAGATGGATGGGATCTAAATAAAGAAAGGGA
TAACAAAGCCGCAATTCTAAGAAAAGACGGTAAATACTACCTGGCAATTTTAGAC
ATGAAGAAAGATCTCAGTAGTATTCGTACGAGCGATGAGGACGAGTCTTCTTTTG
AAAAGATGGAATATAAATTGCTCCCTTCTCCTGTGAAAATGCTTCCAAAAATTTT
TGTTAAATCGAAAGCCGCCAAAGAAAAGTACGGGTTGACCGATAGAATGTTAGAA
TGCTACGATAAAGGTATGCATAAGTCGGGTAGTGCTTTTGATTTGGGTTTTTGTC
ATGAATTGATCGATTACTATAAGCGCTGCATTGCCGAGTACCCAGGCTGGGATGT
TTTCGACTTTAAATTTCGTGAGACAAGCGATTACGGATCCATGAAAGAATTTAAT
GAAGACGTCGCTGGCGCAGGTTACTATATGTCACTTAGAAAGATTCCATGTTCCG
AAGTTTATCGTTTACTGGACGAGAAGTCAATTTACTTGTTTCAAATATATAATAA
GGATTATAGCGAAAACGCACATGGGAATAAGAATATGCATACGATGTATTGGGAG
GGCTTGTTCTCACCACAAAATTTGGAATCACCAGTCTTCAAATTGTCCGGAGGCG
CAGAACTTTTTTTCAGAAAGTCATCTATTCCTAATGACGCTAAAACGGTACATCC
GAAAGGTTCAGTTCTTGTTCCCAGAAACGACGTCAATGGTAGAAGAATACCAGAC
TCGATCTACAGAGAGTTGACAAGGTATTTTAACCGTGGGGATTGCAGGATCAGTG
ATGAAGCTAAGTCTTACCTGGACAAGGTCAAGACAAAAAAAGCGGACCATGACAT
TGTTAAGGATAGAAGATTTACTGTAGATAAGATGATGTTCCATGTTCCGATTGCC
ATGAATTTTAAAGCTATAAGTAAACCAAATCTTAATAAGAAAGTTATTGATGGCA
TAATAGATGATCAAGATTTGAAAATCATCGGTATCGATCGTGGTGAGAGAAATCT
TATTTATGTGACCATGGTCGATAGGAAGGGGAATATATTGTATCAAGACAGTCTT
AATATTTTAAATGGATACGATTACCGCAAAGCTTTAGACGTGAGGGAATATGATA
ACAAAGAAGCTAGAAGGAATTGGACTAAAGTAGAAGGTATTAGAAAAATGAAAGA
AGGTTATTTATCTTTAGCTGTTAGTAAATTGGCCGATATGATCATCGAAAATAAT
GCTATAATCGTAATGGAAGATTTGAATCACGGGTTTAAGGCAGGTCGTTCCAAAA
TTGAAAAGCAGGTGTATCAAAAATTCGAATCAATGTTAATCAACAAGTTAGGATA
CATGGTGCTAAAAGACAAGTCCATTGACCAGTCTGGTGGAGCCCTTCATGGTTAC
CAATTAGCCAATCATGTTACGACCTTAGCTAGCGTGGGTAAACAATGTGGAGTAA
TTTTTTACATACCTGCAGCTTTTACTTCGAAGATTGATCCCACCACGGGCTTTGC
TGATTTATTCGCTCTCTCTAATGTGAAGAATGTCGCTTCTATGAGAGAGTTCTTC
TCCAAAATGAAGTCAGTAATATATGACAAGGCGGAAGGCAAATTCGCCTTTACAT
TTGATTATTTGGATTATAACGTTAAAAGCGAATGTGGACGTACCTTATGGACTGT
GTATACAGTTGGTGAACGCTTCACCTACTCTAGAGTAAACCGAGAGTATGTTCGG
AAAGTCCCAACAGATATCATCTATGATGCATTACAAAAAGCTGGTATTAGCGTCG
AAGGTGACCTTAGAGATAGAATCGCGGAAAGCGACGGTGACACATTAAAGTCTAT
ATTCTACGCTTTTAAATACGCGTTGGATATGAGAGTCGAAAACAGAGAGGAAGAC
TATATACAGTCACCTGTGAAGAATGCTTCTGGTGAGTTCTTTTGTTCAAAAAACG
CCGGAAAGTCTTTGCCGCAGGATTCAGATGCAAATGGTGCCTATAATATAGCTCT
GAAAGGGATCCTACAACTCAGAATGTTGAGCGAACAATACGATCCAAATGCAGAA
TCGATTAGATTGCCACTTATAACTAACAAGGCATGGTTAACTTTTATGCAATCCG
GTATGAAAACTTGGAAGAATTAA
SEQ ATGGATTCTCTTAAGGATTTCACTAATTTATATCCAGTCTCGAAAACATTGCGGT
ID TCGAATTGAAACCAGTTGGGAAAACTCTAGAAAACATTGAAAAAGCCGGTATATT
NO: GAAAGAAGATGAACACAGAGCGGAATCCTACCGCCGGGTAAAAAAGATAATTGAC
132 ACATACCATAAAGTGTTTATTGACAGCTCCTTAGAGAACATGGCTAAAATGGGGA
TAGAAAATGAAATCAAGGCTATGCTGCAGTCTTTTTGTGAACTCTATAAGAAAGA
CCACAGGACAGAAGGAGAAGATAAAGCTCTTGATAAAATTAGAGCTGTTCTTAGA
GGTTTAATCGTTGGGGCTTTCACTGGTGTATGTGGAAGACGAGAAAACACAGTAC
AAAATGAAAAGTACGAGAGTTTGTTCAAAGAAAAATTGATAAAGGAAATTTTGCC
AGATTTCGTGTTGTCCACCGAGGCTGAGTCTCTTCCATTCAGCGTTGAAGAAGCA
ACAAGGAGCTTAAAAGAGTTTGACTCATTCACTTCTTATTTTGCTGGTTTTTACG
AAAATAGAAAGAATATTTATTCCACGAAACCGCAAAGTACTGCGATAGCCTACAG
ATTAATTCATGAAAACTTGCCTAAATTTATAGATAATATTTTGGTCTTCCAGAAG
ATTAAAGAACCAATCGCTAAAGAACTTGAGCACATAAGAGCAGATTTTAGCGCAG
GCGGATATATCAAAAAAGATGAACGGCTAGAAGACATATTCTCATTAAATTACTA
CATTCATGTCCTTTCTCAAGCTGGTATAGAAAAATATAATGCTTTAATCGGGAAG
ATAGTGACGGAAGGTGATGGTGAAATGAAAGGTCTTAATGAACATATTAACTTAT
ATAACCAACAGAGGGGTCGAGAGGATAGGTTGCCCTTGTTTAGGCCTCTATACAA
GCAAATCCTGTCCGATAGAGAGCAATTGTCTTATTTACCTGAATCATTTGAAAAA
GATGAAGAGCTGCTTAGAGCACTTAAGGAATTTTACGATCACATCGCCGAAGACA
TCTTGGGTAGAACACAGCAATTGATGACTTCAATTTCTGAATACGACTTGTCCCG
TATTTATGTCAGAAATGATTCTCAACTTACAGACATCTCGAAGAAAATGCTAGGA
GATTGGAACGCCATTTATATGGCTAGAGAACGAGCCTACGACCACGAACAGGCTC
CTAAACGTATTACTGCTAAATACGAACGTGATAGAATCAAGGCCTTAAAAGGTGA
AGAGTCAATTTCATTGGCGAATCTGAACAGCTGTATAGCTTTCTTGGACAATGTA
AGGGATTGTCGAGTTGACACATACCTATCAACTTTGGGGCAGAAAGAGGGTCCTC
ATGGCTTAAGTAACTTGGTGGAAAACGTCTTCGCCTCATATCATGAAGCAGAACA
GTTATTGTCGTTTCCTTACCCCGAAGAGAACAACCTTATTCAGGACAAAGACAAT
GTAGTTTTGATCAAAAACCTATTGGATAATATAAGTGATTTACAACGTTTCCTTA
AACCTTTGTGGGGAATGGGCGATGAACCTGACAAAGACGAAAGGTTTTACGGTGA
ATACAACTATATTAGAGGAGCGCTTGACCAGGTAATACCTTTGTACAATAAAGTA
AGGAACTACTTGACTCGTAAACCATATTCTACTAGAAAAGTTAAATTGAACTTTG
GTAATTCACAGCTGCTGAGTGGTTGGGATCGTAATAAAGAAAAAGATAACTCCTG
TGTTATCTTGCGAAAAGGACAAAACTTTTACTTGGCAATTATGAACAACCGTCAC
AAAAGGTCCTTCGAGAACAAAGTTCTGCCTGAATACAAAGAAGGTGAACCATATT
TTGAAAAAATGGACTATAAATTCCTGCCAGATCCTAATAAAATGTTGCCTAAGGT
CTTCTTGTCTAAAAAAGGTATAGAAATATATAAACCATCCCCGAAGTTGCTGGAG
CAATATGGTCATGGAACGCACAAAAAAGGTGACACTTTTAGTATGGATGACTTGC
ACGAGTTGATTGATTTTTTTAAACATTCCATTGAAGCGCACGAAGATTGGAAACA
ATTTGGTTTCAAGTTCTCTGACACAGCCACTTACGAAAATGTATCGTCCTTTTAT
AGAGAAGTGGAAGATCAGGGTTATAAACTGTCATTCCGTAAGGTTAGTGAAAGCT
ATGTGTACTCGTTGATCGATCAAGGGAAGCTTTATCTTTTTCAAATCTATAATAA
AGATTTCTCTCCTTGTTCAAAGGGCACACCTAATCTTCATACACTATACTGGAGA
ATGCTTTTCGATGAAAGAAATTTGGCTGATGTGATCTATAAATTAGACGGTAAAG
CTGAGATTTTTTTCAGAGAGAAATCCCTGAAAAACGACCATCCAACTCATCCGGC
AGGTAAACCGATTAAAAAGAAATCCCGGCAAAAAAAGGGCGAAGAGAGTTTATTC
GAGTATGATTTAGTTAAGGACAGACATTATACAATGGACAAATTTCAATTTCATG
TGCCCATTACTATGAACTTTAAGTGTAGTGCAGGGTCTAAGGTTAATGATATGGT
AAACGCACATATTAGAGAAGCTAAAGATATGCACGTCATCGGTATTGATCGCGGA
GAAAGAAATTTACTTTACATTTGCGTTATCGATTCTAGGGGCACCATCTTGGATC
AAATCTCTTTGAACACTATAAATGATATTGACTATCATGATCTACTAGAGAGTCG
GGATAAAGACAGGCAACAAGAAAGAAGAAATTGGCAAACAATTGAAGGTATTAAA
GAATTAAAGCAAGGCTATCTAAGCCAGGCTGTACACAGAATTGCCGAATTAATGG
TAGCATATAAAGCTGTCGTAGCTCTAGAAGACTTGAACATGGGTTTCAAAAGAGG
GCGCCAGAAGGTCGAAAGTAGTGTTTATCAACAATTTGAAAAACAGTTAATAGAT
AAGTTGAATTATCTAGTGGATAAAAAAAAGCGTCCTGAGGACATTGGCGGTTTAT
TAAGAGCCTACCAATTCACTGCGCCATTTAAATCGTTCAAAGAAATGGGTAAACA
AAACGGTTTTCTATTCTACATCCCCGCATGGAATACCTCAAATATAGATCCAACT
ACCGGTTTCGTCAACTTATTTCATGCTCAATATGAGAATGTGGACAAAGCAAAAT
CATTCTTTCAAAAATTTGATAGCATTAGCTACAATCCTAAAAAAGATTGGTTTGA
ATTTGCGTTCGATTATAAAAATTTCACCAAGAAGGCTGAAGGTTCCAGATCTATG
TGGATATTGTGCACCCACGGAAGTAGAATTAAGAACTTCCGTAATTCACAGAAAA
ACGGCCAGTGGGACAGCGAAGAATTCGCCCTAACCGAAGCTTTCAAAAGTCTTTT
CGTAAGATACGAGATAGACTATACAGCTGATCTAAAGACAGCTATTGTGGATGAG
AAGCAAAAAGACTTCTTTGTCGACCTTCTTAAGTTGTTCAAGTTAACTGTGCAGA
TGAGAAATAGTTGGAAGGAAAAAGACCTAGATTACTTGATTAGCCCAGTCGCTGG
TGCAGATGGCAGATTTTTTGATACACGTGAAGGCAATAAATCACTACCAAAAGAC
GCGGACGCTAATGGCGCATACAACATCGCATTGAAGGGTTTGTGGGCTCTCAGGC
AGATTAGGCAGACAAGTGAGGGTGGTAAGCTTAAGCTGGCGATTTCTAATAAGGA
ATGGTTACAGTTTGTTCAAGAAAGATCCTACGAAAAAGATTAA
SEQ ATGAACAATGGTACTAATAATTTTCAAAACTTCATAGGGATTTCTAGCCTTCAAA
ID AGACATTGAGAAATGCTTTAATTCCAACAGAAACGACTCAACAATTCATAGTGAA
NO: AAATGGTATTATAAAAGAAGACGAGTTGCGTGGCGAGAATAGACAAATTTTGAAA
133 GATATCATGGATGACTACTACAGAGGGTTCATCTCCGAAACATTGTCTTCTATTG
ACGACATTGACTGGACCAGCTTATTCGAAAAAATGGAAATACAGCTGAAGAACGG
AGATAACAAGGACACTCTTATAAAGGAGCAAACGGAATATAGAAAGGCTATACAC
AAAAAGTTTGCTAATGACGATAGATTTAAAAACATGTTTAGTGCGAAGTTAATTT
CTGATATTCTACCCGAGTTTGTCATTCATAATAATAACTACTCTGCATCTGAAAA
AGAGGAGAAGACCCAGGTTATAAAGTTGTTTTCAAGATTTGCCACATCATTTAAA
GACTACTTCAAGAACAGGGCGAATTGCTTCTCTGCTGATGATATTAGCTCTTCCA
GCTGTCATAGAATTGTTAACGATAATGCCGAAATTTTTTTTAGTAATGCCTTGGT
ATATAGACGCATAGTCAAGTCACTAAGCAATGATGATATAAACAAGATTAGTGGT
GATATGAAAGATAGCCTTAAAGAAATGAGCCTTGAAGAGATATATTCATATGAGA
AGTACGGTGAATTTATAACTCAAGAAGGAATTTCTTTTTATAACGATATTTGTGG
TAAGGTTAATTCTTTTATGAATTTGTATTGCCAGAAGAACAAGGAAAATAAGAAT
CTATATAAACTACAAAAGTTGCATAAACAGATTTTGTGTATAGCTGATACATCCT
ACGAAGTTCCGTATAAATTTGAATCTGATGAGGAAGTTTATCAATCGGTAAACGG
TTTTCTTGACAACATTTCCAGCAAACATATCGTTGAGAGACTACGTAAAATTGGA
GACAACTATAATGGTTACAATCTAGATAAAATATACATAGTGTCCAAGTTTTATG
AGTCTGTCTCTCAAAAGACATATCGTGATTGGGAGACCATTAATACTGCACTTGA
AATTCATTATAACAACATATTGCCTGGTAACGGGAAGAGTAAAGCTGATAAGGTT
AAAAAGGCCGTCAAAAACGACTTGCAAAAGTCTATTACCGAGATAAATGAATTAG
TGTCAAACTACAAACTATGCTCAGATGATAATATTAAAGCGGAAACATACATCCA
CGAAATTTCCCACATACTGAATAACTTTGAAGCTCAGGAGCTTAAATATAACCCG
GAAATACACTTGGTTGAGAGCGAGTTAAAAGCATCTGAGTTGAAAAATGTATTAG
ACGTCATCATGAATGCGTTTCATTGGTGTTCAGTTTTCATGACTGAAGAATTAGT
CGACAAAGATAACAATTTTTATGCCGAATTAGAGGAAATATATGATGAAATTTAT
CCCGTAATTAGTTTATACAATCTAGTTAGAAATTATGTTACACAAAAGCCGTATA
GTACCAAGAAAATAAAGCTTAATTTCGGAATACCTACGCTTGCTGATGGTTGGTC
AAAAAGTAAAGAATATAGCAATAATGCAATAATTTTAATGAGAGATAACCTATAT
TATTTGGGTATTTTTAACGCTAAGAACAAACCAGACAAGAAAATAATTGAAGGTA
ATACATCTGAAAACAAGGGCGACTATAAAAAGATGATATACAATTTGCTCCCAGG
TCCTAATAAAATGATTCCTAAGGTTTTCCTGAGTAGCAAGACTGGCGTTGAAACT
TACAAGCCTAGTGCGTATATCCTGGAGGGTTATAAACAGAACAAGCATATCAAAT
CCTCTAAGGACTTCGATATCACCTTTTGCCATGACTTAATCGATTATTTTAAAAA
TTGTATCGCAATTCATCCAGAATGGAAAAATTTCGGATTTGATTTTAGTGATACC
AGCACTTACGAGGATATCTCTGGGTTCTACAGAGAAGTGGAGTTGCAGGGCTACA
AAATCGATTGGACTTACATATCTGAAAAGGACATAGATTTGCTGCAGGAGAAAGG
TCAGCTATATTTGTTTCAAATCTACAACAAAGACTTTTCTAAAAAGTCTACCGGT
AATGACAATCTGCACACAATGTACTTGAAGAACTTATTCTCCGAGGAGAACTTAA
AGGACATTGTACTCAAGTTGAATGGAGAAGCCGAGATTTTTTTTAGAAAGAGCAG
TATAAAGAATCCTATAATCCACAAGAAGGGCTCAATTCTCGTGAATAGGACGTAT
GAGGCAGAAGAAAAGGACCAATTTGGGAATATACAAATTGTAAGAAAAAACATCC
CAGAAAATATCTACCAGGAATTATATAAGTATTTTAATGACAAATCTGATAAGGA
ACTGTCTGACGAAGCCGCTAAGCTCAAGAATGTTGTGGGCCACCATGAAGCTGCT
ACTAATATAGTGAAGGACTACAGATATACCTACGATAAATATTTCCTGCATATGC
CAATTACTATAAACTTCAAAGCAAATAAAACAGGTTTTATAAATGATAGAATCCT
GCAGTATATTGCTAAAGAAAAGGATTTACATGTAATTGGGATTGATAGAGGTGAA
CGCAATCTGATCTATGTCAGCGTAATAGATACTTGTGGTAATATTGTGGAACAAA
AGTCCTTTAATATTGTGAACGGATATGATTACCAAATCAAGTTGAAACAACAAGA
GGGAGCACGCCAAATTGCCCGTAAGGAATGGAAAGAGATAGGTAAGATCAAGGAA
ATTAAGGAAGGTTATCTTTCATTAGTTATTCACGAAATTTCGAAGATGGTAATCA
AATACAACGCAATAATTGCTATGGAGGACCTGTCATATGGATTTAAGAAAGGTAG
ATTCAAGGTTGAGAGACAGGTATACCAGAAATTTGAAACTATGTTGATCAACAAA
TTAAATTACTTAGTCTTTAAGGACATATCAATAACGGAAAACGGCGGGCTTTTAA
AAGGGTATCAACTTACATACATACCTGATAAGTTGAAAAATGTGGGTCATCAGTG
TGGGTGCATCTTTTATGTTCCAGCCGCTTACACATCAAAAATCGATCCTACTACT
GGGTTCGTAAACATATTTAAATTTAAAGATCTAACCGTTGATGCAAAAAGAGAGT
TTATCAAGAAATTTGATAGCATTAGGTACGATTCAGAAAAAAATCTATTCTGTTT
TACTTTTGACTACAACAACTTTATAACGCAGAATACAGTGATGTCAAAATCGTCC
TGGTCAGTGTATACTTATGGTGTTAGAATTAAGAGACGTTTCGTAAACGGTCGTT
TTTCTAACGAGTCCGATACAATCGACATCACTAAAGATATGGAAAAAACTTTGGA
AATGACAGATATAAACTGGAGAGATGGTCACGACCTTAGACAAGATATAATCGAT
TATGAAATCGTACAGCATATTTTTGAAATTTTTCGCTTAACAGTTCAGATGCGTA
ACTCTCTTAGTGAGCTAGAAGATAGAGATTATGATAGACTTATCTCGCCTGTTCT
TAACGAAAATAATATCTTCTATGACTCGGCAAAAGCCGGTGATGCACTTCCAAAA
GATGCTGATGCAAATGGCGCGTACTGCATCGCATTGAAGGGGCTCTACGAGATTA
AACAAATCACCGAAAACTGGAAAGAAGATGGTAAATTTTCTAGGGATAAGTTGAA
AATCAGTAATAAAGATTGGTTCGATTTTATACAAAATAAGCGATACTTATAG
SEQ ATGACCAATAAGTTTACTAATCAATACTCATTGTCTAAAACGTTAAGATTCGAGT
ID TAATTCCCCAGGGAAAGACACTAGAATTTATTCAAGAAAAAGGTCTTCTCTCTCA
NO: GGATAAACAAAGAGCAGAATCATACCAGGAGATGAAAAAAACCATAGATAAATTT
134 CATAAGTACTTCATCGACTTGGCACTATCGAACGCCAAGCTAACACATTTGGAAA
CCTACCTGGAGTTGTATAATAAATCGGCAGAGACGAAAAAGGAACAAAAATTCAA
GGATGACCTGAAGAAGGTTCAAGATAATCTGCGAAAGGAAATAGTGAAGTCGTTT
AGTGATGGTGATGCAAAGTCAATCTTTGCTATTTTAGACAAGAAGGAATTAATAA
CCGTGGAACTTGAAAAGTGGTTTGAAAATAACGAACAGAAAGATATTTACTTCGA
CGAAAAATTTAAAACGTTTACTACGTACTTTACAGGGTTCCATCAGAACCGCAAA
AACATGTACTCCGTTGAACCAAACTCTACTGCAATCGCCTACAGATTAATACACG
AAAATTTGCCTAAGTTTTTAGAAAATGCAAAGGCTTTTGAAAAGATAAAGCAAGT
CGAATCGTTACAGGTAAACTTTCGCGAATTAATGGGCGAATTTGGAGATGAAGGT
CTTATTTTTGTCAATGAATTAGAGGAAATGTTTCAAATTAATTATTATAACGATG
TCTTGAGTCAGAACGGCATTACTATCTACAACTCAATTATCAGTGGTTTCACTAA
GAATGATATAAAATATAAAGGTTTGAATGAATACATTAATAATTATAATCAAACT
AAAGATAAGAAGGACAGGCTTCCGAAATTGAAGCAATTGTACAAGCAGATTCTAA
GTGATAGGATTAGTTTGTCTTTCTTGCCAGACGCATTTACTGATGGCAAGCAAGT
CTTAAAGGCTATATTCGATTTCTACAAGATTAACCTACTTTCGTACACAATTGAA
GGTCAAGAAGAATCTCAAAATCTGCTGCTTTTGATTAGGCAAACTATAGAAAATT
TGTCGTCCTTTGACACTCAAAAAATTTACCTGAAGAATGATACACACCTGACTAC
AATATCACAGCAGGTCTTTGGGGATTTTTCTGTCTTCTCCACGGCCCTAAACTAT
TGGTATGAGACAAAAGTTAATCCAAAATTTGAAACAGAATATAGTAAGGCGAATG
AAAAAAAGAGAGAAATTTTGGATAAAGCGAAGGCAGTATTCACAAAACAAGACTA
TTTTTCTATCGCATTTCTCCAAGAAGTCTTATCCGAATATATTTTGACACTCGAT
CACACCTCTGATATAGTTAAGAAACATTCGTCCAACTGCATCGCAGATTACTTCA
AGAATCACTTCGTGGCTAAGAAAGAAAACGAAACGGATAAAACTTTTGACTTCAT
TGCTAACATAACCGCTAAATACCAATGTATTCAGGGCATATTAGAAAATGCAGAC
CAGTACGAAGACGAGTTAAAACAGGACCAAAAGTTAATAGATAATCTAAAGTTTT
TCTTAGATGCTATACTTGAGTTATTACATTTTATAAAGCCATTGCATCTAAAATC
GGAAAGTATTACTGAAAAAGACACTGCGTTCTATGATGTGTTCGAAAATTATTAT
GAGGCTTTATCTTTATTGACCCCCCTTTACAACATGGTCCGCAATTATGTTACTC
AGAAGCCTTACTCTACTGAAAAGATCAAATTAAACTTTGAAAATGCTCAGTTGCT
GAATGGTTGGGATGCCAATAAGGAAGGTGACTACCTGACGACTATTCTAAAAAAA
GACGGTAATTATTTCTTAGCAATCATGGATAAAAAACATAACAAGGCATTTCAAA
AATTTCCAGAAGGAAAAGAAAACTATGAAAAGATGGTTTATAAATTGTTGCCTGG
AGTTAATAAAATGTTGCCAAAAGTTTTTTTTAGCAATAAGAACATAGCTTACTTT
AATCCATCTAAGGAACTGCTCGAGAACTACAAGAAGGAAACACATAAAAAAGGTG
ATACATTTAATTTGGAACATTGCCATACTCTGATTGATTTTTTTAAGGACTCTCT
TAATAAACATGAAGACTGGAAATATTTTGATTTTCAATTTTCGGAAACTAAATCA
TACCAAGATCTAAGTGGATTTTACAGAGAAGTTGAACACCAAGGTTATAAGATTA
ACTTCAAGAATATAGATTCTGAATACATTGATGGTCTTGTAAACGAGGGTAAACT
ATTCCTGTTCCAAATCTACTCTAAGGACTTCTCACCTTTTTCCAAAGGAAAACCT
AATATGCATACGTTGTACTGGAAGGCTCTATTTGAAGAACAAAATTTGCAAAATG
TAATCTACAAACTGAACGGCCAAGCTGAAATATTCTTCAGAAAAGCCTCAATTAA
GCCAAAAAACATTATTCTTCATAAAAAGAAGATCAAGATTGCGAAGAAACATTTT
ATTGATAAGAAGACCAAGACTTCCGAAATTGTACCAGTACAAACAATCAAGAATC
TCAATATGTATTATCAAGGCAAGATAAGTGAGAAAGAGTTAACCCAGGATGATTT
ACGTTATATAGACAATTTCTCTATATTCAACGAGAAGAACAAAACAATAGACATT
ATCAAAGATAAAAGGTTTACTGTTGACAAATTTCAATTTCATGTGCCTATCACAA
TGAACTTTAAGGCCACAGGTGGTTCGTACATTAATCAAACTGTTTTAGAATATCT
GCAAAATAACCCAGAGGTCAAGATCATCGGTCTTGATAGGGGTGAGAGACATCTG
GTGTATCTAACACTCATTGATCAACAAGGCAACATCTTGAAGCAAGAATCATTGA
ACACTATCACAGACTCCAAGATCTCGACTCCATATCACAAACTCCTTGACAATAA
AGAAAACGAAAGGGATCTTGCCAGAAAAAATTGGGGTACAGTTGAAAATATTAAG
GAACTAAAAGAAGGTTACATTTCGCAAGTAGTTCACAAGATTGCAACACTCATGT
TGGAAGAAAACGCAATCGTTGTCATGGAAGATTTAAATTTCGGATTTAAGAGAGG
AAGATTTAAAGTAGAAAAGCAAATCTACCAGAAGTTGGAGAAGATGTTAATTGAC
AAATTGAACTACTTAGTGCTGAAAGACAAACAGCCTCAAGAATTGGGCGGTCTAT
ACAACGCTTTACAACTGACAAATAAATTTGAGTCATTCCAAAAGATGGGTAAGCA
GAGTGGTTTTTTGTTTTATGTTCCGGCATGGAACACATCCAAAATCGATCCAACT
ACAGGCTTCGTGAATTATTTCTACACTAAATATGAAAATGTGGATAAAGCAAAAG
CTTTCTTTGAGAAGTTCGAGGCGATCCGTTTTAACGCTGAAAAGAAGTACTTCGA
GTTCGAGGTCAAAAAGTATTCAGATTTTAACCCCAAGGCTGAAGGCACCCAGCAA
GCATGGACTATTTGCACGTACGGTGAGCGAATCGAAACTAAAAGGCAAAAGGATC
AAAATAATAAGTTTGTAAGCACACCCATTAACTTGACAGAAAAGATAGAAGATTT
TCTTGGAAAAAACCAAATTGTATATGGTGACGGTAACTGTATCAAGTCACAAATT
GCTTCTAAAGACGATAAGGCCTTCTTCGAAACTCTGCTATACTGGTTTAAAATGA
CGTTGCAAATGAGAAACAGTGAAACTAGAACTGATATCGACTATTTAATATCACC
CGTGATGAACGATAATGGTACCTTTTACAATTCAAGAGATTACGAGAAATTGGAG
AACCCCACACTACCAAAAGACGCAGACGCTAATGGTGCCTACCATATTGCTAAAA
AGGGACTGATGTTGTTGAACAAGATAGATCAAGCCGACTTAACTAAAAAAGTTGA
TTTGTCAATTTCGAATAGAGATTGGTTGCAATTCGTCCAGAAAAATAAGTAA
SEQ ATGGAACAGGAATACTACTTGGGTTTGGATATGGGAACTGGTTCAGTCGGTTGGG
ID CTGTTACGGACTCCGAGTACCACGTGTTGAGAAAACACGGAAAGGCTTTATGGGG
NO: TGTCAGACTATTCGAATCAGCATCGACCGCGGAAGAGAGAAGAATGTTTAGAACT
135 TCAAGAAGAAGGCTGGATCGTAGGAATTGGCGGATAGAAATTTTACAAGAAATAT
TCGCCGAAGAAATCTCTAAAAAAGATCCAGGATTTTTTCTACGTATGAAGGAATC
CAAATACTATCCGGAAGATAAACGTGATATTAATGGCAATTGTCCAGAGTTACCC
TATGCTTTATTTGTGGACGACGATTTCACCGATAAAGATTACCATAAGAAGTTCC
CAACAATTTACCATCTGAGAAAGATGTTAATGAACACTGAAGAAACCCCGGATAT
AAGACTGGTCTATCTAGCCATTCATCATATGATGAAACACAGGGGACACTTCTTG
CTATCAGGGGATATAAATGAAATTAAAGAATTTGGTACAACATTTTCTAAATTAT
TGGAAAATATTAAAAACGAAGAATTAGATTGGAATTTAGAATTAGGCAAGGAGGA
ATACGCAGTTGTCGAATCGATTCTGAAAGATAACATGTTGAACAGATCAACGAAA
AAAACAAGGCTGATCAAGGCTTTAAAAGCGAAATCAATATGCGAAAAAGCAGTAT
TGAATTTGTTAGCTGGGGGGACTGTCAAGTTGTCTGATATTTTCGGATTGGAAGA
ATTGAATGAAACAGAGAGACCGAAGATATCCTTCGCCGATAATGGCTACGATGAT
TATATAGGCGAAGTCGAAAATGAGCTGGGCGAACAATTCTACATTATCGAGACTG
CCAAGGCTGTTTATGATTGGGCGGTGTTAGTCGAAATCCTTGGCAAATACACTTC
CATCTCCGAAGCTAAGGTGGCAACCTACGAAAAGCATAAAAGTGATTTGCAATTC
CTTAAGAAAATTGTCCGAAAGTACTTGACCAAAGAAGAGTACAAGGATATTTTCG
TATCAACATCGGACAAACTGAAGAATTATTCAGCTTATATTGGCATGACGAAAAT
TAATGGTAAGAAAGTTGATTTGCAATCCAAGAGATGTTCTAAAGAAGAATTTTAC
GATTTCATTAAAAAAAATGTCCTAAAAAAGTTGGAGGGACAACCTGAATATGAGT
ATTTAAAGGAAGAACTGGAAAGAGAAACTTTCCTACCAAAGCAAGTTAATCGTGA
TAATGGCGTTATTCCATACCAAATACACTTGTACGAATTAAAGAAGATCTTGGGT
AACTTGAGGGACAAAATTGATTTAATCAAGGAAAATGAAGACAAACTGGTACAAT
TATTTGAATTTAGAATACCTTACTACGTGGGCCCTTTAAACAAAATAGACGATGG
TAAGGAAGGGAAGTTCACATGGGCAGTCAGAAAGTCCAATGAAAAAATTTACCCA
TGGAATTTCGAAAACGTTGTAGATATTGAAGCTTCTGCTGAGAAATTTATTAGGA
GAATGACAAATAAATGCACTTATCTTATGGGGGAAGACGTGTTGCCTAAAGATAG
TTTATTATATTCAAAGTATATGGTCTTAAATGAATTAAACAATGTTAAATTAGAT
GGTGAAAAACTTTCCGTCGAATTGAAACAAAGATTGTATACAGATGTATTCTGCA
AATATAGAAAAGTAACTGTAAAGAAGATTAAAAACTACCTTAAATGTGAAGGCAT
TATCAGCGGAAATGTTGAGATCACTGGTATCGATGGTGATTTTAAGGCATCTTTA
ACCGCATATCACGACTTTAAGGAAATATTGACGGGTACTGAGCTTGCTAAAAAAG
ACAAAGAGAACATTATCACCAATATCGTGCTCTTCGGAGACGACAAGAAATTATT
GAAAAAGAGATTGAACCGCCTATACCCTCAGATTACCCCTAACCAATTGAAGAAA
ATCTGCGCTCTGTCTTATACTGGATGGGGTCGTTTTAGCAAGAAGTTTCTAGAAG
AAATTACTGCTCCGGATCCTGAAACTGGGGAAGTCTGGAATATAATTACCGCGCT
ATGGGAATCGAATAATAATTTAATGCAATTACTATCTAATGAATACAGATTTATG
GAAGAAGTCGAAACTTACAATATGGGAAAACAAACAAAAACTTTGAGCTACGAAA
CAGTAGAGAATATGTATGTCTCACCATCTGTAAAGCGGCAGATCTGGCAAACCTT
GAAGATAGTTAAAGAATTAGAAAAAGTGATGAAGGAAAGTCCAAAAAGGGTTTTT
ATTGAAATGGCCCGAGAAAAACAAGAATCTAAAAGGACGGAAAGTAGGAAAAAGC
AACTTATAGATCTATATAAAGCCTGCAAAAATGAAGAAAAAGATTGGGTAAAGGA
ATTAGGTGACCAGGAAGAGCAAAAATTGAGATCTGACAAGCTGTACTTGTATTAT
ACGCAAAAGGGCCGGTGTATGTATTCGGGTGAGGTAATAGAATTGAAAGATTTAT
GGGATAACACTAAGTATGACATTGACCATATTTACCCCCAGTCTAAGACAATGGA
CGATTCATTAAATAACCGAGTTCTTGTCAAAAAGAAGTACAATGCCACAAAGAGC
GATAAGTACCCATTGAACGAAAATATAAGACATGAACGAAAAGGTTTCTGGAAAT
CATTGTTGGACGGTGGATTTATTTCCAAAGAAAAATACGAGAGATTGATTAGAAA
CACTGAACTATCTCCAGAGGAGTTAGCTGGCTTTATCGAAAGACAAATTGTTGAA
ACTAGACAGTCTACAAAAGCAGTTGCAGAAATCTTAAAACAAGTATTTCCAGAAT
CCGAAATTGTGTACGTCAAAGCCGGAACAGTAAGTAGATTTAGAAAAGACTTTGA
ATTATTGAAAGTACGAGAGGTTAACGACCTACATCATGCTAAGGATGCTTATTTA
AATATAGTCGTTGGTAATTCGTATTACGTGAAATTCACAAAAAACGCATCTTGGT
TCATCAAGGAGAATCCTGGTAGGACATACAACTTGAAAAAGATGTTTACATCAGG
ATGGAATATCGAAAGAAATGGTGAGGTTGCGTGGGAGGTAGGCAAGAAGGGAACC
ATTGTTACTGTAAAGCAAATTATGAATAAAAACAATATACTTGTTACGAGACAGG
TGCACGAAGCCAAAGGAGGGTTGTTTGACCAGCAAATCATGAAGAAAGGTAAAGG
TCAGATAGCAATAAAAGAGACTGATGAGCGTTTAGCTAGTATAGAAAAATATGGG
GGCTACAATAAGGCAGCTGGTGCTTACTTCATGTTGGTCGAATCAAAGGATAAAA
AAGGGAAGACGATCCGGACCATAGAGTTTATCCCTCTGTACTTGAAGAATAAGAT
TGAGTCTGACGAAAGCATCGCATTGAATTTCTTGGAAAAGGGGCGCGGTCTAAAG
GAGCCAAAAATATTGTTAAAGAAAATTAAAATAGACACCCTATTCGACGTCGATG
GGTTTAAGATGTGGCTTAGTGGTCGTACTGGGGACAGATTATTATTCAAGTGTGC
CAATCAGTTAATCCTTGACGAGAAAATCATTGTTACAATGAAAAAAATTGTTAAG
TTTATTCAAAGGCGACAAGAAAATAGAGAACTAAAGTTGAGTGATAAGGATGGAA
TCGATAATGAAGTGTTAATGGAGATTTATAACACTTTTGTCGACAAATTGGAGAA
TACGGTGTACAGAATTAGGCTATCTGAACAGGCTAAAACCCTAATTGATAAACAG
AAGGAGTTTGAGCGACTTTCTCTTGAAGACAAATCTTCAACTCTTTTCGAGATCC
TACATATCTTTCAGTGTCAATCTTCTGCAGCTAATTTGAAAATGATTGGAGGTCC
TGGTAAGGCTGGTATATTAGTCATGAACAACAACATATCTAAGTGTAATAAGATT
AGTATAATTAACCAATCACCGACAGGTATCTTTGAAAATGAAATTGATTTACTTA
AA
SEQ ATGAAATCATTCGACTCGTTCACCAACTTGTACTCCCTGTCTAAAACATTGAAAT
ID TTGAAATGCGACCTGTTGGTAACACCCAAAAGATGTTAGATAATGCAGGAGTTTT
NO: CGAAAAGGATAAACTGATCCAGAAAAAATACGGTAAAACGAAACCATATTTCGAT
136 AGGTTGCATCGGGAATTTATAGAAGAAGCTTTGACTGGTGTAGAATTAATTGGCT
TAGATGAGAATTTCCGTACTCTAGTCGATTGGCAAAAAGATAAAAAGAACAATGT
TGCCATGAAGGCATACGAAAATAGTCTACAAAGACTAAGAACAGAGATCGGGAAA
ATTTTCAATTTGAAGGCAGAAGACTGGGTGAAGAACAAATATCCAATATTGGGTC
TTAAGAATAAGAATACTGATATATTGTTCGAGGAGGCCGTTTTCGGTATTCTTAA
GGCAAGATATGGTGAAGAGAAAGACACGTTTATTGAAGTTGAGGAGATTGATAAA
ACCGGTAAGTCCAAAATCAACCAGATCTCTATCTTCGACAGTTGGAAGGGCTTCA
CTGGTTATTTTAAGAAGTTCTTCGAAACTAGGAAGAACTTCTATAAAAACGATGG
TACTTCCACGGCTATTGCTACAAGAATTATCGACCAAAACCTTAAGCGTTTTATT
GATAACCTATCAATTGTTGAAAGTGTTCGACAGAAAGTAGATTTGGCTGAAACTG
AAAAATCTTTTAGTATCTCCTTATCCCAGTTTTTCTCTATAGATTTTTATAATAA
ATGTTTGCTGCAAGATGGCATTGACTACTATAATAAAATAATTGGTGGAGAGACA
TTGAAAAACGGAGAGAAGCTGATTGGCCTTAATGAGTTGATAAATCAATATAGAC
AAAATAATAAGGACCAGAAAATCCCTTTCTTTAAATTGCTAGACAAACAGATTTT
GTCTGAAAAGATCCTATTCTTGGATGAAATAAAGAACGATACTGAATTGATTGAA
GCTTTGTCCCAGTTTGCTAAAACAGCTGAAGAAAAGACAAAGATTGTGAAAAAAT
TGTTTGCTGATTTCGTAGAAAACAATTCTAAATATGATCTAGCCCAGATTTATAT
AAGTCAAGAAGCTTTCAATACAATAAGTAATAAGTGGACAAGTGAAACAGAAACT
TTTGCTAAGTATTTATTCGAAGCCATGAAGTCTGGTAAACTTGCCAAATACGAAA
AAAAAGATAACAGTTATAAATTTCCAGACTTTATAGCCCTTTCACAGATGAAGTC
TGCCTTATTGTCGATATCCTTAGAAGGTCATTTTTGGAAGGAAAAATATTATAAG
ATAAGCAAGTTCCAAGAAAAGACTAATTGGGAACAATTTTTGGCTATATTTCTAT
ATGAGTTCAATTCATTATTTTCCGATAAAATCAACACTAAGGATGGAGAGACTAA
GCAAGTTGGCTACTATTTGTTCGCAAAAGATCTGCACAATTTGATTCTATCAGAA
CAAATAGATATACCAAAAGATTCAAAGGTAACTATAAAGGATTTCGCAGATTCCG
TCCTCACCATTTATCAAATGGCTAAATATTTTGCCGTTGAAAAAAAGAGAGCGTG
GTTAGCAGAATACGAGTTGGACTCGTTTTATACTCAGCCAGATACTGGATACTTG
CAATTCTACGATAATGCATACGAAGACATTGTACAGGTATACAATAAACTTAGAA
ATTACTTAACCAAGAAGCCCTACAGTGAAGAAAAATGGAAGCTGAACTTTGAAAA
TTCGACTTTGGCAAATGGTTGGGATAAAAATAAAGAAAGTGACAACTCCGCAGTG
ATTTTGCAAAAGGGTGGGAAATATTACTTGGGTTTAATCACAAAAGGCCACAATA
AGATTTTTGATGATAGATTTCAAGAAAAATTCATAGTTGGTATAGAAGGTGGCAA
ATACGAGAAAATTGTCTATAAATTCTTCCCTGATCAAGCCAAAATGTTCCCAAAA
GTTTGCTTTTCTGCTAAAGGATTGGAGTTTTTCCGGCCTAGCGAGGAGATCCTTC
GTATCTACAACAATGCTGAATTCAAAAAAGGAGAAACCTATAGCATAGATTCTAT
GCAAAAACTGATAGATTTTTATAAGGATTGTTTAACAAAGTACGAAGGCTGGGCC
TGCTATACATTTAGACATTTAAAGCCCACAGAAGAATACCAAAATAACATTGGTG
AATTCTTTCGGGACGTTGCCGAAGACGGCTATAGGATCGATTTTCAAGGTATCTC
AGATCAATATATCCACGAAAAGAACGAGAAGGGTGAGCTGCACCTTTTCGAAATT
CATAATAAGGACTGGAATTTGGATAAGGCGAGAGATGGTAAATCGAAGACCACTC
AAAAGAACTTGCATACTTTATATTTTGAGTCCTTGTTTTCTAATGATAACGTCGT
CCAAAATTTTCCAATAAAGTTGAATGGACAAGCGGAAATTTTCTATCGGCCTAAG
ACAGAGAAAGACAAATTAGAATCAAAGAAAGATAAAAAGGGAAATAAAGTCATTG
ATCACAAACGATACTCTGAGAATAAAATATTTTTCCACGTACCATTGACACTCAA
CAGGACTAAGAATGACTCTTATAGATTTAATGCTCAGATTAATAATTTTTTGGCA
AATAACAAGGATATTAACATAATTGGGGTGGATAGAGGTGAAAAGCACTTGGTAT
ATTACTCTGTCATCACTCAGGCTTCTGATATATTGGAAAGCGGGTCTCTAAATGA
ATTGAACGGTGTTAACTACGCCGAAAAGCTAGGTAAAAAAGCTGAAAACAGAGAG
CAGGCTCGGCGCGATTGGCAAGATGTTCAAGGAATTAAAGACCTTAAAAAAGGCT
ACATTAGTCAAGTAGTTAGAAAGTTAGCCGATCTTGCTATTAAACATAACGCAAT
CATTATTCTGGAGGACCTAAATATGCGTTTTAAGCAAGTTAGGGGTGGCATAGAA
AAAAGTATTTATCAGCAGCTTGAGAAGGCTTTGATAGATAAGTTATCGTTCCTAG
TTGACAAAGGTGAAAAAAATCCTGAACAAGCTGGTCATCTGTTGAAAGCTTATCA
GCTGAGCGCACCTTTTGAAACATTTCAAAAAATGGGAAAACAAACAGGTATTATT
TTCTATACTCAAGCGAGTTATACAAGTAAATCTGACCCAGTGACAGGATGGAGAC
CACACCTTTATCTAAAATATTTTTCTGCTAAAAAGGCCAAAGATGACATCGCTAA
GTTTACAAAAATAGAATTTGTCAACGATAGATTTGAATTGACTTACGATATTAAA
GATTTTCAGCAAGCAAAAGAATACCCAAATAAGACAGTGTGGAAAGTATGCTCCA
ATGTGGAGAGATTTAGATGGGATAAAAATCTCAATCAAAACAAGGGTGGTTACAC
ACATTATACTAATATAACTGAAAATATTCAAGAATTGTTTACTAAGTACGGAATT
GACATAACCAAAGACTTACTAACTCAGATTTCAACTATTGACGAAAAACAAAATA
CCTCATTTTTCCGCGACTTTATTTTTTATTTCAACTTGATCTGTCAAATTCGTAA
CACGGATGATTCCGAAATTGCCAAGAAGAACGGAAAAGATGATTTCATCCTATCT
CCAGTGGAACCATTTTTTGACTCAAGAAAAGATAATGGTAATAAGTTGCCTGAGA
ACGGAGATGATAACGGCGCTTATAATATCGCTCGGAAGGGTATTGTAATTCTTAA
TAAAATATCTCAGTACTCTGAAAAGAACGAAAACTGCGAGAAAATGAAGTGGGGC
GACTTGTATGTATCTAATATAGATTGGGATAATTTCGTTACTCAAGCCAACGCGA
GACATTGA
SEQ ATGGAAAATTTTAAAAACCTATATCCAATTAATAAGACACTTAGATTCGAGCTTA
ID GGCCATACGGCAAAACACTAGAAAATTTTAAGAAGTCAGGCCTATTAGAAAAAGA
NO: CGCCTTTAAGGCAAATTCCAGAAGATCAATGCAGGCAATTATTGATGAGAAATTT
137 AAAGAGACTATCGAGGAAAGGTTGAAATACACTGAATTCTCTGAGTGCGATCTGG
GAAACATGACTTCCAAGGATAAAAAGATTACCGATAAGGCTGCTACCAACCTCAA
AAAGCAAGTCATCTTATCGTTTGATGATGAAATTTTTAATAACTACTTAAAGCCG
GACAAAAACATTGACGCCCTATTCAAAAATGATCCGTCCAACCCCGTAATTTCAA
CTTTTAAGGGTTTTACCACGTACTTTGTAAATTTTTTTGAGATTCGTAAACATAT
CTTCAAAGGAGAATCGTCGGGTTCCATGGCCTATAGGATAATTGATGAAAATCTT
ACGACTTACTTAAACAATATCGAAAAGATAAAAAAGTTACCAGAAGAATTAAAGT
CTCAATTGGAAGGTATTGACCAAATAGACAAATTAAATAACTATAATGAGTTCAT
AACTCAAAGCGGTATCACACATTACAATGAAATTATCGGTGGTATATCTAAAAGT
GAGAACGTAAAAATACAGGGAATAAACGAGGGGATCAATCTATACTGTCAGAAGA
ATAAAGTAAAATTACCAAGACTAACGCCATTATACAAAATGATTCTGTCTGATAG
AGTTTCCAACTCGTTCGTGCTTGATACTATAGAAAATGATACTGAATTAATTGAG
ATGATTAGCGACTTGATTAATAAAACAGAAATATCTCAAGACGTAATAATGTCAG
ACATTCAGAACATTTTCATAAAATATAAACAGCTTGGTAATTTACCGGGGATAAG
TTACTCTAGCATCGTGAATGCTATTTGCTCCGATTATGACAATAATTTTGGTGAC
GGAAAAAGAAAAAAATCATATGAGAACGATAGGAAGAAACACCTTGAAACAAACG
TATACTCAATTAACTATATATCGGAACTGTTAACAGACACCGATGTATCATCTAA
TATAAAAATGAGATATAAGGAACTTGAACAAAATTACCAGGTGTGTAAGGAGAAT
TTCAATGCTACCAACTGGATGAACATTAAGAATATTAAACAGAGTGAAAAGACAA
ACTTGATTAAAGATCTACTAGATATACTGAAATCAATACAGAGATTCTACGATCT
GTTTGATATAGTTGATGAAGACAAAAATCCTAGTGCTGAGTTTTACACGTGGCTA
AGTAAAAATGCGGAAAAGTTAGATTTCGAGTTCAACTCTGTTTATAATAAATCTA
GGAATTATTTAACTAGAAAGCAGTATTCTGATAAAAAGATAAAATTGAACTTCGA
CTCCCCTACGTTGGCAAAGGGTTGGGATGCAAACAAAGAAATCGATAACTCCACC
ATAATAATGCGTAAGTTTAACAATGATAGGGGGGATTACGATTATTTTTTGGGAA
TTTGGAACAAATCTACCCCAGCGAATGAAAAAATTATTCCCCTTGAAGACAATGG
TCTTTTTGAAAAAATGCAGTATAAATTATATCCAGACCCATCCAAGATGCTTCCA
AAGCAATTTCTGTCAAAAATTTGGAAGGCTAAACACCCTACTACTCCTGAATTTG
ATAAGAAGTATAAGGAGGGCCGACACAAAAAGGGTCCAGATTTTGAAAAAGAATT
CCTGCATGAATTGATAGATTGTTTTAAGCATGGTTTGGTAAATCATGATGAAAAA
TATCAGGATGTCTTTGGATTCAATTTGAGAAATACAGAGGATTACAACTCATATA
CAGAATTTCTCGAGGACGTCGAACGTTGCAATTATAATCTCAGTTTCAACAAGAT
CGCAGACACTTCAAACTTAATTAACGACGGAAAATTGTACGTTTTTCAAATCTGG
TCGAAAGACTTTAGTATTGATTCAAAGGGTACAAAAAACCTAAATACAATATATT
TCGAAAGTCTATTCTCGGAAGAAAACATGATCGAAAAAATGTTCAAACTGTCAGG
CGAAGCTGAAATATTCTACCGTCCCGCAAGCCTTAATTATTGTGAGGATATCATT
AAAAAAGGACATCACCATGCAGAGTTAAAAGATAAATTCGATTACCCAATAATTA
AAGATAAAAGATACTCCCAGGATAAGTTCTTTTTCCATGTACCTATGGTTATTAA
CTACAAGTCGGAAAAACTAAACTCGAAGTCATTAAATAATAGAACTAACGAGAAC
TTGGGACAATTCACACATATAATTGGTATTGATCGTGGCGAAAGACATTTAATAT
ATCTGACTGTTGTTGATGTTTCAACAGGAGAAATTGTTGAACAGAAACATCTTGA
TGAAATTATAAACACAGATACAAAAGGCGTTGAGCATAAAACTCATTATCTAAAT
AAATTGGAGGAAAAGTCGAAGACTCGCGATAACGAGAGAAAGAGTTGGGAAGCAA
TTGAAACCATAAAAGAGCTTAAAGAAGGTTACATTAGTCACGTCATCAATGAAAT
ACAAAAGTTACAAGAAAAGTATAACGCTTTGATTGTAATGGAAAATCTAAATTAT
GGTTTTAAGAATTCAAGAATCAAAGTCGAAAAGCAGGTCTATCAGAAATTTGAAA
CGGCACTTATTAAAAAGTTTAACTACATTATTGATAAAAAGGACCCAGAAACTTA
TATTCATGGTTACCAACTGACGAACCCAATCACAACATTGGACAAAATTGGAAAC
CAAAGTGGAATTGTTTTATACATTCCAGCTTGGAATACATCCAAAATAGACCCTG
TCACGGGGTTTGTCAACTTGTTATATGCCGACGATTTAAAGTATAAAAACCAAGA
ACAAGCAAAGTCTTTTATTCAAAAGATTGATAATATTTATTTCGAAAACGGTGAA
TTTAAATTCGACATAGATTTTTCTAAATGGAACAACCGTTATTCAATAAGTAAAA
CTAAATGGACACTCACCTCATACGGCACTCGTATCCAAACCTTTCGGAATCCCCA
AAAAAATAACAAATGGGATTCTGCAGAATACGACTTGACCGAGGAATTTAAATTA
ATTCTTAATATAGACGGTACACTCAAAAGTCAAGACGTGGAGACATACAAGAAGT
TTATGTCGTTATTCAAGCTTATGCTTCAGTTGAGGAACTCCGTTACAGGCACTGA
TATTGATTACATGATTTCACCAGTAACGGATAAGACTGGGACTCATTTCGATTCT
AGGGAAAATATTAAAAATTTACCTGCTGACGCAGACGCAAACGGCGCATACAATA
TAGCAAGAAAAGGGATTATGGCCATTGAGAATATTATGAATGGCATATCAGATCC
ATTAAAGATAAGCAATGAAGACTACTTAAAATACATTCAGAATCAGCAAGAATAA
SEQ ATGACCCAGTTTGAAGGTTTCACCAATTTGTACCAAGTAAGTAAAACCTTGAGGT
ID TCGAATTGATCCCACAGGGCAAGACATTGAAGCATATTCAAGAGCAAGGATTTAT
NO: AGAAGAAGATAAAGCGAGAAACGATCACTATAAAGAGTTAAAACCCATTATTGAC
138 AGGATCTATAAAACATACGCCGATCAATGCCTTCAATTAGTGCAATTAGATTGGG
AAAACTTGAGCGCTGCCATCGATTCCTACAGGAAGGAAAAAACAGAAGAAACAAG
AAATGCCTTAATCGAGGAACAAGCAACCTATAGAAACGCTATACACGATTACTTC
ATCGGTAGAACTGATAATCTAACAGATGCAATAAATAAGAGACATGCTGAGATAT
ATAAAGGACTATTTAAAGCAGAATTATTCAACGGAAAGGTGTTGAAACAGTTAGG
TACCGTTACAACTACTGAGCATGAAAATGCCTTGCTGAGAAGCTTTGACAAGTTT
ACTACCTACTTTTCGGGTTTCTACGAAAATCGCAAAAATGTATTTTCTGCGGAAG
ATATTTCAACTGCAATCCCTCATAGGATTGTTCAAGATAATTTCCCTAAGTTTAA
AGAGAACTGTCACATTTTTACAAGGTTAATTACTGCGGTTCCAAGTCTAAGAGAA
CATTTTGAGAATGTAAAAAAAGCGATTGGTATATTTGTATCCACTAGCATTGAAG
AGGTTTTCAGCTTCCCTTTTTATAACCAATTACTTACCCAAACACAGATCGACCT
GTACAACCAATTGTTAGGTGGTATATCGAGGGAGGCTGGTACGGAAAAGATTAAA
GGATTAAATGAAGTTCTTAATTTGGCCATACAAAAAAATGATGAAACCGCGCACA
TTATCGCATCTTTACCACATAGGTTTATACCGTTATTCAAGCAAATATTATCTGA
TCGTAATACCTTATCGTTCATATTAGAGGAGTTTAAATCTGACGAAGAAGTTATA
CAATCTTTTTGCAAGTATAAGACGCTATTGAGAAACGAAAACGTTCTGGAAACAG
CCGAAGCACTGTTCAATGAATTAAACAGTATCGACTTGACTCATATTTTTATATC
GCATAAAAAGTTGGAGACAATTTCTTCAGCATTGTGCGATCACTGGGACACTTTA
AGGAACGCACTATATGAACGTAGGATCTCAGAATTGACAGGTAAGATAACGAAGT
CTGCTAAAGAGAAAGTGCAGAGATCCCTAAAACACGAGGATATAAATTTGCAGGA
GATAATTTCAGCTGCAGGTAAAGAGTTGTCTGAAGCGTTCAAGCAAAAGACTTCC
GAAATCTTGTCACACGCACACGCCGCATTAGATCAACCTTTACCCACTACTTTGA
AAAAACAAGAAGAGAAGGAGATATTAAAATCACAACTTGATTCTTTACTTGGCCT
TTATCATCTTTTAGATTGGTTCGCTGTTGACGAGAGCAATGAAGTGGATCCAGAG
TTTTCCGCAAGATTGACCGGTATAAAGTTGGAAATGGAACCTTCGTTATCATTTT
ACAACAAAGCTAGGAACTATGCTACAAAAAAACCTTATTCTGTCGAAAAATTTAA
ACTGAACTTCCAAATGCCTACTCTAGCAAGTGGCTGGGATGTTAATAAAGAAAAG
AACAATGGCGCTATTTTGTTTGTAAAAAATGGCCTATACTATCTTGGAATTATGC
CTAAACAAAAAGGTCGCTACAAGGCTTTGTCATTTGAACCTACTGAAAAGACTAG
CGAAGGTTTCGATAAGATGTATTACGATTATTTCCCGGATGCCGCTAAAATGATC
CCCAAGTGCTCTACTCAATTGAAGGCAGTAACTGCTCATTTCCAAACGCATACCA
CGCCAATACTGCTTTCTAACAACTTTATAGAACCACTAGAAATAACGAAAGAAAT
TTACGACCTAAATAACCCAGAGAAAGAACCAAAAAAGTTCCAGACGGCCTACGCC
AAAAAGACAGGGGACCAAAAAGGTTACCGCGAGGCGTTATGTAAATGGATTGATT
TTACTAGGGACTTTTTATCAAAATACACTAAAACGACGTCTATTGATCTTAGCTC
CTTACGCCCGTCCTCCCAATACAAGGATCTAGGTGAGTATTACGCAGAGTTGAAC
CCGCTATTATACCATATTTCCTTCCAAAGGATTGCTGAAAAGGAAATTATGGACG
CTGTTGAAACTGGGAAATTGTACCTGTTTCAGATTTATAATAAGGACTTCGCAAA
GGGTCACCATGGTAAGCCTAACCTTCACACTTTGTACTGGACCGGACTATTCTCG
CCTGAAAATTTGGCTAAAACAAGTATCAAGTTAAACGGTCAGGCCGAGTTATTTT
ATAGACCCAAATCTAGAATGAAAAGAATGGCCCATAGATTAGGCGAAAAGATGTT
AAACAAGAAATTAAAGGACCAAAAAACCCCGATACCAGACACTCTATACCAAGAA
CTGTACGACTATGTGAATCACAGGCTTAGTCACGATTTATCAGATGAAGCGAGGG
CTTTATTGCCAAATGTCATCACCAAGGAAGTATCACATGAAATAATTAAGGATAG
AAGGTTCACATCTGATAAATTCTTTTTTCATGTCCCAATTACATTGAATTATCAA
GCAGCGAACTCACCATCTAAATTTAATCAGCGCGTCAACGCCTATTTGAAAGAAC
ATCCCGAAACACCAATCATCGGCATAGATCGAGGTGAGAGAAACTTAATATATAT
AACTGTGATTGATTCTACAGGAAAAATCCTGGAGCAACGATCTTTAAATACCATA
CAACAGTTTGATTATCAAAAAAAGTTGGATAACAGAGAAAAAGAACGTGTTGCCG
CTAGGCAGGCTTGGTCTGTGGTAGGAACAATTAAGGACTTAAAGCAGGGCTATCT
GTCCCAAGTTATTCATGAAATAGTCGATCTGATGATACATTATCAGGCAGTTGTC
GTGTTGGAAAATTTGAATTTTGGCTTTAAATCAAAAAGAACTGGCATAGCAGAAA
AAGCTGTGTACCAGCAGTTTGAAAAGATGTTAATCGATAAGCTAAACTGCCTTGT
TCTTAAAGATTACCCCGCAGAAAAAGTAGGTGGTGTTCTTAATCCATATCAGTTG
ACAGACCAATTTACATCCTTTGCGAAAATGGGTACGCAAAGCGGGTTCTTATTCT
ACGTACCGGCCCCCTATACTTCTAAGATCGACCCACTAACAGGTTTTGTGGACCC
TTTTGTTTGGAAGACGATAAAGAACCACGAGTCACGCAAACATTTCTTAGAGGGC
TTTGATTTCTTGCACTACGACGTGAAAACTGGTGATTTTATCTTACACTTTAAAA
TGAACAGAAATCTCTCTTTCCAACGTGGACTGCCCGGATTCATGCCGGCTTGGGA
CATCGTTTTTGAAAAGAATGAAACGCAGTTTGACGCCAAAGGTACACCATTTATA
GCGGGTAAGAGAATTGTGCCGGTCATAGAAAACCATAGATTTACAGGTAGATATA
GGGATCTGTACCCTGCTAATGAATTGATTGCATTACTCGAAGAGAAAGGAATTGT
GTTTCGAGATGGATCGAATATTTTACCTAAGTTGTTGGAAAATGATGATTCACAC
GCAATTGATACTATGGTTGCCCTCATAAGATCGGTATTGCAAATGAGAAACTCAA
ATGCTGCTACGGGAGAGGATTATATAAACAGCCCCGTTCGCGATCTTAATGGTGT
TTGTTTTGATTCACGTTTTCAGAACCCCGAATGGCCAATGGATGCCGACGCAAAC
GGAGCATATCATATTGCTCTTAAAGGCCAACTACTATTAAATCACTTAAAGGAAT
CCAAAGACCTAAAATTGCAAAACGGGATATCTAATCAGGATTGGCTGGCTTACAT
ACAAGAACTACGTAACTAG
SEQ ATGGCCGTTAAGTCAATCAAAGTGAAACTTAGACTGGATGACATGCCAGAGATTC
ID GTGCGGGGTTATGGAAACTTCATAAGGAAGTTAACGCAGGGGTAAGATATTATAC
NO: CGAATGGTTATCATTACTTCGACAAGAGAATTTGTACAGAAGGTCCCCGAACGGC
139 GACGGTGAGCAAGAATGCGATAAGACGGCTGAAGAATGTAAGGCAGAACTTTTGG
AGCGCCTGAGAGCCCGTCAGGTTGAAAATGGCCATAGAGGTCCTGCGGGATCTGA
TGATGAGCTTTTACAGCTAGCTAGACAATTGTATGAATTGTTGGTCCCTCAGGCT
ATTGGGGCTAAAGGAGACGCTCAACAAATCGCCAGAAAGTTCTTGTCACCTCTGG
CTGACAAAGATGCCGTGGGAGGATTAGGTATCGCTAAAGCAGGTAATAAACCAAG
ATGGGTTAGAATGAGAGAAGCAGGCGAACCTGGTTGGGAAGAAGAGAAAGAAAAG
GCCGAAACTAGAAAAAGCGCTGACAGAACCGCAGATGTTTTACGGGCCTTGGCTG
ATTTTGGACTGAAGCCTTTGATGAGAGTGTATACTGATTCAGAAATGTCTTCCGT
TGAATGGAAGCCCCTAAGGAAGGGACAAGCGGTCAGAACCTGGGATAGGGATATG
TTTCAACAGGCTATTGAAAGGATGATGTCATGGGAATCCTGGAATCAAAGAGTAG
GTCAAGAATACGCTAAACTGGTCGAACAAAAGAATAGATTTGAACAAAAAAATTT
TGTAGGTCAAGAACATTTAGTACATTTGGTTAATCAACTTCAACAAGATATGAAA
GAGGCATCTCCTGGTTTGGAATCAAAAGAACAAACAGCACACTATGTTACCGGCC
GAGCTTTGCGAGGTTCTGACAAAGTATTTGAAAAGTGGGGGAAATTAGCTCCCGA
TGCCCCCTTTGATCTATATGATGCTGAAATTAAAAACGTTCAAAGAAGGAACACT
AGACGTTTTGGATCCCATGATCTTTTTGCAAAGCTAGCTGAGCCAGAATACCAGG
CTCTATGGCGTGAAGACGCCTCGTTTTTGACTAGATACGCAGTATACAATTCAAT
ACTCAGAAAACTAAACCATGCCAAGATGTTTGCTACATTCACCCTGCCCGATGCT
ACCGCTCATCCTATTTGGACTAGATTTGACAAGTTGGGGGGGAATCTACATCAGT
ACACATTTTTATTTAATGAATTCGGTGAAAGAAGACACGCTATTAGATTCCACAA
GCTCCTAAAGGTTGAAAACGGCGTTGCGAGAGAAGTTGATGATGTAACAGTTCCC
ATTTCTATGTCGGAGCAATTGGATAATCTATTGCCTAGAGACCCTAATGAACCAA
TTGCTTTGTACTTTCGTGACTACGGTGCAGAACAACACTTTACAGGTGAATTCGG
CGGAGCCAAGATTCAATGTAGACGTGATCAACTCGCACACATGCATAGAAGAAGA
GGCGCTCGTGATGTTTATTTAAATGTGTCTGTTAGAGTTCAATCCCAATCGGAGG
CTAGAGGTGAAAGAAGGCCACCATACGCAGCAGTTTTTAGGTTAGTAGGTGATAA
TCATAGGGCATTTGTCCACTTCGACAAATTAAGTGATTATTTAGCAGAGCACCCT
GATGATGGAAAGTTGGGCAGTGAGGGATTATTAAGTGGGTTGAGGGTAATGTCTG
TAGATCTTGGTCTTCGTACTTCTGCGAGTATCTCTGTCTTTAGAGTAGCACGTAA
GGATGAGTTGAAACCTAATAGCAAAGGAAGAGTCCCGTTTTTTTTTCCTATTAAG
GGTAACGATAACCTGGTGGCCGTGCATGAAAGATCACAACTTTTGAAATTGCCAG
GAGAAACGGAGTCCAAGGACTTGAGGGCAATTAGAGAGGAACGTCAGCGTACATT
GCGACAGCTGAGAACTCAATTGGCTTATTTGAGGTTGTTGGTTAGGTGTGGTTCC
GAGGATGTTGGCAGAAGAGAAAGGTCTTGGGCCAAATTGATAGAACAACCAGTGG
ACGCCGCAAATCACATGACACCAGATTGGAGAGAAGCTTTCGAAAATGAACTCCA
GAAATTAAAGAGCCTACATGGCATATGCTCTGATAAAGAGTGGATGGATGCCGTA
TACGAATCCGTTCGTAGAGTCTGGCGCCACATGGGTAAGCAAGTACGGGACTGGA
GAAAGGATGTTCGTTCCGGCGAAAGACCGAAGATAAGGGGGTATGCAAAGGACGT
TGTAGGCGGTAATTCTATTGAACAGATTGAGTATTTGGAAAGGCAGTACAAATTT
CTTAAATCCTGGAGCTTCTTCGGCAAAGTGTCAGGACAAGTCATCAGGGCTGAAA
AAGGTTCCAGATTTGCTATTACGCTAAGGGAACATATTGATCATGCGAAAGAAGA
TAGACTGAAAAAACTAGCAGATAGAATAATTATGGAAGCACTTGGTTACGTCTAT
GCACTTGATGAAAGAGGCAAGGGGAAATGGGTAGCTAAATACCCGCCTTGTCAAC
TTATTTTATTAGAAGAATTAAGCGAGTACCAATTTAACAACGATAGACCTCCATC
CGAAAATAATCAGCTGATGCAATGGTCCCATAGGGGTGTTTTTCAAGAATTGATA
AATCAAGCTCAAGTACACGATTTGCTGGTAGGTACTATGTACGCAGCGTTTTCGA
GCCGTTTTGATGCAAGAACTGGTGCCCCAGGTATCAGATGTCGACGTGTTCCGGC
CAGATGTACACAGGAACATAACCCTGAGCCATTTCCGTGGTGGCTTAATAAGTTT
GTTGTCGAGCACACATTAGACGCATGCCCTCTGAGAGCAGATGACCTTATACCCA
CTGGAGAAGGCGAAATATTTGTTAGTCCATTCTCTGCAGAAGAAGGTGACTTTCA
CCAGATACATGCAGACTTAAATGCAGCACAGAATCTCCAACAAAGGTTGTGGTCG
GATTTTGATATTTCGCAAATAAGACTAAGATGCGATTGGGGAGAGGTTGATGGAG
AATTGGTGCTGATTCCAAGATTAACCGGAAAGCGAACTGCCGATTCCTATTCTAA
CAAGGTGTTTTACACAAATACTGGTGTTACCTATTACGAAAGAGAAAGGGGTAAG
AAGAGACGTAAAGTATTTGCTCAAGAAAAATTGTCAGAAGAGGAGGCAGAACTGT
TAGTAGAAGCAGACGAAGCCAGAGAAAAATCAGTTGTGCTTATGCGTGACCCTTC
CGGCATTATAAATCGTGGTAATTGGACACGACAAAAAGAATTTTGGTCTATGGTC
AATCAACGTATCGAAGGCTACCTAGTTAAGCAAATCAGGTCTAGGGTTCCACTAC
AAGATAGCGCATGTGAAAATACGGGTGATATATAA
SEQ ATGGCTACTAGATCTTTCATTTTAAAAATTGAACCTAATGAAGAAGTGAAGAAGG
ID GTCTCTGGAAAACTCACGAAGTACTTAATCATGGCATTGCCTATTATATGAATAT
NO: CCTGAAGCTTATTCGTCAAGAAGCTATATACGAGCATCATGAGCAAGATCCTAAG
140 AACCCTAAGAAAGTAAGCAAAGCGGAAATTCAGGCTGAATTGTGGGACTTCGTCT
TGAAGATGCAGAAGTGTAACAGTTTTACGCACGAAGTTGATAAAGATGTGGTGTT
TAATATTTTGAGGGAGCTATATGAGGAGTTGGTGCCCTCGAGTGTCGAAAAAAAA
GGAGAAGCTAATCAGCTGTCAAATAAATTTTTATATCCTCTGGTGGATCCAAACT
CTCAATCAGGTAAAGGCACTGCCAGTAGTGGTCGAAAACCGAGATGGTATAATTT
GAAAATCGCAGGTGATCCATCGTGGGAAGAAGAAAAAAAAAAATGGGAAGAAGAT
AAAAAAAAAGATCCCCTTGCCAAAATACTAGGTAAGCTAGCCGAGTATGGACTTA
TACCATTATTCATTCCTTTCACGGACTCTAATGAACCAATTGTGAAGGAAATCAA
ATGGATGGAAAAATCACGTAATCAGTCTGTTAGGAGGTTGGACAAAGATATGTTT
ATACAGGCTCTTGAGAGGTTTTTGTCGTGGGAGTCCTGGAATTTGAAAGTGAAAG
AAGAATATGAAAAAGTGGAAAAGGAGCATAAGACGTTGGAAGAAAGGATTAAGGA
AGATATTCAGGCCTTTAAGAGTCTGGAACAGTACGAAAAAGAAAGACAGGAACAG
TTATTGAGAGATACTCTAAACACTAATGAATATAGGCTTTCCAAGAGGGGCTTGC
GAGGATGGAGAGAGATAATTCAGAAATGGTTGAAAATGGATGAGAACGAGCCATC
GGAGAAATATCTAGAGGTGTTTAAAGATTACCAAAGAAAGCACCCTCGCGAAGCT
GGTGATTACTCTGTTTATGAATTCCTTTCGAAGAAGGAAAATCACTTCATCTGGC
GAAATCATCCAGAGTACCCATATTTATATGCTACATTTTGCGAAATTGACAAGAA
AAAAAAAGATGCTAAACAGCAAGCGACATTCACCCTCGCTGATCCCATCAACCAC
CCATTATGGGTCAGGTTCGAAGAGAGATCAGGCTCGAACCTGAATAAGTACAGGA
TCTTGACTGAGCAATTGCATACTGAGAAGTTAAAAAAGAAATTGACGGTCCAACT
TGACAGATTGATTTATCCCACTGAATCTGGTGGATGGGAGGAGAAAGGTAAGGTT
GATATTGTCCTATTGCCTTCTCGTCAATTTTACAACCAAATATTTCTGGACATCG
AAGAGAAGGGTAAACATGCTTTTACCTATAAGGATGAGAGTATTAAATTTCCATT
GAAGGGAACGCTTGGCGGCGCTAGAGTTCAGTTCGATAGAGATCATTTGAGAAGA
TACCCGCATAAAGTGGAATCTGGTAATGTAGGTCGGATCTACTTTAACATGACGG
TAAATATTGAACCTACCGAGTCACCAGTCAGTAAGTCTTTAAAGATTCATAGGGA
TGATTTCCCTAAATTTGTCAACTTCAAGCCTAAGGAACTAACCGAGTGGATCAAA
GACAGTAAAGGCAAAAAGTTAAAGAGCGGTATTGAGTCCCTGGAGATAGGTCTTA
GAGTCATGTCTATCGATTTGGGTCAAAGACAAGCAGCCGCAGCATCTATTTTCGA
AGTTGTTGACCAAAAACCGGATATCGAGGGGAAATTATTTTTTCCAATAAAAGGA
ACTGAGCTATACGCTGTGCATCGCGCATCCTTCAATATAAAACTGCCAGGAGAAA
CACTAGTAAAATCTAGAGAGGTCTTGCGTAAAGCACGTGAGGACAATCTCAAATT
AATGAATCAGAAGTTAAATTTCCTTAGGAACGTGTTGCATTTCCAACAGTTCGAG
GACATAACTGAACGCGAGAAAAGAGTCACTAAGTGGATCTCAAGACAAGAAAATA
GTGATGTGCCATTAGTGTATCAAGACGAACTTATTCAAATAAGAGAGCTAATGTA
TAAACCATATAAAGACTGGGTGGCATTCTTAAAACAATTACACAAGCGGCTTGAA
GTAGAAATAGGAAAAGAAGTAAAGCATTGGAGGAAGAGTCTGTCCGATGGTCGCA
AAGGCCTGTACGGGATATCACTTAAAAATATTGATGAAATTGACAGAACACGAAA
ATTTTTGTTAAGATGGTCATTGAGACCAACCGAACCAGGTGAGGTTAGAAGGTTG
GAACCAGGCCAAAGGTTTGCCATCGATCAATTAAACCATCTTAACGCACTGAAAG
AAGATAGATTGAAGAAGATGGCGAACACTATTATTATGCACGCTCTAGGTTATTG
CTATGATGTGAGAAAGAAAAAATGGCAAGCCAAGAACCCTGCATGCCAAATTATT
TTGTTTGAAGATCTTTCTAATTACAATCCATACGAAGAGCGTTCACGTTTTGAAA
ACTCTAAATTGATGAAATGGTCTAGAAGAGAGATTCCGAGACAGGTCGCTCTACA
AGGGGAGATTTACGGTCTTCAAGTCGGTGAGGTTGGTGCTCAATTTTCTTCCAGA
TTTCATGCAAAAACTGGGTCTCCAGGCATTAGGTGTTCGGTCGTTACTAAGGAAA
AGTTACAGGACAACCGTTTCTTCAAAAATTTGCAACGTGAAGGCCGTTTAACACT
TGATAAGATAGCTGTCCTTAAGGAAGGCGATCTGTACCCAGATAAAGGTGGTGAG
AAATTCATATCTTTGAGTAAAGACAGGAAACTGGTTACAACACACGCCGACATTA
ACGCAGCTCAGAACTTGCAAAAGAGATTCTGGACAAGGACCCACGGCTTCTATAA
GGTGTACTGTAAAGCTTATCAAGTAGATGGACAAACGGTTTATATTCCTGAATCA
AAGGACCAGAAACAAAAAATTATAGAAGAATTTGGTGAAGGATACTTTATCTTGA
AGGATGGAGTTTATGAGTGGGGCAATGCAGGTAAGTTAAAGATAAAGAAAGGTTC
ATCAAAGCAATCAAGTAGCGAACTGGTCGATTCGGATATTTTAAAGGATAGCTTT
GATCTAGCTAGTGAATTGAAGGGAGAAAAGTTAATGTTATACAGAGATCCCAGTG
GGAATGTATTTCCATCTGATAAGTGGATGGCCGCCGGAGTGTTTTTTGGCAAATT
AGAGAGAATCTTGATTTCTAAACTGACCAATCAATACTCAATTTCGACCATCGAA
GACGACTCTTCAAAACAATCCATGTGA
SEQ ATGCCTACTCGCACCATCAATCTGAAGTTAGTTTTGGGGAAGAACCCAGAAAATG
ID CGACTCTAAGACGGGCACTATTCTCTACACATAGACTTGTCAACCAAGCGACTAA
NO: GAGAATTGAAGAATTTTTACTGTTGTGTAGAGGAGAAGCTTATCGTACCGTAGAT
141 AATGAAGGTAAAGAAGCTGAGATCCCACGCCATGCTGTTCAAGAAGAGGCGCTTG
CTTTTGCAAAAGCTGCACAACGACATAACGGCTGTATCTCCACATATGAGGACCA
GGAAATCTTGGATGTGCTTAGACAATTGTATGAAAGATTAGTACCTAGCGTCAAT
GAAAACAACGAGGCTGGGGATGCCCAAGCCGCTAACGCTTGGGTGAGTCCATTAA
TGAGTGCAGAGTCCGAAGGTGGACTATCGGTCTATGATAAAGTGTTAGACCCGCC
GCCAGTATGGATGAAACTCAAAGAAGAGAAAGCGCCTGGTTGGGAAGCTGCTTCT
CAGATTTGGATACAGTCCGACGAAGGTCAATCGCTGCTAAATAAACCGGGTAGCC
CACCACGTTGGATTAGAAAACTTAGATCTGGTCAACCGTGGCAAGATGACTTCGT
TTCAGACCAAAAAAAAAAGCAAGATGAACTAACGAAAGGTAACGCACCACTCATA
AAACAATTGAAAGAGATGGGCCTCTTGCCTTTAGTTAATCCCTTTTTTAGACATT
TGTTGGATCCCGAGGGTAAGGGTGTATCCCCATGGGACAGATTGGCCGTAAGGGC
CGCGGTGGCGCACTTCATCTCTTGGGAAAGTTGGAACCACAGAACAAGAGCTGAG
TATAACAGTTTGAAACTGCGAAGAGATGAATTTGAGGCCGCATCTGATGAATTCA
AGGACGATTTTACATTGCTACGACAATATGAGGCTAAGCGACATAGTACGCTTAA
GTCAATTGCCTTAGCTGATGACTCTAACCCGTACCGAATTGGTGTAAGGTCCTTG
AGAGCCTGGAATAGGGTTAGAGAAGAATGGATTGACAAAGGCGCAACCGAGGAAC
AAAGGGTTACCATCCTTAGTAAGCTTCAAACACAATTACGGGGTAAATTCGGTGA
TCCAGACCTATTTAATTGGCTAGCCCAAGATAGACACGTACACCTGTGGTCCCCG
AGAGATTCCGTCACGCCCCTCGTAAGGATTAATGCCGTCGACAAAGTGCTTAGAA
GACGTAAGCCTTATGCACTGATGACTTTTGCACATCCGAGATTCCATCCAAGATG
GATTCTATACGAAGCGCCTGGTGGTTCTAACTTGCGACAATACGCTTTAGATTGT
ACTGAAAATGCTCTGCATATTACACTTCCATTACTCGTCGACGACGCCCATGGTA
CATGGATTGAGAAAAAAATCCGCGTACCACTCGCTCCTAGTGGACAAATACAAGA
TTTAACTTTAGAAAAACTTGAAAAGAAAAAAAACAGATTATACTATAGATCAGGA
TTCCAACAATTTGCTGGATTAGCCGGTGGTGCTGAGGTGTTGTTTCATAGGCCGT
ATATGGAACATGATGAGAGATCAGAAGAATCTCTGTTGGAAAGGCCAGGCGCTGT
GTGGTTCAAATTAACCTTAGATGTTGCTACCCAAGCACCACCTAACTGGTTAGAT
GGTAAAGGCAGAGTTAGGACACCTCCAGAAGTTCATCATTTCAAAACCGCTCTGT
CAAATAAATCTAAACATACGAGAACCTTGCAACCAGGATTGAGAGTCCTTTCTGT
TGATTTGGGTATGAGAACATTTGCTTCTTGTTCTGTTTTCGAATTGATCGAAGGT
AAACCTGAAACAGGTAGAGCATTCCCTGTTGCTGACGAAAGATCAATGGATAGTC
CAAATAAGTTATGGGCCAAGCACGAGAGAAGCTTTAAACTAACTCTGCCTGGAGA
AACACCGAGCAGAAAGGAGGAAGAAGAGAGAAGCATTGCTAGGGCAGAGATTTAC
GCGCTGAAAAGAGATATTCAAAGACTGAAATCACTCCTAAGATTAGGTGAGGAAG
ATAATGATAATAGAAGAGATGCTTTGTTAGAGCAATTCTTTAAAGGATGGGGTGA
AGAGGACGTAGTTCCTGGTCAAGCTTTCCCTAGAAGCCTCTTTCAGGGATTAGGC
GCTGCACCCTTTAGGTCAACACCCGAATTGTGGAGACAGCACTGTCAGACGTATT
ACGACAAAGCGGAAGCTTGCCTGGCAAAGCATATTTCCGACTGGAGGAAGAGAAC
TAGACCTCGTCCGACTTCGAGAGAGATGTGGTATAAGACAAGATCTTACCATGGT
GGCAAAAGTATTTGGATGCTAGAATACTTAGATGCTGTCCGCAAATTACTACTTT
CATGGTCGTTAAGAGGTCGTACTTACGGAGCTATTAATAGACAAGACACCGCTCG
TTTTGGTTCCTTAGCTTCTAGATTGTTGCATCATATCAACTCTTTAAAGGAAGAC
CGCATCAAAACCGGTGCAGATAGTATTGTGCAGGCCGCAAGGGGCTATATTCCTC
TCCCACATGGCAAGGGTTGGGAACAGCGTTATGAACCCTGTCAGTTGATATTATT
TGAAGATCTAGCTAGGTACAGATTTCGTGTAGACAGACCTCGGAGAGAGAATTCG
CAATTGATGCAGTGGAATCATCGAGCTATAGTAGCAGAAACGACGATGCAAGCTG
AACTATACGGTCAAATAGTCGAAAATACCGCTGCTGGTTTCTCCTCAAGATTTCA
TGCTGCAACTGGTGCTCCTGGTGTCAGATGTCGCTTTTTGTTAGAACGAGATTTC
GATAATGACCTACCAAAGCCGTACTTACTGAGAGAACTAAGTTGGATGTTAGGTA
ACACAAAGGTTGAATCAGAGGAAGAAAAATTGCGTCTTCTAAGCGAGAAAATTAG
ACCAGGTTCATTAGTCCCTTGGGATGGGGGTGAACAATTCGCGACATTACACCCG
AAAAGACAAACTCTTTGTGTCATTCACGCAGATATGAACGCTGCTCAAAACCTGC
AACGCAGATTTTTCGGAAGGTGTGGGGAAGCCTTTCGCCTTGTGTGTCAGCCACA
TGGTGATGATGTTTTGAGGCTAGCGTCTACACCAGGTGCAAGACTTTTGGGTGCA
TTACAACAACTGGAAAATGGTCAGGGAGCTTTCGAATTAGTTCGTGATATGGGTA
GCACATCACAAATGAATCGTTTCGTCATGAAGTCGTTGGGCAAAAAAAAGATCAA
GCCATTACAAGACAATAACGGGGATGATGAACTAGAAGACGTGCTATCTGTTTTA
CCTGAAGAAGATGATACCGGACGAATTACTGTATTTCGGGACTCTTCGGGTATAT
TCTTCCCTTGTAACGTTTGGATCCCGGCAAAACAGTTCTGGCCTGCGGTCCGTGC
TATGATTTGGAAGGTTATGGCATCACATTCATTGGGTTAG
SEQ ATGACAAAGTTAAGGCATAGACAGAAGAAGTTAACTCACGATTGGGCGGGGTCTA
ID AAAAGAGAGAAGTTCTAGGGAGCAATGGTAAATTACAGAATCCATTGCTAATGCC
NO: CGTCAAAAAAGGTCAGGTGACAGAATTTCGAAAAGCATTTTCCGCATACGCCCGA
142 GCAACCAAAGGGGAAATGACGGATGGCAGAAAAAATATGTTTACTCACTCATTTG
AACCATTCAAGACCAAGCCTTCGTTACATCAGTGCGAACTGGCTGACAAAGCCTA
CCAGAGCTTGCATTCATATTTACCGGGTTCTTTGGCGCATTTTCTTTTATCTGCC
CATGCACTTGGTTTTAGGATTTTTAGCAAATCAGGGGAAGCCACTGCATTCCAAG
CGTCCTCAAAGATTGAAGCTTACGAAAGCAAGTTAGCTAGCGAGCTTGCTTGTGT
TGATTTGTCTATTCAGAACTTGACTATTTCAACTTTGTTCAACGCATTAACGACT
TCCGTAAGAGGTAAAGGTGAGGAGACATCGGCAGATCCACTGATAGCTAGATTTT
ACACCTTACTTACCGGTAAACCACTAAGCAGAGACACTCAGGGCCCAGAACGAGA
TTTAGCCGAGGTGATAAGCAGAAAAATTGCAAGTTCTTTTGGAACTTGGAAGGAG
ATGACTGCCAATCCACTTCAATCTCTTCAATTTTTTGAAGAGGAGTTGCATGCGC
TAGATGCAAATGTTAGTTTGTCACCTGCCTTCGATGTTCTGATTAAGATGAACGA
CCTGCAGGGTGACTTGAAGAACAGAACGATAGTTTTTGATCCAGATGCTCCTGTG
TTTGAATATAATGCTGAGGATCCTGCTGACATCATCATTAAACTGACAGCTAGAT
ATGCGAAAGAAGCAGTGATTAAAAATCAAAATGTCGGGAATTATGTTAAGAACGC
TATTACGACAACTAACGCAAACGGACTAGGTTGGTTGCTGAACAAAGGCCTTTCC
TTATTGCCTGTCTCCACTGATGACGAACTATTGGAGTTTATTGGGGTCGAGAGAT
CCCATCCTAGCTGTCATGCGTTGATAGAACTTATCGCTCAGTTAGAAGCACCTGA
ACTGTTCGAAAAAAATGTTTTTTCTGATACTCGTTCCGAGGTTCAAGGTATGATA
GATTCAGCTGTAAGCAATCATATCGCCAGGCTGTCAAGCTCTCGTAATTCATTGA
GCATGGACTCAGAGGAACTTGAGAGATTGATAAAATCTTTTCAAATTCATACACC
ACATTGTTCATTATTTATAGGGGCTCAATCCTTATCTCAACAATTGGAAAGCCTA
CCCGAAGCATTGCAGTCAGGAGTGAACAGTGCTGATATTCTGCTCGGCTCAACCC
AATACATGTTGACAAATTCTTTGGTCGAGGAGTCAATCGCTACGTATCAGAGAAC
CTTAAATAGAATTAACTACCTGTCCGGCGTTGCAGGACAGATTAACGGTGCTATT
AAGAGGAAAGCTATTGATGGTGAGAAGATACATTTACCCGCTGCTTGGTCAGAGT
TAATTTCTTTACCCTTTATTGGGCAACCAGTGATTGATGTTGAATCAGATTTAGC
CCACTTAAAGAACCAATACCAGACATTGTCTAACGAATTTGATACGCTGATTTCC
GCACTGCAAAAGAATTTCGACTTAAATTTTAATAAAGCCTTGCTTAATCGAACAC
AACATTTCGAGGCTATGTGTAGATCAACAAAAAAGAATGCCCTTTCTAAGCCTGA
GATCGTTAGTTATAGAGATTTGCTAGCCAGGTTGACTTCTTGTCTTTATAGGGGC
TCTCTAGTCTTGAGGAGGGCGGGTATAGAAGTACTGAAAAAGCACAAGATATTTG
AGTCCAACTCTGAATTAAGAGAGCACGTTCATGAAAGAAAACACTTCGTATTTGT
TTCTCCGCTCGATAGAAAAGCCAAGAAGCTCCTACGTTTGACTGACTCTAGGCCT
GATTTATTGCACGTAATTGATGAAATACTACAACATGATAATTTAGAGAACAAGG
ATAGAGAATCTTTGTGGTTAGTTCGATCTGGTTATTTACTGGCCGGCCTACCAGA
CCAACTCTCCTCTTCCTTTATAAATCTTCCAATCATTACTCAAAAAGGCGATCGT
CGCTTGATAGATCTCATTCAATACGACCAAATTAATAGAGATGCTTTTGTGATGT
TGGTAACTTCCGCTTTTAAGTCGAACTTAAGTGGGCTGCAGTACAGAGCAAACAA
ACAATCTTTTGTGGTTACGCGCACTTTGTCACCATATTTGGGATCTAAATTGGTT
TATGTGCCCAAAGATAAAGATTGGCTGGTCCCTTCCCAAATGTTCGAGGGGAGAT
TTGCGGACATTTTGCAATCCGATTATATGGTGTGGAAGGACGCTGGAAGATTGTG
TGTTATTGACACAGCTAAGCATTTGTCTAACATTAAAAAATCTGTATTCTCAAGT
GAAGAAGTCCTCGCGTTTTTAAGAGAATTGCCACACCGTACGTTTATCCAAACTG
AGGTCAGGGGTTTAGGGGTGAATGTGGACGGTATTGCATTTAATAACGGGGATAT
ACCCTCTCTGAAGACGTTTAGCAATTGCGTGCAAGTCAAAGTGAGTCGGACAAAC
ACTAGTCTGGTCCAAACATTAAATAGATGGTTTGAAGGCGGTAAGGTCTCGCCGC
CTAGCATCCAATTTGAGAGAGCATATTACAAAAAAGATGATCAAATCCACGAGGA
CGCTGCAAAAAGGAAGATAAGGTTTCAAATGCCAGCTACAGAGTTGGTACACGCG
TCAGACGACGCAGGATGGACCCCCTCCTATTTACTTGGTATCGATCCCGGTGAAT
ATGGTATGGGTTTGTCATTGGTCTCAATAAATAATGGCGAAGTTTTAGATAGCGG
ATTTATACACATAAATTCATTGATAAATTTCGCTTCTAAGAAATCAAATCATCAA
ACCAAAGTTGTTCCGAGGCAGCAATACAAGTCACCATACGCCAACTATCTAGAAC
AATCTAAAGATTCTGCAGCAGGAGACATAGCTCATATTTTGGATAGACTTATCTA
CAAGTTGAACGCCCTACCCGTTTTCGAAGCTCTATCTGGCAATAGTCAAAGCGCA
GCGGATCAGGTTTGGACAAAAGTCCTCAGCTTCTACACCTGGGGAGATAATGATG
CACAAAATTCAATTCGTAAGCAACATTGGTTCGGTGCTTCACACTGGGACATTAA
AGGCATGTTGAGGCAACCGCCAACAGAAAAAAAGCCCAAACCATACATTGCCTTT
CCCGGTTCACAAGTTTCTTCTTATGGTAATTCTCAAAGGTGTTCATGTTGTGGAC
GTAACCCAATTGAACAATTGCGCGAAATGGCGAAGGACACATCCATTAAGGAGTT
GAAGATTAGAAATTCAGAAATTCAATTGTTCGACGGTACTATAAAGTTATTTAAT
CCAGACCCGTCAACGGTCATAGAAAGAAGAAGACATAATTTAGGGCCATCAAGAA
TTCCTGTAGCTGATAGAACTTTCAAAAATATAAGTCCAAGCTCACTAGAATTCAA
AGAACTAATAACGATTGTGTCACGGTCTATACGTCATTCCCCAGAATTTATTGCT
AAAAAAAGAGGTATAGGTAGTGAGTACTTTTGTGCTTATAGTGATTGTAATTCCT
CCTTAAATTCAGAAGCAAATGCGGCTGCGAACGTTGCCCAAAAGTTCCAAAAGCA
ATTGTTTTTCGAATTATAG
SEQ ATGAAAAGAATCTTGAACTCTTTAAAGGTTGCCGCCCTGCGTTTGTTATTTAGAG
ID GTAAAGGATCTGAACTTGTCAAGACTGTTAAATACCCTTTGGTCTCGCCGGTTCA
NO: GGGTGCAGTTGAGGAGTTAGCTGAGGCGATCCGCCATGATAACCTACATCTGTTT
143 GGTCAAAAAGAAATTGTTGACCTTATGGAAAAGGATGAAGGTACGCAAGTTTACT
CAGTGGTTGATTTCTGGTTAGATACCCTTCGTTTGGGGATGTTTTTCAGTCCATC
AGCAAACGCATTAAAAATCACGCTGGGTAAGTTTAATTCTGATCAGGTTAGCCCT
TTTAGGAAAGTGTTAGAGCAGTCTCCATTCTTCTTGGCTGGTAGGCTGAAGGTTG
AACCGGCAGAACGTATATTATCTGTCGAGATCCGTAAGATTGGGAAGAGGGAAAA
CAGAGTTGAGAACTATGCTGCTGACGTAGAAACGTGTTTTATAGGCCAATTAAGT
TCAGATGAGAAACAGTCAATACAAAAATTAGCTAATGATATCTGGGATAGTAAAG
ATCATGAAGAGCAAAGAATGTTAAAGGCAGATTTCTTCGCTATCCCTTTGATTAA
GGATCCAAAGGCTGTGACCGAAGAGGATCCTGAAAATGAAACTGCTGGTAAACAA
AAACCCTTGGAGTTGTGTGTCTGCCTTGTCCCAGAACTTTACACAAGAGGATTCG
GGTCAATAGCCGATTTTTTGGTTCAACGCTTAACTCTTTTAAGGGATAAAATGTC
TACAGATACTGCAGAAGATTGTTTAGAATATGTCGGGATTGAGGAGGAAAAAGGT
AACGGCATGAACTCATTGTTGGGAACGTTCTTAAAGAATTTGCAAGGCGATGGAT
TTGAGCAGATTTTCCAATTTATGTTAGGGAGCTATGTCGGTTGGCAAGGGAAGGA
AGATGTTTTAAGAGAGAGATTAGACTTATTGGCTGAAAAAGTGAAGAGGTTACCG
AAACCAAAATTTGCTGGCGAATGGTCTGGTCATAGGATGTTCTTGCATGGCCAAT
TGAAGTCTTGGTCTTCAAATTTTTTTAGACTATTTAACGAGACAAGGGAACTTCT
AGAGTCTATTAAGTCAGATATACAGCATGCCACAATGCTAATATCATATGTAGAA
GAAAAAGGTGGTTATCATCCTCAATTACTTAGTCAATATAGAAAACTTATGGAAC
AACTACCAGCTTTGCGTACCAAGGTATTGGACCCTGAGATTGAAATGACACATAT
GTCCGAAGCAGTTCGCTCTTATATAATGATACATAAATCTGTTGCGGGTTTTTTA
CCGGATTTATTAGAATCATTAGATAGAGACAAGGATCGTGAGTTTCTGCTTAGTA
TTTTTCCAAGAATCCCAAAAATTGATAAAAAAACCAAGGAAATTGTAGCTTGGGA
ACTGCCGGGAGAACCAGAAGAAGGTTATTTATTTACTGCTAATAACTTGTTCAGA
AACTTCTTAGAGAATCCGAAACATGTCCCGAGATTTATGGCCGAAAGGATCCCAG
AAGATTGGACTCGATTACGCTCTGCTCCTGTCTGGTTCGATGGAATGGTAAAACA
ATGGCAAAAAGTCGTTAACCAGTTAGTAGAATCACCAGGTGCTTTATATCAATTT
AACGAATCCTTCTTGAGACAAAGGTTACAGGCCATGTTAACTGTGTATAAGAGGG
ACTTACAAACTGAAAAATTTCTTAAACTTTTGGCGGATGTTTGTAGGCCTCTTGT
AGATTTTTTTGGTTTGGGTGGAAATGATATTATTTTTAAGAGCTGTCAAGACCCA
AGAAAACAATGGCAAACCGTTATTCCTCTCTCTGTTCCGGCAGATGTCTATACTG
CTTGCGAAGGTTTGGCGATTAGACTAAGGGAGACATTAGGATTCGAATGGAAGAA
TTTGAAAGGTCACGAGAGAGAAGATTTCTTAAGATTGCACCAGTTATTGGGCAAT
TTACTTTTCTGGATTCGTGATGCTAAATTGGTAGTAAAATTAGAGGATTGGATGA
ACAACCCATGTGTTCAGGAATATGTAGAAGCCCGGAAAGCTATCGATCTTCCACT
AGAAATATTCGGTTTTGAAGTGCCTATCTTCCTGAATGGCTATCTATTTTCGGAG
TTGAGACAATTAGAACTTTTGCTTAGGAGAAAAAGTGTGATGACTAGCTACAGTG
TAAAGACTACTGGATCTCCTAATAGGCTATTTCAGCTAGTTTATTTACCTCTAAA
CCCTAGTGACCCCGAAAAGAAGAACTCAAATAACTTTCAAGAACGTTTGGATACC
CCAACTGGTTTGTCCCGTCGTTTCCTAGACCTAACCCTTGATGCATTCGCAGGTA
AGTTACTTACCGATCCAGTTACACAAGAATTGAAGACAATGGCAGGTTTTTACGA
TCATCTTTTTGGATTCAAATTGCCATGTAAACTCGCCGCCATGTCGAATCATCCA
GGTTCTTCTTCAAAGATGGTTGTGTTAGCGAAACCCAAAAAAGGTGTTGCTTCTA
ATATAGGGTTTGAACCGATCCCAGATCCCGCTCATCCCGTATTTAGGGTTAGATC
CAGTTGGCCAGAGTTGAAGTACCTCGAGGGGCTATTGTATTTGCCAGAAGACACA
CCTTTGACCATCGAATTAGCAGAGACCTCCGTATCGTGCCAAAGTGTCTCGTCAG
TTGCATTCGATTTGAAAAACTTGACAACGATCTTAGGTCGTGTGGGAGAATTTAG
GGTCACAGCTGATCAACCCTTTAAACTAACGCCTATAATCCCGGAGAAAGAAGAA
TCTTTTATTGGTAAAACTTATTTGGGTCTCGACGCGGGTGAAAGGAGCGGCGTCG
GTTTCGCTATTGTTACAGTGGACGGAGATGGGTACGAAGTGCAAAGATTGGGGGT
CCACGAGGATACACAGCTTATGGCCTTGCAGCAAGTTGCTAGTAAATCCTTAAAA
GAGCCAGTATTTCAGCCTCTAAGAAAAGGCACCTTTAGACAACAAGAAAGAATAC
GGAAATCCTTACGTGGTTGCTACTGGAATTTTTATCATGCCTTGATGATAAAATA
TAGGGCCAAAGTAGTACATGAGGAATCTGTCGGAAGTAGTGGTCTTGTGGGTCAA
TGGTTGAGGGCTTTTCAGAAGGATTTGAAGAAAGCCGATGTTCTCCCCAAGAAGG
GCGGTAAAAACGGTGTAGATAAGAAGAAGAGAGAGTCCTCAGCTCAAGACACTCT
TTGGGGTGGTGCTTTCTCTAAAAAGGAGGAGCAACAGATTGCGTTTGAGGTGCAA
GCTGCAGGTTCTTCGCAATTTTGTTTGAAGTGCGGATGGTGGTTCCAACTAGGCA
TGCGTGAAGTAAACAGGGTACAAGAATCGGGCGTCGTGTTAGATTGGAATAGAAG
CATAGTTACCTTTTTAATAGAATCATCCGGCGAAAAAGTTTATGGTTTCTCCCCA
CAGCAATTAGAGAAGGGTTTCAGACCAGACATCGAAACTTTTAAAAAGATGGTAA
GAGACTTTATGAGACCTCCTATGTTTGATAGAAAAGGCAGACCGGCCGCAGCTTA
CGAGAGATTTGTTTTAGGAAGGAGACATCGAAGGTACAGGTTTGATAAAGTATTT
GAGGAAAGATTTGGGAGGTCTGCTCTTTTCATTTGTCCTAGAGTAGGTTGTGGAA
ATTTTGACCACAGCTCCGAACAGTCCGCGGTTGTTTTGGCCTTGATCGGATATAT
TGCCGATAAGGAGGGAATGTCAGGTAAGAAGTTGGTTTATGTACGGCTGGCCGAA
CTTATGGCCGAATGGAAACTAAAAAAATTAGAAAGATCCAGAGTTGAAGAACAAT
CATCCGCTCAATAA
SEQ ATGGCAGAAAGCAAACAAATGCAGTGTAGGAAATGTGGAGCTAGTATGAAGTACG
ID AAGTCATCGGTTTGGGTAAAAAGTCATGTAGATACATGTGTCCCGATTGTGGCAA
NO: CCATACCTCGGCAAGAAAGATACAAAACAAAAAAAAAAGAGATAAAAAATATGGG
144 TCAGCCAGTAAAGCCCAATCTCAAAGAATTGCTGTAGCAGGTGCTCTTTACCCTG
ACAAAAAAGTACAAACTATCAAAACCTATAAATATCCAGCAGACTTGAATGGTGA
GGTGCATGATAGCGGTGTTGCCGAGAAAATCGCACAAGCAATACAAGAGGACGAG
ATTGGACTTTTGGGACCAAGCTCAGAATATGCATGCTGGATTGCATCTCAAAAAC
AGTCTGAGCCTTACAGTGTAGTCGATTTCTGGTTTGATGCAGTGTGCGCAGGGGG
AGTCTTCGCCTACTCTGGCGCTAGATTATTGAGTACAGTTTTACAGTTATCCGGT
GAGGAATCGGTGCTTAGAGCTGCCTTAGCCTCGTCTCCATTCGTTGACGATATAA
ACTTAGCGCAAGCCGAAAAGTTTTTGGCGGTTAGCAGGCGTACAGGTCAAGATAA
GTTAGGTAAGAGAATTGGGGAGTGCTTTGCAGAAGGAAGATTGGAAGCTTTAGGG
ATAAAAGATAGAATGAGGGAATTTGTTCAAGCTATCGATGTTGCACAGACCGCCG
GACAACGTTTCGCTGCCAAATTGAAGATATTCGGTATAAGTCAGATGCCAGAAGC
TAAGCAATGGAATAACGATTCCGGACTGACTGTCTGTATACTACCTGATTATTAT
GTTCCCGAAGAGAATCGCGCGGACCAACTTGTAGTGTTGTTAAGAAGACTTCGCG
AGATTGCATATTGCATGGGTATTGAAGATGAAGCGGGTTTCGAACATCTTGGAAT
AGATCCTGGTGCTCTTTCGAATTTTTCAAACGGTAACCCTAAGAGAGGATTTCTA
GGGAGGCTGTTAAATAACGATATTATTGCGTTGGCAAACAATATGAGTGCGATGA
CTCCATATTGGGAAGGGCGTAAGGGTGAACTCATAGAAAGGCTTGCGTGGTTAAA
GCACAGGGCAGAAGGGCTGTATCTTAAAGAACCTCATTTCGGTAACTCCTGGGCC
GATCATAGGTCACGAATTTTCTCAAGGATCGCAGGCTGGTTATCTGGTTGCGCTG
GCAAGTTGAAAATTGCGAAAGACCAAATTTCTGGAGTACGTACAGATCTATTTCT
GCTAAAAAGACTGCTGGACGCAGTTCCGCAATCGGCGCCATCCCCCGATTTTATT
GCGTCAATTTCGGCACTTGACAGGTTTTTAGAAGCTGCAGAATCGAGCCAGGACC
CTGCTGAACAAGTGAGGGCTCTCTACGCTTTTCACTTGAACGCACCTGCAGTCCG
AAGTATAGCCAATAAAGCAGTGCAAAGGTCCGACAGCCAAGAATGGCTGATAAAA
GAACTAGACGCTGTTGACCATTTAGAATTTAACAAAGCGTTCCCATTTTTCTCTG
ACACAGGAAAAAAAAAAAAAAAAGGTGCTAATAGCAACGGTGCTCCATCGGAAGA
AGAGTACACTGAAACGGAATCAATACAACAACCTGAGGACGCGGAACAGGAAGTA
AACGGACAAGAAGGGAACGGAGCGTCTAAAAATCAAAAGAAATTTCAAAGAATAC
CTAGATTCTTCGGTGAAGGCTCCAGATCTGAATACAGAATTTTAACGGAAGCTCC
ACAGTATTTCGATATGTTTTGTAATAACATGAGGGCTATATTTATGCAGTTAGAA
AGTCAACCCCGTAAAGCTCCCAGAGATTTTAAATGTTTCCTACAAAATCGATTAC
AAAAATTATACAAACAGACTTTCTTGAATGCACGAAGCAACAAGTGTCGCGCTCT
GCTTGAGTCAGTTTTAATCTCTTGGGGAGAATTTTATACATACGGTGCCAACGAA
AAGAAATTTAGATTAAGACATGAAGCTTCAGAACGCAGCAGTGACCCAGATTACG
TAGTTCAGCAAGCCTTGGAAATCGCGCGTCGTCTATTCCTTTTTGGCTTCGAATG
GAGAGATTGCTCCGCTGGTGAAAGAGTGGATTTGGTTGAAATTCACAAAAAGGCT
ATCAGTTTTTTGTTGGCTATTACTCAAGCTGAGGTCTCTGTTGGTTCATACAATT
GGCTTGGCAACTCAACAGTATCGAGATATTTATCCGTTGCGGGAACTGATACCTT
ATACGGTACCCAATTGGAAGAATTCCTGAACGCTACAGTGTTGAGTCAAATGCGT
GGTCTGGCCATTAGATTGAGTTCTCAAGAACTTAAGGACGGTTTTGATGTGCAGC
TCGAGTCTTCCTGCCAGGACAATCTGCAACACCTATTGGTGTATAGGGCTTCGAG
AGATTTGGCGGCTTGCAAGCGCGCTACTTGTCCAGCCGAACTCGATCCTAAGATT
TTAGTTTTACCGGTAGGTGCATTCATCGCTTCCGTAATGAAAATGATAGAAAGAG
GTGACGAACCTTTAGCTGGTGCTTATTTACGGCATAGGCCACACTCTTTCGGATG
GCAAATTAGGGTCCGCGGTGTTGCTGAGGTAGGGATGGATCAGGGTACAGCATTG
GCCTTTCAAAAGCCAACAGAGTCAGAACCTTTTAAAATTAAGCCCTTCTCTGCAC
AGTATGGACCAGTTCTGTGGTTGAACAGTAGTAGTTATTCTCAATCACAATATTT
GGACGGTTTTCTATCTCAACCAAAAAATTGGAGTATGAGGGTGTTGCCTCAGGCG
GGTTCAGTTCGCGTCGAACAACGAGTTGCTTTGATATGGAACTTACAAGCAGGCA
AGATGAGACTAGAACGCTCCGGTGCGAGGGCCTTTTTCATGCCTGTACCGTTTTC
ATTTAGGCCATCCGGCAGTGGGGACGAAGCAGTTTTGGCGCCCAACCGGTACTTG
GGTCTGTTCCCTCATTCCGGAGGTATAGAATACGCTGTAGTGGATGTCCTGGATT
CTGCTGGATTTAAAATTCTTGAAAGAGGCACTATTGCTGTCAATGGTTTCTCTCA
GAAAAGGGGAGAGCGCCAAGAAGAAGCCCATCGTGAAAAACAAAGAAGGGGGATA
AGTGATATAGGGCGAAAGAAGCCTGTGCAGGCAGAAGTCGATGCGGCGAACGAAT
TGCATAGAAAGTACACTGATGTTGCCACAAGATTAGGTTGTAGAATCGTCGTTCA
ATGGGCACCACAACCTAAACCAGGGACAGCACCGACAGCGCAAACTGTTTACGCG
AGGGCTGTTAGGACAGAAGCTCCGAGGAGCGGCAACCAAGAAGATCATGCAAGAA
TGAAAAGTTCTTGGGGTTACACCTGGGGTACGTATTGGGAGAAACGAAAACCAGA
AGATATTTTAGGGATTTCTACACAGGTGTATTGGACAGGAGGTATAGGCGAATCC
TGTCCTGCTGTAGCAGTCGCTTTATTAGGTCATATTAGAGCAACTTCAACACAAA
CGGAGTGGGAAAAGGAAGAAGTTGTCTTTGGAAGACTGAAGAAGTTCTTTCCGAG
TTAA
SEQ ATGGAGAAGAGAATTAATAAGATACGGAAAAAATTATCTGCGGATAATGCAACAA
ID AGCCAGTCTCTCGTTCAGGCCCCATGAAAACCCTGCTTGTAAGAGTAATGACGGA
NO: TGATTTAAAAAAGAGGTTGGAAAAGCGTAGAAAAAAACCAGAAGTGATGCCGCAA
145 GTGATCTCAAATAACGCAGCTAATAATCTAAGGATGCTACTTGATGATTATACAA
AAATGAAAGAAGCAATCCTGCAAGTTTACTGGCAGGAATTCAAGGATGACCATGT
TGGACTAATGTGCAAATTCGCACAACCAGCGTCTAAGAAAATTGACCAAAATAAA
TTGAAACCCGAAATGGACGAAAAAGGGAATTTAACAACTGCCGGGTTTGCCTGCT
CGCAATGTGGGCAACCATTATTTGTTTATAAATTAGAGCAGGTTTCGGAAAAAGG
AAAGGCTTACACAAATTACTTCGGCAGATGTAATGTTGCCGAACACGAAAAACTC
ATATTGTTAGCTCAGTTGAAGCCTGAGAAAGACTCTGATGAGGCCGTTACTTACT
CGTTGGGGAAGTTTGGTCAAAGAGCTCTCGATTTTTATTCTATTCATGTGACAAA
GGAGTCCACACATCCCGTCAAGCCCTTGGCACAAATTGCGGGTAATAGATACGCT
TCGGGTCCAGTTGGGAAGGCCCTTTCTGATGCATGTATGGGCACAATTGCTAGCT
TTCTTAGTAAATACCAGGATATCATAATAGAGCATCAAAAAGTTGTAAAGGGTAA
CCAAAAGAGATTAGAATCGCTGCGTGAGTTGGCGGGTAAAGAAAACTTGGAATAT
CCATCTGTCACTCTGCCTCCTCAACCTCATACTAAGGAAGGTGTAGATGCGTACA
ATGAAGTTATCGCTAGAGTCCGTATGTGGGTGAATTTAAATTTGTGGCAAAAATT
GAAGTTATCGCGTGATGATGCAAAACCTCTTCTTAGACTAAAGGGCTTTCCTAGC
TTCCCTGTAGTGGAAAGACGCGAAAATGAAGTCGATTGGTGGAATACAATTAACG
AAGTCAAAAAACTGATCGATGCAAAGCGAGATATGGGTCGAGTTTTTTGGTCTGG
TGTTACAGCTGAAAAAAGGAATACGATCTTAGAAGGTTACAACTACTTGCCAAAT
GAGAACGATCATAAAAAAAGAGAAGGCAGTTTAGAAAATCCAAAAAAGCCAGCTA
AGAGACAATTTGGTGATTTGCTACTTTACCTAGAAAAAAAGTACGCCGGAGATTG
GGGGAAAGTCTTTGACGAAGCTTGGGAGAGAATAGATAAAAAAATAGCAGGATTG
ACGTCACACATTGAAAGAGAAGAGGCGAGAAATGCAGAAGATGCTCAGTCCAAAG
CTGTCCTCACCGACTGGTTGAGAGCCAAAGCGTCCTTTGTTCTCGAACGCCTAAA
AGAAATGGATGAGAAGGAATTTTATGCCTGCGAAATCCAGCTACAAAAATGGTAC
GGAGACTTGAGAGGTAACCCCTTTGCCGTGGAAGCAGAGAACCGTGTTGTAGATA
TCTCCGGTTTCTCAATCGGTAGCGATGGACACTCCATTCAGTATCGCAACTTGTT
GGCCTGGAAATATTTGGAAAACGGTAAGAGGGAATTCTATTTACTTATGAATTAT
GGCAAGAAAGGTAGAATCAGGTTTACTGACGGAACAGACATTAAAAAGAGTGGTA
AGTGGCAAGGCCTTTTGTACGGTGGTGGCAAGGCCAAAGTAATAGACTTAACATT
TGACCCCGACGACGAACAACTGATAATACTGCCTTTAGCTTTTGGTACTCGACAG
GGGCGAGAGTTCATTTGGAATGATCTTTTGTCACTCGAGACTGGTTTGATAAAAC
TTGCAAATGGAAGAGTCATCGAGAAGACAATTTACAACAAAAAGATAGGTCGCGA
TGAGCCTGCACTATTTGTGGCCTTGACCTTTGAGAGAAGGGAAGTTGTCGACCCA
TCCAATATTAAACCAGTCAACCTAATCGGTGTAGATAGAGGTGAAAACATCCCAG
CTGTTATCGCTCTGACAGACCCTGAAGGTTGCCCTTTGCCAGAATTTAAAGATTC
GTCTGGTGGACCAACAGATATATTACGTATTGGGGAAGGCTATAAAGAGAAACAA
CGTGCTATTCAGGCTGCAAAAGAAGTTGAACAGAGGAGAGCTGGAGGTTACAGTA
GAAAATTCGCCAGTAAAAGTAGAAACTTAGCAGATGACATGGTTAGAAACTCTGC
CCGGGATTTGTTCTATCATGCGGTTACTCACGATGCAGTCTTAGTCTTTGAAAAT
CTATCGCGCGGTTTTGGTAGGCAAGGCAAGAGGACTTTTATGACAGAGAGACAAT
ATACAAAAATGGAAGATTGGTTAACCGCGAAGCTCGCATATGAAGGTCTTACTTC
GAAAACGTACCTCAGCAAAACGCTGGCTCAATATACTTCTAAAACTTGTTCAAAT
TGTGGTTTTACTATTACCACGGCAGACTACGACGGGATGTTGGTGAGATTGAAGA
AGACGAGCGATGGTTGGGCAACAACATTGAATAATAAGGAATTAAAAGCAGAAGG
ACAGATTACGTATTACAATCGTTATAAACGCCAAACGGTTGAGAAAGAGTTGTCA
GCCGAGTTGGATAGACTAAGTGAAGAGAGCGGTAACAATGATATCTCAAAGTGGA
CTAAAGGGAGGCGGGATGAAGCCCTCTTTTTACTAAAGAAGAGATTCTCACATAG
ACCTGTGCAAGAACAATTCGTTTGTTTAGATTGTGGCCATGAGGTTCATGCAGAC
GAACAGGCTGCGTTAAATATTGCGAGAAGCTGGCTATTTCTAAATTCTAATTCAA
CAGAGTTCAAGAGCTATAAATCCGGAAAACAACCTTTCGTAGGCGCGTGGCAAGC
CTTCTATAAAAGGAGATTAAAAGAGGTTTGGAAACCAAATGCA
SEQ ATGAAAAGAATTAACAAAATTAGAAGGAGGCTGGTCAAAGATTCTAATACCAAGA
ID AAGCTGGTAAGACTGGTCCGATGAAAACCCTATTAGTCAGAGTTATGACCCCAGA
NO: TTTGAGAGAAAGATTGGAGAACCTCAGGAAAAAGCCCGAAAACATCCCACAACCC
146 ATTAGTAACACATCAAGAGCTAATTTAAACAAGTTATTAACTGACTACACTGAAA
TGAAAAAAGCAATATTGCATGTTTACTGGGAAGAGTTCCAGAAAGATCCTGTTGG
GTTGATGTCTAGAGTTGCTCAACCGGCCCCAAAGAATATAGATCAAAGGAAACTT
ATTCCTGTGAAGGACGGCAATGAAAGATTAACCAGCTCCGGTTTCGCTTGCTCCC
AGTGCTGCCAACCCCTGTATGTATACAAACTGGAACAAGTAAATGATAAAGGTAA
GCCACATACTAACTACTTTGGTAGGTGTAATGTATCCGAGCATGAAAGATTGATC
TTGTTAAGTCCCCATAAACCAGAAGCTAATGATGAGTTAGTAACTTATAGTTTAG
GTAAGTTCGGACAACGAGCTTTAGATTTCTATAGCATCCATGTTACAAGAGAAAG
CAATCACCCCGTCAAACCACTGGAACAAATCGGTGGTAATAGTTGTGCGTCAGGT
CCAGTAGGCAAAGCTTTATCAGACGCTTGCATGGGTGCCGTGGCTAGTTTTTTGA
CGAAATACCAAGATATTATACTGGAACATCAAAAGGTAATTAAAAAGAATGAAAA
GAGACTCGCTAACTTAAAAGATATTGCAAGTGCCAATGGTTTAGCTTTTCCTAAA
ATTACCTTGCCACCTCAGCCACATACAAAGGAGGGAATTGAAGCTTACAATAATG
TAGTAGCCCAAATAGTTATTTGGGTGAACCTTAACCTATGGCAAAAGTTAAAAAT
TGGTAGAGACGAAGCCAAACCCCTGCAGAGGCTGAAGGGTTTTCCCTCCTTCCCC
TTAGTAGAGAGACAAGCTAATGAAGTGGACTGGTGGGATATGGTGTGCAATGTTA
AAAAATTGATTAATGAGAAGAAAGAGGATGGTAAAGTGTTTTGGCAGAATCTTGC
TGGCTACAAGAGACAGGAAGCTTTACTGCCTTATTTATCTTCTGAGGAAGATAGG
AAAAAAGGTAAAAAATTTGCTAGATATCAATTCGGAGACCTACTTCTGCATTTAG
AAAAAAAACATGGCGAAGATTGGGGTAAAGTTTATGATGAAGCCTGGGAAAGAAT
TGATAAGAAGGTAGAAGGTCTCTCCAAACATATTAAATTAGAGGAAGAACGTAGG
TCCGAAGACGCTCAATCAAAGGCAGCATTAACTGATTGGTTGAGAGCAAAAGCCT
CTTTCGTTATTGAAGGATTAAAAGAAGCCGACAAAGATGAATTTTGTAGATGTGA
GTTAAAGTTGCAAAAGTGGTATGGAGACCTCCGTGGTAAACCTTTTGCTATTGAG
GCTGAAAATTCTATACTCGATATCTCTGGATTTTCAAAACAATATAACTGCGCAT
TTATATGGCAGAAAGATGGTGTTAAAAAGCTAAATCTATACTTAATTATCAATTA
CTTTAAAGGTGGTAAATTGCGTTTTAAGAAGATAAAGCCTGAAGCCTTTGAGGCA
AACCGTTTTTACACTGTTATCAATAAAAAATCTGGGGAAATCGTACCAATGGAAG
TTAATTTCAATTTCGATGATCCTAATCTTATTATTTTACCTCTTGCTTTCGGCAA
AAGGCAAGGTAGGGAGTTTATTTGGAATGATTTATTGTCGCTGGAAACGGGGTCT
CTCAAACTCGCAAACGGTAGGGTGATAGAAAAAACATTATACAACAGGAGAACTC
GGCAGGATGAGCCAGCTCTTTTTGTGGCTCTGACATTCGAGAGAAGGGAAGTTTT
AGATTCATCTAACATCAAACCAATGAATTTAATAGGTATTGACCGGGGTGAAAAT
ATACCTGCAGTTATTGCTTTAACTGATCCTGAGGGATGTCCTCTTAGCAGATTCA
AGGACTCGTTGGGTAACCCTACTCACATCTTAAGGATTGGAGAAAGTTACAAGGA
GAAACAAAGGACAATACAAGCTGCTAAAGAAGTAGAACAAAGGAGGGGGGTGGAT
ATAGTCGGAAATATGCCAGCAAGGCCAAGAATTTAGCTGACGACATGGTTAGGAA
TACAGCTAGAGACCTTTTATACTATGCCGTCACCCAGGATGCCATGTTGATATTT
GAAAATTTAAGTAGAGGCTTCGGTAGACAAGGTAAGCGCACCTTCATGGCAGAGA
GACAATATACTAGAATGGAAGATTGGTTGACTGCCAAATTGGCATACGAAGGTCT
ACCTAGTAAGACGTACTTATCTAAAACACTAGCGCAGTATACTTCCAAGACATGC
AGTAATTGTGGTTTCACAATCACTTCTGCCGATTACGATCGCGTCTTGGAAAAAC
TAAAAAAAACAGCGACAGGTTGGATGACTACTATTAATGGGAAAGAATTGAAGGT
CGAAGGACAAATAACTTACTATAATAGATATAAACGGCAAAACGTTGTAAAAGAC
CTGTCAGTCGAACTCGATCGACTTAGTGAAGAATCTGTTAATAATGATATTAGTT
CGTGGACAAAAGGTAGATCCGGTGAAGCTTTGAGCCTCCTGAAAAAACGTTTTAG
CCATAGGCCTGTCCAAGAAAAGTTTGTATGTTTAAACTGTGGTTTTGAGACCCAT
GCAGACGAGCAGGCCGCTCTTAATATTGCTAGATCATGGTTATTTTTAAGATCTC
AGGAATACAAGAAGTACCAGACTAACAAGACAACAGGCAACACAGATAAGCGAGC
ATTCGTTGAGACTTGGCAATCTTTTTATAGAAAGAAATTGAAGGAAGTCTGGAAA
CCA
SEQ ATGGGAAAAATGTATTATCTAGGCCTGGACATAGGGACCAATTCAGTAGGCTACG
ID CTGTCACTGACCCCTCCTACCATTTGCTGAAGTTCAAGGGGGAACCCATGTGGGG
NO: AGCACACGTGTTTGCGGCCGGCAACCAGAGCGCAGAGCGGAGAAGCTTCCGCACC
147 TCCAGGAGAAGGCTGGATCGCAGGCAGCAGCGTGTGAAGCTGGTCCAAGAGATAT
TTGCCCCAGTGATTTCCCCCATCGATCCGCGCTTCTTTATTAGGCTCCACGAGTC
CGCTCTCTGGCGCGACGACGTGGCCGAAACTGATAAACATATTTTCTTTAATGAC
CCAACATACACTGACAAGGAGTACTATTCAGATTACCCAACAATTCACCATTTGA
TCGTGGACCTTATGGAAAGTTCGGAGAAGCATGATCCTCGACTTGTCTATTTGGC
CGTGGCGTGGCTCGTGGCACATAGGGGCCACTTCTTGAACGAGGTGGACAAGGAT
AACATCGGGGATGTGTTATCTTTCGACGCTTTCTATCCTGAATTCCTTGCTTTTC
TGTCTGACAATGGCGTCAGCCCGTGGGTCTGCGAATCCAAGGCCCTCCAGGCTAC
GCTATTGTCAAGAAATAGCGTGAACGACAAGTACAAGGCTCTTAAGTCTTTGATT
TTTGGAAGCCAGAAGCCCGAGGACAACTTTGATGCAAATATCTCGGAGGACGGGC
TGATTCAGCTCCTCGCTGGGAAAAAGGTCAAGGTCAATAAGCTGTTTCCACAGGA
GTCAAATGACGCGAGCTTCACCCTTAACGACAAAGAGGATGCCATTGAAGAGATC
CTGGGGACACTCACCCCAGACGAGTGCGAGTGGATAGCCCATATTAGGCGCCTCT
TTGATTGGGCCATAATGAAACATGCGCTTAAGGACGGGCGCACGATATCCGAAAG
CAAGGTCAAATTGTACGAGCAGCACCACCATGATCTGACCCAGCTAAAATATTTT
GTAAAAACATATCTGGCCAAGGAGTACGATGATATCTTCCGCAACGTGGATAGTG
AGACCACCAAAAACTACGTCGCGTACTCATACCACGTGAAAGAAGTTAAGGGCAC
GCTGCCTAAGAACAAGGCAACACAAGAGGAGTTCTGCAAGTACGTTCTCGGGAAA
GTTAAAAATATAGAGTGCAGCGAGGCCGACAAAGTGGATTTTGACGAGATGATTC
AACGCCTGACCGACAATTCGTTTATGCCTAAACAGGTGAGTGGAGAGAATCGCGT
GATTCCATATCAGCTCTATTACTATGAACTCAAGACTATTCTGAATAAGGCCGCT
AGCTATTTACCCTTCCTTACGCAGTGCGGGAAGGATGCCATTTCTAACCAGGATA
AACTCTTGAGTATAATGACATTTCGAATTCCCTATTTCGTGGGTCCGCTTCGTAA
GGATAACAGTGAGCACGCTTGGCTGGAGCGGAAGGCTGGCAAAATTTATCCATGG
AATTTCAACGACAAGGTGGATCTGGACAAATCCGAAGAAGCCTTTATCCGCAGGA
TGACCAATACTTGCACATACTATCCTGGGGAGGATGTCCTTCCACTGGACTCTCT
GATCTACGAAAAGTTCATGATTTTGAATGAAATTAACAACATAAGGATCGATGGG
TATCCTATTTCCGTCGACGTGAAGCAGCAGGTGTTCGGGCTCTTTGAGAAGAAGC
GACGGGTGACCGTGAAGGATATTCAGAATCTTCTCTTATCGCTGGGAGCCCTGGA
TAAACACGGAAAACTGACCGGGATAGATACTACGATTCATTCTAATTACAACACG
TATCACCATTTTAAGTCACTGATGGAGAGGGGCGTCCTAACAAGAGATGACGTGG
AGAGAATAGTGGAACGAATGACATATTCTGATGACACCAAGAGAGTGCGGCTTTG
GCTGAATAACAACTACGGCACTCTGACGGCGGATGATGTAAAGCATATTTCCCGA
CTCCGTAAGCATGACTTCGGGCGGCTGTCTAAGATGTTTCTAACAGGCCTCAAGG
GTGTGCATAAGGAAACTGGGGAGCGCGCTAGCATCCTGGATTTTATGTGGAACAC
CAATGATAACCTGATGCAGCTCCTGTCAGAATGCTACACATTTTCGGACGAAATC
ACCAAGCTGCAGGAGGCTTACTATGCCAAGGCCCAACTAAGCTTGAATGATTTCC
TGGATTCTATGTACATCAGCAACGCCGTAAAACGACCAATTTATAGGACACTGGC
AGTGGTTAACGACATTAGGAAAGCATGCGGAACAGCTCCCAAGCGAATCTTTATC
GAGATGGCCCGCGACGGCGAGAGTAAGAAGAAAAGGTCAGTGACTAGGCGGGAGC
AGATCAAGAACCTTTACCGCTCTATCCGAAAAGACTTCCAGCAAGAGGTTGATTT
CCTTGAGAAGATCTTAGAGAACAAGTCAGATGGACAGCTCCAATCCGATGCTCTG
TATCTGTACTTCGCTCAGCTGGGACGAGATATGTACACTGGCGACCCCATTAAAC
TAGAACATATCAAGGACCAATCGTTTTATAATATCGACCACATCTACCCTCAGTC
CATGGTGAAAGACGATAGTCTGGACAATAAGGTGCTCGTCCAAAGTGAGATTAAC
GGAGAAAAGTCGAGCAGATATCCTTTGGACGCTGCGATCCGCAACAAGATGAAGC
CCCTGTGGGATGCTTACTACAATCATGGACTGATCAGCCTGAAGAAGTATCAGAG
ACTGACCCGGAGTACCCCTTTCACAGACGATGAGAAGTGGGATTTTATCAATAGA
CAACTGGTGGAAACCAGGCAGTCCACGAAAGCTCTGGCCATTCTTCTGAAGAGAA
AGTTTCCAGACACAGAGATCGTCTATTCAAAGGCCGGCCTCAGTTCCGACTTTAG
ACATGAGTTCGGACTCGTTAAATCACGAAATATAAACGATCTCCACCATGCAAAG
GACGCATTCCTCGCGATTGTGACTGGAAATGTCTATCACGAAAGATTTAATAGGC
GGTGGTTCATGGTTAACCAGCCATACTCAGTGAAGACCAAGACCCTTTTCACTCA
CTCTATTAAAAATGGCAACTTCGTGGCTTGGAATGGTGAGGAGGATCTTGGAAGA
ATTGTGAAGATGTTAAAACAGAATAAGAATACCATCCACTTTACTAGATTCAGCT
TTGACCGAAAAGAGGGGCTATTCGATATTCAACCGTTAAAGGCTTCAACAGGTCT
CGTTCCACGAAAGGCCGGACTGGACGTAGTGAAATACGGCGGCTATGATAAGAGC
ACCGCAGCTTACTACCTCCTTGTGCGATTTACGCTCGAGGATAAGAAGACCCAAC
ACAAGCTGATGATGATTCCCGTGGAGGGACTGTACAAAGCTCGAATTGACCATGA
TAAAGAGTTTCTCACAGATTACGCACAAACCACCATCTCTGAGATTCTCCAGAAA
GACAAACAAAAAGTTATAAACATAATGTTTCCAATGGGTACAAGGCATATTAAAC
TGAACAGCATGATCTCCATTGATGGCTTTTATTTGTCCATTGGAGGAAAGTCTAG
TAAAGGCAAGTCTGTCCTCTGCCATGCCATGGTACCCCTAATCGTCCCACACAAG
ATTGAATGCTACATCAAGGCTATGGAGAGTTTTGCTCGGAAATTTAAAGAGAATA
ATAAGCTGCGTATTGTGGAAAAATTCGACAAGATAACCGTTGAAGACAATCTGAA
TCTGTACGAGCTCTTTCTGCAGAAGCTGCAGCATAACCCCTATAATAAGTTCTTC
TCCACACAGTTCGATGTACTGACCAACGGGCGATCAACTTTCACAAAGCTAAGTC
CTGAGGAACAGGTGCAAACACTCCTAAACATTCTTTCCATTTTTAAGACCTGCAG
ATCTTCAGGATGCGACTTGAAGAGCATTAACGGGAGCGCACAGGCAGCTAGGATC
ATGATCTCAGCTGACCTGACAGGGCTGAGTAAAAAATACTCCGACATTCGGCTTG
TAGAGCAAAGCGCCAGTGGGTTGTTCGTTAGTAAGTCGCAGAACCTGCTGGAATA
CCTGTAA
SEQ ATGTCTTCTTTGACGAAGTTTACAAACAAATACTCTAAGCAGCTTACAATTAAGA
ID ACGAACTGATTCCCGTAGGAAAGACTCTGGAAAACATCAAAGAGAATGGGCTGAT
NO: AGACGGCGACGAACAACTGAATGAGAACTATCAGAAGGCCAAAATTATCGTGGAT
148 GACTTCCTGAGGGATTTTATTAACAAGGCCCTGAATAATACCCAGATCGGCAATT
GGCGGGAACTGGCCGACGCTCTGAACAAAGAAGATGAGGACAATATCGAAAAATT
ACAAGACAAAATCAGGGGCATTATTGTCAGTAAGTTCGAGACATTCGATCTGTTC
TCTTCGTACTCCATTAAGAAGGACGAGAAAATCATCGATGATGACAATGACGTTG
AGGAAGAAGAACTGGACTTGGGTAAAAAGACCTCATCCTTCAAGTATATTTTTAA
AAAAAATCTGTTTAAATTAGTGCTCCCCAGTTATTTAAAGACAACTAACCAGGAC
AAGCTTAAGATTATCTCCTCTTTTGACAACTTTAGCACCTATTTTAGAGGCTTCT
TTGAAAATCGCAAGAATATTTTCACTAAGAAGCCCATAAGCACCTCTATTGCCTA
CAGAATCGTACATGATAACTTCCCAAAATTTTTGGATAACATTAGATGTTTTAAT
GTATGGCAGACCGAATGTCCTCAGTTAATTGTGAAGGCGGATAACTACCTCAAAT
CCAAGAATGTGATCGCCAAAGATAAGTCTCTTGCTAACTACTTTACGGTCGGAGC
CTACGATTACTTCTTATCTCAAAACGGTATTGACTTTTACAATAACATTATCGGG
GGATTGCCTGCCTTCGCCGGCCATGAGAAAATTCAGGGCTTAAACGAGTTCATAA
ATCAGGAATGTCAAAAGGACTCAGAGCTGAAATCAAAGCTTAAGAATCGACACGC
ATTTAAAATGGCGGTCTTGTTCAAACAGATCCTCAGCGATAGAGAGAAAAGCTTC
GTTATTGATGAATTCGAGAGCGACGCACAGGTGATTGATGCCGTGAAGAACTTCT
ATGCGGAACAGTGTAAAGACAATAATGTTATTTTCAACCTATTAAACTTGATTAA
GAATATCGCGTTTTTAAGTGACGATGAACTCGACGGTATCTTTATAGAAGGCAAG
TACCTGTCCTCTGTCAGCCAAAAACTCTACTCAGATTGGTCCAAGCTAAGAAATG
ACATCGAGGACAGTGCTAACAGCAAACAGGGCAATAAAGAGCTGGCAAAGAAAAT
CAAGACTAATAAAGGGGATGTGGAGAAGGCGATATCTAAATATGAGTTCTCCCTC
TCCGAACTGAACTCCATCGTCCACGATAATACCAAGTTTAGTGATCTGTTGTCGT
GTACACTGCACAAAGTGGCCAGTGAAAAACTCGTCAAGGTGAACGAAGGCGATTG
GCCCAAACACCTGAAAAATAATGAGGAGAAACAGAAGATCAAAGAACCTTTGGAT
GCGTTGCTCGAAATATATAACACACTGTTGATCTTCAACTGTAAAAGCTTCAACA
AGAACGGGAACTTTTATGTAGACTACGATCGATGTATAAATGAACTGAGCAGCGT
CGTTTACCTGTACAACAAGACTCGCAATTATTGTACGAAAAAACCATATAACACC
GATAAGTTCAAGCTTAATTTCAACAGTCCCCAGCTGGGAGAAGGGTTCAGCAAAT
CAAAAGAAAACGATTGCCTGACATTACTCTTTAAAAAGGATGATAATTATTATGT
TGGGATTATTAGGAAAGGCGCTAAGATCAACTTTGACGACACACAGGCCATAGCT
GACAACACTGATAACTGCATCTTTAAAATGAATTACTTTCTGTTGAAGGACGCCA
AAAAATTCATTCCAAAATGCTCTATTCAGCTCAAGGAGGTTAAGGCCCATTTCAA
GAAGTCTGAAGATGACTACATCCTCTCTGACAAGGAAAAATTCGCTAGTCCTCTG
GTTATCAAAAAAAGTACCTTCTTGCTGGCTACAGCTCACGTGAAAGGCAAGAAAG
GGAACATTAAGAAGTTCCAAAAGGAATACAGCAAAGAGAATCCAACCGAGTACAG
AAATTCTCTGAACGAATGGATCGCATTCTGTAAAGAATTTCTAAAGACGTACAAG
GCCGCTACCATTTTCGATATTACCACCTTGAAAAAAGCCGAGGAGTACGCCGACA
TCGTCGAATTCTATAAAGACGTGGATAACCTGTGTTACAAATTGGAATTCTGCCC
AATTAAGACCTCTTTCATTGAAAACCTCATCGACAATGGGGACCTCTACTTATTT
AGAATTAACAATAAGGATTTTTCTTCGAAATCTACCGGAACTAAAAATCTGCACA
CACTGTATCTGCAAGCAATCTTCGATGAACGTAATCTCAACAACCCTACAATAAT
GCTGAACGGCGGTGCTGAACTGTTCTACCGTAAAGAGAGTATTGAACAGAAGAAT
CGAATCACACACAAAGCGGGCAGTATTCTCGTCAATAAGGTGTGCAAAGACGGGA
CCAGCCTGGACGATAAGATCAGGAATGAAATATATCAGTATGAGAACAAGTTTAT
CGACACCTTGTCGGATGAGGCAAAGAAGGTGCTACCTAACGTTATCAAGAAGGAA
GCTACCCATGACATAACCAAGGATAAGCGGTTCACTTCTGACAAGTTCTTCTTCC
ACTGTCCTCTGACCATTAACTACAAGGAAGGAGACACTAAACAATTCAATAATGA
AGTACTTAGCTTTTTGCGGGGTAATCCCGATATTAACATAATTGGTATCGACCGG
GGAGAACGGAACCTGATATACGTGACAGTAATTAATCAGAAAGGAGAAATCCTGG
ATTCCGTATCCTTCAATACCGTGACTAATAAATCTAGTAAAATCGAGCAGACGGT
CGACTACGAGGAAAAGTTAGCAGTCAGAGAGAAGGAGAGAATCGAGGCCAAACGT
TCCTGGGATAGTATCAGCAAGATTGCTACTCTGAAAGAAGGATATCTGTCCGCTA
TCGTCCATGAGATCTGTTTGTTGATGATCAAGCACAATGCTATAGTGGTTCTGGA
GAACCTGAACGCAGGCTTCAAGCGAATTAGAGGGGGCCTGTCGGAAAAAAGCGTT
TACCAGAAGTTTGAAAAGATGCTAATCAATAAGTTAAATTACTTTGTAAGTAAAA
AAGAAAGCGATTGGAATAAGCCATCAGGACTTTTAAACGGGCTGCAACTGAGCGA
CCAGTTTGAGTCATTCGAAAAACTGGGTATTCAGAGTGGTTTCATATTCTACGTA
CCTGCCGCTTACACTTCAAAGATCGATCCTACAACTGGTTTTGCGAATGTCCTGA
ATCTGTCTAAGGTGAGGAATGTGGACGCAATCAAGTCTTTCTTCAGCAACTTCAA
CGAGATATCTTACAGCAAGAAAGAGGCTCTGTTTAAATTCAGTTTTGATCTGGAT
AGCCTGAGCAAGAAAGGATTCTCTTCTTTCGTAAAGTTTTCTAAGTCCAAATGGA
ACGTCTACACGTTCGGAGAGAGAATCATTAAACCAAAGAACAAGCAGGGGTATCG
GGAAGACAAAAGGATCAATCTGACTTTCGAAATGAAGAAACTATTGAATGAGTAC
AAAGTCTCATTCGATTTGGAGAACAATCTGATCCCCAATCTGACCAGCGCTAACC
TCAAAGACACATTCTGGAAGGAGCTGTTTTTCATCTTTAAGACCACCCTGCAGCT
ACGGAATAGTGTCACAAATGGGAAAGAGGATGTACTGATCTCACCTGTGAAAAAC
GCCAAGGGGGAGTTCTTTGTGTCCGGCACCCATAACAAAACCCTGCCTCAGGACT
GTGACGCGAACGGGGCCTACCACATCGCGCTAAAGGGGTTAATGATTCTCGAACG
TAATAATCTGGTGCGCGAAGAAAAAGACACAAAGAAAATTATGGCCATCAGCAAC
GTTGACTGGTTTGAGTACGTGCAGAAGCGTCGAGGAGTTTTGTAA
SEQ ATGAACAACTATGACGAGTTCACTAAACTTTACCCCATTCAGAAAACCATCAGAT
ID TTGAACTGAAGCCTCAGGGTCGTACCATGGAACACTTGGAAACTTTCAACTTTTT
NO: CGAGGAGGACAGGGATAGAGCTGAGAAATACAAGATCTTGAAAGAGGCCATCGAC
149 GAGTATCACAAAAAATTCATCGATGAGCATCTCACCAACATGTCGCTGGATTGGA
ACAGTCTCAAGCAGATTTCCGAGAAGTACTATAAATCTCGGGAGGAGAAAGATAA
AAAGGTGTTTTTGAGCGAGCAAAAGCGAATGCGACAGGAGATAGTCTCTGAATTT
AAGAAAGATGATCGGTTTAAAGACCTATTTTCCAAAAAGCTTTTTTCAGAGCTGC
TGAAGGAAGAGATCTATAAAAAAGGCAATCACCAAGAAATTGATGCCCTGAAATC
ATTCGACAAATTCAGTGGGTATTTCATAGGACTGCATGAGAACCGGAAGAATATG
TATAGTGATGGAGACGAGATCACAGCCATAAGCAATCGAATCGTTAACGAGAATT
TCCCGAAGTTCCTGGATAACCTGCAGAAGTATCAAGAGGCTAGGAAAAAGTACCC
TGAGTGGATCATCAAGGCTGAATCAGCTCTGGTGGCTCACAATATCAAGATGGAT
GAAGTCTTTAGTCTTGAGTACTTTAATAAAGTCCTTAACCAGGAGGGCATCCAGC
GCTATAACCTGGCTCTCGGTGGCTACGTCACAAAAAGCGGAGAAAAGATGATGGG
TCTCAACGATGCACTGAATTTGGCTCATCAGTCGGAGAAGTCATCTAAGGGACGC
ATACACATGACACCACTGTTTAAACAAATCCTGAGCGAAAAGGAATCATTTTCCT
ACATTCCCGACGTATTCACCGAGGACTCACAACTGCTGCCTAGTATAGGGGGGTT
TTTCGCTCAGATAGAGAACGACAAAGATGGCAACATTTTTGACAGAGCCTTGGAG
TTGATTTCATCTTACGCCGAGTACGATACGGAGCGCATTTATATTCGCCAGGCGG
ATATCAACAGGGTTTCCAATGTGATCTTTGGCGAGTGGGGAACGCTGGGCGGGCT
GATGCGGGAATACAAAGCCGACTCGATCAATGACATCAACCTGGAGAGAACATGC
AAGAAGGTCGATAAATGGTTGGATAGCAAAGAGTTCGCCCTGAGTGACGTCTTGG
AAGCTATCAAAAGAACCGGAAATAATGACGCGTTCAACGAGTATATCTCTAAAAT
GAGGACCGCGAGAGAAAAAATTGATGCAGCAAGGAAGGAGATGAAGTTTATATCT
GAGAAGATCTCAGGCGATGAAGAGTCCATCCATATTATTAAAACTCTTCTGGACT
CAGTGCAGCAATTCCTGCACTTTTTTAACCTCTTCAAGGCCAGGCAGGATATACC
GTTAGACGGGGCTTTTTATGCCGAGTTTGATGAAGTTCATTCGAAACTTTTTGCT
ATAGTGCCTCTCTATAATAAAGTTCGCAATTACCTGACAAAGAATAACTTAAACA
CAAAGAAAATCAAGCTCAACTTCAAAAACCCAACACTGGCAAACGGATGGGATCA
GAACAAGGTATATGATTACGCCTCATTGATTTTCCTCCGGGACGGGAATTACTAT
CTGGGGATCATCAACCCTAAGCGCAAAAAGAACATTAAGTTCGAACAGGGATCTG
GCAATGGTCCCTTCTATAGGAAAATGGTATACAAACAGATTCCTGGCCCCAACAA
GAATCTCCCACGCGTCTTTCTGACGTCCACTAAGGGAAAGAAGGAGTACAAGCCG
TCTAAAGAAATTATCGAGGGCTATGAGGCAGACAAGCATATTAGGGGTGACAAGT
TTGACCTAGACTTTTGTCATAAGCTTATCGACTTTTTCAAGGAGTCCATAGAGAA
GCACAAAGATTGGTCAAAGTTTAATTTCTATTTTTCTCCAACAGAGTCCTACGGG
GATATCTCTGAGTTCTATCTGGATGTTGAAAAGCAGGGGTACAGAATGCACTTCG
AAAATATCTCAGCAGAAACTATCGATGAGTACGTAGAGAAAGGAGATCTGTTTCT
TTTCCAAATCTACAATAAGGATTTTGTGAAGGCCGCCACTGGGAAGAAGGACATG
CACACTATTTACTGGAACGCTGCATTTTCCCCTGAAAATCTGCAGGACGTAGTAG
TGAAATTAAATGGTGAGGCAGAACTGTTTTACCGCGATAAATCAGACATCAAGGA
AATAGTGCACCGGGAAGGCGAGATTCTTGTTAACCGAACATATAATGGCAGGACA
CCTGTCCCTGATAAAATTCATAAGAAACTGACCGATTACCACAACGGTCGAACCA
AGGATCTGGGCGAGGCCAAGGAATACCTCGATAAGGTGAGGTACTTCAAAGCCCA
TTATGACATCACCAAGGACCGAAGATACCTTAACGACAAAATCTACTTCCATGTC
CCACTCACCTTGAACTTCAAAGCTAACGGTAAGAAGAACCTCAATAAAATGGTGA
TTGAAAAATTTCTGTCCGATGAGAAGGCCCATATCATCGGCATTGATCGCGGCGA
GAGAAATCTCCTTTACTATTCTATCATTGATCGGTCGGGAAAGATTATCGACCAA
CAATCACTGAATGTCATCGACGGATTCGACTATAGAGAGAAGCTGAACCAACGGG
AAATCGAGATGAAGGACGCGCGCCAGTCCTGGAACGCTATCGGCAAAATTAAAGA
TTTGAAAGAAGGTTACCTCTCCAAAGCAGTGCACGAAATTACCAAAATGGCAATC
CAGTACAATGCTATTGTGGTAATGGAGGAGTTAAATTACGGATTTAAGCGCGGGA
GGTTCAAGGTTGAAAAGCAAATTTACCAAAAATTTGAGAACATGTTGATTGATAA
GATGAACTACCTGGTGTTCAAGGACGCACCTGACGAGTCGCCAGGCGGCGTGTTA
AATGCATATCAGCTGACAAATCCACTGGAGAGCTTTGCCAAGCTAGGAAAGCAGA
CTGGCATTCTCTTTTACGTCCCTGCAGCGTATACATCCAAAATTGACCCCACCAC
TGGCTTCGTCAATCTGTTTAACACCTCCTCCAAAACCAACGCACAAGAACGGAAA
GAATTTTTGCAAAAGTTTGAGTCCATTAGCTACTCTGCCAAAGACGGCGGGATCT
TTGCTTTCGCATTCGACTACAGGAAATTCGGGACGAGTAAGACAGACCACAAGAA
CGTCTGGACCGCGTACACTAATGGGGAACGCATGCGCTACATCAAAGAGAAAAAG
AGGAATGAACTTTTTGACCCTTCAAAGGAAATCAAGGAAGCTCTCACCTCAAGCG
GTATCAAATACGATGGCGGGCAGAATATTTTGCCAGATATCCTCAGATCGAACAA
TAATGGACTTATCTATACTATGTACTCCTCCTTCATTGCAGCAATTCAAATGAGA
GTGTACGATGGAAAGGAGGATTACATTATATCGCCAATTAAGAACTCCAAAGGCG
AATTCTTCCGCACGGATCCTAAGCGAAGAGAACTCCCAATCGACGCTGATGCGAA
CGGCGCCTATAATATAGCCCTGCGGGGTGAATTAACAATGCGCGCTATTGCCGAG
AAGTTCGACCCCGATTCAGAAAAAATGGCTAAGCTTGAGCTGAAACACAAAGATT
GGTTCGAATTCATGCAGACAAGAGGCGACTAA
SEQ ATGACTAAGACCTTCGATTCCGAGTTCTTCAACCTTTATTCCCTGCAGAAAACT
ID GTAAGGTTTGAGCTGAAGCCGGTGGGCGAGACAGCCAGCTTCGTAGAGGATTTCA
NO: AGAATGAGGGTCTCAAACGGGTAGTTAGTGAGGATGAGAGGAGAGCAGTGGACTA
150 TCAGAAGGTGAAAGAGATCATCGATGACTATCACCGGGATTTCATAGAGGAGTCG
TTGAATTACTTCCCTGAGCAAGTATCCAAAGACGCGCTGGAACAGGCCTTTCATC
TTTACCAGAAACTGAAGGCAGCGAAGGTTGAGGAGCGGGAAAAGGCCTTGAAAGA
GTGGGAAGCCCTGCAGAAAAAGCTCAGAGAAAAGGTTGTCAAATGCTTCAGCGAC
AGCAACAAAGCCAGGTTCAGTAGGATCGATAAGAAAGAACTGATCAAAGAAGACT
TGATCAATTGGCTGGTTGCACAGAACCGGGAAGATGATATTCCCACCGTAGAGAC
CTTCAACAACTTCACAACTTACTTCACCGGCTTCCATGAGAATCGTAAAAACATC
TACAGTAAAGATGATCATGCAACCGCCATCTCCTTCCGGTTGATCCACGAGAATC
TCCCCAAGTTCTTTGACAACGTGATAAGTTTCAATAAGTTGAAAGAGGGATTTCC
CGAACTCAAGTTCGATAAAGTGAAGGAGGATCTGGAAGTGGATTATGACCTTAAG
CACGCTTTCGAGATAGAGTACTTCGTGAACTTTGTGACTCAGGCCGGCATCGATC
AGTATAACTACCTCCTCGGGGGTAAGACGCTCGAGGACGGTACTAAGAAGCAAGG
AATGAATGAGCAAATTAATCTATTTAAACAGCAGCAGACCAGGGATAAGGCTAGA
CAGATCCCCAAGCTTATTCCTCTTTTTAAACAGATCCTAAGTGAAAGGACAGAAA
GTCAAAGCTTCATACCTAAGCAATTTGAAAGTGATCAGGAGCTGTTTGACTCCCT
GCAAAAGCTGCACAACAATTGCCAGGACAAGTTTACCGTGCTGCAGCAGGCTATC
CTCGGACTGGCTGAGGCGGATCTTAAGAAGGTATTCATTAAGACTAGCGACCTCA
ATGCCCTTAGTAACACCATCTTTGGAAATTACTCCGTTTTCAGCGATGCCCTCAA
TCTATACAAAGAGAGCTTGAAGACTAAAAAAGCTCAGGAAGCTTTTGAAAAATTA
CCGGCACATTCTATACACGACCTTATACAATACTTAGAGCAGTTCAACAGCAGCC
TCGACGCTGAGAAACAGCAATCCACAGACACCGTCCTGAATTACTTCATCAAAAC
CGATGAACTGTACTCCCGATTTATCAAGAGCACTTCAGAAGCCTTCACGCAAGTT
CAGCCTCTGTTCGAGCTGGAGGCACTGTCCAGCAAGAGACGACCGCCAGAGTCTG
AAGACGAGGGAGCCAAGGGTCAAGAGGGGTTTGAACAGATAAAGCGAATTAAGGC
TTACTTGGATACTCTCATGGAGGCGGTGCATTTCGCTAAGCCTTTGTACCTGGTT
AAAGGCCGAAAAATGATTGAGGGGCTAGATAAGGATCAGTCTTTTTACGAGGCTT
TTGAAATGGCCTACCAGGAATTGGAATCCTTGATCATTCCAATCTATAATAAAGC
CCGGAGTTATCTGAGCAGGAAGCCCTTCAAAGCCGACAAGTTCAAAATAAATTTT
GACAATAATACGCTACTGTCTGGTTGGGACGCTAACAAGGAAACAGCCAATGCTT
CCATCCTGTTTAAGAAAGACGGCCTGTACTACCTGGGAATTATGCCAAAAGGCAA
AACTTTTTTGTTCGATTACTTTGTGTCATCAGAGGATAGCGAGAAGTTAAAGCAA
AGACGGCAGAAGACCGCCGAAGAAGCCCTCGCACAAGACGGAGAATCATATTTCG
AGAAAATTCGATATAAGCTCCTGCCTGGCGCATCAAAGATGTTGCCAAAAGTCTT
CTTTTCCAACAAAAACATCGGCTTTTATAACCCCAGCGATGATATCCTTCGCATC
CGGAACACCGCCTCACATACCAAAAATGGAACTCCACAGAAGGGCCACTCGAAGG
TTGAATTCAACCTTAACGATTGTCACAAAATGATTGATTTTTTTAAGAGCTCCAT
TCAGAAACACCCCGAATGGGGGTCCTTTGGCTTCACCTTTTCTGATACTTCAGAC
TTCGAGGACATGTCCGCCTTCTACAGGGAGGTGGAGAACCAGGGCTATGTCATCT
CCTTCGACAAAATAAAAGAGACATACATTCAGAGCCAGGTCGAGCAGGGAAATCT
GTACCTGTTTCAGATCTATAACAAGGATTTCAGTCCCTATAGCAAGGGCAAGCCC
AATTTACATACCCTGTACTGGAAGGCCCTGTTCGAAGAGGCAAACCTTAACAATG
TAGTTGCTAAGCTGAATGGGGAAGCAGAGATCTTCTTCCGAAGGCACAGCATCAA
GGCAAGCGACAAAGTTGTACATCCTGCTAACCAGGCCATCGATAACAAGAACCCG
CATACAGAAAAGACACAGTCAACCTTTGAATACGACCTCGTGAAGGACAAGAGGT
ACACACAAGATAAATTCTTCTTCCACGTGCCCATCAGCTTGAATTTTAAAGCGCA
GGGAGTGAGCAAATTTAACGACAAGGTCAACGGCTTCCTGAAGGGAAACCCCGAC
GTGAATATCATCGGAATTGATCGCGGTGAAAGACATCTCCTCTACTTTACTGTGG
TGAACCAGAAGGGTGAGATCCTAGTACAGGAGAGCCTGAACACCCTTATGAGTGA
TAAGGGCCATGTGAATGATTACCAGCAGAAGCTGGACAAGAAGGAACAGGAAAGG
GACGCAGCGCGGAAGTCCTGGACCACTGTTGAGAATATCAAAGAACTGAAGGAGG
GATATCTTAGCCATGTGGTACACAAACTTGCACATCTGATTATCAAGTATAATGC
CATAGTCTGCCTGGAAGACTTGAACTTCGGTTTCAAGCGAGGAAGGTTTAAAGTG
GAGAAGCAGGTGTACCAGAAGTTTGAGAAAGCCCTTATTGATAAGCTAAACTACC
TTGTCTTTAAGGAAAAAGAACTCGGCGAAGTTGGCCACTATTTAACCGCCTACCA
ACTAACCGCCCCTTTCGAGTCTTTTAAGAAACTGGGAAAGCAGAGCGGAATACTC
TTCTATGTGCCTGCAGACTACACCTCTAAGATCGACCCCACTACCGGCTTTGTAA
ACTTTCTAGATCTCCGCTATCAGTCAGTAGAAAAAGCCAAACAGCTCTTGTCAGA
TTTTAACGCCATCCGATTTAATTCCGTCCAAAATTACTTCGAGTTCGAAATCGAC
TATAAAAAACTTACCCCCAAGAGAAAGGTTGGGACGCAGTCTAAGTGGGTAATCT
GCACTTACGGTGACGTGAGATACCAGAACCGCCGAAACCAGAAAGGTCATTGGGA
AACCGAGGAAGTGAATGTGACTGAGAAGCTCAAGGCCCTCTTCGCTAGCGACAGT
AAAACAACAACAGTTATCGATTACGCCAATGACGATAATCTTATAGACGTGATCT
TGGAACAAGACAAAGCCTCTTTTTTTAAGGAATTGTTGTGGTTGCTGAAACTTAC
AATGACCCTTAGGCACAGCAAGATCAAATCAGAGGATGACTTCATCCTCAGCCCG
GTGAAGAATGAACAGGGAGAGTTCTACGATTCACGGAAGGCTGGAGAGGTGTGGC
CCAAGGATGCCGACGCGAACGGGGCCTACCACATAGCTCTAAAAGGTCTGTGGAA
CCTGCAACAAATCAATCAATGGGAGAAAGGTAAGACACTGAACCTGGCCATCAAA
AATCAAGATTGGTTCTCATTCATCCAGGAAAAGCCTTATCAAGAGTGA
SEQ ATGCATACGGGAGGCCTTTTATCAATGGACGCAAAAGAGTTCACCGGGCAGTATC
ID CATTATCTAAGACACTCCGCTTCGAGCTGAGGCCCATTGGCAGGACCTGGGACAA
NO: CCTGGAGGCGTCGGGCTACCTGGCTGAGGACAGACATCGCGCAGAATGCTATCCG
151 AGAGCTAAGGAGCTTTTGGACGACAATCATCGCGCGTTCCTTAACCGGGTGCTCC
CACAGATCGATATGGACTGGCACCCGATCGCTGAGGCTTTTTGCAAGGTCCATAA
GAACCCTGGGAACAAAGAGCTCGCCCAGGACTACAACTTGCAGCTGAGCAAGCGA
CGGAAAGAGATTTCTGCCTACCTTCAAGACGCCGATGGCTACAAAGGGCTCTTCG
CAAAGCCCGCATTGGATGAGGCCATGAAAATCGCCAAGGAGAACGGGAATGAAAG
TGACATCGAAGTTCTCGAAGCGTTTAACGGATTTAGCGTGTACTTTACCGGCTAT
CATGAGTCAAGGGAGAATATTTATAGCGATGAGGACATGGTCTCTGTGGCCTACC
GGATTACCGAGGATAATTTCCCGAGGTTTGTTTCAAATGCACTAATATTCGACAA
GTTAAATGAGAGCCACCCAGACATCATCTCGGAGGTCAGCGGCAACCTCGGAGTT
GACGATATTGGCAAATACTTCGACGTGAGCAACTATAACAACTTCCTCTCACAGG
CTGGCATCGACGACTATAATCATATTATAGGCGGCCACACTACTGAGGATGGTCT
CATTCAGGCATTCAATGTAGTCTTGAATCTTAGGCACCAGAAGGACCCTGGGTTT
GAAAAGATACAGTTCAAGCAGCTGTATAAGCAGATATTATCCGTGCGAACATCTA
AAAGTTACATCCCCAAACAGTTTGATAACTCAAAGGAGATGGTGGATTGCATATG
CGATTATGTGTCAAAAATTGAAAAGAGCGAGACTGTGGAGCGGGCTCTGAAGCTC
GTCAGGAACATTAGCTCCTTTGACCTTAGAGGAATTTTCGTCAATAAAAAGAATC
TGAGGATCCTGAGCAATAAGCTAATAGGAGATTGGGACGCCATAGAGACAGCATT
GATGCATTCCAGCTCAAGCGAGAATGATAAGAAGTCTGTCTACGATAGCGCTGAA
GCCTTCACGCTGGACGATATCTTCTCTTCCGTGAAAAAATTTAGTGATGCGTCCG
CAGAAGATATCGGGAATCGAGCCGAAGATATCTGCAGGGTAATTTCAGAGACCGC
CCCTTTCATCAATGACCTGCGCGCCGTGGACCTGGATAGCCTGAATGACGATGGT
TACGAAGCTGCAGTTTCTAAGATCAGGGAGTCTCTGGAGCCATATATGGACTTGT
TTCACGAACTTGAGATCTTTAGCGTGGGCGACGAGTTCCCGAAATGCGCAGCTTT
CTATAGCGAGTTAGAGGAGGTCAGCGAGCAATTAATCGAGATCATACCCCTGTTT
AATAAGGCACGGAGCTTTTGTACTCGCAAGCGCTACAGCACCGACAAGATTAAAG
TTAATCTGAAATTTCCAACTCTCGCAGACGGGTGGGACCTAAACAAGGAACGCGA
TAATAAAGCCGCCATCCTTAGAAAGGACGGAAAGTACTATCTTGCCATCCTAGAT
ATGAAAAAAGATCTGAGTTCCATTCGTACTAGCGATGAAGACGAATCTTCTTTCG
AAAAAATGGAGTATAAGCTGCTCCCCTCGCCAGTCAAGATGCTACCCAAGATCTT
TGTGAAGAGCAAAGCAGCCAAGGAAAAGTACGGGCTGACGGACAGGATGCTGGAG
TGCTACGATAAGGGAATGCATAAATCAGGGTCAGCTTTTGACTTGGGCTTTTGCC
ATGAGCTAATCGATTACTACAAGCGCTGTATCGCCGAGTATCCAGGATGGGACGT
TTTCGACTTTAAATTTCGGGAGACTTCTGATTATGGTTCAATGAAGGAGTTCAAC
GAAGATGTCGCTGGTGCCGGTTACTACATGAGCCTTCGCAAGATTCCTTGTTCCG
AAGTCTACCGGCTACTGGACGAGAAATCTATATATTTGTTCCAGATATATAACAA
GGACTACAGTGAGAATGCACATGGGAATAAGAATATGCATACTATGTATTGGGAA
GGTCTCTTTTCACCCCAAAATTTGGAGTCACCCGTGTTCAAACTTAGCGGTGGCG
CAGAGCTGTTCTTTAGGAAATCCAGTATACCCAATGACGCCAAGACAGTCCACCC
AAAGGGTAGCGTCCTGGTGCCCAGAAACGATGTGAACGGCAGGAGAATCCCTGAC
AGCATTTACCGAGAACTTACCAGGTACTTCAACCGCGGCGACTGTAGAATCTCTG
ATGAGGCAAAGTCTTATCTGGATAAGGTGAAGACTAAGAAGGCAGATCATGACAT
TGTGAAAGACCGCCGCTTTACTGTCGACAAAATGATGTTTCACGTGCCTATCGCA
ATGAATTTTAAGGCAATCTCAAAACCGAATCTGAACAAGAAGGTGATAGATGGCA
TTATCGATGACCAGGACCTCAAGATCATCGGAATCGACAGAGGTGAGCGAAACCT
GATATACGTCACAATGGTAGATCGGAAGGGTAATATTCTGTACCAGGATTCACTA
AACATCCTCAATGGATATGACTATCGAAAAGCTCTCGATGTCAGGGAATACGACA
ACAAGGAGGCGCGACGGAATTGGACAAAGGTGGAAGGCATACGGAAGATGAAGGA
AGGCTATCTGTCACTAGCTGTCTCCAAATTGGCTGATATGATTATAGAGAACAAC
GCCATTATCGTGATGGAAGATCTCAACCATGGATTCAAGGCAGGAAGAAGTAAAA
TTGAGAAGCAGGTGTATCAGAAGTTCGAAAGCATGCTTATTAATAAGTTGGGTTA
TATGGTCTTAAAGGACAAGTCTATCGATCAGAGCGGCGGCGCACTCCATGGGTAT
CAGCTGGCTAACCATGTCACCACACTAGCATCCGTAGGCAAACAGTGTGGCGTGA
TTTTCTACATTCCTGCTGCGTTCACTTCTAAGATCGATCCTACCACGGGATTCGC
AGACCTGTTCGCACTGAGCAATGTTAAAAACGTGGCCTCCATGAGGGAGTTCTTT
AGCAAAATGAAAAGCGTGATTTATGACAAGGCCGAGGGCAAGTTCGCTTTCACAT
TTGACTACCTGGACTACAATGTGAAATCAGAGTGCGGGAGAACCCTGTGGACCGT
ATACACGGTAGGGGAAAGATTCACTTACAGTCGAGTTAATCGGGAGTATGTCCGT
AAAGTGCCAACTGACATCATCTACGATGCCCTTCAGAAGGCTGGCATAAGTGTTG
AGGGGGATCTAAGGGACAGGATCGCTGAATCGGATGGCGATACTCTCAAATCAAT
CTTCTACGCCTTCAAGTATGCCCTCGACATGAGGGTAGAGAACCGGGAGGAGGAC
TATATACAGTCTCCCGTGAAGAATGCGTCGGGAGAGTTCTTCTGCTCAAAAAACG
CCGGGAAATCTTTGCCGCAGGATTCTGATGCAAATGGGGCTTATAACATTGCTCT
CAAAGGCATCCTGCAGCTGCGCATGCTATCTGAACAATATGACCCAAACGCTGAA
AGCATTAGATTGCCATTGATCACCAATAAGGCTTGGCTGACTTTCATGCAGAGCG
GTATGAAGACATGGAAAAACTAA
SEQ ATGGATTCCCTTAAGGACTTCACAAATCTTTACCCCGTGAGTAAAACCCTGAGAT
ID TTGAACTCAAGCCCGTGGGAAAGACTCTCGAGAATATCGAGAAGGCCGGGATTTT
NO: GAAGGAAGACGAGCATCGGGCGGAAAGTTACAGACGGGTGAAGAAGATTATAGAT
152 ACTTATCACAAGGTCTTTATAGACAGCTCTTTAGAGAACATGGCAAAGATGGGCA
TCGAGAACGAAATCAAGGCCATGCTGCAGTCCTTCTGCGAGCTGTATAAAAAGGA
TCATCGGACCGAAGGCGAAGACAAGGCGCTGGATAAGATCAGGGCAGTGCTGCGC
GGCCTCATTGTGGGTGCCTTCACTGGGGTGTGCGGGCGGAGAGAGAACACTGTGC
AGAATGAGAAATACGAGAGTTTGTTCAAAGAGAAACTCATCAAGGAAATCCTGCC
CGACTTCGTCTTAAGCACAGAAGCCGAATCTCTCCCATTTTCTGTCGAGGAGGCC
ACGCGTTCCCTTAAAGAGTTCGACAGTTTCACTTCATACTTTGCCGGATTTTATG
AAAACCGTAAAAATATATACTCCACTAAACCACAGTCAACTGCAATAGCTTACAG
GTTAATCCACGAAAACCTGCCAAAATTCATCGACAATATACTCGTCTTTCAAAAA
ATCAAGGAACCAATCGCGAAGGAACTTGAACACATCCGGGCTGACTTTAGTGCGG
GAGGATACATCAAAAAAGACGAGCGCCTGGAGGATATATTTTCACTAAATTATTA
TATTCATGTACTGAGCCAGGCTGGCATAGAAAAGTACAACGCTCTAATTGGGAAA
ATCGTGACAGAAGGTGACGGGGAAATGAAAGGGCTAAACGAACATATTAACTTAT
ATAACCAACAGCGGGGTCGAGAAGATCGTCTGCCCCTGTTCAGACCTCTGTATAA
GCAAATACTCTCCGACAGAGAGCAGCTATCATATCTGCCCGAGTCCTTTGAGAAA
GATGAAGAGCTGCTCCGGGCGCTCAAGGAGTTCTATGATCATATAGCCGAGGACA
TTTTGGGCAGAACTCAGCAACTCATGACGTCTATTTCTGAATATGATCTGTCTCG
TATCTATGTCAGGAATGATAGCCAGCTGACCGATATATCCAAGAAGATGCTGGGG
GACTGGAACGCCATTTATATGGCGAGGGAGCGAGCATACGATCACGAGCAGGCAC
CCAAGAGAATCACAGCCAAATATGAGAGAGACCGCATTAAGGCGCTGAAGGGCGA
AGAAAGTATCAGTCTGGCCAATCTGAACTCCTGCATAGCTTTCCTTGATAACGTG
AGGGATTGCAGAGTTGATACTTACCTGAGTACCCTGGGCCAGAAGGAAGGGCCTC
ACGGCCTCTCTAATCTAGTGGAGAATGTATTTGCCTCCTACCACGAAGCTGAGCA
GCTGCTGTCATTTCCGTACCCAGAGGAAAATAATTTAATACAGGATAAGGACAAC
GTAGTGCTTATCAAAAATCTACTGGATAACATTTCCGACCTCCAGCGCTTTCTCA
AACCACTTTGGGGGATGGGCGACGAGCCTGATAAGGATGAGCGCTTTTACGGCGA
GTACAACTACATCAGGGGCGCCTTGGACCAGGTGATTCCCCTCTATAATAAAGTC
AGGAATTACCTGACCCGAAAGCCATACAGTACAAGAAAGGTGAAATTAAATTTCG
GCAATAGTCAGCTGCTGTCTGGTTGGGACCGAAATAAGGAGAAAGACAACAGCTG
CGTAATTCTCAGAAAAGGACAGAACTTTTATTTGGCCATCATGAATAACAGACAC
AAGAGATCTTTCGAGAACAAAGTGCTCCCTGAGTATAAGGAGGGGGAACCCTACT
TCGAGAAGATGGACTATAAATTCCTTCCTGATCCAAATAAAATGCTGCCTAAAGT
ATTTCTGTCAAAAAAAGGTATAGAAATCTACAAACCTTCACCTAAGCTACTTGAA
CAGTATGGCCACGGCACCCATAAAAAAGGGGACACGTTCAGCATGGACGACCTAC
ACGAACTGATTGACTTCTTTAAGCACAGCATAGAAGCTCATGAGGACTGGAAACA
GTTCGGATTCAAATTCTCAGATACCGCGACCTACGAAAACGTGTCTAGTTTTTAC
CGGGAAGTCGAGGACCAGGGCTACAAGCTCAGCTTCAGAAAAGTTAGCGAATCTT
ACGTCTACTCCCTTATAGATCAAGGTAAGCTGTATCTCTTTCAAATCTACAACAA
GGACTTTTCCCCATGTAGCAAGGGCACCCCCAATCTGCACACTCTCTACTGGCGG
ATGCTGTTCGACGAGCGTAACCTGGCAGACGTGATCTACAAATTAGATGGTAAAG
CTGAGATCTTCTTTCGTGAAAAGAGCCTAAAGAACGATCACCCCACTCACCCCGC
CGGAAAGCCCATTAAGAAGAAAAGTAGGCAGAAGAAAGGAGAAGAATCGCTATTT
GAGTACGACCTCGTCAAGGATCGGCATTATACAATGGATAAGTTCCAGTTCCATG
TGCCAATAACTATGAATTTCAAGTGCAGTGCTGGCAGTAAGGTGAATGACATGGT
AAACGCTCATATCCGGGAGGCAAAGGACATGCATGTTATTGGAATTGATAGGGGT
GAGCGTAATCTCCTCTACATCTGTGTTATTGACTCCCGCGGCACAATCCTCGATC
AGATTTCCTTGAATACAATTAATGATATAGACTACCATGACTTGCTTGAGTCTCG
CGACAAAGATAGACAGCAGGAGAGAAGAAATTGGCAGACCATCGAAGGCATCAAG
GAACTCAAGCAAGGCTACCTTTCTCAGGCAGTGCATCGAATAGCCGAGCTGATGG
TGGCTTATAAAGCCGTCGTGGCACTAGAAGACCTAAATATGGGATTTAAACGAGG
CAGGCAGAAGGTGGAATCATCCGTATACCAGCAGTTCGAAAAACAGTTGATAGAC
AAACTCAATTACCTTGTAGACAAGAAGAAGCGGCCTGAGGACATAGGGGGCCTGC
TTAGAGCGTATCAATTTACAGCCCCATTCAAGTCTTTCAAAGAAATGGGTAAACA
GAACGGTTTTCTGTTTTACATCCCAGCGTGGAACACCAGCAATATAGATCCAACC
ACTGGCTTCGTCAATCTGTTTCATGCTCAGTATGAAAATGTGGACAAGGCCAAAT
CCTTCTTTCAGAAATTTGACAGCATCTCCTATAACCCAAAGAAAGACTGGTTTGA
ATTCGCCTTTGACTATAAGAATTTCACTAAGAAGGCCGAGGGATCAAGAAGCATG
TGGATATTGTGCACGCATGGCTCACGTATAAAGAACTTTAGAAACTCGCAAAAAA
ACGGGCAGTGGGACTCAGAAGAATTCGCACTCACCGAGGCTTTCAAATCCCTCTT
CGTCCGGTATGAGATCGATTACACCGCCGATCTGAAGACGGCAATCGTCGACGAG
AAACAGAAAGACTTCTTTGTAGATCTACTTAAGCTCTTTAAGCTAACCGTTCAGA
TGCGAAACAGTTGGAAAGAAAAGGATCTCGACTATCTCATTAGTCCAGTGGCTGG
CGCGGATGGTAGATTTTTCGATACCCGGGAAGGTAACAAGTCCCTTCCCAAAGAC
GCCGACGCGAATGGTGCCTACAATATTGCACTAAAGGGGCTCTGGGCGCTGCGGC
AAATTAGACAGACATCTGAAGGGGGCAAGCTTAAGCTGGCTATTTCTAATAAAGA
GTGGTTGCAGTTTGTGCAGGAAAGGAGTTATGAGAAGGACTAG
SEQ ATGAACAACGGCACCAACAACTTCCAGAACTTCATCGGCATATCGTCTCTGCAGA
ID AAACACTTAGGAATGCCCTGATTCCAACTGAGACAACACAGCAGTTTATTGTGAA
NO: GAATGGGATCATCAAAGAGGACGAATTGCGCGGGGAGAATAGGCAGATCCTGAAG
153 GACATCATGGACGATTACTACAGGGGTTTTATCTCCGAAACGCTGAGCTCGATTG
ACGATATTGACTGGACGTCCCTCTTTGAGAAGATGGAAATCCAACTTAAAAATGG
CGATAATAAAGATACCCTGATAAAGGAACAAACCGAATATAGAAAGGCTATACAC
AAAAAATTCGCAAATGACGACCGCTTTAAGAACATGTTTTCTGCAAAACTGATTA
GCGATATTCTGCCCGAGTTTGTGATTCACAATAATAACTATTCCGCTTCGGAGAA
GGAGGAAAAGACTCAGGTGATTAAACTGTTTTCTCGGTTCGCCACTTCTTTCAAA
GATTATTTCAAAAATCGCGCCAACTGTTTTTCCGCTGACGACATCTCCTCCTCTT
CCTGCCACCGGATCGTAAACGACAATGCCGAGATCTTTTTTAGTAACGCCCTTGT
GTATCGGAGGATAGTGAAGAGCCTGTCCAATGATGACATAAACAAAATTTCTGGC
GATATGAAGGATAGCCTCAAAGAGATGAGCCTTGAAGAAATTTACTCCTACGAGA
AGTATGGGGAGTTCATCACCCAGGAGGGGATTTCCTTCTATAATGACATCTGTGG
CAAGGTGAACAGCTTCATGAACCTGTACTGCCAGAAGAATAAGGAAAACAAAAAT
CTGTACAAGCTTCAGAAGTTACATAAGCAGATCCTGTGTATCGCGGATACCTCAT
ATGAGGTTCCTTATAAGTTCGAGAGTGATGAAGAAGTGTACCAGTCTGTAAATGG
ATTCTTAGACAATATTTCGTCCAAACATATAGTGGAGAGACTGAGAAAGATCGGG
GACAATTACAATGGGTACAATCTCGACAAGATTTATATCGTGTCGAAGTTTTACG
AATCTGTGAGCCAGAAAACATACAGGGATTGGGAAACCATTAATACCGCGCTTGA
AATTCACTACAATAATATTCTGCCTGGCAACGGAAAAAGCAAGGCCGATAAGGTA
AAAAAGGCAGTCAAAAATGACCTTCAGAAAAGTATCACCGAAATCAATGAGTTGG
TGAGCAACTACAAATTGTGTTCAGACGATAATATTAAAGCGGAAACGTACATACA
TGAAATTAGCCATATTCTGAATAACTTTGAGGCGCAGGAACTTAAGTACAACCCT
GAAATTCATCTCGTCGAAAGCGAATTGAAGGCCTCTGAATTGAAAAACGTTCTTG
ACGTGATAATGAACGCTTTCCATTGGTGCTCTGTGTTTATGACTGAAGAGCTGGT
TGATAAGGACAACAACTTTTATGCTGAACTTGAGGAAATCTACGACGAGATCTAC
CCTGTGATTAGCTTGTATAACCTCGTCAGAAACTACGTTACCCAGAAGCCGTACA
GCACGAAAAAAATAAAGCTGAACTTTGGTATTCCGACTCTCGCCGATGGATGGAG
CAAGTCGAAGGAATATTCCAACAATGCCATCATTCTTATGCGAGACAATCTGTAT
TACCTCGGCATCTTTAACGCCAAAAACAAGCCGGATAAGAAAATCATTGAAGGGA
ATACGAGCGAGAATAAGGGCGACTATAAGAAAATGATCTACAACTTACTGCCAGG
TCCCAATAAAATGATTCCTAAGGTGTTTCTGTCATCGAAAACAGGTGTAGAAACA
TATAAGCCCAGCGCATACATCCTGGAAGGCTACAAGCAAAACAAACACATCAAAA
GCAGCAAGGACTTTGATATCACATTCTGCCACGATCTAATCGACTACTTCAAAAA
TTGCATCGCCATTCACCCTGAGTGGAAGAACTTCGGCTTTGACTTCTCCGACACC
AGTACCTACGAAGACATTTCTGGATTCTACCGTGAGGTTGAGCTGCAGGGTTATA
AAATTGACTGGACATACATCAGTGAAAAAGACATCGATCTACTGCAGGAGAAGGG
GCAGCTCTATCTCTTCCAGATTTATAATAAGGATTTCAGCAAGAAGTCCACTGGA
AACGACAATCTGCATACAATGTATCTTAAGAACTTGTTTAGCGAAGAGAATTTGA
AAGATATCGTTCTAAAGTTAAACGGGGAAGCCGAGATTTTCTTTCGAAAGTCTTC
CATTAAGAATCCAATTATTCACAAGAAGGGCAGTATCCTGGTCAACAGAACCTAT
GAGGCCGAGGAAAAGGACCAGTTCGGTAATATACAAATTGTGCGCAAGAACATCC
CCGAGAACATTTACCAGGAGCTCTATAAATACTTCAACGACAAAAGCGATAAGGA
GCTTTCCGACGAGGCTGCCAAGCTGAAAAACGTGGTGGGACACCATGAAGCAGCC
ACCAACATCGTCAAAGATTATCGTTATACATATGACAAATATTTTCTGCACATGC
CTATTACAATAAACTTTAAGGCAAACAAGACCGGGTTCATCAATGACCGGATACT
CCAGTACATCGCAAAAGAGAAGGACCTGCATGTGATCGGCATCGACCGCGGTGAA
AGAAATCTCATTTACGTCAGCGTTATCGACACTTGTGGAAACATTGTGGAGCAGA
AGTCCTTCAACATTGTTAACGGCTATGACTATCAGATCAAGCTCAAACAGCAGGA
AGGTGCTCGTCAGATTGCGAGGAAAGAATGGAAAGAGATCGGCAAGATCAAGGAG
ATCAAAGAAGGGTATCTGAGCTTGGTCATTCACGAGATCTCCAAAATGGTCATCA
AGTACAACGCTATTATCGCGATGGAAGACCTCTCTTACGGCTTTAAGAAGGGGCG
CTTTAAAGTGGAGCGCCAGGTCTATCAGAAGTTCGAGACTATGCTTATCAATAAG
CTGAATTACTTGGTCTTTAAGGATATCAGTATCACCGAGAACGGAGGACTGCTGA
AAGGTTACCAGCTCACATATATTCCCGATAAGCTCAAGAATGTGGGCCACCAATG
CGGTTGTATTTTTTACGTTCCAGCTGCCTACACATCTAAGATCGATCCTACCACC
GGATTCGTCAATATATTTAAATTTAAAGATCTAACCGTTGATGCCAAGCGTGAGT
TTATTAAGAAATTTGATTCAATCAGGTACGACAGCGAAAAGAACCTCTTCTGTTT
CACTTTCGACTACAACAACTTCATCACACAAAATACTGTGATGAGCAAGTCATCA
TGGAGCGTTTATACTTATGGTGTAAGGATAAAAAGGCGCTTTGTTAATGGAAGGT
TTTCCAATGAAAGCGATACAATAGACATCACAAAAGACATGGAGAAGACACTGGA
GATGACAGATATTAATTGGAGGGACGGGCATGACCTTAGACAGGACATCATCGAC
TACGAAATCGTCCAACACATTTTTGAGATATTCAGACTCACTGTCCAGATGCGAA
ACAGCCTGTCGGAACTCGAAGACCGGGACTACGATAGACTGATCTCCCCGGTGTT
AAACGAAAATAATATTTTCTACGATTCTGCTAAGGCAGGAGACGCTCTTCCTAAA
GATGCGGACGCCAATGGCGCTTACTGTATAGCGTTGAAGGGATTGTATGAGATTA
AACAGATCACTGAGAATTGGAAAGAAGACGGTAAATTCTCCAGAGACAAGCTGAA
AATCTCCAACAAAGACTGGTTTGATTTTATTCAAAATAAGCGCTACCTGTAA
SEQ ATGACAAACAAATTTACTAATCAGTACAGCCTGTCAAAGACCCTCCGCTTCGAAC
ID TGATTCCACAAGGGAAGACCCTTGAATTCATCCAGGAAAAGGGTTTATTATCCCA
NO: GGATAAACAACGCGCAGAAAGCTATCAAGAGATGAAGAAGACGATCGATAAATTT
154 CATAAGTATTTCATAGATTTAGCCCTGAGCAACGCTAAATTGACCCACCTGGAAA
CCTATTTGGAGCTGTACAACAAGTCAGCCGAGACAAAGAAAGAGCAGAAGTTTAA
GGACGACCTGAAAAAAGTACAGGACAATTTGCGAAAAGAGATCGTCAAGTCTTTT
TCCGACGGAGACGCCAAGTCAATATTTGCCATCCTGGACAAAAAGGAACTCATCA
CTGTGGAGTTGGAGAAGTGGTTTGAGAATAATGAGCAGAAGGACATCTATTTTGA
CGAAAAGTTCAAGACATTTACTACTTACTTCACCGGATTTCACCAAAACCGGAAG
AACATGTACTCTGTTGAGCCGAACTCAACCGCCATCGCCTACCGCCTTATTCACG
AAAATCTGCCAAAGTTTCTCGAGAATGCTAAAGCCTTTGAGAAAATTAAGCAGGT
CGAGTCGCTCCAGGTGAACTTTCGAGAGCTGATGGGTGAATTCGGGGACGAGGGC
CTGATTTTCGTGAATGAACTCGAAGAGATGTTTCAGATCAACTACTATAATGATG
TACTCTCACAGAACGGGATCACTATCTACAACAGCATTATCTCTGGATTCACTAA
GAACGATATCAAGTATAAAGGGCTGAATGAATACATCAACAATTATAATCAGACT
AAGGACAAAAAGGACAGGCTGCCTAAATTGAAACAGCTGTATAAGCAGATCCTCA
GTGATAGAATTAGCTTGTCATTTCTCCCAGATGCCTTCACTGACGGAAAGCAGGT
GCTTAAGGCGATATTCGATTTCTATAAGATCAACCTCCTCTCTTATACAATCGAG
GGCCAGGAGGAGTCACAGAACCTCCTGCTCCTGATTCGACAAACTATTGAAAATC
TGTCCTCTTTCGATACGCAGAAGATATACCTGAAAAATGACACCCATCTCACTAC
AATATCCCAACAGGTATTCGGAGATTTCTCCGTCTTCAGTACAGCCCTGAATTAC
TGGTACGAGACAAAGGTGAACCCTAAGTTCGAAACAGAGTACAGCAAGGCGAACG
AAAAGAAGAGGGAGATCCTGGACAAAGCCAAAGCCGTTTTCACCAAGCAAGATTA
CTTTAGCATCGCATTTCTGCAGGAAGTCCTGTCTGAGTACATACTGACACTCGAT
CACACAAGCGACATAGTTAAGAAGCACTCTTCCAATTGTATCGCGGACTACTTCA
AAAATCATTTTGTCGCGAAAAAGGAGAACGAGACAGATAAGACCTTCGATTTTAT
CGCGAATATTACCGCAAAGTATCAATGCATTCAGGGTATCTTGGAGAACGCCGAC
CAGTACGAAGACGAGCTTAAACAGGATCAGAAGCTCATCGACAACCTAAAGTTCT
TTTTGGACGCTATACTGGAACTCCTTCATTTTATTAAGCCACTACATCTGAAGAG
TGAGTCTATCACTGAGAAGGACACTGCTTTTTACGACGTTTTCGAGAATTACTAC
GAAGCACTGTCTCTGCTAACCCCTCTGTATAACATGGTGAGAAACTATGTGACAC
AGAAACCTTATAGTACCGAGAAGATTAAGTTGAACTTCGAGAACGCACAATTGCT
GAATGGGTGGGATGCAAACAAAGAGGGTGATTACCTCACAACAATCCTCAAGAAA
GATGGCAATTACTTCCTGGCCATTATGGATAAAAAACATAACAAGGCATTTCAGA
AATTTCCCGAGGGGAAGGAAAATTATGAAAAGATGGTATACAAGTTGCTGCCCGG
GGTGAACAAAATGCTCCCGAAGGTGTTTTTCTCGAATAAGAATATCGCGTACTTT
AACCCGTCCAAGGAACTGTTGGAAAATTATAAAAAGGAAACACACAAGAAGGGGG
ACACTTTTAATTTGGAGCACTGCCACACACTCATTGACTTCTTTAAAGATAGTCT
CAACAAACATGAGGATTGGAAATATTTTGACTTTCAGTTTAGCGAGACCAAGTCT
TATCAGGATCTGTCGGGATTTTATAGGGAAGTTGAGCACCAGGGTTACAAGATAA
ATTTCAAGAACATCGATAGCGAGTACATTGACGGACTGGTGAACGAAGGGAAGCT
GTTCCTGTTTCAGATTTACAGCAAAGATTTCTCTCCTTTCTCAAAAGGCAAGCCG
AACATGCATACCCTGTATTGGAAGGCCCTGTTCGAGGAGCAAAACCTTCAGAATG
TGATTTACAAGCTGAACGGTCAGGCCGAGATTTTTTTTAGGAAGGCCTCTATCAA
GCCCAAAAACATCATTCTGCACAAGAAAAAGATAAAGATCGCCAAAAAACACTTC
ATTGATAAAAAGACAAAGACTTCTGAGATCGTACCTGTTCAGACAATCAAGAATC
TCAACATGTATTATCAGGGGAAGATTAGCGAGAAAGAGCTGACACAGGACGATTT
GAGGTACATCGACAACTTCTCTATCTTTAACGAGAAGAACAAGACAATCGATATC
ATCAAGGACAAGCGGTTTACCGTCGATAAATTCCAGTTCCATGTGCCTATCACGA
TGAATTTCAAGGCCACCGGTGGGAGTTATATCAACCAGACTGTGCTGGAGTATCT
GCAGAACAACCCCGAAGTAAAAATTATTGGCCTGGACAGAGGAGAGCGGCATCTG
GTGTACTTGACCCTCATCGATCAGCAGGGAAATATCCTGAAACAAGAATCTCTGA
ATACTATTACGGACTCCAAAATCAGCACACCTTACCACAAGCTGCTTGATAATAA
AGAGAATGAGAGGGACTTGGCCCGCAAAAATTGGGGCACCGTCGAGAATATTAAG
GAATTGAAAGAAGGATACATCTCACAGGTGGTTCACAAAATCGCAACCCTGATGT
TAGAAGAGAACGCTATTGTGGTGATGGAGGACTTAAACTTCGGATTTAAAAGAGG
AAGATTTAAAGTCGAGAAACAGATTTATCAGAAACTGGAAAAAATGCTCATTGAC
AAATTAAATTACCTGGTGCTGAAAGATAAACAGCCACAGGAGCTGGGTGGCCTGT
ATAATGCTCTGCAGCTGACCAACAAGTTCGAGTCGTTTCAGAAAATGGGCAAGCA
GTCAGGCTTCCTTTTTTACGTGCCCGCTTGGAACACCTCAAAAATCGACCCTACA
ACAGGCTTTGTGAATTATTTCTATACCAAGTATGAAAACGTGGACAAGGCAAAGG
CCTTTTTCGAGAAGTTTGAAGCAATCAGGTTCAATGCCGAGAAAAAATACTTTGA
GTTCGAGGTCAAAAAATATAGCGACTTCAACCCTAAGGCCGAAGGCACGCAACAA
GCCTGGACAATATGCACGTATGGGGAGAGAATTGAGACTAAGCGGCAGAAGGATC
AGAATAACAAATTCGTGAGCACACCGATTAACCTGACAGAGAAGATAGAGGACTT
CCTCGGCAAGAATCAGATCGTGTACGGCGACGGCAATTGCATCAAGTCACAAATT
GCATCTAAAGATGACAAAGCATTCTTCGAAACACTGCTGTATTGGTTCAAGATGA
CACTCCAGATGCGAAATAGCGAAACAAGAACAGATATTGACTACCTCATCAGCCC
TGTGATGAATGATAACGGCACGTTTTACAATTCCCGGGACTATGAAAAATTAGAG
AACCCGACACTGCCAAAAGACGCCGACGCAAATGGTGCATATCACATCGCAAAGA
AAGGTTTGATGCTGTTGAACAAAATTGATCAGGCTGATCTGACAAAAAAGGTCGA
TCTGAGTATCAGTAACCGCGACTGGTTGCAGTTTGTCCAGAAGAACAAATAA
SEQ ATGGAACAAGAGTACTATCTGGGCCTGGACATGGGCACCGGGAGTGTCGGATGGG
ID CAGTCACCGACTCAGAGTACCACGTCCTCAGAAAGCACGGTAAGGCACTTTGGGG
NO: AGTGCGACTCTTCGAGTCCGCTAGTACTGCTGAAGAGAGGAGGATGTTTCGAACT
155 TCCAGGCGCAGGCTGGATCGGCGAAACTGGAGAATAGAGATTCTCCAGGAGATAT
TTGCTGAAGAGATTTCAAAGAAGGATCCTGGTTTTTTCCTGCGCATGAAAGAATC
TAAGTATTACCCCGAAGATAAACGCGACATCAACGGCAATTGTCCTGAACTGCCC
TATGCTCTGTTTGTCGACGACGATTTCACCGACAAAGATTACCACAAGAAATTCC
CCACCATATACCACCTGAGAAAGATGTTGATGAACACCGAGGAGACACCCGACAT
ACGTCTGGTTTACCTGGCTATCCATCATATGATGAAGCACCGCGGGCATTTCCTG
CTGTCTGGAGACATCAATGAGATAAAGGAATTTGGTACTACGTTCTCCAAGTTGT
TAGAAAACATTAAGAATGAAGAGTTGGACTGGAATCTTGAACTGGGAAAGGAAGA
GTATGCAGTTGTAGAGTCGATTTTGAAAGATAACATGTTAAACCGGTCAACTAAG
AAAACCAGGTTAATTAAGGCACTAAAGGCCAAATCGATATGCGAGAAGGCTGTGC
TAAATCTGCTGGCTGGAGGCACCGTGAAACTGTCTGATATTTTCGGCCTGGAAGA
GCTCAATGAAACCGAGCGGCCTAAAATTTCTTTCGCCGATAACGGATACGATGAC
TATATTGGGGAGGTGGAAAACGAGCTCGGAGAACAATTCTACATTATTGAAACCG
CTAAGGCAGTCTATGACTGGGCCGTGCTCGTCGAGATTTTAGGCAAGTACACCAG
CATTAGCGAAGCAAAGGTGGCTACCTATGAAAAGCACAAATCTGACCTCCAGTTT
CTGAAAAAGATTGTGCGCAAATACTTAACAAAAGAAGAGTACAAGGACATCTTTG
TGAGCACATCAGATAAGCTCAAGAATTACTCAGCATACATTGGAATGACAAAGAT
TAACGGGAAGAAGGTGGATCTCCAAAGCAAACGTTGTTCAAAGGAGGAGTTTTAC
GATTTCATAAAGAAGAACGTGCTGAAGAAACTGGAGGGACAACCGGAGTACGAGT
ATTTAAAGGAGGAGCTCGAGCGAGAAACTTTCCTGCCCAAGCAAGTGAACAGAGA
CAATGGTGTCATTCCTTACCAGATTCACTTATATGAGCTGAAGAAAATCCTGGGG
AACTTGAGAGACAAGATAGACCTCATCAAGGAAAATGAAGATAAGTTGGTCCAGT
TGTTCGAATTCAGAATCCCATATTACGTCGGCCCGCTCAATAAGATCGACGACGG
CAAGGAAGGCAAATTCACTTGGGCGGTGCGAAAAAGCAACGAAAAAATATACCCA
TGGAACTTTGAGAACGTCGTTGACATCGAGGCCAGCGCCGAGAAATTTATAAGAC
GCATGACTAATAAGTGTACTTACCTCATGGGCGAGGATGTTCTGCCCAAGGACAG
CCTGCTGTATTCCAAGTACATGGTGCTTAACGAGCTGAATAATGTAAAGTTAGAT
GGTGAGAAGCTCAGCGTGGAGCTTAAACAGAGGCTGTACACTGATGTGTTTTGCA
AGTATCGGAAAGTTACCGTTAAGAAGATAAAGAATTACCTGAAATGCGAAGGGAT
CATTTCCGGCAACGTGGAAATTACCGGAATCGACGGCGATTTTAAGGCGTCGTTG
ACCGCTTATCATGATTTCAAGGAGATTTTAACCGGCACGGAGCTCGCGAAGAAAG
ACAAGGAGAACATAATCACGAATATAGTTCTGTTTGGGGACGATAAAAAACTTCT
TAAAAAACGACTCAATCGACTGTATCCGCAGATTACCCCCAACCAGCTGAAGAAG
ATTTGCGCTCTGAGCTATACCGGGTGGGGCCGGTTCTCTAAGAAATTCCTCGAGG
AGATCACAGCACCAGACCCAGAGACTGGTGAGGTGTGGAATATTATTACAGCTCT
GTGGGAATCCAATAATAACCTTATGCAATTGTTGAGCAATGAATATAGGTTCATG
GAGGAAGTGGAAACCTACAATATGGGCAAGCAGACAAAGACCCTATCTTACGAGA
CCGTTGAGAATATGTATGTCTCCCCTTCAGTGAAACGGCAAATCTGGCAAACTTT
GAAGATCGTGAAGGAGCTCGAAAAGGTGATGAAAGAGAGCCCGAAGAGGGTTTTT
ATTGAAATGGCCAGAGAGAAACAGGAGAGCAAGAGAACAGAGTCTAGGAAGAAGC
AGCTAATCGATTTGTATAAAGCCTGCAAGAACGAGGAAAAAGACTGGGTCAAGGA
GCTAGGCGATCAGGAAGAACAGAAGTTGCGCTCTGATAAGCTGTACTTATATTAT
ACCCAGAAAGGACGGTGCATGTACTCAGGTGAGGTCATTGAGCTGAAAGATCTGT
GGGACAATACTAAGTATGATATTGATCACATCTACCCTCAGTCAAAAACTATGGA
CGACTCCCTCAACAACAGGGTGTTGGTTAAGAAGAAATACAATGCTACAAAGTCC
GATAAATACCCTCTTAACGAAAACATCCGGCACGAAAGAAAGGGCTTCTGGAAGT
CCCTGCTGGATGGGGGTTTTATCAGTAAAGAAAAGTATGAGAGGCTGATCCGAAA
TACCGAGCTCTCCCCCGAGGAACTGGCTGGCTTTATCGAAAGGCAGATCGTAGAG
ACTAGGCAATCTACAAAGGCAGTCGCTGAGATCCTGAAGCAAGTGTTTCCTGAGT
CAGAAATCGTGTACGTCAAAGCTGGCACAGTGTCACGGTTCCGAAAGGACTTTGA
GTTGTTAAAAGTTCGGGAGGTGAATGACCTGCACCACGCTAAAGACGCCTATCTG
AATATCGTTGTGGGGAACTCCTATTATGTTAAGTTTACTAAGAATGCGTCCTGGT
TTATTAAGGAGAACCCGGGGCGCACCTATAACCTGAAGAAGATGTTCACCTCCGG
CTGGAACATAGAACGGAACGGAGAAGTCGCGTGGGAGGTGGGTAAGAAAGGGACC
ATTGTGACCGTCAAACAGATTATGAACAAAAACAACATATTGGTAACTCGCCAGG
TGCATGAGGCCAAAGGGGGCCTCTTTGATCAGCAGATTATGAAAAAGGGCAAAGG
ACAGATCGCAATCAAGGAAACCGACGAGCGCCTGGCATCCATTGAGAAGTACGGA
GGCTACAACAAGGCGGCAGGTGCGTACTTCATGCTCGTCGAGTCCAAAGATAAGA
AAGGCAAAACTATTAGAACAATCGAGTTCATCCCTCTATATTTGAAAAATAAGAT
CGAAAGTGACGAAAGCATCGCCCTTAACTTCTTGGAGAAGGGCCGGGGCTTAAAG
GAACCAAAGATTCTGCTCAAGAAGATCAAGATCGACACACTCTTCGATGTGGATG
GTTTTAAGATGTGGCTGTCAGGCAGGACAGGGGATCGCTTGCTGTTCAAATGCGC
AAATCAGTTGATTCTGGACGAAAAGATCATTGTGACGATGAAGAAGATCGTTAAA
TTCATTCAGCGGAGACAGGAAAACAGAGAACTGAAACTCTCCGATAAGGATGGAA
TTGACAATGAAGTCCTCATGGAGATTTACAATACCTTTGTGGACAAGCTTGAGAA
CACAGTCTATCGGATCCGACTGTCCGAACAGGCAAAGACTCTGATCGACAAACAG
AAAGAATTCGAAAGACTAAGCTTAGAGGACAAAAGTTCAACTCTCTTTGAAATTC
TCCACATCTTCCAATGTCAAAGTAGTGCAGCCAACTTGAAGATGATCGGGGGTCC
CGGCAAGGCTGGAATCTTAGTCATGAACAACAACATCTCCAAATGTAACAAAATC
TCCATCATAAACCAGTCTCCCACCGGCATTTTCGAGAACGAAATTGATTTACTCA
AG
SEQ ATGAAATCTTTCGATTCTTTCACCAACCTCTACTCCCTTAGCAAAACCCTTAAGT
ID TTGAAATGAGGCCGGTGGGGAATACACAGAAGATGCTTGACAATGCTGGCGTCTT
NO: TGAAAAGGACAAATTAATCCAGAAGAAGTATGGTAAAACAAAGCCATATTTTGAC
156 CGATTGCATCGGGAATTCATTGAAGAGGCTCTTACAGGAGTAGAATTGATCGGAC
TGGACGAGAACTTCCGTACCTTAGTAGACTGGCAGAAGGACAAGAAGAACAACGT
GGCAATGAAGGCCTATGAGAACTCACTCCAGCGCCTTAGAACCGAGATCGGAAAG
ATCTTTAATCTTAAGGCGGAAGATTGGGTAAAAAATAAGTACCCGATCCTGGGAC
TGAAAAACAAAAACACAGACATCCTGTTTGAAGAAGCCGTCTTTGGTATCTTGAA
GGCCAGGTATGGAGAGGAGAAAGACACGTTTATAGAGGTAGAGGAGATTGATAAA
ACAGGCAAGAGTAAGATTAATCAGATCAGTATCTTTGATTCTTGGAAGGGGTTCA
CAGGCTACTTTAAGAAGTTTTTCGAAACCAGGAAAAATTTCTATAAGAACGATGG
CACCTCCACAGCTATCGCGACACGCATCATAGATCAGAATCTGAAACGGTTCATT
GATAATCTGAGCATTGTTGAATCCGTGCGCCAGAAGGTCGACCTAGCTGAGACTG
AGAAGTCTTTCTCTATATCACTCTCCCAGTTCTTCTCAATAGATTTTTATAATAA
GTGCCTTCTGCAAGATGGCATAGACTACTATAACAAGATCATCGGCGGCGAAACT
CTCAAAAACGGTGAAAAGCTCATTGGCCTGAATGAGCTCATCAACCAATATAGAC
AAAATAACAAGGATCAGAAAATCCCATTCTTTAAGCTGCTAGATAAACAGATCCT
ATCAGAAAAAATCCTGTTCCTCGACGAAATCAAAAACGACACCGAACTCATCGAG
GCTCTCTCGCAGTTTGCCAAGACGGCTGAGGAGAAGACGAAGATTGTGAAAAAGC
TGTTTGCAGACTTTGTGGAGAACAACTCTAAATACGATTTGGCTCAGATTTATAT
CTCCCAGGAAGCATTTAACACAATCTCCAATAAGTGGACTAGCGAGACTGAAACC
TTCGCCAAATACCTGTTCGAGGCCATGAAAAGCGGCAAGCTCGCCAAATACGAGA
AGAAGGACAATTCCTATAAGTTTCCCGATTTCATCGCATTATCTCAGATGAAGTC
CGCGCTACTTAGCATTAGCCTGGAAGGCCATTTTTGGAAGGAGAAATACTATAAG
ATTTCCAAATTCCAAGAAAAGACCAATTGGGAGCAGTTCTTGGCTATTTTTCTAT
ACGAGTTCAACTCTTTGTTCAGTGACAAGATCAACACTAAGGACGGTGAGACCAA
ACAAGTGGGGTACTACCTCTTCGCCAAAGATCTTCATAACCTGATACTGTCCGAA
CAGATCGACATACCCAAGGATTCAAAGGTGACCATCAAGGATTTTGCGGATTCGG
TATTGACGATCTATCAGATGGCGAAGTATTTCGCTGTCGAGAAAAAGCGGGCATG
GCTGGCCGAATACGAGTTGGACTCCTTCTATACTCAACCCGATACAGGGTACCTG
CAGTTTTACGATAATGCATACGAGGATATAGTCCAGGTGTACAATAAACTCAGGA
ACTACCTCACTAAGAAACCATACTCCGAAGAAAAATGGAAACTTAATTTTGAGAA
TAGTACACTGGCCAATGGATGGGACAAGAACAAGGAATCAGACAACTCCGCTGTA
ATTCTCCAGAAGGGTGGCAAGTATTATCTGGGACTGATAACAAAGGGCCATAACA
AGATTTTCGATGACCGTTTTCAGGAGAAGTTTATAGTGGGCATAGAGGGTGGCAA
GTATGAAAAAATAGTCTACAAGTTCTTTCCCGATCAGGCGAAGATGTTCCCCAAA
GTATGCTTCAGTGCTAAAGGCCTCGAGTTTTTCCGGCCATCTGAAGAGATACTCC
GCATCTATAATAACGCAGAGTTTAAAAAGGGAGAGACGTACTCAATCGACTCGAT
GCAGAAACTCATTGACTTCTACAAAGATTGTCTCACAAAATACGAGGGCTGGGCT
TGCTACACGTTTCGGCACTTGAAGCCAACCGAGGAATATCAAAACAACATCGGGG
AGTTCTTCCGTGACGTCGCCGAAGACGGCTATAGAATTGACTTTCAGGGCATAAG
TGATCAGTATATTCACGAGAAGAATGAGAAAGGTGAGTTGCATCTTTTCGAAATC
CACAATAAAGACTGGAATCTTGACAAGGCTCGCGATGGAAAATCAAAGACTACCC
AGAAGAATCTTCATACACTTTACTTCGAGTCCCTCTTTTCCAACGACAACGTCGT
ACAGAATTTCCCAATAAAACTGAACGGCCAGGCCGAAATTTTTTACAGGCCCAAA
ACCGAAAAAGATAAACTGGAATCCAAGAAAGACAAGAAGGGAAATAAGGTGATAG
ATCACAAAAGGTATTCCGAGAACAAGATTTTTTTCCACGTACCTCTTACCCTGAA
CAGAACGAAGAACGACTCTTATAGATTCAATGCCCAGATAAACAACTTTCTCGCA
AACAACAAAGATATCAATATTATCGGCGTCGATAGAGGTGAGAAGCACTTGGTAT
ATTATTCTGTGATCACGCAAGCATCCGATATCTTGGAGTCCGGTTCTTTGAACGA
ACTGAATGGTGTCAACTACGCCGAGAAACTCGGTAAGAAAGCTGAGAATCGGGAG
CAGGCTAGAAGGGACTGGCAGGACGTTCAGGGTATCAAGGACCTGAAGAAGGGCT
ACATTTCTCAGGTGGTTCGAAAACTGGCTGATTTGGCCATTAAGCACAATGCAAT
CATCATTTTAGAAGATTTGAACATGCGGTTTAAACAAGTCAGGGGGGGGATAGAG
AAATCAATTTACCAACAGCTGGAAAAAGCTCTGATTGATAAACTCTCTTTTTTGG
TTGATAAGGGCGAAAAGAACCCCGAGCAAGCAGGACATCTCCTTAAAGCCTATCA
ACTGAGCGCACCTTTCGAGACATTCCAGAAGATGGGAAAGCAAACCGGCATCATT
TTCTATACCCAGGCTTCCTATACATCCAAGTCTGATCCAGTGACTGGGTGGAGAC
CCCATCTCTACCTCAAGTACTTTTCTGCCAAAAAAGCTAAGGACGACATTGCTAA
GTTCACAAAAATCGAGTTCGTGAACGACAGGTTCGAGCTGACTTATGACATAAAA
GATTTCCAGCAGGCCAAGGAGTACCCAAACAAGACAGTTTGGAAAGTGTGTTCCA
ATGTGGAGAGGTTTCGGTGGGACAAGAATCTGAATCAGAATAAAGGGGGATATAC
TCACTACACCAACATTACCGAGAACATCCAAGAGTTGTTCACCAAATACGGCATC
GACATTACTAAAGATCTGCTGACACAGATCTCCACCATCGATGAGAAGCAGAACA
CATCTTTCTTCCGGGATTTCATCTTTTATTTTAACTTGATCTGTCAGATTAGAAA
TACCGACGACAGTGAGATAGCTAAAAAAAACGGGAAAGACGATTTCATTCTCTCT
CCCGTGGAGCCGTTTTTTGACTCCCGCAAAGACAATGGCAATAAGCTTCCGGAAA
ACGGGGACGATAACGGCGCCTACAACATCGCTCGTAAGGGAATCGTTATCCTCAA
TAAAATAAGCCAGTATTCCGAGAAGAACGAGAATTGTGAAAAAATGAAGTGGGGG
GACCTTTACGTCAGCAACATCGATTGGGATAACTTTGTGACACAAGCCAATGCGA
GACACTAG
SEQ ATGGAAAACTTCAAAAACCTCTACCCCATCAACAAGACCTTGAGGTTTGAGCTCC
ID GGCCATATGGGAAGACACTGGAGAACTTCAAAAAGTCCGGTCTGCTGGAAAAGGA
NO: TGCTTTTAAGGCTAACTCTAGGAGGTCTATGCAGGCCATTATCGATGAGAAATTC
157 AAGGAGACCATAGAGGAGCGTCTGAAATATACTGAGTTTTCCGAGTGTGACCTAG
GAAATATGACCAGTAAGGACAAAAAGATCACCGACAAGGCAGCGACAAACCTGAA
GAAACAGGTGATTTTAAGCTTTGATGATGAGATTTTCAATAACTACTTGAAGCCG
GACAAAAACATCGACGCTCTGTTCAAGAATGATCCAAGCAACCCGGTCATCTCTA
CTTTCAAGGGCTTCACCACATACTTTGTAAATTTCTTCGAAATACGGAAACACAT
CTTCAAGGGAGAGTCTTCCGGTAGCATGGCTTACAGAATAATCGATGAGAACCTA
ACTACATATCTAAACAATATCGAGAAGATCAAGAAATTGCCTGAAGAACTGAAAT
CTCAGCTTGAGGGAATCGATCAAATTGACAAACTGAACAACTATAACGAGTTCAT
CACCCAGTCCGGCATTACTCATTATAACGAAATTATTGGAGGGATTTCGAAGTCT
GAAAATGTCAAAATTCAAGGCATTAACGAAGGGATTAATCTTTACTGTCAAAAGA
ATAAAGTGAAGCTACCACGCTTAACTCCTCTGTATAAGATGATTCTCTCTGATCG
GGTCTCTAATTCCTTTGTGCTGGATACCATTGAAAATGATACCGAGTTAATTGAA
ATGATCTCTGATCTGATAAATAAGACAGAGATAAGTCAGGATGTTATTATGTCCG
ACATCCAAAATATTTTCATCAAATATAAACAACTCGGCAACTTGCCGGGGATTAG
CTACTCATCTATAGTGAATGCTATCTGTTCGGATTACGACAATAACTTTGGTGAC
GGCAAACGTAAAAAAAGCTATGAGAATGATCGCAAAAAACACCTCGAGACTAACG
TGTATAGCATTAACTATATCTCAGAGTTACTGACAGACACCGACGTCTCCAGCAA
CATAAAGATGCGGTACAAAGAGCTGGAGCAGAATTATCAGGTATGCAAGGAAAAT
TTCAACGCCACTAACTGGATGAACATCAAAAACATTAAGCAGTCTGAGAAAACCA
ATCTGATCAAGGACCTTCTTGACATCCTCAAGAGCATCCAGCGGTTTTATGATTT
GTTTGACATCGTGGATGAAGACAAAAATCCTAGTGCTGAGTTCTATACCTGGCTG
TCTAAAAACGCGGAGAAACTGGACTTCGAGTTTAATTCAGTGTACAACAAGAGCA
GGAACTACCTCACGAGAAAGCAGTACTCCGATAAAAAGATTAAGTTGAACTTCGA
TAGTCCTACTCTCGCCAAGGGGTGGGATGCGAACAAAGAAATTGATAATAGCACA
ATTATCATGAGGAAGTTCAACAACGACCGGGGCGATTACGATTACTTCTTGGGGA
TCTGGAATAAGAGCACACCTGCCAACGAAAAGATCATCCCATTAGAGGATAATGG
ACTGTTTGAAAAAATGCAATATAAGCTGTATCCCGATCCTAGTAAAATGCTGCCA
AAGCAATTCCTTTCTAAGATCTGGAAAGCTAAACATCCAACTACACCCGAGTTTG
ATAAGAAGTACAAAGAAGGTCGGCACAAGAAGGGGCCTGATTTTGAGAAAGAGTT
TCTGCACGAGTTGATCGATTGCTTTAAGCATGGATTGGTAAACCACGACGAAAAA
TATCAGGATGTGTTCGGGTTCAATCTGCGCAACACGGAAGACTACAACTCTTATA
CAGAGTTTCTGGAGGACGTCGAAAGGTGCAACTATAATCTTAGTTTCAATAAAAT
CGCTGACACGTCTAACTTGATAAATGATGGGAAACTCTATGTTTTTCAGATCTGG
AGCAAGGATTTCAGCATAGATAGCAAGGGAACAAAAAACTTGAACACAATATACT
TTGAATCCCTCTTCTCGGAGGAAAATATGATCGAGAAGATGTTCAAGCTCTCAGG
GGAAGCCGAAATATTCTATCGTCCAGCAAGTTTGAATTATTGTGAAGATATTATC
AAGAAGGGACACCACCACGCCGAACTGAAGGACAAATTCGACTATCCCATCATCA
AGGACAAGCGATATAGCCAGGACAAATTTTTTTTTCATGTCCCCATGGTTATCAA
CTACAAAAGCGAGAAGTTAAACTCCAAATCACTTAACAATAGGACGAACGAAAAT
TTAGGCCAATTCACGCACATCATCGGTATCGACCGCGGAGAGCGACATCTCATCT
ACCTGACCGTGGTGGATGTGTCCACCGGTGAGATCGTTGAGCAAAAGCACCTGGA
TGAAATTATAAATACAGATACAAAAGGCGTCGAGCATAAAACTCATTATCTCAAT
AAATTAGAAGAGAAGTCCAAGACGCGGGATAATGAAAGAAAGTCCTGGGAAGCAA
TCGAGACGATTAAGGAGCTGAAAGAAGGCTATATTAGCCACGTGATCAATGAAAT
CCAGAAATTGCAGGAAAAGTATAACGCACTGATAGTGATGGAGAACCTCAATTAT
GGGTTTAAGAACTCGCGTATCAAAGTGGAAAAGCAGGTCTACCAGAAATTCGAGA
CCGCCCTGATTAAAAAGTTTAATTACATCATTGACAAGAAAGATCCTGAAACCTA
CATTCATGGATACCAACTGACGAATCCAATCACTACACTCGATAAAATTGGTAAC
CAGAGCGGTATTGTGTTGTACATTCCGGCTTGGAATACAAGCAAGATTGATCCAG
TCACTGGTTTCGTTAACCTCCTGTATGCAGACGATTTGAAATACAAGAACCAGGA
GCAGGCTAAAAGCTTTATCCAGAAAATCGATAATATCTACTTCGAAAATGGTGAG
TTTAAATTTGATATAGATTTCAGCAAATGGAACAACCGCTACTCAATTAGCAAGA
CGAAATGGACACTGACAAGCTACGGAACCCGGATACAGACGTTCCGAAACCCCCA
GAAAAATAACAAGTGGGACAGCGCCGAGTATGACCTGACCGAAGAGTTTAAATTA
ATCCTGAACATCGATGGTACTCTGAAATCTCAGGATGTGGAAACCTATAAGAAAT
TCATGTCTTTATTCAAGCTGATGTTGCAGCTGCGAAACTCCGTTACTGGAACAGA
CATTGACTACATGATTAGCCCTGTGACAGATAAAACTGGAACCCACTTTGATTCA
CGGGAGAATATCAAGAACCTGCCCGCCGATGCTGATGCGAACGGAGCTTACAACA
TTGCTAGGAAGGGCATCATGGCAATCGAGAATATTATGAACGGCATTAGCGACCC
TCTGAAGATCAGTAATGAGGACTACCTGAAGTACATTCAGAACCAACAAGAGTAA
SEQ ATGACCCAGTTTGAGGGTTTCACCAATCTTTATCAGGTGTCAAAAACACTCAGAT
ID TTGAGCTCATCCCACAGGGTAAAACTTTAAAGCATATTCAAGAGCAGGGCTTTAT
NO: AGAGGAAGACAAAGCCAGAAACGACCATTATAAGGAACTAAAACCGATCATTGAC
158 CGCATCTACAAAACCTATGCCGACCAATGCCTTCAGCTCGTCCAACTCGATTGGG
AGAATCTGAGCGCCGCTATTGACAGCTACAGGAAGGAGAAGACCGAGGAGACTAG
AAACGCCCTGATCGAGGAGCAGGCGACCTATAGAAACGCTATTCACGATTATTTT
ATCGGCCGCACCGACAATTTGACAGATGCCATCAACAAGCGGCACGCCGAAATTT
ATAAGGGGTTATTTAAGGCCGAGCTGTTCAATGGAAAAGTACTGAAACAGCTGGG
CACCGTAACAACCACCGAACACGAGAATGCTCTGTTGAGGTCCTTCGACAAGTTT
ACTACCTACTTTAGCGGCTTCTACGAAAACCGTAAAAACGTGTTTTCCGCGGAGG
ATATTTCAACAGCCATTCCTCATAGGATCGTGCAGGATAATTTCCCCAAGTTTAA
GGAGAACTGCCATATCTTTACCAGACTTATCACTGCTGTGCCAAGTTTACGAGAA
CACTTCGAGAATGTTAAGAAGGCTATAGGCATATTCGTTTCCACCTCCATCGAAG
AAGTATTCAGTTTTCCATTCTACAATCAGTTACTCACGCAGACCCAGATAGATCT
CTACAATCAGCTGCTCGGAGGCATTTCTAGAGAAGCAGGCACGGAAAAGATCAAG
GGCTTAAATGAAGTACTCAATCTTGCAATTCAGAAGAACGATGAGACAGCACACA
TTATTGCATCTCTCCCTCACAGATTCATTCCCCTGTTCAAACAGATCCTGTCCGA
TCGCAACACACTAAGCTTTATACTTGAGGAGTTTAAGTCAGATGAGGAAGTGATC
CAGAGCTTCTGTAAGTATAAGACTTTGCTCCGTAATGAAAACGTGCTTGAGACAG
CAGAGGCTCTCTTTAACGAGTTGAATTCCATCGACCTGACACACATTTTTATCAG
CCATAAAAAGCTGGAAACGATTAGCTCTGCCTTGTGCGACCACTGGGACACCCTG
CGTAACGCCCTCTATGAAAGGCGCATTTCCGAGCTCACCGGGAAGATCACAAAAA
GTGCCAAGGAAAAAGTCCAGAGGTCCCTTAAACATGAAGACATCAACCTACAAGA
GATCATCTCTGCGGCTGGGAAAGAGCTGTCAGAAGCATTTAAACAGAAGACTTCC
GAGATCCTGAGCCACGCACACGCCGCATTAGACCAGCCCCTGCCTACAACTCTTA
AAAAACAGGAGGAGAAGGAGATTTTAAAGAGCCAGCTGGACTCATTACTCGGCCT
GTATCATCTCCTGGACTGGTTCGCCGTGGACGAATCCAACGAGGTGGACCCAGAA
TTTAGCGCCAGGCTGACAGGAATTAAACTGGAAATGGAGCCAAGTTTGAGCTTTT
ACAACAAGGCTCGGAACTATGCCACTAAAAAGCCCTACAGCGTGGAAAAGTTCAA
GCTGAATTTTCAGATGCCGACCCTGGCTTCCGGGTGGGATGTTAATAAGGAAAAG
AATAATGGGGCTATACTGTTCGTCAAAAATGGTCTCTACTACCTGGGAATCATGC
CCAAACAGAAGGGCAGGTACAAAGCCCTTTCGTTTGAGCCGACCGAAAAAACCAG
CGAAGGCTTTGATAAGATGTATTACGACTATTTCCCAGATGCAGCCAAGATGATC
CCAAAATGTAGCACTCAGTTGAAGGCGGTAACCGCTCACTTTCAGACACACACCA
CTCCTATCTTGCTCTCCAACAACTTTATTGAGCCGCTGGAGATCACGAAGGAAAT
CTACGACCTTAACAACCCAGAGAAGGAACCCAAGAAATTCCAAACAGCTTATGCT
AAGAAGACTGGGGATCAAAAGGGCTATCGAGAGGCTTTGTGTAAGTGGATTGACT
TTACACGGGATTTCCTGAGTAAGTATACCAAGACCACATCTATTGACCTGTCCTC
ACTGAGACCTTCCTCACAATATAAGGATCTCGGAGAGTATTATGCCGAACTCAAC
CCTCTACTCTATCACATCTCTTTCCAGAGGATCGCCGAAAAGGAAATTATGGACG
CCGTCGAGACAGGCAAGCTGTACCTCTTCCAGATTTACAACAAGGATTTCGCAAA
GGGCCACCACGGAAAACCCAATTTGCACACTTTGTACTGGACAGGGCTCTTCTCT
CCCGAAAATTTGGCCAAAACTTCAATAAAACTGAACGGGCAAGCCGAGCTGTTCT
ATCGGCCCAAGTCACGTATGAAGCGGATGGCCCACCGGCTGGGCGAGAAGATGCT
CAACAAGAAACTGAAGGATCAGAAGACGCCCATACCAGACACTCTTTACCAAGAG
CTGTATGACTACGTGAATCACAGACTGAGTCACGACCTGTCTGATGAAGCCCGGG
CTCTTCTTCCAAATGTGATTACCAAAGAAGTTTCCCACGAAATTATCAAGGACCG
GCGCTTCACCTCTGACAAATTCTTTTTCCACGTCCCAATCACCCTCAACTACCAG
GCAGCCAATTCCCCTTCAAAGTTTAACCAGCGTGTGAATGCCTACCTGAAAGAGC
ATCCGGAGACCCCCATCATAGGGATAGACAGAGGAGAGCGGAATCTTATCTACAT
TACTGTGATTGACAGCACAGGTAAGATCTTGGAGCAGAGATCTTTAAATACAATC
CAGCAGTTTGACTACCAGAAGAAACTGGATAACCGAGAGAAGGAAAGGGTTGCTG
CAAGACAGGCCTGGTCAGTGGTCGGCACCATCAAAGACCTGAAGCAGGGCTACTT
ATCCCAAGTAATTCACGAAATTGTCGATCTTATGATTCATTATCAAGCCGTTGTT
GTGCTGGAGAACCTGAATTTTGGCTTCAAAAGCAAACGAACAGGTATCGCCGAGA
AAGCCGTGTATCAGCAGTTCGAAAAGATGCTCATAGACAAGCTGAACTGCTTAGT
GCTGAAGGATTATCCTGCTGAGAAGGTCGGCGGCGTACTTAACCCATACCAGCTG
ACCGATCAGTTCACTAGTTTCGCCAAGATGGGAACGCAAAGTGGCTTCCTTTTCT
ACGTGCCCGCTCCCTACACGAGTAAGATCGACCCTCTGACCGGCTTCGTCGACCC
ATTCGTCTGGAAGACCATCAAGAATCACGAATCACGGAAACACTTCTTAGAGGGG
TTTGACTTCCTGCACTACGACGTGAAGACAGGGGACTTCATCTTACACTTTAAGA
TGAATCGAAACCTCTCCTTCCAGCGGGGCCTGCCTGGTTTCATGCCCGCATGGGA
CATCGTGTTTGAGAAAAACGAGACACAGTTTGACGCTAAGGGAACCCCCTTTATT
GCGGGGAAGCGGATTGTCCCAGTCATCGAAAACCATCGGTTCACCGGGCGATACC
GGGATCTGTACCCGGCCAACGAGCTCATCGCGCTGCTGGAGGAGAAGGGTATTGT
GTTTAGGGATGGATCCAACATTCTGCCTAAGTTGCTGGAAAATGATGATTCGCAC
GCCATTGATACCATGGTTGCACTGATTAGATCCGTACTGCAGATGAGGAATAGCA
ATGCTGCAACCGGGGAGGATTATATTAATTCCCCAGTGCGAGATCTGAATGGTGT
CTGTTTTGACTCGCGCTTTCAGAATCCAGAATGGCCAATGGATGCAGACGCTAAC
GGGGCGTACCACATTGCTCTGAAAGGCCAGCTACTCCTGAACCACCTCAAGGAGA
GCAAAGATCTGAAGCTGCAGAACGGCATTTCCAACCAAGACTGGCTCGCCTACAT
ACAAGAACTGCGCAATTAA
SEQ ATGGCTGTCAAATCCATCAAGGTTAAATTACGGCTTGATGACATGCCCGAGATCC
ID GCGCCGGGCTCTGGAAACTCCATAAAGAAGTGAATGCTGGCGTTAGATACTACAC
NO: AGAATGGCTCTCCCTGCTGCGCCAGGAAAATTTGTACCGCCGGTCACCTAATGGA
159 GATGGAGAGCAGGAATGCGATAAAACAGCAGAAGAGTGCAAAGCCGAATTGCTGG
AGCGACTGCGGGCACGGCAGGTTGAGAATGGACACCGAGGTCCGGCGGGATCGGA
CGACGAGCTGCTCCAGCTCGCCAGACAATTATATGAACTGCTGGTGCCTCAGGCT
ATTGGGGCAAAGGGTGACGCACAGCAGATTGCTAGAAAATTTCTGTCTCCCCTCG
CCGACAAAGACGCTGTCGGCGGCCTTGGGATAGCCAAAGCCGGCAACAAACCCCG
ATGGGTGCGCATGAGGGAGGCTGGTGAGCCTGGCTGGGAGGAAGAAAAGGAAAAG
GCCGAAACCAGAAAGTCCGCCGACAGGACCGCGGACGTACTCCGAGCATTGGCCG
ATTTTGGGCTGAAGCCCTTAATGCGAGTCTACACCGATAGTGAAATGTCTAGCGT
GGAGTGGAAGCCATTACGCAAAGGGCAGGCAGTGCGGACGTGGGACCGTGACATG
TTCCAGCAAGCCATCGAGCGAATGATGAGCTGGGAGAGCTGGAACCAGAGAGTGG
GGCAGGAGTATGCCAAGCTGGTCGAGCAGAAAAACCGGTTTGAGCAAAAAAATTT
TGTAGGTCAGGAACACCTGGTGCATCTCGTTAACCAGCTCCAGCAAGATATGAAG
GAAGCTTCGCCTGGATTAGAGAGCAAAGAGCAGACTGCACACTATGTAACCGGAA
GAGCACTGAGGGGCAGTGACAAAGTGTTCGAAAAATGGGGAAAACTGGCTCCCGA
TGCCCCCTTTGACCTGTACGACGCAGAAATAAAAAACGTGCAGCGGCGAAACACC
AGGCGATTTGGTAGCCATGATCTGTTCGCCAAATTGGCAGAGCCGGAATATCAGG
CTCTTTGGCGAGAAGACGCATCATTTCTCACTAGGTACGCGGTCTATAACTCCAT
TTTGAGGAAATTGAACCACGCAAAAATGTTTGCCACCTTCACGTTGCCTGACGCC
ACCGCTCATCCCATTTGGACACGGTTTGATAAGCTGGGCGGCAATCTGCATCAGT
ATACATTCCTGTTTAACGAGTTTGGAGAGCGAAGACATGCGATACGATTCCACAA
GCTACTGAAGGTCGAAAATGGCGTGGCACGTGAGGTGGACGATGTCACCGTGCCC
ATCAGCATGAGCGAACAGCTGGATAATTTGTTGCCGCGGGACCCAAATGAACCTA
TAGCCCTTTATTTTAGGGACTACGGGGCGGAGCAACATTTCACTGGGGAGTTTGG
CGGCGCAAAAATTCAGTGCCGACGCGACCAGCTCGCCCACATGCATAGAAGACGC
GGGGCCCGGGACGTATACCTTAACGTCTCTGTGAGGGTGCAGTCCCAGTCAGAGG
CAAGAGGGGAACGCAGACCACCTTACGCAGCAGTATTCAGGCTGGTAGGCGATAA
CCACCGGGCGTTTGTACACTTTGATAAACTTTCTGACTACCTGGCCGAACACCCG
GATGACGGCAAATTAGGATCGGAGGGGCTGCTTAGCGGCCTGCGTGTGATGAGCG
TCGATCTGGGGCTACGGACCTCTGCTTCCATCTCTGTGTTCCGTGTGGCCCGAAA
GGACGAGTTGAAACCTAATTCGAAGGGCCGTGTACCATTCTTTTTCCCTATTAAG
GGAAATGATAATCTCGTCGCGGTGCACGAGCGTTCCCAACTGCTGAAACTGCCTG
GCGAGACCGAGTCCAAAGATCTCAGAGCAATCCGGGAGGAGCGACAACGTACACT
TAGGCAACTCCGCACCCAGCTGGCCTATCTGCGCTTGCTGGTGCGGTGCGGCTCC
GAGGATGTAGGGAGAAGAGAGCGAAGCTGGGCAAAGCTGATAGAGCAACCAGTTG
ACGCCGCGAATCACATGACCCCCGACTGGCGCGAAGCGTTTGAAAATGAGCTGCA
GAAGTTGAAATCTCTGCATGGGATTTGCTCAGATAAGGAGTGGATGGACGCCGTA
TACGAGTCTGTTCGCCGGGTATGGCGGCACATGGGGAAGCAGGTGAGAGATTGGA
GAAAGGACGTTCGCTCTGGGGAACGGCCGAAAATTCGGGGATACGCAAAGGATGT
CGTGGGCGGCAATAGCATTGAGCAGATCGAGTACCTGGAAAGGCAATACAAATTT
CTGAAATCTTGGTCTTTCTTTGGGAAGGTAAGCGGACAAGTTATCAGAGCCGAAA
AGGGATCTCGCTTTGCTATCACATTGAGGGAACACATTGATCACGCCAAAGAAGA
CAGGTTGAAAAAGTTGGCTGATCGCATTATCATGGAAGCACTCGGTTACGTCTAC
GCCCTTGATGAGCGCGGTAAAGGGAAGTGGGTAGCCAAGTATCCCCCATGTCAGC
TGATCCTGCTCGAGGAACTTTCTGAGTATCAGTTCAATAACGACCGTCCTCCCTC
CGAAAATAATCAGCTCATGCAATGGTCCCACCGGGGTGTGTTCCAAGAACTGATC
AATCAGGCTCAGGTGCACGACCTCCTCGTAGGCACTATGTATGCAGCCTTTAGCT
CCCGTTTTGACGCGCGCACAGGCGCCCCTGGAATACGATGTAGGCGAGTTCCCGC
ACGGTGCACTCAAGAACATAACCCGGAGCCTTTCCCATGGTGGCTCAATAAGTTT
GTTGTGGAGCATACCCTCGACGCTTGCCCATTGAGGGCGGATGACTTGATTCCCA
CAGGCGAGGGGGAGATCTTCGTGAGCCCATTTTCTGCCGAAGAAGGGGATTTCCA
CCAAATACATGCCGACTTGAATGCTGCCCAAAATCTGCAGCAAAGGCTGTGGTCA
GACTTCGACATCTCGCAAATCAGACTGCGGTGTGACTGGGGCGAAGTAGACGGCG
AGCTGGTGCTGATACCTAGACTGACGGGTAAGCGTACCGCCGATAGCTATAGTAA
TAAGGTTTTTTATACGAATACGGGGGTGACATATTACGAGCGTGAGAGAGGCAAG
AAGCGTCGGAAGGTGTTCGCGCAGGAGAAGCTGAGCGAAGAGGAGGCGGAGCTAC
TGGTAGAGGCAGATGAGGCAAGAGAAAAGTCCGTCGTCCTGATGCGGGATCCTAG
CGGGATTATTAACAGAGGTAATTGGACACGGCAGAAAGAATTCTGGAGCATGGTG
AATCAAAGAATCGAGGGTTACCTGGTGAAGCAAATTCGAAGCCGGGTGCCCCTTC
AAGACAGCGCATGTGAAAACACTGGGGACATCTAG
SEQ ATGGCTACTCGGTCCTTCATCCTGAAAATCGAGCCAAATGAAGAGGTGAAAAAGG
ID GCCTGTGGAAGACCCATGAGGTACTTAACCACGGCATAGCATACTATATGAATAT
NO: CCTAAAACTTATACGGCAGGAGGCTATCTACGAGCATCACGAGCAAGATCCTAAA
160 AATCCAAAGAAGGTTAGTAAGGCTGAAATCCAGGCTGAATTGTGGGACTTCGTGC
TGAAGATGCAGAAATGCAACAGTTTCACGCATGAAGTTGATAAGGACGTCGTGTT
TAATATACTCCGGGAGCTGTACGAAGAACTGGTACCAAGCTCTGTGGAAAAGAAA
GGAGAGGCCAACCAGCTAAGTAATAAGTTCCTCTATCCTCTCGTGGACCCCAATT
CACAGAGCGGCAAAGGTACCGCATCTTCTGGGAGGAAACCACGCTGGTACAACTT
GAAGATCGCTGGCGATCCCAGCTGGGAGGAGGAAAAGAAGAAATGGGAAGAGGAT
AAAAAGAAAGACCCCCTGGCCAAAATCTTAGGCAAGCTCGCCGAGTACGGTCTGA
TTCCACTTTTCATCCCGTTCACAGATAGCAATGAGCCGATCGTCAAGGAGATTAA
GTGGATGGAAAAGAGCCGCAATCAGAGTGTGCGGAGGCTGGACAAAGACATGTTT
ATTCAGGCCCTGGAACGCTTCCTTAGCTGGGAAAGCTGGAACCTGAAGGTTAAGG
AAGAGTACGAAAAAGTCGAGAAGGAGCATAAGACTTTGGAGGAGCGCATCAAAGA
AGACATCCAGGCCTTTAAGTCTCTAGAACAGTATGAGAAAGAACGGCAGGAACAG
CTGCTGCGTGATACACTGAACACAAACGAATATCGCCTGAGCAAGAGGGGACTCA
GAGGCTGGAGAGAAATCATTCAAAAGTGGCTCAAAATGGATGAAAATGAGCCGTC
TGAAAAATACCTTGAAGTTTTCAAGGACTACCAGCGGAAGCACCCTAGAGAAGCC
GGCGACTATAGTGTTTACGAATTCTTGAGCAAGAAGGAGAATCATTTTATATGGA
GGAATCACCCGGAGTACCCATATCTGTACGCAACCTTCTGCGAAATCGACAAGAA
AAAAAAAGACGCCAAGCAACAGGCTACATTTACTCTGGCCGACCCTATCAATCAC
CCTCTATGGGTCCGGTTTGAGGAGCGCTCCGGAAGCAATCTGAATAAATATCGTA
TTCTGACTGAACAGTTACACACAGAGAAGCTCAAGAAGAAACTTACGGTGCAGCT
GGACCGCCTGATATACCCAACAGAGTCCGGAGGATGGGAAGAGAAAGGAAAGGTT
GACATCGTACTGCTTCCATCTCGTCAGTTTTACAACCAGATATTCCTGGACATCG
AGGAGAAGGGGAAACACGCCTTCACATACAAGGACGAGTCCATAAAGTTCCCACT
GAAGGGTACTTTAGGCGGTGCTAGGGTGCAGTTCGACCGCGATCACCTGAGACGG
TACCCCCACAAGGTGGAGAGCGGGAACGTGGGACGAATCTACTTTAATATGACAG
TGAACATTGAACCCACAGAGAGTCCAGTTAGTAAATCCCTGAAAATTCACCGTGA
CGACTTTCCGAAATTTGTGAATTTCAAGCCAAAGGAGCTTACGGAGTGGATCAAG
GATTCAAAGGGAAAGAAGCTGAAATCTGGTATCGAATCTCTCGAGATCGGTCTCC
GTGTCATGAGCATCGATCTGGGACAGCGCCAGGCAGCTGCCGCCAGTATATTCGA
GGTGGTAGACCAAAAGCCTGACATCGAGGGAAAGCTCTTCTTCCCAATCAAAGGC
ACAGAGCTGTATGCGGTGCACCGGGCGTCCTTTAATATAAAGCTGCCCGGTGAAA
CCCTGGTGAAGTCACGGGAGGTGCTTAGAAAAGCGCGAGAGGATAACCTCAAACT
GATGAACCAAAAACTGAACTTTCTGAGGAACGTCCTGCACTTTCAGCAGTTCGAA
GATATTACCGAACGCGAAAAGAGAGTAACCAAGTGGATATCTCGTCAAGAGAACA
GCGACGTCCCGTTAGTCTATCAGGACGAACTCATCCAAATACGGGAGTTGATGTA
TAAGCCCTACAAGGATTGGGTCGCCTTTCTTAAGCAGCTTCACAAACGCCTAGAG
GTCGAAATAGGTAAAGAGGTGAAACATTGGCGGAAGTCGCTCAGCGACGGGAGGA
AGGGACTTTATGGCATCTCTTTGAAGAACATTGACGAAATCGATAGAACCAGAAA
ATTTTTGTTGAGATGGTCCCTCCGACCCACCGAGCCTGGAGAGGTGAGGCGGTTA
GAACCAGGACAGAGGTTCGCTATCGATCAGCTGAATCACCTCAATGCTCTGAAGG
AGGACCGCCTCAAGAAAATGGCCAATACAATCATAATGCACGCCCTTGGCTACTG
CTACGACGTCCGAAAGAAGAAGTGGCAGGCCAAGAATCCCGCCTGTCAAATTATC
CTTTTTGAGGATCTTAGCAATTACAACCCCTATGAAGAGCGGTCCAGATTCGAAA
ATAGTAAGCTCATGAAGTGGAGCCGCAGGGAGATCCCGCGCCAAGTGGCCCTTCA
GGGGGAAATTTATGGGCTGCAGGTAGGCGAGGTCGGGGCCCAATTCTCCTCGCGC
TTTCATGCGAAAACTGGAAGTCCTGGAATCCGGTGCTCAGTGGTGACAAAGGAGA
AGTTGCAAGACAATCGGTTTTTTAAAAACTTACAGCGGGAGGGAAGGCTGACCCT
GGATAAGATAGCCGTACTTAAGGAAGGAGATCTGTACCCTGACAAAGGCGGTGAA
AAGTTCATTAGCTTGAGCAAGGACCGAAAACTTGTGACCACCCACGCTGACATCA
ATGCGGCACAGAACCTGCAGAAGAGATTTTGGACTCGCACCCACGGATTCTACAA
AGTTTACTGCAAAGCATATCAAGTAGACGGACAGACCGTATACATCCCCGAGTCC
AAAGATCAGAAGCAGAAAATTATTGAAGAGTTTGGGGAAGGGTACTTTATCCTGA
AGGATGGTGTCTACGAATGGGGCAACGCTGGTAAACTTAAAATTAAGAAGGGCAG
CTCTAAACAGTCCTCCAGCGAGTTAGTTGATTCTGATATTCTGAAAGACAGTTTC
GACCTGGCCAGCGAACTTAAAGGGGAAAAATTAATGCTGTACCGGGACCCCAGCG
GAAACGTCTTTCCATCCGATAAGTGGATGGCCGCTGGAGTGTTCTTTGGCAAGTT
AGAGAGGATTCTCATAAGTAAGCTGACCAACCAATACTCAATCTCCACAATCGAG
GATGACTCATCCAAGCAGTCTATGTGA
SEQ ATGCCTACACGCACTATCAACCTGAAACTGGTTCTTGGCAAGAATCCAGAGAATG
ID CTACCCTTCGTCGGGCACTATTTTCAACGCATAGACTGGTGAATCAGGCTACCAA
NO: ACGGATTGAAGAGTTCCTCTTGCTTTGTCGGGGGGAAGCATATAGGACGGTGGAT
161 AATGAGGGGAAAGAGGCTGAAATTCCGAGACACGCCGTGCAGGAGGAAGCTCTTG
CGTTTGCAAAGGCCGCTCAACGGCACAATGGTTGCATCTCTACTTATGAAGACCA
GGAAATCCTGGATGTGCTCCGGCAACTGTATGAAAGGCTGGTGCCTTCTGTGAAT
GAAAATAATGAAGCAGGGGACGCTCAAGCCGCAAACGCGTGGGTGTCGCCACTGA
TGTCCGCCGAGTCCGAGGGAGGGCTCAGCGTTTACGACAAGGTGCTGGACCCACC
CCCAGTGTGGATGAAACTCAAAGAGGAAAAAGCTCCGGGCTGGGAGGCTGCTTCC
CAGATCTGGATCCAGTCCGACGAAGGGCAGTCCCTTCTTAACAAGCCTGGTTCGC
CCCCGCGGTGGATTAGGAAACTGAGGTCAGGCCAGCCTTGGCAGGACGATTTTGT
TAGCGACCAGAAAAAGAAGCAGGACGAGCTGACAAAGGGGAATGCGCCACTGATC
AAACAATTAAAGGAAATGGGCTTATTGCCTCTTGTGAATCCCTTTTTTAGACATC
TGCTTGACCCGGAGGGGAAGGGGGTGTCACCTTGGGACAGACTCGCTGTTAGGGC
CGCTGTCGCTCATTTCATATCATGGGAATCATGGAACCACCGGACACGCGCCGAA
TACAATAGTTTGAAGCTGCGGAGGGATGAGTTCGAAGCAGCTTCCGACGAATTCA
AGGACGACTTCACGCTGCTTCGGCAGTACGAGGCTAAGAGGCACTCCACACTGAA
GAGTATAGCTTTAGCCGATGATTCAAACCCTTATAGGATCGGCGTACGCTCCCTC
CGCGCTTGGAACCGCGTCCGCGAGGAGTGGATCGACAAGGGAGCGACCGAGGAGC
AGCGGGTCACCATTCTCAGCAAGTTGCAGACCCAACTAAGGGGCAAATTTGGAGA
TCCTGACTTGTTCAACTGGCTGGCGCAGGACCGGCACGTGCACCTCTGGAGCCCT
AGAGATAGTGTTACCCCACTGGTTAGGATCAACGCTGTTGACAAAGTATTGCGAC
GGAGAAAACCGTACGCCTTGATGACTTTTGCCCACCCAAGATTCCACCCTCGGTG
GATACTTTACGAAGCCCCAGGGGGCAGCAATCTCCGCCAGTATGCACTGGATTGT
ACCGAAAATGCTCTGCACATTACACTGCCTCTGCTGGTTGACGATGCACATGGCA
CATGGATTGAGAAAAAAATTAGGGTTCCTCTTGCCCCCAGCGGCCAGATTCAGGA
CCTGACACTAGAAAAGCTCGAGAAGAAGAAAAATCGTCTCTACTACCGTTCTGGG
TTCCAGCAGTTTGCCGGCCTGGCCGGAGGTGCCGAGGTGCTTTTCCATCGACCAT
ACATGGAGCACGATGAGAGGAGCGAGGAGAGCTTATTAGAACGCCCTGGTGCTGT
TTGGTTCAAACTCACCTTGGACGTGGCAACCCAGGCCCCTCCAAACTGGTTGGAC
GGAAAGGGCCGCGTCCGAACGCCCCCCGAGGTTCACCACTTCAAGACAGCCCTCA
GTAACAAGTCTAAGCACACACGGACCCTCCAGCCCGGACTCAGAGTGTTATCCGT
GGATCTGGGAATGCGCACCTTCGCCTCTTGCTCCGTATTTGAGCTGATCGAGGGC
AAACCAGAGACTGGCAGAGCGTTCCCTGTGGCCGACGAACGTTCCATGGATTCAC
CAAACAAGCTGTGGGCCAAGCACGAAAGATCCTTTAAACTCACGCTCCCCGGCGA
AACCCCCAGTCGGAAAGAAGAGGAGGAACGGAGCATTGCAAGAGCCGAAATCTAT
GCGTTGAAAAGAGATATTCAGAGATTAAAAAGTCTTCTGCGCCTGGGGGAAGAGG
ATAACGATAATAGACGCGATGCACTTCTTGAGCAATTTTTCAAGGGCTGGGGCGA
GGAAGACGTGGTTCCAGGTCAGGCCTTTCCCCGGAGTCTGTTCCAGGGGCTGGGG
GCCGCCCCATTCAGATCCACCCCTGAGTTGTGGAGACAACACTGTCAAACCTATT
ATGATAAAGCAGAGGCGTGCCTGGCTAAACACATCAGCGATTGGCGCAAGAGAAC
CAGGCCTAGGCCTACCTCACGTGAGATGTGGTACAAGACACGCTCTTATCACGGC
GGAAAGTCAATCTGGATGCTGGAATACCTCGACGCTGTGAGGAAACTGCTCTTAT
CCTGGAGCCTCAGAGGCCGGACCTACGGGGCTATCAACAGACAGGACACAGCAAG
GTTCGGGAGCTTAGCCAGCCGGCTCCTTCACCACATTAACTCACTCAAAGAGGAT
CGAATAAAGACCGGAGCCGACTCGATCGTGCAGGCAGCCCGAGGGTACATCCCCC
TGCCTCATGGGAAGGGCTGGGAGCAGCGATATGAACCCTGCCAGCTGATCTTGTT
TGAGGACCTTGCCCGTTATAGATTTCGCGTTGATAGACCTCGCCGTGAGAATTCT
CAGCTGATGCAGTGGAACCACAGAGCGATCGTGGCTGAGACCACTATGCAGGCCG
AGCTGTATGGACAGATCGTGGAGAACACCGCCGCAGGGTTCAGTTCTCGGTTTCA
TGCTGCCACCGGAGCTCCCGGCGTCCGGTGCCGCTTCCTCTTAGAGCGTGATTTT
GACAATGACCTCCCAAAGCCCTATCTGCTGAGGGAACTGAGCTGGATGCTGGGGA
ACACAAAAGTAGAATCGGAGGAGGAGAAGCTACGGCTCCTCTCCGAAAAGATACG
TCCAGGCTCTCTGGTACCATGGGACGGAGGAGAGCAGTTCGCGACACTGCATCCT
AAGAGACAGACGTTATGTGTGATTCACGCCGATATGAACGCCGCTCAGAATCTGC
AGCGAAGATTCTTTGGCCGCTGCGGCGAAGCCTTCAGGCTGGTATGTCAGCCCCA
CGGGGATGATGTGCTGCGGCTGGCCTCAACCCCTGGGGCTAGACTCTTGGGGGCA
CTCCAGCAGCTGGAAAATGGCCAAGGGGCTTTCGAACTCGTTCGGGACATGGGCA
GCACAAGCCAGATGAACAGATTCGTCATGAAGAGCCTGGGAAAGAAAAAGATCAA
ACCCTTACAGGACAATAATGGCGACGACGAACTGGAGGACGTGTTGTCCGTGCTG
CCAGAGGAAGACGACACAGGCCGCATCACTGTCTTCCGCGACTCAAGTGGGATAT
TCTTTCCTTGCAACGTGTGGATTCCGGCCAAACAGTTCTGGCCTGCCGTCAGAGC
CATGATTTGGAAAGTGATGGCTAGTCATTCATTGGGATGA
SEQ ATGACAAAGCTGAGGCACAGACAAAAGAAGCTTACACACGACTGGGCAGGGAGCA
ID AGAAACGTGAGGTCCTTGGGTCAAATGGAAAACTGCAGAACCCCTTGCTCATGCC
NO: TGTAAAGAAGGGGCAGGTAACAGAATTTAGAAAAGCATTCTCCGCGTACGCTCGG
162 GCAACTAAGGGGGAAATGACCGATGGACGGAAGAACATGTTCACCCATTCTTTCG
AGCCATTCAAAACAAAGCCGTCATTGCACCAATGCGAGCTGGCCGATAAGGCTTA
CCAGTCTTTGCATAGTTACCTCCCCGGTTCCCTGGCCCATTTCTTGCTTTCCGCA
CACGCACTGGGCTTTCGTATTTTCTCTAAATCTGGGGAGGCAACTGCCTTCCAGG
CCAGCTCAAAAATCGAGGCCTATGAGTCCAAGCTCGCTTCGGAGCTAGCCTGTGT
CGATTTGAGTATCCAGAATTTGACGATTAGTACTCTTTTCAACGCTCTCACAACT
TCAGTTCGGGGCAAGGGGGAGGAAACTTCAGCAGATCCCCTTATCGCACGGTTCT
ACACTCTCCTGACGGGCAAGCCCCTGAGCCGAGACACACAGGGCCCAGAACGGGA
CTTGGCAGAGGTCATCTCCAGAAAGATCGCCTCGTCCTTCGGCACATGGAAGGAA
ATGACTGCCAACCCTCTGCAGAGCCTCCAGTTCTTCGAAGAAGAGCTTCATGCAC
TAGATGCCAACGTGTCTTTATCTCCAGCTTTTGATGTGTTAATCAAGATGAATGA
TCTCCAAGGTGATCTGAAGAACCGTACTATAGTGTTCGACCCAGATGCACCCGTG
TTCGAGTACAACGCTGAGGATCCAGCCGATATCATCATAAAGCTGACAGCTCGGT
ATGCGAAGGAGGCCGTCATCAAGAATCAGAACGTGGGCAATTATGTGAAAAACGC
CATTACCACCACTAATGCCAATGGGCTGGGGTGGCTCCTCAATAAAGGGCTTTCA
CTACTGCCAGTTTCTACTGACGATGAGCTGCTCGAATTCATTGGGGTGGAGAGAA
GCCATCCCAGCTGTCACGCGCTGATAGAGCTGATTGCCCAGCTAGAGGCGCCGGA
ACTGTTTGAGAAGAATGTGTTTAGTGACACCCGTTCCGAGGTTCAGGGTATGATC
GACAGTGCAGTGTCGAACCACATTGCTCGGCTGTCCAGCAGCCGAAACTCCCTGA
GCATGGACAGCGAGGAATTGGAACGCTTGATTAAATCTTTCCAGATTCATACTCC
CCATTGTTCTCTGTTCATAGGCGCTCAGTCCTTATCTCAGCAGCTGGAGAGCTTA
CCTGAGGCGCTGCAGTCCGGAGTGAACAGCGCTGATATCTTATTAGGCAGCACAC
AGTATATGCTGACCAACTCTCTCGTTGAAGAGTCAATTGCAACATATCAAAGGAC
ATTAAATAGGATCAATTACCTGAGTGGGGTGGCTGGGCAGATTAACGGTGCTATC
AAAAGAAAGGCAATCGACGGCGAAAAAATACACCTGCCTGCCGCCTGGAGTGAGC
TCATCTCCTTACCTTTCATTGGACAGCCGGTGATTGATGTGGAGAGCGACCTGGC
ACACTTAAAAAACCAGTACCAGACCCTGTCCAATGAATTTGACACCCTCATTTCG
GCCCTGCAGAAGAACTTCGATTTGAATTTCAACAAAGCACTCCTTAACCGCACGC
AGCATTTCGAGGCAATGTGCCGGAGCACAAAAAAAAATGCTTTATCTAAGCCCGA
GATCGTGTCCTACAGAGATCTGCTGGCGCGGCTGACCAGTTGCCTTTATCGAGGC
TCGCTGGTTCTCAGAAGGGCGGGAATCGAAGTTCTGAAAAAGCACAAAATCTTTG
AGTCGAATAGTGAGCTGAGAGAACACGTCCACGAGCGAAAGCACTTCGTGTTCGT
TAGTCCATTGGACAGAAAGGCAAAAAAACTGTTGCGCCTGACCGATTCCCGCCCT
GACTTGCTCCATGTGATCGATGAGATCCTGCAACATGACAATCTGGAGAATAAGG
ACAGAGAGTCCCTTTGGCTGGTCCGGTCTGGGTACCTCCTTGCTGGTCTGCCGGA
CCAGCTGAGTTCTTCGTTTATCAATCTCCCCATAATCACGCAAAAGGGCGATCGC
CGGCTGATTGACCTGATTCAGTATGACCAGATCAATCGCGATGCTTTCGTAATGT
TGGTGACAAGTGCTTTCAAAAGCAATCTCTCTGGGTTGCAGTACCGCGCTAACAA
GCAGTCTTTCGTGGTCACCCGCACCCTGTCTCCTTACCTGGGTAGTAAGCTCGTA
TACGTCCCTAAAGACAAAGATTGGCTGGTCCCATCCCAGATGTTTGAGGGAAGAT
TCGCCGATATTCTGCAGAGTGACTACATGGTCTGGAAGGATGCCGGACGCCTGTG
CGTGATCGACACTGCCAAACATCTCTCTAACATTAAAAAAAGCGTGTTTAGTAGC
GAAGAAGTCCTTGCTTTTCTTCGAGAGCTGCCTCACCGGACCTTCATCCAGACCG
AGGTACGGGGGTTAGGAGTGAACGTCGATGGAATCGCATTTAATAACGGGGATAT
CCCGAGCTTGAAGACATTCTCGAATTGTGTGCAGGTGAAGGTGAGTAGGACTAAT
ACTAGTCTCGTGCAGACTCTAAACAGGTGGTTCGAGGGTGGCAAAGTGTCACCTC
CCTCTATTCAGTTCGAAAGAGCTTACTACAAAAAAGACGATCAGATTCACGAGGA
CGCAGCCAAGAGAAAGATACGCTTCCAGATGCCAGCAACGGAATTAGTGCACGCC
AGCGATGACGCTGGTTGGACCCCCAGCTACCTGCTGGGCATCGACCCCGGTGAGT
ACGGAATGGGTCTCAGTTTGGTGTCCATCAACAATGGAGAGGTCCTGGATTCTGG
ATTCATCCACATTAATTCCCTGATCAATTTCGCGTCCAAAAAAAGCAATCACCAG
ACCAAAGTAGTCCCCCGCCAGCAGTACAAGTCCCCCTACGCGAATTATCTCGAGC
AGTCAAAGGATTCAGCAGCAGGGGATATAGCTCACATTCTGGATCGGCTAATCTA
CAAATTGAACGCCTTGCCTGTGTTCGAGGCGCTGTCTGGCAACAGTCAGAGTGCT
GCTGATCAGGTATGGACCAAAGTTCTATCCTTCTATACATGGGGAGACAACGACG
CACAGAACAGTATACGGAAGCAGCACTGGTTCGGTGCCTCACACTGGGATATTAA
GGGGATGCTGCGCCAACCCCCAACCGAAAAAAAACCCAAACCATATATAGCCTTT
CCCGGGAGTCAAGTGTCATCCTATGGAAATAGTCAAAGGTGTAGTTGTTGCGGCC
GCAATCCCATTGAGCAGTTGCGTGAGATGGCAAAGGACACGAGTATCAAGGAGCT
GAAAATCCGAAATAGTGAGATCCAACTATTCGATGGTACAATCAAGCTGTTTAAC
CCCGACCCTTCCACCGTCATCGAGAGGCGGCGGCATAACCTAGGACCCTCACGCA
TTCCTGTGGCAGACCGAACTTTCAAGAATATTAGCCCTTCTTCGTTAGAGTTCAA
GGAGCTCATTACTATCGTTTCTCGAAGCATCCGCCATAGCCCCGAATTTATTGCT
AAGAAACGGGGTATCGGGTCTGAGTACTTTTGTGCTTATTCTGACTGCAACTCCT
CACTGAACTCAGAGGCCAATGCCGCGGCCAATGTGGCACAGAAGTTTCAGAAGCA
ACTCTTTTTCGAACTCTGA
SEQ ATGAAACGTATTCTGAACTCTCTGAAAGTCGCCGCACTGAGGCTGCTGTTTCGAG
ID GAAAGGGCTCAGAGCTGGTGAAGACCGTCAAGTACCCTCTGGTTTCGCCCGTCCA
NO: GGGTGCTGTGGAAGAACTCGCCGAAGCAATACGCCACGACAACCTACATTTATTT
163 GGGCAGAAGGAAATCGTAGATCTGATGGAGAAGGACGAGGGCACCCAGGTCTACT
CGGTGGTGGACTTTTGGCTCGACACACTCCGTCTAGGGATGTTCTTCAGTCCAAG
TGCTAATGCCCTTAAGATCACTCTGGGGAAGTTTAACAGCGACCAAGTTTCCCCT
TTCAGGAAGGTTCTGGAGCAGTCCCCTTTCTTTCTCGCGGGTAGACTCAAAGTGG
AGCCCGCTGAACGTATCCTCAGCGTGGAGATCCGCAAGATCGGTAAGAGGGAGAA
TAGAGTGGAGAACTACGCCGCAGATGTAGAGACTTGTTTTATCGGTCAGCTGTCT
AGTGATGAAAAGCAGTCTATCCAGAAGCTCGCTAACGATATCTGGGACTCTAAGG
ATCACGAAGAGCAAAGGATGCTTAAGGCGGATTTCTTTGCCATTCCCCTCATCAA
AGACCCAAAGGCAGTGACCGAGGAAGATCCCGAGAATGAAACCGCAGGCAAACAG
AAGCCTCTCGAATTATGTGTGTGCTTAGTGCCCGAGTTGTACACCCGCGGGTTCG
GTTCAATAGCGGACTTCCTGGTCCAGCGTCTGACACTATTAAGAGACAAAATGAG
CACAGACACAGCAGAAGACTGCCTTGAGTATGTCGGCATAGAGGAGGAGAAGGGT
AATGGGATGAACTCGCTGCTGGGGACGTTCCTCAAGAACCTGCAGGGAGACGGGT
TCGAACAGATCTTCCAATTTATGCTCGGCAGTTACGTGGGATGGCAAGGTAAGGA
AGACGTCCTACGCGAACGGCTTGATTTGCTAGCGGAGAAGGTTAAAAGACTGCCG
AAACCTAAGTTTGCCGGCGAGTGGTCCGGCCATCGGATGTTCCTGCATGGTCAAT
TGAAGAGCTGGTCCTCTAACTTTTTCCGCCTGTTTAACGAGACTAGGGAGCTCCT
CGAAAGCATAAAATCCGACATCCAACACGCGACCATGTTAATCAGCTACGTCGAA
GAGAAAGGGGGATACCACCCACAACTCTTGTCACAGTACAGGAAACTAATGGAGC
AGCTGCCAGCTCTCAGAACAAAGGTGTTAGATCCAGAGATAGAAATGACTCACAT
GAGCGAGGCGGTAAGGTCGTACATTATGATCCACAAGTCGGTAGCAGGATTTCTG
CCTGACTTACTCGAGTCCCTCGATAGGGACAAGGACAGGGAATTCCTGCTGAGTA
TATTTCCAAGGATCCCCAAAATTGACAAAAAAACTAAGGAAATCGTGGCCTGGGA
GCTCCCAGGCGAGCCCGAAGAAGGATACCTGTTCACTGCCAATAATCTTTTTCGC
AACTTTCTGGAGAATCCTAAACATGTTCCACGTTTCATGGCAGAAAGGATCCCGG
AAGATTGGACGCGCCTGCGGTCCGCTCCCGTATGGTTTGACGGCATGGTGAAACA
ATGGCAGAAAGTGGTAAACCAGCTGGTGGAGTCACCTGGAGCATTGTATCAGTTC
AATGAAAGCTTTCTCCGACAACGTTTACAGGCAATGCTGACAGTGTATAAGAGAG
ACCTGCAGACAGAGAAATTCCTTAAGTTGTTGGCTGATGTCTGCAGGCCTCTGGT
GGACTTCTTTGGGCTGGGGGGAAACGATATCATCTTCAAAAGCTGCCAGGACCCG
AGGAAACAATGGCAAACTGTCATTCCCTTGAGTGTCCCCGCTGATGTGTACACCG
CGTGTGAGGGGCTGGCAATCCGGCTTCGTGAGACATTGGGATTTGAGTGGAAGAA
CCTTAAGGGCCATGAAAGGGAGGACTTTCTAAGACTGCACCAGCTTTTAGGGAAT
CTGCTTTTCTGGATTCGAGATGCCAAACTGGTGGTGAAATTGGAAGATTGGATGA
ATAATCCCTGTGTTCAGGAGTACGTTGAGGCTCGTAAGGCCATTGATCTCCCACT
GGAGATCTTCGGCTTTGAGGTCCCCATCTTCCTGAACGGATATCTGTTTAGTGAA
CTGAGGCAGTTAGAACTGCTGCTCCGCCGTAAGTCGGTTATGACCAGCTATTCGG
TTAAGACAACTGGCAGTCCAAACAGGCTTTTCCAGTTAGTCTACCTGCCATTAAA
TCCTTCCGACCCTGAGAAAAAAAATTCTAATAACTTTCAGGAACGCCTGGACACC
CCCACTGGCTTATCACGTCGCTTCCTGGACCTTACTCTGGACGCCTTCGCCGGCA
AGTTGCTGACAGACCCCGTGACTCAAGAGCTTAAAACTATGGCTGGGTTCTACGA
TCACCTGTTTGGTTTCAAGCTCCCATGTAAGCTGGCAGCCATGTCTAACCACCCT
GGCTCTAGCAGCAAGATGGTCGTGTTGGCCAAACCTAAAAAAGGGGTTGCATCTA
ATATAGGATTCGAACCAATCCCTGATCCCGCGCACCCCGTATTCCGGGTGAGATC
ATCATGGCCAGAGCTGAAGTATCTGGAGGGGTTACTGTATCTTCCAGAAGACACT
CCACTGACAATAGAGCTCGCAGAGACAAGTGTTAGTTGTCAGAGCGTCAGTAGCG
TGGCATTCGATCTGAAAAATCTGACTACTATCCTTGGACGCGTGGGTGAGTTCCG
TGTGACCGCAGACCAGCCTTTTAAGTTGACCCCCATCATCCCTGAGAAGGAGGAG
TCCTTCATAGGAAAAACATATCTAGGCCTTGATGCCGGGGAACGCTCAGGCGTAG
GGTTCGCTATCGTCACAGTCGACGGGGATGGGTACGAGGTACAGCGCCTGGGGGT
GCATGAAGATACACAGCTGATGGCCCTACAGCAGGTGGCCTCTAAAAGCTTGAAG
GAGCCGGTGTTCCAGCCGCTCAGAAAGGGTACTTTTCGGCAGCAGGAACGTATTA
GAAAATCTCTCAGAGGATGTTATTGGAACTTCTATCACGCTCTGATGATTAAGTA
CCGCGCCAAGGTAGTGCACGAAGAGAGCGTGGGCAGTTCCGGCCTGGTTGGGCAG
TGGTTACGAGCATTCCAGAAGGACCTCAAGAAAGCCGATGTGTTGCCAAAAAAGG
GAGGCAAAAACGGAGTCGATAAGAAAAAGAGAGAGTCTTCTGCACAAGACACATT
GTGGGGAGGGGCTTTTAGCAAGAAGGAAGAACAGCAGATAGCTTTCGAAGTCCAA
GCTGCTGGTTCTAGCCAGTTCTGCCTGAAGTGCGGATGGTGGTTCCAACTCGGAA
TGCGTGAGGTTAATCGCGTGCAGGAATCCGGCGTCGTGCTGGATTGGAATCGGAG
TATTGTCACATTCCTGATTGAGAGCTCTGGCGAGAAAGTGTATGGGTTCTCCCCT
CAGCAACTCGAAAAGGGGTTCAGACCAGACATTGAAACCTTCAAGAAGATGGTTC
GGGATTTCATGCGCCCGCCTATGTTTGACCGGAAGGGTCGCCCAGCAGCTGCCTA
CGAAAGGTTTGTCTTGGGACGCCGGCATCGGCGGTATAGATTCGACAAGGTTTTT
GAAGAACGATTCGGACGATCCGCGCTATTCATTTGCCCGAGGGTTGGCTGTGGCA
ACTTTGACCACAGCAGCGAGCAGTCAGCCGTAGTGCTGGCTCTAATCGGATATAT
TGCCGACAAAGAGGGGATGAGCGGAAAAAAGCTAGTCTACGTGCGTCTGGCAGAA
CTAATGGCGGAATGGAAATTGAAGAAACTGGAGAGGAGTAGAGTTGAGGAGCAAA
GCTCCGCTCAGTGA
SEQ ATGGCGGAGTCGAAGCAAATGCAGTGCAGGAAGTGTGGAGCCTCTATGAAGTACG
ID AAGTGATCGGCCTCGGGAAGAAAAGCTGCAGATATATGTGTCCCGACTGCGGGAA
NO: TCACACATCTGCAAGAAAGATTCAGAATAAGAAGAAAAGGGACAAGAAGTATGGA
164 TCTGCCAGTAAAGCACAAAGCCAACGAATCGCAGTTGCAGGGGCCTTATACCCGG
ATAAAAAGGTTCAGACCATCAAGACTTATAAGTATCCAGCCGACCTGAATGGTGA
GGTCCATGACTCAGGGGTGGCCGAAAAAATAGCCCAAGCAATCCAGGAGGATGAA
ATAGGGCTCCTCGGCCCCTCTTCCGAGTACGCCTGTTGGATCGCTAGCCAGAAAC
AGAGCGAGCCCTACAGTGTTGTAGACTTTTGGTTTGACGCTGTGTGCGCCGGAGG
CGTGTTCGCCTATTCTGGGGCTAGATTGCTGTCTACCGTCCTGCAGCTATCTGGG
GAGGAGAGCGTCCTACGCGCAGCCCTGGCATCCTCCCCTTTTGTCGACGATATCA
ATCTGGCACAGGCCGAAAAATTTCTGGCGGTGTCCAGGCGAACCGGCCAAGATAA
GCTGGGGAAGCGCATTGGAGAGTGCTTCGCAGAGGGCCGACTTGAGGCCCTAGGC
ATCAAGGACCGGATGCGTGAATTTGTCCAGGCTATCGATGTCGCTCAGACCGCTG
GGCAGCGTTTTGCCGCGAAACTGAAAATCTTTGGGATTTCTCAGATGCCCGAGGC
AAAGCAGTGGAACAATGACAGCGGACTCACCGTGTGCATCCTGCCCGACTATTAC
GTCCCAGAAGAAAATCGCGCAGATCAGTTGGTCGTCCTGCTAAGACGACTGAGAG
AGATAGCATACTGTATGGGGATCGAAGATGAGGCCGGTTTTGAACATCTTGGAAT
TGATCCTGGCGCACTATCAAATTTTTCCAATGGCAATCCTAAACGCGGATTTTTG
GGCCGCCTGCTGAACAATGATATTATTGCCTTAGCGAACAACATGTCCGCCATGA
CGCCTTACTGGGAGGGCAGGAAGGGAGAACTGATTGAAAGATTGGCTTGGCTGAA
GCACCGTGCAGAGGGGCTTTATCTGAAGGAACCGCATTTTGGAAATAGTTGGGCC
GACCATAGGTCTAGAATTTTTTCCAGAATAGCCGGGTGGCTTTCTGGGTGCGCTG
GGAAGCTAAAGATCGCCAAAGACCAGATCAGCGGAGTGCGTACTGATCTGTTCCT
TCTGAAGAGACTGCTGGATGCGGTCCCGCAGTCCGCCCCTTCTCCCGACTTCATA
GCCTCTATCTCTGCCTTGGATCGCTTCCTGGAGGCCGCAGAATCTAGTCAGGATC
CTGCCGAACAGGTGAGGGCCCTATACGCCTTTCATCTGAACGCACCCGCGGTGCG
AAGCATCGCCAACAAGGCAGTCCAGCGATCCGACAGCCAAGAATGGCTTATAAAG
GAACTGGACGCTGTGGACCACCTGGAGTTTAACAAGGCCTTTCCCTTCTTCTCTG
ATACGGGAAAGAAGAAAAAGAAAGGGGCTAACTCGAATGGCGCTCCGTCCGAGGA
GGAGTACACCGAGACTGAGAGCATCCAGCAGCCCGAGGACGCTGAGCAAGAGGTT
AATGGTCAGGAAGGCAACGGGGCCTCGAAGAACCAGAAGAAGTTTCAGAGAATCC
CCCGATTCTTCGGCGAGGGGAGTCGCAGCGAGTATCGCATCCTCACTGAAGCCCC
GCAGTACTTCGACATGTTCTGTAACAACATGCGGGCCATCTTTATGCAATTAGAA
TCCCAACCGCGTAAAGCTCCCAGGGATTTTAAGTGTTTCCTGCAGAATCGGCTGC
AGAAATTGTATAAGCAGACATTCCTGAACGCTCGATCCAACAAGTGCCGGGCATT
ACTAGAGTCCGTATTGATTAGTTGGGGAGAGTTTTACACCTACGGGGCTAACGAG
AAAAAATTTCGACTGCGTCATGAAGCTTCTGAGCGCTCCTCGGACCCAGATTACG
TGGTGCAACAGGCGCTGGAGATCGCTCGGAGGCTGTTTCTCTTCGGCTTTGAGTG
GAGGGACTGTAGCGCAGGTGAAAGAGTGGATCTGGTCGAAATACATAAGAAAGCC
ATATCTTTCCTGTTGGCCATCACTCAGGCTGAGGTGTCTGTGGGCAGCTATAACT
GGCTGGGCAATTCTACCGTGAGTCGGTACCTGTCCGTGGCAGGGACTGATACCCT
TTACGGCACCCAGCTGGAAGAATTCTTAAATGCAACCGTGTTATCTCAGATGCGG
GGGCTGGCTATCAGGTTATCATCTCAGGAACTGAAGGATGGATTTGACGTACAGC
TGGAGTCTAGTTGCCAGGATAATCTGCAACACTTGCTCGTGTACAGGGCTTCACG
AGACCTTGCCGCCTGCAAGCGCGCTACTTGTCCAGCTGAGTTGGATCCTAAGATT
CTGGTACTGCCCGTGGGGGCCTTTATCGCTAGCGTGATGAAAATGATTGAAAGAG
GGGATGAGCCTTTAGCTGGAGCTTATCTGAGACACAGACCCCATAGTTTCGGGTG
GCAGATCCGCGTTCGAGGTGTGGCAGAGGTGGGAATGGACCAAGGGACCGCCCTG
GCGTTCCAGAAACCGACCGAGAGCGAACCCTTCAAGATAAAGCCGTTTTCCGCTC
AATACGGCCCCGTTCTATGGCTGAACAGCTCCAGTTATAGCCAGAGCCAGTACCT
GGACGGGTTCCTATCACAGCCCAAGAACTGGAGTATGCGGGTGCTGCCACAGGCC
GGCTCAGTGCGGGTAGAACAGCGCGTCGCCTTGATTTGGAATCTCCAGGCCGGAA
AGATGAGGCTGGAACGGAGCGGAGCGCGGGCTTTCTTCATGCCCGTCCCATTCAG
TTTCCGCCCCAGTGGCAGCGGCGACGAGGCAGTCCTGGCTCCAAATAGGTACCTG
GGACTCTTTCCACACAGCGGCGGCATAGAGTACGCTGTGGTCGATGTTCTTGACT
CTGCCGGCTTCAAAATACTCGAGAGAGGAACAATAGCCGTCAATGGCTTCTCCCA
GAAACGAGGAGAAAGACAAGAGGAAGCCCATCGCGAAAAACAAAGACGCGGTATC
TCCGATATTGGGCGCAAGAAGCCAGTCCAGGCCGAAGTCGATGCGGCCAACGAGC
TCCATCGAAAATACACCGATGTTGCTACTCGGCTGGGGTGTCGAATTGTCGTTCA
ATGGGCACCCCAACCCAAACCAGGCACTGCGCCGACCGCTCAGACTGTGTACGCT
AGGGCCGTGAGGACTGAAGCACCAAGATCCGGCAATCAGGAAGATCACGCCAGGA
TGAAATCTTCCTGGGGATACACATGGGGTACGTATTGGGAAAAAAGGAAGCCCGA
GGACATCCTCGGCATTAGTACCCAGGTGTATTGGACAGGCGGGATCGGCGAGTCC
TGCCCGGCTGTCGCCGTCGCGCTATTGGGACACATCAGGGCCACCTCAACCCAGA
CTGAATGGGAGAAAGAGGAAGTCGTGTTTGGGCGATTGAAAAAGTTCTTCCCATC
CTGA
SEQ ATGGAGAAGCGCATCAATAAAATTCGCAAGAAGCTGTCTGCCGATAACGCCACAA
ID AACCAGTTAGTCGAAGCGGCCCAATGAAGACCCTGCTAGTTCGAGTGATGACTGA
NO: TGATCTGAAGAAAAGGCTCGAAAAGCGACGCAAGAAGCCTGAGGTAATGCCTCAG
165 GTTATAAGTAACAATGCAGCAAACAATCTGCGGATGCTGCTTGACGATTACACAA
AGATGAAGGAAGCCATTCTCCAGGTGTATTGGCAGGAGTTCAAGGATGATCACGT
AGGCCTGATGTGTAAATTCGCGCAACCTGCAAGCAAGAAGATCGACCAAAACAAG
CTGAAACCCGAGATGGATGAAAAAGGCAATTTAACAACCGCCGGATTCGCTTGTT
CCCAGTGTGGGCAGCCACTGTTCGTGTACAAGTTAGAACAGGTGTCGGAAAAAGG
AAAGGCATACACTAACTACTTTGGACGGTGCAATGTTGCAGAACACGAAAAGCTG
ATACTGCTTGCCCAGCTTAAGCCCGAAAAAGACAGCGACGAAGCGGTGACCTACA
GCCTGGGAAAATTCGGGCAGCGGGCACTGGACTTCTATTCTATCCACGTTACCAA
GGAGAGCACCCACCCAGTGAAGCCGTTGGCCCAAATCGCTGGAAACCGGTACGCC
AGCGGACCAGTCGGCAAGGCCCTGTCCGATGCCTGTATGGGCACAATTGCTTCTT
TCCTGTCCAAGTACCAGGACATCATAATCGAGCACCAAAAAGTTGTGAAAGGGAA
TCAGAAACGCCTGGAATCCCTTCGAGAACTGGCCGGCAAGGAGAACCTTGAGTAC
CCGTCCGTGACCCTGCCTCCACAGCCACATACCAAAGAGGGCGTAGACGCGTATA
ATGAGGTCATTGCCCGCGTTCGCATGTGGGTTAATTTAAACCTGTGGCAGAAATT
AAAACTAAGCCGAGATGATGCTAAACCGTTACTGAGATTGAAGGGATTCCCTAGC
TTTCCTGTGGTGGAGAGAAGGGAAAACGAGGTTGATTGGTGGAATACTATTAATG
AGGTGAAAAAGCTTATTGACGCCAAGAGGGATATGGGCAGGGTGTTCTGGAGCGG
GGTGACTGCCGAAAAGAGAAATACCATCCTCGAGGGATACAATTACCTCCCCAAC
GAGAATGATCATAAGAAAAGAGAGGGGAGCTTAGAGAATCCAAAGAAACCTGCAA
AGAGGCAATTCGGTGATCTCCTGCTCTACCTCGAGAAGAAATACGCGGGGGACTG
GGGAAAAGTTTTTGACGAAGCCTGGGAGCGCATTGACAAGAAGATCGCCGGGCTG
ACGTCTCACATTGAACGGGAAGAGGCACGGAATGCAGAGGACGCCCAGTCTAAGG
CCGTGCTGACTGACTGGCTGCGCGCAAAGGCCTCCTTCGTGCTCGAACGTCTGAA
GGAAATGGATGAGAAAGAGTTTTACGCGTGTGAAATACAGCTGCAGAAGTGGTAC
GGCGATCTAAGGGGAAATCCCTTCGCAGTGGAAGCCGAGAATAGGGTAGTTGACA
TCAGTGGGTTCTCCATCGGCAGTGATGGACATTCTATCCAGTATAGAAACCTGCT
CGCCTGGAAGTACTTAGAGAACGGCAAGAGAGAGTTCTATCTGCTGATGAACTAC
GGGAAAAAAGGTAGAATTCGCTTTACAGATGGCACCGACATAAAGAAGTCCGGAA
AGTGGCAAGGCCTCTTATACGGAGGCGGCAAAGCAAAGGTGATAGACTTGACTTT
TGACCCTGACGACGAACAGCTGATAATCTTGCCGCTGGCCTTTGGCACAAGACAA
GGTAGGGAATTTATCTGGAATGATCTTCTTTCTCTCGAGACCGGACTCATCAAGC
TCGCAAACGGAAGGGTCATCGAGAAGACAATCTACAATAAAAAGATAGGCCGAGA
CGAGCCAGCCCTGTTTGTGGCTTTGACATTTGAGCGGAGAGAGGTCGTAGATCCC
AGCAACATCAAACCCGTGAACCTGATCGGTGTTGACAGGGGCGAGAACATCCCGG
CGGTTATCGCACTGACGGATCCAGAAGGATGTCCTCTGCCCGAGTTCAAAGATTC
ATCGGGAGGGCCAACCGACATTTTGAGGATAGGGGAGGGGTACAAGGAGAAGCAG
CGAGCTATCCAGGCGGCCAAAGAAGTGGAGCAACGAAGAGCTGGTGGTTATTCTC
GCAAGTTCGCTTCCAAAAGTCGTAACCTGGCTGACGATATGGTGCGCAATTCTGC
CCGTGACCTTTTCTACCACGCCGTTACACACGACGCCGTGTTAGTGTTTGAAAAT
CTTAGTCGAGGCTTCGGGCGACAGGGGAAGCGGACCTTTATGACCGAGAGACAGT
ATACAAAAATGGAGGATTGGCTGACCGCCAAACTGGCGTATGAAGGACTCACATC
CAAGACCTATCTCTCAAAAACTTTGGCCCAGTATACATCTAAGACGTGCAGTAAC
TGTGGCTTCACCATTACCACAGCTGACTACGATGGCATGCTGGTCCGCTTAAAAA
AGACATCTGACGGCTGGGCTACTACCCTCAACAATAAAGAGCTCAAAGCCGAAGG
ACAAATTACCTATTATAACAGGTATAAAAGACAGACTGTCGAGAAGGAGTTGAGC
GCGGAGCTGGACCGCCTATCAGAGGAGTCAGGGAACAACGATATCTCTAAGTGGA
CTAAGGGACGCCGAGACGAGGCGTTGTTCTTGCTGAAAAAGCGGTTCTCTCATCG
ACCCGTGCAGGAGCAGTTCGTGTGTCTGGACTGCGGCCACGAGGTTCATGCTGAT
GAGCAAGCTGCTCTAAATATTGCCCGTAGTTGGTTGTTCCTGAACAGCAATTCAA
CAGAGTTCAAGTCATACAAGAGCGGAAAGCAGCCGTTTGTGGGCGCATGGCAGGC
ATTTTACAAAAGACGCCTGAAGGAAGTGTGGAAGCCAAACGCC
SEQ ATGAAAAGGATTAACAAAATCCGAAGGCGGCTTGTAAAGGATTCTAACACCAAAA
ID AGGCTGGCAAGACGGGGCCCATGAAAACATTACTCGTTAGAGTTATGACCCCCGA
NO: CCTCAGAGAGCGACTGGAAAATTTACGCAAGAAGCCAGAGAACATACCTCAGCCA
166 ATTAGTAATACCTCTCGGGCAAACCTAAACAAGTTGCTTACTGATTACACGGAGA
TGAAAAAGGCCATACTGCATGTGTACTGGGAGGAGTTTCAAAAGGACCCTGTCGG
GCTAATGAGCAGGGTGGCTCAGCCTGCACCTAAAAACATCGACCAGCGGAAACTC
ATCCCAGTTAAGGACGGAAATGAGAGATTGACAAGTTCAGGTTTCGCCTGCTCAC
AGTGCTGTCAACCGCTGTACGTTTATAAGTTAGAACAAGTGAATGACAAAGGAAA
GCCTCACACAAATTATTTTGGCCGGTGTAATGTCTCTGAGCATGAGCGTCTGATT
CTGTTGTCCCCGCATAAACCGGAAGCTAATGACGAGCTCGTAACCTACAGCTTGG
GGAAGTTTGGCCAAAGAGCATTGGACTTCTATTCAATCCATGTGACCCGCGAATC
CAATCATCCCGTCAAGCCCTTGGAGCAGATAGGGGGCAATAGTTGCGCTTCTGGC
CCTGTGGGCAAAGCCCTGTCCGACGCCTGTATGGGAGCCGTGGCTTCATTCCTGA
CCAAATATCAGGATATCATCTTGGAGCACCAGAAAGTGATCAAGAAAAATGAAAA
AAGGTTAGCAAACCTCAAGGATATTGCAAGCGCTAACGGCTTGGCTTTTCCTAAA
ATCACACTTCCACCTCAGCCTCACACAAAGGAAGGCATCGAGGCATACAACAATG
TGGTGGCCCAGATCGTCATCTGGGTTAACTTAAACCTGTGGCAGAAACTTAAAAT
TGGCAGGGATGAGGCAAAACCCTTACAGCGCCTGAAAGGATTCCCCAGCTTTCCA
CTGGTGGAGCGCCAGGCTAACGAAGTGGACTGGTGGGATATGGTGTGTAACGTCA
AGAAGCTCATCAATGAAAAGAAAGAGGACGGTAAAGTCTTCTGGCAGAACCTCGC
CGGTTACAAACGGCAGGAGGCGCTGTTACCTTATCTGTCGAGTGAAGAGGACCGG
AAAAAAGGCAAGAAATTTGCTCGTTATCAGTTTGGTGATTTGCTCCTACATTTGG
AGAAGAAGCACGGCGAGGACTGGGGAAAAGTATACGATGAGGCCTGGGAGAGGAT
TGACAAAAAGGTGGAGGGACTGTCAAAGCACATCAAGCTCGAAGAAGAGCGCAGA
AGCGAGGACGCCCAATCCAAAGCAGCGCTGACTGACTGGCTGCGGGCGAAGGCCA
GTTTTGTAATCGAAGGCCTTAAAGAAGCCGACAAGGATGAATTCTGCAGATGCGA
ATTAAAACTCCAGAAGTGGTACGGCGATCTCCGAGGTAAGCCTTTCGCAATCGAG
GCCGAGAATTCCATACTGGACATTAGTGGATTCAGTAAACAGTATAATTGTGCCT
TTATATGGCAGAAGGATGGTGTCAAGAAACTCAACCTGTACCTTATTATTAATTA
TTTCAAAGGCGGGAAACTGAGATTTAAGAAGATAAAGCCTGAAGCCTTTGAGGCG
AACCGATTCTACACAGTTATTAACAAGAAATCTGGTGAAATTGTACCCATGGAGG
TAAACTTCAACTTCGATGATCCCAATCTGATTATATTGCCACTAGCTTTTGGCAA
GCGGCAGGGTAGGGAATTCATTTGGAACGATTTGCTTTCACTGGAAACAGGGTCC
CTTAAGCTGGCAAACGGGAGAGTGATTGAAAAGACATTGTACAATCGGAGGACAC
GTCAGGATGAACCTGCCCTTTTCGTGGCTCTGACATTCGAGCGCAGGGAGGTTCT
GGACTCTAGCAATATCAAGCCAATGAACCTGATCGGCATAGACCGAGGAGAGAAT
ATTCCGGCTGTGATCGCACTCACCGATCCCGAAGGATGTCCCCTTTCTCGGTTCA
AGGACTCCTTAGGCAATCCAACTCATATCCTGAGAATCGGCGAGTCATACAAGGA
GAAGCAGCGAACAATTCAGGCCGCCAAGGAAGTCGAGCAGAGGCGAGCTGGCGGC
TACAGCCGTAAATACGCTAGTAAAGCTAAGAACCTGGCCGACGATATGGTGCGCA
ATACTGCTAGAGACCTGCTGTACTATGCAGTGACGCAGGACGCAATGCTGATATT
CGAGAATCTGTCCAGAGGATTCGGAAGGCAGGGCAAGCGGACGTTCATGGCCGAG
CGCCAGTATACAAGGATGGAGGATTGGTTAACGGCCAAGCTTGCCTATGAGGGGC
TACCTAGTAAGACCTATCTGTCTAAGACGCTGGCTCAATACACCAGTAAGACCTG
CTCAAACTGTGGCTTTACAATCACTTCTGCTGATTATGATAGAGTGCTCGAGAAG
CTAAAAAAAACTGCCACCGGCTGGATGACTACTATTAATGGGAAGGAACTGAAAG
TGGAAGGACAGATTACCTATTATAATCGCTACAAGCGTCAAAACGTCGTCAAGGA
CCTGTCGGTGGAATTGGACAGACTCAGTGAAGAGTCCGTGAACAATGATATCAGC
TCCTGGACAAAAGGGCGCAGTGGGGAGGCACTCAGCTTGCTTAAAAAGAGGTTTT
CACATCGGCCGGTCCAGGAGAAATTTGTCTGCCTGAACTGCGGATTCGAGACACA
CGCCGACGAGCAGGCAGCACTGAACATTGCCAGATCCTGGCTGTTCCTTAGGTCC
CAGGAATATAAGAAGTACCAGACTAACAAAACCACGGGAAACACAGATAAAAGGG
CCTTTGTCGAAACTTGGCAATCCTTTTACCGGAAGAAGTTAAAGGAAGTGTGGAA
GCCC
SEQ ATGGATAAGAAATACTCAATAGGCTTAGCAATCGGCACAAATAGCGTCGGATGGG
ID CGGTGATCACTGATGAATATAAGGTTCCGTCTAAAAAGTTCAAGGTTCTGGGAAA
NO: TACAGACCGCCACAGTATCAAAAAAAATCTTATAGGGGCTCTTTTATTTGACAGT
167 GGAGAGACAGCGGAAGCGACTCGTCTCAAACGGACAGCTCGTAGAAGGTATACAC
GTCGGAAGAATCGTATTTGTTATCTACAGGAGATTTTTTCAAATGAGATGGCGAA
AGTAGATGATAGTTTCTTTCATCGACTTGAAGAGTCTTTTTTGGTGGAAGAAGAC
AAGAAGCATGAACGTCATCCTATTTTTGGAAATATAGTAGATGAAGTTGCTTATC
ATGAGAAATATCCAACTATCTATCATCTGCGAAAAAAATTGGTAGATTCTACTGA
TAAAGCGGATTTGCGCTTAATCTATTTGGCCTTAGCGCATATGATTAAGTTTCGT
GGTCATTTTTTGATTGAGGGAGATTTAAATCCTGATAATAGTGATGTGGACAAAC
TATTTATCCAGTTGGTACAAACCTACAATCAATTATTTGAAGAAAACCCTATTAA
CGCAAGTGGAGTAGATGCTAAAGCGATTCTTTCTGCACGATTGAGTAAATCAAGA
CGATTAGAAAATCTCATTGCTCAGCTCCCCGGTGAGAAGAAAAATGGCTTATTTG
GGAATCTCATTGCTTTGTCATTGGGTTTGACCCCTAATTTTAAATCAAATTTTGA
TTTGGCAGAAGATGCTAAATTACAGCTTTCAAAAGATACTTACGATGATGATTTA
GATAATTTATTGGCGCAAATTGGAGATCAATATGCTGATTTGTTTTTGGCAGCTA
AGAATTTATCAGATGCTATTTTACTTTCAGATATCCTAAGAGTAAATACTGAAAT
AACTAAGGCTCCCCTATCAGCTTCAATGATTAAACGCTACGATGAACATCATCAA
GACTTGACTCTTTTAAAAGCTTTAGTTCGACAACAACTTCCAGAAAAGTATAAAG
AAATCTTTTTTGATCAATCAAAAAACGGATATGCAGGTTATATTGATGGGGGAGC
TAGCCAAGAAGAATTTTATAAATTTATCAAACCAATTTTAGAAAAAATGGATGGT
ACTGAGGAATTATTGGTGAAACTAAATCGTGAAGATTTGCTGCGCAAGCAACGGA
CCTTTGACAACGGCTCTATTCCCCATCAAATTCACTTGGGTGAGCTGCATGCTAT
TTTGAGAAGACAAGAAGACTTTTATCCATTTTTAAAAGACAATCGTGAGAAGATT
GAAAAAATCTTGACTTTTCGAATTCCTTATTATGTTGGTCCATTGGCGCGTGGCA
ATAGTCGTTTTGCATGGATGACTCGGAAGTCTGAAGAAACAATTACCCCATGGAA
TTTTGAAGAAGTTGTCGATAAAGGTGCTTCAGCTCAATCATTTATTGAACGCATG
ACAAACTTTGATAAAAATCTTCCAAATGAAAAAGTACTACCAAAACATAGTTTGC
TTTATGAGTATTTTACGGTTTATAACGAATTGACAAAGGTCAAATATGTTACTGA
AGGAATGCGAAAACCAGCATTTCTTTCAGGTGAACAGAAGAAAGCCATTGTTGAT
TTACTCTTCAAAACAAATCGAAAAGTAACCGTTAAGCAATTAAAAGAAGATTATT
TCAAAAAAATAGAATGTTTTGATAGTGTTGAAATTTCAGGAGTTGAAGATAGATT
TAATGCTTCATTAGGTACCTACCATGATTTGCTAAAAATTATTAAAGATAAAGAT
TTTTTGGATAATGAAGAAAATGAAGATATCTTAGAGGATATTGTTTTAACATTGA
CCTTATTTGAAGATAGGGAGATGATTGAGGAAAGACTTAAAACATATGCTCACCT
CTTTGATGATAAGGTGATGAAACAGCTTAAACGTCGCCGTTATACTGGTTGGGGA
CGTTTGTCTCGAAAATTGATTAATGGTATTAGGGATAAGCAATCTGGCAAAACAA
TATTAGATTTTTTGAAATCAGATGGTTTTGCCAATCGCAATTTTATGCAGCTGAT
CCATGATGATAGTTTGACATTTAAAGAAGACATTCAAAAAGCACAAGTGTCTGGA
CAAGGCGATAGTTTACATGAACATATTGCAAATTTAGCTGGTAGCCCTGCTATTA
AAAAAGGTATTTTACAGACTGTAAAAGTTGTTGATGAATTGGTCAAAGTAATGGG
GCGGCATAAGCCAGAAAATATCGTTATTGAAATGGCACGTGAAAATCAGACAACT
CAAAAGGGCCAGAAAAATTCGCGAGAGCGTATGAAACGAATCGAAGAAGGTATCA
AAGAATTAGGAAGTCAGATTCTTAAAGAGCATCCTGTTGAAAATACTCAATTGCA
AAATGAAAAGCTCTATCTCTATTATCTCCAAAATGGAAGAGACATGTATGTGGAC
CAAGAATTAGATATTAATCGTTTAAGTGATTATGATGTCGATCACATTGTTCCAC
AAAGTTTCCTTAAAGACGATTCAATAGACAATAAGGTCTTAACGCGTTCTGATAA
AAATCGTGGTAAATCGGATAACGTTCCAAGTGAAGAAGTAGTCAAAAAGATGAAA
AACTATTGGAGACAACTTCTAAACGCCAAGTTAATCACTCAACGTAAGTTTGATA
ATTTAACGAAAGCTGAACGTGGAGGTTTGAGTGAACTTGATAAAGCTGGTTTTAT
CAAACGCCAATTGGTTGAAACTCGCCAAATCACTAAGCATGTGGCACAAATTTTG
GATAGTCGCATGAATACTAAATACGATGAAAATGATAAACTTATTCGAGAGGTTA
AAGTGATTACCTTAAAATCTAAATTAGTTTCTGACTTCCGAAAAGATTTCCAATT
CTATAAAGTACGTGAGATTAACAATTACCATCATGCCCATGATGCGTATCTAAAT
GCCGTCGTTGGAACTGCTTTGATTAAGAAATATCCAAAACTTGAATCGGAGTTTG
TCTATGGTGATTATAAAGTTTATGATGTTCGTAAAATGATTGCTAAGTCTGAGCA
AGAAATAGGCAAAGCAACCGCAAAATATTTCTTTTACTCTAATATCATGAACTTC
TTCAAAACAGAAATTACACTTGCAAATGGAGAGATTCGCAAACGCCCTCTAATCG
AAACTAATGGGGAAACTGGAGAAATTGTCTGGGATAAAGGGCGAGATTTTGCCAC
AGTGCGCAAAGTATTGTCCATGCCCCAAGTCAATATTGTCAAGAAAACAGAAGTA
CAGACAGGCGGATTCTCCAAGGAGTCAATTTTACCAAAAAGAAATTCGGACAAGC
TTATTGCTCGTAAAAAAGACTGGGATCCAAAAAAATATGGTGGTTTTGATAGTCC
AACGGTAGCTTATTCAGTCCTAGTGGTTGCTAAGGTGGAAAAAGGGAAATCGAAG
AAGTTAAAATCCGTTAAAGAGTTACTAGGGATCACAATTATGGAAAGAAGTTCCT
TTGAAAAAAATCCGATTGACTTTTTAGAAGCTAAAGGATATAAGGAAGTTAAAAA
AGACTTAATCATTAAACTACCTAAATATAGTCTTTTTGAGTTAGAAAACGGTCGT
AAACGGATGCTGGCTAGTGCCGGAGAATTACAAAAAGGAAATGAGCTGGCTCTGC
CAAGCAAATATGTGAATTTTTTATATTTAGCTAGTCATTATGAAAAGTTGAAGGG
TAGTCCAGAAGATAACGAACAAAAACAATTGTTTGTGGAGCAGCATAAGCATTAT
TTAGATGAGATTATTGAGCAAATCAGTGAATTTTCTAAGCGTGTTATTTTAGCAG
ATGCCAATTTAGATAAAGTTCTTAGTGCATATAACAAACATAGAGACAAACCAAT
ACGTGAACAAGCAGAAAATATTATTCATTTATTTACGTTGACGAATCTTGGAGCT
CCCGCTGCTTTTAAATATTTTGATACAACAATTGATCGTAAACGATATACGTCTA
CAAAAGAAGTTTTAGATGCCACTCTTATCCATCAATCCATCACTGGTCTTTATGA
AACACGCATTGATTTGAGTCAGCTAGGAGGTGACTGA
SEQ ATGGATAAGAAGTATTCAATTGGACTTGCGATTGGCACTAACAGTGTGGGCTGGG
ID CGGTGATTACAGACGAGTATAAGGTGCCGTCAAAAAAGTTTAAAGTTCTGGGCAA
NO: CACTGATCGCCATTCCATCAAGAAAAACCTAATCGGGGCCCTTCTTTTTGATAGT
168 GGCGAAACGGCCGAGGCGACGCGTCTAAAACGTACCGCGCGGCGTCGCTACACCC
GACGAAAAAACCGTATTTGTTACCTTCAGGAGATCTTCAGTAACGAAATGGCTAA
GGTGGACGATTCATTCTTCCACCGTCTGGAGGAGTCCTTTTTAGTTGAAGAAGAC
AAGAAGCATGAGCGACACCCAATTTTTGGTAACATTGTCGACGAAGTCGCCTATC
ACGAAAAATATCCGACCATTTATCACCTGCGCAAAAAACTGGTCGATAGCACGGA
TAAAGCGGATCTGCGGCTTATTTACCTGGCGCTTGCCCACATGATCAAGTTCCGC
GGCCACTTCCTGATAGAAGGAGACCTGAACCCGGATAATAGCGATGTAGACAAAC
TGTTTATTCAGCTGGTCCAGACCTACAACCAGCTGTTTGAAGAAAATCCGATTAA
TGCGTCAGGCGTGGATGCGAAAGCGATACTGAGTGCCCGCCTGTCGAAATCTCGC
CGTCTCGAAAATCTGATTGCACAGCTGCCCGGCGAAAAAAAAAACGGTCTTTTTG
GCAATCTGATCGCGCTGTCACTGGGCCTGACACCAAATTTTAAGAGCAACTTCGA
CCTGGCAGAGGATGCGAAGCTTCAACTGTCGAAGGACACCTATGACGATGATCTG
GATAATCTTCTGGCACAAATCGGTGATCAGTATGCGGATTTATTCCTTGCAGCGA
AAAACCTATCTGACGCAATTCTGTTGAGCGATATCCTCCGCGTCAACACCGAAAT
CACTAAAGCCCCCCTGTCAGCGTCGATGATTAAACGTTATGATGAGCACCATCAG
GATCTGACCTTGCTAAAGGCGCTGGTGCGACAGCAGCTTCCCGAAAAATATAAAG
AGATCTTTTTTGATCAATCGAAGAATGGTTATGCCGGATACATTGATGGCGGAGC
CAGTCAGGAAGAATTTTACAAATTCATCAAACCGATCCTGGAAAAAATGGATGGC
ACAGAAGAACTGCTTGTGAAATTGAACCGGGAAGATTTACTGCGCAAACAGCGTA
CGTTCGACAACGGCTCCATACCCCATCAGATTCACTTAGGTGAGCTGCATGCAAT
ACTCCGTCGCCAGGAAGATTTTTATCCATTTTTAAAAGACAACCGTGAGAAGATT
GAAAAAATTTTAACTTTTCGTATTCCATATTACGTCGGGCCTTTGGCCCGAGGTA
ACTCTCGATTCGCCTGGATGACGAGAAAAAGCGAGGAGACCATCACTCCGTGGAA
TTTTGAAGAGGTTGTTGATAAAGGCGCGAGCGCCCAGTCGTTTATCGAACGTATG
ACCAACTTTGATAAAAATCTGCCGAATGAAAAAGTGCTTCCGAAGCATTCTCTGT
TGTATGAATATTTCACTGTGTACAATGAGTTAACGAAAGTGAAATATGTGACCGA
AGGCATGCGGAAACCTGCTTTTCTGTCCGGAGAACAGAAAAAAGCAATTGTGGAC
CTGCTGTTCAAAACGAACCGGAAAGTAACTGTGAAGCAGCTGAAAGAGGACTACT
TCAAAAAAATCGAATGCTTCGACTCAGTAGAGATCTCTGGTGTTGAAGATCGCTT
CAACGCGAGTCTGGGAACGTACCATGATTTGTTGAAAATCATCAAAGATAAAGAC
TTTCTGGATAACGAAGAGAATGAGGACATTCTTGAAGATATTGTTTTGACACTGA
CTCTGTTTGAGGATCGCGAAATGATTGAAGAGCGCCTGAAAACGTATGCCCATTT
ATTCGATGACAAAGTCATGAAGCAGCTGAAACGTCGCCGCTATACTGGGTGGGGC
AGACTTTCACGTAAATTGATCAATGGTATAAGAGACAAACAGAGCGGCAAAACTA
TCTTAGATTTCCTGAAGAGTGATGGATTTGCCAACCGGAATTTTATGCAGCTTAT
ACATGATGACTCGCTAACGTTTAAAGAAGACATTCAGAAGGCGCAGGTCAGCGGC
CAGGGTGATTCGCTGCATGAACACATTGCAAATCTTGCCGGATCGCCAGCGATCA
AAAAAGGCATCCTTCAGACAGTAAAAGTTGTGGATGAACTGGTGAAAGTAATGGG
TCGTCACAAGCCAGAAAATATTGTGATCGAAATGGCCCGGGAAAATCAGACTACT
CAAAAAGGTCAGAAAAATTCTCGCGAGCGTATGAAACGTATTGAAGAAGGCATCA
AAGAGCTAGGCAGCCAGATATTAAAGGAACATCCGGTTGAGAACACTCAGCTGCA
GAATGAAAAACTGTATCTGTATTATCTTCAGAACGGCCGTGACATGTATGTTGAT
CAAGAACTGGATATCAATCGCTTGTCCGATTATGACGTGGATCATATTGTTCCGC
AAAGCTTTCTGAAAGACGATTCTATTGACAATAAAGTACTGACACGTTCGGACAA
AAACCGTGGTAAAAGCGATAACGTACCGTCGGAAGAAGTTGTTAAGAAAATGAAA
AATTATTGGCGCCAACTCCTGAATGCTAAATTGATTACCCAGCGGAAATTTGATA
ACTTAACCAAAGCCGAGCGGGGTGGCTTAAGTGAACTGGATAAAGCGGGTTTTAT
TAAACGCCAACTGGTAGAAACCCGCCAGATAACGAAACATGTAGCTCAAATCCTC
GATAGTCGCATGAATACGAAATATGACGAAAATGATAAATTGATCCGTGAAGTAA
AAGTGATTACTCTTAAAAGCAAATTGGTATCTGATTTTCGGAAAGATTTCCAATT
CTATAAGGTGAGAGAAATTAACAATTACCATCATGCACATGATGCGTATTTAAAT
GCAGTTGTTGGCACCGCCTTAATCAAAAAATATCCGAAATTAGAATCTGAGTTCG
TGTATGGTGATTATAAAGTTTATGATGTTCGAAAAATGATTGCTAAGTCTGAACA
GGAAATCGGCAAAGCGACCGCAAAGTATTTTTTTTATAGCAATATTATGAATTTT
TTTAAAACTGAGATTACCCTGGCGAATGGCGAAATTCGCAAACGTCCTCTGATTG
AAACCAATGGCGAAACCGGCGAGATAGTATGGGACAAGGGCCGTGATTTTGCGAC
CGTCCGGAAAGTCCTGTCAATGCCGCAGGTGAATATTGTCAAGAAAACAGAAGTT
CAGACAGGCGGTTTTAGTAAAGAGTCTATTCTGCCCAAACGTAATTCGGATAAAT
TGATTGCCCGCAAGAAAGATTGGGATCCGAAGAAATATGGTGGATTCGATTCTCC
GACGGTCGCCTATAGCGTTCTAGTCGTCGCCAAGGTCGAAAAAGGTAAATCCAAA
AAACTGAAATCTGTGAAAGAACTGTTAGGCATTACAATCATGGAACGTAGTAGTT
TTGAAAAGAACCCGATCGACTTCCTCGAGGCGAAAGGCTACAAAGAAGTCAAGAA
GGATTTGATTATTAAACTCCCAAAATATTCATTATTTGAGTTAGAAAACGGTAGG
AAGCGTATGCTGGCGAGTGCTGGGGAATTACAGAAAGGGAATGAGTTAGCACTGC
CGTCAAAATATGTGAACTTTCTGTATCTGGCCTCCCATTACGAGAAACTGAAAGG
TAGCCCGGAAGATAATGAACAGAAACAACTATTTGTCGAGCAACACAAACATTAT
CTGGATGAAATTATTGAACAGATTAGTGAATTCTCTAAACGTGTTATTTTAGCGG
ATGCCAACCTTGACAAGGTGCTGAGCGCATATAATAAACACCGTGATAAACCCAT
TCGTGAACAGGCTGAAAATATCATACATCTGTTCACGTTAACCAACTTGGGAGCT
CCTGCCGCTTTTAAATATTTCGATACCACAATTGACCGCAAACGTTATACGTCTA
CAAAAGAGGTGCTCGATGCGACCCTGATCCACCAGTCTATTACAGGCCTGTATGA
AACTCGTATCGACCTGTCACAACTGGGCGGCGACTGA
SEQ ATGGACAAGAAATATTCAATCGGTTTAGCAATAGGAACTAACTCAGTAGGTTGGG
ID CTGTAATTACAGACGAATACAAGGTACCGTCCAAAAAGTTTAAGGTGTTGGGGAA
NO: CACAGATAGACACTCTATAAAAAAAAATTTAATAGGCGCTTTACTTTTCGATTCA
169 GGCGAAACTGCAGAAGCGACACGTCTGAAGAGAACCGCTAGACGTAGATACACGA
GGAGAAAGAACAGAATATGTTACCTACAAGAAATTTTTTCTAATGAGATGGCTAA
GGTGGATGATTCGTTTTTTCATAGACTCGAAGAATCTTTCTTAGTTGAAGAAGAT
AAAAAACACGAAAGGCATCCTATCTTTGGAAACATAGTTGATGAGGTGGCTTACC
ATGAAAAATATCCCACTATATATCACCTTAGAAAAAAGTTGGTTGATTCAACCGA
CAAAGCGGATCTAAGGTTAATTTACCTCGCGTTGGCTCACATGATAAAATTTAGA
GGACATTTCTTGATCGAAGGTGATTTAAATCCCGATAACTCTGATGTAGATAAAC
TGTTCATCCAGTTGGTTCAAACATATAATCAGTTGTTCGAAGAGAACCCCATTAA
CGCATCAGGTGTTGATGCTAAAGCAATCTTATCAGCAAGGTTGAGCAAGAGCAGA
CGTCTGGAAAACTTGATTGCCCAATTGCCAGGTGAAAAGAAGAACGGTCTTTTTG
GAAATTTAATTGCACTTTCACTTGGGTTGACACCGAATTTTAAAAGCAATTTCGA
CCTCGCTGAGGATGCTAAACTCCAGTTATCTAAGGATACATATGACGATGATTTG
GATAATCTATTGGCCCAGATAGGTGATCAGTATGCAGATTTGTTTTTGGCAGCTA
AGAATTTATCAGATGCAATTCTACTGAGCGATATTTTAAGGGTGAATACAGAAAT
AACTAAAGCACCTTTGTCTGCATCTATGATAAAAAGATACGATGAACACCATCAA
GATCTCACACTATTAAAAGCTTTAGTTAGACAACAATTACCAGAAAAATATAAAG
AAATCTTTTTCGATCAGTCCAAGAACGGATACGCCGGCTATATAGATGGCGGTGC
CTCCCAAGAAGAATTTTACAAATTTATCAAACCCATTTTGGAAAAGATGGATGGT
ACTGAAGAATTATTGGTCAAATTAAACAGGGAAGATTTATTAAGAAAACAAAGGA
CCTTTGATAATGGTTCTATTCCACACCAAATCCATCTAGGGGAATTACATGCGAT
TCTTAGAAGACAAGAAGATTTTTATCCATTCTTGAAAGATAACAGGGAAAAGATA
GAGAAAATCTTAACTTTTAGAATTCCCTACTACGTCGGGCCCTTAGCTAGGGGGA
ATTCTAGATTCGCCTGGATGACACGCAAATCAGAAGAAACAATTACGCCTTGGAA
TTTTGAAGAAGTTGTTGATAAAGGAGCCTCTGCTCAATCTTTTATTGAACGAATG
ACCAATTTTGATAAGAATTTACCCAATGAAAAGGTCTTACCCAAACATTCACTCC
TATACGAGTACTTTACTGTTTACAATGAGTTGACAAAAGTGAAGTATGTTACCGA
GGGTATGCGAAAACCTGCTTTCTTGAGTGGTGAACAAAAGAAGGCCATTGTTGAC
TTGTTATTCAAAACTAACAGAAAGGTCACTGTGAAGCAGCTTAAAGAAGATTATT
TCAAAAAGATCGAATGTTTCGACTCGGTAGAAATTAGTGGTGTGGAAGATAGATT
TAATGCTTCTCTTGGAACATATCATGATCTACTAAAGATCATCAAAGATAAAGAT
TTCTTGGACAATGAAGAAAATGAAGATATTCTTGAAGACATCGTGTTGACACTTA
CATTGTTTGAGGACAGAGAAATGATTGAAGAAAGGCTGAAGACCTACGCCCATTT
GTTTGATGATAAAGTCATGAAACAGTTAAAGAGGAGAAGGTATACCGGATGGGGT
AGGCTGTCTCGCAAATTGATTAATGGTATTCGTGATAAACAATCGGGTAAAACAA
TCCTAGATTTCCTGAAGTCCGATGGTTTCGCCAACAGGAATTTTATGCAATTGAT
TCATGACGATTCTTTGACTTTTAAAGAGGATATTCAGAAAGCACAGGTCTCAGGA
CAGGGCGATTCACTCCATGAACATATAGCTAACCTGGCTGGCTCCCCTGCTATTA
AGAAAGGTATCTTGCAAACCGTCAAAGTAGTAGACGAACTTGTTAAAGTTATGGG
AAGACACAAACCTGAAAATATCGTTATTGAAATGGCTCGCGAAAACCAGACAACA
CAAAAGGGTCAAAAGAATTCGAGAGAGAGAATGAAGCGTATCGAAGAAGGTATTA
AAGAACTTGGGTCCCAAATACTTAAAGAACATCCAGTAGAAAACACTCAGCTTCA
AAATGAAAAATTATACTTATATTATCTTCAGAATGGCCGCGATATGTATGTTGAC
CAAGAGTTAGATATAAATAGGTTGTCTGATTACGACGTGGATCATATTGTACCTC
AATCTTTTCTAAAAGATGATTCAATTGATAATAAGGTATTAACGAGAAGTGATAA
AAATAGAGGTAAATCTGACAACGTGCCAAGCGAAGAGGTGGTGAAGAAAATGAAA
AATTATTGGCGTCAACTGTTGAACGCCAAGTTAATTACGCAGAGAAAGTTTGATA
ATCTAACAAAAGCTGAAAGAGGAGGCCTATCTGAGTTAGATAAGGCCGGTTTTAT
CAAACGTCAGTTAGTTGAAACCAGGCAAATCACGAAGCACGTTGCCCAAATTCTA
GATTCAAGGATGAATACCAAATACGATGAAAACGATAAACTGATTCGGGAAGTCA
AGGTTATAACTCTAAAAAGCAAACTAGTTTCAGATTTTCGCAAAGATTTTCAATT
TTACAAAGTTCGAGAAATCAATAATTATCATCATGCTCACGACGCGTACTTGAAC
GCGGTCGTTGGTACAGCTTTAATAAAGAAATATCCTAAACTGGAATCGGAATTTG
TATATGGGGATTACAAAGTATACGACGTGAGAAAGATGATCGCTAAATCTGAACA
AGAAATTGGGAAAGCAACTGCCAAATATTTTTTTTACAGCAACATAATGAATTTT
TTTAAAACGGAAATTACATTGGCAAATGGCGAAATTAGAAAGCGCCCATTGATAG
AGACCAATGGAGAGACTGGGGAAATCGTGTGGGATAAAGGACGTGATTTTGCCAC
AGTGAGGAAAGTGTTAAGTATGCCACAAGTTAATATTGTAAAAAAGACCGAGGTC
CAAACGGGTGGATTTAGCAAAGAATCAATTTTACCTAAGAGAAATTCAGATAAAT
TAATTGCCCGCAAAAAGGATTGGGATCCTAAAAAATATGGTGGTTTTGATTCCCC
AACAGTTGCTTACTCCGTCCTAGTTGTTGCTAAGGTTGAAAAAGGAAAGTCTAAG
AAACTTAAATCCGTAAAAGAGTTACTGGGAATTACAATAATGGAAAGATCCTCTT
TCGAAAAGAACCCTATTGACTTCTTGGAGGCGAAAGGTTATAAAGAAGTCAAAAA
AGATTTGATCATAAAACTACCAAAGTATTCTCTATTTGAATTGGAAAACGGCAGA
AAAAGGATGTTGGCAAGCGCTGGTGAACTACAAAAGGGTAACGAATTGGCATTGC
CGAGTAAATACGTGAATTTTCTATATTTGGCATCACATTACGAAAAGTTAAAGGG
ATCACCCGAGGATAACGAGCAGAAACAACTGTTTGTTGAACAACACAAACATTAT
CTTGATGAAATTATAGAACAAATTAGTGAGTTCAGTAAGAGAGTTATTTTAGCCG
ATGCAAATTTAGACAAAGTTTTATCTGCTTATAACAAACATAGAGATAAGCCTAT
AAGGGAACAAGCCGAAAATATTATTCATTTGTTTACGTTAACAAATTTAGGGGCA
CCAGCAGCATTCAAGTACTTCGATACGACTATCGATCGTAAGCGTTACACATCTA
CCAAAGAAGTTCTTGATGCAACTTTGATTCATCAATCTATAACAGGCTTATATGA
AACTAGAATCGATCTGTCACAACTTGGTGGTGACTAA
SEQ ATGGACAAGAAGTACTCAATTGGGCTTGCTATCGGCACTAACAGCGTTGGCTGGG
ID CGGTCATCACAGACGAATATAAGGTCCCATCAAAGAAATTCAAAGTCCTTGGCAA
NO: TACGGACCGACATTCAATCAAGAAGAACCTGATTGGAGCTCTGCTGTTTGATTCC
170 GGTGAAACCGCCGAGGCAACACGATTGAAACGTACCGCTCGTAGGAGGTATACGC
GGCGGAAAAATAGGATCTGCTATCTGCAGGAAATATTTAGCAACGAAATGGCCAA
GGTAGACGACAGCTTCTTCCACCGGCTCGAGGAATCTTTCCTCGTGGAAGAAGAC
AAAAAGCACGAGCGCCACCCCATTTTCGGCAATATCGTGGACGAGGTAGCTTACC
ATGAAAAGTATCCAACTATTTACCACTTACGTAAGAAGTTAGTGGACAGCACCGA
TAAAGCCGACCTTCGCCTGATTTACCTAGCACTTGCACACATGATTAAGTTCCGA
GGCCACTTCTTGATAGAGGGAGACCTGAATCCTGACAATTCCGATGTGGATAAAT
TGTTCATCCAGCTGGTACAGACATACAATCAGTTGTTTGAGGAAAATCCGATTAA
TGCCAGTGGCGTGGACGCCAAGGCTATCCTGTCTGCTCGGCTTAGTAAGAGTAGA
CGCCTGGAAAATCTAATCGCACAGCTGCCCGGCGAAAAGAAAAATGGACTGTTCG
GTAATTTGATCGCCCTGAGCCTGGGCCTCACCCCTAACTTTAAGTCTAACTTCGA
CCTGGCCGAAGATGCTAAGCTCCAGCTGTCCAAAGATACT
TACGATGACGATCTCGATAATCTACTGGCTCAGATCGGGGACCAGTACGCTGACC
TGTTTCTAGCTGCCAAGAACCTCAGTGACGCCATTCTCCTGTCCGATATTCTGAG
GGTTAACACTGAAATTACAAAGGCCCCGCTGAGCGCGAGCATGATCAAAAGGTAC
GACGAGCATCACCAGGACCTCACGCTGCTGAAGGCCTTAGTCAGACAGCAACTGC
CCGAAAAGTACAAAGAAATCTTTTTCGACCAATCCAAGAACGGGTACGCCGGCTA
CATTGATGGCGGGGCTTCACAAGAGGAGTTTTACAAGTTTATCAAGCCCATCCTG
GAGAAAATGGACGGCACTGAAGAACTGCTTGTGAAACTCAATAGGGAAGACTTAC
TGAGGAAACAGCGCACATTCGATAATGGCTCCATACCCCACCAAATCCATCTGGG
AGAGTTGCATGCCATCTTGCGAAGGCAGGAGGACTTCTACCCCTTTCTTAAGGAC
AACAGGGAGAAAATCGAGAAAATTCTGACTTTCCGTATCCCCTACTACGTGGGCC
CACTTGCTCGCGGAAACTCACGATTCGCATGGATGACCAGAAAGTCCGAGGAAAC
AATTACACCCTGGAATTTTGAGGAGGTAGTAGACAAGGGAGCCAGCGCTCAATCT
TTCATTGAGAGGATGACGAATTTCGACAAGAACCTTCCAAACGAGAAAGTGCTTC
CTAAGCACAGCCTGCTGTATGAGTATTTCACGGTGTACAACGAACTTACGAAGGT
CAAGTATGTGACAGAGGGTATGCGGAAACCTGCTTTTCTGTCTGGTGAACAGAAG
AAAGCTATCGTCGATCTCCTGTTTAAAACCAACCGAAAGGTGACGGTGAAACAGT
TGAAGGAGGATTACTTCAAGAAGATCGAGTGTTTTGATTCTGTTGAAATTTCTGG
GGTCGAGGATAGATTCAACGCCAGCCTGGGCACCTACCATGATTTGCTGAAGATT
ATCAAGGATAAGGATTTTCTGGATAATGAGGAGAATGAAGACATTTTGGAGGATA
TAGTGCTGACCCTCACCCTGTTCGAGGACCGGGAGATGATCGAGGAGAGACTGAA
AACATACGCTCACCTGTTTGACGACAAGGTCATGAAGCAGCTTAAGAGACGCCGT
TACACAGGCTGGGGAAGATTATCCCGCAAATTAATCAACGGGATACGCGATAAAC
AAAGTGGCAAGACCATACTCGACTTCCTAAAGAGCGATGGATTCGCAAATCGCAA
TTTCATGCAGTTGATCCACGACGATAGCCTGACCTTCAAAGAGGACATTCAGAAA
GCGCAGGTGAGTGGTCAAGGGGATTCCCTGCACGAACACATTGCTAACTTGGCTG
GATCACCAGCCATTAAGAAAGGCATACTGCAGACCGTTAAAGTGGTAGATGAGCT
TGTGAAAGTCATGGGAAGACATAAGCCAGAGAACATAGTGATCGAAATGGCCAGG
GAAAATCAGACCACGCAAAAGGGGCAGAAGAACTCAAGAGAGCGTATGAAGAGGA
TCGAGGAGGGCATCAAGGAGCTGGGTAGCCAGATCCTTAAAGAGCACCCAGTTGA
GAATACCCAGCTGCAGAATGAGAAACTTTATCTCTATTATCTCCAGAACGGAAGG
GATATGTATGTCGACCAGGAACTGGACATCAATCGGCTGAGTGATTATGACGTCG
ACCACATTGTGCCTCAAAGCTTTCTGAAGGATGATTCCATCGACAATAAAGTTCT
GACCCGGTCTGATAAAAATAGAGGCAAATCCGACAACGTACCTAGCGAAGAAGTC
GTCAAAAAAATGAAGAACTATTGGAGGCAGTTGCTGAATGCCAAGCTGATTACAC
AACGCAAGTTTGACAATCTCACCAAGGCAGAAAGGGGGGGCCTGTCAGAACTCGA
CAAAGCAGGTTTCATTAAAAGGCAGCTAGTTGAAACTAGGCAGATTACTAAGCAC
GTGGCCCAGATCCTCGACTCACGGATGAATACAAAGTATGATGAGAATGATAAGC
TAATCCGGGAGGTGAAGGTGATTACTCTGAAATCTAAGCTGGTGTCAGATTTCAG
AAAAGACTTCCAGTTCTACAAAGTCAGAGAGATCAACAATTATCACCATGCCCAC
GATGCATATCTTAATGCAGTAGTGGGGACAGCTCTGATCAAAAAATATCCTAAAC
TGGAGTCTGAATTCGTTTATGGTGACTATAAAGTCTATGACGTCAGAAAAATGAT
CGCAAAGAGCGAGCAGGAGATAGGGAAGGCCACAGCAAAGTACTTCTTTTACAGT
AATATCATGAACTTTTTCAAAACTGAGATTACATTGGCTAACGGCGAGATCCGCA
AGCGGCCACTGATAGAGACTAACGGAGAGACAGGGGAGATTGTTTGGGATAAGGG
CCGTGACTTCGCCACCGTTAGGAAAGTGCTGTCCATGCCCCAGGTGAACATTGTG
AAGAAGACAGAAGTGCAGACGGGTGGGTTCTCAAAAGAGTCTATTCTGCCTAAGC
GGAATAGTGACAAACTGATCGCACGTAAAAAGGACTGGGATCCAAAAAAGTACGG
CGGATTCGACAGTCCTACCGTTGCATATTCCGTGCTTGTGGTCGCTAAGGTGGAG
AAGGGAAAAAGCAAGAAACTGAAGTCAGTCAAAGAACTACTGGGCATAACGATCA
TGGAGCGCTCCAGTTTCGAAAAAAACCCAATCGATTTTCTTGAAGCCAAGGGATA
CAAGGAGGTAAAGAAAGACCTTATCATTAAGCTGCCTAAGTACAGTCTGTTCGAA
CTGGAGAATGGGAGGAAGCGCATGCTGGCATCAGCTGGAGAACTCCAAAAAGGGA
ACGAGTTGGCCCTCCCCTCAAAGTATGTCAATTTTCTCTACCTGGCTTCTCACTA
CGAGAAGTTAAAGGGGTCTCCAGAGGATAATGAGCAGAAACAGCTGTTTGTGGAA
CAGCACAAGCACTATTTGGACGAAATCATCGAACAAATTTCCGAGTTCAGTAAGA
GGGTGATTCTGGCCGACGCAAACCTTGACAAAGTTCTGTCCGCATACAATAAGCA
CAGAGACAAACCAATCCGCGAGCAAGCCGAGAATATAATTCACCTTTTCACTCTG
ACTAATCTGGGGGCCCCCGCAGCATTTAAATATTTCGATACAACAATCGACCGGA
AGCGGTATACATCTACTAAGGAAGTCCTCGATGCGACACTGATCCACCAGTCAAT
TACAGGTTTATATGAAACAAGAATCGACCTGTCCCAGCTGGGCGGCGACTAG
SEQ AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAA
ID GCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGA
NO: CAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTT
171 TTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgcgtc
ttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattc
tgtaacaaaggggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataat
cacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgcca
tagcatttttatccataagattagcggatcctacctgacgctttttatcgcaact
ctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtga
gataggcggagatacgaactttaagAAGGAGatataccATGGAACAGGAATATTA
TCTGGGCTTGGACATGGGCACCGGTTCCGTCGGCTGGGCTGTTACTGACAGTGAA
TATCACGTTCTAAGAAAGCATGGTAAGGCATTGTGGGGTGTAAGACTTTTCGAAT
CTGCTTCCACTGCTGAAGAGCGTAGAATGTTTAGAACGAGTCGACGTAGGCTAGA
CAGGCGCAATTGGAGAATCGAAATTTTACAAGAAATTTTTGCGGAAGAGATATCT
AAGAAAGACCCAGGCTTTTTCCTGAGAATGAAGGAATCTAAGTATTACCCTGAGG
ATAAAAGAGATATAAATGGTAACTGTCCCGAATTGCCTTACGCATTATTTGTGGA
CGATGATTTTACCGATAAGGATTACCATAAAAAGTTCCCAACTATCTACCATTTA
CGCAAAATGTTAATGAATACAGAGGAAACCCCAGACATAAGACTAGTTTATCTGG
CAATACACCATATGATGAAACATAGAGGCCATTTCTTACTTTCCGGGGATATCAA
CGAAATCAAAGAGTTTGGTACCACATTTAGTAAGTTACTGGAAAACATAAAGAAT
GAAGAATTGGATTGGAACTTAGAACTCGGAAAAGAAGAATACGCGGTTGTCGAAT
CTATCCTGAAGGATAATATGCTGAATAGGTCGACCAAAAAAACTAGGCTGATCAA
AGCACTGAAAGCCAAATCTATCTGCGAAAAAGCTGTTTTAAATTTACTTGCTGGT
GGCACTGTTAAGTTATCAGACATTTTTGGTTTGGAAGAATTGAACGAAACCGAGC
GTCCAAAAATTAGTTTCGCTGATAATGGCTACGATGATTACATTGGTGAGGTGGA
AAACGAGTTGGGCGAACAATTTTATATTATAGAGACAGCTAAGGCAGTCTATGAC
TGGGCTGTTTTAGTAGAAATCCTTGGTAAATACACATCTATCTCCGAAGCGAAAG
TTGCTACTTACGAAAAGCACAAGTCCGATCTCCAGTTTTTGAAGAAAATTGTCAG
GAAATATCTGACTAAGGAAGAATATAAAGATATTTTCGTTAGTACCTCTGACAAA
CTGAAAAATTACTCCGCTTACATCGGGATGACCAAGATTAATGGCAAAAAAGTTG
ATCTGCAAAGCAAAAGGTGTTCGAAGGAAGAATTTTATGATTTCATTAAAAAGAA
TGTCTTAAAAAAATTAGAAGGTCAGCCAGAATACGAATATTTGAAAGAAGAACTG
GAAAGAGAGACATTCTTACCAAAACAAGTCAACAGAGATAATGGGGTAATTCCAT
ATCAAATTCACCTCTACGAATTAAAAAAAATTTTAGGCAATTTACGCGATAAAAT
TGACCTTATCAAAGAAAATGAGGATAAGCTGGTTCAACTCTTTGAATTCAGAATA
CCCTATTATGTGGGCCCACTGAACAAGATTGATGACGGCAAAGAAGGTAAATTCA
CATGGGCCGTCCGCAAATCCAATGAAAAAATTTACCCATGGAACTTTGAAAATGT
AGTAGATATTGAAGCGTCTGCGGAGAAATTTATTCGAAGAATGACTAATAAATGC
ACTTACTTGATGGGAGAGGATGTTCTGCCTAAAGACAGCTTATTATACAGCAAGT
ACATGGTTCTAAACGAACTTAACAACGTTAAGTTGGACGGTGAGAAATTAAGTGT
AGAATTGAAACAAAGATTGTATACTGACGTCTTCTGCAAGTACAGAAAAGTGACA
GTTAAAAAAATTAAGAATTACTTGAAGTGCGAAGGTATAATTTCTGGAAACGTAG
AGATTACTGGTATTGATGGTGATTTCAAAGCATCCCTAACAGCTTACCACGATTT
CAAGGAAATCCTGACAGGAACTGAACTCGCAAAAAAAGATAAAGAAAACATTATT
ACTAATATTGTTCTTTTCGGTGATGACAAGAAATTGTTGAAGAAAAGACTGAATA
GACTTTACCCCCAGATTACTCCCAATCAACTTAAGAAAATTTGTGCTTTGTCTTA
CACAGGATGGGGTCGTTTTTCAAAAAAGTTCTTAGAAGAGATTACCGCACCTGAT
CCAGAAACAGGCGAAGTATGGAATATAATTACCGCCTTATGGGAATCGAACAATA
ATCTTATGCAACTTCTGAGCAATGAATATCGTTTCATGGAAGAAGTTGAGACTTA
CAACATGGGCAAACAGACGAAGACTTTATCCTATGAAACTGTGGAAAATATGTAT
GTATCACCTTCTGTCAAGAGACAAATTTGGCAAACCTTAAAAATTGTCAAAGAAT
TAGAAAAGGTAATGAAGGAGTCTCCTAAACGTGTGTTTATTGAAATGGCTAGAGA
AAAACAAGAGTCAAAAAGAACCGAGTCAAGAAAGAAGCAGTTAATCGATTTATAT
AAGGCTTGTAAAAACGAAGAGAAAGATTGGGTTAAAGAATTGGGGGACCAAGAGG
AACAAAAACTACGGTCGGATAAGTTGTATTTATACTATACGCAAAAGGGACGATG
TATGTATTCCGGCGAGGTAATAGAATTGAAGGATTTATGGGACAATACAAAATAT
GACATAGACCATATATATCCCCAATCAAAAACGATGGACGATAGCTTGAACAATA
GAGTACTCGTGAAAAAAAAATATAATGCGACCAAATCTGATAAGTATCCTCTGAA
TGAAAATATCAGACATGAAAGAAAGGGGTTCTGGAAGTCCTTGTTAGATGGTGGG
TTTATAAGCAAAGAAAAGTACGAGCGTCTAATAAGAAACACGGAGTTATCGCCAG
AAGAACTCGCTGGTTTTATTGAGAGGCAAATCGTGGAAACGAGACAATCTACCAA
AGCCGTTGCTGAGATCCTAAAGCAAGTTTTCCCAGAGTCGGAGATTGTCTATGTC
AAAGCTGGCACAGTGAGCAGGTTTAGGAAAGACTTCGAACTATTAAAGGTAAGAG
AAGTGAACGATTTACATCACGCAAAGGACGCTTACCTAAATATCGTTGTAGGTAA
CTCATATTATGTTAAATTTACCAAGAACGCCTCTTGGTTTATAAAGGAGAACCCA
GGTAGAACATATAACCTGAAAAAGATGTTCACCTCTGGTTGGAATATTGAGAGAA
ACGGAGAAGTCGCATGGGAAGTTGGTAAGAAAGGGACTATAGTGACAGTAAAGCA
AATTATGAACAAAAATAATATCCTCGTTACAAGGCAGGTTCATGAAGCAAAGGGC
GGCCTTTTTGACCAACAAATTATGAAGAAAGGGAAAGGTCAAATTGCAATAAAAG
AAACCGATGAGAGACTAGCGTCAATAGAAAAGTATGGTGGCTATAATAAAGCTGC
GGGTGCATACTTTATGCTTGTTGAATCAAAAGACAAGAAAGGTAAGACTATTAGA
ACTATAGAATTTATACCCCTGTACCTTAAAAACAAAATTGAATCGGATGAGTCAA
TCGCGTTAAATTTTCTAGAGAAAGGAAGGGGTTTAAAAGAACCAAAGATCCTGTT
AAAAAAGATTAAGATTGACACCTTGTTCGATGTAGATGGATTTAAAATGTGGTTA
TCTGGCAGAACAGGCGATAGACTTTTGTTTAAGTGCGCTAATCAATTAATTTTGG
ATGAGAAAATCATTGTCACAATGAAAAAAATAGTTAAGTTTATTCAGAGAAGACA
AGAAAACAGGGAGTTGAAATTATCTGATAAAGATGGTATCGACAATGAAGTTTTA
ATGGAAATCTACAATACATTCGTTGATAAACTTGAAAATACCGTATATCGAATCA
GGTTAAGTGAACAAGCCAAAACATTAATTGATAAACAAAAAGAATTTGAAAGGCT
ATCACTGGAAGACAAATCCTCCACCCTATTTGAAATTTTGCATATATTCCAGTGC
CAATCTTCAGCAGCTAATTTAAAAATGATTGGCGGACCTGGGAAAGCCGGCATCC
TAGTGATGAACAATAATATCTCCAAGTGTAACAAAATATCAATTATTAACCAATC
TCCGACAGGTATTTTTGAAAATGAAATAGACTTGCTTAAGATATAAGAAATCATC
CTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTA
TTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGA
ATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACT
CAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGT
GTTAAGGGATGTTATTTCC
SEQ AATTCAAAGGATAATCAAAC
ID
NO:
172
SEQ AATCTCTACTCTTTGTAGAT
ID
NO:
173
SEQ AATTTCTACTGTTGTAGAT
ID
NO:
174
SEQ AATTTCTACTAGTGTAGAT
ID
NO:
175
SEQ AATTTCTACTATTGT
ID
NO:
176
SEQ AATTTCTACTGTTGTAGA
ID
NO:
177
SEQ AATTTCTACTATTGTA
ID
NO:
178
SEQ AATTTCTACTTTTGTAGAT
ID
NO:
179
SEQ AATTTCTACTGTTGTAGAT
ID
NO:
180
SEQ AATTTCTACTCTTGTAGAT
ID
NO:
181

Claims

1.-20. (canceled)

21. A nucleic acid-guided nuclease system comprising:

(a) (i) a nucleic acid-guided nuclease comprises an amino acid sequence having at least 90% sequence identity to the amino acid sequence of SEQ ID NO: 4, or (ii) a nucleic acid molecule encoding the nucleic acid-guided nuclease of (i); and

(b) (i) an engineered guide nucleic acid capable of complexing with the nucleic acid-guided nuclease, or (ii) a nucleic acid molecule encoding the engineered guide nucleic acid of (i), wherein the engineered guide nucleic acid recognizes a target region in a genome of a cell and a protospacer adjacent motif (PAM) sequence of TTTN.

22. The nucleic acid-guided nuclease system of claim 21, wherein the nucleic acid-guided nuclease is encoded by a nucleic acid molecule having at least 80% sequence identity to the nucleotide sequences of SEQ ID NO: 44 or SEQ ID NO: 24.

23. The nucleic acid-guided nuclease system of claim 21, wherein the nucleic acid molecule encoding the nucleic acid-guided nuclease is codon optimized for Escherichia Coli.

24. The nucleic acid-guided nuclease system of claim 21, wherein the nucleic acid molecule encoding the nucleic acid-guided nuclease is codon optimized for Saccharomyces cerevisiae.

25. The nucleic acid-guided nuclease system of claim 21, wherein the nucleic acid molecule encoding the nucleic acid-guided nuclease is codon optimized for mammalian cells.

26. The nucleic acid-guided nuclease system of claim 21, wherein the target region is within a coding region of a protein.

27. The nucleic acid-guided nuclease system of claim 21, wherein the target region is within a non-coding region of a protein.

28. The nucleic acid-guided nuclease system of claim 21, wherein the nucleic acid-guided nuclease comprises an amino acid sequence having at least 95% sequence identity to the amino acid sequence of SEQ ID NO: 4.

29. The nucleic acid-guided nuclease system of claim 21, wherein the nucleic acid-guided nuclease comprises the amino acid sequence of SEQ ID NO: 4.

30. A method of modifying a target region in the genome of a cell, the method comprising:

(a) contacting a cell with the nucleic acid-guided nuclease system of claim 21; and

(b) allowing the nucleic-acid guided nuclease system to create a genome edit in a target region of the genome of the cell.

31. The method of claim 30, wherein the method results in cell death.

32. The method of claim 30, wherein the nucleic acid-guided nuclease is encoded by a nucleic acid molecule having at least 80% sequence identity to the nucleotide sequences of SEQ ID NO: 44 or SEQ ID NO: 24.

33. The method of claim 30, wherein the nucleic acid-guided nuclease is encoded by a nucleic acid molecule having at least 85% sequence identity to the nucleotide sequences of SEQ ID NO: 44 or SEQ ID NO: 24.

34. The method of claim 30, wherein the nucleic acid molecule encoding the nucleic acid-guided nuclease is codon optimized for bacteria.

35. The method of claim 30, wherein the nucleic acid molecule encoding the nucleic acid-guided nuclease is codon optimized for mammalian cells.

36. The method of claim 30, wherein the nucleic acid-guided nuclease comprises an amino acid sequence having at least 95% sequence identity to the amino acid sequence of SEQ ID NO: 4.

37. The method of claim 30, wherein the nucleic acid-guided nuclease comprises the amino acid sequence of SEQ ID NO: 4.

38. The method of claim 30, wherein the target region is within a bacterial cell.

39. The method of claim 30, wherein the target region is within a plant cell.

40. The method of claim 30, wherein the target region is within a mammalian cell.

Resources

Images & Drawings included:

Sources:

Similar patent applications:

Recent applications in this class: