; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G04980 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G04980
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr1:3219842..3222495
RNA-Seq ExpressionCSPI01G04980
SyntenyCSPI01G04980
Gene Ontology termsGO:0006397 - mRNA processing (biological process)
GO:0005634 - nucleus (cellular component)
GO:0004721 - phosphoprotein phosphatase activity (molecular function)
GO:0005488 - binding (molecular function)
InterPro domainsIPR006811 - RNA polymerase II subunit A
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033137.1 polyprotein [Cucumis melo var. makuwa]1.9e-3749.19Show/hide
Query:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR-----KSEYLGH
        LNNVT+  KFP+P+++ELF+EL GA              I+M P D+E  AF T+EGHYEFLVMPF L NAP TFQALMNQ+FK YLR     +  YLGH
Subjt:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR-----KSEYLGH

Query:  IISSEGVEADPNKIRAVLDWPATTCIREVWSFL--------------------------GAYQWLAEANEAFEGLKNAMVTLPVL
         IS +G+E DP KI+A+ +WP  T +REV  FL                          GAYQW  EA   FE LK AM+TLPVL
Subjt:  IISSEGVEADPNKIRAVLDWPATTCIREVWSFL--------------------------GAYQWLAEANEAFEGLKNAMVTLPVL

KAA0037052.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]8.2e-3648.37Show/hide
Query:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR-----KSEYLGH
        LNNVT+ DKFP+ +++ELF+ELNG               IRM P D+E  AF T+EGHYEFLVMPF L NAP TFQALMNQ+FK Y+R     +  YLGH
Subjt:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR-----KSEYLGH

Query:  IISSEGVEADPNKIRAVLDWPATTCIREVWSFL--------------------------GAYQWLAEANEAFEGLKNAMVTLPV
         IS + +E DP KIRA+ +WP  T +REV  FL                          GAY+W  EA+ AFE LK AM+TLP+
Subjt:  IISSEGVEADPNKIRAVLDWPATTCIREVWSFL--------------------------GAYQWLAEANEAFEGLKNAMVTLPV

KAA0066118.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.8e-3541.67Show/hide
Query:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR------------
        LNNVTV DKFP+P+V+ELF+ELNGA              IRMHP D+E  AF T+EGHYEF+VMPF L NAP TFQALMNQ+FK +LR            
Subjt:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR------------

Query:  ------------------------------------KSEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFL------------------------
                                            +  YLGH IS +G+E DP KIRAV +WPA + +RE+  FL                        
Subjt:  ------------------------------------KSEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFL------------------------

Query:  --GAYQWLAEANEAFEGLKNAMVTLPVL
          GAY+W  E   AFE LK AM+TLPVL
Subjt:  --GAYQWLAEANEAFEGLKNAMVTLPVL

TYK06599.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]8.2e-3648.37Show/hide
Query:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR-----KSEYLGH
        LNNVT+ DKFP+ +++ELF+ELNG               IRM P D+E  AF T+EGHYEFLVMPF L NAP TFQALMNQ+FK Y+R     +  YLGH
Subjt:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR-----KSEYLGH

Query:  IISSEGVEADPNKIRAVLDWPATTCIREVWSFL--------------------------GAYQWLAEANEAFEGLKNAMVTLPV
         IS + +E DP KIRA+ +WP  T +REV  FL                          GAY+W  EA+ AFE LK AM+TLP+
Subjt:  IISSEGVEADPNKIRAVLDWPATTCIREVWSFL--------------------------GAYQWLAEANEAFEGLKNAMVTLPV

TYK15157.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.8e-3541.67Show/hide
Query:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR------------
        LNNVTV DKFP+P+V+ELF+ELNGA              IRMHP D+E  AF T+EGHYEF+VMPF L NAP TFQALMNQ+FK +LR            
Subjt:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR------------

Query:  ------------------------------------KSEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFL------------------------
                                            +  YLGH IS +G+E DP KIRAV +WPA + +RE+  FL                        
Subjt:  ------------------------------------KSEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFL------------------------

Query:  --GAYQWLAEANEAFEGLKNAMVTLPVL
          GAY+W  E   AFE LK AM+TLPVL
Subjt:  --GAYQWLAEANEAFEGLKNAMVTLPVL

TrEMBL top hitse value%identityAlignment
A0A5A7SUY2 Polyprotein9.4e-3849.19Show/hide
Query:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR-----KSEYLGH
        LNNVT+  KFP+P+++ELF+EL GA              I+M P D+E  AF T+EGHYEFLVMPF L NAP TFQALMNQ+FK YLR     +  YLGH
Subjt:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR-----KSEYLGH

Query:  IISSEGVEADPNKIRAVLDWPATTCIREVWSFL--------------------------GAYQWLAEANEAFEGLKNAMVTLPVL
         IS +G+E DP KI+A+ +WP  T +REV  FL                          GAYQW  EA   FE LK AM+TLPVL
Subjt:  IISSEGVEADPNKIRAVLDWPATTCIREVWSFL--------------------------GAYQWLAEANEAFEGLKNAMVTLPVL

A0A5A7U2S1 Ty3/gypsy retrotransposon protein1.2e-3541.67Show/hide
Query:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR------------
        LNNVTV DKFP+P+V+ELF+ELNGA              IRMHP D+E  AF T+EGHYEF+VMPF L NAP TFQALMNQ+FK +LR            
Subjt:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR------------

Query:  ------------------------------------KSEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFL------------------------
                                            +  YLGH IS +G+E DP KIRAV +WPA   +RE+  FL                        
Subjt:  ------------------------------------KSEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFL------------------------

Query:  --GAYQWLAEANEAFEGLKNAMVTLPVL
          GAY+W  E   AFE LK AM+TLPVL
Subjt:  --GAYQWLAEANEAFEGLKNAMVTLPVL

A0A5A7VEI2 Ty3/gypsy retrotransposon protein8.8e-3641.67Show/hide
Query:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR------------
        LNNVTV DKFP+P+V+ELF+ELNGA              IRMHP D+E  AF T+EGHYEF+VMPF L NAP TFQALMNQ+FK +LR            
Subjt:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR------------

Query:  ------------------------------------KSEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFL------------------------
                                            +  YLGH IS +G+E DP KIRAV +WPA + +RE+  FL                        
Subjt:  ------------------------------------KSEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFL------------------------

Query:  --GAYQWLAEANEAFEGLKNAMVTLPVL
          GAY+W  E   AFE LK AM+TLPVL
Subjt:  --GAYQWLAEANEAFEGLKNAMVTLPVL

A0A5D3CTA1 Ty3/gypsy retrotransposon protein8.8e-3641.67Show/hide
Query:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR------------
        LNNVTV DKFP+P+V+ELF+ELNGA              IRMHP D+E  AF T+EGHYEF+VMPF L NAP TFQALMNQ+FK +LR            
Subjt:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR------------

Query:  ------------------------------------KSEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFL------------------------
                                            +  YLGH IS +G+E DP KIRAV +WPA + +RE+  FL                        
Subjt:  ------------------------------------KSEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFL------------------------

Query:  --GAYQWLAEANEAFEGLKNAMVTLPVL
          GAY+W  E   AFE LK AM+TLPVL
Subjt:  --GAYQWLAEANEAFEGLKNAMVTLPVL

A0A5D3CXB1 Ty3/gypsy retrotransposon protein1.2e-3541.67Show/hide
Query:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR------------
        LNNVTV DKFP+P+V+ELF+ELNGA              IRMHP D+E  AF T+EGHYEF+VMPF L NAP TFQALMNQ+FK +LR            
Subjt:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR------------

Query:  ------------------------------------KSEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFL------------------------
                                            +  YLGH IS +G+E DP KIRAV +WPA   +RE+  FL                        
Subjt:  ------------------------------------KSEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFL------------------------

Query:  --GAYQWLAEANEAFEGLKNAMVTLPVL
          GAY+W  E   AFE LK AM+TLPVL
Subjt:  --GAYQWLAEANEAFEGLKNAMVTLPVL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.65.4e-1428.25Show/hide
Query:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLRK-----------
        LN +TV D+ P+P +DE+  +L                 I M P  +   AF T  GHYE+L MPF L NAP TFQ  MN I +  L K           
Subjt:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLRK-----------

Query:  -------------------------------------SEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFLG
                                             + +LGH+++ +G++ +P KI A+  +P  T  +E+ +FLG
Subjt:  -------------------------------------SEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFLG

P20825 Retrovirus-related Pol polyprotein from transposon 2973.1e-1427.18Show/hide
Query:  KYDDVFDLQKNCLRAGEVLNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQI
        KY  V D +K        LN +T+ D++P+P +DE+  +L                 I M    +   AF T  GHYE+L MPF L NAP TFQ  MN I
Subjt:  KYDDVFDLQKNCLRAGEVLNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQI

Query:  FKSYLRK------------------------------------------------SEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFLG
         +  L K                                                + +LGHI++ +G++ +P K++A++ +P  T  +E+ +FLG
Subjt:  FKSYLRK------------------------------------------------SEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFLG

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.0e-1226.55Show/hide
Query:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLRK-----------
        LN VT+ D +P+P ++     L  A              I M  +D+   AF T  G YEFL +PF L NAP  FQ +++ I + ++ K           
Subjt:  LNNVTVADKFPLPIVDELFNELNGA--------------IRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLRK-----------

Query:  -------------------------------------SEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFLG
                                              E+LG+I++++G++ADP K+RA+ + P  T ++E+  FLG
Subjt:  -------------------------------------SEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFLG

Q9CY97 RNA polymerase II subunit A C-terminal domain phosphatase SSU721.9e-1132.26Show/hide
Query:  KNGVAPSLFLETPLQKKP-----CMDLSKLFITFQDARKFGDNRLGCHREPHLNTRDHAMMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWED
        +NG+   L     ++ +P     C DL  L +T ++       R+       LN+R+    + V +VN++++DNHEEA +GA L  +LCQ I+ TE  E+
Subjt:  KNGVAPSLFLETPLQKKP-----CMDLSKLFITFQDARKFGDNRLGCHREPHLNTRDHAMMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWED

Query:  SIDDIVINFEKQLRRKLLYSISFY
         ID+++  FE++  R  L+++ FY
Subjt:  SIDDIVINFEKQLRRKLLYSISFY

Q9NP77 RNA polymerase II subunit A C-terminal domain phosphatase SSU721.9e-1132.26Show/hide
Query:  KNGVAPSLFLETPLQKKP-----CMDLSKLFITFQDARKFGDNRLGCHREPHLNTRDHAMMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWED
        +NG+   L     ++ +P     C DL  L +T ++       R+       LN+R+    + V +VN++++DNHEEA +GA L  +LCQ I+ TE  E+
Subjt:  KNGVAPSLFLETPLQKKP-----CMDLSKLFITFQDARKFGDNRLGCHREPHLNTRDHAMMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWED

Query:  SIDDIVINFEKQLRRKLLYSISFY
         ID+++  FE++  R  L+++ FY
Subjt:  SIDDIVINFEKQLRRKLLYSISFY

Arabidopsis top hitse value%identityAlignment
AT1G73820.1 Ssu72-like family protein2.6e-2470.83Show/hide
Query:  LNTRDHAMMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDSIDDIVINFEKQLRRKLLYSISFY
        LN R+ ++ KT+L++NLEVKDNHEEAAIG RL  +LCQEIE  E+WED+IDDIV  FEKQ RRKL+YSISFY
Subjt:  LNTRDHAMMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDSIDDIVINFEKQLRRKLLYSISFY

ATMG00860.1 DNA/RNA polymerases superfamily protein3.2e-0636.26Show/hide
Query:  YLG--HIISSEGVEADPNKIRAVLDWPATTCIREVWSFLG--------------------------AYQWLAEANEAFEGLKNAMVTLPVL
        YLG  HIIS EGV ADP K+ A++ WP      E+  FLG                          + +W   A  AF+ LK A+ TLPVL
Subjt:  YLG--HIISSEGVEADPNKIRAVLDWPATTCIREVWSFLG--------------------------AYQWLAEANEAFEGLKNAMVTLPVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTCGTGGAGATTCAGAAACTACTGAAGATGGAGGAATCCATGAAGTCTTTGTCGAAAAGTGTGTAATGGTTGAATGTGCAAGGGATAGTCAACAACAAAGTTC
GGAAGAGATGAAGAAGTTGTTGACGGATCTATTGATTGAGATTGCGCAAGTGCGATCTAATACAATTTCAAATACAGACAGAGGAGAGTCAAATGGAATTTTTCGACCAA
ACCGACACCGAACCGCCTATGGTCGGTTTAGTAAATGTTCAAATAGACCTCGATCATCAATGAGTAAATGTCGATCTTTGCAAGGAATGGAGTCTTTGCAGGGATTCATG
AAGTCTTGGAAGCAATATTTCAATCCAACTGAAGCTTTGACAAAAGTCTTACTGAAGTACGATGATGTATTTGATTTGCAGAAGAATTGCCTTCGAGCTGGGGAGGTGTT
GAACAACGTTACTGTTGCAGATAAATTCCCATTACCTATAGTAGATGAGTTGTTCAATGAATTAAATGGGGCAATTAGGATGCATCCTGCTGATATGGAGAACCCTGCTT
TTCATACATATGAGGGTCATTACGAATTCTTGGTAATGCCTTTCGCATTGATCAATGCACCATTTACATTTCAAGCTTTGATGAATCAAATTTTTAAATCATACTTGAGG
AAATCAGAGTATTTGGGGCATATTATTTCTAGTGAAGGAGTAGAAGCTGACCCCAACAAGATACGAGCCGTGTTGGATTGGCCGGCTACTACATGTATTCGAGAGGTGTG
GAGTTTTCTAGGAGCTTATCAATGGTTGGCTGAAGCAAATGAAGCGTTTGAAGGATTGAAGAACGCCATGGTGACTTTACCTGTGTTGTTGGGATACGTTCTTAACACGA
CACAACATGGTGCTTGGTGTGGCATTTTCCTTAAAAGATACTCATATCGAGACATTCTACACATGGGAGGAGTGATGTGGCGTCATGGCCTTCCAACTTACAAAAATGGT
GTCGCTCCATCACTTTTTCTAGAGACTCCTCTCCAAAAGAAACCTTGTATGGATCTCTCCAAACTCTTTATAACCTTTCAAGATGCTCGAAAATTTGGAGATAATAGATT
GGGATGCCATAGAGAACCACATTTGAACACCCGTGATCATGCAATGATGAAAACTGTACTGATTGTTAATTTGGAGGTAAAAGATAACCACGAGGAGGCAGCCATAGGAG
CACGACTTACTTTTGATCTTTGCCAGGAGATTGAACGGACTGAATCATGGGAAGATTCTATAGATGACATCGTGATTAACTTCGAAAAACAGCTGAGAAGAAAACTGTTG
TATAGCATTTCCTTTTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAATTCGTGGAGATTCAGAAACTACTGAAGATGGAGGAATCCATGAAGTCTTTGTCGAAAAGTGTGTAATGGTTGAATGTGCAAGGGATAGTCAACAACAAAGTTC
GGAAGAGATGAAGAAGTTGTTGACGGATCTATTGATTGAGATTGCGCAAGTGCGATCTAATACAATTTCAAATACAGACAGAGGAGAGTCAAATGGAATTTTTCGACCAA
ACCGACACCGAACCGCCTATGGTCGGTTTAGTAAATGTTCAAATAGACCTCGATCATCAATGAGTAAATGTCGATCTTTGCAAGGAATGGAGTCTTTGCAGGGATTCATG
AAGTCTTGGAAGCAATATTTCAATCCAACTGAAGCTTTGACAAAAGTCTTACTGAAGTACGATGATGTATTTGATTTGCAGAAGAATTGCCTTCGAGCTGGGGAGGTGTT
GAACAACGTTACTGTTGCAGATAAATTCCCATTACCTATAGTAGATGAGTTGTTCAATGAATTAAATGGGGCAATTAGGATGCATCCTGCTGATATGGAGAACCCTGCTT
TTCATACATATGAGGGTCATTACGAATTCTTGGTAATGCCTTTCGCATTGATCAATGCACCATTTACATTTCAAGCTTTGATGAATCAAATTTTTAAATCATACTTGAGG
AAATCAGAGTATTTGGGGCATATTATTTCTAGTGAAGGAGTAGAAGCTGACCCCAACAAGATACGAGCCGTGTTGGATTGGCCGGCTACTACATGTATTCGAGAGGTGTG
GAGTTTTCTAGGAGCTTATCAATGGTTGGCTGAAGCAAATGAAGCGTTTGAAGGATTGAAGAACGCCATGGTGACTTTACCTGTGTTGTTGGGATACGTTCTTAACACGA
CACAACATGGTGCTTGGTGTGGCATTTTCCTTAAAAGATACTCATATCGAGACATTCTACACATGGGAGGAGTGATGTGGCGTCATGGCCTTCCAACTTACAAAAATGGT
GTCGCTCCATCACTTTTTCTAGAGACTCCTCTCCAAAAGAAACCTTGTATGGATCTCTCCAAACTCTTTATAACCTTTCAAGATGCTCGAAAATTTGGAGATAATAGATT
GGGATGCCATAGAGAACCACATTTGAACACCCGTGATCATGCAATGATGAAAACTGTACTGATTGTTAATTTGGAGGTAAAAGATAACCACGAGGAGGCAGCCATAGGAG
CACGACTTACTTTTGATCTTTGCCAGGAGATTGAACGGACTGAATCATGGGAAGATTCTATAGATGACATCGTGATTAACTTCGAAAAACAGCTGAGAAGAAAACTGTTG
TATAGCATTTCCTTTTACTGA
Protein sequenceShow/hide protein sequence
MAIRGDSETTEDGGIHEVFVEKCVMVECARDSQQQSSEEMKKLLTDLLIEIAQVRSNTISNTDRGESNGIFRPNRHRTAYGRFSKCSNRPRSSMSKCRSLQGMESLQGFM
KSWKQYFNPTEALTKVLLKYDDVFDLQKNCLRAGEVLNNVTVADKFPLPIVDELFNELNGAIRMHPADMENPAFHTYEGHYEFLVMPFALINAPFTFQALMNQIFKSYLR
KSEYLGHIISSEGVEADPNKIRAVLDWPATTCIREVWSFLGAYQWLAEANEAFEGLKNAMVTLPVLLGYVLNTTQHGAWCGIFLKRYSYRDILHMGGVMWRHGLPTYKNG
VAPSLFLETPLQKKPCMDLSKLFITFQDARKFGDNRLGCHREPHLNTRDHAMMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDSIDDIVINFEKQLRRKLL
YSISFY