; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g017430 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g017430
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr06:40075123..40077330
RNA-Seq ExpressionLcy06g017430
SyntenyLcy06g017430
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_010424607.1 PREDICTED: uncharacterized protein LOC104709741 [Camelina sativa]1.0e-2430.33Show/hide
Query:  RFVVNPDFDQKFQSLKVLDLSFHTFDHMPIVAIIESNPQKRKRNPNRPLKFEEPWTTFPESKEIIKERWNRDQGRDVEAFARKISSCLEGLALWNKERLG
        R + N D+   F       L     DH PIV  I +   + KR+      F+  W       ++I   W+RD  +D + F +K+ +C + ++ W K ++ 
Subjt:  RFVVNPDFDQKFQSLKVLDLSFHTFDHMPIVAIIESNPQKRKRNPNRPLKFEEPWTTFPESKEIIKERWNRDQGRDVEAFARKISSCLEGLALWNKERLG

Query:  GSISKVVDQKLKQIKDIELQDSGFPSQSLMKAERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERIWEEDEEKIGYIA
             + D  LK+  ++ L D   P  ++ + +  L   + +EE +W  +SR  W+K GDK++K+FH    QRR RN I GL +R+ IW  ++  I + A
Subjt:  GSISKVVDQKLKQIKDIELQDSGFPSQSLMKAERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERIWEEDEEKIGYIA

Query:  FSYFKDLFASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAFTKA
         SYF+DLF  +NP N   E   + ++  ++  DN+ LT   T+A
Subjt:  FSYFKDLFASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAFTKA

XP_015382226.1 uncharacterized protein LOC107175329, partial [Citrus sinensis]5.9e-2533.65Show/hide
Query:  QKRKRNPNRPLKFEEPWTTFPESKEIIKERWNR----DQGRDVEAFARKISSCLEGLALWNKERLGGSISKVVDQKLKQIKDIELQDSGFP-SQSLMKAE
        Q+ KR   R + +E+ W+ + + +EI+K+ W +    ++   VE F +K    L  L LW+K+  GG   K ++Q   ++K I    S +     L K E
Subjt:  QKRKRNPNRPLKFEEPWTTFPESKEIIKERWNR----DQGRDVEAFARKISSCLEGLALWNKERLGGSISKVVDQKLKQIKDIELQDSGFP-SQSLMKAE

Query:  RELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERIWEEDEEKIGYIAFSYFKDLFASSNPSNDRLEFTFKGIDKALSDRD
         +++ + ++EE FWK RSR DWLK GDK+TK+FH KAS RRK+N I G++  +  W ED +++  I   +F  LF+++ P+ ++++  FK     +++  
Subjt:  RELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERIWEEDEEKIGYIAFSYFKDLFASSNPSNDRLEFTFKGIDKALSDRD

Query:  NKNLTSAF
        N  L + F
Subjt:  NKNLTSAF

XP_015382608.1 uncharacterized protein LOC107175577 [Citrus sinensis]1.7e-2429.64Show/hide
Query:  QKTKRFVVNPDFDQKFQSLKVLDLSFHTFDHMPIVAIIESNPQK--RKRNPNRPLKFEEPWTTFPESKEIIKERW----NRDQGRDVEAFARKISSCLEG
        ++  RFV N  +   F +   ++L+  T DH P++ +++   ++    R  +  + +E+ W+ +   KEI+ E W    N   G  V  F +     +  
Subjt:  QKTKRFVVNPDFDQKFQSLKVLDLSFHTFDHMPIVAIIESNPQK--RKRNPNRPLKFEEPWTTFPESKEIIKERW----NRDQGRDVEAFARKISSCLEG

Query:  LALWNKERLGGSISKVVDQKLKQIKDIELQDSGFPSQSLMK-AERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERIW
        L  W+K    G   K ++Q  KQ++  + +   + S+  +K  E +++ +  +EE +WK RSR DWLK GDK+TK+FH+KAS R+K+N I G+ +    W
Subjt:  LALWNKERLGGSISKVVDQKLKQIKDIELQDSGFPSQSLMK-AERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERIW

Query:  EEDEEKIGYIAFSYFKDLFASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAFT
         +  + +     +YF +LF ++ PS +++E   KGI   +S+  N++L   FT
Subjt:  EEDEEKIGYIAFSYFKDLFASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAFT

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]9.1e-2630.42Show/hide
Query:  QIIELKRNQKTKRFVVNPDFDQKFQSLKVLDLSFHTFDHMPIVAIIESNPQKR--KRNPNRPLKFEEPWTTFPESKEIIKERWNR----DQGRDVEAFAR
        Q+IE    ++  RF+ + D++Q   +L V +L     DH P++  ++   +    K+N +  + +E+ W+ +   K I+KE W +     QG  V  F +
Subjt:  QIIELKRNQKTKRFVVNPDFDQKFQSLKVLDLSFHTFDHMPIVAIIESNPQKR--KRNPNRPLKFEEPWTTFPESKEIIKERWNR----DQGRDVEAFAR

Query:  KISSCLEGLALWNKERLGGSISKV--VDQKLKQIKDIELQDSGFPSQSLMKAERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIV
          ++CL  L +W++    G   K+  + +KL ++K    Q        +   E ++E +  +EE +WK RSR DWLK GDK+TK+FH+KAS R+++N I 
Subjt:  KISSCLEGLALWNKERLGGSISKV--VDQKLKQIKDIELQDSGFPSQSLMKAERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIV

Query:  GLVSRERIWEEDEEKIGYIAFSYFKDLFASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAFTK
        G++ +  +W +D E +      YF  LF +S+PS  ++E    G+   ++   N  L S FT+
Subjt:  GLVSRERIWEEDEEKIGYIAFSYFKDLFASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAFTK

XP_024038343.1 uncharacterized protein LOC112097373 [Citrus clementina]1.0e-2429.92Show/hide
Query:  QKTKRFVVNPDFDQKFQSLKVLDLSFHTFDHMPIVAIIESNPQKRKRNPNRP--LKFEEPWTTFPESKEIIKERWNRDQG-----RDVEAFARKISSCLE
        ++  RFV N  +   F      ++   T DH P+V  ++        N  R   + +E+ W+ +   KEII++ W+  QG       V  F +   + + 
Subjt:  QKTKRFVVNPDFDQKFQSLKVLDLSFHTFDHMPIVAIIESNPQKRKRNPNRP--LKFEEPWTTFPESKEIIKERWNRDQG-----RDVEAFARKISSCLE

Query:  GLALWNKERLGGSISKVVDQKLKQIKDIELQDSGF-PSQSLMKAERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERI
         L LW+KE   G   K +++ + Q++ ++L    +     + + ER+++ +  ++E +WK RSR DWLK GDK+TK+FH+KAS R+K+N I G+ +    
Subjt:  GLALWNKERLGGSISKVVDQKLKQIKDIELQDSGF-PSQSLMKAERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERI

Query:  WEEDEEKIGYIAFSYFKDLFASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAFT
        W E+ E + +    YF +LF +S P+ D++     GI + +S   N++L   FT
Subjt:  WEEDEEKIGYIAFSYFKDLFASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAFT

TrEMBL top hitse value%identityAlignment
A0A2N9GII4 Uncharacterized protein3.4e-2631.58Show/hide
Query:  QKTKRFVVNPDFDQKFQSLKVLDLSFHTFDHMPIVAIIESNPQKRKRNPNRPLKFEEPWTTFPESKEIIKERWNRDQGRDVEAF--ARKISSCLEGLALW
        ++  R V    +   F  + V  L     DH+PI+ +  +N + + RN  R  +FEE W T P+ + +I+  W  + G     F    KI  C  GLA W
Subjt:  QKTKRFVVNPDFDQKFQSLKVLDLSFHTFDHMPIVAIIESNPQKRKRNPNRPLKFEEPWTTFPESKEIIKERWNRDQGRDVEAF--ARKISSCLEGLALW

Query:  NKERLGGSISKVVDQKLKQIKDIELQDSGFPSQSLMKAERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERIWEEDEE
        +K+  GGS    +  + + ++ +   D G     +   + E+  L   +E  WK RSR  WLK GD +TK+FHN A+QR++ N+I GL++ +  W  +  
Subjt:  NKERLGGSISKVVDQKLKQIKDIELQDSGFPSQSLMKAERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERIWEEDEE

Query:  KIGYIAFSYFKDLFASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAF
        ++  I+  YFKD+F SS P   R+E   + +D+ ++   N+ LT  F
Subjt:  KIGYIAFSYFKDLFASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAF

A0A2N9HE04 Reverse transcriptase domain-containing protein3.4e-2631.58Show/hide
Query:  QKTKRFVVNPDFDQKFQSLKVLDLSFHTFDHMPIVAIIESNPQKRKRNPNRPLKFEEPWTTFPESKEIIKERWNRDQGRDVEAF--ARKISSCLEGLALW
        ++  R V    +   F  + V  L     DH+PI+ +  +N + + RN  R  +FEE W T P+ + +I+  W  + G     F    KI  C  GLA W
Subjt:  QKTKRFVVNPDFDQKFQSLKVLDLSFHTFDHMPIVAIIESNPQKRKRNPNRPLKFEEPWTTFPESKEIIKERWNRDQGRDVEAF--ARKISSCLEGLALW

Query:  NKERLGGSISKVVDQKLKQIKDIELQDSGFPSQSLMKAERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERIWEEDEE
        +K+  GGS    +  + + ++ +   D G     +   + E+  L   +E  WK RSR  WLK GD +TK+FHN A+QR++ N+I GL++ +  W  +  
Subjt:  NKERLGGSISKVVDQKLKQIKDIELQDSGFPSQSLMKAERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERIWEEDEE

Query:  KIGYIAFSYFKDLFASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAF
        ++  I+  YFKD+F SS P   R+E   + +D+ ++   N+ LT  F
Subjt:  KIGYIAFSYFKDLFASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAF

A0A2N9I611 Uncharacterized protein2.1e-2833.33Show/hide
Query:  QKFQSLKVLDLSFHTFDHMPIVAIIESNPQKRKRNPNRPLKFEEPWTTFPESKEIIKERWNRDQGRDVEAF--ARKISSCLEGLALWNKERLGGSISKVV
        + F++  ++++     DHM I  ++  N Q RKR P R  +FE+ WT   E +++I + W   +  +   F    K+  C + L  W+K    G+I   +
Subjt:  QKFQSLKVLDLSFHTFDHMPIVAIIESNPQKRKRNPNRPLKFEEPWTTFPESKEIIKERWNRDQGRDVEAF--ARKISSCLEGLALWNKERLGGSISKVV

Query:  DQKLKQIKDIELQDSGFPSQSLMKAERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERIWEEDEEKIGYIAFSYFKDL
        ++  + +K +E  + G  S  +   + E+  L E+EE +WK R+R  WLK GD++TK+FH+KA+QR+K+N + GL+ +E  W +D  K+  IA  YF+D+
Subjt:  DQKLKQIKDIELQDSGFPSQSLMKAERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERIWEEDEEKIGYIAFSYFKDL

Query:  FASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAF
        F S+N  +  L+ + +GI+K ++D  N++LT  F
Subjt:  FASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAF

A0A2N9IJF6 Uncharacterized protein3.4e-2631.58Show/hide
Query:  QKTKRFVVNPDFDQKFQSLKVLDLSFHTFDHMPIVAIIESNPQKRKRNPNRPLKFEEPWTTFPESKEIIKERWNRDQGRDVEAF--ARKISSCLEGLALW
        ++  R V    +   F  + V  L     DH+PI+ +  +N + + RN  R  +FEE W T P+ + +I+  W  + G     F    KI  C  GLA W
Subjt:  QKTKRFVVNPDFDQKFQSLKVLDLSFHTFDHMPIVAIIESNPQKRKRNPNRPLKFEEPWTTFPESKEIIKERWNRDQGRDVEAF--ARKISSCLEGLALW

Query:  NKERLGGSISKVVDQKLKQIKDIELQDSGFPSQSLMKAERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERIWEEDEE
        +K+  GGS    +  + + ++ +   D G     +   + E+  L   +E  WK RSR  WLK GD +TK+FHN A+QR++ N+I GL++ +  W  +  
Subjt:  NKERLGGSISKVVDQKLKQIKDIELQDSGFPSQSLMKAERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERIWEEDEE

Query:  KIGYIAFSYFKDLFASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAF
        ++  I+  YFKD+F SS P   R+E   + +D+ ++   N+ LT  F
Subjt:  KIGYIAFSYFKDLFASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAF

A0A803QGA2 Uncharacterized protein7.6e-2629.05Show/hide
Query:  VVNPDFDQKFQSLKVLDLSFHTFDHMPIVAIIESNPQKR-KRNPNRPLKFEEPWTTFPESKEIIKERWNRDQGRDVEAFARKISSCLEGLALWNKERLGG
        +VN  ++  F    +  L ++  DH  ++  ++ +   +   N  +  +F   W    E K +IK  W  ++   + +  + I  C   L  W++ + G 
Subjt:  VVNPDFDQKFQSLKVLDLSFHTFDHMPIVAIIESNPQKR-KRNPNRPLKFEEPWTTFPESKEIIKERWNRDQGRDVEAFARKISSCLEGLALWNKERLGG

Query:  SISKVVDQKLKQIKDIELQDSGFPSQSLMKAERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERIWEEDEEKIGYIAF
           ++ D + K ++   LQ  G  ++   + E +L  L  +EE FW+ R+R  WL+ GD++TK+FH KA+ RRK N I GL + + IW    E IG I  
Subjt:  SISKVVDQKLKQIKDIELQDSGFPSQSLMKAERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERIWEEDEEKIGYIAF

Query:  SYFKDLFASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAFT
        SY+  LF S  P+N+ +E   + +++ ++D  N+ LT  FT
Subjt:  SYFKDLFASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAFT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.5e-0524.43Show/hide
Query:  QKTKRFVVNPDFDQKFQS-LKVLDLSFHTFDHMPIVAIIESNPQKRKRNPNRPLKFEEPWTTFPESKEIIKERWNRDQGRDVEAFARKISSCLEGLALWN
        +K  R + N D+   F S + V +LS    DH P + I+E+ P KR +   R   F     TF  S  +  E      G  + +    + +  +   L N
Subjt:  QKTKRFVVNPDFDQKFQS-LKVLDLSFHTFDHMPIVAIIESNPQKRKRNPNRPLKFEEPWTTFPESKEIIKERWNRDQGRDVEAFARKISSCLEGLALWN

Query:  KERLGGSISKVVDQKLKQIKDIELQDSGFPSQSLMKAE----RELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERIWEE
        ++   G+I     + L  ++ I+ Q    PS SL + E    ++        E F++ +SR  WL+ GD +T++FH      + +N I  L   + +  E
Subjt:  KERLGGSISKVVDQKLKQIKDIELQDSGFPSQSLMKAE----RELEMLFEEEEKFWKIRSREDWLKWGDKSTKWFHNKASQRRKRNEIVGLVSRERIWEE

Query:  DEEKIGYIAFSYFKDLFASSN
        +  ++  +  +Y+  L  S +
Subjt:  DEEKIGYIAFSYFKDLFASSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGCAGGAATGCAAAGAAGAACGGGCGAACACCAATGAGAAGAGAGATTATGGAGTCGACCTTAGAGAGACACAAGGAAGCAAAGGATTTTACAGAGTTAGGAAA
AACGACCTATTGGAAAGAGGTCAAAATTAGTGGGGAAGAAGCAGGGGAAGAGGCGAGAGAGGAAGAGGAACAAGAACAGAATGATTTAGACCAGGAGTGGAAAAAAGGGA
AGAAAACAGAAAGAAGGTTTCGATTAAAGGGAAAGGGGAGGCTGCCACCGGAGAGACAATCGTCGCTCAAGTCGTCGAAAGTCAAATGTCGGGAAGCAGCGAGGACAAGG
AGAGAGACGTTGAGATCATGCACGAACTCCAAGCCCAAAAGCCCCCAATGGATACTTTGGTGTCATAAGGGAAAAGAAAAGCGATGGAAACAGTCAACATATCGAAAGGG
CAAGGACCAGACAAGACAAATAATAGAACTAAAGAGAAATCAAAAGACAAAAAGGTTCGTGGTCAACCCGGATTTTGATCAAAAGTTCCAAAGCCTAAAAGTCCTCGATC
TTAGTTTTCATACCTTTGACCACATGCCTATTGTGGCCATCATCGAGAGCAACCCCCAGAAGAGGAAGAGAAACCCGAATAGGCCTCTCAAATTTGAAGAACCGTGGACT
ACTTTCCCTGAAAGCAAGGAGATCATAAAAGAAAGGTGGAACAGAGATCAAGGAAGAGATGTGGAAGCTTTTGCCAGAAAAATTAGTTCTTGTTTAGAAGGTCTTGCCCT
CTGGAACAAAGAAAGGCTGGGAGGTTCTATTAGTAAGGTTGTGGACCAAAAGCTTAAGCAAATCAAAGATATTGAGCTCCAAGATTCTGGGTTCCCATCTCAATCTTTGA
TGAAAGCGGAACGAGAGCTCGAGATGCTCTTTGAAGAAGAAGAAAAGTTTTGGAAGATTAGATCCAGAGAAGACTGGCTTAAGTGGGGGGATAAGAGCACCAAATGGTTC
CACAATAAAGCTAGTCAAAGGAGAAAAAGGAATGAGATAGTGGGCTTGGTCAGCAGAGAAAGAATTTGGGAAGAGGACGAGGAAAAAATAGGCTATATAGCCTTCAGCTA
CTTCAAAGATCTATTTGCATCTTCAAATCCCTCTAATGACAGGCTTGAATTCACCTTCAAAGGCATTGACAAAGCTTTATCGGATAGGGATAACAAGAACCTTACAAGCG
CCTTTACTAAAGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGGCAGGAATGCAAAGAAGAACGGGCGAACACCAATGAGAAGAGAGATTATGGAGTCGACCTTAGAGAGACACAAGGAAGCAAAGGATTTTACAGAGTTAGGAAA
AACGACCTATTGGAAAGAGGTCAAAATTAGTGGGGAAGAAGCAGGGGAAGAGGCGAGAGAGGAAGAGGAACAAGAACAGAATGATTTAGACCAGGAGTGGAAAAAAGGGA
AGAAAACAGAAAGAAGGTTTCGATTAAAGGGAAAGGGGAGGCTGCCACCGGAGAGACAATCGTCGCTCAAGTCGTCGAAAGTCAAATGTCGGGAAGCAGCGAGGACAAGG
AGAGAGACGTTGAGATCATGCACGAACTCCAAGCCCAAAAGCCCCCAATGGATACTTTGGTGTCATAAGGGAAAAGAAAAGCGATGGAAACAGTCAACATATCGAAAGGG
CAAGGACCAGACAAGACAAATAATAGAACTAAAGAGAAATCAAAAGACAAAAAGGTTCGTGGTCAACCCGGATTTTGATCAAAAGTTCCAAAGCCTAAAAGTCCTCGATC
TTAGTTTTCATACCTTTGACCACATGCCTATTGTGGCCATCATCGAGAGCAACCCCCAGAAGAGGAAGAGAAACCCGAATAGGCCTCTCAAATTTGAAGAACCGTGGACT
ACTTTCCCTGAAAGCAAGGAGATCATAAAAGAAAGGTGGAACAGAGATCAAGGAAGAGATGTGGAAGCTTTTGCCAGAAAAATTAGTTCTTGTTTAGAAGGTCTTGCCCT
CTGGAACAAAGAAAGGCTGGGAGGTTCTATTAGTAAGGTTGTGGACCAAAAGCTTAAGCAAATCAAAGATATTGAGCTCCAAGATTCTGGGTTCCCATCTCAATCTTTGA
TGAAAGCGGAACGAGAGCTCGAGATGCTCTTTGAAGAAGAAGAAAAGTTTTGGAAGATTAGATCCAGAGAAGACTGGCTTAAGTGGGGGGATAAGAGCACCAAATGGTTC
CACAATAAAGCTAGTCAAAGGAGAAAAAGGAATGAGATAGTGGGCTTGGTCAGCAGAGAAAGAATTTGGGAAGAGGACGAGGAAAAAATAGGCTATATAGCCTTCAGCTA
CTTCAAAGATCTATTTGCATCTTCAAATCCCTCTAATGACAGGCTTGAATTCACCTTCAAAGGCATTGACAAAGCTTTATCGGATAGGGATAACAAGAACCTTACAAGCG
CCTTTACTAAAGCATAA
Protein sequenceShow/hide protein sequence
MSGRNAKKNGRTPMRREIMESTLERHKEAKDFTELGKTTYWKEVKISGEEAGEEAREEEEQEQNDLDQEWKKGKKTERRFRLKGKGRLPPERQSSLKSSKVKCREAARTR
RETLRSCTNSKPKSPQWILWCHKGKEKRWKQSTYRKGKDQTRQIIELKRNQKTKRFVVNPDFDQKFQSLKVLDLSFHTFDHMPIVAIIESNPQKRKRNPNRPLKFEEPWT
TFPESKEIIKERWNRDQGRDVEAFARKISSCLEGLALWNKERLGGSISKVVDQKLKQIKDIELQDSGFPSQSLMKAERELEMLFEEEEKFWKIRSREDWLKWGDKSTKWF
HNKASQRRKRNEIVGLVSRERIWEEDEEKIGYIAFSYFKDLFASSNPSNDRLEFTFKGIDKALSDRDNKNLTSAFTKA