; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg008833 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg008833
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold10:31675579..31678915
RNA-Seq ExpressionSpg008833
SyntenySpg008833
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8707640.1 hypothetical protein F3Y22_tig00110378pilonHSYRG00039 [Hibiscus syriacus]2.0e-2029.08Show/hide
Query:  FVQNPESVNAQLVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNLQNF--PHAAYNEMA---------------------------------VAPSN
        F ++P SVNA LV+EFYANI K       VRG ++ ++ +AIN  ++LQ     HA   E A                                 + P  
Subjt:  FVQNPESVNAQLVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNLQNF--PHAAYNEMA---------------------------------VAPSN

Query:  EQLSDAVRERMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIINTPNLARLQRTQ
        +  +  ++ +++PT+H++ VS  R+LL+ +++ S  IDVG+II  ++  C  KK   L FPN IT LC +  V EN  D IL     I    L  L   +
Subjt:  EQLSDAVRERMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIINTPNLARLQRTQ

Query:  EARQGGLVY----GINSILEQLALSASWQEFAERQA---------LTFWNYVKNRDASLKKALQENFSKPFPALPAFPEDLL
          +    V+    G      +  L A  ++  + QA           F+ YVK+RDA ++   QE         P FP+++L
Subjt:  EARQGGLVY----GINSILEQLALSASWQEFAERQA---------LTFWNYVKNRDASLKKALQENFSKPFPALPAFPEDLL

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]6.9e-2130.5Show/hide
Query:  FVQNPESVNAQLVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNLQNF--PHAAYNEMA---------------------------------VAPSN
        F  +P SVNA LV+EFYANI K       VRG ++ ++  AIN  ++LQ     HA + E A                                 + P  
Subjt:  FVQNPESVNAQLVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNLQNF--PHAAYNEMA---------------------------------VAPSN

Query:  EQLSDAVRERMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIINTPNLARLQRTQ
        +  +  ++ +++PT+H++TVS  R+LL+ +++ S  IDVG+II  ++  C  KK   L FPN IT LC++  V EN  D IL     I    L  L   +
Subjt:  EQLSDAVRERMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIINTPNLARLQRTQ

Query:  EARQGGLVY-------GINSILEQLAL-SASWQEFAERQAL-----TFWNYVKNRDASLKKALQENFSKPFPALPAFPEDLL
          +    V+         N+ +  LAL  A  Q  A+  AL      F+ YVK+RD  ++   QE         P FP+++L
Subjt:  EARQGGLVY-------GINSILEQLAL-SASWQEFAERQAL-----TFWNYVKNRDASLKKALQENFSKPFPALPAFPEDLL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]8.2e-3032.26Show/hide
Query:  QLPY-DRFITVGN--GFVQNPESVNAQLVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNL-----------QNF----------------------
        QLP+  + IT  N   F  +PE     LVREFYAN+   E     VRG++V WS  AINA++ L           QN                       
Subjt:  QLPY-DRFITVGN--GFVQNPESVNAQLVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNL-----------QNF----------------------

Query:  PHAAYN--EMAVAPSNEQLSDAVRERMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFD
           AY     A+ P+ +     ++ R+LPTTH  TVS++R+LL+ ++L   SI+VG++I  EI  C  +K G LFFP+ IT LC+ A  P    +  L +
Subjt:  PHAAYN--EMAVAPSNEQLSDAVRERMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFD

Query:  KGIINTPNLARLQR---TQEARQ---------------GGLVYGINSILEQLALSASWQ-------EFAERQALTFWNYVKNRDASLKKALQENFSKPFP
         G I+   +AR+ +   T+  +Q               G ++  + ++ ++L+     Q       +   +Q   FW Y K RD +LKKALQ NF++P P
Subjt:  KGIINTPNLARLQR---TQEARQ---------------GGLVYGINSILEQLALSASWQ-------EFAERQALTFWNYVKNRDASLKKALQENFSKPFP

Query:  ALPAFPEDLL
          PAFP+++L
Subjt:  ALPAFPEDLL

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]2.8e-2235.21Show/hide
Query:  NLQNFPHAAYNEMAVAPSNEQLSDAVRERMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVI
        N+QN P  A  + A+ P+ +     ++ R+LPTTH  TVS++R+LL++++L   SI+VG++I  EI  C  +K G LFFP+ IT LC+ A  P    +  
Subjt:  NLQNFPHAAYNEMAVAPSNEQLSDAVRERMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVI

Query:  LFDKGIINTPNLARL------QRTQE-----------ARQGGLVYGINSILEQLALSASWQEF--------AERQALTFWNYVKNRDASLKKALQENFSK
        L   G I+   +AR+      + TQ+           +R  G +      LEQ       Q++          +Q   FW Y K RD +LKKALQ NF++
Subjt:  LFDKGIINTPNLARL------QRTQE-----------ARQGGLVYGINSILEQLALSASWQEF--------AERQALTFWNYVKNRDASLKKALQENFSK

Query:  PFPALPAFPEDLL
        P P  P FP++LL
Subjt:  PFPALPAFPEDLL

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]2.6e-2834.19Show/hide
Query:  LVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNLQN--FPHAAYNE---------------------------------MAVAPSNEQLSDAVRERM
        LVREFYAN+   E     VRG++V WS  AINA++ L +    H+ + E                                  A+ P+ +     ++ R+
Subjt:  LVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNLQN--FPHAAYNE---------------------------------MAVAPSNEQLSDAVRERM

Query:  LPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIINTPNLARL------QRTQEARQG
        LPTTH   VS++R+LL+ ++L   SI+VG++I  EI  C  +K G LFFP+ IT LC+ A    NE    L + G I+   +AR+      + TQ+    
Subjt:  LPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIINTPNLARL------QRTQEARQG

Query:  GLVYGINS-----ILEQLAL---SASWQEFAERQALTFWNYVKNRDASLKKALQENFSKPFPALPAFPEDLL
              +S     +L+QL       S QE   +Q   FW Y K RD +LKKALQ NF++P P  PAFP+++L
Subjt:  GLVYGINS-----ILEQLAL---SASWQEFAERQALTFWNYVKNRDASLKKALQENFSKPFPALPAFPEDLL

TrEMBL top hitse value%identityAlignment
A0A2P5BCG4 Uncharacterized protein (Fragment)3.9e-3032.26Show/hide
Query:  QLPY-DRFITVGN--GFVQNPESVNAQLVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNL-----------QNF----------------------
        QLP+  + IT  N   F  +PE     LVREFYAN+   E     VRG++V WS  AINA++ L           QN                       
Subjt:  QLPY-DRFITVGN--GFVQNPESVNAQLVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNL-----------QNF----------------------

Query:  PHAAYN--EMAVAPSNEQLSDAVRERMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFD
           AY     A+ P+ +     ++ R+LPTTH  TVS++R+LL+ ++L   SI+VG++I  EI  C  +K G LFFP+ IT LC+ A  P    +  L +
Subjt:  PHAAYN--EMAVAPSNEQLSDAVRERMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFD

Query:  KGIINTPNLARLQR---TQEARQ---------------GGLVYGINSILEQLALSASWQ-------EFAERQALTFWNYVKNRDASLKKALQENFSKPFP
         G I+   +AR+ +   T+  +Q               G ++  + ++ ++L+     Q       +   +Q   FW Y K RD +LKKALQ NF++P P
Subjt:  KGIINTPNLARLQR---TQEARQ---------------GGLVYGINSILEQLALSASWQ-------EFAERQALTFWNYVKNRDASLKKALQENFSKPFP

Query:  ALPAFPEDLL
          PAFP+++L
Subjt:  ALPAFPEDLL

A0A2P5CEY2 Uncharacterized protein1.4e-2235.21Show/hide
Query:  NLQNFPHAAYNEMAVAPSNEQLSDAVRERMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVI
        N+QN P  A  + A+ P+ +     ++ R+LPTTH  TVS++R+LL++++L   SI+VG++I  EI  C  +K G LFFP+ IT LC+ A  P    +  
Subjt:  NLQNFPHAAYNEMAVAPSNEQLSDAVRERMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVI

Query:  LFDKGIINTPNLARL------QRTQE-----------ARQGGLVYGINSILEQLALSASWQEF--------AERQALTFWNYVKNRDASLKKALQENFSK
        L   G I+   +AR+      + TQ+           +R  G +      LEQ       Q++          +Q   FW Y K RD +LKKALQ NF++
Subjt:  LFDKGIINTPNLARL------QRTQE-----------ARQGGLVYGINSILEQLALSASWQEF--------AERQALTFWNYVKNRDASLKKALQENFSK

Query:  PFPALPAFPEDLL
        P P  P FP++LL
Subjt:  PFPALPAFPEDLL

A0A2P5DXM3 Uncharacterized protein1.3e-2834.19Show/hide
Query:  LVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNLQN--FPHAAYNE---------------------------------MAVAPSNEQLSDAVRERM
        LVREFYAN+   E     VRG++V WS  AINA++ L +    H+ + E                                  A+ P+ +     ++ R+
Subjt:  LVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNLQN--FPHAAYNE---------------------------------MAVAPSNEQLSDAVRERM

Query:  LPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIINTPNLARL------QRTQEARQG
        LPTTH   VS++R+LL+ ++L   SI+VG++I  EI  C  +K G LFFP+ IT LC+ A    NE    L + G I+   +AR+      + TQ+    
Subjt:  LPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIINTPNLARL------QRTQEARQG

Query:  GLVYGINS-----ILEQLAL---SASWQEFAERQALTFWNYVKNRDASLKKALQENFSKPFPALPAFPEDLL
              +S     +L+QL       S QE   +Q   FW Y K RD +LKKALQ NF++P P  PAFP+++L
Subjt:  GLVYGINS-----ILEQLAL---SASWQEFAERQALTFWNYVKNRDASLKKALQENFSKPFPALPAFPEDLL

A0A6A3ASF6 Uncharacterized protein9.8e-2129.08Show/hide
Query:  FVQNPESVNAQLVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNLQNF--PHAAYNEMA---------------------------------VAPSN
        F ++P SVNA LV+EFYANI K       VRG ++ ++ +AIN  ++LQ     HA   E A                                 + P  
Subjt:  FVQNPESVNAQLVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNLQNF--PHAAYNEMA---------------------------------VAPSN

Query:  EQLSDAVRERMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIINTPNLARLQRTQ
        +  +  ++ +++PT+H++ VS  R+LL+ +++ S  IDVG+II  ++  C  KK   L FPN IT LC +  V EN  D IL     I    L  L   +
Subjt:  EQLSDAVRERMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIINTPNLARLQRTQ

Query:  EARQGGLVY----GINSILEQLALSASWQEFAERQA---------LTFWNYVKNRDASLKKALQENFSKPFPALPAFPEDLL
          +    V+    G      +  L A  ++  + QA           F+ YVK+RDA ++   QE         P FP+++L
Subjt:  EARQGGLVY----GINSILEQLALSASWQEFAERQA---------LTFWNYVKNRDASLKKALQENFSKPFPALPAFPEDLL

A0A6A3BU96 Uncharacterized protein3.4e-2130.5Show/hide
Query:  FVQNPESVNAQLVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNLQNF--PHAAYNEMA---------------------------------VAPSN
        F  +P SVNA LV+EFYANI K       VRG ++ ++  AIN  ++LQ     HA + E A                                 + P  
Subjt:  FVQNPESVNAQLVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNLQNF--PHAAYNEMA---------------------------------VAPSN

Query:  EQLSDAVRERMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIINTPNLARLQRTQ
        +  +  ++ +++PT+H++TVS  R+LL+ +++ S  IDVG+II  ++  C  KK   L FPN IT LC++  V EN  D IL     I    L  L   +
Subjt:  EQLSDAVRERMLPTTHDSTVSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIINTPNLARLQRTQ

Query:  EARQGGLVY-------GINSILEQLAL-SASWQEFAERQAL-----TFWNYVKNRDASLKKALQENFSKPFPALPAFPEDLL
          +    V+         N+ +  LAL  A  Q  A+  AL      F+ YVK+RD  ++   QE         P FP+++L
Subjt:  EARQGGLVY-------GINSILEQLAL-SASWQEFAERQAL-----TFWNYVKNRDASLKKALQENFSKPFPALPAFPEDLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAAGCAAATCTTCCGAATAGGAAGGGATTTTGTAAATTAATTTTTACCGTTTGGATTTCCTGTGGGCAGACCCTGGTAAGTCTTTTCCACTTCAACTTCTTGTT
TCAATCTTGCTATTTGCATTCCTCTGTTTTTGTTGCTATTATCTGCAAACACCCCTTTGAGTTTCTAATGGCAAAAACAAGAGCGAGGAAAGAAAGAGAAAGTGAGGAGG
AGGAGGTACCGGTTACTCCGGAAGTGCAAAAAGGGAAAACCAAGAAGAAGAGAACGCCAGAAGAGAAAGAAGCTAAACGAAGAAGAAGGCAGCAGAGGGCTGCAGAGCAA
GAAGCTATCCAAGAAGAAGCAGTGAATGACCCAGTTACGGAAGAAGTTCAAGACGAACAGGCCGCGGTTGTGCCTGAAGAAGAGAATGAACAAGAACCAGAGGCTCGTGT
TGAGGTCATCATGCCGGAGCCACCGAAACGTCGCCGCATTAAGCGGAAGGTGGGGCGCGTCAAGGTGATTCGAAATACCCCATCGCCTCCGTCGTCGGATTCTGAGGAAG
AGAAAATGGAACAAGAAAAGGAGCCTGAAGACAAGGCAAGAGAAGAAGCGGAGAAAAAGGTTGAAGAAGAGCGGTTGCTCAAGCGAATGGTGGAAAAGGGCAAAGGTGTT
GCTGCGACGACGGAGAAACCTGACGAAATAGAAGAGTCACAATTACCGTATGATCGCTTTATCACGGTTGGGAACGGTTTTGTTCAAAACCCTGAATCTGTGAACGCGCA
GTTGGTGCGTGAATTCTATGCAAATATCGACAAAGAAGAAGGTTTCCTAGCGATTGTTCGAGGTATTGAGGTCGACTGGAGTCCTAGTGCTATCAACGCACTGTATAACC
TTCAGAATTTCCCCCATGCGGCATATAATGAGATGGCTGTAGCGCCATCTAATGAGCAGTTAAGTGATGCTGTGCGGGAGAGGATGCTTCCAACGACTCATGATTCGACG
GTTTCTAGGGAACGGGTGCTTCTGGTTTTCGCTATTTTGAGGTCTCTCAGTATTGATGTGGGAAAAATTATCGCTGACGAAATATTCGGTTGTTGGAAGAAGAAAGTGGG
GAAGCTGTTTTTCCCGAATACCATTACCATGCTCTGCAAGCGAGCAGGGGTTCCAGAGAATGAAGGAGATGTGATATTATTTGACAAGGGAATCATCAACACGCCTAACT
TGGCACGGCTTCAGCGTACGCAAGAAGCACGTCAGGGTGGGCTGGTCTACGGCATCAACTCGATTCTAGAACAACTGGCACTTTCGGCCAGTTGGCAGGAGTTTGCCGAG
AGGCAAGCTTTAACCTTCTGGAACTATGTTAAGAATCGTGATGCCAGCTTAAAGAAGGCGCTGCAAGAGAATTTTTCCAAGCCATTTCCGGCCCTTCCAGCATTCCCTGA
GGATCTCTTGGACTGGTTAAGCTTAATTCGATCAAGTCTAGTTGGTGATGAGCTTGAGGCAAGGGTATACTGCACCATAAAGTGGGTCATCCCATGCTTAAGAGCTTATG
ACTATCTCAGTATTAAGTTGAAAAGTGGTGATTATTTGTCCATGCCGGAGGAATCATTTTGCTGCAACAGAGCTCGGTTTTGCAGAGTGCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGAAGCAAATCTTCCGAATAGGAAGGGATTTTGTAAATTAATTTTTACCGTTTGGATTTCCTGTGGGCAGACCCTGGTAAGTCTTTTCCACTTCAACTTCTTGTT
TCAATCTTGCTATTTGCATTCCTCTGTTTTTGTTGCTATTATCTGCAAACACCCCTTTGAGTTTCTAATGGCAAAAACAAGAGCGAGGAAAGAAAGAGAAAGTGAGGAGG
AGGAGGTACCGGTTACTCCGGAAGTGCAAAAAGGGAAAACCAAGAAGAAGAGAACGCCAGAAGAGAAAGAAGCTAAACGAAGAAGAAGGCAGCAGAGGGCTGCAGAGCAA
GAAGCTATCCAAGAAGAAGCAGTGAATGACCCAGTTACGGAAGAAGTTCAAGACGAACAGGCCGCGGTTGTGCCTGAAGAAGAGAATGAACAAGAACCAGAGGCTCGTGT
TGAGGTCATCATGCCGGAGCCACCGAAACGTCGCCGCATTAAGCGGAAGGTGGGGCGCGTCAAGGTGATTCGAAATACCCCATCGCCTCCGTCGTCGGATTCTGAGGAAG
AGAAAATGGAACAAGAAAAGGAGCCTGAAGACAAGGCAAGAGAAGAAGCGGAGAAAAAGGTTGAAGAAGAGCGGTTGCTCAAGCGAATGGTGGAAAAGGGCAAAGGTGTT
GCTGCGACGACGGAGAAACCTGACGAAATAGAAGAGTCACAATTACCGTATGATCGCTTTATCACGGTTGGGAACGGTTTTGTTCAAAACCCTGAATCTGTGAACGCGCA
GTTGGTGCGTGAATTCTATGCAAATATCGACAAAGAAGAAGGTTTCCTAGCGATTGTTCGAGGTATTGAGGTCGACTGGAGTCCTAGTGCTATCAACGCACTGTATAACC
TTCAGAATTTCCCCCATGCGGCATATAATGAGATGGCTGTAGCGCCATCTAATGAGCAGTTAAGTGATGCTGTGCGGGAGAGGATGCTTCCAACGACTCATGATTCGACG
GTTTCTAGGGAACGGGTGCTTCTGGTTTTCGCTATTTTGAGGTCTCTCAGTATTGATGTGGGAAAAATTATCGCTGACGAAATATTCGGTTGTTGGAAGAAGAAAGTGGG
GAAGCTGTTTTTCCCGAATACCATTACCATGCTCTGCAAGCGAGCAGGGGTTCCAGAGAATGAAGGAGATGTGATATTATTTGACAAGGGAATCATCAACACGCCTAACT
TGGCACGGCTTCAGCGTACGCAAGAAGCACGTCAGGGTGGGCTGGTCTACGGCATCAACTCGATTCTAGAACAACTGGCACTTTCGGCCAGTTGGCAGGAGTTTGCCGAG
AGGCAAGCTTTAACCTTCTGGAACTATGTTAAGAATCGTGATGCCAGCTTAAAGAAGGCGCTGCAAGAGAATTTTTCCAAGCCATTTCCGGCCCTTCCAGCATTCCCTGA
GGATCTCTTGGACTGGTTAAGCTTAATTCGATCAAGTCTAGTTGGTGATGAGCTTGAGGCAAGGGTATACTGCACCATAAAGTGGGTCATCCCATGCTTAAGAGCTTATG
ACTATCTCAGTATTAAGTTGAAAAGTGGTGATTATTTGTCCATGCCGGAGGAATCATTTTGCTGCAACAGAGCTCGGTTTTGCAGAGTGCTCTGA
Protein sequenceShow/hide protein sequence
MAEANLPNRKGFCKLIFTVWISCGQTLVSLFHFNFLFQSCYLHSSVFVAIICKHPFEFLMAKTRARKERESEEEEVPVTPEVQKGKTKKKRTPEEKEAKRRRRQQRAAEQ
EAIQEEAVNDPVTEEVQDEQAAVVPEEENEQEPEARVEVIMPEPPKRRRIKRKVGRVKVIRNTPSPPSSDSEEEKMEQEKEPEDKAREEAEKKVEEERLLKRMVEKGKGV
AATTEKPDEIEESQLPYDRFITVGNGFVQNPESVNAQLVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVRERMLPTTHDST
VSRERVLLVFAILRSLSIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIINTPNLARLQRTQEARQGGLVYGINSILEQLALSASWQEFAE
RQALTFWNYVKNRDASLKKALQENFSKPFPALPAFPEDLLDWLSLIRSSLVGDELEARVYCTIKWVIPCLRAYDYLSIKLKSGDYLSMPEESFCCNRARFCRVL