; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg004872 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg004872
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold9:20981773..20983489
RNA-Seq ExpressionSpg004872
SyntenySpg004872
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]2.1e-1928.41Show/hide
Query:  RFVNNFARAKYAELLKRDFLFERGF-------SGELPHFL--------------------RTVREFYANIDQEEGFLVVVRGIEVDWSPRAINALYNLQN
        +F N+ A+A++     R+  FE GF        G  P  +                      V+EFYANI +     + VRG ++ ++  AIN  ++LQ 
Subjt:  RFVNNFARAKYAELLKRDFLFERGF-------SGELPHFL--------------------RTVREFYANIDQEEGFLVVVRGIEVDWSPRAINALYNLQN

Query:  F--PHAAYNEMAVAPSNE-LLSDAVRE--------------------VEANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEICGC
            HA + E A +   + +L D   E                      A  W  F++ +++PT+H++TVS  R+LL  +++ S  IDVG+II  ++  C
Subjt:  F--PHAAYNEMAVAPSNE-LLSDAVRE--------------------VEANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEICGC

Query:  WKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARLQRMQEVRQGGLVH-------DINTILEQLAL-SASRQEFAERQAL-----TFWS
          KK   L FPN IT LCR   V E+  D IL     I    L  L  ++  +    VH       + N  +  LAL  A  Q  A+  AL      F+ 
Subjt:  WKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARLQRMQEVRQGGLVH-------DINTILEQLAL-SASRQEFAERQAL-----TFWS

Query:  YVKNRDANLKKALQENFSKPYPALPAFPEDL---FNPWIPPPPMEGGEEEDGNEPGQED
        YVK+RD  ++   QE         P FP+++   FN    P P     E D  +P   D
Subjt:  YVKNRDANLKKALQENFSKPYPALPAFPEDL---FNPWIPPPPMEGGEEEDGNEPGQED

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]9.3e-3132.98Show/hide
Query:  VREFYANIDQEEGFLVVVRGIEVDWSPRAINALYNLQNFPHAAYNEMAVAPSNELLSDAVREV------------------------EANTWMGFIRQRM
        VREFYAN+   E   V VRG++V WS  AINA++ L + P   ++E     + + L   +  V                         A  W  F++ R+
Subjt:  VREFYANIDQEEGFLVVVRGIEVDWSPRAINALYNLQNFPHAAYNEMAVAPSNELLSDAVREV------------------------EANTWMGFIRQRM

Query:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEICGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARL---------QRMQEV
        LPTTH  TVS++R+LL  ++L   SI+VG++I  EI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+         Q+    
Subjt:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEICGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARL---------QRMQEV

Query:  R---------QGGLVHDINTILEQLALSASRQ-------EFAERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDL
        R          G ++  +  + ++L+    +Q       +   +Q   FW+Y K RD  LKKALQ NF++P P  PAFP+++
Subjt:  R---------QGGLVHDINTILEQLALSASRQ-------EFAERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDL

PON50458.1 hypothetical protein PanWU01x14_223230, partial [Parasponia andersonii]1.3e-1934.24Show/hide
Query:  VREFYANIDQEEGFLVVVRGIEVDWSPRAINALYNLQNF--PHAAYNEMAVAPSNELLSDAVR---------------------EVEANTWMGFIRQRML
        VREFY N+   +   V VRG++V  S  AIN +Y L +    H+ + E    P   ++ + V                         A  W  F++ R+L
Subjt:  VREFYANIDQEEGFLVVVRGIEVDWSPRAINALYNLQNF--PHAAYNEMAVAPSNELLSDAVR---------------------EVEANTWMGFIRQRML

Query:  PTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEICGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARL
        PTTH   VS+ERVLL +++L   SI++G++I  EIC C  +K G LFFP+ I  +CR A  P    +  L + G ID   +AR+
Subjt:  PTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEICGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARL

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]1.4e-2334.54Show/hide
Query:  ANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEICGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARLQRM
        A  W  F++ R+LPTTH  TVS++R+LL +++L   SI+VG++I  EI  C  +K G LFFP+ IT LCR A  P    +  L   G ID   +AR+ + 
Subjt:  ANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEICGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARLQRM

Query:  QEVR------------------QGGLVHDINTILEQLALSASRQ-------EFAERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDL
         +                     G ++  +  + ++L+    +Q       +   +Q   FW+Y K RD  LKKALQ NF++P P  P FP++L
Subjt:  QEVR------------------QGGLVHDINTILEQLALSASRQ-------EFAERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDL

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]9.9e-3334.39Show/hide
Query:  GELP------HFLRTVREFYANIDQEEGFLVVVRGIEVDWSPRAINALYNLQN--FPHAAYNEMAVAPSNELLSDAVREV--------------------
        G+LP       FL  VREFYAN+   E   + VRG++V WS  AINA++ L +    H+ + E    P    + + V                       
Subjt:  GELP------HFLRTVREFYANIDQEEGFLVVVRGIEVDWSPRAINALYNLQN--FPHAAYNEMAVAPSNELLSDAVREV--------------------

Query:  -EANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEICGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARL-
          A  W  F++ R+LPTTH   VS++R+LL  ++L   SI+VG++I  EI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+ 
Subjt:  -EANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEICGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARL-

Query:  -------------QRMQEVRQGGLVHDINTILEQLALSASRQEFAERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDL
                      R           D+   L+ L    S+QE   +Q   FW+Y K RD  LKKALQ NF++P P  PAFP+++
Subjt:  -------------QRMQEVRQGGLVHDINTILEQLALSASRQEFAERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDL

TrEMBL top hitse value%identityAlignment
A0A1S2Z475 uncharacterized protein LOC101493401 isoform X31.1e-1624.54Show/hide
Query:  RFVNNFARAKYAELLK-RDFLFERGFSGE-------LP---------HFLRT------------VREFYANIDQEEGFLVVVRGIEVDWSPRAINALYNL
        +F+N   + K+  L+K R+F  E GFS         LP         H  +T            VREFY+ I + +   V+VRG+ V ++P+ +N  +NL
Subjt:  RFVNNFARAKYAELLK-RDFLFERGFSGE-------LP---------HFLRT------------VREFYANIDQEEGFLVVVRGIEVDWSPRAINALYNL

Query:  QNFPHAAYNEMAVAPSNELLSDAVREVEANT----------------------------------WMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSI
                N+  V    + L + V + + N+                                  W   I  ++LP ++  +V ++R+LL + ++   SI
Subjt:  QNFPHAAYNEMAVAPSNELLSDAVREVEANT----------------------------------WMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSI

Query:  DVGKIIADEICGC--WKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDK---GIIDTPNLARLQRMQEVRQGGL----VHDINTILEQLA-------LS
        +VGKII DEI  C   KKK  +L FP+ I+ LC   GV  ++ D ++ ++   G+ D       + M   R+G +     H + T  E+         + 
Subjt:  DVGKIIADEICGC--WKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDK---GIIDTPNLARLQRMQEVRQGGL----VHDINTILEQLA-------LS

Query:  ASRQEFA--------------ERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDLFNPWIPPPPMEGGEEEDGNEPG
          +++F                +Q   FW + K      +K  + NF K     P FP+++  P++  P  E G+ +D  EPG
Subjt:  ASRQEFA--------------ERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDLFNPWIPPPPMEGGEEEDGNEPG

A0A1S3EI57 uncharacterized protein LOC101493401 isoform X11.1e-1624.54Show/hide
Query:  RFVNNFARAKYAELLK-RDFLFERGFSGE-------LP---------HFLRT------------VREFYANIDQEEGFLVVVRGIEVDWSPRAINALYNL
        +F+N   + K+  L+K R+F  E GFS         LP         H  +T            VREFY+ I + +   V+VRG+ V ++P+ +N  +NL
Subjt:  RFVNNFARAKYAELLK-RDFLFERGFSGE-------LP---------HFLRT------------VREFYANIDQEEGFLVVVRGIEVDWSPRAINALYNL

Query:  QNFPHAAYNEMAVAPSNELLSDAVREVEANT----------------------------------WMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSI
                N+  V    + L + V + + N+                                  W   I  ++LP ++  +V ++R+LL + ++   SI
Subjt:  QNFPHAAYNEMAVAPSNELLSDAVREVEANT----------------------------------WMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSI

Query:  DVGKIIADEICGC--WKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDK---GIIDTPNLARLQRMQEVRQGGL----VHDINTILEQLA-------LS
        +VGKII DEI  C   KKK  +L FP+ I+ LC   GV  ++ D ++ ++   G+ D       + M   R+G +     H + T  E+         + 
Subjt:  DVGKIIADEICGC--WKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDK---GIIDTPNLARLQRMQEVRQGGL----VHDINTILEQLA-------LS

Query:  ASRQEFA--------------ERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDLFNPWIPPPPMEGGEEEDGNEPG
          +++F                +Q   FW + K      +K  + NF K     P FP+++  P++  P  E G+ +D  EPG
Subjt:  ASRQEFA--------------ERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDLFNPWIPPPPMEGGEEEDGNEPG

A0A2P5BCG4 Uncharacterized protein (Fragment)4.5e-3132.98Show/hide
Query:  VREFYANIDQEEGFLVVVRGIEVDWSPRAINALYNLQNFPHAAYNEMAVAPSNELLSDAVREV------------------------EANTWMGFIRQRM
        VREFYAN+   E   V VRG++V WS  AINA++ L + P   ++E     + + L   +  V                         A  W  F++ R+
Subjt:  VREFYANIDQEEGFLVVVRGIEVDWSPRAINALYNLQNFPHAAYNEMAVAPSNELLSDAVREV------------------------EANTWMGFIRQRM

Query:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEICGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARL---------QRMQEV
        LPTTH  TVS++R+LL  ++L   SI+VG++I  EI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+         Q+    
Subjt:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEICGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARL---------QRMQEV

Query:  R---------QGGLVHDINTILEQLALSASRQ-------EFAERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDL
        R          G ++  +  + ++L+    +Q       +   +Q   FW+Y K RD  LKKALQ NF++P P  PAFP+++
Subjt:  R---------QGGLVHDINTILEQLALSASRQ-------EFAERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDL

A0A2P5CEY2 Uncharacterized protein6.9e-2434.54Show/hide
Query:  ANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEICGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARLQRM
        A  W  F++ R+LPTTH  TVS++R+LL +++L   SI+VG++I  EI  C  +K G LFFP+ IT LCR A  P    +  L   G ID   +AR+ + 
Subjt:  ANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEICGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARLQRM

Query:  QEVR------------------QGGLVHDINTILEQLALSASRQ-------EFAERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDL
         +                     G ++  +  + ++L+    +Q       +   +Q   FW+Y K RD  LKKALQ NF++P P  P FP++L
Subjt:  QEVR------------------QGGLVHDINTILEQLALSASRQ-------EFAERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDL

A0A2P5DXM3 Uncharacterized protein4.8e-3334.39Show/hide
Query:  GELP------HFLRTVREFYANIDQEEGFLVVVRGIEVDWSPRAINALYNLQN--FPHAAYNEMAVAPSNELLSDAVREV--------------------
        G+LP       FL  VREFYAN+   E   + VRG++V WS  AINA++ L +    H+ + E    P    + + V                       
Subjt:  GELP------HFLRTVREFYANIDQEEGFLVVVRGIEVDWSPRAINALYNLQN--FPHAAYNEMAVAPSNELLSDAVREV--------------------

Query:  -EANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEICGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARL-
          A  W  F++ R+LPTTH   VS++R+LL  ++L   SI+VG++I  EI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+ 
Subjt:  -EANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEICGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARL-

Query:  -------------QRMQEVRQGGLVHDINTILEQLALSASRQEFAERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDL
                      R           D+   L+ L    S+QE   +Q   FW+Y K RD  LKKALQ NF++P P  PAFP+++
Subjt:  -------------QRMQEVRQGGLVHDINTILEQLALSASRQEFAERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAACAAGAGCGCGAAAAGAAAGGGAGAATGAGGAAGAAGAGGTCCCTGTTACCCCTGAAGTGCAGAAAGTTAAGACGAAGAAGAAAAGGTCCCCGGAGGAGAA
GGAGGCCAAGAGACGAAGACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGATTGTTGCTGCGGCAGTTGAAGAAGGAGACCCGCAAGAACCTGATGTACAGA
ACCCAGAGGAGGGTGAACAGAGAGTTGTGGATACGGAAGAAGAGGAGCGAACAGAAGAAGTTCAAGAGGAGCGGCCGGAAGTGCAAGAAGAAGTTCAAGAACAGCAGGTC
GAGGATGTTCAAATGCAACAGGAAGAAGAGGTTCAGGTACCGGATAATGAGCCAGTACAGGAGGCTCAAGTAGAAGTGATCATGCCGGAGGTGCCAAGGCGTCGCCGTGT
TAAGAGAAAAGCAGGTCGCGCTAGGGTTGTCCGGAATGATACTCCATCGCCTCTGACCACGGATTCTGAAAGAGAGAATGCAGAGAGAGTAGAGCGTGAGAAGAAGGAAG
CCGAGGAAAGAGCAAGAGAAGAGGCAGAGGAAAAGGCTGAGGAAGAGCGGTTGCTCAAGCGAAGGGCGGAAAAGGGCAAAAATGTTGCTGAAGCATCAGAAGAGCACGAT
GAAATAGAAGAGCAACAATTACTAGATGATCGCTTCGTCAACAATTTTGCCAGAGCAAAATACGCTGAGCTTCTGAAAAGGGACTTCCTGTTTGAGAGGGGATTTAGTGG
TGAGCTTCCGCATTTTCTGAGGACTGTGCGCGAATTCTATGCAAATATCGACCAAGAAGAAGGTTTCCTAGTAGTTGTTCGAGGTATTGAGGTCGACTGGAGTCCTAGGG
CTATCAACGCATTGTATAACCTTCAGAACTTCCCCCATGCGGCATATAATGAGATGGCTGTCGCGCCATCTAATGAGCTGTTAAGTGATGCTGTGCGGGAGGTAGAAGCG
AATACGTGGATGGGATTTATCAGACAAAGGATGCTTCCAACGACTCATGACTCGACAGTTTCTAGGGAACGAGTGCTTTTGGCTTTCGCTATTTTGAGGTCTCTCAGTAT
TGATGTGGGAAAAATTATTGCTGATGAAATATGTGGTTGTTGGAAGAAGAAAGTGGGGAAACTATTCTTTCCAAACACAATCACAATGCTTTGTAGAGGAGCAGGGGTTC
CGGAAGATGAAGGGGATGTGATTCTGTTTGACAAGGGAATCATTGACACGCCTAACTTGGCACGGCTCCAGCGTATGCAGGAGGTACGTCAGGGTGGGCTGGTCCACGAC
ATCAACACGATTTTAGAACAACTCGCACTTTCGGCCAGCAGGCAGGAGTTTGCTGAACGGCAAGCTTTGACTTTCTGGAGCTATGTTAAGAATCGTGATGCCAATCTGAA
GAAGGCGCTGCAGGAGAATTTTTCGAAACCATATCCAGCCCTTCCAGCATTCCCTGAAGATTTATTCAACCCATGGATTCCACCCCCACCGATGGAAGGAGGAGAAGAGG
AAGATGGAAATGAACCGGGCCAAGAGGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAACAAGAGCGCGAAAAGAAAGGGAGAATGAGGAAGAAGAGGTCCCTGTTACCCCTGAAGTGCAGAAAGTTAAGACGAAGAAGAAAAGGTCCCCGGAGGAGAA
GGAGGCCAAGAGACGAAGACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGATTGTTGCTGCGGCAGTTGAAGAAGGAGACCCGCAAGAACCTGATGTACAGA
ACCCAGAGGAGGGTGAACAGAGAGTTGTGGATACGGAAGAAGAGGAGCGAACAGAAGAAGTTCAAGAGGAGCGGCCGGAAGTGCAAGAAGAAGTTCAAGAACAGCAGGTC
GAGGATGTTCAAATGCAACAGGAAGAAGAGGTTCAGGTACCGGATAATGAGCCAGTACAGGAGGCTCAAGTAGAAGTGATCATGCCGGAGGTGCCAAGGCGTCGCCGTGT
TAAGAGAAAAGCAGGTCGCGCTAGGGTTGTCCGGAATGATACTCCATCGCCTCTGACCACGGATTCTGAAAGAGAGAATGCAGAGAGAGTAGAGCGTGAGAAGAAGGAAG
CCGAGGAAAGAGCAAGAGAAGAGGCAGAGGAAAAGGCTGAGGAAGAGCGGTTGCTCAAGCGAAGGGCGGAAAAGGGCAAAAATGTTGCTGAAGCATCAGAAGAGCACGAT
GAAATAGAAGAGCAACAATTACTAGATGATCGCTTCGTCAACAATTTTGCCAGAGCAAAATACGCTGAGCTTCTGAAAAGGGACTTCCTGTTTGAGAGGGGATTTAGTGG
TGAGCTTCCGCATTTTCTGAGGACTGTGCGCGAATTCTATGCAAATATCGACCAAGAAGAAGGTTTCCTAGTAGTTGTTCGAGGTATTGAGGTCGACTGGAGTCCTAGGG
CTATCAACGCATTGTATAACCTTCAGAACTTCCCCCATGCGGCATATAATGAGATGGCTGTCGCGCCATCTAATGAGCTGTTAAGTGATGCTGTGCGGGAGGTAGAAGCG
AATACGTGGATGGGATTTATCAGACAAAGGATGCTTCCAACGACTCATGACTCGACAGTTTCTAGGGAACGAGTGCTTTTGGCTTTCGCTATTTTGAGGTCTCTCAGTAT
TGATGTGGGAAAAATTATTGCTGATGAAATATGTGGTTGTTGGAAGAAGAAAGTGGGGAAACTATTCTTTCCAAACACAATCACAATGCTTTGTAGAGGAGCAGGGGTTC
CGGAAGATGAAGGGGATGTGATTCTGTTTGACAAGGGAATCATTGACACGCCTAACTTGGCACGGCTCCAGCGTATGCAGGAGGTACGTCAGGGTGGGCTGGTCCACGAC
ATCAACACGATTTTAGAACAACTCGCACTTTCGGCCAGCAGGCAGGAGTTTGCTGAACGGCAAGCTTTGACTTTCTGGAGCTATGTTAAGAATCGTGATGCCAATCTGAA
GAAGGCGCTGCAGGAGAATTTTTCGAAACCATATCCAGCCCTTCCAGCATTCCCTGAAGATTTATTCAACCCATGGATTCCACCCCCACCGATGGAAGGAGGAGAAGAGG
AAGATGGAAATGAACCGGGCCAAGAGGACTAA
Protein sequenceShow/hide protein sequence
MAKTRARKERENEEEEVPVTPEVQKVKTKKKRSPEEKEAKRRRRQQRAEEQEKATEIVAAAVEEGDPQEPDVQNPEEGEQRVVDTEEEERTEEVQEERPEVQEEVQEQQV
EDVQMQQEEEVQVPDNEPVQEAQVEVIMPEVPRRRRVKRKAGRARVVRNDTPSPLTTDSERENAERVEREKKEAEERAREEAEEKAEEERLLKRRAEKGKNVAEASEEHD
EIEEQQLLDDRFVNNFARAKYAELLKRDFLFERGFSGELPHFLRTVREFYANIDQEEGFLVVVRGIEVDWSPRAINALYNLQNFPHAAYNEMAVAPSNELLSDAVREVEA
NTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEICGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDTPNLARLQRMQEVRQGGLVHD
INTILEQLALSASRQEFAERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDLFNPWIPPPPMEGGEEEDGNEPGQED