; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0002821 (gene) of Chayote v1 genome

Gene IDSed0002821
OrganismSechium edule (Chayote v1)
DescriptionRpr2 domain-containing protein
Genome locationLG07:11301620..11304244
RNA-Seq ExpressionSed0002821
SyntenySed0002821
Gene Ontology termsGO:0006396 - RNA processing (biological process)
InterPro domainsIPR007175 - Ribonuclease P subunit, Rpr2/Snm1/Rpp21


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044832.1 Rpr2 domain-containing protein [Cucumis melo var. makuwa]1.9e-5544.83Show/hide
Query:  MAKKK---KKGSN-------------KKITGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPG
        MA+KK   K+GS+             ++ TGKI+PKVSNN K YLNHLENL TWA+ QP +PSLAA FG+RLAA+AES   AP+PSLFLC RCETVLQPG
Subjt:  MAKKK---KKGSN-------------KKITGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPG

Query:  SNCSIRIEKNNAKRRRRRNKCSNLTQNNVVY--------------------------------------------------------------------L
        SNC IRIEKNNAK+R R  K SN+TQN V Y                                                                     
Subjt:  SNCSIRIEKNNAKRRRRRNKCSNLTQNNVVY--------------------------------------------------------------------L

Query:  KPSLEKRKS--------------------------HEKPFGPETSYASTDVKEKVGDIIPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRMEPESSSA
         PSL   +                              P  P TS  ++  + +V D IP +DAPATPLT+  MTLL+S +RKRKKPS KNR EPES SA
Subjt:  KPSLEKRKS--------------------------HEKPFGPETSYASTDVKEKVGDIIPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRMEPESSSA

Query:  PTVDGDKTEGTSKTKRKKNSWTSLKEIVQTNEHGGKQNGAGLTIPFSL
        PT  G+K+E TSK KR + SWTSLKEI Q  E  GKQN AGL IPFSL
Subjt:  PTVDGDKTEGTSKTKRKKNSWTSLKEIVQTNEHGGKQNGAGLTIPFSL

XP_004146565.1 uncharacterized protein LOC101220608 [Cucumis sativus]8.6e-5646.3Show/hide
Query:  KKITGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPGSNCSIRIEKNNAKRRRRRNKCSNLTQ
        ++ TGKI+PKVSNN K YLNHLENL TWA+ QP +PSLAA FG+RLAA+AES   +P+PSLFLC RCET+LQPGSNC+IRIEKN AK+RRR  K SNLTQ
Subjt:  KKITGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPGSNCSIRIEKNNAKRRRRRNKCSNLTQ

Query:  NNVVYLKPSLEKR-------KSHEK-----------------------------------------------PFGPETSYASTDVK------EKVGDI--
        N V Y       R       K H K                                               P  P  S    D+          GDI  
Subjt:  NNVVYLKPSLEKR-------KSHEK-----------------------------------------------PFGPETSYASTDVK------EKVGDI--

Query:  -------------------------------IPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRMEPESSSAPTVDGDKTEGTSKTKRKKNSWTSLKEI
                                       IP +DAPATPLT+ GMTLL+S +RKRKKPS KN+ EPES S PT  G+ +EGTSK KR + SWTSLKEI
Subjt:  -------------------------------IPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRMEPESSSAPTVDGDKTEGTSKTKRKKNSWTSLKEI

Query:  VQTNEHGGKQNGAGLTIPFSLQGT
         Q  E  GKQN AGL IPFSL  T
Subjt:  VQTNEHGGKQNGAGLTIPFSLQGT

XP_008452026.1 PREDICTED: uncharacterized protein LOC103493157 [Cucumis melo]2.9e-5645.11Show/hide
Query:  MAKKK---KKGSN-------------KKITGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPG
        MA+KK   K+GS+             ++ TGKI+PKVSNN K YLNHLENL TWA+ QP +PSLAA FG+RLAA+AES   AP+PSLFLC RCETVLQPG
Subjt:  MAKKK---KKGSN-------------KKITGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPG

Query:  SNCSIRIEKNNAKRRRRRNKCSNLTQNNVVY--------------------------------------------------------------------L
        SNC IRIEKNNAK+RRR  K SN+TQN V Y                                                                     
Subjt:  SNCSIRIEKNNAKRRRRRNKCSNLTQNNVVY--------------------------------------------------------------------L

Query:  KPSLEKRKS--------------------------HEKPFGPETSYASTDVKEKVGDIIPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRMEPESSSA
         PSL   +                              P  P TS  ++  + +V D IP +DAPATPLT+  MTLL+S +RKRKKPS KNR EPES SA
Subjt:  KPSLEKRKS--------------------------HEKPFGPETSYASTDVKEKVGDIIPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRMEPESSSA

Query:  PTVDGDKTEGTSKTKRKKNSWTSLKEIVQTNEHGGKQNGAGLTIPFSL
        PT  G+K+E TSK KR + SWTSLKEI Q  E  GKQN AGL IPFSL
Subjt:  PTVDGDKTEGTSKTKRKKNSWTSLKEIVQTNEHGGKQNGAGLTIPFSL

XP_022136585.1 uncharacterized protein LOC111008256 isoform X1 [Momordica charantia]5.5e-6345.96Show/hide
Query:  TGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPGSNCSIRIEKNNAKRRRRRNKCSNLTQNNV
        TGKIQPK  NNVK YL+HLENL TWA+ Q  IPSLAA FGRR AA+A+SS   P+ SLFLC RCET+LQPGSNCSIRIEKN AKRRR+ NKCSNLTQNN+
Subjt:  TGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPGSNCSIRIEKNNAKRRRRRNKCSNLTQNNV

Query:  VY--------------------------------------------------------------------------------------------------
        VY                                                                                                  
Subjt:  VY--------------------------------------------------------------------------------------------------

Query:  -----LKPSLE-----------------------KRKSH----EKPFGPETSYASTDVKEKVGDIIPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRM
             L P +E                       KRK      +   GPE S A TD ++K GD IP VDAPATP  M+GMTLL S KRKRKKPS KN+ 
Subjt:  -----LKPSLE-----------------------KRKSH----EKPFGPETSYASTDVKEKVGDIIPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRM

Query:  EPESSSAPTVDGDKTEGTSKTKRKKNSWTSLKEIVQTNEHGGKQNGAGLTIPFSLQGTF
        EPESS APT +GDKTEGTSK KRK+ SWTSLKEI QTNE  GKQN     IPFSLQGTF
Subjt:  EPESSSAPTVDGDKTEGTSKTKRKKNSWTSLKEIVQTNEHGGKQNGAGLTIPFSLQGTF

XP_038906436.1 uncharacterized protein LOC120092350 isoform X1 [Benincasa hispida]1.7e-5645.88Show/hide
Query:  MAKKK---KKGSN-------------KKITGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPG
        MAKKK   KKGS+             ++ITGKI+PKVSNNVK YLNHLENL TWA  QP IPSLA  FG+RLAA+AES   AP+ SLFLC RCET+LQPG
Subjt:  MAKKK---KKGSN-------------KKITGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPG

Query:  SNCSIRIEKNNAKRRRRRNKCSNLTQNNVVYLKPSLEKR-------KSHEK-------------------------------------------------
        SNCSIRIEKNNAKRRR+ NKCSNLTQN V Y       R       K H K                                                 
Subjt:  SNCSIRIEKNNAKRRRRRNKCSNLTQNNVVYLKPSLEKR-------KSHEK-------------------------------------------------

Query:  ------------------------------------PFGPETSYAST------------------DVKEKVGDIIPIVDAPATPLTMLGMTLLNSSKRKR
                                            P  P T    T                   V+E+VGD IP VDAPATP TM G+TLL+S +RKR
Subjt:  ------------------------------------PFGPETSYAST------------------DVKEKVGDIIPIVDAPATPLTMLGMTLLNSSKRKR

Query:  KKPSLKNRMEPESSSAPTVDGDKTEGTSKTKRKKNSWTSLKEIVQTNEHGGKQNGAGLTIPFSL
        KKPS KN+ EPE SSAPT  GDKT G SK KR + SWTSLKEI Q +E  GKQN A L IPFSL
Subjt:  KKPSLKNRMEPESSSAPTVDGDKTEGTSKTKRKKNSWTSLKEIVQTNEHGGKQNGAGLTIPFSL

TrEMBL top hitse value%identityAlignment
A0A0A0KUP1 Uncharacterized protein4.2e-5646.3Show/hide
Query:  KKITGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPGSNCSIRIEKNNAKRRRRRNKCSNLTQ
        ++ TGKI+PKVSNN K YLNHLENL TWA+ QP +PSLAA FG+RLAA+AES   +P+PSLFLC RCET+LQPGSNC+IRIEKN AK+RRR  K SNLTQ
Subjt:  KKITGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPGSNCSIRIEKNNAKRRRRRNKCSNLTQ

Query:  NNVVYLKPSLEKR-------KSHEK-----------------------------------------------PFGPETSYASTDVK------EKVGDI--
        N V Y       R       K H K                                               P  P  S    D+          GDI  
Subjt:  NNVVYLKPSLEKR-------KSHEK-----------------------------------------------PFGPETSYASTDVK------EKVGDI--

Query:  -------------------------------IPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRMEPESSSAPTVDGDKTEGTSKTKRKKNSWTSLKEI
                                       IP +DAPATPLT+ GMTLL+S +RKRKKPS KN+ EPES S PT  G+ +EGTSK KR + SWTSLKEI
Subjt:  -------------------------------IPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRMEPESSSAPTVDGDKTEGTSKTKRKKNSWTSLKEI

Query:  VQTNEHGGKQNGAGLTIPFSLQGT
         Q  E  GKQN AGL IPFSL  T
Subjt:  VQTNEHGGKQNGAGLTIPFSLQGT

A0A1S3BU13 uncharacterized protein LOC1034931571.4e-5645.11Show/hide
Query:  MAKKK---KKGSN-------------KKITGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPG
        MA+KK   K+GS+             ++ TGKI+PKVSNN K YLNHLENL TWA+ QP +PSLAA FG+RLAA+AES   AP+PSLFLC RCETVLQPG
Subjt:  MAKKK---KKGSN-------------KKITGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPG

Query:  SNCSIRIEKNNAKRRRRRNKCSNLTQNNVVY--------------------------------------------------------------------L
        SNC IRIEKNNAK+RRR  K SN+TQN V Y                                                                     
Subjt:  SNCSIRIEKNNAKRRRRRNKCSNLTQNNVVY--------------------------------------------------------------------L

Query:  KPSLEKRKS--------------------------HEKPFGPETSYASTDVKEKVGDIIPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRMEPESSSA
         PSL   +                              P  P TS  ++  + +V D IP +DAPATPLT+  MTLL+S +RKRKKPS KNR EPES SA
Subjt:  KPSLEKRKS--------------------------HEKPFGPETSYASTDVKEKVGDIIPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRMEPESSSA

Query:  PTVDGDKTEGTSKTKRKKNSWTSLKEIVQTNEHGGKQNGAGLTIPFSL
        PT  G+K+E TSK KR + SWTSLKEI Q  E  GKQN AGL IPFSL
Subjt:  PTVDGDKTEGTSKTKRKKNSWTSLKEIVQTNEHGGKQNGAGLTIPFSL

A0A5A7TNC0 Rpr2 domain-containing protein9.3e-5644.83Show/hide
Query:  MAKKK---KKGSN-------------KKITGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPG
        MA+KK   K+GS+             ++ TGKI+PKVSNN K YLNHLENL TWA+ QP +PSLAA FG+RLAA+AES   AP+PSLFLC RCETVLQPG
Subjt:  MAKKK---KKGSN-------------KKITGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPG

Query:  SNCSIRIEKNNAKRRRRRNKCSNLTQNNVVY--------------------------------------------------------------------L
        SNC IRIEKNNAK+R R  K SN+TQN V Y                                                                     
Subjt:  SNCSIRIEKNNAKRRRRRNKCSNLTQNNVVY--------------------------------------------------------------------L

Query:  KPSLEKRKS--------------------------HEKPFGPETSYASTDVKEKVGDIIPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRMEPESSSA
         PSL   +                              P  P TS  ++  + +V D IP +DAPATPLT+  MTLL+S +RKRKKPS KNR EPES SA
Subjt:  KPSLEKRKS--------------------------HEKPFGPETSYASTDVKEKVGDIIPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRMEPESSSA

Query:  PTVDGDKTEGTSKTKRKKNSWTSLKEIVQTNEHGGKQNGAGLTIPFSL
        PT  G+K+E TSK KR + SWTSLKEI Q  E  GKQN AGL IPFSL
Subjt:  PTVDGDKTEGTSKTKRKKNSWTSLKEIVQTNEHGGKQNGAGLTIPFSL

A0A5D3CYJ3 Rpr2 domain-containing protein1.4e-5645.11Show/hide
Query:  MAKKK---KKGSN-------------KKITGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPG
        MA+KK   K+GS+             ++ TGKI+PKVSNN K YLNHLENL TWA+ QP +PSLAA FG+RLAA+AES   AP+PSLFLC RCETVLQPG
Subjt:  MAKKK---KKGSN-------------KKITGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPG

Query:  SNCSIRIEKNNAKRRRRRNKCSNLTQNNVVY--------------------------------------------------------------------L
        SNC IRIEKNNAK+RRR  K SN+TQN V Y                                                                     
Subjt:  SNCSIRIEKNNAKRRRRRNKCSNLTQNNVVY--------------------------------------------------------------------L

Query:  KPSLEKRKS--------------------------HEKPFGPETSYASTDVKEKVGDIIPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRMEPESSSA
         PSL   +                              P  P TS  ++  + +V D IP +DAPATPLT+  MTLL+S +RKRKKPS KNR EPES SA
Subjt:  KPSLEKRKS--------------------------HEKPFGPETSYASTDVKEKVGDIIPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRMEPESSSA

Query:  PTVDGDKTEGTSKTKRKKNSWTSLKEIVQTNEHGGKQNGAGLTIPFSL
        PT  G+K+E TSK KR + SWTSLKEI Q  E  GKQN AGL IPFSL
Subjt:  PTVDGDKTEGTSKTKRKKNSWTSLKEIVQTNEHGGKQNGAGLTIPFSL

A0A6J1C4C1 uncharacterized protein LOC111008256 isoform X12.7e-6345.96Show/hide
Query:  TGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPGSNCSIRIEKNNAKRRRRRNKCSNLTQNNV
        TGKIQPK  NNVK YL+HLENL TWA+ Q  IPSLAA FGRR AA+A+SS   P+ SLFLC RCET+LQPGSNCSIRIEKN AKRRR+ NKCSNLTQNN+
Subjt:  TGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPGSNCSIRIEKNNAKRRRRRNKCSNLTQNNV

Query:  VY--------------------------------------------------------------------------------------------------
        VY                                                                                                  
Subjt:  VY--------------------------------------------------------------------------------------------------

Query:  -----LKPSLE-----------------------KRKSH----EKPFGPETSYASTDVKEKVGDIIPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRM
             L P +E                       KRK      +   GPE S A TD ++K GD IP VDAPATP  M+GMTLL S KRKRKKPS KN+ 
Subjt:  -----LKPSLE-----------------------KRKSH----EKPFGPETSYASTDVKEKVGDIIPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRM

Query:  EPESSSAPTVDGDKTEGTSKTKRKKNSWTSLKEIVQTNEHGGKQNGAGLTIPFSLQGTF
        EPESS APT +GDKTEGTSK KRK+ SWTSLKEI QTNE  GKQN     IPFSLQGTF
Subjt:  EPESSSAPTVDGDKTEGTSKTKRKKNSWTSLKEIVQTNEHGGKQNGAGLTIPFSLQGTF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G41270.1 CONTAINS InterPro DOMAIN/s: RNAse P, Rpr2/Rpp21 subunit (InterPro:IPR007175); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).4.1e-2434.44Show/hide
Query:  KGSNKKIT---GKIQPKVSNNVKHYLNHLENLPTW-ATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPGSNCSIRIEKNNAKRRRRR
        K   KK+T   G   P + + ++H   HL+NL  W +T   PIPSLA+L GRRLAA  ES+    +P L  C RCET+L+PG NC++RIEK +A  +++R
Subjt:  KGSNKKIT---GKIQPKVSNNVKHYLNHLENLPTW-ATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPGSNCSIRIEKNNAKRRRRR

Query:  NKCSN-----LTQNNVVY---------LKPSLEKRKSHE-KPFGPETSYAS-------------------TDVKEKVGDIIPIVDAPATPLTMLGMTLLN
        N+C         QNNVVY         LK    K +  E  PF P+T+ +S                   +  +  V D +       TP  M+   L  
Subjt:  NKCSN-----LTQNNVVY---------LKPSLEKRKSHE-KPFGPETSYAS-------------------TDVKEKVGDIIPIVDAPATPLTMLGMTLLN

Query:  SSKRKRKKPSLKNRMEPESSSAPTVDGDKTEGTSKTKRKKNSWTSLKEIVQTNEHGGKQNGAGLTIPFSL
           R+ +KP  K   EP+S        +KT G S  +++K+ WTS+KEI +TN+           IPF L
Subjt:  SSKRKRKKPSLKNRMEPESSSAPTVDGDKTEGTSKTKRKKNSWTSLKEIVQTNEHGGKQNGAGLTIPFSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAGAAGAAGAAGAAGGGAAGCAACAAGAAAATCACTGGAAAAATCCAACCCAAAGTCTCTAACAATGTCAAACATTATTTGAACCACTTGGAAAACTTACCAAC
TTGGGCCACTGCCCAACCCCCTATTCCTTCCTTGGCTGCTCTCTTTGGCCGGCGCCTTGCTGCTTCTGCGGAATCTTCGCTGGCGGCCCCGGAGCCTTCTCTATTTCTCT
GCCACAGGTGTGAAACAGTTCTTCAACCTGGCTCTAACTGCTCTATACGAATAGAGAAGAATAATGCCAAGAGACGTCGTAGGCGTAACAAATGTAGTAATTTGACACAG
AACAATGTGGTGTATCTCAAACCGAGTTTGGAAAAGAGGAAATCACATGAGAAACCATTTGGACCCGAAACTAGTTATGCTTCGACAGACGTTAAGGAGAAAGTTGGAGA
CATTATTCCTATTGTTGATGCTCCTGCAACTCCTCTAACTATGCTTGGAATGACTCTATTGAATTCGAGCAAGAGAAAGAGGAAGAAGCCTTCATTGAAAAATAGAATGG
AACCTGAAAGTAGCTCTGCTCCTACAGTAGATGGGGATAAAACGGAAGGCACATCCAAAACTAAGCGGAAGAAAAATTCATGGACAAGTTTGAAGGAAATCGTTCAAACA
AATGAACATGGTGGTAAACAGAATGGGGCTGGATTGACAATTCCATTTTCCTTACAAGGCACTTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAGAAGAAGAAGAAGGGAAGCAACAAGAAAATCACTGGAAAAATCCAACCCAAAGTCTCTAACAATGTCAAACATTATTTGAACCACTTGGAAAACTTACCAAC
TTGGGCCACTGCCCAACCCCCTATTCCTTCCTTGGCTGCTCTCTTTGGCCGGCGCCTTGCTGCTTCTGCGGAATCTTCGCTGGCGGCCCCGGAGCCTTCTCTATTTCTCT
GCCACAGGTGTGAAACAGTTCTTCAACCTGGCTCTAACTGCTCTATACGAATAGAGAAGAATAATGCCAAGAGACGTCGTAGGCGTAACAAATGTAGTAATTTGACACAG
AACAATGTGGTGTATCTCAAACCGAGTTTGGAAAAGAGGAAATCACATGAGAAACCATTTGGACCCGAAACTAGTTATGCTTCGACAGACGTTAAGGAGAAAGTTGGAGA
CATTATTCCTATTGTTGATGCTCCTGCAACTCCTCTAACTATGCTTGGAATGACTCTATTGAATTCGAGCAAGAGAAAGAGGAAGAAGCCTTCATTGAAAAATAGAATGG
AACCTGAAAGTAGCTCTGCTCCTACAGTAGATGGGGATAAAACGGAAGGCACATCCAAAACTAAGCGGAAGAAAAATTCATGGACAAGTTTGAAGGAAATCGTTCAAACA
AATGAACATGGTGGTAAACAGAATGGGGCTGGATTGACAATTCCATTTTCCTTACAAGGCACTTTCTAA
Protein sequenceShow/hide protein sequence
MAKKKKKGSNKKITGKIQPKVSNNVKHYLNHLENLPTWATAQPPIPSLAALFGRRLAASAESSLAAPEPSLFLCHRCETVLQPGSNCSIRIEKNNAKRRRRRNKCSNLTQ
NNVVYLKPSLEKRKSHEKPFGPETSYASTDVKEKVGDIIPIVDAPATPLTMLGMTLLNSSKRKRKKPSLKNRMEPESSSAPTVDGDKTEGTSKTKRKKNSWTSLKEIVQT
NEHGGKQNGAGLTIPFSLQGTF