; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0001802 (gene) of Chayote v1 genome

Gene IDSed0001802
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein isoform 2
Genome locationLG13:5703172..5707431
RNA-Seq ExpressionSed0001802
SyntenySed0001802
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600514.1 Nucleoside diphosphate kinase IV, chloroplastic/mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]2.8e-6371.16Show/hide
Query:  FSAGE-SYPFLEMAKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFA
        FS+ E SY     AKPL+ EAIAI EKKM M LDDIIKMSK T NK +KQRRFPNKMQKFP NATQDRPRKLQ   D+R+SLRQGALA RRSNFQGN FA
Subjt:  FSAGE-SYPFLEMAKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFA

Query:  LATELARKAAITPIRPRVFNRRAPNLNTTRVEAPPPPVQRRP-----SIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGG
        LATE+AR AA+ PIRPR FNRR PN   TRVEA  PPVQR+P      I K+TAP + Q NA+ RQ+PQTLDSLFANMK+QR R LSQRQNGVAQQRNG 
Subjt:  LATELARKAAITPIRPRVFNRRAPNLNTTRVEAPPPPVQRRP-----SIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGG

Query:  RQQEIPPWRRYRSGN
        RQQ  PPW R R  N
Subjt:  RQQEIPPWRRYRSGN

KAG7031152.1 hypothetical protein SDJN02_05192, partial [Cucurbita argyrosperma subsp. argyrosperma]3.7e-6373.27Show/hide
Query:  AKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFALATELARKAAITP
        AKPL+ EAIAI EKKM M LDDIIKMSK T NK +KQRRFPNKMQKFP NATQDRPRKLQ   D+R+SLRQGALA RRSNFQGN FALATE+AR AA+ P
Subjt:  AKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFALATELARKAAITP

Query:  IRPRVFNRRAPNLNTTRVEAPPPPVQRRP-----SIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGGRQQEIPPWRRYRS
        IRPR FNRR PN   TRVEA  PPVQR+P      I K+TAP + Q NA+ RQ+PQTLDSLFANMK+QR R LSQRQNGVAQQRNG RQQ  PPW R R 
Subjt:  IRPRVFNRRAPNLNTTRVEAPPPPVQRRP-----SIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGGRQQEIPPWRRYRS

Query:  GN
         N
Subjt:  GN

XP_004148996.1 uncharacterized protein LOC101210049 [Cucumis sativus]1.4e-6274.49Show/hide
Query:  AKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFALATELARKAAITP
        AKPL+ EAIAI EKKM M LDDIIKMSK T NK +KQRR PNKMQKFP NATQDRPRKLQ   DSRSSLRQGALA RRSNFQGN F LATE+ARKAA+ P
Subjt:  AKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFALATELARKAAITP

Query:  IRPRVFNRRAPNLNTTRVEAPPPPVQRRP-----SIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQRSRALSQRQN-GVAQQRNGGRQQEIPPW
        IRPR F RRAPN N TRVEA  PPV R+P      + KV+APA+PQ N + RQ+PQTLDSLFANMK+QR R LSQRQN G AQQRNGGRQQ+ PPW
Subjt:  IRPRVFNRRAPNLNTTRVEAPPPPVQRRP-----SIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQRSRALSQRQN-GVAQQRNGGRQQEIPPW

XP_022941743.1 uncharacterized protein LOC111447019 isoform X1 [Cucurbita moschata]3.7e-6373.27Show/hide
Query:  AKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFALATELARKAAITP
        AKPL+ EAIAI EKKM M LDDIIKMSK T NK +KQRRFPNKMQKFP NATQDRPRKLQ   D+R+SLRQGALA RRSNFQGN FALATE+AR AA+ P
Subjt:  AKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFALATELARKAAITP

Query:  IRPRVFNRRAPNLNTTRVEAPPPPVQRRP-----SIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGGRQQEIPPWRRYRS
        IRPR FNRR PN   TRVEA  PPVQR+P      I K+TAP + Q NA+ RQ+PQTLDSLFANMK+QR R LSQRQNG AQQRNG RQQ  PPW R R 
Subjt:  IRPRVFNRRAPNLNTTRVEAPPPPVQRRP-----SIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGGRQQEIPPWRRYRS

Query:  GN
        GN
Subjt:  GN

XP_023528268.1 uncharacterized protein LOC111791234 isoform X1 [Cucurbita pepo subsp. pepo]2.8e-6373.27Show/hide
Query:  AKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFALATELARKAAITP
        AKPL+ EAIAI EKKM M LDDIIKMSK T +K +KQRRFPNKMQKFP NATQDRPRKLQ   D+R+SLRQGALA RRSNFQGN FALATE+AR AA+ P
Subjt:  AKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFALATELARKAAITP

Query:  IRPRVFNRRAPNLNTTRVEAPPPPVQRRPS-----IAKVTAPAEPQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGGRQQEIPPWRRYRS
        IRPR FNRR PN   TRVEA  PPVQR+PS     I K+TAP + Q NA+ RQ+PQTLDSLFANMK+QR R LSQRQNG AQQRNG RQQ  PPW R R 
Subjt:  IRPRVFNRRAPNLNTTRVEAPPPPVQRRPS-----IAKVTAPAEPQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGGRQQEIPPWRRYRS

Query:  GN
        GN
Subjt:  GN

TrEMBL top hitse value%identityAlignment
A0A0A0KUX2 Uncharacterized protein6.7e-6374.49Show/hide
Query:  AKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFALATELARKAAITP
        AKPL+ EAIAI EKKM M LDDIIKMSK T NK +KQRR PNKMQKFP NATQDRPRKLQ   DSRSSLRQGALA RRSNFQGN F LATE+ARKAA+ P
Subjt:  AKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFALATELARKAAITP

Query:  IRPRVFNRRAPNLNTTRVEAPPPPVQRRP-----SIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQRSRALSQRQN-GVAQQRNGGRQQEIPPW
        IRPR F RRAPN N TRVEA  PPV R+P      + KV+APA+PQ N + RQ+PQTLDSLFANMK+QR R LSQRQN G AQQRNGGRQQ+ PPW
Subjt:  IRPRVFNRRAPNLNTTRVEAPPPPVQRRP-----SIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQRSRALSQRQN-GVAQQRNGGRQQEIPPW

A0A6J1ETG7 uncharacterized protein LOC1114375688.8e-6369.52Show/hide
Query:  AKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFALATELARKAAITP
        AKPL+ EAIA+ EKKM M LDDIIKMSK T NKA+K+RRFPNKMQKFP NATQDRPRKLQ   DSRSS+RQGALA RRSNFQGN FAL TE+AR+A + P
Subjt:  AKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFALATELARKAAITP

Query:  IRPRVFNRRAPNLNTTRVEAPP---PPVQRRPSIAKVTAPAE----------PQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGGRQQEI
         RPR F RRAPN N TRV+APP    P   R  + KV APA+          PQ NA+ RQ+PQTLDSLFANMK+QR R LSQRQNG AQQRNGGRQQ I
Subjt:  IRPRVFNRRAPNLNTTRVEAPP---PPVQRRPSIAKVTAPAE----------PQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGGRQQEI

Query:  PPWRRYRSGN
        PPW+R R GN
Subjt:  PPWRRYRSGN

A0A6J1FPB6 uncharacterized protein LOC111447019 isoform X11.8e-6373.27Show/hide
Query:  AKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFALATELARKAAITP
        AKPL+ EAIAI EKKM M LDDIIKMSK T NK +KQRRFPNKMQKFP NATQDRPRKLQ   D+R+SLRQGALA RRSNFQGN FALATE+AR AA+ P
Subjt:  AKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFALATELARKAAITP

Query:  IRPRVFNRRAPNLNTTRVEAPPPPVQRRP-----SIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGGRQQEIPPWRRYRS
        IRPR FNRR PN   TRVEA  PPVQR+P      I K+TAP + Q NA+ RQ+PQTLDSLFANMK+QR R LSQRQNG AQQRNG RQQ  PPW R R 
Subjt:  IRPRVFNRRAPNLNTTRVEAPPPPVQRRP-----SIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGGRQQEIPPWRRYRS

Query:  GN
        GN
Subjt:  GN

A0A6J1J5N5 uncharacterized protein LOC111481570 isoform X14.4e-6272.64Show/hide
Query:  KPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFALATELARKAAITPI
        KPL+ EAIAI EKKM M LDDIIKMSK T NK +KQRRFPNKMQKFP NATQDRPRKLQ   D+R+SLRQGA A RRSNFQGN FALATE+ARKAA+ PI
Subjt:  KPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFALATELARKAAITPI

Query:  RPRVFNRRAPNLNTTRVEAPPPPVQRRP-----SIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGGRQQEIPPWRRYRSG
        RPR FNR  PN   TRVEA  PPVQR+P      I K+ AP + Q NA+ RQKPQTLDSLFANMK+QR R LSQRQNG AQQRNG RQQ  PPW R R G
Subjt:  RPRVFNRRAPNLNTTRVEAPPPPVQRRP-----SIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGGRQQEIPPWRRYRSG

Query:  N
        N
Subjt:  N

A0A6J1JB99 uncharacterized protein LOC111483428 isoform X21.7e-6168.57Show/hide
Query:  AKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFALATELARKAAITP
        AKPL+ EAIA+ EKKM M LDDIIKMSK T NKA+K+RRFPNKMQKFP N TQDRPRKLQ   DSRSS+RQGALA RRSNFQGN FAL TE++R+A + P
Subjt:  AKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQ---DSRSSLRQGALAIRRSNFQGNLFALATELARKAAITP

Query:  IRPRVFNRRAPNLNTTRVEAPP---PPVQRRPSIAKVTAPAE----------PQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGGRQQEI
         RPR F RRAPN N TRV+APP    P   R  + KV APA+          PQ NA+ RQ+PQTLDSLFANMK+QR R LSQRQNG AQQRNGGRQQ I
Subjt:  IRPRVFNRRAPNLNTTRVEAPP---PPVQRRPSIAKVTAPAE----------PQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGGRQQEI

Query:  PPWRRYRSGN
        PPW+R R GN
Subjt:  PPWRRYRSGN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G10970.1 unknown protein3.7e-2139.81Show/hide
Query:  KPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAK-KQRRFPNKMQKFPINATQD--RPRKLQDSRSSLRQGALAIRRSNFQGNLFALATELARKAAITPI
        KP++ E +A+ EKKM M+LD+IIKM K   N  K K++R  NK +KF   A     + ++  DSRS +RQGA A +RSNFQGN F + T +ARKAA    
Subjt:  KPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAK-KQRRFPNKMQKFPINATQD--RPRKLQDSRSSLRQGALAIRRSNFQGNLFALATELARKAAITPI

Query:  RPRVFN-RRAPNLNTTRVEAPPPP----------VQRRPSIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQ--RSRALSQRQNGVAQQRNGGRQQE--
        R R +N  R  N N +R  APP             Q++    K+            RQ PQTLDS FANMK++  R R  +  ++ V     G  QQ+  
Subjt:  RPRVFN-RRAPNLNTTRVEAPPPP----------VQRRPSIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQ--RSRALSQRQNGVAQQRNGGRQQE--

Query:  IPPWRR
        + PW R
Subjt:  IPPWRR

AT4G10970.2 unknown protein3.7e-2139.81Show/hide
Query:  KPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAK-KQRRFPNKMQKFPINATQD--RPRKLQDSRSSLRQGALAIRRSNFQGNLFALATELARKAAITPI
        KP++ E +A+ EKKM M+LD+IIKM K   N  K K++R  NK +KF   A     + ++  DSRS +RQGA A +RSNFQGN F + T +ARKAA    
Subjt:  KPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAK-KQRRFPNKMQKFPINATQD--RPRKLQDSRSSLRQGALAIRRSNFQGNLFALATELARKAAITPI

Query:  RPRVFN-RRAPNLNTTRVEAPPPP----------VQRRPSIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQ--RSRALSQRQNGVAQQRNGGRQQE--
        R R +N  R  N N +R  APP             Q++    K+            RQ PQTLDS FANMK++  R R  +  ++ V     G  QQ+  
Subjt:  RPRVFN-RRAPNLNTTRVEAPPPP----------VQRRPSIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQ--RSRALSQRQNGVAQQRNGGRQQE--

Query:  IPPWRR
        + PW R
Subjt:  IPPWRR

AT4G10970.3 unknown protein3.7e-2139.81Show/hide
Query:  KPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAK-KQRRFPNKMQKFPINATQD--RPRKLQDSRSSLRQGALAIRRSNFQGNLFALATELARKAAITPI
        KP++ E +A+ EKKM M+LD+IIKM K   N  K K++R  NK +KF   A     + ++  DSRS +RQGA A +RSNFQGN F + T +ARKAA    
Subjt:  KPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAK-KQRRFPNKMQKFPINATQD--RPRKLQDSRSSLRQGALAIRRSNFQGNLFALATELARKAAITPI

Query:  RPRVFN-RRAPNLNTTRVEAPPPP----------VQRRPSIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQ--RSRALSQRQNGVAQQRNGGRQQE--
        R R +N  R  N N +R  APP             Q++    K+            RQ PQTLDS FANMK++  R R  +  ++ V     G  QQ+  
Subjt:  RPRVFN-RRAPNLNTTRVEAPPPP----------VQRRPSIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQ--RSRALSQRQNGVAQQRNGGRQQE--

Query:  IPPWRR
        + PW R
Subjt:  IPPWRR

AT4G10970.4 unknown protein3.7e-2139.81Show/hide
Query:  KPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAK-KQRRFPNKMQKFPINATQD--RPRKLQDSRSSLRQGALAIRRSNFQGNLFALATELARKAAITPI
        KP++ E +A+ EKKM M+LD+IIKM K   N  K K++R  NK +KF   A     + ++  DSRS +RQGA A +RSNFQGN F + T +ARKAA    
Subjt:  KPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAK-KQRRFPNKMQKFPINATQD--RPRKLQDSRSSLRQGALAIRRSNFQGNLFALATELARKAAITPI

Query:  RPRVFN-RRAPNLNTTRVEAPPPP----------VQRRPSIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQ--RSRALSQRQNGVAQQRNGGRQQE--
        R R +N  R  N N +R  APP             Q++    K+            RQ PQTLDS FANMK++  R R  +  ++ V     G  QQ+  
Subjt:  RPRVFN-RRAPNLNTTRVEAPPPP----------VQRRPSIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQ--RSRALSQRQNGVAQQRNGGRQQE--

Query:  IPPWRR
        + PW R
Subjt:  IPPWRR

AT4G23910.1 unknown protein1.2e-1641.18Show/hide
Query:  LEMAKPLSAEAIAINEKKMGMTLDDIIKMSKK--TENKAKKQRRFPNKMQKFPINATQD--RPRKLQDSRSSLRQGALAIRRSNFQGNLFALATELARK-
        +E   PL AE IA+ EKK+ M LDDIIK++K+    NK KK RR  NK Q F   A  +    R   +S S++RQGA+  RRS FQG  F + T +ARK 
Subjt:  LEMAKPLSAEAIAINEKKMGMTLDDIIKMSKK--TENKAKKQRRFPNKMQKFPINATQD--RPRKLQDSRSSLRQGALAIRRSNFQGNLFALATELARK-

Query:  --AAITPIRPRVFN-RRAPNLNTTRVEAPPPPVQRRPSIAKVTAPAE--PQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGGRQQEIPPW
          AA    R R FN  R  + N +R+ A  PPVQ     A+V A      Q       K +TLDS FA+MK+QR   ++    GV  Q        +PPW
Subjt:  --AAITPIRPRVFN-RRAPNLNTTRVEAPPPPVQRRPSIAKVTAPAE--PQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGGRQQEIPPW

Query:  RRYR
         R R
Subjt:  RRYR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTCTCCGCCGGAGAATCATATCCTTTTCTAGAGATGGCCAAACCACTTTCTGCTGAAGCAATTGCTATAAATGAGAAGAAGATGGGCATGACTTTAGATGACAT
TATCAAGATGTCCAAAAAAACTGAAAATAAAGCTAAGAAGCAGAGAAGGTTTCCGAACAAAATGCAGAAATTTCCAATCAATGCTACACAAGATAGACCTAGGAAGTTGC
AAGACTCAAGATCTTCTCTAAGACAGGGGGCTTTGGCCATAAGAAGGTCAAACTTTCAAGGGAATCTGTTTGCCTTGGCAACCGAGCTTGCAAGAAAGGCGGCAATTACT
CCAATTCGGCCTAGAGTATTTAACCGCAGGGCACCCAATTTGAATACAACAAGGGTTGAAGCTCCTCCACCACCTGTTCAGAGGAGACCATCCATTGCCAAGGTTACTGC
ACCTGCCGAGCCACAAATGAATGCTTCGGCAAGACAGAAGCCACAGACACTCGACTCGCTGTTTGCCAACATGAAGAAGCAGAGGTCGAGGGCGTTGTCGCAGCGACAAA
ATGGTGTTGCACAACAACGGAACGGTGGTCGCCAGCAGGAAATACCTCCATGGAGAAGATACCGTTCTGGTAACTGA
mRNA sequenceShow/hide mRNA sequence
CTTTCCTCGCGGGTTAAAAACAGGCTCTCCCGCGAATCTCAGGAATTCTTCATCTCTCTCGTAATTTCGCTGCGCTAATCTTTCACCGGTGTTTCTCCGATGAGTTTCTC
CGCCGGAGAATCATATCCTTTTCTAGAGATGGCCAAACCACTTTCTGCTGAAGCAATTGCTATAAATGAGAAGAAGATGGGCATGACTTTAGATGACATTATCAAGATGT
CCAAAAAAACTGAAAATAAAGCTAAGAAGCAGAGAAGGTTTCCGAACAAAATGCAGAAATTTCCAATCAATGCTACACAAGATAGACCTAGGAAGTTGCAAGACTCAAGA
TCTTCTCTAAGACAGGGGGCTTTGGCCATAAGAAGGTCAAACTTTCAAGGGAATCTGTTTGCCTTGGCAACCGAGCTTGCAAGAAAGGCGGCAATTACTCCAATTCGGCC
TAGAGTATTTAACCGCAGGGCACCCAATTTGAATACAACAAGGGTTGAAGCTCCTCCACCACCTGTTCAGAGGAGACCATCCATTGCCAAGGTTACTGCACCTGCCGAGC
CACAAATGAATGCTTCGGCAAGACAGAAGCCACAGACACTCGACTCGCTGTTTGCCAACATGAAGAAGCAGAGGTCGAGGGCGTTGTCGCAGCGACAAAATGGTGTTGCA
CAACAACGGAACGGTGGTCGCCAGCAGGAAATACCTCCATGGAGAAGATACCGTTCTGGTAACTGAAGAATTACACAGAAAGAACTGTCGTGCAAATGTAGATGATGACT
GCTCGTGTTCGTGTCGCTTGATAGTGAGTCGTATAGCTAATAATGTAGTGTTTCTTGACATTTTCTAAACCTCAAACATTTTGGTGCCTGTTTTTAGGATTTTTGTGGCT
TTCAATATGGACCCTGAAATACTCTATAGATTTGTTGCATCTGCAGTTCATGTAAAGTTTTGTTTTGTTAAATCCCATCTTTTCTTTGTTTCACAAAAAGAAATCCTCAT
CTTCTCTGTTCACGTTCATGCCGTAGGAAATTAATTCATTGTATGTGAACGTTCAATTACTTGGGGAATTCTTTAGAGAATTATTTAGTTGAAACGGTG
Protein sequenceShow/hide protein sequence
MSFSAGESYPFLEMAKPLSAEAIAINEKKMGMTLDDIIKMSKKTENKAKKQRRFPNKMQKFPINATQDRPRKLQDSRSSLRQGALAIRRSNFQGNLFALATELARKAAIT
PIRPRVFNRRAPNLNTTRVEAPPPPVQRRPSIAKVTAPAEPQMNASARQKPQTLDSLFANMKKQRSRALSQRQNGVAQQRNGGRQQEIPPWRRYRSGN