; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0016463 (gene) of Chayote v1 genome

Gene IDSed0016463
OrganismSechium edule (Chayote v1)
DescriptionSerine/arginine repetitive matrix-like protein
Genome locationLG14:22100357..22102255
RNA-Seq ExpressionSed0016463
SyntenySed0016463
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603801.1 hypothetical protein SDJN03_04410, partial [Cucurbita argyrosperma subsp. sororia]5.7e-9770.34Show/hide
Query:  MEIPSPNPQSHSQTLSPAAAGSSSGSGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL----------PNPKPDPDPTD------DGPKL
        MEIP  N  S SQ LSP A G    SGSFPTSPEFEFWMVRNPSFPQPNLLSADELF DGVLLPLHL+          PN K DP+P+       DGPKL
Subjt:  MEIPSPNPQSHSQTLSPAAAGSSSGSGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL----------PNPKPDPDPTD------DGPKL

Query:  TASSAEQGCSLTPSKRWSIFKKSEKK---SNQE----EKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSA
        T +S E G SLT SKRWSIFKKSEKK   +NQE    EKKKEKK  +GNGS+SAELNIN+WPFSRSRSAGNA +RPKMFAG QPGSRKVNSAPCSRSNS 
Subjt:  TASSAEQGCSLTPSKRWSIFKKSEKK---SNQE----EKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSA

Query:  GESKSRRWPSSPSRAGVHLGRSSPVWQVRRGGSTVKTSENH------------TDAHRSKTATASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEASA
        GESKSR+WPSSPSRAGVHLGRSSPVWQVRRGGS  KTSEN             TD +RSK   A+TA  AASR RVLNLNVPMCIGYRNHLSCRSDEASA
Subjt:  GESKSRRWPSSPSRAGVHLGRSSPVWQVRRGGSTVKTSENH------------TDAHRSKTATASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEASA

Query:  IGVVGGGGGGGSNGGGVDGGSV-GNSG
        +GVV   GGG S+G    G  V GN+G
Subjt:  IGVVGGGGGGGSNGGGVDGGSV-GNSG

KAG7033972.1 hypothetical protein SDJN02_03698, partial [Cucurbita argyrosperma subsp. argyrosperma]5.1e-9869.94Show/hide
Query:  MEIPSPNPQSHSQTLSPAAAGSSSGSGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL----------PNPKPDPDPTD------DGPKL
        MEIP  N  S SQTLSP A G    SGSFPTSPEFEFWMVRNPSFPQPNLLSADELF DGVLLPLHL+          PN K DP+P+       DGPKL
Subjt:  MEIPSPNPQSHSQTLSPAAAGSSSGSGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL----------PNPKPDPDPTD------DGPKL

Query:  TASSAEQGCSLTPSKRWSIFKKSEKK---SNQE----EKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSA
        T +S E G SLT SKRWSIFKKSEKK   +NQE    EKKKEKK  +GNGS+SAELNIN+WPFSRSRSAGNA +RPKMFAG QPGSRKVNSAPCSRSNS 
Subjt:  TASSAEQGCSLTPSKRWSIFKKSEKK---SNQE----EKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSA

Query:  GESKSRRWPSSPSRAGVHLGRSSPVWQVRRGGSTVKTSENH------------TDAHRSK-TATA-STAASAASRGRVLNLNVPMCIGYRNHLSCRSDEA
        GESKSR+WPSSPSRAGVHLGRSSPVWQVRRGGS  KTSEN             TD +RSK TATA +TA +AASR RVLNLNVPMCIGYRNHLSCRSDEA
Subjt:  GESKSRRWPSSPSRAGVHLGRSSPVWQVRRGGSTVKTSENH------------TDAHRSK-TATA-STAASAASRGRVLNLNVPMCIGYRNHLSCRSDEA

Query:  SAIGVVGGGGGGGSN--------GGGVDGGSVGNSG
        SA+GVVG  GGG S+        G   D  SV N G
Subjt:  SAIGVVGGGGGGGSN--------GGGVDGGSVGNSG

XP_023517334.1 uncharacterized protein LOC111781122 [Cucurbita pepo subsp. pepo]1.8e-9570.57Show/hide
Query:  SGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL----------PNPKPDPDPTD------DGPKLTASSAEQGCSLTPSKRWSIFKKSEK
        +GSFPTSPEFEFWMVRNPS PQPNLLSADELFVDGVLLPLHL+          PNPK DP+P        DGPKL   SAE G SLT S+RW+IFKKSEK
Subjt:  SGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL----------PNPKPDPDPTD------DGPKLTASSAEQGCSLTPSKRWSIFKKSEK

Query:  KS-------NQEEKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSAGESKSRRWPSSPSRAGVHLGRSSPV
        K+        ++EKKKEKKT  GNGSSSAELNIN+WPFSRSRSAGN  +RPK+F GGQPGSRK NSAPCSRSNSAGESKSR+WPSSPS AGVHLGRSSPV
Subjt:  KS-------NQEEKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSAGESKSRRWPSSPSRAGVHLGRSSPV

Query:  WQVRRGGSTVKTSE------------NHTDAHRSKTATASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEASAIGVVGGGGGGGSNGGGVDGGSVGN
        WQVRRGGS VKTSE              TDAHRSK       A+AASR RVLNLNVPMCIGYRNHLSCRSDE S +G VG GGG  S+ GGVDGGS G+
Subjt:  WQVRRGGSTVKTSE------------NHTDAHRSKTATASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEASAIGVVGGGGGGGSNGGGVDGGSVGN

XP_023544356.1 protein lingerer [Cucurbita pepo subsp. pepo]1.1e-9568.9Show/hide
Query:  MEIPSPNPQSHSQTLSPAAAGSSSGSGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL------------PNPKPDPDPTD------DGP
        MEIP  N  S SQ LSP A G    SGSFPTSPEFEFWMVRNPSFPQPNLLSADELF DGVLLPLHL+            PN K DP+P+       DGP
Subjt:  MEIPSPNPQSHSQTLSPAAAGSSSGSGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL------------PNPKPDPDPTD------DGP

Query:  KLTASSAEQGCSLTPSKRWSIFKKSEKK---SNQE----EKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSN
        KLT +S E G SLT SKRWSIFKKSEKK   +NQE    EKKKEKK  +GNGS+SAELNIN+WPFSRSRSAGNA +RPKMFAG QPGSRKVNSAPCSRSN
Subjt:  KLTASSAEQGCSLTPSKRWSIFKKSEKK---SNQE----EKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSN

Query:  SAGESKSRRWPSSPSRAGVHLGRSSPVWQVRRGGSTVKTSENH------------TDAHRSKTATASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEA
        S GESKSR+WPSSPSRAGVHLGRSSPVWQVRRGGS  KTSE              TD +RSK    +TA +AASR RVLNLNVPMCIGYRNHLSCRSDEA
Subjt:  SAGESKSRRWPSSPSRAGVHLGRSSPVWQVRRGGSTVKTSENH------------TDAHRSKTATASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEA

Query:  SAIGVVGGGGGGGSNGGGVDGGSVGNSG
        SA+GV+G  GGG S+  G      GN+G
Subjt:  SAIGVVGGGGGGGSNGGGVDGGSVGNSG

XP_038883611.1 uncharacterized protein LOC120074528 [Benincasa hispida]1.3e-9868.96Show/hide
Query:  MEIPSPNPQSHSQTLSPAAAGSSSGSGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL----------PNPKPD-------PDPTDDGPK
        MEIP       SQ LSP  A  +  SGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL          PN KPD       PDP+ DGPK
Subjt:  MEIPSPNPQSHSQTLSPAAAGSSSGSGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL----------PNPKPD-------PDPTDDGPK

Query:  LTASSAEQGCSLTPSKRWSIFKKSEKK---SNQE----EKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNS
        LT +SA+ G SL+ SKRWSIFKKSEKK   +NQE    EKKKEKKT  GNGS+SAELNIN+WPFSRSRSAGNA +RPKMF G QPGSRKVNSAPCSRSNS
Subjt:  LTASSAEQGCSLTPSKRWSIFKKSEKK---SNQE----EKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNS

Query:  AGESKSRRWPSSPSRAGVHLGRSSPVWQVRRGGSTVKTSE------------NHTDAHRSKTATASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEAS
        AGESKSR+WPSSPSR GVHLGRSSPVWQVRRGGST K+SE              TDAHRSK A AS   S+ASR RVLNLNVPMCIGYRNHLSCRSDE S
Subjt:  AGESKSRRWPSSPSRAGVHLGRSSPVWQVRRGGSTVKTSE------------NHTDAHRSKTATASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEAS

Query:  AIGVVGGGGGGGSNGGG--------VDGGSVGNSG
        A+GV+G GGGG S+  G         DGG++ N G
Subjt:  AIGVVGGGGGGGSNGGG--------VDGGSVGNSG

TrEMBL top hitse value%identityAlignment
A0A0A0KMZ0 Uncharacterized protein6.1e-8966.89Show/hide
Query:  MEIPSPNPQSHSQTLSPAAAGSSSGSGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLLPN--PKPDPDPT--------------DDGPKL
        MEIPSP+P                 + +FPTSPEFEFWMVRNPSFPQ NLLSADELFVDGVLLPLHLLPN  P P  DP                DGPKL
Subjt:  MEIPSPNPQSHSQTLSPAAAGSSSGSGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLLPN--PKPDPDPT--------------DDGPKL

Query:  TASSAEQGCSLTPSKRWSIFKKSEKKS---NQE----EKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSA
        T +S + G S   SKRWSIFKKSEKK+   NQE    EKKKEKKT   NGS+SAELNIN+WPFSRSRSAGNA +RPK+F G QPGSRKVNSAPCSRSNSA
Subjt:  TASSAEQGCSLTPSKRWSIFKKSEKKS---NQE----EKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSA

Query:  GESKSRRWPSSPSRAGVHLGRSSPVWQVRRGGSTVKTSENH------------TDAHRSKTATASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEASA
        GESKSR+WPSSPSR GVHLGRSSPVWQVRRGGS  KT E              ++ HRSK ATA+ A+S+ASR RVLNLNVPMCIGYRNHLSCRSDE SA
Subjt:  GESKSRRWPSSPSRAGVHLGRSSPVWQVRRGGSTVKTSENH------------TDAHRSKTATASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEASA

Query:  IG
        +G
Subjt:  IG

A0A6J1EE91 uncharacterized protein LOC1114335172.2e-9469.9Show/hide
Query:  SGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL----------PNPKPDPDPTD------DGPKLTASSAEQGCSLTPSKRWSIFKKSEK
        +GSFPTSPEFEFWMVRNPS PQPNLLSADELFVDGVLLPLHL+          PNPK DP+P        DGPKL   SAE G SLT S+RW+IFKKSEK
Subjt:  SGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL----------PNPKPDPDPTD------DGPKLTASSAEQGCSLTPSKRWSIFKKSEK

Query:  KS-------NQEEKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSAGESKSRRWPSSPSRAGVHLGRSSPV
        K+        ++EKKKEKKT  GNGSSSAELNIN+WPFSRSRSAGN  +RPK+F GGQPGSRK NSAPCSRSNSAGESKSR+WPSSPS  GVHLGRSSPV
Subjt:  KS-------NQEEKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSAGESKSRRWPSSPSRAGVHLGRSSPV

Query:  WQVRRGGSTVKTSE------------NHTDAHRSKTATASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEASAIGVVGGGGGGGSNGGGVDGGSVGN
        WQVRRGGSTVKTSE              TDAHRSK       A+AAS  RVLNLNVPMCIGYRNHLSCRSDE S +G VG GGG  S+  GVDGGS G+
Subjt:  WQVRRGGSTVKTSE------------NHTDAHRSKTATASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEASAIGVVGGGGGGGSNGGGVDGGSVGN

A0A6J1GDJ4 uncharacterized protein LOC1114531901.3e-9468.9Show/hide
Query:  MEIPSPNPQSHSQTLSPAAAGSSSGSGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL----------PNPKPDPDPTD------DGPKL
        MEIP  N  S SQ LSP A G    SGSFPTSPEFEFWMVRNPSFPQPNLLSADELF DGVLLPLHL+          PN K DP+P+       DGPKL
Subjt:  MEIPSPNPQSHSQTLSPAAAGSSSGSGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL----------PNPKPDPDPTD------DGPKL

Query:  TASSAEQGCSLTPSKRWSIFKKSEKK---SNQE----EKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSA
        T +S E G SLT SKRWSIFKKSEKK   +NQE    EKKKEKK  +GNGS+SAELNIN+WPFSRSRSAGNA +RPKMF+G QPGSRKVNSAPCSRSNS 
Subjt:  TASSAEQGCSLTPSKRWSIFKKSEKK---SNQE----EKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSA

Query:  GESKSRRWPSSPSRAGVHLGRSSPVWQVRRGGSTVKTSE----NHTDAHRSKTA--TASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEASAIGVVGG
        GESKSR+WPSSPSRAGVHLGRSSPVWQVRRGGS  KTSE    N   A R +      S A +AASR RVLNLNVPMCIGYRNHLSCRSDEASA+GVVG 
Subjt:  GESKSRRWPSSPSRAGVHLGRSSPVWQVRRGGSTVKTSE----NHTDAHRSKTA--TASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEASAIGVVGG

Query:  GGGGGSN--------GGGVDGGSVGNSG
         GGG S+        G   D  SV N G
Subjt:  GGGGGSN--------GGGVDGGSVGNSG

A0A6J1IMB6 uncharacterized protein LOC1114787122.0e-9568.88Show/hide
Query:  MEIPSPNPQSHSQTLSPAAAGSSSGSGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL----------PNPKPDPDPTD------DGPKL
        MEIP  N  S +Q LSP   G    SGSFPTSPEFEFWMVRNPSFPQPNLLSADELF DGVLLPLHL+          PN K DP+P        DGPKL
Subjt:  MEIPSPNPQSHSQTLSPAAAGSSSGSGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL----------PNPKPDPDPTD------DGPKL

Query:  TASSAEQGCSLTPSKRWSIFKKSEKK---SNQE----EKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSA
        T +S E G SLT SKRWSIFKK EKK   +NQE    EKKKEKK  +GNGS+SAELNIN+WPFSRSRSAGNA +RPKMF G QPGSRKVNSAPCSRSNS 
Subjt:  TASSAEQGCSLTPSKRWSIFKKSEKK---SNQE----EKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSA

Query:  GESKSRRWPSSPSRAGVHLGRSSPVWQVRRGGSTVKTSENH------------TDAHRSKTATASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEASA
        GESKSR+WPSSPSRAGVHLGRSSPVWQVRRGGS  KTSE              TD +RSK    +TAA++ASR RVLNLNVPMCIGYRNHLSCRSDEASA
Subjt:  GESKSRRWPSSPSRAGVHLGRSSPVWQVRRGGSTVKTSENH------------TDAHRSKTATASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEASA

Query:  IGVVGGGGGGGSNGGGV-----DGGSVGNSG
        +G VGG  GGGS  G V     DGGSV N G
Subjt:  IGVVGGGGGGGSNGGGV-----DGGSVGNSG

A0A6J1KQC7 uncharacterized protein LOC1114972984.8e-9469.93Show/hide
Query:  SGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL----------PNPKPDPDPTD------DGPKLTASSAEQGCSLTPSKRWSIFKKSEK
        +GSFPTSPEFEFWMVRNPS PQPNL+SADELFVDGVLLPLHL+          PNPK DP+P        DGPKLT  SAE G SLT S+RW+IFKKSEK
Subjt:  SGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLL----------PNPKPDPDPTD------DGPKLTASSAEQGCSLTPSKRWSIFKKSEK

Query:  KS-------NQEEKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSAGESKSRRWPSSPSRAGVHLGRSSPV
        K+        ++EKKKEKKT  GNGSSSAELNIN+WPFSRSRSAGN  +RPK+F GGQPGSRK NSAPCSRSNSAGESKSR+WPSSPS AGVHLGR SPV
Subjt:  KS-------NQEEKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSAGESKSRRWPSSPSRAGVHLGRSSPV

Query:  WQVRRGGSTVKTSENHT------------DAHRSKTATASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEASAIGVVGGGGGGGSNGGGVDGGS
        WQVRR GS VKTSE  +            DAHRSK       A+AASR RVLNLNVPMCIGYRNHLSCRSDE S +G VG GGG  S GGGVD GS
Subjt:  WQVRRGGSTVKTSENHT------------DAHRSKTATASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEASAIGVVGGGGGGGSNGGGVDGGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G22190.1 unknown protein7.7e-5249.22Show/hide
Query:  QTLSPAAAGSSSGSGSFPTS-PEFEFWMVRNPSFPQ--PNLLSADELFVDGVLLPLHLL---------PN-PKPDPDPTDDGPKLTA---SSAEQGC---
        +TLSP   GS     S  ++ PEFEFW + N SFPQ   +LLSADELF DGVLLPL LL         PN  + DPDP+     L     S  E G    
Subjt:  QTLSPAAAGSSSGSGSFPTS-PEFEFWMVRNPSFPQ--PNLLSADELFVDGVLLPLHLL---------PN-PKPDPDPTDDGPKLTA---SSAEQGC---

Query:  ---SLTPSKRW-SIFKKSE------KKSNQEEKKKEKKTGNGNGS---SSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSAGES
             T SKRW  IF+KSE      K+  +E KK++KKTG+G  S   S AELNIN+WPFSRSRSAGN  +RP+M + G P +RKV+SAPCSRSNS GES
Subjt:  ---SLTPSKRW-SIFKKSE------KKSNQEEKKKEKKTGNGNGS---SSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSAGES

Query:  KSRRWPSSPSRAGVHLGRSSPVWQVRRG-----GSTVKTSENHTDAHRSKTAT-ASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEASAIGVVGGGGG
        KSR+WPSSPSR GVHLGR+SPVWQVRRG     G T+          R    T        +++ +VLNLNVPMCIGYR+ LSCR++E+S       GGG
Subjt:  KSRRWPSSPSRAGVHLGRSSPVWQVRRG-----GSTVKTSENHTDAHRSKTAT-ASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEASAIGVVGGGGG

Query:  GGSNGGGVDGGSVGNSGGP
          + G   +  +  N+  P
Subjt:  GGSNGGGVDGGSVGNSGGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATTCCATCCCCAAATCCCCAATCGCATTCCCAAACCCTCTCGCCGGCCGCCGCCGGCAGTAGTAGCGGCAGCGGCAGCTTCCCCACCTCGCCGGAGTTCGAGTT
CTGGATGGTCCGAAACCCCTCTTTCCCTCAGCCCAATCTCCTCTCCGCCGACGAGCTCTTCGTCGACGGTGTTCTCCTTCCCCTTCATCTTCTCCCCAACCCGAAACCCG
ACCCGGATCCCACCGACGACGGCCCGAAATTGACGGCGAGTTCGGCCGAACAGGGCTGCTCGTTAACGCCGTCGAAGCGGTGGAGCATTTTCAAGAAGAGTGAGAAAAAG
AGTAATCAGGAGGAGAAGAAGAAGGAAAAGAAGACTGGAAATGGAAATGGGTCTTCATCGGCTGAGTTGAATATTAATCTCTGGCCGTTTTCCCGTAGCAGATCCGCCGG
TAATGCTTGTTCTAGACCCAAAATGTTCGCCGGCGGTCAACCGGGTTCCCGGAAGGTCAACAGTGCGCCGTGCTCTCGGAGCAACTCCGCCGGTGAATCCAAATCCAGGA
GGTGGCCGAGCAGCCCGAGCCGAGCCGGGGTCCATCTGGGCCGGAGTAGCCCGGTTTGGCAAGTCCGGCGAGGCGGATCTACCGTTAAAACCTCAGAAAATCACACAGAC
GCCCACCGGAGCAAGACCGCCACCGCTTCCACCGCCGCCTCCGCCGCATCAAGAGGTAGAGTCTTGAATTTGAATGTTCCTATGTGTATTGGGTATAGAAACCATTTGAG
CTGTAGAAGCGATGAAGCCAGTGCAATTGGGGTTGTTGGCGGCGGCGGCGGCGGTGGAAGCAACGGCGGCGGCGTAGACGGCGGCAGTGTTGGTAATTCTGGAGGACCAC
CAATTGTCACCCTGTTGGTGACTTTCTCTAGAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATTCCATCCCCAAATCCCCAATCGCATTCCCAAACCCTCTCGCCGGCCGCCGCCGGCAGTAGTAGCGGCAGCGGCAGCTTCCCCACCTCGCCGGAGTTCGAGTT
CTGGATGGTCCGAAACCCCTCTTTCCCTCAGCCCAATCTCCTCTCCGCCGACGAGCTCTTCGTCGACGGTGTTCTCCTTCCCCTTCATCTTCTCCCCAACCCGAAACCCG
ACCCGGATCCCACCGACGACGGCCCGAAATTGACGGCGAGTTCGGCCGAACAGGGCTGCTCGTTAACGCCGTCGAAGCGGTGGAGCATTTTCAAGAAGAGTGAGAAAAAG
AGTAATCAGGAGGAGAAGAAGAAGGAAAAGAAGACTGGAAATGGAAATGGGTCTTCATCGGCTGAGTTGAATATTAATCTCTGGCCGTTTTCCCGTAGCAGATCCGCCGG
TAATGCTTGTTCTAGACCCAAAATGTTCGCCGGCGGTCAACCGGGTTCCCGGAAGGTCAACAGTGCGCCGTGCTCTCGGAGCAACTCCGCCGGTGAATCCAAATCCAGGA
GGTGGCCGAGCAGCCCGAGCCGAGCCGGGGTCCATCTGGGCCGGAGTAGCCCGGTTTGGCAAGTCCGGCGAGGCGGATCTACCGTTAAAACCTCAGAAAATCACACAGAC
GCCCACCGGAGCAAGACCGCCACCGCTTCCACCGCCGCCTCCGCCGCATCAAGAGGTAGAGTCTTGAATTTGAATGTTCCTATGTGTATTGGGTATAGAAACCATTTGAG
CTGTAGAAGCGATGAAGCCAGTGCAATTGGGGTTGTTGGCGGCGGCGGCGGCGGTGGAAGCAACGGCGGCGGCGTAGACGGCGGCAGTGTTGGTAATTCTGGAGGACCAC
CAATTGTCACCCTGTTGGTGACTTTCTCTAGAAGATGA
Protein sequenceShow/hide protein sequence
MEIPSPNPQSHSQTLSPAAAGSSSGSGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLLPNPKPDPDPTDDGPKLTASSAEQGCSLTPSKRWSIFKKSEKK
SNQEEKKKEKKTGNGNGSSSAELNINLWPFSRSRSAGNACSRPKMFAGGQPGSRKVNSAPCSRSNSAGESKSRRWPSSPSRAGVHLGRSSPVWQVRRGGSTVKTSENHTD
AHRSKTATASTAASAASRGRVLNLNVPMCIGYRNHLSCRSDEASAIGVVGGGGGGGSNGGGVDGGSVGNSGGPPIVTLLVTFSRR