; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G021780 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G021780
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function (DUF688)
Genome locationCmo_Chr14:15610496..15611913
RNA-Seq ExpressionCmoCh14G021780
SyntenyCmoCh14G021780
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019077.1 Exportin-2, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-10998.51Show/hide
Query:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQL
        P L +AAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQL
Subjt:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQL

Query:  GAMVLRKRGVLIEEWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSKK
        GAMVLRKRGVLIEEWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSKK
Subjt:  GAMVLRKRGVLIEEWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSKK

Query:  A
        A
Subjt:  A

XP_022924712.1 uncharacterized protein At4g00950-like [Cucurbita moschata]1.7e-10998.51Show/hide
Query:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQL
        P L +AAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQL
Subjt:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQL

Query:  GAMVLRKRGVLIEEWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSKK
        GAMVLRKRGVLIEEWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSKK
Subjt:  GAMVLRKRGVLIEEWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSKK

Query:  A
        A
Subjt:  A

XP_022980847.1 uncharacterized protein At4g00950 [Cucurbita maxima]5.7e-10596.04Show/hide
Query:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQL
        P L +AAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQ PKQS GSFRYDK+QL
Subjt:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQL

Query:  GAMVLRKRGVLIE-EWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSK
        GAMVLRKRGVLIE EWFCWLGKLSFGRKGEVGSA GSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSK
Subjt:  GAMVLRKRGVLIE-EWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSK

Query:  KA
        KA
Subjt:  KA

XP_023527385.1 uncharacterized protein At4g00950-like [Cucurbita pepo subsp. pepo]4.4e-10596.04Show/hide
Query:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQL
        P L +AAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTP+SSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQS GSFRYDKTQL
Subjt:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQL

Query:  GAMVLRKRGVLIE-EWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSK
        GAMVLRKRGVLIE EWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENED VVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEG KQINVPWKSK
Subjt:  GAMVLRKRGVLIE-EWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSK

Query:  KA
        KA
Subjt:  KA

XP_038896089.1 uncharacterized protein At4g00950-like [Benincasa hispida]9.4e-8479.21Show/hide
Query:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQL
        P L IAAMESPDRSGMLTPP+HTSVSVPFRWEEEPGKPR CF +PN    TQT  LELPPRL L+DPKI KLPPPIP+ KG FQFPKQ  GSFR+ KTQL
Subjt:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQL

Query:  GAMVLRKRGVLIE-EWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSK
        GAMVLRKRGVLIE EWFCWLGKL+FGRKGEVGSA+GSVFPSSL    EDG+VAERSSS MKVA T+K G+FSSCL++ K +FWGSIGEG KQIN+PWKSK
Subjt:  GAMVLRKRGVLIE-EWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSK

Query:  KA
        KA
Subjt:  KA

TrEMBL top hitse value%identityAlignment
A0A0A0L338 Uncharacterized protein5.1e-7572.95Show/hide
Query:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPN--SSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKT
        P L I AMESPDRSGMLTPP+++SVSVPFRWEEEPGKPRFCF + N  + + TQT  LELPPRLLL+DPKISKL PPIP+ KG FQF K    SFR+DKT
Subjt:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPN--SSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKT

Query:  QLGAMVLRKRGVLIE-EWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFW---GSIGEGLKQINV
        QLGAMVLRKRGVLIE EWFCWLGKLSF  KGEVGS YG+VFPSSL+K        E+SSS MKVA  QK GSFSSC V+AKTEFW   G+IGEG KQIN+
Subjt:  QLGAMVLRKRGVLIE-EWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFW---GSIGEGLKQINV

Query:  PWKSKKA
        PWKSK+A
Subjt:  PWKSKKA

A0A5D3DBI9 Uncharacterized protein2.1e-7372.55Show/hide
Query:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPN--SSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKT
        P L I AMESPDRSGMLTPPI +SVSVPFRWEEEPGKPRFCF + N  + + TQT  LELPPRLLL+DPKI KL P IP+ KG FQF K    SFR+ KT
Subjt:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPN--SSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKT

Query:  QLGAMVLRKRGVLIE-EWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWK
        QLGAMVLRKRG+LIE EWFCW  KL+F RKGEVGS Y +VFPSSL+K        ERSSS MKVA TQK GSFSSC V+AKTEFWG+IGEG KQIN+PWK
Subjt:  QLGAMVLRKRGVLIE-EWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWK

Query:  SKKA
        SKKA
Subjt:  SKKA

A0A6J1D0Z4 uncharacterized protein At4g009501.0e-7573.4Show/hide
Query:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSAT-QTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQ
        P L I AMESPDRSGM TPPI +SVSVPFRWEEEPGKPRFCF TPNS  +T Q KCLELPPRLLL++PKI KLPPPI AH+    F  Q  GSFR+DK Q
Subjt:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSAT-QTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQ

Query:  LGAMVLRKRGVLIE-EWFCWLGKLSFGRKGEVGSAYGSVFPSSLEK-ENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWK
        LGAMVLRKRGVL+E EWFCWLG  SFGR+G++GSA GSVFPSSLEK    +      SSS MK+A+T  EGSFSSCLV+AKTEFWGSIGEGLKQIN+PWK
Subjt:  LGAMVLRKRGVLIE-EWFCWLGKLSFGRKGEVGSAYGSVFPSSLEK-ENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWK

Query:  SKK
         KK
Subjt:  SKK

A0A6J1E9R6 uncharacterized protein At4g00950-like8.3e-11098.51Show/hide
Query:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQL
        P L +AAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQL
Subjt:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQL

Query:  GAMVLRKRGVLIEEWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSKK
        GAMVLRKRGVLIEEWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSKK
Subjt:  GAMVLRKRGVLIEEWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSKK

Query:  A
        A
Subjt:  A

A0A6J1ISE2 uncharacterized protein At4g009502.8e-10596.04Show/hide
Query:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQL
        P L +AAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQ PKQS GSFRYDK+QL
Subjt:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQL

Query:  GAMVLRKRGVLIE-EWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSK
        GAMVLRKRGVLIE EWFCWLGKLSFGRKGEVGSA GSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSK
Subjt:  GAMVLRKRGVLIE-EWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSK

Query:  KA
        KA
Subjt:  KA

SwissProt top hitse value%identityAlignment
Q9M160 Uncharacterized protein At4g009504.3e-1530.24Show/hide
Query:  TMDNGQWTSPSLKIAAMESPDRSGMLTPPIHTSV--SVPFRWEEEPGKPRFCFRTPNSSSATQT------------KCLELPPRLLLLDP---KISKLPP
        T   G  T   L +   +    S  ++ PIH+S+  SVPF WEEEPGKP+    + +SSS++              K LELPPRL LL+     ++KL  
Subjt:  TMDNGQWTSPSLKIAAMESPDRSGMLTPPIHTSV--SVPFRWEEEPGKPRFCFRTPNSSSATQT------------KCLELPPRLLLLDP---KISKLPP

Query:  PIPAHKGIFQFPKQSR-----------------GSFRYD------KTQLGA-----------MVLRKRGVLIEEWFCWLGKLSFGR----KGEVGSAYGS
        PI    G +      R                 GSFR D        ++G+            V++KRG         LG   F R    KG+     GS
Subjt:  PIPAHKGIFQFPKQSR-----------------GSFRYD------KTQLGA-----------MVLRKRGVLIEEWFCWLGKLSFGR----KGEVGSAYGS

Query:  -VFPSSLEKENE--------------------DGVVAERSS--SGMKVAETQKEGSFSSCLV----KAKTEFWGSIGEGLKQINVPWKSKK
         VFPSS+++E+E                    DG+   +SS    +K++   + GSFS+        +K+ FW ++  GLKQ+ VPWKSKK
Subjt:  -VFPSSLEKENE--------------------DGVVAERSS--SGMKVAETQKEGSFSSCLV----KAKTEFWGSIGEGLKQINVPWKSKK

Arabidopsis top hitse value%identityAlignment
AT2G46535.1 unknown protein7.9e-1228.8Show/hide
Query:  SPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQLGAMVLRKRG
        SP    +   PIHT  SVPF WE++PGKP+   R       +  KCL+LPPRLLL            P        P++  G  R+         LR++G
Subjt:  SPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRGSFRYDKTQLGAMVLRKRG

Query:  VLIEEWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSKK
                         +G+V      VF S  ++  ++      + + MK+ +  + GS+        + FWGS+ +GLK + +PWK+KK
Subjt:  VLIEEWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQINVPWKSKK

AT3G61840.1 Protein of unknown function (DUF688)9.0e-0841.33Show/hide
Query:  TSPSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLP
        T+  + ++ M SPD S        +  S+PF WEEEPGKP+   R P+ S     KCL+LPPR+L  D + +K+P
Subjt:  TSPSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLP

AT4G00950.1 Protein of unknown function (DUF688)3.1e-1630.24Show/hide
Query:  TMDNGQWTSPSLKIAAMESPDRSGMLTPPIHTSV--SVPFRWEEEPGKPRFCFRTPNSSSATQT------------KCLELPPRLLLLDP---KISKLPP
        T   G  T   L +   +    S  ++ PIH+S+  SVPF WEEEPGKP+    + +SSS++              K LELPPRL LL+     ++KL  
Subjt:  TMDNGQWTSPSLKIAAMESPDRSGMLTPPIHTSV--SVPFRWEEEPGKPRFCFRTPNSSSATQT------------KCLELPPRLLLLDP---KISKLPP

Query:  PIPAHKGIFQFPKQSR-----------------GSFRYD------KTQLGA-----------MVLRKRGVLIEEWFCWLGKLSFGR----KGEVGSAYGS
        PI    G +      R                 GSFR D        ++G+            V++KRG         LG   F R    KG+     GS
Subjt:  PIPAHKGIFQFPKQSR-----------------GSFRYD------KTQLGA-----------MVLRKRGVLIEEWFCWLGKLSFGR----KGEVGSAYGS

Query:  -VFPSSLEKENE--------------------DGVVAERSS--SGMKVAETQKEGSFSSCLV----KAKTEFWGSIGEGLKQINVPWKSKK
         VFPSS+++E+E                    DG+   +SS    +K++   + GSFS+        +K+ FW ++  GLKQ+ VPWKSKK
Subjt:  -VFPSSLEKENE--------------------DGVVAERSS--SGMKVAETQKEGSFSSCLV----KAKTEFWGSIGEGLKQINVPWKSKK

AT4G27810.1 unknown protein3.1e-0825.84Show/hide
Query:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRT-PNSSSATQ--------TKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRG
        P   I    + D  G+ TPP++ + SVPF WEE PGKPR      P +S   +         +CLELPPRL          P P     G +  P++S  
Subjt:  PSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRT-PNSSSATQ--------TKCLELPPRLLLLDPKISKLPPPIPAHKGIFQFPKQSRG

Query:  SFRYDKTQLGAMVLRKRGVLIEEWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQ
                                      LS  R+ E  S     F  S      DG       + +K++  +++GS  + L  +K++F   + +G KQ
Subjt:  SFRYDKTQLGAMVLRKRGVLIEEWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIGEGLKQ

Query:  INVPWKSKK
        + +PW+ ++
Subjt:  INVPWKSKK

AT5G53030.1 unknown protein2.5e-1029.81Show/hide
Query:  GMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKIS-KLPPPIPAHKGIFQFPKQSRGSFRYDKTQLGAMVLRK-RGVLI
        G+ TPP++ + SVPF WEE PGKPR   +    +     + LELPPRL+L     +   P P     G +   ++S    R       A V+RK RGV  
Subjt:  GMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKIS-KLPPPIPAHKGIFQFPKQSRGSFRYDKTQLGAMVLRK-RGVLI

Query:  ----EEWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSG------------MKVAETQKEGSFSSCLVKAKTEFW----GSIGEGLKQI
            +E     G   +G  G        +F  S  +  +DG    R  +G            +K+    K+GSF +     K++FW      + EG KQ+
Subjt:  ----EEWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSG------------MKVAETQKEGSFSSCLVKAKTEFW----GSIGEGLKQI

Query:  NVPWKSKK
         +PWK K+
Subjt:  NVPWKSKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCATGCTGCAATGGCCGTGTGTGATGGGGGGAAGAGGAAGGACTTGTTTGGAACCATCTTCCAAGCAATGGACAATGGACAATGGACAATGGACAAGCCCCTCTCT
AAAAATTGCGGCAATGGAGTCTCCAGATCGCTCTGGAATGCTAACCCCACCAATCCACACCTCAGTGTCAGTCCCATTTCGATGGGAAGAGGAGCCTGGCAAACCCAGGT
TCTGCTTCAGAACCCCCAACAGTTCTTCAGCCACTCAAACAAAGTGCCTTGAACTTCCACCAAGATTGTTGCTATTGGATCCCAAAATCTCAAAACTCCCTCCTCCCATC
CCTGCACACAAGGGTATTTTTCAGTTTCCTAAGCAGAGTCGTGGGTCATTCAGATATGATAAAACTCAGCTTGGAGCCATGGTTCTGAGAAAGAGGGGAGTGCTCATAGA
GGAGTGGTTTTGTTGGTTGGGAAAATTGAGTTTCGGGCGCAAAGGGGAAGTTGGGTCTGCCTATGGAAGTGTATTTCCTTCTTCTTTAGAGAAGGAGAATGAAGATGGAG
TTGTTGCAGAGAGAAGCAGTTCAGGGATGAAGGTCGCAGAGACCCAGAAAGAAGGGAGCTTTTCTTCTTGTCTTGTCAAAGCCAAGACTGAGTTTTGGGGAAGCATAGGC
GAGGGGCTAAAACAGATCAACGTTCCATGGAAGAGCAAAAAAGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGCATGCTGCAATGGCCGTGTGTGATGGGGGGAAGAGGAAGGACTTGTTTGGAACCATCTTCCAAGCAATGGACAATGGACAATGGACAATGGACAAGCCCCTCTCT
AAAAATTGCGGCAATGGAGTCTCCAGATCGCTCTGGAATGCTAACCCCACCAATCCACACCTCAGTGTCAGTCCCATTTCGATGGGAAGAGGAGCCTGGCAAACCCAGGT
TCTGCTTCAGAACCCCCAACAGTTCTTCAGCCACTCAAACAAAGTGCCTTGAACTTCCACCAAGATTGTTGCTATTGGATCCCAAAATCTCAAAACTCCCTCCTCCCATC
CCTGCACACAAGGGTATTTTTCAGTTTCCTAAGCAGAGTCGTGGGTCATTCAGATATGATAAAACTCAGCTTGGAGCCATGGTTCTGAGAAAGAGGGGAGTGCTCATAGA
GGAGTGGTTTTGTTGGTTGGGAAAATTGAGTTTCGGGCGCAAAGGGGAAGTTGGGTCTGCCTATGGAAGTGTATTTCCTTCTTCTTTAGAGAAGGAGAATGAAGATGGAG
TTGTTGCAGAGAGAAGCAGTTCAGGGATGAAGGTCGCAGAGACCCAGAAAGAAGGGAGCTTTTCTTCTTGTCTTGTCAAAGCCAAGACTGAGTTTTGGGGAAGCATAGGC
GAGGGGCTAAAACAGATCAACGTTCCATGGAAGAGCAAAAAAGCTTAAGAAAGAGCATCGTAGCTCGTTGGTAATATGAAGGACTGATGGTTTAGTTAATGGCTGAGATC
TTACATACTTGACATTCTTGTACAGAAATAATGTTAAGACTTCTATGCGTTGAATGATCATTTGTTTTCGACAATTTTCAAGAAATGGAAAACATCAGAGTAACA
Protein sequenceShow/hide protein sequence
MCMLQWPCVMGGRGRTCLEPSSKQWTMDNGQWTSPSLKIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFRTPNSSSATQTKCLELPPRLLLLDPKISKLPPPI
PAHKGIFQFPKQSRGSFRYDKTQLGAMVLRKRGVLIEEWFCWLGKLSFGRKGEVGSAYGSVFPSSLEKENEDGVVAERSSSGMKVAETQKEGSFSSCLVKAKTEFWGSIG
EGLKQINVPWKSKKA