; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G01120 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G01120
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function (DUF688)
Genome locationClcChr05:769750..770886
RNA-Seq ExpressionClc05G01120
SyntenyClc05G01120
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019077.1 Exportin-2, partial [Cucurbita argyrosperma subsp. argyrosperma]8.1e-9176.86Show/hide
Query:  ERETE----GEMGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPN--TVTQTNCLELPPRLLLIDPK----T
        +R+T+    GEMGSEAKLW+NS+VPKLPLLA+AAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCF TPN  + TQT CLELPPRLLL+DPK     
Subjt:  ERETE----GEMGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPN--TVTQTNCLELPPRLLLIDPK----T

Query:  PKLPSQKPPFQFHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSL----EDAIVAERSSSRMKVAVTQKDGSFSS
        P +P+ K  FQF     GS R+ K+QLGAMVLRKRG+LIE EWFCWLGKLSFGRKGEVGSA+GSVFPSSL    ED +VAERSSS MKVA TQK+GSFSS
Subjt:  PKLPSQKPPFQFHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSL----EDAIVAERSSSRMKVAVTQKDGSFSS

Query:  CLLQPKTEFWGSIGEGLKQINIPWKSKKA
        CL++ KTEFWGSIGEGLKQIN+PWKSKKA
Subjt:  CLLQPKTEFWGSIGEGLKQINIPWKSKKA

XP_022924712.1 uncharacterized protein At4g00950-like [Cucurbita moschata]1.2e-8978.9Show/hide
Query:  MGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPN--TVTQTNCLELPPRLLLIDPK----TPKLPSQKPPFQ
        MGSEAKLW+NS+VPKLPLLA+AAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCF TPN  + TQT CLELPPRLLL+DPK     P +P+ K  FQ
Subjt:  MGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPN--TVTQTNCLELPPRLLLIDPK----TPKLPSQKPPFQ

Query:  FHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSL----EDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFWG
        F     GS R+ K+QLGAMVLRKRG+LIE EWFCWLGKLSFGRKGEVGSA+GSVFPSSL    ED +VAERSSS MKVA TQK+GSFSSCL++ KTEFWG
Subjt:  FHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSL----EDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFWG

Query:  SIGEGLKQINIPWKSKKA
        SIGEGLKQIN+PWKSKKA
Subjt:  SIGEGLKQINIPWKSKKA

XP_022980847.1 uncharacterized protein At4g00950 [Cucurbita maxima]1.5e-9280.28Show/hide
Query:  MGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPN--TVTQTNCLELPPRLLLIDPK----TPKLPSQKPPFQ
        MGSEAKLW+NSEVPKLPLLA+AAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCF TPN  + TQT CLELPPRLLL+DPK     P +P+ K  FQ
Subjt:  MGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPN--TVTQTNCLELPPRLLLIDPK----TPKLPSQKPPFQ

Query:  FHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSL----EDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFWG
             CGS R+ KSQLGAMVLRKRG+LIEKEWFCWLGKLSFGRKGEVGSA GSVFPSSL    ED +VAERSSS MKVA TQK+GSFSSCL++ KTEFWG
Subjt:  FHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSL----EDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFWG

Query:  SIGEGLKQINIPWKSKKA
        SIGEGLKQIN+PWKSKKA
Subjt:  SIGEGLKQINIPWKSKKA

XP_023527385.1 uncharacterized protein At4g00950-like [Cucurbita pepo subsp. pepo]1.5e-9279.82Show/hide
Query:  MGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTP--NTVTQTNCLELPPRLLLIDPK----TPKLPSQKPPFQ
        MGSEAKLW+NSEVPKLPLLA+AAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCF TP  ++ TQT CLELPPRLLL+DPK     P +P+ K  FQ
Subjt:  MGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTP--NTVTQTNCLELPPRLLLIDPK----TPKLPSQKPPFQ

Query:  FHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSL----EDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFWG
        F    CGS R+ K+QLGAMVLRKRG+LIEKEWFCWLGKLSFGRKGEVGSA+GSVFPSSL    EDA+VAERSSS MKVA TQK+GSFSSCL++ KTEFWG
Subjt:  FHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSL----EDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFWG

Query:  SIGEGLKQINIPWKSKKA
        SIGEG KQIN+PWKSKKA
Subjt:  SIGEGLKQINIPWKSKKA

XP_038896089.1 uncharacterized protein At4g00950-like [Benincasa hispida]1.5e-10084.47Show/hide
Query:  ERETEGEMGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPNTVTQTNCLELPPRLLLIDPKTPKL----PSQ
        +RETEGEMGSE K W NSEVPKLPLLAIAAMESPDRSGMLTPP+HTSVSVPFRWEEEPGKPR CFT+PN  TQTN LELPPRL L+DPK PKL    PSQ
Subjt:  ERETEGEMGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPNTVTQTNCLELPPRLLLIDPKTPKL----PSQ

Query:  KPPFQFHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSLEDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFW
        K  FQF   C GS RFHK+QLGAMVLRKRG+LIEKEWFCWLGKL+FGRKGEVGSAFGSVFPSSLED IVAERSSSRMKVAVT+K G+FSSCLLQPK +FW
Subjt:  KPPFQFHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSLEDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFW

Query:  GSIGEGLKQINIPWKSKKA
        GSIGEG KQINIPWKSKKA
Subjt:  GSIGEGLKQINIPWKSKKA

TrEMBL top hitse value%identityAlignment
A0A5A7U0F3 Uncharacterized protein8.5e-8677.31Show/hide
Query:  MGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPN----TVTQTNCLELPPRLLLIDPKTPKL----PSQKPP
        M SEAKL +N EVPKLPLLAI AMESPDRSGMLTPPI +SVSVPFRWEEEPGKPRFCF + N    T TQTN LELPPRLLL+DPK PKL    PSQK  
Subjt:  MGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPN----TVTQTNCLELPPRLLLIDPKTPKL----PSQKPP

Query:  FQFHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSLEDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFWGSI
        FQFH HCC S RFHK+QLGAMVLRKRG+LIEKEWFCW  KL+F RKGEVGS + +VFPSSL+     ERSSSRMKVAVTQK GSFSSC +Q KTEFWG+I
Subjt:  FQFHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSLEDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFWGSI

Query:  GEGLKQINIPWKSKKA
        GEG KQINIPWKSKKA
Subjt:  GEGLKQINIPWKSKKA

A0A5D3DBI9 Uncharacterized protein1.0e-8677.78Show/hide
Query:  MGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPN----TVTQTNCLELPPRLLLIDPKTPKL----PSQKPP
        MGSEAKL +N EVPKLPLLAI AMESPDRSGMLTPPI +SVSVPFRWEEEPGKPRFCF + N    T TQTN LELPPRLLL+DPK PKL    PSQK  
Subjt:  MGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPN----TVTQTNCLELPPRLLLIDPKTPKL----PSQKPP

Query:  FQFHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSLEDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFWGSI
        FQFH HCC S RFHK+QLGAMVLRKRG+LIEKEWFCW  KL+F RKGEVGS + +VFPSSL+     ERSSSRMKVAVTQK GSFSSC +Q KTEFWG+I
Subjt:  FQFHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSLEDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFWGSI

Query:  GEGLKQINIPWKSKKA
        GEG KQINIPWKSKKA
Subjt:  GEGLKQINIPWKSKKA

A0A6J1D0Z4 uncharacterized protein At4g009503.6e-8473.06Show/hide
Query:  MGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPN---TVTQTNCLELPPRLLLIDPKTPKLP----SQKPPF
        MGSEAKLW+NSE+PKLPLLAI AMESPDRSGM TPPI +SVSVPFRWEEEPGKPRFCFTTPN   +  Q  CLELPPRLLL++PK PKLP    + +P  
Subjt:  MGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPN---TVTQTNCLELPPRLLLIDPKTPKLP----SQKPPF

Query:  QFHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSLEDAIVAER-----SSSRMKVAVTQKDGSFSSCLLQPKTEF
         F N CCGS RF K QLGAMVLRKRG+L+EKEWFCWLG  SFGR+G++GSA GSVFPSSLE    +E      SSSRMK+A T  +GSFSSCL+Q KTEF
Subjt:  QFHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSLEDAIVAER-----SSSRMKVAVTQKDGSFSSCLLQPKTEF

Query:  WGSIGEGLKQINIPWKSKK
        WGSIGEGLKQINIPWK KK
Subjt:  WGSIGEGLKQINIPWKSKK

A0A6J1E9R6 uncharacterized protein At4g00950-like5.7e-9078.9Show/hide
Query:  MGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPN--TVTQTNCLELPPRLLLIDPK----TPKLPSQKPPFQ
        MGSEAKLW+NS+VPKLPLLA+AAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCF TPN  + TQT CLELPPRLLL+DPK     P +P+ K  FQ
Subjt:  MGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPN--TVTQTNCLELPPRLLLIDPK----TPKLPSQKPPFQ

Query:  FHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSL----EDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFWG
        F     GS R+ K+QLGAMVLRKRG+LIE EWFCWLGKLSFGRKGEVGSA+GSVFPSSL    ED +VAERSSS MKVA TQK+GSFSSCL++ KTEFWG
Subjt:  FHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSL----EDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFWG

Query:  SIGEGLKQINIPWKSKKA
        SIGEGLKQIN+PWKSKKA
Subjt:  SIGEGLKQINIPWKSKKA

A0A6J1ISE2 uncharacterized protein At4g009507.2e-9380.28Show/hide
Query:  MGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPN--TVTQTNCLELPPRLLLIDPK----TPKLPSQKPPFQ
        MGSEAKLW+NSEVPKLPLLA+AAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCF TPN  + TQT CLELPPRLLL+DPK     P +P+ K  FQ
Subjt:  MGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPN--TVTQTNCLELPPRLLLIDPK----TPKLPSQKPPFQ

Query:  FHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSL----EDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFWG
             CGS R+ KSQLGAMVLRKRG+LIEKEWFCWLGKLSFGRKGEVGSA GSVFPSSL    ED +VAERSSS MKVA TQK+GSFSSCL++ KTEFWG
Subjt:  FHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSL----EDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFWG

Query:  SIGEGLKQINIPWKSKKA
        SIGEGLKQIN+PWKSKKA
Subjt:  SIGEGLKQINIPWKSKKA

SwissProt top hitse value%identityAlignment
Q9M160 Uncharacterized protein At4g009501.2e-0435.92Show/hide
Query:  ERETEGEMGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSV--SVPFRWEEEPGKPRFCFTTPNTVTQTN--------------CLELPPRLL
        E+ETE E         N  V KLP+L     +    S  ++ PIH+S+  SVPF WEEEPGKP+   T+ ++ + ++               LELPPRL 
Subjt:  ERETEGEMGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSV--SVPFRWEEEPGKPRFCFTTPNTVTQTN--------------CLELPPRLL

Query:  LID
        L++
Subjt:  LID

Arabidopsis top hitse value%identityAlignment
AT2G46535.1 unknown protein4.0e-1131.69Show/hide
Query:  SPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPNTVTQTNCLELPPRLLLIDPKTP-KLPSQKPPFQFHNHCCGSLRFHKSQLGAMVLRKRGLLIEK
        SP    +   PIHT  SVPF WE++PGKP+        ++   CL+LPPRLLL    T   LP +K          G LRF         LR++G     
Subjt:  SPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPNTVTQTNCLELPPRLLLIDPKTP-KLPSQKPPFQFHNHCCGSLRFHKSQLGAMVLRKRGLLIEK

Query:  EWFCWLGKLSFGRKGEVGSAFGSVFPSSLEDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFWGSIGEGLKQINIPWKSKK
                     +G+V      VF S  + A     + + MK+    + GS+        + FWGS+ +GLK + +PWK+KK
Subjt:  EWFCWLGKLSFGRKGEVGSAFGSVFPSSLEDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFWGSIGEGLKQINIPWKSKK

AT3G61840.1 Protein of unknown function (DUF688)6.6e-0640.58Show/hide
Query:  LAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPNTVTQTNCLELPPRLLLIDPKTPKLP
        ++++ M SPD S        +  S+PF WEEEPGKP+     P   + + CL+LPPR+L  D  T K+P
Subjt:  LAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPNTVTQTNCLELPPRLLLIDPKTPKLP

AT4G00950.1 Protein of unknown function (DUF688)8.6e-0635.92Show/hide
Query:  ERETEGEMGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSV--SVPFRWEEEPGKPRFCFTTPNTVTQTN--------------CLELPPRLL
        E+ETE E         N  V KLP+L     +    S  ++ PIH+S+  SVPF WEEEPGKP+   T+ ++ + ++               LELPPRL 
Subjt:  ERETEGEMGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSV--SVPFRWEEEPGKPRFCFTTPNTVTQTN--------------CLELPPRLL

Query:  LID
        L++
Subjt:  LID

AT4G27810.1 unknown protein2.8e-0925.85Show/hide
Query:  PKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPNTVTQTN-----------CLELPPRLLLIDPKTPKLPSQKPPFQFHNHCCGS
        PKLPL +I    + D  G+ TPP++ + SVPF WEE PGKPR         ++ N           CLELPPRL          P+   P          
Subjt:  PKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPNTVTQTN-----------CLELPPRLLLIDPKTPKLPSQKPPFQFHNHCCGS

Query:  LRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSLEDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFWGSIGEGLKQINIP
                          +++  +      LS  R+ E  S     F  S           + +K++  ++ GS  + L   K++F   + +G KQ+ IP
Subjt:  LRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSLEDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFWGSIGEGLKQINIP

Query:  WKSKK
        W+ ++
Subjt:  WKSKK

AT5G53030.1 unknown protein2.4e-0830.45Show/hide
Query:  PLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPNTVTQ---TNCLELPPRLLLI-DPKTPKLPSQKP----PFQFHNHCCGSLRFHKS
        P+  IA   +P   G+ TPP++ + SVPF WEE PGKPR     P  + Q      LELPPRL+L  +  T   PS       P+        SL   +S
Subjt:  PLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPNTVTQ---TNCLELPPRLLLI-DPKTPKLPSQKP----PFQFHNHCCGSLRFHKS

Query:  QLGAMVLRKRGL---LIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSS--LEDAIVAER------------SSSRMKVAVTQKDGSFSSCLLQPKTEFW--
           A++ + RG+     EKE     G   +G  G        +F  S   +D     R              +++K+    K GSF +     K++FW  
Subjt:  QLGAMVLRKRGL---LIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSS--LEDAIVAER------------SSSRMKVAVTQKDGSFSSCLLQPKTEFW--

Query:  --GSIGEGLKQINIPWKSKK
            + EG KQ+ IPWK K+
Subjt:  --GSIGEGLKQINIPWKSKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATTTCAGTGAAAAATCTGTTCTCCTCTGCTGGAGTCATGGAAGTCAGTACACAATACACACACATAGAAAGAGAAACCGAGGGAGAAATGGGAAGTGAAGCAAA
GCTATGGCAAAATTCTGAAGTACCAAAGCTTCCATTACTTGCAATTGCAGCAATGGAGTCTCCAGATCGGTCTGGAATGCTAACCCCACCAATCCACACCTCAGTCTCAG
TCCCATTTCGATGGGAAGAAGAGCCTGGCAAGCCCAGGTTCTGTTTCACAACCCCCAACACAGTCACGCAAACAAACTGCCTTGAACTTCCACCAAGATTGTTACTAATT
GATCCCAAAACCCCAAAACTCCCTTCTCAAAAGCCCCCTTTTCAGTTTCATAACCACTGTTGTGGCTCACTCAGATTTCATAAATCTCAGCTTGGAGCCATGGTTCTCAG
AAAGAGAGGACTGCTGATTGAGAAGGAATGGTTTTGTTGGTTGGGAAAATTGAGTTTTGGGCGCAAAGGGGAAGTTGGGTCTGCCTTTGGAAGTGTATTTCCTTCTTCTT
TGGAAGATGCGATTGTTGCAGAGAGAAGCAGTTCAAGGATGAAAGTTGCAGTGACCCAGAAAGATGGGAGCTTTTCTTCTTGTCTTCTCCAACCCAAGACTGAGTTTTGG
GGAAGCATAGGTGAGGGGTTGAAACAGATCAACATTCCATGGAAGAGCAAAAAAGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGATATTTCAGTGAAAAATCTGTTCTCCTCTGCTGGAGTCATGGAAGTCAGTACACAATACACACACATAGAAAGAGAAACCGAGGGAGAAATGGGAAGTGAAGCAAA
GCTATGGCAAAATTCTGAAGTACCAAAGCTTCCATTACTTGCAATTGCAGCAATGGAGTCTCCAGATCGGTCTGGAATGCTAACCCCACCAATCCACACCTCAGTCTCAG
TCCCATTTCGATGGGAAGAAGAGCCTGGCAAGCCCAGGTTCTGTTTCACAACCCCCAACACAGTCACGCAAACAAACTGCCTTGAACTTCCACCAAGATTGTTACTAATT
GATCCCAAAACCCCAAAACTCCCTTCTCAAAAGCCCCCTTTTCAGTTTCATAACCACTGTTGTGGCTCACTCAGATTTCATAAATCTCAGCTTGGAGCCATGGTTCTCAG
AAAGAGAGGACTGCTGATTGAGAAGGAATGGTTTTGTTGGTTGGGAAAATTGAGTTTTGGGCGCAAAGGGGAAGTTGGGTCTGCCTTTGGAAGTGTATTTCCTTCTTCTT
TGGAAGATGCGATTGTTGCAGAGAGAAGCAGTTCAAGGATGAAAGTTGCAGTGACCCAGAAAGATGGGAGCTTTTCTTCTTGTCTTCTCCAACCCAAGACTGAGTTTTGG
GGAAGCATAGGTGAGGGGTTGAAACAGATCAACATTCCATGGAAGAGCAAAAAAGCATAAGAAAGAGA
Protein sequenceShow/hide protein sequence
MDISVKNLFSSAGVMEVSTQYTHIERETEGEMGSEAKLWQNSEVPKLPLLAIAAMESPDRSGMLTPPIHTSVSVPFRWEEEPGKPRFCFTTPNTVTQTNCLELPPRLLLI
DPKTPKLPSQKPPFQFHNHCCGSLRFHKSQLGAMVLRKRGLLIEKEWFCWLGKLSFGRKGEVGSAFGSVFPSSLEDAIVAERSSSRMKVAVTQKDGSFSSCLLQPKTEFW
GSIGEGLKQINIPWKSKKA