; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G001860 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G001860
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function (DUF1068)
Genome locationCmo_Chr02:886793..889987
RNA-Seq ExpressionCmoCh02G001860
SyntenyCmoCh02G001860
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022947173.1 uncharacterized protein LOC111451120 isoform X1 [Cucurbita moschata]8.8e-9284.55Show/hide
Query:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL
        MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTD                  E + TT      
Subjt:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL

Query:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
             ++CVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
Subjt:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ

Query:  RGWRDDIVMSHAARDIVQTS
        RGWRDDIVMSHAARDIVQTS
Subjt:  RGWRDDIVMSHAARDIVQTS

XP_022947174.1 uncharacterized protein LOC111451120 isoform X2 [Cucurbita moschata]9.8e-9183.18Show/hide
Query:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL
        MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTD+                             
Subjt:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL

Query:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
               CVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
Subjt:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ

Query:  RGWRDDIVMSHAARDIVQTS
        RGWRDDIVMSHAARDIVQTS
Subjt:  RGWRDDIVMSHAARDIVQTS

XP_022971112.1 uncharacterized protein LOC111469880 isoform X1 [Cucurbita maxima]1.8e-8982.27Show/hide
Query:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL
        MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHF+EGLAVLSSS SSTCPPCFCDCSSQTDFAFTD                  E + TT      
Subjt:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL

Query:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
             ++CVKHDSGMNEETEE+FAELLSE+LKLREA+AMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
Subjt:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ

Query:  RGWRDDIVMSHAARDIVQTS
        RGWRDDIVMSHAARDIVQTS
Subjt:  RGWRDDIVMSHAARDIVQTS

XP_023534557.1 uncharacterized protein LOC111796098 isoform X1 [Cucurbita pepo subsp. pepo]8.3e-9082.73Show/hide
Query:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL
        MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHF+EGLAVLSSSSSSTCPPCFCDCSSQTDFAFTD                  E + TT      
Subjt:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL

Query:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
             ++CVKHDSGMNEETEE+FAELLSEELKLREAEA+ERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
Subjt:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ

Query:  RGWRDDIVMSHAARDIVQTS
        RGWRDDIV SHAARDIVQTS
Subjt:  RGWRDDIVMSHAARDIVQTS

XP_023534558.1 uncharacterized protein LOC111796098 isoform X2 [Cucurbita pepo subsp. pepo]9.1e-8981.36Show/hide
Query:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL
        MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHF+EGLAVLSSSSSSTCPPCFCDCSSQTDFAFTD+                             
Subjt:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL

Query:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
               CVKHDSGMNEETEE+FAELLSEELKLREAEA+ERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
Subjt:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ

Query:  RGWRDDIVMSHAARDIVQTS
        RGWRDDIV SHAARDIVQTS
Subjt:  RGWRDDIVMSHAARDIVQTS

TrEMBL top hitse value%identityAlignment
A0A0A0LQC2 Uncharacterized protein4.3e-7675.24Show/hide
Query:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL
        MAVKP GSCSPGLTKVGL  +ALCIAAYILGPPLYWHF+EGL   SSSS STCPPCFCDCSS TDFAFT+                  EL+ TT      
Subjt:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL

Query:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
             ++CVKHDSGMNEETE+ FAELLSEELKLREAEA+E HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LTALWE RARQ
Subjt:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ

Query:  RGWRDDIVMS
        RGWRD+IV S
Subjt:  RGWRDDIVMS

A0A6J1G5W1 uncharacterized protein LOC111451120 isoform X14.3e-9284.55Show/hide
Query:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL
        MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTD                  E + TT      
Subjt:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL

Query:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
             ++CVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
Subjt:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ

Query:  RGWRDDIVMSHAARDIVQTS
        RGWRDDIVMSHAARDIVQTS
Subjt:  RGWRDDIVMSHAARDIVQTS

A0A6J1G609 uncharacterized protein LOC111451120 isoform X24.7e-9183.18Show/hide
Query:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL
        MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTD+                             
Subjt:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL

Query:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
               CVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
Subjt:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ

Query:  RGWRDDIVMSHAARDIVQTS
        RGWRDDIVMSHAARDIVQTS
Subjt:  RGWRDDIVMSHAARDIVQTS

A0A6J1I131 uncharacterized protein LOC111469880 isoform X18.9e-9082.27Show/hide
Query:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL
        MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHF+EGLAVLSSS SSTCPPCFCDCSSQTDFAFTD                  E + TT      
Subjt:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL

Query:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
             ++CVKHDSGMNEETEE+FAELLSE+LKLREA+AMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
Subjt:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ

Query:  RGWRDDIVMSHAARDIVQTS
        RGWRDDIVMSHAARDIVQTS
Subjt:  RGWRDDIVMSHAARDIVQTS

A0A6J1I5W5 uncharacterized protein LOC111469880 isoform X29.8e-8980.91Show/hide
Query:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL
        MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHF+EGLAVLSSS SSTCPPCFCDCSSQTDFAFTD+                             
Subjt:  MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLL

Query:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
               CVKHDSGMNEETEE+FAELLSE+LKLREA+AMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ
Subjt:  QLNRAQNCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQ

Query:  RGWRDDIVMSHAARDIVQTS
        RGWRDDIVMSHAARDIVQTS
Subjt:  RGWRDDIVMSHAARDIVQTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)9.2e-4752.36Show/hide
Query:  KVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLLQLNRAQNCVKHDSG
        K+GL  + L +A YILGPPLYWH  E LA +S+SS   CP C C+CS+ +      E                            L      +C KHD  
Subjt:  KVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLLQLNRAQNCVKHDSG

Query:  MNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQRGWRD
        +NE+TE+ +AELL+EELKLREAE++E+H+RAD+ LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  LA QKKLT+ WE RARQ+GWR+
Subjt:  MNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQRGWRD

AT2G32580.1 Protein of unknown function (DUF1068)4.0e-4248.98Show/hide
Query:  KVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLLQLNRAQNCVKHDSG
        KVGL  +AL +  YILGPPLYWH  E LAV    S+++C  C CDCSS                                ++   L      +C K D  
Subjt:  KVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLLQLNRAQNCVKHDSG

Query:  MNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQRGWRDDIVMS
        +NE+TE+ +AELL+EELK REA +ME+H+R D  LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  L  QKKLT++WE RARQ+G++D    S
Subjt:  MNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQRGWRDDIVMS

AT2G32580.2 Protein of unknown function (DUF1068)1.7e-3265.38Show/hide
Query:  NCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQRGWRDD
        NC K D  +NE+TE+ +AELL+EELK REA +ME+H+R D  LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  L  QKKLT++WE RARQ+G++D 
Subjt:  NCVKHDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQRGWRDD

Query:  IVMS
           S
Subjt:  IVMS

AT4G04360.1 Protein of unknown function (DUF1068)3.0e-3746.94Show/hide
Query:  KVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLLQLNRAQNCVKHDSG
        KV    + LCI AYI GP LYWH  E +A    S  S+CPPC CDCSSQ   +  D     HS                             +C++H+ G
Subjt:  KVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLLQLNRAQNCVKHDSG

Query:  MNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQRGWRDDIVMS
         +EE+E +F E+++EELKLREA+A E   RAD  LL+AKK  SQYQKEADKC+ GMETCE ARE+AEA L  Q++L+ +WE RARQ GW++  V S
Subjt:  MNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQRGWRDDIVMS

AT4G30996.1 Protein of unknown function (DUF1068)2.5e-2840.21Show/hide
Query:  LGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLLQLNRAQNCVKHDSGMNE
        L   A+  A  + GP LYW F +G  V S+ ++S CPPC CDC                             LQ    +  L       +C   D  + +
Subjt:  LGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLLQLNRAQNCVKHDSGMNE

Query:  ETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQRGWRDD
        E E+ F +LL+EELKL+EA A E  R  +++L EAK++ SQYQKEA+KCN+  E CE+ARERAEA L  ++K+T+LWE RARQ GW  +
Subjt:  ETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQRGWRDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTGAAGCCGGCGGGTTCGTGCTCTCCAGGACTGACGAAGGTGGGATTGGGTTTTATAGCTCTCTGTATAGCTGCTTACATTTTGGGTCCGCCTCTCTACTGGCA
CTTCGTGGAGGGCTTGGCTGTTCTTTCTTCTTCCTCTTCCTCAACTTGTCCACCTTGCTTTTGTGACTGTTCTTCTCAGACTGACTTCGCCTTCACTGATGAGAGTTTCA
TTCAACATAGTTTGAAAACACGACTTTTAGAGAAAAGAGCAAAAGAACTACAAGCTACTACTAGCATGATCGTCTTGCTCCAGTTGAATCGAGCTCAAAATTGTGTGAAA
CATGACTCGGGCATGAATGAGGAAACAGAAGAGACTTTTGCAGAGTTGTTGTCCGAGGAACTGAAGCTGAGGGAAGCCGAAGCCATGGAACGTCATCGGCGCGCTGACAT
ATCTCTTCTGGAAGCGAAGAAGATGACATCTCAATATCAGAAAGAAGCAGACAAGTGCAATTCAGGCATGGAAACATGTGAAGCAGCAAGGGAAAGAGCTGAAGCTACAT
TAGCTTCACAGAAGAAGCTAACAGCATTATGGGAGAATAGGGCTCGCCAAAGAGGATGGCGAGACGACATTGTCATGTCACATGCTGCTCGTGATATCGTTCAAACTTCA
TGA
mRNA sequenceShow/hide mRNA sequence
CACTACGTTTGATTTCCATTCCGCTTTTCCTTCTGTAATTCATCGTCTCCAACGAGCTCGTTTCCATTTTGCATTTCTCGTTTCTGGACACACACATAGAGGTAGAGGAG
ATACAGAAAGGTCTAGGAATGGCAGTGAAGCCGGCGGGTTCGTGCTCTCCAGGACTGACGAAGGTGGGATTGGGTTTTATAGCTCTCTGTATAGCTGCTTACATTTTGGG
TCCGCCTCTCTACTGGCACTTCGTGGAGGGCTTGGCTGTTCTTTCTTCTTCCTCTTCCTCAACTTGTCCACCTTGCTTTTGTGACTGTTCTTCTCAGACTGACTTCGCCT
TCACTGATGAGAGTTTCATTCAACATAGTTTGAAAACACGACTTTTAGAGAAAAGAGCAAAAGAACTACAAGCTACTACTAGCATGATCGTCTTGCTCCAGTTGAATCGA
GCTCAAAATTGTGTGAAACATGACTCGGGCATGAATGAGGAAACAGAAGAGACTTTTGCAGAGTTGTTGTCCGAGGAACTGAAGCTGAGGGAAGCCGAAGCCATGGAACG
TCATCGGCGCGCTGACATATCTCTTCTGGAAGCGAAGAAGATGACATCTCAATATCAGAAAGAAGCAGACAAGTGCAATTCAGGCATGGAAACATGTGAAGCAGCAAGGG
AAAGAGCTGAAGCTACATTAGCTTCACAGAAGAAGCTAACAGCATTATGGGAGAATAGGGCTCGCCAAAGAGGATGGCGAGACGACATTGTCATGTCACATGCTGCTCGT
GATATCGTTCAAACTTCATGAATGACCCGACACTAAACAGGTGCTTTGACAGTAGCTTGGAATTTGATTAATCTATACTGGGAGCCCAGGTTTTTCTTTTTACATATCCC
ACTTAGGTTTCTTCTTCAAACTTCCACTTGTATGAACATGATAGATAGCGATATATTATGGAAAAGATACCAACCAGATGCCTCTCTCCTTTCCCTTGAAGAGATTAGGA
CCTAAAACCTTTGGCAGGGCAATGTTATCTGGCAAATCTTCCTTTGATGCTTAATGGGCTGTTTCAATATTTGATAGAACAGTAGGTTAGACTACTCATCTAACTTCTTG
TCCCTAGAACGATCAAATATCATCTTCATTCATAAAGAACGTACAAGAGATCAAAGAG
Protein sequenceShow/hide protein sequence
MAVKPAGSCSPGLTKVGLGFIALCIAAYILGPPLYWHFVEGLAVLSSSSSSTCPPCFCDCSSQTDFAFTDESFIQHSLKTRLLEKRAKELQATTSMIVLLQLNRAQNCVK
HDSGMNEETEETFAELLSEELKLREAEAMERHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWENRARQRGWRDDIVMSHAARDIVQTS