; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0995 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0995
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPhytocyanin domain-containing protein
Genome locationMC02:7980280..7981964
RNA-Seq ExpressionMC02g0995
SyntenyMC02g0995
Gene Ontology termsGO:0022900 - electron transport chain (biological process)
GO:0009055 - electron transfer activity (molecular function)
InterPro domainsIPR003245 - Phytocyanin domain
IPR008972 - Cupredoxin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008459215.1 PREDICTED: extensin, partial [Cucumis melo]5.35e-6274.8Show/hide
Query:  PPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCDLRRATKLANWTQGNGDGFQFVLKQQKT
        PPS  P PQSPRKIIVGGS  W LGFDY++WALKNGPF++NDILVFKYDPP  +T PHSVYLL NM+SF+NCDL +A  LAN  QG+GDGF+FVLK Q  
Subjt:  PPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCDLRRATKLANWTQGNGDGFQFVLKQQKT

Query:  YYFACGEGNGFHCKNGSMKFSVTPILQ
        YYFACGEGNGFHCK GSMKF++TPIL+
Subjt:  YYFACGEGNGFHCKNGSMKFSVTPILQ

XP_011649199.1 leucine-rich repeat extensin-like protein 3 [Cucumis sativus]8.63e-6459.32Show/hide
Query:  PKTRRFPP---PPQPKSRRFPPPPQPKTRRFPPPPPQPKTRLFQPPPPQPKTRRSPPPPPSSLLPPSPLPSLLPPPPPPPPPLPPSPLPSPPPPQPKTRR
        P  RR PP   P  P SRRFPPPP  + R     PP  +  L  PPPP P+  R+PPP      PP P P L  PPPP       SP+ SPPPPQ + R 
Subjt:  PKTRRFPP---PPQPKSRRFPPPPQPKTRRFPPPPPQPKTRLFQPPPPQPKTRRSPPPPPSSLLPPSPLPSLLPPPPPPPPPLPPSPLPSPPPPQPKTRR

Query:  SPPPPPQSKTRPSPPPPPPSPLPPPPLPSPPSLPPPPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSN
         P PPP     PSPP     P P PP PSP SLPP  S +P PQSPRKIIVGGS  W LGFDY++WALKNGPF++NDILVFKYDPP  +T PH+VYLL N
Subjt:  SPPPPPQSKTRPSPPPPPPSPLPPPPLPSPPSLPPPPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSN

Query:  MQSFSNCDLRRATKLANWTQGNGDGFQFVLKQQKTYYFACGEGNGFHCKNGSMKFSVTPILQS
        M+S +NCD  +A  LAN TQG+GDGF+FVLK Q  YYFACGEGNGFHCK GSMKF++TPIL++
Subjt:  MQSFSNCDLRRATKLANWTQGNGDGFQFVLKQQKTYYFACGEGNGFHCKNGSMKFSVTPILQS

XP_022148560.1 uncharacterized protein LOC111017194 [Momordica charantia]4.38e-5973.81Show/hide
Query:  SRPPPQ-SPR--KIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCDLRRATKLANWTQGNGDGFQFVLKQQKT
        S PPPQ SP+  KI+VGGSENW  GFDY+NWALKNGPF++ND LVFKYDPP   T PHSVYLL N+ SFS CDLR A K+ANWTQG GDGF+FVL+Q K 
Subjt:  SRPPPQ-SPR--KIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCDLRRATKLANWTQGNGDGFQFVLKQQKT

Query:  YYFACGEGNGFHCKNGSMKFSVTPIL
        YYFACGE NGFHCK G+MKFS+TP L
Subjt:  YYFACGEGNGFHCKNGSMKFSVTPIL

XP_022148634.1 leucine-rich repeat extensin-like protein 3 [Momordica charantia]3.28e-220100Show/hide
Query:  MASSIYLLAILLFTIASMSNVTSAMGDWFQDYNCPWHSKQRHRSPPPPHPRTRRFPPPPQSKTRRFPPPPHPSLPLPPPPQPKTRRFPPPPQPKSRRFPP
        MASSIYLLAILLFTIASMSNVTSAMGDWFQDYNCPWHSKQRHRSPPPPHPRTRRFPPPPQSKTRRFPPPPHPSLPLPPPPQPKTRRFPPPPQPKSRRFPP
Subjt:  MASSIYLLAILLFTIASMSNVTSAMGDWFQDYNCPWHSKQRHRSPPPPHPRTRRFPPPPQSKTRRFPPPPHPSLPLPPPPQPKTRRFPPPPQPKSRRFPP

Query:  PPQPKTRRFPPPPPQPKTRLFQPPPPQPKTRRSPPPPPSSLLPPSPLPSLLPPPPPPPPPLPPSPLPSPPPPQPKTRRSPPPPPQSKTRPSPPPPPPSPL
        PPQPKTRRFPPPPPQPKTRLFQPPPPQPKTRRSPPPPPSSLLPPSPLPSLLPPPPPPPPPLPPSPLPSPPPPQPKTRRSPPPPPQSKTRPSPPPPPPSPL
Subjt:  PPQPKTRRFPPPPPQPKTRLFQPPPPQPKTRRSPPPPPSSLLPPSPLPSLLPPPPPPPPPLPPSPLPSPPPPQPKTRRSPPPPPQSKTRPSPPPPPPSPL

Query:  PPPPLPSPPSLPPPPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCDLRRATKLANWTQGN
        PPPPLPSPPSLPPPPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCDLRRATKLANWTQGN
Subjt:  PPPPLPSPPSLPPPPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCDLRRATKLANWTQGN

Query:  GDGFQFVLKQQKTYYFACGEGNGFHCKNGSMKFSVTPILQS
        GDGFQFVLKQQKTYYFACGEGNGFHCKNGSMKFSVTPILQS
Subjt:  GDGFQFVLKQQKTYYFACGEGNGFHCKNGSMKFSVTPILQS

XP_038902031.1 extensin-3-like [Benincasa hispida]9.16e-6752.26Show/hide
Query:  IYLLAILLFTIASMSNVTSAMGDWFQDYNCPWHSKQRHRSPPPPHPRTRRFPP-------PPQSKTRRFPPPP--HPSLPLPPPPQPKTRRFPPPPQPKS
        IY   I+LF  A  S ++SA    F+ YN        +RSPPPP    RRFPP       PP   +RRFPPPP  HP  P        +R+FPPPP  + 
Subjt:  IYLLAILLFTIASMSNVTSAMGDWFQDYNCPWHSKQRHRSPPPPHPRTRRFPP-------PPQSKTRRFPPPP--HPSLPLPPPPQPKTRRFPPPPQPKS

Query:  RRFPP----PPQPKTRRFPPPPPQPKTR---LFQPPPPQPKTRRSPPPPPSSLLPPSPLPSLLPPPPPPPPPLPPSPLPSPPPPQPKTRRSPPPPPQSKT
         R PP    PP P+  R PP P  P+T    L  PPPP  + R     PP    PPSP PS         P  PPSP+ SPPP Q +  R+PP    S+ 
Subjt:  RRFPP----PPQPKTRRFPPPPPQPKTR---LFQPPPPQPKTRRSPPPPPSSLLPPSPLPSLLPPPPPPPPPLPPSPLPSPPPPQPKTRRSPPPPPQSKT

Query:  RPSPPP--PPPSPLPPPPLPSPPSLPPPPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCD
         P PPP  P PSPL        PSLPP PS  P PQSPRKIIVGGS+ W LGFDY++WALKNGPF++NDILVFKYDPP  +T PHSVY L NM+SF+NCD
Subjt:  RPSPPP--PPPSPLPPPPLPSPPSLPPPPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCD

Query:  LRRATKLANWTQGNGDGFQFVLKQQKTYYFACGEGNGFHCKNGSMKFSVTPILQ
        L +   +AN TQG+ DGF+FVLK Q  YYFACGEGNGFHCK GSMKF++TPI++
Subjt:  LRRATKLANWTQGNGDGFQFVLKQQKTYYFACGEGNGFHCKNGSMKFSVTPILQ

TrEMBL top hitse value%identityAlignment
A0A0A0LIT9 Phytocyanin domain-containing protein4.18e-6459.32Show/hide
Query:  PKTRRFPP---PPQPKSRRFPPPPQPKTRRFPPPPPQPKTRLFQPPPPQPKTRRSPPPPPSSLLPPSPLPSLLPPPPPPPPPLPPSPLPSPPPPQPKTRR
        P  RR PP   P  P SRRFPPPP  + R     PP  +  L  PPPP P+  R+PPP      PP P P L  PPPP       SP+ SPPPPQ + R 
Subjt:  PKTRRFPP---PPQPKSRRFPPPPQPKTRRFPPPPPQPKTRLFQPPPPQPKTRRSPPPPPSSLLPPSPLPSLLPPPPPPPPPLPPSPLPSPPPPQPKTRR

Query:  SPPPPPQSKTRPSPPPPPPSPLPPPPLPSPPSLPPPPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSN
         P PPP     PSPP     P P PP PSP SLPP  S +P PQSPRKIIVGGS  W LGFDY++WALKNGPF++NDILVFKYDPP  +T PH+VYLL N
Subjt:  SPPPPPQSKTRPSPPPPPPSPLPPPPLPSPPSLPPPPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSN

Query:  MQSFSNCDLRRATKLANWTQGNGDGFQFVLKQQKTYYFACGEGNGFHCKNGSMKFSVTPILQS
        M+S +NCD  +A  LAN TQG+GDGF+FVLK Q  YYFACGEGNGFHCK GSMKF++TPIL++
Subjt:  MQSFSNCDLRRATKLANWTQGNGDGFQFVLKQQKTYYFACGEGNGFHCKNGSMKFSVTPILQS

A0A1S3CAV5 extensin2.59e-6274.8Show/hide
Query:  PPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCDLRRATKLANWTQGNGDGFQFVLKQQKT
        PPS  P PQSPRKIIVGGS  W LGFDY++WALKNGPF++NDILVFKYDPP  +T PHSVYLL NM+SF+NCDL +A  LAN  QG+GDGF+FVLK Q  
Subjt:  PPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCDLRRATKLANWTQGNGDGFQFVLKQQKT

Query:  YYFACGEGNGFHCKNGSMKFSVTPILQ
        YYFACGEGNGFHCK GSMKF++TPIL+
Subjt:  YYFACGEGNGFHCKNGSMKFSVTPILQ

A0A6J1D388 uncharacterized protein LOC1110171942.12e-5973.81Show/hide
Query:  SRPPPQ-SPR--KIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCDLRRATKLANWTQGNGDGFQFVLKQQKT
        S PPPQ SP+  KI+VGGSENW  GFDY+NWALKNGPF++ND LVFKYDPP   T PHSVYLL N+ SFS CDLR A K+ANWTQG GDGF+FVL+Q K 
Subjt:  SRPPPQ-SPR--KIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCDLRRATKLANWTQGNGDGFQFVLKQQKT

Query:  YYFACGEGNGFHCKNGSMKFSVTPIL
        YYFACGE NGFHCK G+MKFS+TP L
Subjt:  YYFACGEGNGFHCKNGSMKFSVTPIL

A0A6J1D4K0 leucine-rich repeat extensin-like protein 31.59e-220100Show/hide
Query:  MASSIYLLAILLFTIASMSNVTSAMGDWFQDYNCPWHSKQRHRSPPPPHPRTRRFPPPPQSKTRRFPPPPHPSLPLPPPPQPKTRRFPPPPQPKSRRFPP
        MASSIYLLAILLFTIASMSNVTSAMGDWFQDYNCPWHSKQRHRSPPPPHPRTRRFPPPPQSKTRRFPPPPHPSLPLPPPPQPKTRRFPPPPQPKSRRFPP
Subjt:  MASSIYLLAILLFTIASMSNVTSAMGDWFQDYNCPWHSKQRHRSPPPPHPRTRRFPPPPQSKTRRFPPPPHPSLPLPPPPQPKTRRFPPPPQPKSRRFPP

Query:  PPQPKTRRFPPPPPQPKTRLFQPPPPQPKTRRSPPPPPSSLLPPSPLPSLLPPPPPPPPPLPPSPLPSPPPPQPKTRRSPPPPPQSKTRPSPPPPPPSPL
        PPQPKTRRFPPPPPQPKTRLFQPPPPQPKTRRSPPPPPSSLLPPSPLPSLLPPPPPPPPPLPPSPLPSPPPPQPKTRRSPPPPPQSKTRPSPPPPPPSPL
Subjt:  PPQPKTRRFPPPPPQPKTRLFQPPPPQPKTRRSPPPPPSSLLPPSPLPSLLPPPPPPPPPLPPSPLPSPPPPQPKTRRSPPPPPQSKTRPSPPPPPPSPL

Query:  PPPPLPSPPSLPPPPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCDLRRATKLANWTQGN
        PPPPLPSPPSLPPPPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCDLRRATKLANWTQGN
Subjt:  PPPPLPSPPSLPPPPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCDLRRATKLANWTQGN

Query:  GDGFQFVLKQQKTYYFACGEGNGFHCKNGSMKFSVTPILQS
        GDGFQFVLKQQKTYYFACGEGNGFHCKNGSMKFSVTPILQS
Subjt:  GDGFQFVLKQQKTYYFACGEGNGFHCKNGSMKFSVTPILQS

A0A6J1G5T9 extensin-like isoform X44.24e-5843.7Show/hide
Query:  SSIYLLAILLFTIASMSNVTSAMGDWFQDYNCPWHSKQRHRSPPPPHPRTRRFPPPPQSKTRRFPPPPHPSLP-LPPPPQPKTRRFPPPPQPKSRRFPP-
        +SI+   I+LF +A MS ++SA   WF+  N  +    RH+ P        RFPPPP   +R  P PP   LP  PPP +P+  R PPP +P+  R PP 
Subjt:  SSIYLLAILLFTIASMSNVTSAMGDWFQDYNCPWHSKQRHRSPPPPHPRTRRFPPPPQSKTRRFPPPPHPSLP-LPPPPQPKTRRFPPPPQPKSRRFPP-

Query:  -PPQPKTRRFPPPPPQPKT---RLFQPPPPQPKTRRSPPPPPSSLLPPSPLPSLLPPPPPPPPPLPPSPLPSPPPPQPKTRRSPPPPPQSKTRPSPPPPP
         P QP+T    PPP +P+T   R F P PP+P+T      PPSS L                                                      
Subjt:  -PPQPKTRRFPPPPPQPKT---RLFQPPPPQPKTRRSPPPPPSSLLPPSPLPSLLPPPPPPPPPLPPSPLPSPPPPQPKTRRSPPPPPQSKTRPSPPPPP

Query:  PSPLPPPPLPSPPSLPPPPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCDLRRATKLANW
                              P PQ+PRKIIVGGS+ W LGFDY++W LKNGPF++NDILVFKYDPP  +T PH+VYLL NMQS + CD RRA  +AN 
Subjt:  PSPLPPPPLPSPPSLPPPPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCDLRRATKLANW

Query:  TQGNGDGFQFVLKQQKTYYFACGEGNGFHCKNGSMKFSVTP
        TQG+G+GF FVLKQQK YYFACGEGNGFHC  GSMKFS+TP
Subjt:  TQGNGDGFQFVLKQQKTYYFACGEGNGFHCKNGSMKFSVTP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15780.1 Cupredoxin superfamily protein7.1e-3758.62Show/hide
Query:  PRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCDLRRATKLANWTQGNGDGFQFVLKQQKTYYFACGEGNG
        PRKIIVGG + W  GF+Y++WA K  PFFLNDILVFKY+PP  A   HSVYLL N  S+  CD+++   +A+  QG G GF+FVLKQ K YY +CGE +G
Subjt:  PRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATVPHSVYLLSNMQSFSNCDLRRATKLANWTQGNGDGFQFVLKQQKTYYFACGEGNG

Query:  FHCKNGSMKFSVTPIL
         HC NG+MKF+V P+L
Subjt:  FHCKNGSMKFSVTPIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TGGAAGTGGAAGACTCAACTTGGGGAGAAAATTGGAGAAAAAAGTTCAAAATTTGATTTCTTCAATCACTATATAAGAACCCCATCAACTCAAACATTTGGCAACATCAA
CATTTCATTCCCACAATACAAAATTATAGCTTTGTTCTTCCTCTATCAATTTCAAAGCTTCATAAATCCAACCATGGCTTCTTCCATATATTTACTAGCAATTCTCCTCT
TCACAATAGCCTCTATGTCAAATGTAACCTCAGCCATGGGGGATTGGTTTCAAGACTACAATTGCCCTTGGCATTCCAAACAAAGGCATAGGAGTCCCCCGCCCCCACAC
CCGAGAACTCGACGGTTTCCACCACCTCCACAGTCGAAAACTCGACGATTTCCACCACCACCACATCCATCACTGCCTTTGCCGCCACCCCCACAACCGAAAACTCGACG
GTTCCCACCGCCCCCGCAGCCTAAAAGTCGACGGTTCCCACCTCCCCCGCAGCCAAAAACTCGACGGTTTCCTCCGCCACCTCCACAGCCGAAAACTCGACTATTTCAGC
CGCCACCCCCACAACCGAAAACTCGACGATCGCCACCGCCACCACCATCATCACTACTGCCACCATCACCATTGCCATCATTATTGCCTCCGCCGCCGCCGCCACCGCCA
CCACTGCCACCATCACCACTACCGTCACCACCACCCCCACAACCAAAAACTCGACGGTCACCACCGCCACCCCCACAGTCAAAAACTCGACCGTCGCCACCGCCACCTCC
ACCGTCACCACTGCCCCCACCACCATTGCCATCACCGCCATCACTACCGCCCCCACCTTCATCGCGACCGCCCCCACAAAGCCCTAGAAAGATCATAGTGGGTGGCTCGG
AGAATTGGCATCTTGGCTTCGACTACAGCAACTGGGCCCTTAAAAACGGTCCATTTTTTCTCAACGACATTCTAGTGTTCAAGTATGATCCTCCAACGGGCGCAACAGTT
CCCCACAGTGTGTACTTGCTATCAAACATGCAAAGCTTCTCAAACTGTGATCTGAGAAGGGCCACAAAGCTTGCAAATTGGACACAAGGAAATGGCGATGGCTTCCAATT
TGTGCTCAAACAACAAAAAACTTACTACTTTGCCTGTGGAGAAGGCAATGGCTTCCATTGCAAAAATGGATCCATGAAGTTCTCTGTCACACCAATACTTCAGAGTTAG
mRNA sequenceShow/hide mRNA sequence
TGGAAGTGGAAGACTCAACTTGGGGAGAAAATTGGAGAAAAAAGTTCAAAATTTGATTTCTTCAATCACTATATAAGAACCCCATCAACTCAAACATTTGGCAACATCAA
CATTTCATTCCCACAATACAAAATTATAGCTTTGTTCTTCCTCTATCAATTTCAAAGCTTCATAAATCCAACCATGGCTTCTTCCATATATTTACTAGCAATTCTCCTCT
TCACAATAGCCTCTATGTCAAATGTAACCTCAGCCATGGGGGATTGGTTTCAAGACTACAATTGCCCTTGGCATTCCAAACAAAGGCATAGGAGTCCCCCGCCCCCACAC
CCGAGAACTCGACGGTTTCCACCACCTCCACAGTCGAAAACTCGACGATTTCCACCACCACCACATCCATCACTGCCTTTGCCGCCACCCCCACAACCGAAAACTCGACG
GTTCCCACCGCCCCCGCAGCCTAAAAGTCGACGGTTCCCACCTCCCCCGCAGCCAAAAACTCGACGGTTTCCTCCGCCACCTCCACAGCCGAAAACTCGACTATTTCAGC
CGCCACCCCCACAACCGAAAACTCGACGATCGCCACCGCCACCACCATCATCACTACTGCCACCATCACCATTGCCATCATTATTGCCTCCGCCGCCGCCGCCACCGCCA
CCACTGCCACCATCACCACTACCGTCACCACCACCCCCACAACCAAAAACTCGACGGTCACCACCGCCACCCCCACAGTCAAAAACTCGACCGTCGCCACCGCCACCTCC
ACCGTCACCACTGCCCCCACCACCATTGCCATCACCGCCATCACTACCGCCCCCACCTTCATCGCGACCGCCCCCACAAAGCCCTAGAAAGATCATAGTGGGTGGCTCGG
AGAATTGGCATCTTGGCTTCGACTACAGCAACTGGGCCCTTAAAAACGGTCCATTTTTTCTCAACGACATTCTAGTGTTCAAGTATGATCCTCCAACGGGCGCAACAGTT
CCCCACAGTGTGTACTTGCTATCAAACATGCAAAGCTTCTCAAACTGTGATCTGAGAAGGGCCACAAAGCTTGCAAATTGGACACAAGGAAATGGCGATGGCTTCCAATT
TGTGCTCAAACAACAAAAAACTTACTACTTTGCCTGTGGAGAAGGCAATGGCTTCCATTGCAAAAATGGATCCATGAAGTTCTCTGTCACACCAATACTTCAGAGTTAGA
ACAACCCTTCTTCAAATGGATTTATCTTAATTAAAATACTTTTCTTTTTAAACCCTTCACAATAATTTACTGTTTTACGAGCTTTAGTTTAATTTCAATTACTAATTAGT
ACTTAAATACTGTCGAATAAAGAATATCCCATAATGGGAGATCATACTATGCTATCCTACTTTTGTTTTTATGTATTCCCCATTAATCATTGTTTTCATTTTTTTTTTAT
CGAGCATAGTTCAACGGTAATTTAAATACGTGTTTTCCTGTATTTTAAATTTGGTAACAATTTAATCCATATAGTTTAAAATATTTTGATCGTGGAGTAATTCTTAGGGT
GGCATAAGCAAAAAACTTGTCTTATGTCTAGGAAGGAAATAGTGTAAGAAGAAAC
Protein sequenceShow/hide protein sequence
WKWKTQLGEKIGEKSSKFDFFNHYIRTPSTQTFGNINISFPQYKIIALFFLYQFQSFINPTMASSIYLLAILLFTIASMSNVTSAMGDWFQDYNCPWHSKQRHRSPPPPH
PRTRRFPPPPQSKTRRFPPPPHPSLPLPPPPQPKTRRFPPPPQPKSRRFPPPPQPKTRRFPPPPPQPKTRLFQPPPPQPKTRRSPPPPPSSLLPPSPLPSLLPPPPPPPP
PLPPSPLPSPPPPQPKTRRSPPPPPQSKTRPSPPPPPPSPLPPPPLPSPPSLPPPPSSRPPPQSPRKIIVGGSENWHLGFDYSNWALKNGPFFLNDILVFKYDPPTGATV
PHSVYLLSNMQSFSNCDLRRATKLANWTQGNGDGFQFVLKQQKTYYFACGEGNGFHCKNGSMKFSVTPILQS