; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g04440 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g04440
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionEG45-like domain containing protein
Genome locationchr8:3226111..3230693
RNA-Seq ExpressionMoc08g04440
SyntenyMoc08g04440
Gene Ontology termsGO:0010073 - meristem maintenance (biological process)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant
IPR007112 - Expansin/pollen allergen, DPBB domain
IPR009009 - RlpA-like protein, double-psi beta-barrel domain
IPR018289 - MULE transposase domain
IPR019557 - Aminotransferase-like, plant mobile domain
IPR036908 - RlpA-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040476.1 serine/threonine-protein phosphatase 7 long form-like protein [Cucumis melo var. makuwa]4.9e-5647.58Show/hide
Query:  RTATDDEFDELELEG-EHEILSQTF---DMDSVNSIAEHESMLRPVTVETDMLYKGFMCNDKRTMQHIVKHFAVKSHHPYEVVESTPSIWTVRCKKWQDG
        RT TDDEF+E++ +G E+E+ S  F   DM++V+ I EH+ ++ P+  +   LY G +C +K  +QH++K FA+KSH PYEVVESTP+ W +RCKK  +G
Subjt:  RTATDDEFDELELEG-EHEILSQTF---DMDSVNSIAEHESMLRPVTVETDMLYKGFMCNDKRTMQHIVKHFAVKSHHPYEVVESTPSIWTVRCKKWQDG

Query:  CNWRLRAILKKSTNLWE--GLGRKNKSFGSYFCLSGVSPRASNSGTHLYGKYKGKLLIATSVDSNENLLPL-------------AFVIVDEESRQTWGWF
        C WRLRAI+KKS  L+E   L  ++  F S    S V   +       +   + K  I  SV S ++++               AF +V+EES  +WGWF
Subjt:  CNWRLRAILKKSTNLWE--GLGRKNKSFGSYFCLSGVSPRASNSGTHLYGKYKGKLLIATSVDSNENLLPL-------------AFVIVDEESRQTWGWF

Query:  LKNLRKVVTHEEICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHV
        L++L+++VTH+EICL+SDRH GIISAVNNPDNGWTG K HHRFCLRHV
Subjt:  LKNLRKVVTHEEICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHV

TYK27427.1 mediator of RNA polymerase II transcription subunit 12-like isoform X1 [Cucumis melo var. makuwa]4.0e-5838.1Show/hide
Query:  RTATDDEFDELELEG-EHEILSQTF---DMDSVNSIAEHESMLRPVTVETDMLYKGFMCNDKRTMQHIVKHFAVKSHHPYEVVESTPSIWTVRCKKWQDG
        RT TDDEF+E++ +G E+E+ S TF   DM++V+ I EH+ ++ P+  +   LY G +C +K  +QH+VK FA+KSH PYEVVESTP+ W +RCKK  +G
Subjt:  RTATDDEFDELELEG-EHEILSQTF---DMDSVNSIAEHESMLRPVTVETDMLYKGFMCNDKRTMQHIVKHFAVKSHHPYEVVESTPSIWTVRCKKWQDG

Query:  CNWRLRAILKKSTNL-----------------------------------------------------------------WEGLGRK-NKSFGSYF----
        C WRLRAI+KKS  L                                                                 W+G  +   K FG +     
Subjt:  CNWRLRAILKKSTNL-----------------------------------------------------------------WEGLGRK-NKSFGSYF----

Query:  ------------------------------------------CLSGVS---PRASNSGTHLYGKYKGKLLIATSVDSNENLLPLAFVIVDEESRQTWGWF
                                                  C+   S   P      THLYGKY+GKLLIATS+DSN +LLPLAF +V+EES  +WGWF
Subjt:  ------------------------------------------CLSGVS---PRASNSGTHLYGKYKGKLLIATSVDSNENLLPLAFVIVDEESRQTWGWF

Query:  LKNLRKVVTHEEICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHVSSNFNKKYN
        L++L+++VTH+EICL+S RH GIISAVNNPDN WTG   HHRFCLRHV   + + ++
Subjt:  LKNLRKVVTHEEICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHVSSNFNKKYN

XP_008456454.1 PREDICTED: uncharacterized protein LOC103496397 [Cucumis melo]1.2e-5439.1Show/hide
Query:  RTATDDEFDELELEG-EHEILSQTF---DMDSVNSIAEHESMLRPVTVETDMLYKGFMCNDKRTMQHIVKHFAVKSHHPYEVVESTPSIWTVRCKKWQDG
        RT TDDEF+E++ +G E+E+ S TF   DMD+V+ I EH+ ++ P+ ++   LY G +C +K  +QH+VK FA+KSH PYEVVESTP+ W +RCKK   G
Subjt:  RTATDDEFDELELEG-EHEILSQTF---DMDSVNSIAEHESMLRPVTVETDMLYKGFMCNDKRTMQHIVKHFAVKSHHPYEVVESTPSIWTVRCKKWQDG

Query:  CNWRLRAILKKSTNL-----------------------------------------------------------------WEG--------LGRKNKS--
        C WRLRAI+KKS  L                                                                 W+G         G  ++S  
Subjt:  CNWRLRAILKKSTNL-----------------------------------------------------------------WEG--------LGRKNKS--

Query:  ---------------------------------------FGSY-FCLSGVSPRASNSGTHLYGKYKGKLLIATSVDSNENLLPLAFVIVDEESRQTWGWF
                                               FG Y    S   P     GTHLYGKY+GKLLIATS+DSN +LLPLAF IV+EES  +WGWF
Subjt:  ---------------------------------------FGSY-FCLSGVSPRASNSGTHLYGKYKGKLLIATSVDSNENLLPLAFVIVDEESRQTWGWF

Query:  LKNLRKVVTHEEICLISDRHGGIISAVNNPDNGWT
        L++L+++VTH+E+CL+SDRH GIISAVNNPDNGWT
Subjt:  LKNLRKVVTHEEICLISDRHGGIISAVNNPDNGWT

XP_022131704.1 EG45-like domain containing protein [Momordica charantia]8.8e-58100Show/hide
Query:  MALADVGTATAYGPPYLPTLCNGNSVAQFPPGNLFVAVNEGLWDNGAACGRRYRLRCLSGRNRPCKADIIEVQVVNLCPKSPCPSSFLMSKEAFTAISRL
        MALADVGTATAYGPPYLPTLCNGNSVAQFPPGNLFVAVNEGLWDNGAACGRRYRLRCLSGRNRPCKADIIEVQVVNLCPKSPCPSSFLMSKEAFTAISRL
Subjt:  MALADVGTATAYGPPYLPTLCNGNSVAQFPPGNLFVAVNEGLWDNGAACGRRYRLRCLSGRNRPCKADIIEVQVVNLCPKSPCPSSFLMSKEAFTAISRL

Query:  PNARLNVEYIEI
        PNARLNVEYIEI
Subjt:  PNARLNVEYIEI

XP_022143642.1 uncharacterized protein LOC111013502 [Momordica charantia]4.1e-8772.81Show/hide
Query:  MLRPVTVETDMLYKGFMCNDKRTMQHIVKHFAVKSHHPYEVVESTPSIWTVRCKKWQDGCNWRLRAILKKSTNLWEGLGRKNKSFGSYFCLSGVSPRASN
        MLRPVTVE DMLYKGFMCNDKRTMQHIVK FAVKSHHPYEVVESTPSIW VRCKKWQDGCN RLRAILKK+   +                    P    
Subjt:  MLRPVTVETDMLYKGFMCNDKRTMQHIVKHFAVKSHHPYEVVESTPSIWTVRCKKWQDGCNWRLRAILKKSTNLWEGLGRKNKSFGSYFCLSGVSPRASN

Query:  SGTHLYGKYKGKLLIATSVDSNENLLPLAFVIVDEESRQTWGWFLKNLRKVVTHEEICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHVSSNFNKKY
         GTHLY KYKGKLLIATSVDSN +LLPLAF IVDEESRQTWGWF KNLRKVVTHEEICLISDRHGGII AVNN DNGWTGPKSHHRFCLRHVSSN N KY
Subjt:  SGTHLYGKYKGKLLIATSVDSNENLLPLAFVIVDEESRQTWGWFLKNLRKVVTHEEICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHVSSNFNKKY

Query:  NCKELKDSMI-------CRPYDNEFANL
         C ELKD +         R Y+ E  N+
Subjt:  NCKELKDSMI-------CRPYDNEFANL

TrEMBL top hitse value%identityAlignment
A0A1S3C4J7 uncharacterized protein LOC1034963975.8e-5539.1Show/hide
Query:  RTATDDEFDELELEG-EHEILSQTF---DMDSVNSIAEHESMLRPVTVETDMLYKGFMCNDKRTMQHIVKHFAVKSHHPYEVVESTPSIWTVRCKKWQDG
        RT TDDEF+E++ +G E+E+ S TF   DMD+V+ I EH+ ++ P+ ++   LY G +C +K  +QH+VK FA+KSH PYEVVESTP+ W +RCKK   G
Subjt:  RTATDDEFDELELEG-EHEILSQTF---DMDSVNSIAEHESMLRPVTVETDMLYKGFMCNDKRTMQHIVKHFAVKSHHPYEVVESTPSIWTVRCKKWQDG

Query:  CNWRLRAILKKSTNL-----------------------------------------------------------------WEG--------LGRKNKS--
        C WRLRAI+KKS  L                                                                 W+G         G  ++S  
Subjt:  CNWRLRAILKKSTNL-----------------------------------------------------------------WEG--------LGRKNKS--

Query:  ---------------------------------------FGSY-FCLSGVSPRASNSGTHLYGKYKGKLLIATSVDSNENLLPLAFVIVDEESRQTWGWF
                                               FG Y    S   P     GTHLYGKY+GKLLIATS+DSN +LLPLAF IV+EES  +WGWF
Subjt:  ---------------------------------------FGSY-FCLSGVSPRASNSGTHLYGKYKGKLLIATSVDSNENLLPLAFVIVDEESRQTWGWF

Query:  LKNLRKVVTHEEICLISDRHGGIISAVNNPDNGWT
        L++L+++VTH+E+CL+SDRH GIISAVNNPDNGWT
Subjt:  LKNLRKVVTHEEICLISDRHGGIISAVNNPDNGWT

A0A5D3DIM2 Serine/threonine-protein phosphatase 7 long form-like protein2.3e-5647.58Show/hide
Query:  RTATDDEFDELELEG-EHEILSQTF---DMDSVNSIAEHESMLRPVTVETDMLYKGFMCNDKRTMQHIVKHFAVKSHHPYEVVESTPSIWTVRCKKWQDG
        RT TDDEF+E++ +G E+E+ S  F   DM++V+ I EH+ ++ P+  +   LY G +C +K  +QH++K FA+KSH PYEVVESTP+ W +RCKK  +G
Subjt:  RTATDDEFDELELEG-EHEILSQTF---DMDSVNSIAEHESMLRPVTVETDMLYKGFMCNDKRTMQHIVKHFAVKSHHPYEVVESTPSIWTVRCKKWQDG

Query:  CNWRLRAILKKSTNLWE--GLGRKNKSFGSYFCLSGVSPRASNSGTHLYGKYKGKLLIATSVDSNENLLPL-------------AFVIVDEESRQTWGWF
        C WRLRAI+KKS  L+E   L  ++  F S    S V   +       +   + K  I  SV S ++++               AF +V+EES  +WGWF
Subjt:  CNWRLRAILKKSTNLWE--GLGRKNKSFGSYFCLSGVSPRASNSGTHLYGKYKGKLLIATSVDSNENLLPL-------------AFVIVDEESRQTWGWF

Query:  LKNLRKVVTHEEICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHV
        L++L+++VTH+EICL+SDRH GIISAVNNPDNGWTG K HHRFCLRHV
Subjt:  LKNLRKVVTHEEICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHV

A0A5D3DUI1 Mediator of RNA polymerase II transcription subunit 12-like isoform X11.9e-5838.1Show/hide
Query:  RTATDDEFDELELEG-EHEILSQTF---DMDSVNSIAEHESMLRPVTVETDMLYKGFMCNDKRTMQHIVKHFAVKSHHPYEVVESTPSIWTVRCKKWQDG
        RT TDDEF+E++ +G E+E+ S TF   DM++V+ I EH+ ++ P+  +   LY G +C +K  +QH+VK FA+KSH PYEVVESTP+ W +RCKK  +G
Subjt:  RTATDDEFDELELEG-EHEILSQTF---DMDSVNSIAEHESMLRPVTVETDMLYKGFMCNDKRTMQHIVKHFAVKSHHPYEVVESTPSIWTVRCKKWQDG

Query:  CNWRLRAILKKSTNL-----------------------------------------------------------------WEGLGRK-NKSFGSYF----
        C WRLRAI+KKS  L                                                                 W+G  +   K FG +     
Subjt:  CNWRLRAILKKSTNL-----------------------------------------------------------------WEGLGRK-NKSFGSYF----

Query:  ------------------------------------------CLSGVS---PRASNSGTHLYGKYKGKLLIATSVDSNENLLPLAFVIVDEESRQTWGWF
                                                  C+   S   P      THLYGKY+GKLLIATS+DSN +LLPLAF +V+EES  +WGWF
Subjt:  ------------------------------------------CLSGVS---PRASNSGTHLYGKYKGKLLIATSVDSNENLLPLAFVIVDEESRQTWGWF

Query:  LKNLRKVVTHEEICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHVSSNFNKKYN
        L++L+++VTH+EICL+S RH GIISAVNNPDN WTG   HHRFCLRHV   + + ++
Subjt:  LKNLRKVVTHEEICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHVSSNFNKKYN

A0A6J1BR12 EG45-like domain containing protein4.3e-58100Show/hide
Query:  MALADVGTATAYGPPYLPTLCNGNSVAQFPPGNLFVAVNEGLWDNGAACGRRYRLRCLSGRNRPCKADIIEVQVVNLCPKSPCPSSFLMSKEAFTAISRL
        MALADVGTATAYGPPYLPTLCNGNSVAQFPPGNLFVAVNEGLWDNGAACGRRYRLRCLSGRNRPCKADIIEVQVVNLCPKSPCPSSFLMSKEAFTAISRL
Subjt:  MALADVGTATAYGPPYLPTLCNGNSVAQFPPGNLFVAVNEGLWDNGAACGRRYRLRCLSGRNRPCKADIIEVQVVNLCPKSPCPSSFLMSKEAFTAISRL

Query:  PNARLNVEYIEI
        PNARLNVEYIEI
Subjt:  PNARLNVEYIEI

A0A6J1CRF1 uncharacterized protein LOC1110135022.0e-8772.81Show/hide
Query:  MLRPVTVETDMLYKGFMCNDKRTMQHIVKHFAVKSHHPYEVVESTPSIWTVRCKKWQDGCNWRLRAILKKSTNLWEGLGRKNKSFGSYFCLSGVSPRASN
        MLRPVTVE DMLYKGFMCNDKRTMQHIVK FAVKSHHPYEVVESTPSIW VRCKKWQDGCN RLRAILKK+   +                    P    
Subjt:  MLRPVTVETDMLYKGFMCNDKRTMQHIVKHFAVKSHHPYEVVESTPSIWTVRCKKWQDGCNWRLRAILKKSTNLWEGLGRKNKSFGSYFCLSGVSPRASN

Query:  SGTHLYGKYKGKLLIATSVDSNENLLPLAFVIVDEESRQTWGWFLKNLRKVVTHEEICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHVSSNFNKKY
         GTHLY KYKGKLLIATSVDSN +LLPLAF IVDEESRQTWGWF KNLRKVVTHEEICLISDRHGGII AVNN DNGWTGPKSHHRFCLRHVSSN N KY
Subjt:  SGTHLYGKYKGKLLIATSVDSNENLLPLAFVIVDEESRQTWGWFLKNLRKVVTHEEICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHVSSNFNKKY

Query:  NCKELKDSMI-------CRPYDNEFANL
         C ELKD +         R Y+ E  N+
Subjt:  NCKELKDSMI-------CRPYDNEFANL

SwissProt top hitse value%identityAlignment
Q9LNG5 Serine/threonine-protein phosphatase 7 long form homolog4.3e-1543.81Show/hide
Query:  DSMICRPYDNE-FANLPDFCVNGHNIWCTVSPLICFHIVEWHHHDRVTRQFGMKQTILEVPC-WDKRIHDIDIRDSTLQD----HIAHFVARWNIRSQFL
        + +I +PY  +  A +P  CV+G NIW TV+PLICF +VEWH  DRV RQFG+ QTI   PC  +K +H ID R  +  D    H  H +  W  R   +
Subjt:  DSMICRPYDNE-FANLPDFCVNGHNIWCTVSPLICFHIVEWHHHDRVTRQFGMKQTILEVPC-WDKRIHDIDIRDSTLQD----HIAHFVARWNIRSQFL

Query:  VINPP
        V   P
Subjt:  VINPP

Q9M0C2 Putative EG45-like domain containing protein 16.7e-0834.57Show/hide
Query:  GNLFVAVNEGLWDNGAACGRRYRLRCLSGRN---RPCKADIIEVQVVNLCPKSPCPSSFLMSKEAFTAISRLPNARLNVEY
        G +  A ++ LWDNG  CG+ + ++C   RN    PC    ++V++V+ CP S C S+  +S+EAF  I+      +N++Y
Subjt:  GNLFVAVNEGLWDNGAACGRRYRLRCLSGRN---RPCKADIIEVQVVNLCPKSPCPSSFLMSKEAFTAISRLPNARLNVEY

Q9ZP41 EG45-like domain containing protein1.8e-1333.93Show/hide
Query:  ALADVGTATAYGPPYLPTLCNGNSVAQFPPGNLFVAVNEGLWDNGAACGRRYRLRCLSGRNR----PCKADIIEVQVVNLCPKSPCPSSFLMSKEAFTAI
        A A  GTAT Y PPY+P+ CNG        G +  A +  +W+NGA C + +R++C    N+    PC+   + V++V+LCP + C ++  +S+EAF+ I
Subjt:  ALADVGTATAYGPPYLPTLCNGNSVAQFPPGNLFVAVNEGLWDNGAACGRRYRLRCLSGRNR----PCKADIIEVQVVNLCPKSPCPSSFLMSKEAFTAI

Query:  SRLPNARLNVEY
        +     ++ +E+
Subjt:  SRLPNARLNVEY

Q9ZV52 EG45-like domain containing protein 29.9e-1235.65Show/hide
Query:  MALADVGTATAYGPPYLPTLCNGNSVAQFPPGNLFVAVNEGLWDNGAACGRRYRLRCLSGR---NRPCKADIIEVQVVNLCPKSPCPSSFLMSKEAFTAI
        +A A  G A  Y PPY  + C G          L V V   LW NG ACGRRYR+RC+      +R C    ++V+VV+ C + PC     +S++AF  I
Subjt:  MALADVGTATAYGPPYLPTLCNGNSVAQFPPGNLFVAVNEGLWDNGAACGRRYRLRCLSGR---NRPCKADIIEVQVVNLCPKSPCPSSFLMSKEAFTAI

Query:  SRLPNARLNVEYIEI
        +      + V Y  I
Subjt:  SRLPNARLNVEYIEI

Arabidopsis top hitse value%identityAlignment
AT1G48120.1 hydrolases;protein serine/threonine phosphatases3.1e-1643.81Show/hide
Query:  DSMICRPYDNE-FANLPDFCVNGHNIWCTVSPLICFHIVEWHHHDRVTRQFGMKQTILEVPC-WDKRIHDIDIRDSTLQD----HIAHFVARWNIRSQFL
        + +I +PY  +  A +P  CV+G NIW TV+PLICF +VEWH  DRV RQFG+ QTI   PC  +K +H ID R  +  D    H  H +  W  R   +
Subjt:  DSMICRPYDNE-FANLPDFCVNGHNIWCTVSPLICFHIVEWHHHDRVTRQFGMKQTILEVPC-WDKRIHDIDIRDSTLQD----HIAHFVARWNIRSQFL

Query:  VINPP
        V   P
Subjt:  VINPP

AT1G49920.1 MuDR family transposase4.7e-1743.48Show/hide
Query:  HLYGKYKGKLLIATSVDSNENLLPLAFVIVDEESRQTWGWFLKNLRKVVTHEE-ICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHVSS
        +L GKYK KL+IA++ D+     PLAF +  E S  +W WFL  +R+ VT  + ICLIS     I++ +N P + W  P ++HRFCL H+ S
Subjt:  HLYGKYKGKLLIATSVDSNENLLPLAFVIVDEESRQTWGWFLKNLRKVVTHEE-ICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHVSS

AT1G64255.1 MuDR family transposase3.4e-1540.43Show/hide
Query:  KYKGKLLIATSVDSNENLLPLAFVIVDEESRQTWGWFLKNLRKVVTHEE-ICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHVSSNFNKKY
        +Y+ KL+IA+ VD+     PLAF +  E S   W WFL  +R+ VT  + +CLIS  H  II+ VN   + W  P ++HRF L H  S F++ +
Subjt:  KYKGKLLIATSVDSNENLLPLAFVIVDEESRQTWGWFLKNLRKVVTHEE-ICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHVSSNFNKKY

AT1G64260.1 MuDR family transposase1.6e-1740.95Show/hide
Query:  LYGKYKGKLLIATSVDSNENLLPLAFVIVDEESRQTWGWFLKNLR-KVVTHEEICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHVSSNF---NKKY
        L GKY+ KL+IA+ VD+     PLAF +  E S  +W WF   +R KV   +++CLIS     I++ VN P + W  P +HH+FCL H+ S F    + Y
Subjt:  LYGKYKGKLLIATSVDSNENLLPLAFVIVDEESRQTWGWFLKNLR-KVVTHEEICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHVSSNF---NKKY

Query:  NCKEL
        N + L
Subjt:  NCKEL

AT2G18660.1 plant natriuretic peptide A7.1e-1335.65Show/hide
Query:  MALADVGTATAYGPPYLPTLCNGNSVAQFPPGNLFVAVNEGLWDNGAACGRRYRLRCLSGR---NRPCKADIIEVQVVNLCPKSPCPSSFLMSKEAFTAI
        +A A  G A  Y PPY  + C G          L V V   LW NG ACGRRYR+RC+      +R C    ++V+VV+ C + PC     +S++AF  I
Subjt:  MALADVGTATAYGPPYLPTLCNGNSVAQFPPGNLFVAVNEGLWDNGAACGRRYRLRCLSGR---NRPCKADIIEVQVVNLCPKSPCPSSFLMSKEAFTAI

Query:  SRLPNARLNVEYIEI
        +      + V Y  I
Subjt:  SRLPNARLNVEYIEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCTTGCAGATGTTGGCACTGCCACAGCCTACGGTCCCCCATATCTTCCCACCCTTTGTAATGGGAACAGCGTTGCCCAGTTCCCGCCCGGCAACCTCTTCGTGGC
GGTGAACGAAGGGCTGTGGGACAATGGCGCTGCCTGCGGTCGACGGTATAGATTACGATGCTTGAGCGGGCGAAACCGCCCGTGCAAGGCCGACATCATCGAAGTTCAGG
TGGTCAACCTCTGCCCCAAATCGCCATGCCCCTCTTCCTTTCTCATGTCCAAAGAGGCCTTTACCGCCATCTCCCGCCTCCCCAATGCCAGACTCAACGTCGAATATATT
GAGATTGAAATGGCAGCTAAATGTTTGACGATTGTACTGTATATGAATGGTGGTACTGTTGATGGTGTAAATGAAATTGATTACGATGGACCATCAAGTAGAGGTTTTAC
TGTCTACAGTGGTATTGAGTTTGAACATTTTGTTCAAATAGTTGGAGTGTCACTAAGGACAGCGACAGATGATGAGTTTGATGAATTGGAACTCGAGGGTGAACATGAGA
TCCTTTCTCAAACATTTGACATGGACAGTGTTAACAGTATTGCAGAACACGAGTCCATGTTGCGACCAGTGACAGTTGAAACTGATATGTTGTACAAGGGGTTCATGTGT
AATGATAAAAGAACTATGCAACATATCGTCAAACATTTTGCTGTAAAGAGTCACCATCCTTACGAAGTTGTAGAGTCGACACCATCTATATGGACAGTTAGATGTAAGAA
GTGGCAAGATGGATGCAATTGGCGACTTCGTGCGATTCTTAAGAAAAGTACTAACTTATGGGAGGGTCTGGGAAGGAAAAACAAAAGCTTTGGCTCGTATTTTTGCCTTT
CAGGAGTTTCGCCCCGTGCTTCAAATAGTGGTACTCACCTTTACGGAAAGTATAAGGGGAAGCTATTGATTGCAACATCAGTCGATTCAAATGAAAATTTGTTACCTCTT
GCGTTTGTCATTGTAGACGAAGAGAGTCGTCAGACCTGGGGATGGTTTTTAAAAAACCTAAGAAAAGTTGTTACGCATGAAGAGATATGTTTAATTTCAGATCGACATGG
TGGTATTATCTCTGCGGTTAATAATCCAGACAATGGTTGGACCGGACCTAAATCGCATCACAGATTCTGTCTACGACATGTTTCTAGCAACTTCAACAAAAAATACAATT
GTAAAGAGTTGAAAGATTCGATGATCTGTAGGCCATATGATAACGAATTTGCCAACCTACCCGACTTCTGTGTCAATGGACATAACATCTGGTGCACTGTGAGTCCACTC
ATATGCTTTCATATTGTGGAGTGGCATCATCATGATAGAGTTACAAGACAATTTGGTATGAAACAAACAATTCTAGAGGTGCCATGCTGGGATAAGAGGATACATGACAT
CGACATTCGGGATAGCACCTTGCAGGATCATATTGCACATTTTGTGGCGAGGTGGAACATACGTAGCCAATTTCTTGTTATCAATCCTCCAGTCGACGACGACGGTACTT
GCGATCCAGGATACATGAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCTTGCAGATGTTGGCACTGCCACAGCCTACGGTCCCCCATATCTTCCCACCCTTTGTAATGGGAACAGCGTTGCCCAGTTCCCGCCCGGCAACCTCTTCGTGGC
GGTGAACGAAGGGCTGTGGGACAATGGCGCTGCCTGCGGTCGACGGTATAGATTACGATGCTTGAGCGGGCGAAACCGCCCGTGCAAGGCCGACATCATCGAAGTTCAGG
TGGTCAACCTCTGCCCCAAATCGCCATGCCCCTCTTCCTTTCTCATGTCCAAAGAGGCCTTTACCGCCATCTCCCGCCTCCCCAATGCCAGACTCAACGTCGAATATATT
GAGATTGAAATGGCAGCTAAATGTTTGACGATTGTACTGTATATGAATGGTGGTACTGTTGATGGTGTAAATGAAATTGATTACGATGGACCATCAAGTAGAGGTTTTAC
TGTCTACAGTGGTATTGAGTTTGAACATTTTGTTCAAATAGTTGGAGTGTCACTAAGGACAGCGACAGATGATGAGTTTGATGAATTGGAACTCGAGGGTGAACATGAGA
TCCTTTCTCAAACATTTGACATGGACAGTGTTAACAGTATTGCAGAACACGAGTCCATGTTGCGACCAGTGACAGTTGAAACTGATATGTTGTACAAGGGGTTCATGTGT
AATGATAAAAGAACTATGCAACATATCGTCAAACATTTTGCTGTAAAGAGTCACCATCCTTACGAAGTTGTAGAGTCGACACCATCTATATGGACAGTTAGATGTAAGAA
GTGGCAAGATGGATGCAATTGGCGACTTCGTGCGATTCTTAAGAAAAGTACTAACTTATGGGAGGGTCTGGGAAGGAAAAACAAAAGCTTTGGCTCGTATTTTTGCCTTT
CAGGAGTTTCGCCCCGTGCTTCAAATAGTGGTACTCACCTTTACGGAAAGTATAAGGGGAAGCTATTGATTGCAACATCAGTCGATTCAAATGAAAATTTGTTACCTCTT
GCGTTTGTCATTGTAGACGAAGAGAGTCGTCAGACCTGGGGATGGTTTTTAAAAAACCTAAGAAAAGTTGTTACGCATGAAGAGATATGTTTAATTTCAGATCGACATGG
TGGTATTATCTCTGCGGTTAATAATCCAGACAATGGTTGGACCGGACCTAAATCGCATCACAGATTCTGTCTACGACATGTTTCTAGCAACTTCAACAAAAAATACAATT
GTAAAGAGTTGAAAGATTCGATGATCTGTAGGCCATATGATAACGAATTTGCCAACCTACCCGACTTCTGTGTCAATGGACATAACATCTGGTGCACTGTGAGTCCACTC
ATATGCTTTCATATTGTGGAGTGGCATCATCATGATAGAGTTACAAGACAATTTGGTATGAAACAAACAATTCTAGAGGTGCCATGCTGGGATAAGAGGATACATGACAT
CGACATTCGGGATAGCACCTTGCAGGATCATATTGCACATTTTGTGGCGAGGTGGAACATACGTAGCCAATTTCTTGTTATCAATCCTCCAGTCGACGACGACGGTACTT
GCGATCCAGGATACATGAGCTGA
Protein sequenceShow/hide protein sequence
MALADVGTATAYGPPYLPTLCNGNSVAQFPPGNLFVAVNEGLWDNGAACGRRYRLRCLSGRNRPCKADIIEVQVVNLCPKSPCPSSFLMSKEAFTAISRLPNARLNVEYI
EIEMAAKCLTIVLYMNGGTVDGVNEIDYDGPSSRGFTVYSGIEFEHFVQIVGVSLRTATDDEFDELELEGEHEILSQTFDMDSVNSIAEHESMLRPVTVETDMLYKGFMC
NDKRTMQHIVKHFAVKSHHPYEVVESTPSIWTVRCKKWQDGCNWRLRAILKKSTNLWEGLGRKNKSFGSYFCLSGVSPRASNSGTHLYGKYKGKLLIATSVDSNENLLPL
AFVIVDEESRQTWGWFLKNLRKVVTHEEICLISDRHGGIISAVNNPDNGWTGPKSHHRFCLRHVSSNFNKKYNCKELKDSMICRPYDNEFANLPDFCVNGHNIWCTVSPL
ICFHIVEWHHHDRVTRQFGMKQTILEVPCWDKRIHDIDIRDSTLQDHIAHFVARWNIRSQFLVINPPVDDDGTCDPGYMS