; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g40690 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g40690
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of Unknown Function (DUF239)
Genome locationchr9:31042283..31044784
RNA-Seq ExpressionMoc09g40690
SyntenyMoc09g40690
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601521.1 hypothetical protein SDJN03_06754, partial [Cucurbita argyrosperma subsp. sororia]6.6e-22289.13Show/hide
Query:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ
        MDSSN    L+P+LVFSLLVVS   PV   H DDK I PNN T T +P+D+L KLKLIRAHLKKINKP VKTIQSPDGDLIDCVISH QPAFDHPLLKGQ
Subjt:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ

Query:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR
        KPLD+ +RPYD+SPSG   SE+FQLWSMSGESCPEGTVPIRRT EKDMLRASSV+RFGRK  RRRIRRDS++ GHEHAVGFVSG++YYGAKASINVWAPR
Subjt:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR

Query:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDP
        VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDP
Subjt:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDP

Query:  KHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQ
        KHGNWWLEFGSGVLVGYWPAFLFTHL+SHATMIQFGGEVVNS+ SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCY+IQ
Subjt:  KHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQ

Query:  GGINRVWGNYFYYGGPGRNVRCP
        GGINR+WGNYFYYGGPGRNVRCP
Subjt:  GGINRVWGNYFYYGGPGRNVRCP

XP_008446177.1 PREDICTED: uncharacterized protein LOC103488982 [Cucumis melo]7.7e-22390.58Show/hide
Query:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP
        SS L+PVLVFSLLVVSSFCPVHS      P N T T +P+D L KLKLIRAHLKKINKPP+KTIQSPDGDLIDCVI+H QPAFDHPLLKGQKPLDLPERP
Subjt:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP

Query:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL
        Y+RS SG  +SETFQLWSMSGE CPEG+VPIRRT E DM+RASSVQRFGRKV RRRIRRDS+S+GHEHAVGFVSGE+YYGAK SINVWAPRVTNQYEFSL
Subjt:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL

Query:  SQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEF
        SQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQ TGCYNLLCSGFVQTN+RIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEF
Subjt:  SQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEF

Query:  GSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGN
        GSGVLVGYWPAFLFTHL+SHA+MIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCY+IQGGINRVWGN
Subjt:  GSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGN

Query:  YFYYGGPGRNVRCP
        YFYYGGPGRNVRCP
Subjt:  YFYYGGPGRNVRCP

XP_022151755.1 uncharacterized protein LOC111019661 [Momordica charantia]1.8e-280100Show/hide
Query:  MLSTFRWAKAKNLWRCCGDCRRRQGQCTRPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAH
        MLSTFRWAKAKNLWRCCGDCRRRQGQCTRPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAH
Subjt:  MLSTFRWAKAKNLWRCCGDCRRRQGQCTRPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAH

Query:  LKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKV
        LKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKV
Subjt:  LKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKV

Query:  RRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNL
        RRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNL
Subjt:  RRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNL

Query:  LCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGE
        LCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGE
Subjt:  LCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGE

Query:  GFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
        GFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
Subjt:  GFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP

XP_022956450.1 uncharacterized protein LOC111458179 [Cucurbita moschata]2.5e-22189.13Show/hide
Query:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ
        MDSSN    L+P+LVFSLLVVS   PV   H DDK I PNN T T +P+D+L KLKLIRAHLKKINKP VKTIQSPDGDLIDCVISH QPAFDHPLLKGQ
Subjt:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ

Query:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR
        KPLD+ ERPYD+S SG   SE+FQLWSMSGESCPEGTVPIRRT EKDMLRASS++RFGRK  RRRIRRDS++ GHEHAVGFVSG++YYGAKASINVWAPR
Subjt:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR

Query:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDP
        VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDP
Subjt:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDP

Query:  KHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQ
        KHGNWWLEFGSGVLVGYWPAFLFTHL+SHATMIQFGGEVVNS+ SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCY+IQ
Subjt:  KHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQ

Query:  GGINRVWGNYFYYGGPGRNVRCP
        GGINRVWGNYFYYGGPGRNVRCP
Subjt:  GGINRVWGNYFYYGGPGRNVRCP

XP_038891414.1 uncharacterized protein LOC120080834 isoform X1 [Benincasa hispida]4.0e-22792.27Show/hide
Query:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP
        SS+LV VLVFSLLVVSSFCPVHS DK I   N T T +P+D   KLKLIRAHLKKINKPP+KTIQSPDGDLIDCVISH QPAFDHPLLKG+KPLDLPERP
Subjt:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP

Query:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL
        ++RS SGGG SETFQLWSMSGE CPEGTVPIRRT EKDMLRASSVQRFGRKV RRRIRRDS+SNGHEHAVGFVSGE+YYGAKASINVWAPRVTNQYEFSL
Subjt:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL

Query:  SQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEF
        SQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQ TGCYNLLCSGFVQTN+RIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEF
Subjt:  SQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEF

Query:  GSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGN
        GSGVLVGYWPAFLFTHL+SHATMIQFGGEVVNSR SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCY+IQGGINRVWGN
Subjt:  GSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGN

Query:  YFYYGGPGRNVRCP
        YFYYGGPGRNVRCP
Subjt:  YFYYGGPGRNVRCP

TrEMBL top hitse value%identityAlignment
A0A1S3BDY3 uncharacterized protein LOC1034889823.7e-22390.58Show/hide
Query:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP
        SS L+PVLVFSLLVVSSFCPVHS      P N T T +P+D L KLKLIRAHLKKINKPP+KTIQSPDGDLIDCVI+H QPAFDHPLLKGQKPLDLPERP
Subjt:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP

Query:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL
        Y+RS SG  +SETFQLWSMSGE CPEG+VPIRRT E DM+RASSVQRFGRKV RRRIRRDS+S+GHEHAVGFVSGE+YYGAK SINVWAPRVTNQYEFSL
Subjt:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL

Query:  SQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEF
        SQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQ TGCYNLLCSGFVQTN+RIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEF
Subjt:  SQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEF

Query:  GSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGN
        GSGVLVGYWPAFLFTHL+SHA+MIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCY+IQGGINRVWGN
Subjt:  GSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGN

Query:  YFYYGGPGRNVRCP
        YFYYGGPGRNVRCP
Subjt:  YFYYGGPGRNVRCP

A0A6J1B0N7 uncharacterized protein LOC1104230597.8e-21377.85Show/hide
Query:  STFRWAKAKNLWRCCGDCRRRQGQCTRPNY-FAKQTRQPNFPLM---IMDSSNLVPVLVFSLLVVSSFCPVH-SDDKNILPNNHTATLQPEDELNKLKLI
        ++F   K ++ W CC   RRRQ QCT+     A++T + NF  M       S ++P+ VF LLV SS CPV+ S+  N+  +N   T +PE+EL KLK+I
Subjt:  STFRWAKAKNLWRCCGDCRRRQGQCTRPNY-FAKQTRQPNFPLM---IMDSSNLVPVLVFSLLVVSSFCPVH-SDDKNILPNNHTATLQPEDELNKLKLI

Query:  RAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFG
        R  LKKINKP  KTIQSPDGD+IDCV+ HHQPAFDHP LKGQKPLD PERP   +P+ G A+E FQLWSMSGESCPEGT+PIRRT E+DMLRASSV+RFG
Subjt:  RAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFG

Query:  RKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGC
        RK  RRR+RRDSTSNGHEHAVG+VSG+QYYGAKASINVWAPRV+NQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWT+DAYQATGC
Subjt:  RKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGC

Query:  YNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHF
        YNLLCSGFVQTN+RIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSG+LVGYWP+FLFTHL+ HA+M+QFGGE+VNSR+ GFHT+T+MGSGHF
Subjt:  YNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHF

Query:  AGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
        AGEGFGKASYFRNLQVVDWDN+LIPL+NL+VLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
Subjt:  AGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP

A0A6J1DC30 uncharacterized protein LOC1110196618.8e-281100Show/hide
Query:  MLSTFRWAKAKNLWRCCGDCRRRQGQCTRPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAH
        MLSTFRWAKAKNLWRCCGDCRRRQGQCTRPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAH
Subjt:  MLSTFRWAKAKNLWRCCGDCRRRQGQCTRPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAH

Query:  LKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKV
        LKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKV
Subjt:  LKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKV

Query:  RRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNL
        RRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNL
Subjt:  RRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNL

Query:  LCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGE
        LCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGE
Subjt:  LCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGE

Query:  GFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
        GFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
Subjt:  GFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP

A0A6J1GXT6 uncharacterized protein LOC1114581791.2e-22189.13Show/hide
Query:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ
        MDSSN    L+P+LVFSLLVVS   PV   H DDK I PNN T T +P+D+L KLKLIRAHLKKINKP VKTIQSPDGDLIDCVISH QPAFDHPLLKGQ
Subjt:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ

Query:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR
        KPLD+ ERPYD+S SG   SE+FQLWSMSGESCPEGTVPIRRT EKDMLRASS++RFGRK  RRRIRRDS++ GHEHAVGFVSG++YYGAKASINVWAPR
Subjt:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR

Query:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDP
        VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDP
Subjt:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDP

Query:  KHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQ
        KHGNWWLEFGSGVLVGYWPAFLFTHL+SHATMIQFGGEVVNS+ SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCY+IQ
Subjt:  KHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQ

Query:  GGINRVWGNYFYYGGPGRNVRCP
        GGINRVWGNYFYYGGPGRNVRCP
Subjt:  GGINRVWGNYFYYGGPGRNVRCP

A0A6J1IRB1 uncharacterized protein LOC1114777501.9e-21988.42Show/hide
Query:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ
        MDSSN    L+P+LVF LL +S   PV   H DDK I PNN T T  P+D+L KLKLIRAHLKKINKP VKTIQSPDGDLIDCVISH QPAFDHPLLKGQ
Subjt:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ

Query:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR
        KPLD+ ERPYD+S S    SE+FQLWSMSGESCPEGTVPIRRT EKDMLRASSV+RFGRK  RRRIRRDS++ GHEHAVGFVSG++YYGAKASINVWAPR
Subjt:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR

Query:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDP
        VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDP
Subjt:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDP

Query:  KHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQ
        KHGNWWLEFGSGVLVGYWPAFLFTHL+SHATMIQFGGEVVNS+ SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCY+IQ
Subjt:  KHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQ

Query:  GGINRVWGNYFYYGGPGRNVRCP
        GGINRVWGNYFYYGGPGRNVRCP
Subjt:  GGINRVWGNYFYYGGPGRNVRCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)6.6e-18874.15Show/hide
Query:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP
        SSN+  + +  LL+ SSF  V S+  N+ P N   TL+P DELNKLK I  HL+KINKP +KTI SPDGD+IDCV+ HHQPAFDHP L+GQKPLD PERP
Subjt:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP

Query:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL
           +   G   ++FQLW M GE+CPEGTVPIRRT+E+D+LRA+SV  FG+K+  R  RRD++SNGHEHAVG+VSGE+YYGAKASINVWAP+V NQYEFSL
Subjt:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL

Query:  SQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEF
        SQ+W+ISGSFG+DLNTIEAGWQVSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTNS IAIGAAISP+SSY GGQFDI+LL+WKDPKHGNWWLEF
Subjt:  SQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEF

Query:  GSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGN
        GSG+LVGYWP+FLFTHL+ HA+M+Q+GGE+VNS   G HT+TQMGSGHFA EGF K+SYFRN+QVVDWDN+L+P  NL+VLADHPNCYDIQGG NR WG+
Subjt:  GSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGN

Query:  YFYYGGPGRNVRCP
        YFYYGGPG+N +CP
Subjt:  YFYYGGPGRNVRCP

AT1G23340.1 Protein of Unknown Function (DUF239)6.2e-18673.22Show/hide
Query:  LVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSG
        L F+ +++ S    ++   N    + T  L+P+ E+ K+KLIR  L+KINKP +KTI S DGD IDCV SHHQPAFDHPLL+GQ+P+D PE P   S   
Subjt:  LVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSG

Query:  GGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVIS
          + E FQLWS+ GESCPEGT+PIRRT E+DMLRA+SV+RFGRK+  RR+RRDS+SNGHEHAVG+VSG QYYGAKASINVW PRV +QYEFSLSQ+W+I+
Subjt:  GGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVIS

Query:  GSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVG
        GSF  DLNTIEAGWQ+SPELYGD  PRFFTYWTSDAYQATGCYNLLCSGFVQTN+RIAIGAAISP SSY GGQFDISLL+WKDPKHG+WWL+FGSG LVG
Subjt:  GSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVG

Query:  YWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGP
        YWP  LFTHL+ H  M+QFGGE+VN+R  G HT+TQMGSGHFAGEGFGKASYFRNLQ+VDWDN+LIP+SNLKVLADHPNCYDI+GG+NRVWGN+FYYGGP
Subjt:  YWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGP

Query:  GRNVRCP
        G+N +CP
Subjt:  GRNVRCP

AT1G23340.2 Protein of Unknown Function (DUF239)6.2e-18673.22Show/hide
Query:  LVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSG
        L F+ +++ S    ++   N    + T  L+P+ E+ K+KLIR  L+KINKP +KTI S DGD IDCV SHHQPAFDHPLL+GQ+P+D PE P   S   
Subjt:  LVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSG

Query:  GGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVIS
          + E FQLWS+ GESCPEGT+PIRRT E+DMLRA+SV+RFGRK+  RR+RRDS+SNGHEHAVG+VSG QYYGAKASINVW PRV +QYEFSLSQ+W+I+
Subjt:  GGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVIS

Query:  GSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVG
        GSF  DLNTIEAGWQ+SPELYGD  PRFFTYWTSDAYQATGCYNLLCSGFVQTN+RIAIGAAISP SSY GGQFDISLL+WKDPKHG+WWL+FGSG LVG
Subjt:  GSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVG

Query:  YWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGP
        YWP  LFTHL+ H  M+QFGGE+VN+R  G HT+TQMGSGHFAGEGFGKASYFRNLQ+VDWDN+LIP+SNLKVLADHPNCYDI+GG+NRVWGN+FYYGGP
Subjt:  YWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGP

Query:  GRNVRCP
        G+N +CP
Subjt:  GRNVRCP

AT1G70550.1 Protein of Unknown Function (DUF239)6.4e-19171.08Show/hide
Query:  KNLWRCC-GDCRRRQGQCTRPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPV
        +N  +CC  D  RR+ Q +R   F   ++Q +F    M SS+ + +++   LV SSF    S   +   +    TL+P++EL KL LIR  L KINKP V
Subjt:  KNLWRCC-GDCRRRQGQCTRPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPV

Query:  KTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDS
        KTIQS DGD IDCV +H QPAFDHPLL+GQKPLD PE P   S    G+ E  QLWS+SGESCPEGT+PIRRT E+DMLRASSVQRFGRK+  RR++RDS
Subjt:  KTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDS

Query:  TSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTN
        T+NGHEHAVG+V+G QYYGAKASINVW+PRVT+QYEFSLSQ+WVI+GSF  DLNTIEAGWQ+SPELYGD YPRFFTYWTSDAY+ TGCYNLLCSGFVQTN
Subjt:  TSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTN

Query:  SRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFR
         RIAIGAAISP SSY GGQFDISLL+WKDPKHG+WWL+FGSG LVGYWPAFLFTHL+ H +M+QFGGE+VN+R  G HT TQMGSGHFAGEGFGKASYFR
Subjt:  SRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFR

Query:  NLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
        NLQ+VDWDN+LIP SNLK+LADHPNCYDI+GG NRVWGNYFYYGGPG+N RCP
Subjt:  NLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP

AT1G70550.2 Protein of Unknown Function (DUF239)2.1e-18974.76Show/hide
Query:  MDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPE
        M SS+ + +++   LV SSF    S   +   +    TL+P++EL KL LIR  L KINKP VKTIQS DGD IDCV +H QPAFDHPLL+GQKPLD PE
Subjt:  MDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPE

Query:  RPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEF
         P   S    G+ E  QLWS+SGESCPEGT+PIRRT E+DMLRASSVQRFGRK+  RR++RDST+NGHEHAVG+V+G QYYGAKASINVW+PRVT+QYEF
Subjt:  RPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEF

Query:  SLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWL
        SLSQ+WVI+GSF  DLNTIEAGWQ+SPELYGD YPRFFTYWTSDAY+ TGCYNLLCSGFVQTN RIAIGAAISP SSY GGQFDISLL+WKDPKHG+WWL
Subjt:  SLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWL

Query:  EFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVW
        +FGSG LVGYWPAFLFTHL+ H +M+QFGGE+VN+R  G HT TQMGSGHFAGEGFGKASYFRNLQ+VDWDN+LIP SNLK+LADHPNCYDI+GG NRVW
Subjt:  EFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVW

Query:  GNYFYYGGPGRNVRCP
        GNYFYYGGPG+N RCP
Subjt:  GNYFYYGGPGRNVRCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCAGTACGTTTCGTTGGGCAAAAGCTAAGAACCTATGGCGCTGCTGCGGTGATTGCAGAAGAAGACAAGGCCAATGCACACGACCAAATTACTTTGCCAAACAGAC
AAGACAACCAAATTTCCCACTCATGATCATGGATTCTTCTAACCTCGTCCCTGTTCTTGTTTTTTCCCTTCTTGTCGTTTCGTCGTTTTGTCCCGTTCATTCAGACGACA
AAAATATCCTTCCCAATAACCACACGGCCACGCTCCAGCCGGAGGATGAATTAAACAAGTTGAAGCTCATTAGAGCCCATCTCAAGAAAATCAACAAGCCCCCCGTCAAA
ACCATTCAGAGTCCGGACGGGGATCTTATAGATTGTGTAATATCTCATCACCAGCCGGCTTTTGATCATCCATTGCTGAAAGGACAGAAGCCATTGGATCTGCCGGAGAG
ACCTTATGACCGGAGCCCTTCCGGCGGAGGGGCATCGGAAACGTTTCAATTATGGAGCATGTCCGGCGAGTCTTGCCCGGAAGGAACTGTTCCGATCAGAAGAACGAGGG
AAAAGGATATGCTGAGAGCCAGCTCCGTTCAAAGATTTGGAAGAAAAGTTAGAAGAAGACGCATTAGAAGAGACTCCACCAGCAACGGCCACGAGCATGCAGTGGGATTT
GTAAGCGGAGAACAGTACTACGGAGCAAAGGCGAGCATAAACGTATGGGCGCCGCGGGTGACGAATCAATACGAGTTCAGTCTGTCACAGATGTGGGTCATTTCTGGTTC
ATTTGGGGATGATCTAAACACCATTGAAGCTGGCTGGCAGGTTAGCCCAGAGCTATATGGAGACAATTACCCTAGATTTTTCACTTATTGGACATCAGACGCATATCAAG
CGACTGGATGCTACAATTTACTCTGCTCGGGCTTCGTTCAGACCAACAGCAGAATCGCCATTGGAGCTGCAATTTCTCCAACTTCTTCCTACAATGGCGGCCAATTCGAC
ATCAGTCTACTGGTTTGGAAGGATCCAAAGCATGGGAATTGGTGGTTGGAATTCGGGTCGGGCGTGCTGGTCGGGTACTGGCCCGCATTCTTGTTCACTCACCTTCAAAG
CCACGCGACGATGATTCAGTTCGGTGGGGAGGTGGTGAATTCGAGGGCCTCTGGATTTCACACGGCGACGCAAATGGGCAGCGGCCATTTCGCCGGCGAGGGCTTTGGAA
AAGCTTCGTATTTTCGTAACCTTCAAGTAGTGGATTGGGACAACAGCTTAATTCCACTGTCAAACTTGAAGGTTTTGGCAGATCATCCAAATTGCTACGACATTCAAGGA
GGGATTAATAGAGTTTGGGGAAATTACTTTTACTATGGTGGGCCTGGAAGAAATGTGAGATGCCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTCAGTACGTTTCGTTGGGCAAAAGCTAAGAACCTATGGCGCTGCTGCGGTGATTGCAGAAGAAGACAAGGCCAATGCACACGACCAAATTACTTTGCCAAACAGAC
AAGACAACCAAATTTCCCACTCATGATCATGGATTCTTCTAACCTCGTCCCTGTTCTTGTTTTTTCCCTTCTTGTCGTTTCGTCGTTTTGTCCCGTTCATTCAGACGACA
AAAATATCCTTCCCAATAACCACACGGCCACGCTCCAGCCGGAGGATGAATTAAACAAGTTGAAGCTCATTAGAGCCCATCTCAAGAAAATCAACAAGCCCCCCGTCAAA
ACCATTCAGAGTCCGGACGGGGATCTTATAGATTGTGTAATATCTCATCACCAGCCGGCTTTTGATCATCCATTGCTGAAAGGACAGAAGCCATTGGATCTGCCGGAGAG
ACCTTATGACCGGAGCCCTTCCGGCGGAGGGGCATCGGAAACGTTTCAATTATGGAGCATGTCCGGCGAGTCTTGCCCGGAAGGAACTGTTCCGATCAGAAGAACGAGGG
AAAAGGATATGCTGAGAGCCAGCTCCGTTCAAAGATTTGGAAGAAAAGTTAGAAGAAGACGCATTAGAAGAGACTCCACCAGCAACGGCCACGAGCATGCAGTGGGATTT
GTAAGCGGAGAACAGTACTACGGAGCAAAGGCGAGCATAAACGTATGGGCGCCGCGGGTGACGAATCAATACGAGTTCAGTCTGTCACAGATGTGGGTCATTTCTGGTTC
ATTTGGGGATGATCTAAACACCATTGAAGCTGGCTGGCAGGTTAGCCCAGAGCTATATGGAGACAATTACCCTAGATTTTTCACTTATTGGACATCAGACGCATATCAAG
CGACTGGATGCTACAATTTACTCTGCTCGGGCTTCGTTCAGACCAACAGCAGAATCGCCATTGGAGCTGCAATTTCTCCAACTTCTTCCTACAATGGCGGCCAATTCGAC
ATCAGTCTACTGGTTTGGAAGGATCCAAAGCATGGGAATTGGTGGTTGGAATTCGGGTCGGGCGTGCTGGTCGGGTACTGGCCCGCATTCTTGTTCACTCACCTTCAAAG
CCACGCGACGATGATTCAGTTCGGTGGGGAGGTGGTGAATTCGAGGGCCTCTGGATTTCACACGGCGACGCAAATGGGCAGCGGCCATTTCGCCGGCGAGGGCTTTGGAA
AAGCTTCGTATTTTCGTAACCTTCAAGTAGTGGATTGGGACAACAGCTTAATTCCACTGTCAAACTTGAAGGTTTTGGCAGATCATCCAAATTGCTACGACATTCAAGGA
GGGATTAATAGAGTTTGGGGAAATTACTTTTACTATGGTGGGCCTGGAAGAAATGTGAGATGCCCCTAA
Protein sequenceShow/hide protein sequence
MLSTFRWAKAKNLWRCCGDCRRRQGQCTRPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVK
TIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGF
VSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFD
ISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQG
GINRVWGNYFYYGGPGRNVRCP