; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1791 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1791
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationMC09:22997530..23000875
RNA-Seq ExpressionMC09g1791
SyntenyMC09g1791
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601521.1 hypothetical protein SDJN03_06754, partial [Cucurbita argyrosperma subsp. sororia]1.05e-28088.29Show/hide
Query:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ
        MDSSN    L+P+LVFSLLVVS   PV   H DDK I PNN T T +P+D+L KLKLIRAHLKKINKP VKTIQSPDGDLIDCVISH QPAFDHPLLKGQ
Subjt:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ

Query:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR
        KPLD+ +RPYD+SPSG   SE+FQLWSMSGESCPEGTVPIRRT EKDMLRASSV+RFGRK RRR IRRDS++ GHEHAVGFVSG++YYGAKASINVWAPR
Subjt:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR

Query:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV
        VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV
Subjt:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV

Query:  WKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC
        WKDPKHGNWWLEFGSGVLVGYWPAFLFTHL+SHATMIQFGGEVVNS+ SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC
Subjt:  WKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC

Query:  YDIQGGINRVWGNYFYYGGPGRNVRCP
        Y+IQGGINR+WGNYFYYGGPGRNVRCP
Subjt:  YDIQGGINRVWGNYFYYGGPGRNVRCP

XP_008446177.1 PREDICTED: uncharacterized protein LOC103488982 [Cucumis melo]1.94e-28189.5Show/hide
Query:  DSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPER
         SS L+PVLVFSLLVVSSFCPVHS      P N T T +P+D L KLKLIRAHLKKINKPP+KTIQSPDGDLIDCVI+H QPAFDHPLLKGQKPLDLPER
Subjt:  DSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPER

Query:  PYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFS
        PY+RS SG  +SETFQLWSMSGE CPEG+VPIRRT E DM+RASSVQRFGRKVRRR IRRDS+S+GHEHAVGFVSGE+YYGAK SINVWAPRVTNQYEFS
Subjt:  PYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFS

Query:  LSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGN
        LSQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWTSDAYQ TGCYNLLCSGFVQTN+RIAIGAAISPTSSYNGGQFDISLLVWKDPKHGN
Subjt:  LSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGN

Query:  WWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGIN
        WWLEFGSGVLVGYWPAFLFTHL+SHA+MIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCY+IQGGIN
Subjt:  WWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGIN

Query:  RVWGNYFYYGGPGRNVRCP
        RVWGNYFYYGGPGRNVRCP
Subjt:  RVWGNYFYYGGPGRNVRCP

XP_022151755.1 uncharacterized protein LOC111019661 [Momordica charantia]0.099.14Show/hide
Query:  MLSTFRWAKAKNLWRCCGDCRRRQGQCTRPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAH
        MLSTFRWAKAKNLWRCCGDCRRRQGQCTRPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAH
Subjt:  MLSTFRWAKAKNLWRCCGDCRRRQGQCTRPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAH

Query:  LKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKV
        LKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKV
Subjt:  LKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKV

Query:  RRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATG
        RRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWTSDAYQATG
Subjt:  RRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATG

Query:  CYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGH
        CYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGH
Subjt:  CYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGH

Query:  FAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
        FAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
Subjt:  FAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP

XP_022956450.1 uncharacterized protein LOC111458179 [Cucurbita moschata]6.07e-28088.29Show/hide
Query:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ
        MDSSN    L+P+LVFSLLVVS   PV   H DDK I PNN T T +P+D+L KLKLIRAHLKKINKP VKTIQSPDGDLIDCVISH QPAFDHPLLKGQ
Subjt:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ

Query:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR
        KPLD+ ERPYD+S SG   SE+FQLWSMSGESCPEGTVPIRRT EKDMLRASS++RFGRK RRR IRRDS++ GHEHAVGFVSG++YYGAKASINVWAPR
Subjt:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR

Query:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV
        VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV
Subjt:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV

Query:  WKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC
        WKDPKHGNWWLEFGSGVLVGYWPAFLFTHL+SHATMIQFGGEVVNS+ SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC
Subjt:  WKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC

Query:  YDIQGGINRVWGNYFYYGGPGRNVRCP
        Y+IQGGINRVWGNYFYYGGPGRNVRCP
Subjt:  YDIQGGINRVWGNYFYYGGPGRNVRCP

XP_038891414.1 uncharacterized protein LOC120080834 isoform X1 [Benincasa hispida]3.35e-28791.39Show/hide
Query:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP
        SS+LV VLVFSLLVVSSFCPVHS DK I   N T T +P+D   KLKLIRAHLKKINKPP+KTIQSPDGDLIDCVISH QPAFDHPLLKG+KPLDLPERP
Subjt:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP

Query:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL
        ++RS SGGG SETFQLWSMSGE CPEGTVPIRRT EKDMLRASSVQRFGRKVRRR IRRDS+SNGHEHAVGFVSGE+YYGAKASINVWAPRVTNQYEFSL
Subjt:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL

Query:  SQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNW
        SQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWTSDAYQ TGCYNLLCSGFVQTN+RIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNW
Subjt:  SQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNW

Query:  WLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINR
        WLEFGSGVLVGYWPAFLFTHL+SHATMIQFGGEVVNSR SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCY+IQGGINR
Subjt:  WLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINR

Query:  VWGNYFYYGGPGRNVRCP
        VWGNYFYYGGPGRNVRCP
Subjt:  VWGNYFYYGGPGRNVRCP

TrEMBL top hitse value%identityAlignment
A0A1S3BDY3 uncharacterized protein LOC1034889829.39e-28289.5Show/hide
Query:  DSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPER
         SS L+PVLVFSLLVVSSFCPVHS      P N T T +P+D L KLKLIRAHLKKINKPP+KTIQSPDGDLIDCVI+H QPAFDHPLLKGQKPLDLPER
Subjt:  DSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPER

Query:  PYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFS
        PY+RS SG  +SETFQLWSMSGE CPEG+VPIRRT E DM+RASSVQRFGRKVRRR IRRDS+S+GHEHAVGFVSGE+YYGAK SINVWAPRVTNQYEFS
Subjt:  PYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFS

Query:  LSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGN
        LSQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWTSDAYQ TGCYNLLCSGFVQTN+RIAIGAAISPTSSYNGGQFDISLLVWKDPKHGN
Subjt:  LSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGN

Query:  WWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGIN
        WWLEFGSGVLVGYWPAFLFTHL+SHA+MIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCY+IQGGIN
Subjt:  WWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGIN

Query:  RVWGNYFYYGGPGRNVRCP
        RVWGNYFYYGGPGRNVRCP
Subjt:  RVWGNYFYYGGPGRNVRCP

A0A6J1B0N7 uncharacterized protein LOC1104230591.11e-26777.19Show/hide
Query:  STFRWAKAKNLWRCCGDCRRRQGQCTRPNY-FAKQTRQPNFPLMIM---DSSNLVPVLVFSLLVVSSFCPVH-SDDKNILPNNHTATLQPEDELNKLKLI
        ++F   K ++ W CC   RRRQ QCT+     A++T + NF  M       S ++P+ VF LLV SS CPV+ S+  N+  +N T   +PE+EL KLK+I
Subjt:  STFRWAKAKNLWRCCGDCRRRQGQCTRPNY-FAKQTRQPNFPLMIM---DSSNLVPVLVFSLLVVSSFCPVH-SDDKNILPNNHTATLQPEDELNKLKLI

Query:  RAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFG
        R  LKKINKP  KTIQSPDGD+IDCV+ HHQPAFDHP LKGQKPLD PERP   +P+G  A+E FQLWSMSGESCPEGT+PIRRT E+DMLRASSV+RFG
Subjt:  RAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFG

Query:  RKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQ
        RK RRR +RRDSTSNGHEHAVG+VSG+QYYGAKASINVWAPRV+NQYEFSLSQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWT+DAYQ
Subjt:  RKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQ

Query:  ATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMG
        ATGCYNLLCSGFVQTN+RIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSG+LVGYWP+FLFTHL+ HA+M+QFGGE+VNSR+ GFHT+T+MG
Subjt:  ATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMG

Query:  SGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
        SGHFAGEGFGKASYFRNLQVVDWDN+LIPL+NL+VLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
Subjt:  SGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP

A0A6J1DC30 uncharacterized protein LOC1110196610.099.14Show/hide
Query:  MLSTFRWAKAKNLWRCCGDCRRRQGQCTRPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAH
        MLSTFRWAKAKNLWRCCGDCRRRQGQCTRPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAH
Subjt:  MLSTFRWAKAKNLWRCCGDCRRRQGQCTRPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAH

Query:  LKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKV
        LKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKV
Subjt:  LKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKV

Query:  RRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATG
        RRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWTSDAYQATG
Subjt:  RRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATG

Query:  CYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGH
        CYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGH
Subjt:  CYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGH

Query:  FAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
        FAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
Subjt:  FAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP

A0A6J1GXT6 uncharacterized protein LOC1114581792.94e-28088.29Show/hide
Query:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ
        MDSSN    L+P+LVFSLLVVS   PV   H DDK I PNN T T +P+D+L KLKLIRAHLKKINKP VKTIQSPDGDLIDCVISH QPAFDHPLLKGQ
Subjt:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ

Query:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR
        KPLD+ ERPYD+S SG   SE+FQLWSMSGESCPEGTVPIRRT EKDMLRASS++RFGRK RRR IRRDS++ GHEHAVGFVSG++YYGAKASINVWAPR
Subjt:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR

Query:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV
        VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV
Subjt:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV

Query:  WKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC
        WKDPKHGNWWLEFGSGVLVGYWPAFLFTHL+SHATMIQFGGEVVNS+ SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC
Subjt:  WKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC

Query:  YDIQGGINRVWGNYFYYGGPGRNVRCP
        Y+IQGGINRVWGNYFYYGGPGRNVRCP
Subjt:  YDIQGGINRVWGNYFYYGGPGRNVRCP

A0A6J1IRB1 uncharacterized protein LOC1114777502.30e-27787.59Show/hide
Query:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ
        MDSSN    L+P+LVF LL +S   PV   H DDK I PNN T T  P+D+L KLKLIRAHLKKINKP VKTIQSPDGDLIDCVISH QPAFDHPLLKGQ
Subjt:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ

Query:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR
        KPLD+ ERPYD+S S    SE+FQLWSMSGESCPEGTVPIRRT EKDMLRASSV+RFGRK RRR IRRDS++ GHEHAVGFVSG++YYGAKASINVWAPR
Subjt:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR

Query:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV
        VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV
Subjt:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV

Query:  WKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC
        WKDPKHGNWWLEFGSGVLVGYWPAFLFTHL+SHATMIQFGGEVVNS+ SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC
Subjt:  WKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC

Query:  YDIQGGINRVWGNYFYYGGPGRNVRCP
        Y+IQGGINRVWGNYFYYGGPGRNVRCP
Subjt:  YDIQGGINRVWGNYFYYGGPGRNVRCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)4.8e-18673.44Show/hide
Query:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP
        SSN+  + +  LL+ SSF  V S+  N+ P N   TL+P DELNKLK I  HL+KINKP +KTI SPDGD+IDCV+ HHQPAFDHP L+GQKPLD PERP
Subjt:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP

Query:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL
           +   G   ++FQLW M GE+CPEGTVPIRRT+E+D+LRA+SV  FG+K+  R  RRD++SNGHEHAVG+VSGE+YYGAKASINVWAP+V NQYEFSL
Subjt:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL

Query:  SQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNW
        SQ+W+ISGSFG+DLNTIEAGWQ    VSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTNS IAIGAAISP+SSY GGQFDI+LL+WKDPKHGNW
Subjt:  SQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNW

Query:  WLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINR
        WLEFGSG+LVGYWP+FLFTHL+ HA+M+Q+GGE+VNS   G HT+TQMGSGHFA EGF K+SYFRN+QVVDWDN+L+P  NL+VLADHPNCYDIQGG NR
Subjt:  WLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINR

Query:  VWGNYFYYGGPGRNVRCP
         WG+YFYYGGPG+N +CP
Subjt:  VWGNYFYYGGPGRNVRCP

AT1G23340.1 Protein of Unknown Function (DUF239)4.5e-18472.51Show/hide
Query:  LVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSG
        L F+ +++ S    ++   N    + T  L+P+ E+ K+KLIR  L+KINKP +KTI S DGD IDCV SHHQPAFDHPLL+GQ+P+D PE P   S   
Subjt:  LVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSG

Query:  GGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVIS
          + E FQLWS+ GESCPEGT+PIRRT E+DMLRA+SV+RFGRK+  RR+RRDS+SNGHEHAVG+VSG QYYGAKASINVW PRV +QYEFSLSQ+W+I+
Subjt:  GGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVIS

Query:  GSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSG
        GSF  DLNTIEAGWQ    +SPELYGD  PRFFTYWTSDAYQATGCYNLLCSGFVQTN+RIAIGAAISP SSY GGQFDISLL+WKDPKHG+WWL+FGSG
Subjt:  GSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSG

Query:  VLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFY
         LVGYWP  LFTHL+ H  M+QFGGE+VN+R  G HT+TQMGSGHFAGEGFGKASYFRNLQ+VDWDN+LIP+SNLKVLADHPNCYDI+GG+NRVWGN+FY
Subjt:  VLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFY

Query:  YGGPGRNVRCP
        YGGPG+N +CP
Subjt:  YGGPGRNVRCP

AT1G23340.2 Protein of Unknown Function (DUF239)4.5e-18472.51Show/hide
Query:  LVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSG
        L F+ +++ S    ++   N    + T  L+P+ E+ K+KLIR  L+KINKP +KTI S DGD IDCV SHHQPAFDHPLL+GQ+P+D PE P   S   
Subjt:  LVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSG

Query:  GGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVIS
          + E FQLWS+ GESCPEGT+PIRRT E+DMLRA+SV+RFGRK+  RR+RRDS+SNGHEHAVG+VSG QYYGAKASINVW PRV +QYEFSLSQ+W+I+
Subjt:  GGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVIS

Query:  GSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSG
        GSF  DLNTIEAGWQ    +SPELYGD  PRFFTYWTSDAYQATGCYNLLCSGFVQTN+RIAIGAAISP SSY GGQFDISLL+WKDPKHG+WWL+FGSG
Subjt:  GSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSG

Query:  VLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFY
         LVGYWP  LFTHL+ H  M+QFGGE+VN+R  G HT+TQMGSGHFAGEGFGKASYFRNLQ+VDWDN+LIP+SNLKVLADHPNCYDI+GG+NRVWGN+FY
Subjt:  VLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFY

Query:  YGGPGRNVRCP
        YGGPG+N +CP
Subjt:  YGGPGRNVRCP

AT1G70550.1 Protein of Unknown Function (DUF239)3.5e-18970.46Show/hide
Query:  KNLWRCC-GDCRRRQGQCTRPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPV
        +N  +CC  D  RR+ Q +R   F   ++Q +F    M SS+ + +++   LV SSF    S   +   +    TL+P++EL KL LIR  L KINKP V
Subjt:  KNLWRCC-GDCRRRQGQCTRPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPV

Query:  KTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDS
        KTIQS DGD IDCV +H QPAFDHPLL+GQKPLD PE P   S    G+ E  QLWS+SGESCPEGT+PIRRT E+DMLRASSVQRFGRK+  RR++RDS
Subjt:  KTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDS

Query:  TSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGF
        T+NGHEHAVG+V+G QYYGAKASINVW+PRVT+QYEFSLSQ+WVI+GSF  DLNTIEAGWQ    +SPELYGD YPRFFTYWTSDAY+ TGCYNLLCSGF
Subjt:  TSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGF

Query:  VQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKA
        VQTN RIAIGAAISP SSY GGQFDISLL+WKDPKHG+WWL+FGSG LVGYWPAFLFTHL+ H +M+QFGGE+VN+R  G HT TQMGSGHFAGEGFGKA
Subjt:  VQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKA

Query:  SYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
        SYFRNLQ+VDWDN+LIP SNLK+LADHPNCYDI+GG NRVWGNYFYYGGPG+N RCP
Subjt:  SYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP

AT1G70550.2 Protein of Unknown Function (DUF239)1.1e-18774.05Show/hide
Query:  MDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPE
        M SS+ + +++   LV SSF    S   +   +    TL+P++EL KL LIR  L KINKP VKTIQS DGD IDCV +H QPAFDHPLL+GQKPLD PE
Subjt:  MDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPE

Query:  RPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEF
         P   S    G+ E  QLWS+SGESCPEGT+PIRRT E+DMLRASSVQRFGRK+  RR++RDST+NGHEHAVG+V+G QYYGAKASINVW+PRVT+QYEF
Subjt:  RPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEF

Query:  SLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHG
        SLSQ+WVI+GSF  DLNTIEAGWQ    +SPELYGD YPRFFTYWTSDAY+ TGCYNLLCSGFVQTN RIAIGAAISP SSY GGQFDISLL+WKDPKHG
Subjt:  SLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHG

Query:  NWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGI
        +WWL+FGSG LVGYWPAFLFTHL+ H +M+QFGGE+VN+R  G HT TQMGSGHFAGEGFGKASYFRNLQ+VDWDN+LIP SNLK+LADHPNCYDI+GG 
Subjt:  NWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGI

Query:  NRVWGNYFYYGGPGRNVRCP
        NRVWGNYFYYGGPG+N RCP
Subjt:  NRVWGNYFYYGGPGRNVRCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCAGTACGTTTCGTTGGGCAAAAGCTAAGAACCTATGGCGCTGCTGCGGTGATTGCAGAAGAAGACAAGGCCAATGCACACGACCAAATTACTTTGCCAAACAGAC
AAGACAACCAAATTTCCCACTCATGATCATGGATTCTTCTAACCTCGTCCCTGTTCTTGTTTTTTCCCTTCTTGTCGTTTCGTCGTTTTGTCCCGTTCATTCAGACGACA
AAAATATCCTTCCCAATAACCACACGGCCACGCTCCAGCCGGAGGATGAATTAAACAAGTTGAAGCTCATTAGAGCCCATCTCAAGAAAATCAACAAGCCCCCCGTCAAA
ACCATTCAGAGTCCGGACGGGGATCTTATAGATTGTGTAATATCTCATCACCAGCCGGCTTTTGATCATCCATTGCTGAAAGGACAGAAGCCATTGGATCTGCCGGAGAG
ACCTTATGACCGGAGCCCTTCCGGCGGAGGGGCATCGGAAACGTTTCAATTATGGAGCATGTCCGGCGAGTCTTGCCCGGAAGGAACTGTTCCGATCAGAAGAACGAGGG
AAAAGGATATGCTGAGAGCCAGCTCCGTTCAAAGATTTGGAAGAAAAGTTAGAAGAAGACGCATTAGAAGAGACTCCACCAGCAACGGCCACGAGCATGCAGTGGGATTT
GTAAGCGGAGAACAGTACTACGGAGCAAAGGCGAGCATAAACGTATGGGCGCCGCGGGTGACGAATCAATACGAGTTCAGTCTGTCACAGATGTGGGTCATTTCTGGTTC
ATTTGGGGATGATCTAAACACCATTGAAGCTGGCTGGCAGGCAAGTTTCTCAGTTAGCCCAGAGCTATATGGAGACAATTACCCTAGATTTTTCACTTATTGGACATCAG
ACGCATATCAAGCGACTGGATGCTACAATTTACTCTGCTCGGGCTTCGTTCAGACCAACAGCAGAATCGCCATTGGAGCTGCAATTTCTCCAACTTCTTCCTACAATGGC
GGCCAATTCGACATCAGTCTACTGGTTTGGAAGGATCCAAAGCATGGGAATTGGTGGTTGGAATTCGGGTCGGGCGTGCTGGTCGGGTACTGGCCCGCATTCTTGTTCAC
TCACCTTCAAAGCCACGCGACGATGATTCAGTTCGGTGGGGAGGTGGTGAATTCGAGGGCCTCTGGATTTCACACGGCGACGCAAATGGGCAGCGGCCATTTCGCCGGCG
AGGGCTTTGGAAAAGCTTCGTATTTTCGTAACCTTCAAGTAGTGGATTGGGACAACAGCTTAATTCCACTGTCAAACTTGAAGGTTTTGGCAGATCATCCAAATTGCTAC
GACATTCAAGGAGGGATTAATAGAGTTTGGGGAAATTACTTTTACTATGGTGGGCCTGGAAGAAATGTGAGATGCCCCTAA
mRNA sequenceShow/hide mRNA sequence
AGAACATCTAGGGAAGGGAGAAAGAGAAACCCAACTTTAGAGAACCAGTTTTAATTCCCATCTATTCTATAAATCTCACAGGTCACAGCCATATATGTGAAGAAGAGAAT
AAACAGATTTTCGGACCCAGATTTTAGCCTTAACGAAAATTGACACAAAAAGCTAAAGTATTCAGTCTTTGTATCGGAGACTGGAAACACAACCCCCTCTCCTTCTCTCT
CTCTCTCTCTATAATTTCTTTACGAATTTTGAGGAATTGGGCATGATTTCATCACTACCCAAGTCCAAGTGTATATATAATAGCACACTCACATGGGCAAAACTTTAAAG
AAAATCCAACCTCATAACCTGTCAGAAAGATAAAACCAAGGCCACGGAATTAGAAGAAGAAGAAGGAGTAGTAGTAGAGAGAAATTTCAGAGGGGAGCTTTCAAAGACAT
TGGAAATGGCGAGACAGTGAAATTTTCAATTTAATGGTCAATTCGAGTTTGGCAATTTGAGAAGATGCTCAGTACGTTTCGTTGGGCAAAAGCTAAGAACCTATGGCGCT
GCTGCGGTGATTGCAGAAGAAGACAAGGCCAATGCACACGACCAAATTACTTTGCCAAACAGACAAGACAACCAAATTTCCCACTCATGATCATGGATTCTTCTAACCTC
GTCCCTGTTCTTGTTTTTTCCCTTCTTGTCGTTTCGTCGTTTTGTCCCGTTCATTCAGACGACAAAAATATCCTTCCCAATAACCACACGGCCACGCTCCAGCCGGAGGA
TGAATTAAACAAGTTGAAGCTCATTAGAGCCCATCTCAAGAAAATCAACAAGCCCCCCGTCAAAACCATTCAGAGTCCGGACGGGGATCTTATAGATTGTGTAATATCTC
ATCACCAGCCGGCTTTTGATCATCCATTGCTGAAAGGACAGAAGCCATTGGATCTGCCGGAGAGACCTTATGACCGGAGCCCTTCCGGCGGAGGGGCATCGGAAACGTTT
CAATTATGGAGCATGTCCGGCGAGTCTTGCCCGGAAGGAACTGTTCCGATCAGAAGAACGAGGGAAAAGGATATGCTGAGAGCCAGCTCCGTTCAAAGATTTGGAAGAAA
AGTTAGAAGAAGACGCATTAGAAGAGACTCCACCAGCAACGGCCACGAGCATGCAGTGGGATTTGTAAGCGGAGAACAGTACTACGGAGCAAAGGCGAGCATAAACGTAT
GGGCGCCGCGGGTGACGAATCAATACGAGTTCAGTCTGTCACAGATGTGGGTCATTTCTGGTTCATTTGGGGATGATCTAAACACCATTGAAGCTGGCTGGCAGGCAAGT
TTCTCAGTTAGCCCAGAGCTATATGGAGACAATTACCCTAGATTTTTCACTTATTGGACATCAGACGCATATCAAGCGACTGGATGCTACAATTTACTCTGCTCGGGCTT
CGTTCAGACCAACAGCAGAATCGCCATTGGAGCTGCAATTTCTCCAACTTCTTCCTACAATGGCGGCCAATTCGACATCAGTCTACTGGTTTGGAAGGATCCAAAGCATG
GGAATTGGTGGTTGGAATTCGGGTCGGGCGTGCTGGTCGGGTACTGGCCCGCATTCTTGTTCACTCACCTTCAAAGCCACGCGACGATGATTCAGTTCGGTGGGGAGGTG
GTGAATTCGAGGGCCTCTGGATTTCACACGGCGACGCAAATGGGCAGCGGCCATTTCGCCGGCGAGGGCTTTGGAAAAGCTTCGTATTTTCGTAACCTTCAAGTAGTGGA
TTGGGACAACAGCTTAATTCCACTGTCAAACTTGAAGGTTTTGGCAGATCATCCAAATTGCTACGACATTCAAGGAGGGATTAATAGAGTTTGGGGAAATTACTTTTACT
ATGGTGGGCCTGGAAGAAATGTGAGATGCCCCTAATTGGGTCTGTTTCTCTCCAATGTAATTTTCTTGTGGGGGTGATTTGGGTTTTTTTCGTTTTTTCTTGAGAGAAAA
GTGTGAGATATTCTTGTATTTGTTTCCTATTTTAGTTCATTTTTTGGTCCCTTCTAATTCAAATCACTAAAGTGTGAGAATCGGGAGGACCATTGTAAATTTTGACAAAG
AAATAGTGAATTATACTAAGATTGTGGAATTTATTCTCATTTTTTTAAGTAGTGGCAGATGGGGCTGGC
Protein sequenceShow/hide protein sequence
MLSTFRWAKAKNLWRCCGDCRRRQGQCTRPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLQPEDELNKLKLIRAHLKKINKPPVK
TIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGF
VSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNG
GQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQSHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCY
DIQGGINRVWGNYFYYGGPGRNVRCP