; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS015039 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS015039
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationscaffold2:765260..767798
RNA-Seq ExpressionMS015039
SyntenyMS015039
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601521.1 hypothetical protein SDJN03_06754, partial [Cucurbita argyrosperma subsp. sororia]2.6e-22088.29Show/hide
Query:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ
        MDSSN    L+P+LVFSLLVVS   PV   H DDK I PNN T T RP+D+L KLKLIRAHLKKINKP VKTIQSPDGDLIDCVISH QPAFDHPLLKGQ
Subjt:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ

Query:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR
        KPLD+ +RPYD+SPSG   SE+FQLWSMSGESCPEGTVPIRRT EKDMLRASSV+RFGRK  RRRIRRDS++ GHEHAVGFVSG++YYGAKASINVWAPR
Subjt:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR

Query:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV
        VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV
Subjt:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV

Query:  WKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC
        WKDPKHGNWWLEFGSGVLVGYWPAFLFTHL++HATMIQFGGEVVNS+ SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC
Subjt:  WKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC

Query:  YDIQGGINRVWGNYFYYGGPGRNVRCP
        Y+IQGGINR+WGNYFYYGGPGRNVRCP
Subjt:  YDIQGGINRVWGNYFYYGGPGRNVRCP

XP_008446177.1 PREDICTED: uncharacterized protein LOC103488982 [Cucumis melo]3.1e-22189.71Show/hide
Query:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP
        SS L+PVLVFSLLVVSSFCPVHS      P N T T RP+D L KLKLIRAHLKKINKPP+KTIQSPDGDLIDCVI+H QPAFDHPLLKGQKPLDLPERP
Subjt:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP

Query:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL
        Y+RS SG  +SETFQLWSMSGE CPEG+VPIRRT E DM+RASSVQRFGRKV RRRIRRDS+S+GHEHAVGFVSGE+YYGAK SINVWAPRVTNQYEFSL
Subjt:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL

Query:  SQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNW
        SQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWTSDAYQ TGCYNLLCSGFVQTN+RIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNW
Subjt:  SQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNW

Query:  WLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINR
        WLEFGSGVLVGYWPAFLFTHL++HA+MIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCY+IQGGINR
Subjt:  WLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINR

Query:  VWGNYFYYGGPGRNVRCP
        VWGNYFYYGGPGRNVRCP
Subjt:  VWGNYFYYGGPGRNVRCP

XP_022151755.1 uncharacterized protein LOC111019661 [Momordica charantia]1.1e-25898.63Show/hide
Query:  RPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQ
        RPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATL+PEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQ
Subjt:  RPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQ

Query:  PAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYG
        PAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYG
Subjt:  PAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYG

Query:  AKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSY
        AKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSY
Subjt:  AKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSY

Query:  NGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLS
        NGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQ+HATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLS
Subjt:  NGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLS

Query:  NLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
        NLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
Subjt:  NLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP

XP_022956450.1 uncharacterized protein LOC111458179 [Cucurbita moschata]9.9e-22088.29Show/hide
Query:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ
        MDSSN    L+P+LVFSLLVVS   PV   H DDK I PNN T T RP+D+L KLKLIRAHLKKINKP VKTIQSPDGDLIDCVISH QPAFDHPLLKGQ
Subjt:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ

Query:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR
        KPLD+ ERPYD+S SG   SE+FQLWSMSGESCPEGTVPIRRT EKDMLRASS++RFGRK  RRRIRRDS++ GHEHAVGFVSG++YYGAKASINVWAPR
Subjt:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR

Query:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV
        VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV
Subjt:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV

Query:  WKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC
        WKDPKHGNWWLEFGSGVLVGYWPAFLFTHL++HATMIQFGGEVVNS+ SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC
Subjt:  WKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC

Query:  YDIQGGINRVWGNYFYYGGPGRNVRCP
        Y+IQGGINRVWGNYFYYGGPGRNVRCP
Subjt:  YDIQGGINRVWGNYFYYGGPGRNVRCP

XP_038891414.1 uncharacterized protein LOC120080834 isoform X1 [Benincasa hispida]1.6e-22591.39Show/hide
Query:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP
        SS+LV VLVFSLLVVSSFCPVHS DK I   N T T RP+D   KLKLIRAHLKKINKPP+KTIQSPDGDLIDCVISH QPAFDHPLLKG+KPLDLPERP
Subjt:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP

Query:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL
        ++RS SGGG SETFQLWSMSGE CPEGTVPIRRT EKDMLRASSVQRFGRKV RRRIRRDS+SNGHEHAVGFVSGE+YYGAKASINVWAPRVTNQYEFSL
Subjt:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL

Query:  SQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNW
        SQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWTSDAYQ TGCYNLLCSGFVQTN+RIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNW
Subjt:  SQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNW

Query:  WLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINR
        WLEFGSGVLVGYWPAFLFTHL++HATMIQFGGEVVNSR SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCY+IQGGINR
Subjt:  WLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINR

Query:  VWGNYFYYGGPGRNVRCP
        VWGNYFYYGGPGRNVRCP
Subjt:  VWGNYFYYGGPGRNVRCP

TrEMBL top hitse value%identityAlignment
A0A1S3BDY3 uncharacterized protein LOC1034889821.5e-22189.71Show/hide
Query:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP
        SS L+PVLVFSLLVVSSFCPVHS      P N T T RP+D L KLKLIRAHLKKINKPP+KTIQSPDGDLIDCVI+H QPAFDHPLLKGQKPLDLPERP
Subjt:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP

Query:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL
        Y+RS SG  +SETFQLWSMSGE CPEG+VPIRRT E DM+RASSVQRFGRKV RRRIRRDS+S+GHEHAVGFVSGE+YYGAK SINVWAPRVTNQYEFSL
Subjt:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL

Query:  SQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNW
        SQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWTSDAYQ TGCYNLLCSGFVQTN+RIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNW
Subjt:  SQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNW

Query:  WLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINR
        WLEFGSGVLVGYWPAFLFTHL++HA+MIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCY+IQGGINR
Subjt:  WLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINR

Query:  VWGNYFYYGGPGRNVRCP
        VWGNYFYYGGPGRNVRCP
Subjt:  VWGNYFYYGGPGRNVRCP

A0A6J1B0N7 uncharacterized protein LOC1104230593.6e-20780.32Show/hide
Query:  AKQTRQPNFPLM---IMDSSNLVPVLVFSLLVVSSFCPVH-SDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQP
        A++T + NF  M       S ++P+ VF LLV SS CPV+ S+  N+  +N   T RPE+EL KLK+IR  LKKINKP  KTIQSPDGD+IDCV+ HHQP
Subjt:  AKQTRQPNFPLM---IMDSSNLVPVLVFSLLVVSSFCPVH-SDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQP

Query:  AFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGA
        AFDHP LKGQKPLD PERP   +P+ G A+E FQLWSMSGESCPEGT+PIRRT E+DMLRASSV+RFGRK  RRR+RRDSTSNGHEHAVG+VSG+QYYGA
Subjt:  AFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGA

Query:  KASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYN
        KASINVWAPRV+NQYEFSLSQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTN+RIAIGAAISPTSSYN
Subjt:  KASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYN

Query:  GGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSN
        GGQFDISLLVWKDPKHGNWWLEFGSG+LVGYWP+FLFTHL++HA+M+QFGGE+VNSR+ GFHT+T+MGSGHFAGEGFGKASYFRNLQVVDWDN+LIPL+N
Subjt:  GGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSN

Query:  LKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
        L+VLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
Subjt:  LKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP

A0A6J1DC30 uncharacterized protein LOC1110196615.2e-25998.63Show/hide
Query:  RPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQ
        RPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATL+PEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQ
Subjt:  RPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQ

Query:  PAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYG
        PAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYG
Subjt:  PAFDHPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYG

Query:  AKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSY
        AKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSY
Subjt:  AKASINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSY

Query:  NGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLS
        NGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQ+HATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLS
Subjt:  NGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLS

Query:  NLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
        NLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
Subjt:  NLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP

A0A6J1GXT6 uncharacterized protein LOC1114581794.8e-22088.29Show/hide
Query:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ
        MDSSN    L+P+LVFSLLVVS   PV   H DDK I PNN T T RP+D+L KLKLIRAHLKKINKP VKTIQSPDGDLIDCVISH QPAFDHPLLKGQ
Subjt:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ

Query:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR
        KPLD+ ERPYD+S SG   SE+FQLWSMSGESCPEGTVPIRRT EKDMLRASS++RFGRK  RRRIRRDS++ GHEHAVGFVSG++YYGAKASINVWAPR
Subjt:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR

Query:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV
        VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV
Subjt:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV

Query:  WKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC
        WKDPKHGNWWLEFGSGVLVGYWPAFLFTHL++HATMIQFGGEVVNS+ SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC
Subjt:  WKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC

Query:  YDIQGGINRVWGNYFYYGGPGRNVRCP
        Y+IQGGINRVWGNYFYYGGPGRNVRCP
Subjt:  YDIQGGINRVWGNYFYYGGPGRNVRCP

A0A6J1IRB1 uncharacterized protein LOC1114777502.2e-21787.35Show/hide
Query:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ
        MDSSN    L+P+LVF LL +S   PV   H DDK I PNN T T  P+D+L KLKLIRAHLKKINKP VKTIQSPDGDLIDCVISH QPAFDHPLLKGQ
Subjt:  MDSSN----LVPVLVFSLLVVSSFCPV---HSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQ

Query:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR
        KPLD+ ERPYD+S S    SE+FQLWSMSGESCPEGTVPIRRT EKDMLRASSV+RFGRK  RRRIRRDS++ GHEHAVGFVSG++YYGAKASINVWAPR
Subjt:  KPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPR

Query:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV
        VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQ    VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV
Subjt:  VTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLV

Query:  WKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC
        WKDPKHGNWWLEFGSGVLVGYWPAFLFTHL++HATMIQFGGEVVNS+ SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC
Subjt:  WKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNC

Query:  YDIQGGINRVWGNYFYYGGPGRNVRCP
        Y+IQGGINRVWGNYFYYGGPGRNVRCP
Subjt:  YDIQGGINRVWGNYFYYGGPGRNVRCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)1.2e-18673.68Show/hide
Query:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP
        SSN+  + +  LL+ SSF  V S+  N+ P N   TLRP DELNKLK I  HL+KINKP +KTI SPDGD+IDCV+ HHQPAFDHP L+GQKPLD PERP
Subjt:  SSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERP

Query:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL
           +   G   ++FQLW M GE+CPEGTVPIRRT+E+D+LRA+SV  FG+K+  R  RRD++SNGHEHAVG+VSGE+YYGAKASINVWAP+V NQYEFSL
Subjt:  YDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL

Query:  SQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNW
        SQ+W+ISGSFG+DLNTIEAGWQ    VSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTNS IAIGAAISP+SSY GGQFDI+LL+WKDPKHGNW
Subjt:  SQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNW

Query:  WLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINR
        WLEFGSG+LVGYWP+FLFTHL+ HA+M+Q+GGE+VNS   G HT+TQMGSGHFA EGF K+SYFRN+QVVDWDN+L+P  NL+VLADHPNCYDIQGG NR
Subjt:  WLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINR

Query:  VWGNYFYYGGPGRNVRCP
         WG+YFYYGGPG+N +CP
Subjt:  VWGNYFYYGGPGRNVRCP

AT1G23340.1 Protein of Unknown Function (DUF239)1.1e-18472.75Show/hide
Query:  LVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSG
        L F+ +++ S    ++   N    + T  LRP+ E+ K+KLIR  L+KINKP +KTI S DGD IDCV SHHQPAFDHPLL+GQ+P+D PE P   S   
Subjt:  LVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSG

Query:  GGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVIS
          + E FQLWS+ GESCPEGT+PIRRT E+DMLRA+SV+RFGRK+  RR+RRDS+SNGHEHAVG+VSG QYYGAKASINVW PRV +QYEFSLSQ+W+I+
Subjt:  GGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVIS

Query:  GSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSG
        GSF  DLNTIEAGWQ    +SPELYGD  PRFFTYWTSDAYQATGCYNLLCSGFVQTN+RIAIGAAISP SSY GGQFDISLL+WKDPKHG+WWL+FGSG
Subjt:  GSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSG

Query:  VLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFY
         LVGYWP  LFTHL+ H  M+QFGGE+VN+R  G HT+TQMGSGHFAGEGFGKASYFRNLQ+VDWDN+LIP+SNLKVLADHPNCYDI+GG+NRVWGN+FY
Subjt:  VLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFY

Query:  YGGPGRNVRCP
        YGGPG+N +CP
Subjt:  YGGPGRNVRCP

AT1G23340.2 Protein of Unknown Function (DUF239)1.1e-18472.75Show/hide
Query:  LVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSG
        L F+ +++ S    ++   N    + T  LRP+ E+ K+KLIR  L+KINKP +KTI S DGD IDCV SHHQPAFDHPLL+GQ+P+D PE P   S   
Subjt:  LVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPERPYDRSPSG

Query:  GGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVIS
          + E FQLWS+ GESCPEGT+PIRRT E+DMLRA+SV+RFGRK+  RR+RRDS+SNGHEHAVG+VSG QYYGAKASINVW PRV +QYEFSLSQ+W+I+
Subjt:  GGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSLSQMWVIS

Query:  GSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSG
        GSF  DLNTIEAGWQ    +SPELYGD  PRFFTYWTSDAYQATGCYNLLCSGFVQTN+RIAIGAAISP SSY GGQFDISLL+WKDPKHG+WWL+FGSG
Subjt:  GSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSG

Query:  VLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFY
         LVGYWP  LFTHL+ H  M+QFGGE+VN+R  G HT+TQMGSGHFAGEGFGKASYFRNLQ+VDWDN+LIP+SNLKVLADHPNCYDI+GG+NRVWGN+FY
Subjt:  VLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFY

Query:  YGGPGRNVRCP
        YGGPG+N +CP
Subjt:  YGGPGRNVRCP

AT1G70550.1 Protein of Unknown Function (DUF239)7.4e-18972.58Show/hide
Query:  FAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFD
        F   ++Q +F    M SS+ + +++   LV SSF    S   +   +    TLRP++EL KL LIR  L KINKP VKTIQS DGD IDCV +H QPAFD
Subjt:  FAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFD

Query:  HPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKAS
        HPLL+GQKPLD PE P   S    G+ E  QLWS+SGESCPEGT+PIRRT E+DMLRASSVQRFGRK+  RR++RDST+NGHEHAVG+V+G QYYGAKAS
Subjt:  HPLLKGQKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKAS

Query:  INVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQ
        INVW+PRVT+QYEFSLSQ+WVI+GSF  DLNTIEAGWQ    +SPELYGD YPRFFTYWTSDAY+ TGCYNLLCSGFVQTN RIAIGAAISP SSY GGQ
Subjt:  INVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQ

Query:  FDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKV
        FDISLL+WKDPKHG+WWL+FGSG LVGYWPAFLFTHL+ H +M+QFGGE+VN+R  G HT TQMGSGHFAGEGFGKASYFRNLQ+VDWDN+LIP SNLK+
Subjt:  FDISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKV

Query:  LADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP
        LADHPNCYDI+GG NRVWGNYFYYGGPG+N RCP
Subjt:  LADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP

AT1G70550.2 Protein of Unknown Function (DUF239)3.7e-18874.29Show/hide
Query:  MDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPE
        M SS+ + +++   LV SSF    S   +   +    TLRP++EL KL LIR  L KINKP VKTIQS DGD IDCV +H QPAFDHPLL+GQKPLD PE
Subjt:  MDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKGQKPLDLPE

Query:  RPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEF
         P   S    G+ E  QLWS+SGESCPEGT+PIRRT E+DMLRASSVQRFGRK+  RR++RDST+NGHEHAVG+V+G QYYGAKASINVW+PRVT+QYEF
Subjt:  RPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEF

Query:  SLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHG
        SLSQ+WVI+GSF  DLNTIEAGWQ    +SPELYGD YPRFFTYWTSDAY+ TGCYNLLCSGFVQTN RIAIGAAISP SSY GGQFDISLL+WKDPKHG
Subjt:  SLSQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHG

Query:  NWWLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGI
        +WWL+FGSG LVGYWPAFLFTHL+ H +M+QFGGE+VN+R  G HT TQMGSGHFAGEGFGKASYFRNLQ+VDWDN+LIP SNLK+LADHPNCYDI+GG 
Subjt:  NWWLEFGSGVLVGYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGI

Query:  NRVWGNYFYYGGPGRNVRCP
        NRVWGNYFYYGGPG+N RCP
Subjt:  NRVWGNYFYYGGPGRNVRCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CGACCAAATTACTTTGCCAAACAGACAAGACAACCAAATTTCCCACTCATGATCATGGATTCTTCTAACCTCGTCCCTGTTCTTGTTTTTTCCCTTCTTGTCGTTTCGTC
GTTTTGTCCCGTTCATTCAGACGACAAAAATATCCTTCCCAATAACCACACGGCCACGCTCCGGCCGGAGGATGAATTAAACAAGTTGAAGCTCATTAGAGCCCATCTCA
AGAAAATCAACAAGCCCCCCGTCAAAACCATTCAGAGTCCGGACGGGGATCTTATAGATTGTGTAATATCTCATCACCAGCCGGCTTTTGATCATCCATTGCTGAAAGGA
CAGAAGCCATTGGATCTGCCGGAGAGACCTTATGACCGGAGCCCTTCCGGTGGAGGGGCATCGGAAACGTTTCAATTATGGAGCATGTCCGGCGAGTCTTGCCCGGAAGG
AACTGTTCCGATCAGAAGAACGAGGGAAAAGGATATGCTGAGAGCCAGCTCCGTTCAAAGATTTGGAAGAAAAGTTAGAAGAAGACGCATTAGAAGAGACTCCACCAGCA
ACGGCCACGAGCATGCAGTGGGATTTGTAAGCGGAGAACAGTACTACGGAGCAAAGGCGAGCATAAACGTATGGGCGCCGCGGGTGACGAATCAATACGAGTTCAGTCTG
TCACAGATGTGGGTCATTTCTGGTTCATTTGGGGATGATCTAAACACCATTGAAGCTGGCTGGCAGGCAAGTTTCTCAGTTAGCCCAGAGCTATATGGAGACAATTACCC
TAGATTTTTCACTTATTGGACATCAGACGCATATCAAGCGACTGGATGCTACAATTTACTCTGCTCGGGCTTCGTTCAGACCAACAGCAGAATCGCCATTGGAGCTGCAA
TTTCTCCAACTTCTTCCTACAATGGCGGCCAATTCGACATCAGTCTACTGGTTTGGAAGGATCCAAAGCATGGGAATTGGTGGTTGGAATTCGGGTCGGGCGTGCTGGTC
GGGTACTGGCCCGCATTCTTGTTCACTCACCTTCAAAACCACGCGACGATGATTCAGTTCGGTGGGGAGGTGGTGAATTCGAGGGCCTCTGGATTTCACACGGCGACGCA
AATGGGCAGCGGCCATTTCGCCGGCGAGGGCTTTGGAAAAGCTTCGTATTTTCGTAACCTTCAAGTAGTGGATTGGGACAACAGCTTAATTCCACTGTCAAACTTGAAGG
TTTTGGCAGATCATCCAAATTGCTACGACATTCAAGGAGGGATTAATAGAGTTTGGGGAAATTACTTTTACTATGGTGGGCCTGGAAGAAATGTGAGATGCCCC
mRNA sequenceShow/hide mRNA sequence
CGACCAAATTACTTTGCCAAACAGACAAGACAACCAAATTTCCCACTCATGATCATGGATTCTTCTAACCTCGTCCCTGTTCTTGTTTTTTCCCTTCTTGTCGTTTCGTC
GTTTTGTCCCGTTCATTCAGACGACAAAAATATCCTTCCCAATAACCACACGGCCACGCTCCGGCCGGAGGATGAATTAAACAAGTTGAAGCTCATTAGAGCCCATCTCA
AGAAAATCAACAAGCCCCCCGTCAAAACCATTCAGAGTCCGGACGGGGATCTTATAGATTGTGTAATATCTCATCACCAGCCGGCTTTTGATCATCCATTGCTGAAAGGA
CAGAAGCCATTGGATCTGCCGGAGAGACCTTATGACCGGAGCCCTTCCGGTGGAGGGGCATCGGAAACGTTTCAATTATGGAGCATGTCCGGCGAGTCTTGCCCGGAAGG
AACTGTTCCGATCAGAAGAACGAGGGAAAAGGATATGCTGAGAGCCAGCTCCGTTCAAAGATTTGGAAGAAAAGTTAGAAGAAGACGCATTAGAAGAGACTCCACCAGCA
ACGGCCACGAGCATGCAGTGGGATTTGTAAGCGGAGAACAGTACTACGGAGCAAAGGCGAGCATAAACGTATGGGCGCCGCGGGTGACGAATCAATACGAGTTCAGTCTG
TCACAGATGTGGGTCATTTCTGGTTCATTTGGGGATGATCTAAACACCATTGAAGCTGGCTGGCAGGCAAGTTTCTCAGTTAGCCCAGAGCTATATGGAGACAATTACCC
TAGATTTTTCACTTATTGGACATCAGACGCATATCAAGCGACTGGATGCTACAATTTACTCTGCTCGGGCTTCGTTCAGACCAACAGCAGAATCGCCATTGGAGCTGCAA
TTTCTCCAACTTCTTCCTACAATGGCGGCCAATTCGACATCAGTCTACTGGTTTGGAAGGATCCAAAGCATGGGAATTGGTGGTTGGAATTCGGGTCGGGCGTGCTGGTC
GGGTACTGGCCCGCATTCTTGTTCACTCACCTTCAAAACCACGCGACGATGATTCAGTTCGGTGGGGAGGTGGTGAATTCGAGGGCCTCTGGATTTCACACGGCGACGCA
AATGGGCAGCGGCCATTTCGCCGGCGAGGGCTTTGGAAAAGCTTCGTATTTTCGTAACCTTCAAGTAGTGGATTGGGACAACAGCTTAATTCCACTGTCAAACTTGAAGG
TTTTGGCAGATCATCCAAATTGCTACGACATTCAAGGAGGGATTAATAGAGTTTGGGGAAATTACTTTTACTATGGTGGGCCTGGAAGAAATGTGAGATGCCCC
Protein sequenceShow/hide protein sequence
RPNYFAKQTRQPNFPLMIMDSSNLVPVLVFSLLVVSSFCPVHSDDKNILPNNHTATLRPEDELNKLKLIRAHLKKINKPPVKTIQSPDGDLIDCVISHHQPAFDHPLLKG
QKPLDLPERPYDRSPSGGGASETFQLWSMSGESCPEGTVPIRRTREKDMLRASSVQRFGRKVRRRRIRRDSTSNGHEHAVGFVSGEQYYGAKASINVWAPRVTNQYEFSL
SQMWVISGSFGDDLNTIEAGWQASFSVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLV
GYWPAFLFTHLQNHATMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYDIQGGINRVWGNYFYYGGPGRNVRCP