; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0019046 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0019046
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationchr10:3258534..3262534
RNA-Seq ExpressionIVF0019046
SyntenyIVF0019046
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135262.2 uncharacterized protein LOC101219221 [Cucumis sativus]6.12e-31297.62Show/hide
Query:  MDSSSSS--SSSSSILLPVLVFSLLVVSSFCPVHSHPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKP
        MDSSSSS  SSSSS+LLP+LVFSLLVVSSFCPVHSHPINKTT FRPQDHLKKLKL+RAHLK+INKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKP
Subjt:  MDSSSSS--SSSSSILLPVLVFSLLVVSSFCPVHSHPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKP

Query:  LDLPERPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTN
        LDLP+RPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTN
Subjt:  LDLPERPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTN

Query:  QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHG
        QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHG
Subjt:  QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHG

Query:  NWWLEFGSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGI
        NWWLEFGSGVLVGYWPAFLFTHLRSHA+MIQFGGEVVNSRASGFHT TQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGI
Subjt:  NWWLEFGSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGI

Query:  NRVWGNYFYYGGPGRNVRCP
        NRVWGNYFYYGGPGRNVRCP
Subjt:  NRVWGNYFYYGGPGRNVRCP

XP_008446177.1 PREDICTED: uncharacterized protein LOC103488982 [Cucumis melo]0.0100Show/hide
Query:  MPKFYSYKLQQKQKHMDSSSSSSSSSSILLPVLVFSLLVVSSFCPVHSHPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQ
        MPKFYSYKLQQKQKHMDSSSSSSSSSSILLPVLVFSLLVVSSFCPVHSHPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQ
Subjt:  MPKFYSYKLQQKQKHMDSSSSSSSSSSILLPVLVFSLLVVSSFCPVHSHPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQ

Query:  PAFDHPLLKGQKPLDLPERPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGA
        PAFDHPLLKGQKPLDLPERPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGA
Subjt:  PAFDHPLLKGQKPLDLPERPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGA

Query:  KGSINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQF
        KGSINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQF
Subjt:  KGSINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQF

Query:  DISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVL
        DISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVL
Subjt:  DISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVL

Query:  ADHPNCYNIQGGINRVWGNYFYYGGPGRNVRCP
        ADHPNCYNIQGGINRVWGNYFYYGGPGRNVRCP
Subjt:  ADHPNCYNIQGGINRVWGNYFYYGGPGRNVRCP

XP_022151755.1 uncharacterized protein LOC111019661 [Momordica charantia]9.37e-28390.58Show/hide
Query:  SSILLPVLVFSLLVVSSFCPVHSH-----PINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKPLDLPERP
        SS L+PVLVFSLLVVSSFCPVHS      P N T T +P+D L KLKLIRAHLKKINKPP+KTIQSPDGDLIDCVI+H QPAFDHPLLKGQKPLDLPERP
Subjt:  SSILLPVLVFSLLVVSSFCPVHSH-----PINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKPLDLPERP

Query:  YERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRR-IRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQYEFSL
        Y+RS SG  +SETFQLWSMSGE CPEG+VPIRRT E DM+RASSVQRFGRKVRRR IRRDS+S+GHEHAVGFVSGE+YYGAK SINVWAPRVTNQYEFSL
Subjt:  YERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRR-IRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQYEFSL

Query:  SQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEF
        SQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQ TGCYNLLCSGFVQTN+RIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEF
Subjt:  SQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEF

Query:  GSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGN
        GSGVLVGYWPAFLFTHL+SHA+MIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCY+IQGGINRVWGN
Subjt:  GSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGN

Query:  YFYYGGPGRNVRCP
        YFYYGGPGRNVRCP
Subjt:  YFYYGGPGRNVRCP

XP_022956450.1 uncharacterized protein LOC111458179 [Cucurbita moschata]3.45e-28289.83Show/hide
Query:  LLPVLVFSLLVVSSFCPVHS--------HPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKPLDLPERP
        LLP+LVFSLLVVS   PV+S        HP N+TTTFRPQD LKKLKLIRAHLKKINKP +KTIQSPDGDLIDCVI+HQQPAFDHPLLKGQKPLD+ ERP
Subjt:  LLPVLVFSLLVVSSFCPVHS--------HPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKPLDLPERP

Query:  YERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQYEFSLS
        Y++SSSGE  SE+FQLWSMSGE CPEG+VPIRRTTE DM+RASS++RFGRK RRRIRRDSS+ GHEHAVGFVSG+EYYGAK SINVWAPRVTNQYEFSLS
Subjt:  YERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQYEFSLS

Query:  QMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFG
        QMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQ TGCYNLLCSGFVQTN+RIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFG
Subjt:  QMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFG

Query:  SGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGNY
        SGVLVGYWPAFLFTHLRSHA+MIQFGGEVVNS+ SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGNY
Subjt:  SGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGNY

Query:  FYYGGPGRNVRCP
        FYYGGPGRNVRCP
Subjt:  FYYGGPGRNVRCP

XP_038891414.1 uncharacterized protein LOC120080834 isoform X1 [Benincasa hispida]7.54e-29694.43Show/hide
Query:  SSILLPVLVFSLLVVSSFCPVHS-----HPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKPLDLPERP
        SS L+ VLVFSLLVVSSFCPVHS     H INKTTTFRPQDH KKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVI+HQQPAFDHPLLKG+KPLDLPERP
Subjt:  SSILLPVLVFSLLVVSSFCPVHS-----HPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKPLDLPERP

Query:  YERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQYEFSLS
        +ERSSSG   SETFQLWSMSGEFCPEG+VPIRRTTE DM+RASSVQRFGRKVRRRIRRDSSS+GHEHAVGFVSGEEYYGAK SINVWAPRVTNQYEFSLS
Subjt:  YERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQYEFSLS

Query:  QMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFG
        QMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFG
Subjt:  QMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFG

Query:  SGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGNY
        SGVLVGYWPAFLFTHLRSHA+MIQFGGEVVNSR SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGNY
Subjt:  SGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGNY

Query:  FYYGGPGRNVRCP
        FYYGGPGRNVRCP
Subjt:  FYYGGPGRNVRCP

TrEMBL top hitse value%identityAlignment
A0A0A0KQ49 Uncharacterized protein9.2e-23294.05Show/hide
Query:  MDSSSSS--SSSSSILLPVLVFSLLVVSSFCPVHSHPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKP
        MDSSSSS  SSSSS+LLP+LVFSLLVVSSFCPVHSHPINKTT FRPQDHLKKLKL                 SPDGDLIDCVITHQQPAFDHPLLKGQKP
Subjt:  MDSSSSS--SSSSSILLPVLVFSLLVVSSFCPVHSHPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKP

Query:  LDLPERPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTN
        LDLP+RPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTN
Subjt:  LDLPERPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTN

Query:  QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHG
        QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHG
Subjt:  QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHG

Query:  NWWLEFGSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGI
        NWWLEFGSGVLVGYWPAFLFTHLRSHA+MIQFGGEVVNSRASGFHT TQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGI
Subjt:  NWWLEFGSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGI

Query:  NRVWGNYFYYGGPGRNVRCP
        NRVWGNYFYYGGPGRNVRCP
Subjt:  NRVWGNYFYYGGPGRNVRCP

A0A1S3BDY3 uncharacterized protein LOC1034889825.7e-258100Show/hide
Query:  MPKFYSYKLQQKQKHMDSSSSSSSSSSILLPVLVFSLLVVSSFCPVHSHPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQ
        MPKFYSYKLQQKQKHMDSSSSSSSSSSILLPVLVFSLLVVSSFCPVHSHPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQ
Subjt:  MPKFYSYKLQQKQKHMDSSSSSSSSSSILLPVLVFSLLVVSSFCPVHSHPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQ

Query:  PAFDHPLLKGQKPLDLPERPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGA
        PAFDHPLLKGQKPLDLPERPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGA
Subjt:  PAFDHPLLKGQKPLDLPERPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGA

Query:  KGSINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQF
        KGSINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQF
Subjt:  KGSINVWAPRVTNQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQF

Query:  DISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVL
        DISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVL
Subjt:  DISLLVWKDPKHGNWWLEFGSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVL

Query:  ADHPNCYNIQGGINRVWGNYFYYGGPGRNVRCP
        ADHPNCYNIQGGINRVWGNYFYYGGPGRNVRCP
Subjt:  ADHPNCYNIQGGINRVWGNYFYYGGPGRNVRCP

A0A6J1DC30 uncharacterized protein LOC1110196611.3e-22290.58Show/hide
Query:  SSILLPVLVFSLLVVSSFCPVHSH-----PINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKPLDLPERP
        SS L+PVLVFSLLVVSSFCPVHS      P N T T +P+D L KLKLIRAHLKKINKPP+KTIQSPDGDLIDCVI+H QPAFDHPLLKGQKPLDLPERP
Subjt:  SSILLPVLVFSLLVVSSFCPVHSH-----PINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKPLDLPERP

Query:  YERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKV-RRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQYEFSL
        Y+RS SG  +SETFQLWSMSGE CPEG+VPIRRT E DM+RASSVQRFGRKV RRRIRRDS+S+GHEHAVGFVSGE+YYGAK SINVWAPRVTNQYEFSL
Subjt:  YERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKV-RRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQYEFSL

Query:  SQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEF
        SQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQ TGCYNLLCSGFVQTN+RIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEF
Subjt:  SQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEF

Query:  GSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGN
        GSGVLVGYWPAFLFTHL+SHA+MIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCY+IQGGINRVWGN
Subjt:  GSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGN

Query:  YFYYGGPGRNVRCP
        YFYYGGPGRNVRCP
Subjt:  YFYYGGPGRNVRCP

A0A6J1GXT6 uncharacterized protein LOC1114581798.7e-22288.81Show/hide
Query:  SSSSSSILLPVLVFSLLVVSSFCPVHS--------HPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKP
        SS++   LLP+LVFSLLVVS   PV+S        HP N+TTTFRPQD LKKLKLIRAHLKKINKP +KTIQSPDGDLIDCVI+HQQPAFDHPLLKGQKP
Subjt:  SSSSSSILLPVLVFSLLVVSSFCPVHS--------HPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKP

Query:  LDLPERPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTN
        LD+ ERPY++SSSGE  SE+FQLWSMSGE CPEG+VPIRRTTE DM+RASS++RFGRK RRRIRRDSS+ GHEHAVGFVSG+EYYGAK SINVWAPRVTN
Subjt:  LDLPERPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTN

Query:  QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHG
        QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQ TGCYNLLCSGFVQTN+RIAIGAAISPTSSYNGGQFDISLLVWKDPKHG
Subjt:  QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHG

Query:  NWWLEFGSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGI
        NWWLEFGSGVLVGYWPAFLFTHLRSHA+MIQFGGEVVNS+ SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGI
Subjt:  NWWLEFGSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGI

Query:  NRVWGNYFYYGGPGRNVRCP
        NRVWGNYFYYGGPGRNVRCP
Subjt:  NRVWGNYFYYGGPGRNVRCP

A0A6J1IRB1 uncharacterized protein LOC1114777506.9e-21987.86Show/hide
Query:  SSSSSSILLPVLVFSLLVVSSFCPVHS--------HPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKP
        SS++   LLP+LVF LL +S   PV+S        HP N+TTTF PQD LKKLKLIRAHLKKINKP +KTIQSPDGDLIDCVI+HQQPAFDHPLLKGQKP
Subjt:  SSSSSSILLPVLVFSLLVVSSFCPVHS--------HPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKP

Query:  LDLPERPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTN
        LD+ ERPY++SSS E  SE+FQLWSMSGE CPEG+VPIRRTTE DM+RASSV+RFGRK RRRIRRDSS+ GHEHAVGFVSG+EYYGAK SINVWAPRVTN
Subjt:  LDLPERPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTN

Query:  QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHG
        QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQ TGCYNLLCSGFVQTN+RIAIGAAISPTSSYNGGQFDISLLVWKDPKHG
Subjt:  QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHG

Query:  NWWLEFGSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGI
        NWWLEFGSGVLVGYWPAFLFTHLRSHA+MIQFGGEVVNS+ SGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGI
Subjt:  NWWLEFGSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGI

Query:  NRVWGNYFYYGGPGRNVRCP
        NRVWGNYFYYGGPGRNVRCP
Subjt:  NRVWGNYFYYGGPGRNVRCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)2.3e-18272.93Show/hide
Query:  SSSILLPVLVFSLLVVSSFCPVHSHPIN-KTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKPLDLPERPYER
        SS+IL   L   LL+ SSF  V S  ++ +  T RP D L KLK I  HL+KINKP IKTI SPDGD+IDCV+ H QPAFDHP L+GQKPLD PERP   
Subjt:  SSSILLPVLVFSLLVVSSFCPVHSHPIN-KTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKPLDLPERPYER

Query:  SSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQYEFSLSQMW
        +  G    ++FQLW M GE CPEG+VPIRRT E D++RA+SV  FG+K+ R  RRD+SS+GHEHAVG+VSGE+YYGAK SINVWAP+V NQYEFSLSQ+W
Subjt:  SSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQYEFSLSQMW

Query:  VISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGV
        +ISGSFG+DLNTIEAGWQVSPELYGDNYPRFFTYWT+DAYQ TGCYNLLCSGFVQTN+ IAIGAAISP+SSY GGQFDI+LL+WKDPKHGNWWLEFGSG+
Subjt:  VISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGV

Query:  LVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGNYFYY
        LVGYWP+FLFTHL+ HASM+Q+GGE+VNS   G HT+TQMGSGHFA EGF K+SYFRN+QVVDWDN+L+P  NL+VLADHPNCY+IQGG NR WG+YFYY
Subjt:  LVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGNYFYY

Query:  GGPGRNVRCP
        GGPG+N +CP
Subjt:  GGPGRNVRCP

AT1G23340.1 Protein of Unknown Function (DUF239)2.9e-18573.24Show/hide
Query:  SSSSSILLPVLVFSLLVVSSFCPVHSHPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKPLDLPERPYE
        SSSSS L    +  L + SS+    S+  ++T   RPQ  ++K+KLIR  L+KINKP IKTI S DGD IDCV +H QPAFDHPLL+GQ+P+D PE P  
Subjt:  SSSSSILLPVLVFSLLVVSSFCPVHSHPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKPLDLPERPYE

Query:  RSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQYEFSLSQM
         S    ES E FQLWS+ GE CPEG++PIRRTTE DM+RA+SV+RFGRK+ RR+RRDSSS+GHEHAVG+VSG +YYGAK SINVW PRV +QYEFSLSQ+
Subjt:  RSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQYEFSLSQM

Query:  WVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSG
        W+I+GSF  DLNTIEAGWQ+SPELYGD  PRFFTYWTSDAYQ TGCYNLLCSGFVQTNNRIAIGAAISP SSY GGQFDISLL+WKDPKHG+WWL+FGSG
Subjt:  WVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSG

Query:  VLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGNYFY
         LVGYWP  LFTHLR H +M+QFGGE+VN+R  G HT+TQMGSGHFAGEGFGKASYFRNLQ+VDWDN+LIP+SNLKVLADHPNCY+I+GG+NRVWGN+FY
Subjt:  VLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGNYFY

Query:  YGGPGRNVRCP
        YGGPG+N +CP
Subjt:  YGGPGRNVRCP

AT1G23340.2 Protein of Unknown Function (DUF239)2.9e-18573.24Show/hide
Query:  SSSSSILLPVLVFSLLVVSSFCPVHSHPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKPLDLPERPYE
        SSSSS L    +  L + SS+    S+  ++T   RPQ  ++K+KLIR  L+KINKP IKTI S DGD IDCV +H QPAFDHPLL+GQ+P+D PE P  
Subjt:  SSSSSILLPVLVFSLLVVSSFCPVHSHPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKPLDLPERPYE

Query:  RSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQYEFSLSQM
         S    ES E FQLWS+ GE CPEG++PIRRTTE DM+RA+SV+RFGRK+ RR+RRDSSS+GHEHAVG+VSG +YYGAK SINVW PRV +QYEFSLSQ+
Subjt:  RSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQYEFSLSQM

Query:  WVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSG
        W+I+GSF  DLNTIEAGWQ+SPELYGD  PRFFTYWTSDAYQ TGCYNLLCSGFVQTNNRIAIGAAISP SSY GGQFDISLL+WKDPKHG+WWL+FGSG
Subjt:  WVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSG

Query:  VLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGNYFY
         LVGYWP  LFTHLR H +M+QFGGE+VN+R  G HT+TQMGSGHFAGEGFGKASYFRNLQ+VDWDN+LIP+SNLKVLADHPNCY+I+GG+NRVWGN+FY
Subjt:  VLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGNYFY

Query:  YGGPGRNVRCP
        YGGPG+N +CP
Subjt:  YGGPGRNVRCP

AT1G70550.1 Protein of Unknown Function (DUF239)6.8e-18774.22Show/hide
Query:  SSSSSSSSSSILLPVLVFSLLVVSSFCPVHSHPINKTT---TFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKPL
        S  S    SS  L +++   LV SSF    S   N T    T RPQ+ L+KL LIR  L KINKP +KTIQS DGD IDCV THQQPAFDHPLL+GQKPL
Subjt:  SSSSSSSSSSILLPVLVFSLLVVSSFCPVHSHPINKTT---TFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKPL

Query:  DLPERPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQ
        D PE P +  S  + S E  QLWS+SGE CPEG++PIRRTTE DM+RASSVQRFGRK+ RR++RDS+++GHEHAVG+V+G +YYGAK SINVW+PRVT+Q
Subjt:  DLPERPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQ

Query:  YEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGN
        YEFSLSQ+WVI+GSF  DLNTIEAGWQ+SPELYGD YPRFFTYWTSDAY+TTGCYNLLCSGFVQTN RIAIGAAISP SSY GGQFDISLL+WKDPKHG+
Subjt:  YEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGN

Query:  WWLEFGSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGIN
        WWL+FGSG LVGYWPAFLFTHL+ H SM+QFGGE+VN+R  G HT TQMGSGHFAGEGFGKASYFRNLQ+VDWDN+LIP SNLK+LADHPNCY+I+GG N
Subjt:  WWLEFGSGVLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGIN

Query:  RVWGNYFYYGGPGRNVRCP
        RVWGNYFYYGGPG+N RCP
Subjt:  RVWGNYFYYGGPGRNVRCP

AT1G70550.2 Protein of Unknown Function (DUF239)3.1e-18775.18Show/hide
Query:  SSILLPVLVFSLLVVSSFCPVHSHPINKTT---TFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKPLDLPERPYE
        SS  L +++   LV SSF    S   N T    T RPQ+ L+KL LIR  L KINKP +KTIQS DGD IDCV THQQPAFDHPLL+GQKPLD PE P +
Subjt:  SSILLPVLVFSLLVVSSFCPVHSHPINKTT---TFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKGQKPLDLPERPYE

Query:  RSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQYEFSLSQM
          S  + S E  QLWS+SGE CPEG++PIRRTTE DM+RASSVQRFGRK+ RR++RDS+++GHEHAVG+V+G +YYGAK SINVW+PRVT+QYEFSLSQ+
Subjt:  RSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQYEFSLSQM

Query:  WVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSG
        WVI+GSF  DLNTIEAGWQ+SPELYGD YPRFFTYWTSDAY+TTGCYNLLCSGFVQTN RIAIGAAISP SSY GGQFDISLL+WKDPKHG+WWL+FGSG
Subjt:  WVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSG

Query:  VLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGNYFY
         LVGYWPAFLFTHL+ H SM+QFGGE+VN+R  G HT TQMGSGHFAGEGFGKASYFRNLQ+VDWDN+LIP SNLK+LADHPNCY+I+GG NRVWGNYFY
Subjt:  VLVGYWPAFLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGNYFY

Query:  YGGPGRNVRCP
        YGGPG+N RCP
Subjt:  YGGPGRNVRCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCAAATTTTACAGTTACAAATTACAACAAAAACAAAAACACATGGATTCTTCTTCCTCCTCTTCTTCTTCTTCTTCTATTCTTCTTCCTGTTCTTGTTTTTTCCCT
TCTTGTCGTTTCATCCTTTTGTCCTGTCCATTCCCATCCAATTAACAAAACGACGACGTTTCGCCCACAAGACCATCTCAAGAAATTGAAACTAATTAGAGCTCATCTCA
AGAAAATCAACAAGCCACCCATCAAAACAATTCAGAGCCCTGACGGTGATCTTATAGATTGTGTCATAACTCATCAACAGCCAGCTTTCGATCATCCATTACTGAAAGGA
CAGAAGCCATTGGATCTGCCGGAGAGACCGTACGAACGGAGTAGTTCGGGGGAGGAATCATCGGAAACATTCCAATTATGGAGTATGTCCGGGGAATTTTGCCCGGAGGG
CAGTGTTCCAATCAGAAGAACAACAGAAAATGATATGATGAGAGCGAGCTCTGTTCAAAGATTTGGAAGGAAAGTAAGAAGAAGGATTAGAAGAGATTCATCCAGCAGTG
GGCATGAGCATGCAGTGGGATTTGTAAGTGGAGAAGAGTACTACGGAGCAAAAGGAAGCATAAACGTATGGGCACCGCGGGTGACGAATCAATATGAGTTTAGTCTGTCA
CAGATGTGGGTCATTTCGGGTTCTTTTGGAGATGATCTCAATACCATTGAAGCTGGCTGGCAGGTTAGCCCAGAGCTGTATGGAGACAATTACCCAAGATTTTTCACTTA
TTGGACTTCAGACGCATATCAAACAACGGGCTGCTACAATCTACTCTGCTCCGGCTTCGTTCAAACCAACAACAGAATCGCCATTGGAGCTGCTATTTCTCCAACTTCTT
CCTACAATGGCGGCCAGTTCGACATCAGTCTTCTTGTTTGGAAGGATCCGAAACATGGAAATTGGTGGTTAGAATTCGGGTCGGGCGTATTAGTGGGATACTGGCCGGCA
TTTCTATTCACTCACCTCCGAAGCCACGCGAGCATGATACAATTCGGTGGTGAGGTGGTGAATTCTCGGGCGTCGGGTTTCCACACGGCGACACAAATGGGAAGCGGCCA
CTTCGCCGGAGAAGGATTCGGAAAAGCTTCATATTTTAGGAACTTGCAAGTGGTGGATTGGGATAACAGCTTAATTCCTCTATCGAATCTCAAGGTTTTGGCTGATCATC
CAAATTGTTATAACATTCAAGGTGGAATTAATAGAGTTTGGGGAAATTATTTTTACTATGGTGGCCCTGGAAGAAATGTGAGATGCCCATAA
mRNA sequenceShow/hide mRNA sequence
CCCCCCATAAGCATAAACTCTCTCCCTTTCACCTCTTCTCTTTTGCTTTATTTTATTCACCCCCGTGCCTTTGAAGCTCAACATAGAAGAGAATTTCAAAGGGAAGGACT
TGGGCTTTGGAAATGGTGAGAAACACTGTTCATTTTTTCCGTTTAGTCAATTTCTTGTTGGGTTATTTGTCAAGATGCTACTCCTCAGGCAAAAGCTAAGAATTTAGTGG
CGGTTTTGATGAGTAGTTCAAGATAAATGCCCAAATTTTACAGTTACAAATTACAACAAAAACAAAAACACATGGATTCTTCTTCCTCCTCTTCTTCTTCTTCTTCTATT
CTTCTTCCTGTTCTTGTTTTTTCCCTTCTTGTCGTTTCATCCTTTTGTCCTGTCCATTCCCATCCAATTAACAAAACGACGACGTTTCGCCCACAAGACCATCTCAAGAA
ATTGAAACTAATTAGAGCTCATCTCAAGAAAATCAACAAGCCACCCATCAAAACAATTCAGAGCCCTGACGGTGATCTTATAGATTGTGTCATAACTCATCAACAGCCAG
CTTTCGATCATCCATTACTGAAAGGACAGAAGCCATTGGATCTGCCGGAGAGACCGTACGAACGGAGTAGTTCGGGGGAGGAATCATCGGAAACATTCCAATTATGGAGT
ATGTCCGGGGAATTTTGCCCGGAGGGCAGTGTTCCAATCAGAAGAACAACAGAAAATGATATGATGAGAGCGAGCTCTGTTCAAAGATTTGGAAGGAAAGTAAGAAGAAG
GATTAGAAGAGATTCATCCAGCAGTGGGCATGAGCATGCAGTGGGATTTGTAAGTGGAGAAGAGTACTACGGAGCAAAAGGAAGCATAAACGTATGGGCACCGCGGGTGA
CGAATCAATATGAGTTTAGTCTGTCACAGATGTGGGTCATTTCGGGTTCTTTTGGAGATGATCTCAATACCATTGAAGCTGGCTGGCAGGTTAGCCCAGAGCTGTATGGA
GACAATTACCCAAGATTTTTCACTTATTGGACTTCAGACGCATATCAAACAACGGGCTGCTACAATCTACTCTGCTCCGGCTTCGTTCAAACCAACAACAGAATCGCCAT
TGGAGCTGCTATTTCTCCAACTTCTTCCTACAATGGCGGCCAGTTCGACATCAGTCTTCTTGTTTGGAAGGATCCGAAACATGGAAATTGGTGGTTAGAATTCGGGTCGG
GCGTATTAGTGGGATACTGGCCGGCATTTCTATTCACTCACCTCCGAAGCCACGCGAGCATGATACAATTCGGTGGTGAGGTGGTGAATTCTCGGGCGTCGGGTTTCCAC
ACGGCGACACAAATGGGAAGCGGCCACTTCGCCGGAGAAGGATTCGGAAAAGCTTCATATTTTAGGAACTTGCAAGTGGTGGATTGGGATAACAGCTTAATTCCTCTATC
GAATCTCAAGGTTTTGGCTGATCATCCAAATTGTTATAACATTCAAGGTGGAATTAATAGAGTTTGGGGAAATTATTTTTACTATGGTGGCCCTGGAAGAAATGTGAGAT
GCCCATAAATTTGTGTTTTTTTCTTCTTCTTTTTCTTTTCATGAATTTTCTTTGTGGGGTGTTTTTTTTAGAGGAAAATGTGAAATATTCATGTATTTGT
Protein sequenceShow/hide protein sequence
MPKFYSYKLQQKQKHMDSSSSSSSSSSILLPVLVFSLLVVSSFCPVHSHPINKTTTFRPQDHLKKLKLIRAHLKKINKPPIKTIQSPDGDLIDCVITHQQPAFDHPLLKG
QKPLDLPERPYERSSSGEESSETFQLWSMSGEFCPEGSVPIRRTTENDMMRASSVQRFGRKVRRRIRRDSSSSGHEHAVGFVSGEEYYGAKGSINVWAPRVTNQYEFSLS
QMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQTTGCYNLLCSGFVQTNNRIAIGAAISPTSSYNGGQFDISLLVWKDPKHGNWWLEFGSGVLVGYWPA
FLFTHLRSHASMIQFGGEVVNSRASGFHTATQMGSGHFAGEGFGKASYFRNLQVVDWDNSLIPLSNLKVLADHPNCYNIQGGINRVWGNYFYYGGPGRNVRCP