; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G009720 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G009720
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPhd finger protein
Genome locationchr09:12942527..12955223
RNA-Seq ExpressionLsi09G009720
SyntenyLsi09G009720
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001965 - Zinc finger, PHD-type
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger
IPR034732 - Extended PHD (ePHD) domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8648409.1 hypothetical protein Csa_008617 [Cucumis sativus]0.0e+0080.53Show/hide
Query:  GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKL
        GTCRTSFHPICAREA HRMEVWAKYGCDNVELRAFCSKHSE RDRSS QDP EA+NSSSYVVNHLPVTLSINRP KLVGRRNIDSLLLC+EASDTNSGKL
Subjt:  GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKL

Query:  DDGELEDIGSADPSLNAACVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSL
        DDGELED GSADP+LNAACVD QKSTVQG  D               LIDQGKVNVKDVA EIGIPPDLLCAKLTA NIVPDLKSKIVRWLRNHAYIGSL
Subjt:  DDGELEDIGSADPSLNAACVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSL

Query:  QKNLRVKLKSAVLAKAVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKN-EIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDA
        QKNLRVKLKSAVLAKAVVGAADRS SLSVLDSDNSDLIADKMVTPRRKTK++ISHLKN EIKSSSEE LG +GL  QS++L+QQ      DSKKECIQDA
Subjt:  QKNLRVKLKSAVLAKAVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKN-EIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDA

Query:  GEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFP
        GEK                                                                                H+N+C SSQ SPSRNFP
Subjt:  GEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFP

Query:  NDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKKI
        N VEG+QLE SVSGH+SSISAVHGKAGESPGSY HPFV+ KM YMLHGKLLNV EGE+SC Q SSN G C D QHQH+DCN+ SCNSG FSPKQQVNKKI
Subjt:  NDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKKI

Query:  GGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASS
         G+IK+SPEDEIEGEIIFYQHRLLANAVSRK FTDHLICNVVKSLPKEIDEARSTRWDA+LINQYYS LREAKKQGKKERRHKEAQAVLAAATAAAAASS
Subjt:  GGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASS

Query:  RMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGS
        RMSSFRKDVYEESTHRELMPRAKET TKVALPKTS ESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGS
Subjt:  RMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGS

Query:  GAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSA
        GAPVVN WEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKG +SCYICHRKHGVCLKCNYGHCQSTFHPSCGRSA
Subjt:  GAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSA

Query:  GCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEV
        GCYMTVK+SGGKLQHRAYCEKHSSEQRA                                      KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEV
Subjt:  GCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEV

Query:  SSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRFK
        SSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKF DRGQYAGKQIPQRSSTTTSRNL+D GGLRFK
Subjt:  SSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRFK

Query:  SRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        S+KHAETFQKELVMTS+QASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
Subjt:  SRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

XP_008460473.1 PREDICTED: uncharacterized protein LOC103499277 isoform X1 [Cucumis melo]0.0e+0080.17Show/hide
Query:  GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKL
        GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHS  RD+SSVQDP EA+NSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLC+EASDTNSGKL
Subjt:  GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKL

Query:  DDGELEDIGSADPSLNAACVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSL
        DDGELED GSADP+ NAACVD+QKSTVQG  D               LIDQGKVNVKDVA EIGIPPDLLCAKLTA NIVPDLKSKIVRWLRNHAYIGSL
Subjt:  DDGELEDIGSADPSLNAACVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSL

Query:  QKNLRVKLKSAVLAKAVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKN-EIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDA
        QKNLRVKLKSAVLAKAVVGAADRS SL+V DSDNSDLIADKMV PRRKTKSN+SHLKN EIKSSSEE LGGHGL  QS+ L+QQACEEQ DS KECIQDA
Subjt:  QKNLRVKLKSAVLAKAVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKN-EIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDA

Query:  GEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFP
        GEK                                                                                 +N+C SSQ SPSRNFP
Subjt:  GEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFP

Query:  ND-VEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKK
        N  VEG+QLE SVSGH+SSISAVHGKAGES GSY HPFV+EKM YMLHGKLLNV  GE+S  Q S + G C DHQHQH+DC D SCNSGEFSPKQQ NKK
Subjt:  ND-VEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKK

Query:  IGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAAS
        IGG+IK+SPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDA+LINQYYS LREAKKQGKKERRHKEAQAVLAAATAAAAAS
Subjt:  IGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAAS

Query:  SRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRG
        SRMSSFRKDVYEESTHRELMPRAKET TKVALPKTS ESDFCKEHARSCDICRRPET+LKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRG
Subjt:  SRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRG

Query:  SGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRS
        SGAPVVN WEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ NPVGGMETVSKG +SCYICHRKHGVCLKCNYGHCQSTFHPSC RS
Subjt:  SGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRS

Query:  AGCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPE
        AGCYMTVK+SGGKLQHRAYCEKHSSEQRA                                      KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPE
Subjt:  AGCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPE

Query:  VSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRF
        VSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTD+DSTTSQNPFPRKFADR  YAGKQIPQRSSTTTSRNL+D GGLRF
Subjt:  VSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRF

Query:  KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQ NQE GSAEPPKCDR
Subjt:  KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

XP_008460474.1 PREDICTED: uncharacterized protein LOC103499277 isoform X2 [Cucumis melo]0.0e+0080.25Show/hide
Query:  GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKL
        GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHS  RD+SSVQDP EA+NSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLC+EASDTNSGKL
Subjt:  GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKL

Query:  DDGELEDIGSADPSLNAACVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSL
        DDGELED GSADP+ NAACVD+QKSTVQG  D               LIDQGKVNVKDVA EIGIPPDLLCAKLTA NIVPDLKSKIVRWLRNHAYIGSL
Subjt:  DDGELEDIGSADPSLNAACVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSL

Query:  QKNLRVKLKSAVLAKAVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKN-EIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDA
        QKNLRVKLKSAVLAKAVVGAADRS SL+V DSDNSDLIADKMV PRRKTKSN+SHLKN EIKSSSEE LGGHGL  QS+ L+QQACEEQ DS KECIQDA
Subjt:  QKNLRVKLKSAVLAKAVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKN-EIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDA

Query:  GEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFP
        GEK                                                                                 +N+C SSQ SPSRNFP
Subjt:  GEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFP

Query:  NDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKKI
        N VEG+QLE SVSGH+SSISAVHGKAGES GSY HPFV+EKM YMLHGKLLNV  GE+S  Q S + G C DHQHQH+DC D SCNSGEFSPKQQ NKKI
Subjt:  NDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKKI

Query:  GGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASS
        GG+IK+SPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDA+LINQYYS LREAKKQGKKERRHKEAQAVLAAATAAAAASS
Subjt:  GGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASS

Query:  RMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGS
        RMSSFRKDVYEESTHRELMPRAKET TKVALPKTS ESDFCKEHARSCDICRRPET+LKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGS
Subjt:  RMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGS

Query:  GAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSA
        GAPVVN WEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ NPVGGMETVSKG +SCYICHRKHGVCLKCNYGHCQSTFHPSC RSA
Subjt:  GAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSA

Query:  GCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEV
        GCYMTVK+SGGKLQHRAYCEKHSSEQRA                                      KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEV
Subjt:  GCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEV

Query:  SSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRFK
        SSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTD+DSTTSQNPFPRKFADR  YAGKQIPQRSSTTTSRNL+D GGLRFK
Subjt:  SSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRFK

Query:  SRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        SRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQ NQE GSAEPPKCDR
Subjt:  SRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

XP_011655203.1 uncharacterized protein LOC101212864 isoform X2 [Cucumis sativus]0.0e+0080.45Show/hide
Query:  GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKL
        GTCRTSFHPICAREA HRMEVWAKYGCDNVELRAFCSKHSE RDRSS QDP EA+NSSSYVVNHLPVTLSINRP KLVGRRNIDSLLLC+EASDTNSGKL
Subjt:  GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKL

Query:  DDGELEDIGSADPSLNAACVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSL
        DDGELED GSADP+LNAACVD QKSTVQG  D               LIDQGKVNVKDVA EIGIPPDLLCAKLTA NIVPDLKSKIVRWLRNHAYIGSL
Subjt:  DDGELEDIGSADPSLNAACVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSL

Query:  QKNLRVKLKSAVLAKAVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKN-EIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDA
        QKNLRVKLKSAVLAKAVVGAADRS SLSVLDSDNSDLIADKMVTPRRKTK++ISHLKN EIKSSSEE LG +GL  QS++L+QQ      DSKKECIQDA
Subjt:  QKNLRVKLKSAVLAKAVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKN-EIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDA

Query:  GEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFP
        GEK                                                                                H+N+C SSQ SPSRNFP
Subjt:  GEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFP

Query:  ND-VEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKK
        N  VEG+QLE SVSGH+SSISAVHGKAGESPGSY HPFV+ KM YMLHGKLLNV EGE+SC Q SSN G C D QHQH+DCN+ SCNSG FSPKQQVNKK
Subjt:  ND-VEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKK

Query:  IGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAAS
        I G+IK+SPEDEIEGEIIFYQHRLLANAVSRK FTDHLICNVVKSLPKEIDEARSTRWDA+LINQYYS LREAKKQGKKERRHKEAQAVLAAATAAAAAS
Subjt:  IGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAAS

Query:  SRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRG
        SRMSSFRKDVYEESTHRELMPRAKET TKVALPKTS ESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRG
Subjt:  SRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRG

Query:  SGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRS
        SGAPVVN WEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKG +SCYICHRKHGVCLKCNYGHCQSTFHPSCGRS
Subjt:  SGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRS

Query:  AGCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPE
        AGCYMTVK+SGGKLQHRAYCEKHSSEQRA                                      KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPE
Subjt:  AGCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPE

Query:  VSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRF
        VSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKF DRGQYAGKQIPQRSSTTTSRNL+D GGLRF
Subjt:  VSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRF

Query:  KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        KS+KHAETFQKELVMTS+QASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
Subjt:  KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

XP_038875394.1 uncharacterized protein LOC120067860 isoform X2 [Benincasa hispida]0.0e+0080.55Show/hide
Query:  GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKL
        GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSE RD+SS QD   AVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLL +EASDTNSGKL
Subjt:  GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKL

Query:  DDGELEDIGSADPSLNAACVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSL
        DDGELEDIGSADPSLNAACV TQKSTVQG  D               LIDQGKVNVKDVASEIGIPPDLLCAKLTA N+VPDLKSKIVRWLRNHAYIGSL
Subjt:  DDGELEDIGSADPSLNAACVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSL

Query:  QKNLRVKLKSAVLAKAVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKN-EIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDA
        QKNLRVKLKSAVLA+AVVGAADRS+SLSVLDSDNSDLIADKMVTP+RK KSNISHLKN EIK SSE   GGHGLA+QSD L+ Q CEEQGDSKKEC QD 
Subjt:  QKNLRVKLKSAVLAKAVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKN-EIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDA

Query:  GEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFP
        GEK                                                                                HLN+C     SPSRNFP
Subjt:  GEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFP

Query:  NDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKKI
        ND+ GDQL+VSVSGHNSSI AVHGKAGESPG YFHPFVQEKMAYMLHGKLLNVSEGE SC QASSN  GCCDHQHQH+DCND SCNSG FSPKQQVNKKI
Subjt:  NDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKKI

Query:  GGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASS
         G+IKLSPEDEIEGEIIFYQHRLLANAVSRK+F DHLICNVVKSLPKE+D+ARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASS
Subjt:  GGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASS

Query:  RMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGS
        RMSSFRKD+YEES HRELMPRAK+TLTKVAL KTS ESD CKEH RSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGS
Subjt:  RMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGS

Query:  GAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSA
        GA VVN W+KSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGV+SCYICHRKHGVCLKCNYGHCQSTFHPSCGR+A
Subjt:  GAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSA

Query:  GCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEV
        GCYMTVK+SGGKLQHRAYCEKHSSEQRA                                      KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEV
Subjt:  GCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEV

Query:  SSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGL-RF
        SSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIK WNKVPLSLDTEQKTDDDSTTSQNPF RKFADRGQYAGKQIPQRSSTT SRNLVDVGGL R 
Subjt:  SSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGL-RF

Query:  KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET SAE PKCDR
Subjt:  KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

TrEMBL top hitse value%identityAlignment
A0A0A0KTJ7 Uncharacterized protein0.0e+0080.21Show/hide
Query:  MEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKLDDGELEDIGSADPSLNAA
        MEVWAKYGCDNVELRAFCSKHSE RDRSS QDP EA+NSSSYVVNHLPVTLSINRP KLVGRRNIDSLLLC+EASDTNSGKLDDGELED GSADP+LNAA
Subjt:  MEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKLDDGELEDIGSADPSLNAA

Query:  CVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSLQKNLRVKLKSAVLAKAVV
        CVD QKSTVQG  D               LIDQGKVNVKDVA EIGIPPDLLCAKLTA NIVPDLKSKIVRWLRNHAYIGSLQKNLRVKLKSAVLAKAVV
Subjt:  CVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSLQKNLRVKLKSAVLAKAVV

Query:  GAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKN-EIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDAGEKFSLLSDVTILSLTLT
        GAADRS SLSVLDSDNSDLIADKMVTPRRKTK++ISHLKN EIKSSSEE LG +GL  QS++L+QQ      DSKKECIQDAGEK               
Subjt:  GAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKN-EIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDAGEKFSLLSDVTILSLTLT

Query:  NSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFPND-VEGDQLEVSVSGHNS
                                                                         H+N+C SSQ SPSRNFPN  VEG+QLE SVSGH+S
Subjt:  NSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFPND-VEGDQLEVSVSGHNS

Query:  SISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEII
        SISAVHGKAGESPGSY HPFV+ KM YMLHGKLLNV EGE+SC Q SSN G C D QHQH+DCN+ SCNSG FSPKQQVNKKI G+IK+SPEDEIEGEII
Subjt:  SISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEII

Query:  FYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRE
        FYQHRLLANAVSRK FTDHLICNVVKSLPKEIDEARSTRWDA+LINQYYS LREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRE
Subjt:  FYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRE

Query:  LMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAEC
        LMPRAKET TKVALPKTS ESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVN WEKSYFVAEC
Subjt:  LMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAEC

Query:  GLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRA
        GLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKG +SCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAGCYMTVK+SGGKLQHRA
Subjt:  GLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRA

Query:  YCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLK
        YCEKHSSEQRA                                      KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLK
Subjt:  YCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLK

Query:  SCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSD
        SCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKF DRGQYAGKQIPQRSSTTTSRNL+D GGLRFKS+KHAETFQKELVMTS+
Subjt:  SCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSD

Query:  QASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        QASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
Subjt:  QASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

A0A1S3CC47 uncharacterized protein LOC103499277 isoform X30.0e+0079.83Show/hide
Query:  MEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKLDDGELEDIGSADPSLNAA
        MEVWAKYGCDNVELRAFCSKHS  RD+SSVQDP EA+NSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLC+EASDTNSGKLDDGELED GSADP+ NAA
Subjt:  MEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKLDDGELEDIGSADPSLNAA

Query:  CVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSLQKNLRVKLKSAVLAKAVV
        CVD+QKSTVQG  D               LIDQGKVNVKDVA EIGIPPDLLCAKLTA NIVPDLKSKIVRWLRNHAYIGSLQKNLRVKLKSAVLAKAVV
Subjt:  CVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSLQKNLRVKLKSAVLAKAVV

Query:  GAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKN-EIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDAGEKFSLLSDVTILSLTLT
        GAADRS SL+V DSDNSDLIADKMV PRRKTKSN+SHLKN EIKSSSEE LGGHGL  QS+ L+QQACEEQ DS KECIQDAGEK               
Subjt:  GAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKN-EIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDAGEKFSLLSDVTILSLTLT

Query:  NSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFPND-VEGDQLEVSVSGHNS
                                                                          +N+C SSQ SPSRNFPN  VEG+QLE SVSGH+S
Subjt:  NSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFPND-VEGDQLEVSVSGHNS

Query:  SISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEII
        SISAVHGKAGES GSY HPFV+EKM YMLHGKLLNV  GE+S  Q S + G C DHQHQH+DC D SCNSGEFSPKQQ NKKIGG+IK+SPEDEIEGEII
Subjt:  SISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEII

Query:  FYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRE
        FYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDA+LINQYYS LREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRE
Subjt:  FYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRE

Query:  LMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAEC
        LMPRAKET TKVALPKTS ESDFCKEHARSCDICRRPET+LKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVN WEKSYFVAEC
Subjt:  LMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAEC

Query:  GLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRA
        GLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ NPVGGMETVSKG +SCYICHRKHGVCLKCNYGHCQSTFHPSC RSAGCYMTVK+SGGKLQHRA
Subjt:  GLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRA

Query:  YCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLK
        YCEKHSSEQRA                                      KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLK
Subjt:  YCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLK

Query:  SCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSD
        SCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTD+DSTTSQNPFPRKFADR  YAGKQIPQRSSTTTSRNL+D GGLRFKSRKHAETFQKELVMTSD
Subjt:  SCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSD

Query:  QASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        QASMKNSLLPKQYLYVPADVLAKEKQ NQE GSAEPPKCDR
Subjt:  QASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

A0A1S3CD10 uncharacterized protein LOC103499277 isoform X10.0e+0080.17Show/hide
Query:  GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKL
        GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHS  RD+SSVQDP EA+NSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLC+EASDTNSGKL
Subjt:  GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKL

Query:  DDGELEDIGSADPSLNAACVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSL
        DDGELED GSADP+ NAACVD+QKSTVQG  D               LIDQGKVNVKDVA EIGIPPDLLCAKLTA NIVPDLKSKIVRWLRNHAYIGSL
Subjt:  DDGELEDIGSADPSLNAACVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSL

Query:  QKNLRVKLKSAVLAKAVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKN-EIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDA
        QKNLRVKLKSAVLAKAVVGAADRS SL+V DSDNSDLIADKMV PRRKTKSN+SHLKN EIKSSSEE LGGHGL  QS+ L+QQACEEQ DS KECIQDA
Subjt:  QKNLRVKLKSAVLAKAVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKN-EIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDA

Query:  GEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFP
        GEK                                                                                 +N+C SSQ SPSRNFP
Subjt:  GEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFP

Query:  ND-VEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKK
        N  VEG+QLE SVSGH+SSISAVHGKAGES GSY HPFV+EKM YMLHGKLLNV  GE+S  Q S + G C DHQHQH+DC D SCNSGEFSPKQQ NKK
Subjt:  ND-VEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKK

Query:  IGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAAS
        IGG+IK+SPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDA+LINQYYS LREAKKQGKKERRHKEAQAVLAAATAAAAAS
Subjt:  IGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAAS

Query:  SRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRG
        SRMSSFRKDVYEESTHRELMPRAKET TKVALPKTS ESDFCKEHARSCDICRRPET+LKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRG
Subjt:  SRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRG

Query:  SGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRS
        SGAPVVN WEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ NPVGGMETVSKG +SCYICHRKHGVCLKCNYGHCQSTFHPSC RS
Subjt:  SGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRS

Query:  AGCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPE
        AGCYMTVK+SGGKLQHRAYCEKHSSEQRA                                      KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPE
Subjt:  AGCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPE

Query:  VSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRF
        VSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTD+DSTTSQNPFPRKFADR  YAGKQIPQRSSTTTSRNL+D GGLRF
Subjt:  VSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRF

Query:  KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQ NQE GSAEPPKCDR
Subjt:  KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

A0A1S3CDU2 uncharacterized protein LOC103499277 isoform X20.0e+0080.25Show/hide
Query:  GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKL
        GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHS  RD+SSVQDP EA+NSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLC+EASDTNSGKL
Subjt:  GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKL

Query:  DDGELEDIGSADPSLNAACVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSL
        DDGELED GSADP+ NAACVD+QKSTVQG  D               LIDQGKVNVKDVA EIGIPPDLLCAKLTA NIVPDLKSKIVRWLRNHAYIGSL
Subjt:  DDGELEDIGSADPSLNAACVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSL

Query:  QKNLRVKLKSAVLAKAVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKN-EIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDA
        QKNLRVKLKSAVLAKAVVGAADRS SL+V DSDNSDLIADKMV PRRKTKSN+SHLKN EIKSSSEE LGGHGL  QS+ L+QQACEEQ DS KECIQDA
Subjt:  QKNLRVKLKSAVLAKAVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKN-EIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDA

Query:  GEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFP
        GEK                                                                                 +N+C SSQ SPSRNFP
Subjt:  GEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFP

Query:  NDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKKI
        N VEG+QLE SVSGH+SSISAVHGKAGES GSY HPFV+EKM YMLHGKLLNV  GE+S  Q S + G C DHQHQH+DC D SCNSGEFSPKQQ NKKI
Subjt:  NDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKKI

Query:  GGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASS
        GG+IK+SPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDA+LINQYYS LREAKKQGKKERRHKEAQAVLAAATAAAAASS
Subjt:  GGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASS

Query:  RMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGS
        RMSSFRKDVYEESTHRELMPRAKET TKVALPKTS ESDFCKEHARSCDICRRPET+LKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGS
Subjt:  RMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGS

Query:  GAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSA
        GAPVVN WEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ NPVGGMETVSKG +SCYICHRKHGVCLKCNYGHCQSTFHPSC RSA
Subjt:  GAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSA

Query:  GCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEV
        GCYMTVK+SGGKLQHRAYCEKHSSEQRA                                      KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEV
Subjt:  GCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEV

Query:  SSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRFK
        SSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTD+DSTTSQNPFPRKFADR  YAGKQIPQRSSTTTSRNL+D GGLRFK
Subjt:  SSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRFK

Query:  SRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        SRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQ NQE GSAEPPKCDR
Subjt:  SRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

A0A6J1FEE0 uncharacterized protein LOC111444957 isoform X10.0e+0077.98Show/hide
Query:  GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKL
        GTCRTSFHPICAREAR+RMEVWAKYGCDNVELRAFCSKHS+  DRSS Q P EAVNSSSYVV+HLPVTLSINRPQKLVGRRNID+LLLC+EASDTNSGKL
Subjt:  GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEASDTNSGKL

Query:  DDGELEDIGSADPSLNAACVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSL
        D GE E+IGS DPSLNA CVDT KSTVQG  D               LIDQGKVN KDVASEIGIPPDLLCAKL AGN+VPDLKSKIVRWLR+HAYIG+L
Subjt:  DDGELEDIGSADPSLNAACVDTQKSTVQGAGD---------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSL

Query:  QKNLRVKLKSAVLAKAVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLK-NEIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDA
        Q+NLRVKLKSAV AKAV G+ADRS+SLSV DSDNSDLIA+KMVT RRKTKSNISHLK +EIKSSSEE LGGH L MQSD L+QQACEEQGDS KECIQDA
Subjt:  QKNLRVKLKSAVLAKAVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLK-NEIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDA

Query:  GEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFP
        GEK                                                                                HLN+  SSQD+P RN P
Subjt:  GEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFP

Query:  NDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKKI
        NDVEGD LE SVSGHNSS+SAVHGKAGESP SYFHP+VQEKMA+ML  KLLN+SEGEMS WQASS+ G CC H  QH DCN  S  SG F+PKQ VNKKI
Subjt:  NDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKKI

Query:  GGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASS
        GG+IKLSPEDEIEGEIIFYQ RLLANAVSRKRFTD+LICNVVKSLPKEI+EARSTRWDAVLINQY+ ELREAKK+GKKERRHKEAQAVLAAATAAAAASS
Subjt:  GGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASS

Query:  RMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGS
        RMSSFRKDVYEES HRELMPRAKETLTKVALPKTS ESDFCKEHARSCDICRRPET+LKPILVC+SCKVSVHLDCYRTVKESSGPW CELCEEL++SRGS
Subjt:  RMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGS

Query:  GAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSA
        G PVVN WEKSYFVAECGLCGGTTGAFRKSSDGQWVHA CAEWVFEST+KRGQANPVGGMETVSKGV+SCYICHRKHGV LKCNYGHCQ+TFHP C RSA
Subjt:  GAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSA

Query:  GCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEV
        GCYMTVKTSGGKLQHRAYCEKHSSEQRA                                      KRDLVLCSHDVLAFKRDHVARSVLV SPFFLPEV
Subjt:  GCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEV

Query:  SSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRFK
        SSESATTSLKGHVEDLKSCSEAVQRSDD+TVDSTVSIKHWNKVPLSLDTEQKTDDDS+TSQNPFP+KF DRGQ+AGKQIPQRSST+TSRNLVDV GLRFK
Subjt:  SSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRFK

Query:  SRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        SRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
Subjt:  SRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

SwissProt top hitse value%identityAlignment
B2KF05 Bromodomain and PHD finger-containing protein 33.1e-1731.43Show/hide
Query:  ILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ-ANPVGG
        IL C  C ++VH +CY       G W C  C + S SR    PV           +C LC    GAF+++SDG W H  CA W+ E  F       P+ G
Subjt:  ILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ-ANPVGG

Query:  METV--SKGVESCYICHRKH-GVCLKCNYGHCQSTFHPSCGRSAGCYMTVKT--------SGGKLQHRAYCEKHS
        ++ +  ++   +CYIC +K  G  ++C+  +C + FH +C + AG +M ++         +   ++  AYCE HS
Subjt:  METV--SKGVESCYICHRKH-GVCLKCNYGHCQSTFHPSCGRSAGCYMTVKT--------SGGKLQHRAYCEKHS

B2RRD7 Peregrin1.7e-1830.77Show/hide
Query:  ILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ-ANPVGG
        IL C  C ++VH +CY       G W C  C + S SR                 +C LC    GAF+++ DG+W H  CA W+ E  F       P+  
Subjt:  ILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ-ANPVGG

Query:  METV--SKGVESCYIC-HRKHGVCLKCNYGHCQSTFHPSCGRSAGCYM---TVKTSGG-----KLQHRAYCEKHSSEQRAKR
        +E +  ++   +CYIC  R  G C++C+  +C + FH +C + AG YM    V+ +G       ++  AYC+ H+    A+R
Subjt:  METV--SKGVESCYIC-HRKHGVCLKCNYGHCQSTFHPSCGRSAGCYM---TVKTSGG-----KLQHRAYCEKHSSEQRAKR

P55201 Peregrin1.7e-1830.77Show/hide
Query:  ILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ-ANPVGG
        IL C  C ++VH +CY       G W C  C + S SR                 +C LC    GAF+++ DG+W H  CA W+ E  F       P+  
Subjt:  ILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ-ANPVGG

Query:  METV--SKGVESCYIC-HRKHGVCLKCNYGHCQSTFHPSCGRSAGCYM---TVKTSGG-----KLQHRAYCEKHSSEQRAKR
        +E +  ++   +CYIC  R  G C++C+  +C + FH +C + AG YM    V+ +G       ++  AYC+ H+    A+R
Subjt:  METV--SKGVESCYIC-HRKHGVCLKCNYGHCQSTFHPSCGRSAGCYM---TVKTSGG-----KLQHRAYCEKHSSEQRAKR

Q12311 NuA3 HAT complex component NTO18.8e-2030.81Show/hide
Query:  RSCDICRRPET-ILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWV
        ++C +C   ++  L  I+ C  C ++VH +CY  +    G W C  C               +  K+ F A C +C   TGAF+++  G WVH  CA W+
Subjt:  RSCDICRRPET-ILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWV

Query:  FESTFKR-GQANPVGGME--TVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAGCYMTV-KTSGGKLQHRAYCEKHSSE
         E  F       P+ G++  +VS+   +CYIC +K G C++C   +C + +H +C R AG YM+  K +  +L    + +K+S E
Subjt:  FESTFKR-GQANPVGGME--TVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAGCYMTV-KTSGGKLQHRAYCEKHSSE

Q9ULD4 Bromodomain and PHD finger-containing protein 34.1e-1732Show/hide
Query:  ILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ-ANPVGG
        IL C  C ++VH +CY       G W C  C + S SR    PV           +C LC    GAF+++SDG W H  CA W+ E  F       P+ G
Subjt:  ILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ-ANPVGG

Query:  METV--SKGVESCYICHRKH-GVCLKCNYGHCQSTFHPSCGRSAGCYMTVK-----TSGGKL---QHRAYCEKHS
        ++ +  ++   +CYIC +K  G  ++C+  +C + FH +C + AG +M ++     +  G +   +  AYCE HS
Subjt:  METV--SKGVESCYICHRKH-GVCLKCNYGHCQSTFHPSCGRSAGCYMTVK-----TSGGKL---QHRAYCEKHS

Arabidopsis top hitse value%identityAlignment
AT1G05830.1 trithorax-like protein 29.4e-1729.56Show/hide
Query:  CDICRRPETILKPI-LVCSSCKVSVHLDCYRTVKESSG-PWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVF
        C++C   E     + L C  C++ VH  CY  ++  +G  W C LC  ++L                    C LC    GA + ++DG+W H  CA W+ 
Subjt:  CDICRRPETILKPI-LVCSSCKVSVHLDCYRTVKESSG-PWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVF

Query:  ES-TFKRGQANPVGGMETVSKGVES--CYICHRKHGVCLKCNYGHCQSTFHPSCGRSAG
        E+      +  P+ G++ VSK      C IC   +G C++C+   C+  +HP C R+AG
Subjt:  ES-TFKRGQANPVGGMETVSKGVES--CYICHRKHGVCLKCNYGHCQSTFHPSCGRSAG

AT1G05830.2 trithorax-like protein 29.4e-1729.56Show/hide
Query:  CDICRRPETILKPI-LVCSSCKVSVHLDCYRTVKESSG-PWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVF
        C++C   E     + L C  C++ VH  CY  ++  +G  W C LC  ++L                    C LC    GA + ++DG+W H  CA W+ 
Subjt:  CDICRRPETILKPI-LVCSSCKVSVHLDCYRTVKESSG-PWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVF

Query:  ES-TFKRGQANPVGGMETVSKGVES--CYICHRKHGVCLKCNYGHCQSTFHPSCGRSAG
        E+      +  P+ G++ VSK      C IC   +G C++C+   C+  +HP C R+AG
Subjt:  ES-TFKRGQANPVGGMETVSKGVES--CYICHRKHGVCLKCNYGHCQSTFHPSCGRSAG

AT1G77800.1 PHD finger family protein2.1e-17337.78Show/hide
Query:  LIWFFF-GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDR-SSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEA
        ++W  F G C    +   A EA +R+EVW K+GCD VELRAFCSKHS+ ++   SV+        S   + HLP + S+       G  + D + +    
Subjt:  LIWFFF-GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDR-SSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEA

Query:  SDTNSGKLDDGELEDIGSADPSLNAACVDTQKSTVQGAGD------------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVR
          T S    + +L+++ S     N +  D  +S + G  +                  LID GKV+VKDVA+EIGI PD L AKLT G+++PDL  K+V+
Subjt:  SDTNSGKLDDGELEDIGSADPSLNAACVDTQKSTVQGAGD------------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVR

Query:  WLRNHAYIGSLQ--KNLRVKLKSAVLAK--AVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKNEIKSSSEEALGGHGLAMQSDALNQQAC
        WL  HA++GS    KNL+ K   +VL K  +V  A D S        + + ++ D+           + H K+ +  S +     HG             
Subjt:  WLRNHAYIGSLQ--KNLRVKLKSAVLAK--AVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKNEIKSSSEEALGGHGLAMQSDALNQQAC

Query:  EEQGDSKKECIQDAGEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLN
         EQ +S                                                    +SG +M     ++FSL                          
Subjt:  EEQGDSKKECIQDAGEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLN

Query:  KCGSSQDSPSRNFPNDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCN
        +  SSQ+  + N PN +  D                       PG   H ++ ++++ +   + L  S  +    + ++N  G  +  ++H+        
Subjt:  KCGSSQDSPSRNFPNDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCN

Query:  SGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQ
         G  +  Q    +  G++ LSPEDE+EGE+++YQ +LL  AVSRK+ +D+L+  V K LP EIDE    RWD VL+N+Y+ ++REA+KQG+KE+R+K+AQ
Subjt:  SGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQ

Query:  AVLAAATAAAAASSRMSSFRKDVYEESTHRE-------------LMPRAKETLTKVALPKTSSES-------DFCKEHARSCDICRRPETILKPILVCSS
        AVLAAATAAAA SSR +S RKD+ EE   +E             L+P+ KE+L K+A+    SE        DF  E+ R+CDICRR ETI   I+VCSS
Subjt:  AVLAAATAAAAASSRMSSFRKDVYEESTHRE-------------LMPRAKETLTKVALPKTSSES-------DFCKEHARSCDICRRPETILKPILVCSS

Query:  CKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKG
        CKV+VH+DCY+  KES+GPW CELC E      S  P  N  EK     EC LCGGTTGAFRK+++GQWVHAFCAEW  ESTF+RGQ NPV GME+++K 
Subjt:  CKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKG

Query:  VESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------
         ++C +C R +G C KC+YG+CQ+TFHPSC RSAG +M   T GGK  H+AYCEKHS EQ+A                                      
Subjt:  VESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------

Query:  KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPR
        KR+L + SH++LA KRDH AR + VR+PF  PEVSS+SATTS+KGH +   S SEA+QRSDD+T+DSTV+ K   K PL +DT+QKT DDS TS++ F R
Subjt:  KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPR

Query:  KFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPP
        K  +R   +GK +P++    +     D        ++H ETF KELVMTSD+AS KN  LPK Y YVP D L ++K  NQ+  S++ P
Subjt:  KFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPP

AT1G77800.2 PHD finger family protein8.4e-17538.14Show/hide
Query:  LIWFFF-GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDR-SSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEA
        ++W  F G C    +   A EA +R+EVW K+GCD VELRAFCSKHS+ ++   SV+        S   + HLP + S+       G  + D + +    
Subjt:  LIWFFF-GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDR-SSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRNIDSLLLCQEA

Query:  SDTNSGKLDDGELEDIGSADPSLNAACVDTQKSTVQGAGD------------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVR
          T S    + +L+++ S     N +  D  +S + G  +                  LID GKV+VKDVA+EIGI PD L AKLT G+++PDL  K+V+
Subjt:  SDTNSGKLDDGELEDIGSADPSLNAACVDTQKSTVQGAGD------------------LIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVR

Query:  WLRNHAYIGSLQ--KNLRVKLKSAVLAK--AVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKNEIKSSSEEALGGHGLAMQSDALNQQAC
        WL  HA++GS    KNL+ K   +VL K  +V  A D S        + + ++ D+           + H K+ +  S +     HG             
Subjt:  WLRNHAYIGSLQ--KNLRVKLKSAVLAK--AVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKNEIKSSSEEALGGHGLAMQSDALNQQAC

Query:  EEQGDSKKECIQDAGEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLN
         EQ +S                                                    +SG +M     ++FSL                          
Subjt:  EEQGDSKKECIQDAGEKFSLLSDVTILSLTLTNSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLN

Query:  KCGSSQDSPSRNFPNDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCN
        +  SSQ+  + N PN +  D                       PG   H ++ ++++ +   + L  S  +    + ++N  G  +  ++H+        
Subjt:  KCGSSQDSPSRNFPNDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCN

Query:  SGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQ
         G  +  Q    +  G++ LSPEDE+EGE+++YQ +LL  AVSRK+ +D+L+  V K LP EIDE    RWD VL+N+Y+ ++REA+KQG+KE+R+K+AQ
Subjt:  SGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQ

Query:  AVLAAATAAAAASSRMSSFRKDVYEESTHRE-------------LMPRAKETLTKVALPKTSSES-------DFCKEHARSCDICRRPETILKPILVCSS
        AVLAAATAAAA SSR +S RKD+ EE   +E             L+P+ KE+L K+A+    SE        DF  E+ R+CDICRR ETI   I+VCSS
Subjt:  AVLAAATAAAAASSRMSSFRKDVYEESTHRE-------------LMPRAKETLTKVALPKTSSES-------DFCKEHARSCDICRRPETILKPILVCSS

Query:  CKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKG
        CKV+VH+DCY+  KES+GPW CELC E      S  P  N  EK     EC LCGGTTGAFRK+++GQWVHAFCAEW  ESTF+RGQ NPV GME+++K 
Subjt:  CKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKG

Query:  VESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------
         ++C +C R +G C KC+YG+CQ+TFHPSC RSAG +M   T GGK  H+AYCEKHS EQ+A                                      
Subjt:  VESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRA--------------------------------------

Query:  KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPR
        KR+L + SH++LA KRDH AR + VR+PF  PEVSS+SATTS+KGH +   S SEA+QRSDD+T+DSTV+ K   K PL +DT+QKT DDS TS++ F R
Subjt:  KRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPR

Query:  KFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPP
        K  +R   +GK +P R     S ++ + G    K +KH ETF KELVMTSD+AS KN  LPK Y YVP D L ++K  NQ+  S++ P
Subjt:  KFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPP

AT2G31650.1 homologue of trithorax7.2e-1730.82Show/hide
Query:  CDICRRPETILKPI-LVCSSCKVSVHLDCYRTVKESSGP-WCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVF
        C++C   E     + L C  C++ VH  CY  ++   G  W C LC         GAP +           C LC    GA + ++DG+W H  CA W+ 
Subjt:  CDICRRPETILKPI-LVCSSCKVSVHLDCYRTVKESSGP-WCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVF

Query:  ESTFKR-GQANPVGGMETVSKG--VESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAG
        E+      +  P+ G+  VSK      C IC   +G C++C+   C+  +HP C R+AG
Subjt:  ESTFKR-GQANPVGGMETVSKG--VESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGTGAGAATAGGAATCTTTCCTGTCTTGGATGGATGTTTTAATTCGCACGATATTGTTGGCTTGATATGGTTCTTCTTTGGAACCTGCAGAACCTCCTTTCATCC
AATTTGTGCAAGGGAGGCTAGACATAGAATGGAGGTGTGGGCCAAATATGGGTGTGATAATGTTGAGCTTCGTGCTTTCTGCTCTAAGCACTCAGAAGGTCGAGATAGGA
GCAGTGTTCAAGACCCTTTGGAAGCTGTTAACAGTAGCTCATATGTGGTCAACCACCTTCCAGTGACATTGTCAATTAACCGACCACAAAAGTTAGTTGGGAGAAGGAAT
ATTGATAGCCTTTTACTCTGTCAGGAGGCTTCTGATACCAACTCTGGAAAATTGGATGATGGCGAATTAGAAGATATAGGATCCGCTGATCCCAGTTTAAATGCTGCTTG
TGTTGATACACAGAAGTCTACTGTCCAGGGGGCTGGGGATTTGATTGATCAAGGAAAAGTGAATGTTAAAGATGTAGCCTCAGAAATTGGCATACCGCCTGATTTATTAT
GTGCAAAACTTACTGCTGGCAACATTGTCCCCGACTTGAAATCCAAAATTGTCCGATGGCTAAGAAATCATGCTTATATTGGTTCCTTGCAGAAGAATTTGAGAGTCAAA
CTTAAATCGGCTGTTTTAGCTAAGGCTGTTGTTGGAGCTGCTGATCGTTCTAACTCTTTGTCAGTACTTGATTCTGACAATTCAGATCTTATTGCTGATAAGATGGTTAC
ACCTCGAAGAAAAACTAAAAGCAATATTAGCCATTTGAAGAATGAAATCAAATCTTCATCTGAAGAAGCCTTGGGTGGCCATGGTCTAGCAATGCAAAGTGATGCCTTGA
ACCAGCAAGCTTGTGAAGAGCAGGGTGATTCAAAGAAGGAATGCATCCAAGACGCTGGTGAGAAGTTTAGTCTGTTGTCAGATGTCACTATCTTGTCCCTCACTCTAACC
AATTCACCCGAGAAGCCATATGGTCCTGAGCAAACCACTAAAGTGGCCTTAGGTTCAGTGATCAAGTGTAAGGAGGAAGCAAGTGGAAAGTTAATGATAGGGGACGAGAA
CTCTTCTTTCAGTCTTTTAGTTGAGAAACATAGAACATTAGGCATGCCAAAGCAAAATTTGGCCACCTTTAAGCAGAGATGGGTGCATTTAAACAAGTGTGGTAGTTCTC
AAGATTCCCCTTCAAGAAATTTTCCAAATGATGTAGAAGGCGATCAATTAGAGGTTTCTGTTTCTGGCCATAATTCTTCAATAAGCGCAGTTCATGGAAAGGCTGGAGAG
TCCCCTGGTTCTTATTTTCATCCATTTGTCCAAGAGAAGATGGCATATATGTTGCATGGAAAGCTTCTAAATGTGTCTGAAGGAGAGATGTCATGTTGGCAAGCATCTTC
CAATGTTGGCGGTTGCTGTGATCACCAACACCAGCATGTAGACTGCAATGACGCGAGTTGCAATTCTGGTGAATTTAGTCCGAAGCAGCAAGTGAATAAAAAAATAGGTG
GAGTCATAAAGTTGTCTCCAGAAGATGAAATTGAGGGAGAAATTATATTTTATCAGCACAGATTACTTGCAAATGCAGTTTCAAGAAAGCGGTTCACTGATCATTTAATT
TGTAACGTTGTTAAGAGTCTTCCAAAGGAGATTGATGAAGCAAGAAGTACTAGATGGGATGCTGTTCTTATTAATCAGTATTATAGTGAACTCAGAGAAGCAAAGAAACA
GGGTAAGAAAGAGAGAAGACATAAGGAAGCACAGGCTGTATTAGCTGCTGCAACGGCTGCTGCTGCTGCCTCTTCTAGGATGTCATCATTCAGAAAAGATGTATATGAAG
AATCTACTCACAGAGAGTTGATGCCTCGTGCAAAAGAAACACTTACTAAGGTTGCTCTCCCAAAGACTTCATCAGAGTCAGATTTCTGTAAAGAACACGCTAGATCCTGT
GATATCTGTAGGCGGCCAGAGACAATATTAAAGCCAATTTTAGTCTGCTCGAGCTGCAAGGTTTCGGTACATCTGGATTGCTATCGGACTGTGAAAGAATCCTCAGGTCC
ATGGTGTTGTGAATTGTGTGAAGAATTGTCACTATCAAGGGGGTCTGGAGCACCAGTAGTCAATATTTGGGAGAAATCATATTTTGTCGCAGAATGTGGTCTATGTGGTG
GCACCACAGGTGCATTCAGGAAATCATCTGATGGCCAGTGGGTTCATGCCTTCTGTGCAGAGTGGGTCTTTGAATCAACTTTCAAAAGAGGACAAGCGAATCCCGTGGGA
GGCATGGAGACAGTTTCTAAAGGGGTGGAATCTTGCTATATTTGTCACCGCAAGCATGGTGTCTGTTTGAAGTGCAATTATGGTCATTGCCAGTCTACTTTTCATCCCTC
ATGTGGTAGAAGTGCTGGATGTTACATGACTGTAAAGACTTCTGGTGGTAAGTTGCAGCACAGAGCATACTGTGAAAAACATAGCTCAGAGCAGAGAGCGAAGAGGGACC
TGGTTCTGTGTTCACATGATGTTCTTGCTTTTAAAAGAGACCATGTAGCACGATCAGTACTTGTTCGCTCGCCTTTCTTCCTACCTGAAGTTAGTTCTGAATCAGCTACG
ACATCACTGAAGGGACACGTGGAAGATTTAAAATCGTGCAGTGAAGCTGTGCAAAGATCAGATGATGTAACTGTGGACAGCACAGTTTCCATCAAGCACTGGAACAAAGT
ACCGTTGTCTTTGGACACTGAGCAAAAGACGGATGACGATAGTACCACATCTCAGAATCCATTCCCGCGAAAATTTGCAGACAGGGGTCAGTATGCTGGGAAGCAAATAC
CTCAGAGATCTTCAACTACTACATCACGCAATCTTGTAGATGTTGGGGGATTGAGGTTCAAGTCTAGAAAGCATGCCGAGACATTTCAAAAAGAGCTGGTAATGACGTCT
GACCAAGCATCAATGAAGAACTCTCTTCTACCTAAGCAGTATCTGTATGTTCCAGCTGATGTCCTTGCCAAGGAGAAGCAGGTCAACCAGGAAACAGGTTCTGCCGAGCC
ACCGAAATGTGACAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGGTGAGAATAGGAATCTTTCCTGTCTTGGATGGATGTTTTAATTCGCACGATATTGTTGGCTTGATATGGTTCTTCTTTGGAACCTGCAGAACCTCCTTTCATCC
AATTTGTGCAAGGGAGGCTAGACATAGAATGGAGGTGTGGGCCAAATATGGGTGTGATAATGTTGAGCTTCGTGCTTTCTGCTCTAAGCACTCAGAAGGTCGAGATAGGA
GCAGTGTTCAAGACCCTTTGGAAGCTGTTAACAGTAGCTCATATGTGGTCAACCACCTTCCAGTGACATTGTCAATTAACCGACCACAAAAGTTAGTTGGGAGAAGGAAT
ATTGATAGCCTTTTACTCTGTCAGGAGGCTTCTGATACCAACTCTGGAAAATTGGATGATGGCGAATTAGAAGATATAGGATCCGCTGATCCCAGTTTAAATGCTGCTTG
TGTTGATACACAGAAGTCTACTGTCCAGGGGGCTGGGGATTTGATTGATCAAGGAAAAGTGAATGTTAAAGATGTAGCCTCAGAAATTGGCATACCGCCTGATTTATTAT
GTGCAAAACTTACTGCTGGCAACATTGTCCCCGACTTGAAATCCAAAATTGTCCGATGGCTAAGAAATCATGCTTATATTGGTTCCTTGCAGAAGAATTTGAGAGTCAAA
CTTAAATCGGCTGTTTTAGCTAAGGCTGTTGTTGGAGCTGCTGATCGTTCTAACTCTTTGTCAGTACTTGATTCTGACAATTCAGATCTTATTGCTGATAAGATGGTTAC
ACCTCGAAGAAAAACTAAAAGCAATATTAGCCATTTGAAGAATGAAATCAAATCTTCATCTGAAGAAGCCTTGGGTGGCCATGGTCTAGCAATGCAAAGTGATGCCTTGA
ACCAGCAAGCTTGTGAAGAGCAGGGTGATTCAAAGAAGGAATGCATCCAAGACGCTGGTGAGAAGTTTAGTCTGTTGTCAGATGTCACTATCTTGTCCCTCACTCTAACC
AATTCACCCGAGAAGCCATATGGTCCTGAGCAAACCACTAAAGTGGCCTTAGGTTCAGTGATCAAGTGTAAGGAGGAAGCAAGTGGAAAGTTAATGATAGGGGACGAGAA
CTCTTCTTTCAGTCTTTTAGTTGAGAAACATAGAACATTAGGCATGCCAAAGCAAAATTTGGCCACCTTTAAGCAGAGATGGGTGCATTTAAACAAGTGTGGTAGTTCTC
AAGATTCCCCTTCAAGAAATTTTCCAAATGATGTAGAAGGCGATCAATTAGAGGTTTCTGTTTCTGGCCATAATTCTTCAATAAGCGCAGTTCATGGAAAGGCTGGAGAG
TCCCCTGGTTCTTATTTTCATCCATTTGTCCAAGAGAAGATGGCATATATGTTGCATGGAAAGCTTCTAAATGTGTCTGAAGGAGAGATGTCATGTTGGCAAGCATCTTC
CAATGTTGGCGGTTGCTGTGATCACCAACACCAGCATGTAGACTGCAATGACGCGAGTTGCAATTCTGGTGAATTTAGTCCGAAGCAGCAAGTGAATAAAAAAATAGGTG
GAGTCATAAAGTTGTCTCCAGAAGATGAAATTGAGGGAGAAATTATATTTTATCAGCACAGATTACTTGCAAATGCAGTTTCAAGAAAGCGGTTCACTGATCATTTAATT
TGTAACGTTGTTAAGAGTCTTCCAAAGGAGATTGATGAAGCAAGAAGTACTAGATGGGATGCTGTTCTTATTAATCAGTATTATAGTGAACTCAGAGAAGCAAAGAAACA
GGGTAAGAAAGAGAGAAGACATAAGGAAGCACAGGCTGTATTAGCTGCTGCAACGGCTGCTGCTGCTGCCTCTTCTAGGATGTCATCATTCAGAAAAGATGTATATGAAG
AATCTACTCACAGAGAGTTGATGCCTCGTGCAAAAGAAACACTTACTAAGGTTGCTCTCCCAAAGACTTCATCAGAGTCAGATTTCTGTAAAGAACACGCTAGATCCTGT
GATATCTGTAGGCGGCCAGAGACAATATTAAAGCCAATTTTAGTCTGCTCGAGCTGCAAGGTTTCGGTACATCTGGATTGCTATCGGACTGTGAAAGAATCCTCAGGTCC
ATGGTGTTGTGAATTGTGTGAAGAATTGTCACTATCAAGGGGGTCTGGAGCACCAGTAGTCAATATTTGGGAGAAATCATATTTTGTCGCAGAATGTGGTCTATGTGGTG
GCACCACAGGTGCATTCAGGAAATCATCTGATGGCCAGTGGGTTCATGCCTTCTGTGCAGAGTGGGTCTTTGAATCAACTTTCAAAAGAGGACAAGCGAATCCCGTGGGA
GGCATGGAGACAGTTTCTAAAGGGGTGGAATCTTGCTATATTTGTCACCGCAAGCATGGTGTCTGTTTGAAGTGCAATTATGGTCATTGCCAGTCTACTTTTCATCCCTC
ATGTGGTAGAAGTGCTGGATGTTACATGACTGTAAAGACTTCTGGTGGTAAGTTGCAGCACAGAGCATACTGTGAAAAACATAGCTCAGAGCAGAGAGCGAAGAGGGACC
TGGTTCTGTGTTCACATGATGTTCTTGCTTTTAAAAGAGACCATGTAGCACGATCAGTACTTGTTCGCTCGCCTTTCTTCCTACCTGAAGTTAGTTCTGAATCAGCTACG
ACATCACTGAAGGGACACGTGGAAGATTTAAAATCGTGCAGTGAAGCTGTGCAAAGATCAGATGATGTAACTGTGGACAGCACAGTTTCCATCAAGCACTGGAACAAAGT
ACCGTTGTCTTTGGACACTGAGCAAAAGACGGATGACGATAGTACCACATCTCAGAATCCATTCCCGCGAAAATTTGCAGACAGGGGTCAGTATGCTGGGAAGCAAATAC
CTCAGAGATCTTCAACTACTACATCACGCAATCTTGTAGATGTTGGGGGATTGAGGTTCAAGTCTAGAAAGCATGCCGAGACATTTCAAAAAGAGCTGGTAATGACGTCT
GACCAAGCATCAATGAAGAACTCTCTTCTACCTAAGCAGTATCTGTATGTTCCAGCTGATGTCCTTGCCAAGGAGAAGCAGGTCAACCAGGAAACAGGTTCTGCCGAGCC
ACCGAAATGTGACAGGTAGTTGGAGAAAGCATTTCTTGTGTTTCAGATTTCCACGTTGGAGGCCAAACATCCGTATGCCTCGGGACGCATGTCCTGCTCGGAAACTGAGT
CCGAGAACAAGTGATATGGCCAATGGAGAAGAACAGCAGGGTGAATGCTGGGATGGCTCCACTGGGGCGTCGGGTTTTGACTTGGGCAGCAAAAGTTTTGGCATTGCAGA
TGTACGTTCGACGTCAATTGTAATTGTACAGGTGGGTGGGTGGAGTCGTACGGTTAAATGTCGAAATGGGGGCACAGATTGAGTTTTCCTTGTCTTTGAGATTGTCTTCC
TGGTGGGTAGAGTCAAAGTTTTAGAAGTATTGGCTGATTGTGTGTACGAGAAGTTGGGGGGAGGGGGAAAGGAAAGAGTTGCTGCTGTGAAGCTACTATTGTAAAGTTAA
TGAGTGTATAACTGTTAACGCAATCAATCACTCTGTCCATCCTCTTCAAACCAACTCACTGCCCACTTCTAAATAGTACAATAATGCTCAACCTTAAAATTTGTTCTACT
CTTTTATCCCTTCCATTAATGAGATTAGAAAATTTTTAGTTA
Protein sequenceShow/hide protein sequence
MVVRIGIFPVLDGCFNSHDIVGLIWFFFGTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEGRDRSSVQDPLEAVNSSSYVVNHLPVTLSINRPQKLVGRRN
IDSLLLCQEASDTNSGKLDDGELEDIGSADPSLNAACVDTQKSTVQGAGDLIDQGKVNVKDVASEIGIPPDLLCAKLTAGNIVPDLKSKIVRWLRNHAYIGSLQKNLRVK
LKSAVLAKAVVGAADRSNSLSVLDSDNSDLIADKMVTPRRKTKSNISHLKNEIKSSSEEALGGHGLAMQSDALNQQACEEQGDSKKECIQDAGEKFSLLSDVTILSLTLT
NSPEKPYGPEQTTKVALGSVIKCKEEASGKLMIGDENSSFSLLVEKHRTLGMPKQNLATFKQRWVHLNKCGSSQDSPSRNFPNDVEGDQLEVSVSGHNSSISAVHGKAGE
SPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLI
CNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSC
DICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVG
GMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEVSSESAT
TSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTS
DQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR