; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10009869 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10009869
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPhd finger protein
Genome locationChr06:12415875..12422564
RNA-Seq ExpressionHG10009869
SyntenyHG10009869
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001965 - Zinc finger, PHD-type
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger
IPR034732 - Extended PHD (ePHD) domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8648409.1 hypothetical protein Csa_008617 [Cucumis sativus]0.0e+0093.79Show/hide
Query:  QHLNKCGSSQDSPSRNFPNDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCND
        +H+N+C SSQ SPSRNFPN VEG+QLE SVSGH+SSISAVHGKAGESPGSY HPFV+ KM YMLHGKLLNV EGE+SC Q SSN G C D QHQH+DCN+
Subjt:  QHLNKCGSSQDSPSRNFPNDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCND

Query:  ASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRH
         SCNSG FSPKQQVNKKI G+IK+SPEDEIEGEIIFYQHRLLANAVSRK FTDHLICNVVKSLPKEIDEARSTRWDA+LINQYYS LREAKKQGKKERRH
Subjt:  ASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRH

Query:  KEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKES
        KEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKET TKVALPKTS ESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKES
Subjt:  KEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKES

Query:  SGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLK
        SGPWCCELCEELSLSRGSGAPVVN WEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKG +SCYICHRKHGVCLK
Subjt:  SGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLK

Query:  CNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKR
        CNYGHCQSTFHPSCGRSAGCYMTVK+SGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKR
Subjt:  CNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKR

Query:  DHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQR
        DHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKF DRGQYAGKQIPQR
Subjt:  DHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQR

Query:  SSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        SSTTTSRNL+D GGLRFKS+KHAETFQKELVMTS+QASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
Subjt:  SSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

XP_011655201.1 uncharacterized protein LOC101212864 isoform X1 [Cucumis sativus]0.0e+0092.96Show/hide
Query:  QHLNKCGSSQDSPSRNFPND-VEGDQLEVSVSGHNSSISAVHG-----KAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQ
        +H+N+C SSQ SPSRNFPN  VEG+QLE SVSGH+SSISAVHG     KAGESPGSY HPFV+ KM YMLHGKLLNV EGE+SC Q SSN G C D QHQ
Subjt:  QHLNKCGSSQDSPSRNFPND-VEGDQLEVSVSGHNSSISAVHG-----KAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQ

Query:  HVDCNDASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQG
        H+DCN+ SCNSG FSPKQQVNKKI G+IK+SPEDEIEGEIIFYQHRLLANAVSRK FTDHLICNVVKSLPKEIDEARSTRWDA+LINQYYS LREAKKQG
Subjt:  HVDCNDASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQG

Query:  KKERRHKEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCY
        KKERRHKEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKET TKVALPKTS ESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCY
Subjt:  KKERRHKEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCY

Query:  RTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRK
        RTVKESSGPWCCELCEELSLSRGSGAPVVN WEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKG +SCYICHRK
Subjt:  RTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRK

Query:  HGVCLKCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHD
        HGVCLKCNYGHCQSTFHPSCGRSAGCYMTVK+SGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHD
Subjt:  HGVCLKCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHD

Query:  VLAFKRDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAG
        VLAFKRDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKF DRGQYAG
Subjt:  VLAFKRDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAG

Query:  KQIPQRSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        KQIPQRSSTTTSRNL+D GGLRFKS+KHAETFQKELVMTS+QASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
Subjt:  KQIPQRSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

XP_011655203.1 uncharacterized protein LOC101212864 isoform X2 [Cucumis sativus]0.0e+0093.65Show/hide
Query:  QHLNKCGSSQDSPSRNFPND-VEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCN
        +H+N+C SSQ SPSRNFPN  VEG+QLE SVSGH+SSISAVHGKAGESPGSY HPFV+ KM YMLHGKLLNV EGE+SC Q SSN G C D QHQH+DCN
Subjt:  QHLNKCGSSQDSPSRNFPND-VEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCN

Query:  DASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERR
        + SCNSG FSPKQQVNKKI G+IK+SPEDEIEGEIIFYQHRLLANAVSRK FTDHLICNVVKSLPKEIDEARSTRWDA+LINQYYS LREAKKQGKKERR
Subjt:  DASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERR

Query:  HKEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKE
        HKEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKET TKVALPKTS ESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKE
Subjt:  HKEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKE

Query:  SSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCL
        SSGPWCCELCEELSLSRGSGAPVVN WEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKG +SCYICHRKHGVCL
Subjt:  SSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCL

Query:  KCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFK
        KCNYGHCQSTFHPSCGRSAGCYMTVK+SGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFK
Subjt:  KCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFK

Query:  RDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQ
        RDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKF DRGQYAGKQIPQ
Subjt:  RDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQ

Query:  RSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        RSSTTTSRNL+D GGLRFKS+KHAETFQKELVMTS+QASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
Subjt:  RSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

XP_038875392.1 uncharacterized protein LOC120067860 isoform X1 [Benincasa hispida]0.0e+0093.35Show/hide
Query:  QHLNKCGSSQDSPSRNFPNDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCND
        +HLN+C     SPSRNFPND+ GDQL+VSVSGHNSSI AVHGKAGESPG YFHPFVQEKMAYMLHGKLLNVSEGE SC QASSN  GCCDHQHQH+DCND
Subjt:  QHLNKCGSSQDSPSRNFPNDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCND

Query:  ASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRH
         SCNSG FSPKQQVNKKI G+IKLSPEDEIEGEIIFYQHRLLANAVSRK+F DHLICNVVKSLPKE+D+ARSTRWDAVLINQYYSELREAKKQGKKERRH
Subjt:  ASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRH

Query:  KEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKES
        KEAQAVLAAATAAAAASSRMSSFRKD+YEES HRELMPRAK+TLTKVAL KTS ESD CKEH RSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKES
Subjt:  KEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKES

Query:  SGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLK
        SGPWCCELCEELSLSRGSGA VVN W+KSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGV+SCYICHRKHGVCLK
Subjt:  SGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLK

Query:  CNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKR
        CNYGHCQSTFHPSCGR+AGCYMTVK+SGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKR
Subjt:  CNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKR

Query:  DHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQR
        DHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIK WNKVPLSLDTEQKTDDDSTTSQNPF RKFADRGQYAGKQIPQR
Subjt:  DHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQR

Query:  SSTTTSRNLVDVGGL-RFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        SSTT SRNLVDVGGL R KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET SAE PKCDR
Subjt:  SSTTTSRNLVDVGGL-RFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

XP_038875394.1 uncharacterized protein LOC120067860 isoform X2 [Benincasa hispida]0.0e+0093.35Show/hide
Query:  QHLNKCGSSQDSPSRNFPNDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCND
        +HLN+C     SPSRNFPND+ GDQL+VSVSGHNSSI AVHGKAGESPG YFHPFVQEKMAYMLHGKLLNVSEGE SC QASSN  GCCDHQHQH+DCND
Subjt:  QHLNKCGSSQDSPSRNFPNDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCND

Query:  ASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRH
         SCNSG FSPKQQVNKKI G+IKLSPEDEIEGEIIFYQHRLLANAVSRK+F DHLICNVVKSLPKE+D+ARSTRWDAVLINQYYSELREAKKQGKKERRH
Subjt:  ASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRH

Query:  KEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKES
        KEAQAVLAAATAAAAASSRMSSFRKD+YEES HRELMPRAK+TLTKVAL KTS ESD CKEH RSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKES
Subjt:  KEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKES

Query:  SGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLK
        SGPWCCELCEELSLSRGSGA VVN W+KSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGV+SCYICHRKHGVCLK
Subjt:  SGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLK

Query:  CNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKR
        CNYGHCQSTFHPSCGR+AGCYMTVK+SGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKR
Subjt:  CNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKR

Query:  DHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQR
        DHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIK WNKVPLSLDTEQKTDDDSTTSQNPF RKFADRGQYAGKQIPQR
Subjt:  DHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQR

Query:  SSTTTSRNLVDVGGL-RFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        SSTT SRNLVDVGGL R KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET SAE PKCDR
Subjt:  SSTTTSRNLVDVGGL-RFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

TrEMBL top hitse value%identityAlignment
A0A0A0KTJ7 Uncharacterized protein0.0e+0093.65Show/hide
Query:  QHLNKCGSSQDSPSRNFPND-VEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCN
        +H+N+C SSQ SPSRNFPN  VEG+QLE SVSGH+SSISAVHGKAGESPGSY HPFV+ KM YMLHGKLLNV EGE+SC Q SSN G C D QHQH+DCN
Subjt:  QHLNKCGSSQDSPSRNFPND-VEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCN

Query:  DASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERR
        + SCNSG FSPKQQVNKKI G+IK+SPEDEIEGEIIFYQHRLLANAVSRK FTDHLICNVVKSLPKEIDEARSTRWDA+LINQYYS LREAKKQGKKERR
Subjt:  DASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERR

Query:  HKEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKE
        HKEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKET TKVALPKTS ESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKE
Subjt:  HKEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKE

Query:  SSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCL
        SSGPWCCELCEELSLSRGSGAPVVN WEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKG +SCYICHRKHGVCL
Subjt:  SSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCL

Query:  KCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFK
        KCNYGHCQSTFHPSCGRSAGCYMTVK+SGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFK
Subjt:  KCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFK

Query:  RDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQ
        RDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKF DRGQYAGKQIPQ
Subjt:  RDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQ

Query:  RSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        RSSTTTSRNL+D GGLRFKS+KHAETFQKELVMTS+QASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
Subjt:  RSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

A0A1S3CC47 uncharacterized protein LOC103499277 isoform X30.0e+0092.89Show/hide
Query:  LNKCGSSQDSPSRNFPND-VEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDA
        +N+C SSQ SPSRNFPN  VEG+QLE SVSGH+SSISAVHGKAGES GSY HPFV+EKM YMLHGKLLNV  GE+S  Q S + G C DHQHQH+DC D 
Subjt:  LNKCGSSQDSPSRNFPND-VEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDA

Query:  SCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHK
        SCNSGEFSPKQQ NKKIGG+IK+SPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDA+LINQYYS LREAKKQGKKERRHK
Subjt:  SCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHK

Query:  EAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESS
        EAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKET TKVALPKTS ESDFCKEHARSCDICRRPET+LKPILVCSSCKVSVHLDCYRTVKESS
Subjt:  EAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESS

Query:  GPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKC
        GPWCCELCEELSLSRGSGAPVVN WEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ NPVGGMETVSKG +SCYICHRKHGVCLKC
Subjt:  GPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKC

Query:  NYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKRD
        NYGHCQSTFHPSC RSAGCYMTVK+SGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKRD
Subjt:  NYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKRD

Query:  HVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRS
        HVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTD+DSTTSQNPFPRKFADR  YAGKQIPQRS
Subjt:  HVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRS

Query:  STTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        STTTSRNL+D GGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQ NQE GSAEPPKCDR
Subjt:  STTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

A0A1S3CD10 uncharacterized protein LOC103499277 isoform X10.0e+0092.89Show/hide
Query:  LNKCGSSQDSPSRNFPND-VEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDA
        +N+C SSQ SPSRNFPN  VEG+QLE SVSGH+SSISAVHGKAGES GSY HPFV+EKM YMLHGKLLNV  GE+S  Q S + G C DHQHQH+DC D 
Subjt:  LNKCGSSQDSPSRNFPND-VEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDA

Query:  SCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHK
        SCNSGEFSPKQQ NKKIGG+IK+SPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDA+LINQYYS LREAKKQGKKERRHK
Subjt:  SCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHK

Query:  EAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESS
        EAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKET TKVALPKTS ESDFCKEHARSCDICRRPET+LKPILVCSSCKVSVHLDCYRTVKESS
Subjt:  EAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESS

Query:  GPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKC
        GPWCCELCEELSLSRGSGAPVVN WEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ NPVGGMETVSKG +SCYICHRKHGVCLKC
Subjt:  GPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKC

Query:  NYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKRD
        NYGHCQSTFHPSC RSAGCYMTVK+SGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKRD
Subjt:  NYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKRD

Query:  HVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRS
        HVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTD+DSTTSQNPFPRKFADR  YAGKQIPQRS
Subjt:  HVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRS

Query:  STTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        STTTSRNL+D GGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQ NQE GSAEPPKCDR
Subjt:  STTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

A0A1S3CDU2 uncharacterized protein LOC103499277 isoform X20.0e+0093.03Show/hide
Query:  LNKCGSSQDSPSRNFPNDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDAS
        +N+C SSQ SPSRNFPN VEG+QLE SVSGH+SSISAVHGKAGES GSY HPFV+EKM YMLHGKLLNV  GE+S  Q S + G C DHQHQH+DC D S
Subjt:  LNKCGSSQDSPSRNFPNDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDAS

Query:  CNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKE
        CNSGEFSPKQQ NKKIGG+IK+SPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDA+LINQYYS LREAKKQGKKERRHKE
Subjt:  CNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKE

Query:  AQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSG
        AQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKET TKVALPKTS ESDFCKEHARSCDICRRPET+LKPILVCSSCKVSVHLDCYRTVKESSG
Subjt:  AQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSG

Query:  PWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCN
        PWCCELCEELSLSRGSGAPVVN WEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ NPVGGMETVSKG +SCYICHRKHGVCLKCN
Subjt:  PWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCN

Query:  YGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKRDH
        YGHCQSTFHPSC RSAGCYMTVK+SGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKRDH
Subjt:  YGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKRDH

Query:  VARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSS
        VARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTD+DSTTSQNPFPRKFADR  YAGKQIPQRSS
Subjt:  VARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSS

Query:  TTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        TTTSRNL+D GGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQ NQE GSAEPPKCDR
Subjt:  TTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

A0A6J1FEE0 uncharacterized protein LOC111444957 isoform X10.0e+0090.83Show/hide
Query:  QHLNKCGSSQDSPSRNFPNDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCND
        +HLN+  SSQD+P RN PNDVEGD LE SVSGHNSS+SAVHGKAGESP SYFHP+VQEKMA+ML  KLLN+SEGEMS WQASS+ G CC H  QH DCN 
Subjt:  QHLNKCGSSQDSPSRNFPNDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCND

Query:  ASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRH
         S  SG F+PKQ VNKKIGG+IKLSPEDEIEGEIIFYQ RLLANAVSRKRFTD+LICNVVKSLPKEI+EARSTRWDAVLINQY+ ELREAKK+GKKERRH
Subjt:  ASCNSGEFSPKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRH

Query:  KEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKES
        KEAQAVLAAATAAAAASSRMSSFRKDVYEES HRELMPRAKETLTKVALPKTS ESDFCKEHARSCDICRRPET+LKPILVC+SCKVSVHLDCYRTVKES
Subjt:  KEAQAVLAAATAAAAASSRMSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKES

Query:  SGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLK
        SGPW CELCEEL++SRGSG PVVN WEKSYFVAECGLCGGTTGAFRKSSDGQWVHA CAEWVFEST+KRGQANPVGGMETVSKGV+SCYICHRKHGV LK
Subjt:  SGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLK

Query:  CNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKR
        CNYGHCQ+TFHP C RSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKR
Subjt:  CNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKR

Query:  DHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQR
        DHVARSVLV SPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDD+TVDSTVSIKHWNKVPLSLDTEQKTDDDS+TSQNPFP+KF DRGQ+AGKQIPQR
Subjt:  DHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQR

Query:  SSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
        SST+TSRNLVDV GLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR
Subjt:  SSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPPKCDR

SwissProt top hitse value%identityAlignment
B2KF05 Bromodomain and PHD finger-containing protein 33.0e-1731.43Show/hide
Query:  ILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ-ANPVGG
        IL C  C ++VH +CY       G W C  C + S SR    PV           +C LC    GAF+++SDG W H  CA W+ E  F       P+ G
Subjt:  ILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ-ANPVGG

Query:  METV--SKGVESCYICHRKH-GVCLKCNYGHCQSTFHPSCGRSAGCYMTVKT--------SGGKLQHRAYCEKHS
        ++ +  ++   +CYIC +K  G  ++C+  +C + FH +C + AG +M ++         +   ++  AYCE HS
Subjt:  METV--SKGVESCYICHRKH-GVCLKCNYGHCQSTFHPSCGRSAGCYMTVKT--------SGGKLQHRAYCEKHS

B2RRD7 Peregrin2.1e-1830.93Show/hide
Query:  ILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ-ANPVGG
        IL C  C ++VH +CY       G W C  C + S SR                 +C LC    GAF+++ DG+W H  CA W+ E  F       P+  
Subjt:  ILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ-ANPVGG

Query:  METV--SKGVESCYIC-HRKHGVCLKCNYGHCQSTFHPSCGRSAGCYM---TVKTSGG-----KLQHRAYCEKHS---SEQRAKAENQTHGIEE
        +E +  ++   +CYIC  R  G C++C+  +C + FH +C + AG YM    V+ +G       ++  AYC+ H+   S +R  A + + G EE
Subjt:  METV--SKGVESCYIC-HRKHGVCLKCNYGHCQSTFHPSCGRSAGCYM---TVKTSGG-----KLQHRAYCEKHS---SEQRAKAENQTHGIEE

P55201 Peregrin4.7e-1830.41Show/hide
Query:  ILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ-ANPVGG
        IL C  C ++VH +CY       G W C  C + S SR                 +C LC    GAF+++ DG+W H  CA W+ E  F       P+  
Subjt:  ILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ-ANPVGG

Query:  METV--SKGVESCYIC-HRKHGVCLKCNYGHCQSTFHPSCGRSAGCYM---TVKTSGG-----KLQHRAYCEKHS---SEQRAKAENQTHGIEE
        +E +  ++   +CYIC  R  G C++C+  +C + FH +C + AG YM    V+ +G       ++  AYC+ H+   S +R  A + + G E+
Subjt:  METV--SKGVESCYIC-HRKHGVCLKCNYGHCQSTFHPSCGRSAGCYM---TVKTSGG-----KLQHRAYCEKHS---SEQRAKAENQTHGIEE

Q12311 NuA3 HAT complex component NTO11.1e-1930.81Show/hide
Query:  RSCDICRRPET-ILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWV
        ++C +C   ++  L  I+ C  C ++VH +CY  +    G W C  C               +  K+ F A C +C   TGAF+++  G WVH  CA W+
Subjt:  RSCDICRRPET-ILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWV

Query:  FESTFKR-GQANPVGGME--TVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAGCYMTV-KTSGGKLQHRAYCEKHSSE
         E  F       P+ G++  +VS+   +CYIC +K G C++C   +C + +H +C R AG YM+  K +  +L    + +K+S E
Subjt:  FESTFKR-GQANPVGGME--TVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAGCYMTV-KTSGGKLQHRAYCEKHSSE

Q9ULD4 Bromodomain and PHD finger-containing protein 36.7e-1731.35Show/hide
Query:  ILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ-ANPVGG
        IL C  C ++VH +CY       G W C  C + S SR    PV           +C LC    GAF+++SDG W H  CA W+ E  F       P+ G
Subjt:  ILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQ-ANPVGG

Query:  METV--SKGVESCYICHRKH-GVCLKCNYGHCQSTFHPSCGRSAGCYMTVK-----TSGGKL---QHRAYCEKHSSEQRAKAENQ
        ++ +  ++   +CYIC +K  G  ++C+  +C + FH +C + AG +M ++     +  G +   +  AYCE HS    A A  +
Subjt:  METV--SKGVESCYICHRKH-GVCLKCNYGHCQSTFHPSCGRSAGCYMTVK-----TSGGKL---QHRAYCEKHSSEQRAKAENQ

Arabidopsis top hitse value%identityAlignment
AT1G05830.1 trithorax-like protein 26.9e-1729.56Show/hide
Query:  CDICRRPETILKPI-LVCSSCKVSVHLDCYRTVKESSG-PWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVF
        C++C   E     + L C  C++ VH  CY  ++  +G  W C LC  ++L                    C LC    GA + ++DG+W H  CA W+ 
Subjt:  CDICRRPETILKPI-LVCSSCKVSVHLDCYRTVKESSG-PWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVF

Query:  ES-TFKRGQANPVGGMETVSKGVES--CYICHRKHGVCLKCNYGHCQSTFHPSCGRSAG
        E+      +  P+ G++ VSK      C IC   +G C++C+   C+  +HP C R+AG
Subjt:  ES-TFKRGQANPVGGMETVSKGVES--CYICHRKHGVCLKCNYGHCQSTFHPSCGRSAG

AT1G05830.2 trithorax-like protein 26.9e-1729.56Show/hide
Query:  CDICRRPETILKPI-LVCSSCKVSVHLDCYRTVKESSG-PWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVF
        C++C   E     + L C  C++ VH  CY  ++  +G  W C LC  ++L                    C LC    GA + ++DG+W H  CA W+ 
Subjt:  CDICRRPETILKPI-LVCSSCKVSVHLDCYRTVKESSG-PWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVF

Query:  ES-TFKRGQANPVGGMETVSKGVES--CYICHRKHGVCLKCNYGHCQSTFHPSCGRSAG
        E+      +  P+ G++ VSK      C IC   +G C++C+   C+  +HP C R+AG
Subjt:  ES-TFKRGQANPVGGMETVSKGVES--CYICHRKHGVCLKCNYGHCQSTFHPSCGRSAG

AT1G77800.1 PHD finger family protein6.8e-17455.67Show/hide
Query:  GVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASSR
        G++ LSPEDE+EGE+++YQ +LL  AVSRK+ +D+L+  V K LP EIDE    RWD VL+N+Y+ ++REA+KQG+KE+R+K+AQAVLAAATAAAA SSR
Subjt:  GVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASSR

Query:  MSSFRKDVYEESTHRE-------------LMPRAKETLTKVALPKTSSES-------DFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKE
         +S RKD+ EE   +E             L+P+ KE+L K+A+    SE        DF  E+ R+CDICRR ETI   I+VCSSCKV+VH+DCY+  KE
Subjt:  MSSFRKDVYEESTHRE-------------LMPRAKETLTKVALPKTSSES-------DFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKE

Query:  SSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCL
        S+GPW CELC E      S  P  N  EK     EC LCGGTTGAFRK+++GQWVHAFCAEW  ESTF+RGQ NPV GME+++K  ++C +C R +G C 
Subjt:  SSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCL

Query:  KCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFK
        KC+YG+CQ+TFHPSC RSAG +M   T GGK  H+AYCEKHS EQ+AKAE+Q HG EEL  +K  RVELERLRLLCERI+KREK+KR+L + SH++LA K
Subjt:  KCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFK

Query:  RDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQ
        RDH AR + VR+PF  PEVSS+SATTS+KGH +   S SEA+QRSDD+T+DSTV+ K   K PL +DT+QKT DDS TS++ F RK  +R   +GK +P+
Subjt:  RDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQ

Query:  RSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPP
        +    +     D        ++H ETF KELVMTSD+AS KN  LPK Y YVP D L ++K  NQ+  S++ P
Subjt:  RSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPP

AT1G77800.2 PHD finger family protein2.8e-17556.37Show/hide
Query:  GVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASSR
        G++ LSPEDE+EGE+++YQ +LL  AVSRK+ +D+L+  V K LP EIDE    RWD VL+N+Y+ ++REA+KQG+KE+R+K+AQAVLAAATAAAA SSR
Subjt:  GVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASSR

Query:  MSSFRKDVYEESTHRE-------------LMPRAKETLTKVALPKTSSES-------DFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKE
         +S RKD+ EE   +E             L+P+ KE+L K+A+    SE        DF  E+ R+CDICRR ETI   I+VCSSCKV+VH+DCY+  KE
Subjt:  MSSFRKDVYEESTHRE-------------LMPRAKETLTKVALPKTSSES-------DFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKE

Query:  SSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCL
        S+GPW CELC E      S  P  N  EK     EC LCGGTTGAFRK+++GQWVHAFCAEW  ESTF+RGQ NPV GME+++K  ++C +C R +G C 
Subjt:  SSGPWCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCL

Query:  KCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFK
        KC+YG+CQ+TFHPSC RSAG +M   T GGK  H+AYCEKHS EQ+AKAE+Q HG EEL  +K  RVELERLRLLCERI+KREK+KR+L + SH++LA K
Subjt:  KCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFK

Query:  RDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQ
        RDH AR + VR+PF  PEVSS+SATTS+KGH +   S SEA+QRSDD+T+DSTV+ K   K PL +DT+QKT DDS TS++ F RK  +R   +GK +P 
Subjt:  RDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQ

Query:  RSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPP
        R     S ++ + G    K +KH ETF KELVMTSD+AS KN  LPK Y YVP D L ++K  NQ+  S++ P
Subjt:  RSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETGSAEPP

AT2G31650.1 homologue of trithorax5.3e-1730.82Show/hide
Query:  CDICRRPETILKPI-LVCSSCKVSVHLDCYRTVKESSGP-WCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVF
        C++C   E     + L C  C++ VH  CY  ++   G  W C LC         GAP +           C LC    GA + ++DG+W H  CA W+ 
Subjt:  CDICRRPETILKPI-LVCSSCKVSVHLDCYRTVKESSGP-WCCELCEELSLSRGSGAPVVNIWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVF

Query:  ESTFKR-GQANPVGGMETVSKG--VESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAG
        E+      +  P+ G+  VSK      C IC   +G C++C+   C+  +HP C R+AG
Subjt:  ESTFKR-GQANPVGGMETVSKG--VESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCATTTAAACAAGTGTGGTAGTTCTCAAGATTCCCCTTCAAGAAATTTTCCAAATGATGTAGAAGGCGATCAATTAGAGGTTTCTGTTTCTGGCCATAATTCTTC
AATAAGCGCAGTTCATGGAAAGGCTGGAGAGTCCCCTGGTTCTTATTTTCATCCATTTGTCCAAGAGAAGATGGCATATATGTTGCATGGAAAGCTTCTAAATGTGTCTG
AAGGAGAGATGTCATGTTGGCAAGCATCTTCCAATGTTGGCGGTTGCTGTGATCACCAACACCAGCATGTAGACTGCAATGACGCGAGTTGCAATTCTGGTGAATTTAGT
CCGAAGCAGCAAGTGAATAAAAAAATAGGTGGAGTCATAAAGTTGTCTCCAGAAGATGAAATTGAGGGAGAAATTATATTTTATCAGCACAGATTACTTGCAAATGCAGT
TTCAAGAAAGCGGTTCACTGATCATTTAATTTGTAACGTTGTTAAGAGTCTTCCAAAGGAGATTGATGAAGCAAGAAGTACTAGATGGGATGCTGTTCTTATTAATCAGT
ATTATAGTGAACTCAGAGAAGCAAAGAAACAGGGTAAGAAAGAGAGAAGACATAAGGAAGCACAGGCTGTATTAGCTGCTGCAACGGCTGCTGCTGCTGCCTCTTCTAGG
ATGTCATCATTCAGAAAAGATGTATATGAAGAATCTACTCACAGAGAGTTGATGCCTCGTGCAAAAGAAACACTTACTAAGGTTGCTCTCCCAAAGACTTCATCAGAGTC
AGATTTCTGTAAAGAACACGCTAGATCCTGTGATATCTGTAGGCGGCCAGAGACAATATTAAAGCCAATTTTAGTCTGCTCGAGCTGCAAGGTTTCGGTACATCTGGATT
GCTATCGGACTGTGAAAGAATCCTCAGGTCCATGGTGTTGTGAATTGTGTGAAGAATTGTCACTATCAAGGGGGTCTGGAGCACCAGTAGTCAATATTTGGGAGAAATCA
TATTTTGTCGCAGAATGTGGTCTATGTGGTGGCACCACAGGTGCATTCAGGAAATCATCTGATGGCCAGTGGGTTCATGCCTTCTGTGCAGAGTGGGTCTTTGAATCAAC
TTTCAAAAGAGGACAAGCGAATCCCGTGGGAGGCATGGAGACAGTTTCTAAAGGGGTGGAATCTTGCTATATTTGTCACCGCAAGCATGGTGTCTGTTTGAAGTGCAATT
ATGGTCATTGCCAGTCTACTTTTCATCCCTCATGTGGTAGAAGTGCTGGATGTTACATGACTGTAAAGACTTCTGGTGGTAAGTTGCAGCACAGAGCATACTGTGAAAAA
CATAGCTCAGAGCAGAGAGCGAAGGCTGAGAACCAAACACATGGAATCGAGGAATTGAATAGAGTCAAGCAAATTAGGGTTGAACTTGAGAGGCTACGCCTACTTTGCGA
GAGAATCATTAAGCGTGAAAAGATCAAGAGGGACCTGGTTCTGTGTTCACATGATGTTCTTGCTTTTAAAAGAGACCATGTAGCACGATCAGTACTTGTTCGCTCGCCTT
TCTTCCTACCTGAAGTTAGTTCTGAATCAGCTACGACATCACTGAAGGGACACGTGGAAGATTTAAAATCGTGCAGTGAAGCTGTGCAAAGATCAGATGATGTAACTGTG
GACAGCACAGTTTCCATCAAGCACTGGAACAAAGTACCGTTGTCTTTGGACACTGAGCAAAAGACGGATGACGATAGTACCACATCTCAGAATCCATTCCCGCGAAAATT
TGCAGACAGGGGTCAGTATGCTGGGAAGCAAATACCTCAGAGATCTTCAACTACTACATCACGCAATCTTGTAGATGTTGGGGGATTGAGGTTCAAGTCTAGAAAGCATG
CCGAGACATTTCAAAAAGAGCTGGTAATGACGTCTGACCAAGCATCAATGAAGAACTCTCTTCTACCTAAGCAGTATCTGTATGTTCCAGCTGATGTCCTTGCCAAGGAG
AAGCAGGTCAACCAGGAAACAGGTTCTGCCGAGCCACCGAAATGTGACAGATTTCCACGTTGGAGGCCAAACATCCGTATGCCTCGGGACGCATGTCCTGCTCGGAAACT
GAGTCCGAGAACAAGTGATATGGCCAATGGAGAAGAACAGCAGGGTGAATGCTGGGATGGCTCCACTGGGGCGTCGGGTTTTGACTTGGGCAGCAAAAGTTTTGGCATTG
CAGATGTACGTTCGACGTCAATTGTAATTGTACAGGTGGGTGGGTGGAGTCGTACGGTTAAATGTCGAAATGGGGGCACAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGCATTTAAACAAGTGTGGTAGTTCTCAAGATTCCCCTTCAAGAAATTTTCCAAATGATGTAGAAGGCGATCAATTAGAGGTTTCTGTTTCTGGCCATAATTCTTC
AATAAGCGCAGTTCATGGAAAGGCTGGAGAGTCCCCTGGTTCTTATTTTCATCCATTTGTCCAAGAGAAGATGGCATATATGTTGCATGGAAAGCTTCTAAATGTGTCTG
AAGGAGAGATGTCATGTTGGCAAGCATCTTCCAATGTTGGCGGTTGCTGTGATCACCAACACCAGCATGTAGACTGCAATGACGCGAGTTGCAATTCTGGTGAATTTAGT
CCGAAGCAGCAAGTGAATAAAAAAATAGGTGGAGTCATAAAGTTGTCTCCAGAAGATGAAATTGAGGGAGAAATTATATTTTATCAGCACAGATTACTTGCAAATGCAGT
TTCAAGAAAGCGGTTCACTGATCATTTAATTTGTAACGTTGTTAAGAGTCTTCCAAAGGAGATTGATGAAGCAAGAAGTACTAGATGGGATGCTGTTCTTATTAATCAGT
ATTATAGTGAACTCAGAGAAGCAAAGAAACAGGGTAAGAAAGAGAGAAGACATAAGGAAGCACAGGCTGTATTAGCTGCTGCAACGGCTGCTGCTGCTGCCTCTTCTAGG
ATGTCATCATTCAGAAAAGATGTATATGAAGAATCTACTCACAGAGAGTTGATGCCTCGTGCAAAAGAAACACTTACTAAGGTTGCTCTCCCAAAGACTTCATCAGAGTC
AGATTTCTGTAAAGAACACGCTAGATCCTGTGATATCTGTAGGCGGCCAGAGACAATATTAAAGCCAATTTTAGTCTGCTCGAGCTGCAAGGTTTCGGTACATCTGGATT
GCTATCGGACTGTGAAAGAATCCTCAGGTCCATGGTGTTGTGAATTGTGTGAAGAATTGTCACTATCAAGGGGGTCTGGAGCACCAGTAGTCAATATTTGGGAGAAATCA
TATTTTGTCGCAGAATGTGGTCTATGTGGTGGCACCACAGGTGCATTCAGGAAATCATCTGATGGCCAGTGGGTTCATGCCTTCTGTGCAGAGTGGGTCTTTGAATCAAC
TTTCAAAAGAGGACAAGCGAATCCCGTGGGAGGCATGGAGACAGTTTCTAAAGGGGTGGAATCTTGCTATATTTGTCACCGCAAGCATGGTGTCTGTTTGAAGTGCAATT
ATGGTCATTGCCAGTCTACTTTTCATCCCTCATGTGGTAGAAGTGCTGGATGTTACATGACTGTAAAGACTTCTGGTGGTAAGTTGCAGCACAGAGCATACTGTGAAAAA
CATAGCTCAGAGCAGAGAGCGAAGGCTGAGAACCAAACACATGGAATCGAGGAATTGAATAGAGTCAAGCAAATTAGGGTTGAACTTGAGAGGCTACGCCTACTTTGCGA
GAGAATCATTAAGCGTGAAAAGATCAAGAGGGACCTGGTTCTGTGTTCACATGATGTTCTTGCTTTTAAAAGAGACCATGTAGCACGATCAGTACTTGTTCGCTCGCCTT
TCTTCCTACCTGAAGTTAGTTCTGAATCAGCTACGACATCACTGAAGGGACACGTGGAAGATTTAAAATCGTGCAGTGAAGCTGTGCAAAGATCAGATGATGTAACTGTG
GACAGCACAGTTTCCATCAAGCACTGGAACAAAGTACCGTTGTCTTTGGACACTGAGCAAAAGACGGATGACGATAGTACCACATCTCAGAATCCATTCCCGCGAAAATT
TGCAGACAGGGGTCAGTATGCTGGGAAGCAAATACCTCAGAGATCTTCAACTACTACATCACGCAATCTTGTAGATGTTGGGGGATTGAGGTTCAAGTCTAGAAAGCATG
CCGAGACATTTCAAAAAGAGCTGGTAATGACGTCTGACCAAGCATCAATGAAGAACTCTCTTCTACCTAAGCAGTATCTGTATGTTCCAGCTGATGTCCTTGCCAAGGAG
AAGCAGGTCAACCAGGAAACAGGTTCTGCCGAGCCACCGAAATGTGACAGATTTCCACGTTGGAGGCCAAACATCCGTATGCCTCGGGACGCATGTCCTGCTCGGAAACT
GAGTCCGAGAACAAGTGATATGGCCAATGGAGAAGAACAGCAGGGTGAATGCTGGGATGGCTCCACTGGGGCGTCGGGTTTTGACTTGGGCAGCAAAAGTTTTGGCATTG
CAGATGTACGTTCGACGTCAATTGTAATTGTACAGGTGGGTGGGTGGAGTCGTACGGTTAAATGTCGAAATGGGGGCACAGATTGA
Protein sequenceShow/hide protein sequence
MQHLNKCGSSQDSPSRNFPNDVEGDQLEVSVSGHNSSISAVHGKAGESPGSYFHPFVQEKMAYMLHGKLLNVSEGEMSCWQASSNVGGCCDHQHQHVDCNDASCNSGEFS
PKQQVNKKIGGVIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVVKSLPKEIDEARSTRWDAVLINQYYSELREAKKQGKKERRHKEAQAVLAAATAAAAASSR
MSSFRKDVYEESTHRELMPRAKETLTKVALPKTSSESDFCKEHARSCDICRRPETILKPILVCSSCKVSVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNIWEKS
YFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFKRGQANPVGGMETVSKGVESCYICHRKHGVCLKCNYGHCQSTFHPSCGRSAGCYMTVKTSGGKLQHRAYCEK
HSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCERIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVRSPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTV
DSTVSIKHWNKVPLSLDTEQKTDDDSTTSQNPFPRKFADRGQYAGKQIPQRSSTTTSRNLVDVGGLRFKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKE
KQVNQETGSAEPPKCDRFPRWRPNIRMPRDACPARKLSPRTSDMANGEEQQGECWDGSTGASGFDLGSKSFGIADVRSTSIVIVQVGGWSRTVKCRNGGTD