; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr003498 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr003498
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHistone H3.2
Genome locationtig00002207:37174..46203
RNA-Seq ExpressionSgr003498
SyntenySgr003498
Gene Ontology termsGO:0009873 - ethylene-activated signaling pathway (biological process)
GO:0045892 - negative regulation of transcription, DNA-templated (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003714 - transcription corepressor activity (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR000164 - Histone H3/CENP-A
IPR007125 - Histone H2A/H2B/H3
IPR009072 - Histone-fold
IPR022709 - Protein SCAI


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577288.1 Protein SCAI, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0085.46Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDN+S AKTFRALVESANRKFARVQDVPAYG +D++HYFHKVFKAYMRLWK QQE+R KLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAY+
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFL+VSLFLNRTDTVQVLAERLKALVDDSKA FR TDFKEWRLVVQEIFCFM +A+ SMN+RPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA

Query:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP
         FDSHPSSLP VARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQ+HPVEPNENGA+ID+SGASGIIDINL+TDITDPSLPPNP
Subjt:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP

Query:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
        +KAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSV+Q AS+GESRKSLK KVI QNSRENCNALPESCKS+KPGSSDLYDEYLWFGHRGN
Subjt:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAE
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK LHGAER ETAAILLS LRP FKNPL+VDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLD+DVYNDAE
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAE

Query:  TIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEIRT
        TIISSAFSEWEIILCTSTSLN+VWAQVL DHFLRRLILRF+FCRSVLSFF+T ED DLP+CLPCLPDSISSN GVV SA+RRLA HLN            
Subjt:  TIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEIRT

Query:  PKNTWICFLNRPIPRLSLRTRRYKYALTALVVKLFGFFDGGERDQPGREEAPRLEMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPG
                                                            R +MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPG
Subjt:  PKNTWICFLNRPIPRLSLRTRRYKYALTALVVKLFGFFDGGERDQPGREEAPRLEMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPG

Query:  TVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
        TVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Subjt:  TVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA

KAG6600643.1 hypothetical protein SDJN03_05876, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0080.81Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTD++SVAKTFRALVESA+RKFARVQDVPAYG VDNHHYFHKVFKAYMRLWKFQQEFR KLVESGLNR EIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA
        FYEAILNR+YFE SKNSRKDLGARFK+LRFYARFLLVSL LNRT TVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVA++SMN        
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA

Query:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP
                                           VKFAEITLDT+RMLQCLEWEPGFFYQ+HPVEPNENGA+ID+SGASGIIDINLATD++DPSLPPNP
Subjt:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP

Query:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
        +KAILYRPSVTHLIAVMAT+CEELLPDSIMLIYLSAAGKCCQNSVNQM S+GESRKS++ KVI+QNSRENCN+LPESCKSEK GSSDLYDEYLWFGHR N
Subjt:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAE
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAER ETAAILLS LRP FKNPL+VDTIQSGSQFTFFLTAPLPAF EMVGL SAN+D DVYNDAE
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAE

Query:  TIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEIRT
        TI+SSA SEWE +LCTSTSLN+VWAQVL D+FLRRLILRF+FCRSVLSFFST ED DLP+CLPCLPDS++SNSGVV SA+RRLAKHLNVADLFNFHE   
Subjt:  TIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEIRT

Query:  PKNTWICFLNRPIPRLSLRTRRYKYALTALVVKLFGFFDGGERDQPGREEAPRLEMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPG
                                                       R  A + EMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPG
Subjt:  PKNTWICFLNRPIPRLSLRTRRYKYALTALVVKLFGFFDGGERDQPGREEAPRLEMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPG

Query:  TVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIR
        TVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIR
Subjt:  TVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIR

XP_004146874.2 LOW QUALITY PROTEIN: protein SCAI [Cucumis sativus]0.0e+0091.3Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTD+D  AKTFRA+VE+ANRKFARVQDVPAYG VDNHHYFHKVFKAYMRLWK+QQEFR KLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGA FKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKA FR TDFKEWRLVVQEIFCFM +A+ S N+RPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA

Query:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP
         FDSHP SLP V RFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQ+HPVEPNENGA IDHSGASGIIDINLATD+TDPSLPPNP
Subjt:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP

Query:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
        +KAIL+RPSVTHLIAVMAT+CEELLPDSIMLIYLSAAGKCCQNSVNQMAS GESRKSLK+KV +QNSRENCNAL ESCKSEKPGSSDLYDEYLWFGHRG+
Subjt:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAE
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAER ETAAILLS LRP FKNPL+VDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLD+DVYNDA+
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAE

Query:  TIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEI
        TI+SSAFS+WEIILCTSTSLN+VWAQVL DHFLRRLILRF+FCRSVLSFF+T ED DLPVCLPCLPDS+SSNSGVVSSA+RRLAKHLNVADLFNFHE+
Subjt:  TIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEI

XP_022136688.1 protein SCAI isoform X1 [Momordica charantia]0.0e+0094.14Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTD DSVAKTFRALV+SANRKFARVQDVPAYG VDNHHYFHKVFKA+MRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHY+RTSEARFLIEAYV
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFL VSLFLNRTDTVQVLAERLKALVDDSKA F GTDFKEWRLVVQEIFCFMKVA+VSMN+RPLRYSA
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA

Query:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP
         FDSHPSSLP VARFHAKRVLKFRDAVLTSYHR+EVKFAEITLDTYRMLQCLEWEPGFF+Q+HPVEPNENGA+IDHSGASGIIDINLATDI+DPSLPPNP
Subjt:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP

Query:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
        +KAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMAS GESRKSLK+KVI+QNSRENCNALPESCKSEK GSSDLYDEYLWFGHRGN
Subjt:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAE
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAER ETAAILLS LRPVFKNPLDVDT++SGSQFTFFLTAPLPAFCEMVGLS ANLDLDVYNDAE
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAE

Query:  TIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHE
        TIISSAFSEWEIILCTSTSLNLVWAQVL DHFLRRLILRF+FCRSVLSFFSTTED +LPVCLPCLP+SI+SNSGVVSS VRR+AKHLNVADLFNFHE
Subjt:  TIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHE

XP_038877443.1 protein SCAI [Benincasa hispida]0.0e+0092.31Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDNDS AK+FRA+VESANRKFARVQDVPAYG VDNHHYFHKVFKAYMRLWK+QQEFR KLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKA FR TDFKEWRLVVQEIFCFM VA+ S N+RPLRYSA
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA

Query:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP
         FDSHPSSLP VARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQC EWEPGFFYQ+HPVEPNENGA+IDHSGASGIIDINLATD TD SLPPNP
Subjt:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP

Query:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
        +KAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMAS+GESRKSLK+KV  QNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
Subjt:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAE
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAER ETAAILLS LRP FKNPL+VDT QSGSQFTFFLTAPLPAFCE+VGL SANLD+DVYNDAE
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAE

Query:  TIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEI
        TIISSAFS+WEIILCTS SLN+VWAQVL DHFLRRLILRF+F RSVLSFF+T ED DLPVCLPCLPDS+SSNSGVVSSA+RRLAKHLNVADLFNFH +
Subjt:  TIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEI

TrEMBL top hitse value%identityAlignment
A0A0A0KU99 Uncharacterized protein0.0e+0091.47Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTD+D  AKTFRA+VE+ANRKFARVQDVPAYG VDNHHYFHKVFKAYMRLWK+QQEFR KLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKA FR TDFKEWRLVVQEIFCFM +A+ S N+RPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA

Query:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP
         FDSHP SLP V RFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQ+HPVEPNENGA IDHSGASGIIDINLATD+TDPSLPPNP
Subjt:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP

Query:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
        +KAIL+RPSVTHLIAVMAT+CEELLPDSIMLIYLSAAGKCCQNSVNQMAS GESRKSLK+KV +QNSRENCNAL ESCKSEKPGSSDLYDEYLWFGHRG+
Subjt:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAE
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAER ETAAILLS LRP FKNPL+VDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLD+DVYNDA+
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAE

Query:  TIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEI
        TI+SSAFS+WEIILCTSTSLN+VWAQVL DHFLRRLILRF+FCRSVLSFF+T ED DLPVCLPCLPDS+SSNSGVVSSA+RRLAKHLNVADLFNFHE+
Subjt:  TIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEI

A0A1S3BX87 protein SCAI0.0e+0091.64Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDNDS AKTFRA+VE+ANRKFARVQDVPAYG VDNHHYFHKVFKAYMRLWK+QQEFR KLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKA FR TDFKEWRLVVQEIFCFM VA+ S N+RPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA

Query:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP
         FDSHP SLP V RFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQ+HPVEPNENGA IDHSGASGIIDINLATD+TDPSLPPNP
Subjt:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP

Query:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
        +KAILYRPSVTHLIAVMAT+CEELLPDSIMLIYLSAAGKCCQNSVNQM S GESRKSLK+KV +QNSRENCNAL ESCK EKPGSSDLYDEYLWFGHRGN
Subjt:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAE
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAER ETAAILLS LRP FKNPL+VDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLD+DVYNDA+
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAE

Query:  TIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEI
        TI+SSAFS+WEIILCTSTSLN+VWAQVL DHFLRRLILRF+FCR+VLSFF+T ED DLP CLPCLPDS+SSNSGVVSSA+RRLAKHLNVADLFNFHE+
Subjt:  TIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEI

A0A6J1C886 protein SCAI isoform X10.0e+0094.14Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTD DSVAKTFRALV+SANRKFARVQDVPAYG VDNHHYFHKVFKA+MRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHY+RTSEARFLIEAYV
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFL VSLFLNRTDTVQVLAERLKALVDDSKA F GTDFKEWRLVVQEIFCFMKVA+VSMN+RPLRYSA
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA

Query:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP
         FDSHPSSLP VARFHAKRVLKFRDAVLTSYHR+EVKFAEITLDTYRMLQCLEWEPGFF+Q+HPVEPNENGA+IDHSGASGIIDINLATDI+DPSLPPNP
Subjt:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP

Query:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
        +KAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMAS GESRKSLK+KVI+QNSRENCNALPESCKSEK GSSDLYDEYLWFGHRGN
Subjt:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAE
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAER ETAAILLS LRPVFKNPLDVDT++SGSQFTFFLTAPLPAFCEMVGLS ANLDLDVYNDAE
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAE

Query:  TIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHE
        TIISSAFSEWEIILCTSTSLNLVWAQVL DHFLRRLILRF+FCRSVLSFFSTTED +LPVCLPCLP+SI+SNSGVVSS VRR+AKHLNVADLFNFHE
Subjt:  TIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHE

A0A6J1EMW7 protein SCAI-like0.0e+0091.64Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDN+S AKTFRALVESANRKFARVQDVPAYG +D++HYFHKVFKAYMRLWK QQE+R KLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAY+
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFL+VSL LNRTDTVQVLAERLKALVDDSKA FR TDFKEWRLVVQEIFCFM VA+ SMN+RPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA

Query:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP
         FDSHPSSLP VARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQ+HPVEPNENGA+ID+SGASGIIDINL+TDITDPSLPPNP
Subjt:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP

Query:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASH-GESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRG
        +KAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSV+Q AS+ GESRKSLK KVI+QNSRENCNALPESCKS+KPGSSDLYDEYLWFGHRG
Subjt:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASH-GESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRG

Query:  NGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDA
        NGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK LHGAER ETAAILLS LRP FKNPL+VDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLD+DVYNDA
Subjt:  NGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDA

Query:  ETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHE
        ETIISSAFSEWEIILCTSTSLN+VWAQVL DHFLRRLILRF+FCRSVLSFF+T ED DLP+CLPCLPDSISSN GVV SA+RRLA HLNVADLFNFHE
Subjt:  ETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHE

A0A6J1JDH9 protein SCAI-like0.0e+0091.64Show/hide
Query:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDN+S AKTFRALVESANRKFARVQDVPAYG +D++HYFHKVFKAYMRLWK QQE+R KLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAY+
Subjt:  MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFL+VSLFLNRTDTVQVLAERLKALVDDSKAAFR TDFKEWRLVVQEIFCFM VA+ SMN+RPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSA

Query:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP
         FDSH SSLP VARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQ+HPVEPNENGA+ID+SGASGIIDINL+TDITDPSLPPNP
Subjt:  PFDSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNP

Query:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASH-GESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRG
        +KAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSV+Q AS+ GESRKSLK KVI+QNSRENCNALPESCKS+KPGSSDLYDEYLWFGHRG
Subjt:  RKAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASH-GESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRG

Query:  NGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDA
        NGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK LHGAER ETAAILLS LRP FKNPL+VDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLD+DVYNDA
Subjt:  NGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDA

Query:  ETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHE
        ETIISSAFSEWEIILCTSTSLN+VWAQVL DHFLRRLILRF+FCRSVLSFF+T ED DLP+CLPCLPD+ISSN GVV SA+RRLA HLNVADLFNFHE
Subjt:  ETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHE

SwissProt top hitse value%identityAlignment
P59169 Histone H3.35.7e-67100Show/hide
Query:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
        MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
Subjt:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY

Query:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
        LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Subjt:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA

P69245 Histone H3.35.7e-67100Show/hide
Query:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
        MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
Subjt:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY

Query:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
        LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Subjt:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA

Q3C2E5 Histone H3.35.7e-67100Show/hide
Query:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
        MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
Subjt:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY

Query:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
        LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Subjt:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA

Q6RUR1 Histone H3.35.7e-67100Show/hide
Query:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
        MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
Subjt:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY

Query:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
        LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Subjt:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA

Q71V89 Histone H3.35.7e-67100Show/hide
Query:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
        MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
Subjt:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY

Query:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
        LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Subjt:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA

Arabidopsis top hitse value%identityAlignment
AT3G03570.1 Protein of unknown function (DUF3550/UPF0682)1.5e-13644.87Show/hide
Query:  DNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFY
        +N  +++ + +LV  A++KF++++D+P Y      +YF KVFK Y +LWKFQQE R KLVE+GL RWEIGEIASRI QLY+GHYMRTS+A +L E+YVFY
Subjt:  DNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFY

Query:  EAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSAPF
        EAIL R YF+      +DL    K+LRF ARFL+V L L R + V  L ++ K L+D+ K  F+ TDFKEW++V QEI  F+K  +  MNIRPLRYS   
Subjt:  EAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSAPF

Query:  DSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEP-GFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNPR
        D +  +        A R L+  DA+L+SY+ NEVK++E+TLD++RMLQCLEWEP G  YQ         GA +  +   G+  IN +  + DP+LPPNPR
Subjt:  DSHPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEP-GFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNPR

Query:  KAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESR------KSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDE--YL
        KA+LYRPS+TH +AV+ATICEEL    I+L+YLSA+GK  Q S + +++   +       +  +   I Q +  +    P S +S +  S D       L
Subjt:  KAILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESR------KSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDE--YL

Query:  WFGHRGNGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILL--SHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANL
         FG  G  G + +YP D++PFTR+P+F+I+DS++S  FK + GAE+ E AA+LL  SH  P+     D     SGS FT FLT+P+ AFC +  +S++++
Subjt:  WFGHRGNGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILL--SHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANL

Query:  DLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDG--DLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNV
        + D++  AE ++SS+ +EW   L TS +L+ VW+Q+L D FLRRL+LRF+FCR+VL+ ++   +   + P C P LP+S+   +  V SAV ++A     
Subjt:  DLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDG--DLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNV

Query:  ADLF
           F
Subjt:  ADLF

AT4G40030.1 Histone superfamily protein4.0e-68100Show/hide
Query:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
        MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
Subjt:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY

Query:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
        LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Subjt:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA

AT4G40030.2 Histone superfamily protein1.4e-6897.84Show/hide
Query:  RLEMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAA
        +++MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAA
Subjt:  RLEMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAA

Query:  EAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
        EAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Subjt:  EAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA

AT4G40040.1 Histone superfamily protein4.0e-68100Show/hide
Query:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
        MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
Subjt:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY

Query:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
        LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Subjt:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA

AT4G40050.1 Protein of unknown function (DUF3550/UPF0682)2.6e-20061.24Show/hide
Query:  DSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEA
        + V+  FRALVE+A+RKFARV+D+PA+G   + HYF KVFKAYM+LW +QQ  R+KLVESGLNRWEIGEIASRIGQLYF  YMRTSEARFL+EA+VFYEA
Subjt:  DSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEA

Query:  ILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSAPFDS
        IL RSYF+ ++   KDLGARFKELRFYARFLLVSL ++R   +  LA++L+ LVD S + FR T+FKEWRLVVQEI  F++  +    +RPLRY A  DS
Subjt:  ILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSAPFDS

Query:  HPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEP-GFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNPRKA
        +P+S   +ARFHAK++ KFRDA+L SYHRNEVK+AE+TLDTYRM+QCLEWEP G FYQ+ PVE  ENG  +DH+  SG+ID+NLA D+ DPSLPPNPRKA
Subjt:  HPSSLPCVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEP-GFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNPRKA

Query:  ILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHR-GNGG
        ILYRP+V+HL+AV+A IC+EL P+++ML+YLSA+G   + +V Q  +   S ++ K K++++ S+E  +   E   + K  S++ Y+ +LW G R G+ G
Subjt:  ILYRPSVTHLIAVMATICEELLPDSIMLIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHR-GNGG

Query:  PNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDT-IQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAET
         N LYPGD+IPFTR+P+FLI+DS+ S AFKVL GAER E  A+LLS L+P F+NP   DT   +GSQFTFFLTAPL AFC+M+GLS+   D ++ ++AE+
Subjt:  PNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAEREETAAILLSHLRPVFKNPLDVDT-IQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAET

Query:  IISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGD--LPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNF
        I+S++FSEWE IL TS  LNLVWAQVL D FLRRLILRF+FCRSVL+ FS TED D  LP C P LP+ +SS S  V S+V+RLA+HL VA  F+F
Subjt:  IISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFFSTTEDGD--LPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGACAACGACTCTGTAGCGAAGACCTTCCGAGCTCTGGTGGAGAGCGCGAACCGGAAGTTCGCTAGGGTCCAAGACGTCCCGGCTTACGGACATGTGGACAACCA
CCACTATTTTCATAAGGTTTTCAAGGCTTATATGCGTCTCTGGAAGTTCCAGCAGGAATTCCGCACCAAGCTTGTTGAATCTGGTCTTAACCGCTGGGAAATTGGCGAGA
TCGCTAGCCGGATCGGTCAGCTTTATTTCGGGCATTACATGAGAACCAGCGAGGCCAGGTTTTTGATTGAAGCTTATGTCTTCTACGAAGCCATTCTTAATCGAAGCTAT
TTTGAGGGATCCAAGAATTCGAGGAAGGATCTGGGAGCAAGGTTCAAGGAGCTGAGGTTTTACGCAAGGTTTTTGCTTGTTTCTTTGTTCCTGAATCGTACGGACACGGT
TCAGGTCCTCGCGGAACGATTGAAGGCTTTGGTAGACGATAGCAAGGCCGCTTTTCGGGGTACTGACTTTAAAGAATGGAGGTTAGTTGTACAAGAAATTTTCTGCTTCA
TGAAAGTAGCATCAGTCTCAATGAATATCAGACCTTTGCGTTATTCTGCTCCATTTGATTCCCATCCGTCATCCCTTCCGTGTGTGGCTCGTTTCCATGCAAAGAGGGTT
CTTAAATTCCGAGATGCTGTTTTGACAAGCTACCACCGAAATGAGGTTAAATTTGCAGAAATTACTTTGGACACTTATAGAATGCTGCAATGTCTAGAATGGGAGCCTGG
TTTCTTCTACCAAAGGCATCCAGTTGAACCAAATGAAAATGGTGCTTCCATTGATCATTCTGGGGCGTCTGGAATAATTGATATTAACTTAGCTACTGATATTACTGATC
CATCTTTACCTCCAAATCCAAGGAAAGCTATACTCTATCGGCCTTCTGTGACTCATTTGATAGCTGTCATGGCTACAATTTGCGAGGAGCTCCTTCCAGATAGTATCATG
CTGATTTATCTTTCTGCAGCAGGGAAATGCTGTCAAAACAGTGTTAATCAAATGGCAAGTCATGGGGAATCCAGAAAATCCCTTAAACATAAAGTCATCTCCCAGAACTC
ACGAGAAAATTGTAATGCTCTGCCTGAGTCCTGTAAGAGTGAGAAGCCAGGGTCAAGTGACCTTTATGATGAGTATTTGTGGTTTGGGCATAGGGGTAATGGAGGTCCAA
ACGTTCTATACCCTGGTGATATAATACCTTTCACACGTAGACCTGTTTTCTTGATAGTTGACAGTAATAACAGCCATGCATTCAAGGTTCTACATGGGGCAGAAAGAGAA
GAGACTGCCGCTATACTTCTTTCACATTTGAGGCCTGTATTCAAGAATCCCTTAGATGTTGATACAATTCAATCAGGAAGTCAGTTTACTTTTTTCTTGACTGCTCCTCT
GCCTGCATTTTGCGAAATGGTTGGCCTGTCCTCGGCCAATTTGGATCTAGATGTTTACAATGATGCCGAGACGATAATCTCTTCTGCGTTTTCCGAGTGGGAAATAATTC
TTTGTACATCAACTAGCTTAAACCTCGTCTGGGCCCAAGTTTTGTGTGATCATTTTTTACGCCGTCTCATTCTCAGATTTATGTTCTGCCGATCCGTGCTATCTTTCTTC
AGTACTACAGAAGACGGCGACCTTCCTGTTTGCCTGCCTTGTCTTCCCGACTCCATCTCTTCGAATTCTGGAGTTGTCAGTTCAGCAGTTCGCCGTCTCGCAAAGCACCT
AAACGTTGCTGACTTATTTAACTTCCACGAAATTCGTACACCGAAAAACACATGGATATGCTTTCTTAACCGTCCGATCCCCAGATTATCTCTTCGCACACGACGCTATA
AATATGCCCTAACTGCTCTTGTTGTTAAACTTTTTGGCTTCTTTGACGGAGGAGAGAGGGATCAGCCCGGAAGAGAAGAAGCGCCACGACTCGAGATGGCCCGTACCAAG
CAAACTGCCCGTAAGTCTACTGGAGGGAAGGCTCCCAGGAAGCAGCTGGCCACAAAGGCTGCCCGTAAGTCAGCCCCAACTACAGGAGGGGTGAAGAAGCCTCACAGATA
CCGTCCTGGTACTGTTGCTCTTCGTGAAATTCGTAAGTATCAGAAGAGTACTGAGCTATTGATTAGGAAGTTGCCGTTCCAGAGGTTGGTTCGTGAAATTGCCCAGGACT
TCAAGACTGATCTGCGTTTCCAGAGCCATGCTGTTCTTGCCTTGCAAGAGGCCGCCGAGGCATACCTTGTTGGGTTGTTTGAGGACACCAATCTGTGTGCCATTCATGCC
AAGCGTGTTACCATAATGCCTAAAGATATCCAGTTGGCTCGGAGAATCCGGGGTGAACGTGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCGACAACGACTCTGTAGCGAAGACCTTCCGAGCTCTGGTGGAGAGCGCGAACCGGAAGTTCGCTAGGGTCCAAGACGTCCCGGCTTACGGACATGTGGACAACCA
CCACTATTTTCATAAGGTTTTCAAGGCTTATATGCGTCTCTGGAAGTTCCAGCAGGAATTCCGCACCAAGCTTGTTGAATCTGGTCTTAACCGCTGGGAAATTGGCGAGA
TCGCTAGCCGGATCGGTCAGCTTTATTTCGGGCATTACATGAGAACCAGCGAGGCCAGGTTTTTGATTGAAGCTTATGTCTTCTACGAAGCCATTCTTAATCGAAGCTAT
TTTGAGGGATCCAAGAATTCGAGGAAGGATCTGGGAGCAAGGTTCAAGGAGCTGAGGTTTTACGCAAGGTTTTTGCTTGTTTCTTTGTTCCTGAATCGTACGGACACGGT
TCAGGTCCTCGCGGAACGATTGAAGGCTTTGGTAGACGATAGCAAGGCCGCTTTTCGGGGTACTGACTTTAAAGAATGGAGGTTAGTTGTACAAGAAATTTTCTGCTTCA
TGAAAGTAGCATCAGTCTCAATGAATATCAGACCTTTGCGTTATTCTGCTCCATTTGATTCCCATCCGTCATCCCTTCCGTGTGTGGCTCGTTTCCATGCAAAGAGGGTT
CTTAAATTCCGAGATGCTGTTTTGACAAGCTACCACCGAAATGAGGTTAAATTTGCAGAAATTACTTTGGACACTTATAGAATGCTGCAATGTCTAGAATGGGAGCCTGG
TTTCTTCTACCAAAGGCATCCAGTTGAACCAAATGAAAATGGTGCTTCCATTGATCATTCTGGGGCGTCTGGAATAATTGATATTAACTTAGCTACTGATATTACTGATC
CATCTTTACCTCCAAATCCAAGGAAAGCTATACTCTATCGGCCTTCTGTGACTCATTTGATAGCTGTCATGGCTACAATTTGCGAGGAGCTCCTTCCAGATAGTATCATG
CTGATTTATCTTTCTGCAGCAGGGAAATGCTGTCAAAACAGTGTTAATCAAATGGCAAGTCATGGGGAATCCAGAAAATCCCTTAAACATAAAGTCATCTCCCAGAACTC
ACGAGAAAATTGTAATGCTCTGCCTGAGTCCTGTAAGAGTGAGAAGCCAGGGTCAAGTGACCTTTATGATGAGTATTTGTGGTTTGGGCATAGGGGTAATGGAGGTCCAA
ACGTTCTATACCCTGGTGATATAATACCTTTCACACGTAGACCTGTTTTCTTGATAGTTGACAGTAATAACAGCCATGCATTCAAGGTTCTACATGGGGCAGAAAGAGAA
GAGACTGCCGCTATACTTCTTTCACATTTGAGGCCTGTATTCAAGAATCCCTTAGATGTTGATACAATTCAATCAGGAAGTCAGTTTACTTTTTTCTTGACTGCTCCTCT
GCCTGCATTTTGCGAAATGGTTGGCCTGTCCTCGGCCAATTTGGATCTAGATGTTTACAATGATGCCGAGACGATAATCTCTTCTGCGTTTTCCGAGTGGGAAATAATTC
TTTGTACATCAACTAGCTTAAACCTCGTCTGGGCCCAAGTTTTGTGTGATCATTTTTTACGCCGTCTCATTCTCAGATTTATGTTCTGCCGATCCGTGCTATCTTTCTTC
AGTACTACAGAAGACGGCGACCTTCCTGTTTGCCTGCCTTGTCTTCCCGACTCCATCTCTTCGAATTCTGGAGTTGTCAGTTCAGCAGTTCGCCGTCTCGCAAAGCACCT
AAACGTTGCTGACTTATTTAACTTCCACGAAATTCGTACACCGAAAAACACATGGATATGCTTTCTTAACCGTCCGATCCCCAGATTATCTCTTCGCACACGACGCTATA
AATATGCCCTAACTGCTCTTGTTGTTAAACTTTTTGGCTTCTTTGACGGAGGAGAGAGGGATCAGCCCGGAAGAGAAGAAGCGCCACGACTCGAGATGGCCCGTACCAAG
CAAACTGCCCGTAAGTCTACTGGAGGGAAGGCTCCCAGGAAGCAGCTGGCCACAAAGGCTGCCCGTAAGTCAGCCCCAACTACAGGAGGGGTGAAGAAGCCTCACAGATA
CCGTCCTGGTACTGTTGCTCTTCGTGAAATTCGTAAGTATCAGAAGAGTACTGAGCTATTGATTAGGAAGTTGCCGTTCCAGAGGTTGGTTCGTGAAATTGCCCAGGACT
TCAAGACTGATCTGCGTTTCCAGAGCCATGCTGTTCTTGCCTTGCAAGAGGCCGCCGAGGCATACCTTGTTGGGTTGTTTGAGGACACCAATCTGTGTGCCATTCATGCC
AAGCGTGTTACCATAATGCCTAAAGATATCCAGTTGGCTCGGAGAATCCGGGGTGAACGTGCTTAG
Protein sequenceShow/hide protein sequence
MTDNDSVAKTFRALVESANRKFARVQDVPAYGHVDNHHYFHKVFKAYMRLWKFQQEFRTKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSY
FEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKAAFRGTDFKEWRLVVQEIFCFMKVASVSMNIRPLRYSAPFDSHPSSLPCVARFHAKRV
LKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQRHPVEPNENGASIDHSGASGIIDINLATDITDPSLPPNPRKAILYRPSVTHLIAVMATICEELLPDSIM
LIYLSAAGKCCQNSVNQMASHGESRKSLKHKVISQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKVLHGAERE
ETAAILLSHLRPVFKNPLDVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDLDVYNDAETIISSAFSEWEIILCTSTSLNLVWAQVLCDHFLRRLILRFMFCRSVLSFF
STTEDGDLPVCLPCLPDSISSNSGVVSSAVRRLAKHLNVADLFNFHEIRTPKNTWICFLNRPIPRLSLRTRRYKYALTALVVKLFGFFDGGERDQPGREEAPRLEMARTK
QTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHA
KRVTIMPKDIQLARRIRGERA