; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001918 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001918
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein SCAI
Genome locationChr11:1650637..1660534
RNA-Seq ExpressionHG10001918
SyntenyHG10001918
Gene Ontology termsGO:0009873 - ethylene-activated signaling pathway (biological process)
GO:0045892 - negative regulation of transcription, DNA-templated (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003714 - transcription corepressor activity (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR000164 - Histone H3/CENP-A
IPR007125 - Histone H2A/H2B/H3
IPR009072 - Histone-fold
IPR022709 - Protein SCAI


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577288.1 Protein SCAI, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0090.77Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDN+S AK+FRA+VESANRKFARVQDVPAYGR+D++HYFHKVFKAYMRLWK+QQE+RAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAY+
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFL+VSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMN+ATASMNVRPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPN+NGAAID+SGASGIIDINL+TDITDPSLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
        KKAILYRPSVTHLIAVMATICEEL PDSIMLIYLSA GKCCQNSV+Q ASYGESRKSLK+KVI QNSRENCNALPESCKS+KPGSSDLYDEYLWFGHRGN
Subjt:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAE
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK    AERGETAA+LLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAE
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAE

Query:  TIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLN------------
        TIISSAFS+WEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLP+CLPCLPDS+SSN GVV SAIRRLA HLN            
Subjt:  TIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLN------------

Query:  ----------------AARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFK
                        AARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFK
Subjt:  ----------------AARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFK

TYK16871.1 protein SCAI [Cucumis melo var. makuwa]0.0e+0095.9Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDNDSEAK+FRAMVE+ANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATAS NVRPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHP SLPFV RFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPN+NGA IDHSGASGIIDINLATD+TDPSLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
        KKAILYRPSVTHLIAVMAT+CEEL PDSIMLIYLSA GKCCQNSVNQM S+GESRKSLKNKV AQNSRENCNAL ESCK EKPGSSDLYDEYLWFGHRGN
Subjt:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIIS
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAERGETAA+LLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDA+TI+S
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIIS

Query:  SAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAA
        SAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCR+VLSFFNTKEDDDLP CLPCLPDSVSSNSGVVSSAIRRLAKHLN A
Subjt:  SAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAA

XP_004146874.2 LOW QUALITY PROTEIN: protein SCAI [Cucumis sativus]0.0e+0094.92Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTD+D EAK+FRAMVE+ANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGA FKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMN+ATAS NVRPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHP SLPFV RFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPN+NGA IDHSGASGIIDINLATD+TDPSLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
        KKAIL+RPSVTHLIAVMAT+CEEL PDSIMLIYLSA GKCCQNSVNQMAS GESRKSLKNKV AQNSRENCNAL ESCKSEKPGSSDLYDEYLWFGHRG+
Subjt:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAE
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK    AERGETAA+LLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDA+
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAE

Query:  TIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAA
        TI+SSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLN A
Subjt:  TIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAA

XP_008453803.1 PREDICTED: protein SCAI [Cucumis melo]0.0e+0095.25Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDNDSEAK+FRAMVE+ANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATAS NVRPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHP SLPFV RFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPN+NGA IDHSGASGIIDINLATD+TDPSLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
        KKAILYRPSVTHLIAVMAT+CEEL PDSIMLIYLSA GKCCQNSVNQM S+GESRKSLKNKV AQNSRENCNAL ESCK EKPGSSDLYDEYLWFGHRGN
Subjt:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAE
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK    AERGETAA+LLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDA+
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAE

Query:  TIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAA
        TI+SSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCR+VLSFFNTKEDDDLP CLPCLPDSVSSNSGVVSSAIRRLAKHLN A
Subjt:  TIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAA

XP_038877443.1 protein SCAI [Benincasa hispida]0.0e+0096.44Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATAS NVRPLRYSA
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQC EWEPGFFYQKHPVEPN+NGAAIDHSGASGIIDINLATD TD SLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
        KKAILYRPSVTHLIAVMATICEEL PDSIMLIYLSA GKCCQNSVNQMASYGESRKSLKNKV  QNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
Subjt:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAE
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK    AERGETAA+LLSPLRPAFKNPLNVDT QSGSQFTFFLTAPLPAFCE+VGL SANLDIDVYNDAE
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAE

Query:  TIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAA
        TIISSAFSDWEIILCTS SLNIVWAQVLSDHFLRRLILRFIF RSVLSFFNTKEDDDLPVCLPCLPDS+SSNSGVVSSAIRRLAKHLN A
Subjt:  TIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAA

TrEMBL top hitse value%identityAlignment
A0A0A0KU99 Uncharacterized protein0.0e+0095.08Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTD+D EAK+FRAMVE+ANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMN+ATAS NVRPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHP SLPFV RFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPN+NGA IDHSGASGIIDINLATD+TDPSLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
        KKAIL+RPSVTHLIAVMAT+CEEL PDSIMLIYLSA GKCCQNSVNQMAS GESRKSLKNKV AQNSRENCNAL ESCKSEKPGSSDLYDEYLWFGHRG+
Subjt:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAE
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK    AERGETAA+LLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDA+
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAE

Query:  TIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAA
        TI+SSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLN A
Subjt:  TIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAA

A0A1S3BX87 protein SCAI0.0e+0095.25Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDNDSEAK+FRAMVE+ANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATAS NVRPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHP SLPFV RFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPN+NGA IDHSGASGIIDINLATD+TDPSLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
        KKAILYRPSVTHLIAVMAT+CEEL PDSIMLIYLSA GKCCQNSVNQM S+GESRKSLKNKV AQNSRENCNAL ESCK EKPGSSDLYDEYLWFGHRGN
Subjt:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAE
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK    AERGETAA+LLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDA+
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAE

Query:  TIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAA
        TI+SSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCR+VLSFFNTKEDDDLP CLPCLPDSVSSNSGVVSSAIRRLAKHLN A
Subjt:  TIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAA

A0A5A7TRF5 Protein SCAI0.0e+0093.65Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDNDSEAK+FRAMVE+ANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATAS NVRPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHP SLPFV RFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPN+NGA IDHSGASGIIDINLATD+TDPSLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
        KKAILYRPSVTHLIAVMAT+CEEL PDSIMLIYLSA GKCCQNSVNQM S+GESRKSLKNKV AQNSRENCNA  ESCK EKPGSSDLYDEYLWFGHRGN
Subjt:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK------------AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLD
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK            AERGETAA+LLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLD
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK------------AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLD

Query:  IDVYNDAETIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAA
        IDVYNDA+TI+SSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCR+VLSFFN KEDDDLP CLPCLPDSVSSNSGVVSSAIRRLAKHLN A
Subjt:  IDVYNDAETIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAA

A0A5D3CY13 Protein SCAI0.0e+0095.9Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDNDSEAK+FRAMVE+ANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATAS NVRPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHP SLPFV RFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPN+NGA IDHSGASGIIDINLATD+TDPSLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN
        KKAILYRPSVTHLIAVMAT+CEEL PDSIMLIYLSA GKCCQNSVNQM S+GESRKSLKNKV AQNSRENCNAL ESCK EKPGSSDLYDEYLWFGHRGN
Subjt:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGN

Query:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIIS
        GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAERGETAA+LLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDA+TI+S
Subjt:  GGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIIS

Query:  SAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAA
        SAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCR+VLSFFNTKEDDDLP CLPCLPDSVSSNSGVVSSAIRRLAKHLN A
Subjt:  SAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAA

A0A6J1EMW7 protein SCAI-like0.0e+0094.08Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDN+S AK+FRA+VESANRKFARVQDVPAYGR+D++HYFHKVFKAYMRLWK+QQE+RAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAY+
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFL+VSL LNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPN+NGAAID+SGASGIIDINL+TDITDPSLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASY-GESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRG
        KKAILYRPSVTHLIAVMATICEEL PDSIMLIYLSA GKCCQNSV+Q ASY GESRKSLK+KVIAQNSRENCNALPESCKS+KPGSSDLYDEYLWFGHRG
Subjt:  KKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASY-GESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRG

Query:  NGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDA
        NGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK    AERGETAA+LLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDA
Subjt:  NGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDA

Query:  ETIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAA
        ETIISSAFS+WEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLP+CLPCLPDS+SSN GVV SAIRRLA HLN A
Subjt:  ETIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAA

SwissProt top hitse value%identityAlignment
P59169 Histone H3.34.1e-23100Show/hide
Query:  AARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFK
        AARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFK
Subjt:  AARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFK

Q54YY1 Protein SCAI homolog1.7e-5327.74Show/hide
Query:  KSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVES---GLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAI
        K+F  ++  + R F  ++D+P +GR     +F K F+ Y +LWK+QQ++R+ L +    GL R EIGEIAS+IGQLY+ +Y+RTS+  +L E+Y+FYEAI
Subjt:  KSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVES---GLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAI

Query:  LNRSYFEG-SKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNV---ATAS-MNVRP-LRYS
          RSYF+  S +   D+    K+LR+YARF++V L LN+   V  L E L   V+D    ++ +D +EW LV+QEIF F+     AT S  N  P L  S
Subjt:  LNRSYFEG-SKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNV---ATAS-MNVRP-LRYS

Query:  AAFDSHPSSLPFVARFHAKRVLK---------------------FRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAI---D
            ++ S+       H    ++                      + A+L    +N++KF+EITLD +RM Q LE+EP    +++ ++  Q   A+    
Subjt:  AAFDSHPSSLPFVARFHAKRVLK---------------------FRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAI---D

Query:  HSGASGIIDINLATDITDPSLPP-------------------------NPKKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQ-NSVNQ--
           A+   + N  TD  + ++P                          NP K +LYRP+++ ++  ++   +EL  +  ML+Y+ A G   + N VNQ  
Subjt:  HSGASGIIDINLATDITDPSLPP-------------------------NPKKAILYRPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQ-NSVNQ--

Query:  -----------------------MASYGESRKSLKNKVI---------------------------------------AQNSRENCNALPESCKSEKPGS
                               + +   +   L NK+                                        A N          +  S    +
Subjt:  -----------------------MASYGESRKSLKNKVI---------------------------------------AQNSRENCNALPESCKSEKPGS

Query:  SDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSA
        +     Y +     N     LYP D++PF R+P FLIV+S +S  F  E        +S L P       V  ++ G+ FTFFL  P+ AFC+   +S  
Subjt:  SDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSA

Query:  N-LDIDVYNDAETIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPV-CLPCLPDSVSSNSGVVSSAIRRLAKHL
        N +    +N+   +  ++      +L     L+  ++  L D FLR  I+RFIFC +        +D+   V   P LP S+  N  ++ S I +L   L
Subjt:  N-LDIDVYNDAETIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPV-CLPCLPDSVSSNSGVVSSAIRRLAKHL

Query:  NAA
        + +
Subjt:  NAA

Q6RUR1 Histone H3.34.1e-23100Show/hide
Query:  AARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFK
        AARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFK
Subjt:  AARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFK

Q8C8N2 Protein SCAI3.7e-6432.08Show/hide
Query:  FRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVES-GLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRS
        F  +++ + + F  ++D+P YG+     YF + F  Y +LWK+QQ+ R  L    GL RW+IGEIAS+IGQLY+ +Y+RTSE  +L EA+ FY AI  RS
Subjt:  FRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVES-GLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRS

Query:  YF-EGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNV-ATASMNVRPLRYSAAFDSHPS
        Y+ + +K  R +L    K+LR+YARF++V L LN+ D V+ L + L   ++D    F   D  EW LV+QE+  F+       +N        +     +
Subjt:  YF-EGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNV-ATASMNVRPLRYSAAFDSHPS

Query:  SLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNPKKAILYR
          P + +      L   DA++     N+VKF+E+T+D +RMLQ LE EP                    + AS +    +      P+   NP K +LY+
Subjt:  SLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNPKKAILYR

Query:  PSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRE--NCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNV
        P+ + L   +A   +EL  +S++LIYLSATG       +    Y          V+  ++R+  N +A+ +  +S K                     + 
Subjt:  PSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRE--NCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNV

Query:  LYPGDIIPFTRRPVFLIVDSNNSHAFKAER---GETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSA
        L+PGD+ PFTR+P+F++VDS+NS A+K      G+    LLSP   A+   L  D  Q GS FT FL  PL AF  + GLSS    +  +   +  +   
Subjt:  LYPGDIIPFTRRPVFLIVDSNNSHAFKAER---GETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSA

Query:  FSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNT-KEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLN
          D   +L  S S++  + Q   D FLR L+ RF+FC + +      +E  + P   P LP   +  +  +   I  LA  L+
Subjt:  FSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNT-KEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLN

Q8N9R8 Protein SCAI1.3e-6432.42Show/hide
Query:  FRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVES-GLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRS
        F  +++ + + F  ++D+P YG+     YF + F  Y +LWK+QQ+ R  L    GL RW+IGEIAS+IGQLY+ +Y+RTSE  +L EA+ FY AI  RS
Subjt:  FRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVES-GLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRS

Query:  YF-EGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNV-ATASMNVRPLRYSAAFDSHPS
        Y+ + +K  R +L    K+LR+YARF++V L LN+ D V+ L + L   ++D    F   D  EW LV+QE+  F+       +N        +     +
Subjt:  YF-EGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNV-ATASMNVRPLRYSAAFDSHPS

Query:  SLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNPKKAILYR
          P + +      L   DA++     N+VKF+E+T+D +RMLQ LE EP                    + AS +    +      P+   NP K +LY+
Subjt:  SLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNPKKAILYR

Query:  PSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRE--NCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNV
        P+ + L   +A   +EL  +S++LIYLSATG       +    Y          V+  ++R+  N +A+ +  +S K                     + 
Subjt:  PSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRE--NCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNV

Query:  LYPGDIIPFTRRPVFLIVDSNNSHAFKAER---GETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSA
        L+PGD+ PFTR+P+F+IVDS+NS A+K      G+    LLSP   A+   L  D  Q GS FT FL  PL AF  + GLSS    +  +   +  +   
Subjt:  LYPGDIIPFTRRPVFLIVDSNNSHAFKAER---GETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSA

Query:  FSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNT-KEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLN
          D   +L  S S++  + Q   D FLR L+ RFIFC + +      +E  + P   P LP   +  +  +   I  LA  L+
Subjt:  FSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNT-KEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLN

Arabidopsis top hitse value%identityAlignment
AT3G03570.1 Protein of unknown function (DUF3550/UPF0682)4.8e-13645.53Show/hide
Query:  AMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFE
        ++V  A++KF++++D+P Y R    +YF KVFK Y +LWK+QQE R KLVE+GL RWEIGEIASRI QLY+GHYMRTS+A +L E+YVFYEAIL R YF+
Subjt:  AMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFE

Query:  GSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSAAFDSHPSSLPFV
              +DL    K+LRF ARFL+V L L R + V  L ++ K L+D+ K TF+ TDFKEW++V QEI  F+   TA MN+RPLRYS   D +  +    
Subjt:  GSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSAAFDSHPSSLPFV

Query:  ARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEP-GFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNPKKAILYRPSVT
            A R L+  DA+L+SY+ NEVK++E+TLD++RMLQCLEWEP G  YQ         GA +  +   G+  IN +  + DP+LPPNP+KA+LYRPS+T
Subjt:  ARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEP-GFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNPKKAILYRPSVT

Query:  HLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESR------KSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDE--YLWFGHRGNGGP
        H +AV+ATICEEL    I+L+YLSA+GK  Q S + +++   +       +  ++  I Q +  +    P S +S +  S D       L FG  G  G 
Subjt:  HLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESR------KSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDE--YLWFGHRGNGGP

Query:  NVLYPGDIIPFTRRPVFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETII
        + +YP D++PFTR+P+F+I+DS++S  FK    AE+GE AA+LLSP         +     SGS FT FLT+P+ AFC +  +S+++++ D++  AE ++
Subjt:  NVLYPGDIIPFTRRPVFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETII

Query:  SSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSF----FNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAARK
        SS+ ++W   L TS +L+ VW+Q+L D FLRRL+LRFIFCR+VL+     FN K++   P C P LP+S+   +  V SA+ ++A    A  K
Subjt:  SSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSF----FNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAARK

AT4G40030.1 Histone superfamily protein2.9e-24100Show/hide
Query:  AARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFK
        AARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFK
Subjt:  AARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFK

AT4G40040.1 Histone superfamily protein2.9e-24100Show/hide
Query:  AARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFK
        AARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFK
Subjt:  AARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFK

AT4G40040.2 Histone superfamily protein2.9e-24100Show/hide
Query:  AARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFK
        AARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFK
Subjt:  AARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFK

AT4G40050.1 Protein of unknown function (DUF3550/UPF0682)4.0e-19960.95Show/hide
Query:  AKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILN
        + +FRA+VE+A+RKFARV+D+PA+GR  + HYF KVFKAYM+LW YQQ  R+KLVESGLNRWEIGEIASRIGQLYF  YMRTSEARFL+EA+VFYEAIL 
Subjt:  AKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILN

Query:  RSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSAAFDSHPS
        RSYF+ ++   KDLGARFKELRFYARFLLVSL ++R   +  LA++L+ LVD S + FR T+FKEWRLVVQEI  F+   T    +RPLRY A  DS+P+
Subjt:  RSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSAAFDSHPS

Query:  SLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEP-GFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNPKKAILY
        S  ++ARFHAK++ KFRDA+L SYHRNEVK+AE+TLDTYRM+QCLEWEP G FYQK PVE  +NG  +DH+  SG+ID+NLA D+ DPSLPPNP+KAILY
Subjt:  SLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEP-GFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNPKKAILY

Query:  RPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHR-GNGGPNV
        RP+V+HL+AV+A IC+EL P+++ML+YLSA+G   + +V Q  +   S ++ K+K++A+ S+E  +   E   + K  S++ Y+ +LW G R G+ G N 
Subjt:  RPSVTHLIAVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHR-GNGGPNV

Query:  LYPGDIIPFTRRPVFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDT-IQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIIS
        LYPGD+IPFTR+P+FLI+DS+ S AFK    AERGE  A+LLSPL+P+F+NP   DT   +GSQFTFFLTAPL AFC+M+GLS+   D ++ ++AE+I+S
Subjt:  LYPGDIIPFTRRPVFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDT-IQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIIS

Query:  SAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDD--LPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAAR
        ++FS+WE IL TS  LN+VWAQVL D FLRRLILRFIFCRSVL+ F+  EDDD  LP C P LP+ +SS S  V S+++RLA+HL  A+
Subjt:  SAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDD--LPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGACAACGACTCTGAAGCGAAGAGCTTCCGAGCTATGGTGGAGAGCGCCAACCGGAAGTTCGCTAGGGTCCAAGACGTTCCGGCCTACGGACGTGTGGACAACCA
CCACTACTTTCATAAGGTTTTCAAGGCTTATATGCGTCTCTGGAAGTACCAGCAGGAATTCCGCGCCAAGCTTGTTGAATCTGGTCTCAACCGCTGGGAAATTGGCGAGA
TCGCTAGCCGGATCGGTCAGCTTTACTTTGGGCATTATATGAGAACTAGTGAGGCCAGGTTCTTGATTGAAGCCTATGTCTTCTATGAAGCGATTCTTAATCGAAGCTAT
TTTGAGGGATCGAAGAATTCGAGAAAGGATCTGGGGGCGAGGTTCAAGGAACTGAGGTTTTACGCGAGGTTTTTGCTGGTTTCGTTGTTTTTGAATCGTACGGATACCGT
TCAGGTTCTCGCGGAACGATTGAAGGCTCTGGTCGACGATAGCAAGGCCACTTTTCGGGCTACTGACTTTAAAGAGTGGAGGCTAGTTGTACAAGAAATTTTCTGCTTCA
TGAATGTAGCAACAGCCTCGATGAATGTCAGACCTTTACGTTACTCTGCTGCATTTGATTCCCATCCGTCATCCCTTCCATTTGTGGCACGTTTCCATGCAAAGAGGGTT
CTTAAATTCCGAGATGCCGTTCTGACTAGCTACCACCGAAATGAGGTTAAATTTGCGGAAATTACTTTGGACACTTATAGAATGCTACAATGTCTAGAATGGGAGCCTGG
TTTCTTCTACCAAAAGCATCCAGTTGAACCAAATCAAAATGGGGCTGCCATTGATCATTCTGGTGCGTCTGGAATAATTGATATTAACCTAGCTACTGATATTACCGATC
CATCTTTACCTCCAAATCCAAAGAAAGCTATCCTCTATCGACCTTCTGTGACTCATTTGATAGCTGTCATGGCCACAATTTGTGAGGAGCTCCATCCAGATAGTATCATG
CTGATTTATCTATCTGCAACAGGGAAATGCTGTCAAAACAGTGTCAATCAAATGGCGAGTTATGGGGAATCAAGAAAATCCTTGAAAAATAAAGTCATCGCCCAGAATTC
ACGAGAAAATTGTAATGCTCTACCCGAATCTTGTAAGAGTGAGAAGCCAGGATCGAGCGACCTTTATGATGAGTATTTGTGGTTTGGGCATAGGGGTAATGGAGGTCCAA
ACGTTTTATACCCTGGTGATATAATACCTTTTACACGTAGACCTGTTTTCTTGATAGTTGACAGTAACAACAGCCATGCATTCAAGGCAGAGAGAGGAGAAACTGCCGCC
GTACTTCTTTCACCATTGAGGCCTGCATTCAAGAATCCCTTAAATGTTGATACAATTCAATCAGGAAGTCAGTTTACTTTTTTCTTGACTGCCCCTCTGCCGGCATTTTG
TGAAATGGTTGGCCTGTCCTCGGCCAATTTGGATATAGATGTTTACAATGATGCTGAGACCATAATCTCCTCCGCATTTTCCGATTGGGAAATAATTCTTTGTACATCAA
CTAGCTTAAATATCGTTTGGGCCCAAGTTTTGTCTGATCATTTTTTACGCCGTCTCATTCTCAGATTTATATTCTGCCGATCCGTGCTATCGTTCTTCAACACTAAAGAA
GACGACGACCTTCCTGTTTGCCTGCCTTGTCTTCCCGACTCTGTTTCTTCAAATTCTGGAGTTGTCAGCTCAGCAATTCGCCGTCTCGCAAAGCACCTTAACGCTGCTCG
TAAGTCTGCCCCAACCACAGGAGGAGTGAAGAAGCCCCACAGATACCGTCCTGGTACCGTTGCTCTTCGTGAAATTCGTAAGTATCAGAAGAGTACTGAGCTTTTGATCA
GGAAGTTGCCCTTCCAGAGGTTGGTTCGTGAAATTGCCCAGGACTTTAAGGATGGAGGAGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACCGACAACGACTCTGAAGCGAAGAGCTTCCGAGCTATGGTGGAGAGCGCCAACCGGAAGTTCGCTAGGGTCCAAGACGTTCCGGCCTACGGACGTGTGGACAACCA
CCACTACTTTCATAAGGTTTTCAAGGCTTATATGCGTCTCTGGAAGTACCAGCAGGAATTCCGCGCCAAGCTTGTTGAATCTGGTCTCAACCGCTGGGAAATTGGCGAGA
TCGCTAGCCGGATCGGTCAGCTTTACTTTGGGCATTATATGAGAACTAGTGAGGCCAGGTTCTTGATTGAAGCCTATGTCTTCTATGAAGCGATTCTTAATCGAAGCTAT
TTTGAGGGATCGAAGAATTCGAGAAAGGATCTGGGGGCGAGGTTCAAGGAACTGAGGTTTTACGCGAGGTTTTTGCTGGTTTCGTTGTTTTTGAATCGTACGGATACCGT
TCAGGTTCTCGCGGAACGATTGAAGGCTCTGGTCGACGATAGCAAGGCCACTTTTCGGGCTACTGACTTTAAAGAGTGGAGGCTAGTTGTACAAGAAATTTTCTGCTTCA
TGAATGTAGCAACAGCCTCGATGAATGTCAGACCTTTACGTTACTCTGCTGCATTTGATTCCCATCCGTCATCCCTTCCATTTGTGGCACGTTTCCATGCAAAGAGGGTT
CTTAAATTCCGAGATGCCGTTCTGACTAGCTACCACCGAAATGAGGTTAAATTTGCGGAAATTACTTTGGACACTTATAGAATGCTACAATGTCTAGAATGGGAGCCTGG
TTTCTTCTACCAAAAGCATCCAGTTGAACCAAATCAAAATGGGGCTGCCATTGATCATTCTGGTGCGTCTGGAATAATTGATATTAACCTAGCTACTGATATTACCGATC
CATCTTTACCTCCAAATCCAAAGAAAGCTATCCTCTATCGACCTTCTGTGACTCATTTGATAGCTGTCATGGCCACAATTTGTGAGGAGCTCCATCCAGATAGTATCATG
CTGATTTATCTATCTGCAACAGGGAAATGCTGTCAAAACAGTGTCAATCAAATGGCGAGTTATGGGGAATCAAGAAAATCCTTGAAAAATAAAGTCATCGCCCAGAATTC
ACGAGAAAATTGTAATGCTCTACCCGAATCTTGTAAGAGTGAGAAGCCAGGATCGAGCGACCTTTATGATGAGTATTTGTGGTTTGGGCATAGGGGTAATGGAGGTCCAA
ACGTTTTATACCCTGGTGATATAATACCTTTTACACGTAGACCTGTTTTCTTGATAGTTGACAGTAACAACAGCCATGCATTCAAGGCAGAGAGAGGAGAAACTGCCGCC
GTACTTCTTTCACCATTGAGGCCTGCATTCAAGAATCCCTTAAATGTTGATACAATTCAATCAGGAAGTCAGTTTACTTTTTTCTTGACTGCCCCTCTGCCGGCATTTTG
TGAAATGGTTGGCCTGTCCTCGGCCAATTTGGATATAGATGTTTACAATGATGCTGAGACCATAATCTCCTCCGCATTTTCCGATTGGGAAATAATTCTTTGTACATCAA
CTAGCTTAAATATCGTTTGGGCCCAAGTTTTGTCTGATCATTTTTTACGCCGTCTCATTCTCAGATTTATATTCTGCCGATCCGTGCTATCGTTCTTCAACACTAAAGAA
GACGACGACCTTCCTGTTTGCCTGCCTTGTCTTCCCGACTCTGTTTCTTCAAATTCTGGAGTTGTCAGCTCAGCAATTCGCCGTCTCGCAAAGCACCTTAACGCTGCTCG
TAAGTCTGCCCCAACCACAGGAGGAGTGAAGAAGCCCCACAGATACCGTCCTGGTACCGTTGCTCTTCGTGAAATTCGTAAGTATCAGAAGAGTACTGAGCTTTTGATCA
GGAAGTTGCCCTTCCAGAGGTTGGTTCGTGAAATTGCCCAGGACTTTAAGGATGGAGGAGCTTAA
Protein sequenceShow/hide protein sequence
MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSY
FEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSAAFDSHPSSLPFVARFHAKRV
LKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNPKKAILYRPSVTHLIAVMATICEELHPDSIM
LIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAERGETAA
VLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKE
DDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKDGGA