; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G001620 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G001620
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionHistone H3.2
Genome locationchr07:1716284..1727137
RNA-Seq ExpressionLsi07G001620
SyntenyLsi07G001620
Gene Ontology termsGO:0045892 - negative regulation of transcription, DNA-templated (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003714 - transcription corepressor activity (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR000164 - Histone H3/CENP-A
IPR007125 - Histone H2A/H2B/H3
IPR009072 - Histone-fold
IPR022709 - Protein SCAI


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577288.1 Protein SCAI, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0085.29Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDN+S AK+FRA+VESANRKFARVQDVPAYGR+D++HYFHKVFKAYMRLWK+QQE+RAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAY+
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFL+VSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMN+ATASMNVRPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPN+NGAAID+SGASGIIDINL+TDITDPSLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT
        KKAILYRPSVTHLIA                                                                                 VMAT
Subjt:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT

Query:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV
        ICEEL PDSIMLIYLSA GKCCQNSV+Q ASYGESRKSLK+KVI QNSRENCNALPESCKS+KPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV
Subjt:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV

Query:  FLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTS
        FLIVDSNNSHAFK    AERGETAA+LLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFS+WEIILCTSTS
Subjt:  FLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTS

Query:  LNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLN----MARTKQTARKSTGGKAPRKQLATKAAR
        LNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLP+CLPCLPDS+SSN GVV SAIRRLA HLN    MARTKQTARKSTGGKAPRKQLATKAAR
Subjt:  LNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLN----MARTKQTARKSTGGKAPRKQLATKAAR

Query:  KSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQL
        KSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQL
Subjt:  KSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQL

Query:  ARRIRGERA
        ARRIRGERA
Subjt:  ARRIRGERA

KAG6600643.1 hypothetical protein SDJN03_05876, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0076.19Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTD++S AK+FRA+VESA+RKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWK+QQEFRAKLVESGLNR EIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNR+YFE SKNSRKDLGARFK+LRFYARFLLVSL LNRT TVQVLAERLKALVDDSKA FR TDFKEWRLVVQEIFCFM VAT SMN        
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
                                           VKFAEITLDT+RMLQCLEWEPGFFYQKHPVEPN+NGA ID+SGASGIIDINLATD++DPSLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT
        KKAILYRPSVTHLIA                                                                                 VMAT
Subjt:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT

Query:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV
        +CEEL PDSIMLIYLSA GKCCQNSVNQM SYGESRKS+++KVI QNSRENCN+LPESCKSEK GSSDLYDEYLWFGHR NGGPNVLYPGDIIPFTRRPV
Subjt:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV

Query:  FLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTS
        FLIVDSNNSHAFK    AERGETAA+LLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAF EMVGL SAN+D DVYNDAETI+SSA S+WE +LCTSTS
Subjt:  FLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTS

Query:  LNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLN------------------MARTKQTARKSTG
        LNIVWAQVLSD+FLRRLILRFIFCRSVLSFF+TKEDDDLP+CLPCLPDSV+SNSGVV SAIRRLAKHLN                  MARTKQTARKSTG
Subjt:  LNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLN------------------MARTKQTARKSTG

Query:  GKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAI
        GKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAI
Subjt:  GKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAI

Query:  HAKRVTIMPKDIQLARRIR
        HAKRVTIMPKDIQLARRIR
Subjt:  HAKRVTIMPKDIQLARRIR

TYK16871.1 protein SCAI [Cucumis melo var. makuwa]0.0e+0084.26Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDNDSEAK+FRAMVE+ANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATAS NVRPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHP SLPFV RFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPN+NGA IDHSGASGIIDINLATD+TDPSLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT
        KKAILYRPSVTHLIA                                                                                 VMAT
Subjt:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT

Query:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV
        +CEEL PDSIMLIYLSA GKCCQNSVNQM S+GESRKSLKNKV AQNSRENCNAL ESCK EKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV
Subjt:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV

Query:  FLIVDSNNSHAFKAERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTSLNIV
        FLIVDSNNSHAFKAERGETAA+LLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDA+TI+SSAFSDWEIILCTSTSLNIV
Subjt:  FLIVDSNNSHAFKAERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTSLNIV

Query:  WAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNMA
        WAQVLSDHFLRRLILRFIFCR+VLSFFNTKEDDDLP CLPCLPDSVSSNSGVVSSAIRRLAKHLN+A
Subjt:  WAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNMA

XP_008453803.1 PREDICTED: protein SCAI [Cucumis melo]0.0e+0083.76Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDNDSEAK+FRAMVE+ANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATAS NVRPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHP SLPFV RFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPN+NGA IDHSGASGIIDINLATD+TDPSLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT
        KKAILYRPSVTHLIA                                                                                 VMAT
Subjt:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT

Query:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV
        +CEEL PDSIMLIYLSA GKCCQNSVNQM S+GESRKSLKNKV AQNSRENCNAL ESCK EKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV
Subjt:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV

Query:  FLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTS
        FLIVDSNNSHAFK    AERGETAA+LLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDA+TI+SSAFSDWEIILCTSTS
Subjt:  FLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTS

Query:  LNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNMA
        LNIVWAQVLSDHFLRRLILRFIFCR+VLSFFNTKEDDDLP CLPCLPDSVSSNSGVVSSAIRRLAKHLN+A
Subjt:  LNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNMA

XP_038877443.1 protein SCAI [Benincasa hispida]0.0e+0084.8Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATAS NVRPLRYSA
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQC EWEPGFFYQKHPVEPN+NGAAIDHSGASGIIDINLATD TD SLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT
        KKAILYRPSVTHLIA                                                                                 VMAT
Subjt:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT

Query:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV
        ICEEL PDSIMLIYLSA GKCCQNSVNQMASYGESRKSLKNKV  QNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV
Subjt:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV

Query:  FLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTS
        FLIVDSNNSHAFK    AERGETAA+LLSPLRPAFKNPLNVDT QSGSQFTFFLTAPLPAFCE+VGL SANLDIDVYNDAETIISSAFSDWEIILCTS S
Subjt:  FLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTS

Query:  LNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNMA
        LNIVWAQVLSDHFLRRLILRFIF RSVLSFFNTKEDDDLPVCLPCLPDS+SSNSGVVSSAIRRLAKHLN+A
Subjt:  LNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNMA

TrEMBL top hitse value%identityAlignment
A0A0A0KU99 Uncharacterized protein0.0e+0083.61Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTD+D EAK+FRAMVE+ANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMN+ATAS NVRPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHP SLPFV RFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPN+NGA IDHSGASGIIDINLATD+TDPSLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT
        KKAIL+RPSVTHLIA                                                                                 VMAT
Subjt:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT

Query:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV
        +CEEL PDSIMLIYLSA GKCCQNSVNQMAS GESRKSLKNKV AQNSRENCNAL ESCKSEKPGSSDLYDEYLWFGHRG+GGPNVLYPGDIIPFTRRPV
Subjt:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV

Query:  FLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTS
        FLIVDSNNSHAFK    AERGETAA+LLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDA+TI+SSAFSDWEIILCTSTS
Subjt:  FLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTS

Query:  LNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNMA
        LNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLN+A
Subjt:  LNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNMA

A0A1S3BX87 protein SCAI0.0e+0083.76Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDNDSEAK+FRAMVE+ANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATAS NVRPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHP SLPFV RFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPN+NGA IDHSGASGIIDINLATD+TDPSLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT
        KKAILYRPSVTHLIA                                                                                 VMAT
Subjt:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT

Query:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV
        +CEEL PDSIMLIYLSA GKCCQNSVNQM S+GESRKSLKNKV AQNSRENCNAL ESCK EKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV
Subjt:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV

Query:  FLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTS
        FLIVDSNNSHAFK    AERGETAA+LLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDA+TI+SSAFSDWEIILCTSTS
Subjt:  FLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTS

Query:  LNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNMA
        LNIVWAQVLSDHFLRRLILRFIFCR+VLSFFNTKEDDDLP CLPCLPDSVSSNSGVVSSAIRRLAKHLN+A
Subjt:  LNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNMA

A0A5A7TRF5 Protein SCAI0.0e+0082.47Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDNDSEAK+FRAMVE+ANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATAS NVRPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHP SLPFV RFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPN+NGA IDHSGASGIIDINLATD+TDPSLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT
        KKAILYRPSVTHLIA                                                                                 VMAT
Subjt:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT

Query:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV
        +CEEL PDSIMLIYLSA GKCCQNSVNQM S+GESRKSLKNKV AQNSRENCNA  ESCK EKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV
Subjt:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV

Query:  FLIVDSNNSHAFK------------AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWE
        FLIVDSNNSHAFK            AERGETAA+LLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDA+TI+SSAFSDWE
Subjt:  FLIVDSNNSHAFK------------AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWE

Query:  IILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNMA
        IILCTSTSLNIVWAQVLSDHFLRRLILRFIFCR+VLSFFN KEDDDLP CLPCLPDSVSSNSGVVSSAIRRLAKHLN+A
Subjt:  IILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNMA

A0A5D3CY13 Protein SCAI0.0e+0084.26Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDNDSEAK+FRAMVE+ANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSL LNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATAS NVRPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHP SLPFV RFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPN+NGA IDHSGASGIIDINLATD+TDPSLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT
        KKAILYRPSVTHLIA                                                                                 VMAT
Subjt:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT

Query:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV
        +CEEL PDSIMLIYLSA GKCCQNSVNQM S+GESRKSLKNKV AQNSRENCNAL ESCK EKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV
Subjt:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPV

Query:  FLIVDSNNSHAFKAERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTSLNIV
        FLIVDSNNSHAFKAERGETAA+LLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDA+TI+SSAFSDWEIILCTSTSLNIV
Subjt:  FLIVDSNNSHAFKAERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTSLNIV

Query:  WAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNMA
        WAQVLSDHFLRRLILRFIFCR+VLSFFNTKEDDDLP CLPCLPDSVSSNSGVVSSAIRRLAKHLN+A
Subjt:  WAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNMA

A0A6J1EMW7 protein SCAI-like0.0e+0082.74Show/hide
Query:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV
        MTDN+S AK+FRA+VESANRKFARVQDVPAYGR+D++HYFHKVFKAYMRLWK+QQE+RAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAY+
Subjt:  MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYV

Query:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA
        FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFL+VSL LNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYS 
Subjt:  FYEAILNRSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSA

Query:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP
        AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPN+NGAAID+SGASGIIDINL+TDITDPSLPPNP
Subjt:  AFDSHPSSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNP

Query:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT
        KKAILYRPSVTHLIA                                                                                 VMAT
Subjt:  KKAILYRPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMAT

Query:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASY-GESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRP
        ICEEL PDSIMLIYLSA GKCCQNSV+Q ASY GESRKSLK+KVIAQNSRENCNALPESCKS+KPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRP
Subjt:  ICEELHPDSIMLIYLSATGKCCQNSVNQMASY-GESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRP

Query:  VFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTST
        VFLIVDSNNSHAFK    AERGETAA+LLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFS+WEIILCTST
Subjt:  VFLIVDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTST

Query:  SLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNMA
        SLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLP+CLPCLPDS+SSN GVV SAIRRLA HLN+A
Subjt:  SLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNMA

SwissProt top hitse value%identityAlignment
P59169 Histone H3.37.5e-67100Show/hide
Query:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
        MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
Subjt:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY

Query:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
        LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Subjt:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA

P69245 Histone H3.37.5e-67100Show/hide
Query:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
        MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
Subjt:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY

Query:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
        LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Subjt:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA

Q3C2E5 Histone H3.37.5e-67100Show/hide
Query:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
        MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
Subjt:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY

Query:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
        LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Subjt:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA

Q6RUR1 Histone H3.37.5e-67100Show/hide
Query:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
        MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
Subjt:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY

Query:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
        LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Subjt:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA

Q71V89 Histone H3.37.5e-67100Show/hide
Query:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
        MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
Subjt:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY

Query:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
        LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Subjt:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA

Arabidopsis top hitse value%identityAlignment
AT3G03570.1 Protein of unknown function (DUF3550/UPF0682)4.7e-12540.24Show/hide
Query:  AMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFE
        ++V  A++KF++++D+P Y R    +YF KVFK Y +LWK+QQE R KLVE+GL RWEIGEIASRI QLY+GHYMRTS+A +L E+YVFYEAIL R YF+
Subjt:  AMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFE

Query:  GSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSAAFDSHPSSLPFV
              +DL    K+LRF ARFL+V L L R + V  L ++ K L+D+ K TF+ TDFKEW++V QEI  F+   TA MN+RPLRYS   D +  +    
Subjt:  GSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSAAFDSHPSSLPFV

Query:  ARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEP-GFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNPKKAILYRPSVT
            A R L+  DA+L+SY+ NEVK++E+TLD++RMLQCLEWEP G  YQ         GA +  +   G+  IN +  + DP+LPPNP+KA+LYRPS+T
Subjt:  ARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEP-GFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNPKKAILYRPSVT

Query:  HLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMATICEELHPDSIM
        H +A                                                                                 V+ATICEEL    I+
Subjt:  HLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMATICEELHPDSIM

Query:  LIYLSATGKCCQNSVNQMASYGESR------KSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDE--YLWFGHRGNGGPNVLYPGDIIPFTRRPVFLI
        L+YLSA+GK  Q S + +++   +       +  ++  I Q +  +    P S +S +  S D       L FG  G  G + +YP D++PFTR+P+F+I
Subjt:  LIYLSATGKCCQNSVNQMASYGESR------KSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDE--YLWFGHRGNGGPNVLYPGDIIPFTRRPVFLI

Query:  VDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTSLNI
        +DS++S  FK    AE+GE AA+LLSP         +     SGS FT FLT+P+ AFC +  +S+++++ D++  AE ++SS+ ++W   L TS +L+ 
Subjt:  VDSNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTSLNI

Query:  VWAQVLSDHFLRRLILRFIFCRSVLSF----FNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLA
        VW+Q+L D FLRRL+LRFIFCR+VL+     FN K++   P C P LP+S+   +  V SA+ ++A
Subjt:  VWAQVLSDHFLRRLILRFIFCRSVLSF----FNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLA

AT4G40030.1 Histone superfamily protein5.3e-68100Show/hide
Query:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
        MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
Subjt:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY

Query:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
        LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Subjt:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA

AT4G40030.2 Histone superfamily protein3.1e-6898.55Show/hide
Query:  LNMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAE
        + MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAE
Subjt:  LNMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAE

Query:  AYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
        AYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Subjt:  AYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA

AT4G40040.1 Histone superfamily protein5.3e-68100Show/hide
Query:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
        MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY
Subjt:  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAY

Query:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
        LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA
Subjt:  LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA

AT4G40050.1 Protein of unknown function (DUF3550/UPF0682)7.9e-18953.5Show/hide
Query:  AKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILN
        + +FRA+VE+A+RKFARV+D+PA+GR  + HYF KVFKAYM+LW YQQ  R+KLVESGLNRWEIGEIASRIGQLYF  YMRTSEARFL+EA+VFYEAIL 
Subjt:  AKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILN

Query:  RSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSAAFDSHPS
        RSYF+ ++   KDLGARFKELRFYARFLLVSL ++R   +  LA++L+ LVD S + FR T+FKEWRLVVQEI  F+   T    +RPLRY A  DS+P+
Subjt:  RSYFEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSAAFDSHPS

Query:  SLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEP-GFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNPKKAILY
        S  ++ARFHAK++ KFRDA+L SYHRNEVK+AE+TLDTYRM+QCLEWEP G FYQK PVE  +NG  +DH+  SG+ID+NLA D+ DPSLPPNP+KAILY
Subjt:  SLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEP-GFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNPKKAILY

Query:  RPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMATICEELH
        RP+V+HL+A                                                                                 V+A IC+EL 
Subjt:  RPSVTHLIAPARGNINSSFSELSTSICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMATICEELH

Query:  PDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHR-GNGGPNVLYPGDIIPFTRRPVFLIVD
        P+++ML+YLSA+G   + +V Q  +   S ++ K+K++A+ S+E  +   E   + K  S++ Y+ +LW G R G+ G N LYPGD+IPFTR+P+FLI+D
Subjt:  PDSIMLIYLSATGKCCQNSVNQMASYGESRKSLKNKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHR-GNGGPNVLYPGDIIPFTRRPVFLIVD

Query:  SNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDT-IQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTSLNIV
        S+ S AFK    AERGE  A+LLSPL+P+F+NP   DT   +GSQFTFFLTAPL AFC+M+GLS+   D ++ ++AE+I+S++FS+WE IL TS  LN+V
Subjt:  SNNSHAFK----AERGETAAVLLSPLRPAFKNPLNVDT-IQSGSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTSLNIV

Query:  WAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDD--LPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNMART
        WAQVL D FLRRLILRFIFCRSVL+ F+  EDDD  LP C P LP+ +SS S  V S+++RLA+HL +A++
Subjt:  WAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDD--LPVCLPCLPDSVSSNSGVVSSAIRRLAKHLNMART


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGACAACGACTCTGAAGCGAAGAGCTTCCGAGCTATGGTGGAGAGCGCCAACCGGAAGTTCGCTAGGGTCCAAGACGTTCCGGCCTACGGACGTGTGGACAACCA
CCACTACTTTCATAAGGTTTTCAAGGCTTATATGCGTCTCTGGAAGTACCAGCAGGAATTCCGCGCCAAGCTTGTTGAATCTGGTCTCAACCGCTGGGAAATTGGCGAGA
TCGCTAGCCGGATCGGTCAGCTTTACTTTGGGCATTATATGAGAACTAGTGAGGCCAGGTTCTTGATTGAAGCCTATGTCTTCTATGAAGCGATTCTTAATCGAAGCTAT
TTTGAGGGATCGAAGAATTCGAGAAAGGATCTGGGGGCGAGGTTCAAGGAACTGAGGTTTTACGCGAGGTTTTTGCTGGTTTCGTTGTTTTTGAATCGTACGGATACCGT
TCAGGTTCTCGCGGAACGATTGAAGGCTCTGGTCGACGATAGCAAGGCCACTTTTCGGGCTACTGACTTTAAAGAGTGGAGGCTAGTTGTACAAGAAATTTTCTGCTTCA
TGAATGTAGCAACAGCCTCGATGAATGTCAGACCTTTACGTTACTCTGCTGCATTTGATTCCCATCCGTCATCCCTTCCATTTGTGGCACGTTTCCATGCAAAGAGGGTT
CTTAAATTCCGAGATGCCGTTCTGACTAGCTACCACCGAAATGAGGTTAAATTTGCGGAAATTACTTTGGACACTTATAGAATGCTACAATGTCTAGAATGGGAGCCTGG
TTTCTTCTACCAAAAGCATCCAGTTGAACCAAATCAAAATGGGGCTGCCATTGATCATTCTGGTGCGTCTGGAATAATTGATATTAACCTAGCTACTGATATTACCGATC
CATCTTTACCTCCAAATCCAAAGAAAGCTATCCTCTATCGACCTTCTGTGACTCATTTGATAGCTCCTGCAAGGGGAAATATCAACAGTTCCTTCAGTGAACTCTCCACT
TCTATTTGTCATCACTTTTGGAAGCCCCAATTCTTCTCCATCGCTATTCACATGCTAACCAAAGCCTTCACCGTAAACTCCAATGCCTATCACCACCCGCTACTTAGAAC
TAGGATATGCCCCAATATATTCTCAGTGGCCATTGTGACTTCAGGGACCAATGTATATTATGGAAAGGAACGTATAAAGGTTGAAAAGGTCATGGCCACAATTTGTGAGG
AGCTCCATCCAGATAGTATCATGCTGATTTATCTATCTGCAACAGGGAAATGCTGTCAAAACAGTGTCAATCAAATGGCGAGTTATGGGGAATCAAGAAAATCCTTGAAA
AATAAAGTCATCGCCCAGAATTCACGAGAAAATTGTAATGCTCTACCCGAATCTTGTAAGAGTGAGAAGCCAGGATCGAGCGACCTTTATGATGAGTATTTGTGGTTTGG
GCATAGGGGTAATGGAGGTCCAAACGTTTTATACCCTGGTGATATAATACCTTTTACACGTAGACCTGTTTTCTTGATAGTTGACAGTAACAACAGCCATGCATTCAAGG
CAGAGAGAGGAGAAACTGCCGCCGTACTTCTTTCACCATTGAGGCCTGCATTCAAGAATCCCTTAAATGTTGATACAATTCAATCAGGAAGTCAGTTTACTTTTTTCTTG
ACTGCCCCTCTGCCGGCATTTTGTGAAATGGTTGGCCTGTCCTCGGCCAATTTGGATATAGATGTTTACAATGATGCTGAGACCATAATCTCCTCCGCATTTTCCGATTG
GGAAATAATTCTTTGTACATCAACTAGCTTAAATATCGTTTGGGCCCAAGTTTTGTCTGATCATTTTTTACGCCGTCTCATTCTCAGATTTATATTCTGCCGATCCGTGC
TATCGTTCTTCAACACTAAAGAAGACGACGACCTTCCTGTTTGCCTGCCTTGTCTTCCCGACTCTGTTTCTTCAAATTCTGGAGTTGTCAGCTCAGCAATTCGCCGTCTC
GCAAAGCACCTTAACATGGCTCGTACCAAGCAAACTGCCCGTAAGTCCACCGGAGGAAAGGCTCCAAGGAAGCAGCTGGCCACAAAGGCTGCTCGTAAGTCTGCCCCAAC
CACAGGAGGAGTGAAGAAGCCCCACAGATACCGTCCTGGTACCGTTGCTCTTCGTGAAATTCGTAAGTATCAGAAGAGTACTGAGCTTTTGATCAGGAAGTTGCCCTTCC
AGAGGTTGGTTCGTGAAATTGCCCAGGACTTTAAGACTGATCTGCGTTTCCAGAGCCATGCTGTTCTGGCATTGCAAGAGGCAGCCGAGGCCTACCTTGTTGGGTTATTC
GAGGACACCAATCTCTGTGCCATTCACGCAAAGCGTGTTACCATAATGCCTAAAGATATCCAGTTGGCTCGGAGAATCCGGGGTGAACGTGCTTAG
mRNA sequenceShow/hide mRNA sequence
CAAAAAACACACCTCTCCCTCCCACTTCTTCTCATCCTTCTCCTTTCTCCGCCATAGCCGCCAAAATTCCTGCCATTTCCGGCCCCTCTCCGCCGCCTCTGCAACCCGCT
TTTACCGGATTTCAAATCCAAATTCAGTTTCACACGACAAAGTAAAAGCGAAGGGAGATTAGAGGAGGAAGAAGAAGAAGAAGGATGAAAGAGAAAAAGTGGTTTGATTT
ATGACGCCACAATAGAACCAAACTCCACACGAGTCCCCAAGCAAAAAAGTCCAACAGCTTCTCTTTCTCTCTTTCTCCCTGTTCCTCTGAATCGTATAAGCTCGTAGACG
TACTCGAGCGCGCGCACTCTCACCTCTCTTTTTTTTCCCCCTTTTTTCTCTGGTAGTTCACTTTGCGCTTTCGTTTGTTCTGTTTTTGTTCGATGAGAGCCGGAGATTGT
GACTGCCGGTCTAGGAATTGAGACTTCCGGAATGACCGACAACGACTCTGAAGCGAAGAGCTTCCGAGCTATGGTGGAGAGCGCCAACCGGAAGTTCGCTAGGGTCCAAG
ACGTTCCGGCCTACGGACGTGTGGACAACCACCACTACTTTCATAAGGTTTTCAAGGCTTATATGCGTCTCTGGAAGTACCAGCAGGAATTCCGCGCCAAGCTTGTTGAA
TCTGGTCTCAACCGCTGGGAAATTGGCGAGATCGCTAGCCGGATCGGTCAGCTTTACTTTGGGCATTATATGAGAACTAGTGAGGCCAGGTTCTTGATTGAAGCCTATGT
CTTCTATGAAGCGATTCTTAATCGAAGCTATTTTGAGGGATCGAAGAATTCGAGAAAGGATCTGGGGGCGAGGTTCAAGGAACTGAGGTTTTACGCGAGGTTTTTGCTGG
TTTCGTTGTTTTTGAATCGTACGGATACCGTTCAGGTTCTCGCGGAACGATTGAAGGCTCTGGTCGACGATAGCAAGGCCACTTTTCGGGCTACTGACTTTAAAGAGTGG
AGGCTAGTTGTACAAGAAATTTTCTGCTTCATGAATGTAGCAACAGCCTCGATGAATGTCAGACCTTTACGTTACTCTGCTGCATTTGATTCCCATCCGTCATCCCTTCC
ATTTGTGGCACGTTTCCATGCAAAGAGGGTTCTTAAATTCCGAGATGCCGTTCTGACTAGCTACCACCGAAATGAGGTTAAATTTGCGGAAATTACTTTGGACACTTATA
GAATGCTACAATGTCTAGAATGGGAGCCTGGTTTCTTCTACCAAAAGCATCCAGTTGAACCAAATCAAAATGGGGCTGCCATTGATCATTCTGGTGCGTCTGGAATAATT
GATATTAACCTAGCTACTGATATTACCGATCCATCTTTACCTCCAAATCCAAAGAAAGCTATCCTCTATCGACCTTCTGTGACTCATTTGATAGCTCCTGCAAGGGGAAA
TATCAACAGTTCCTTCAGTGAACTCTCCACTTCTATTTGTCATCACTTTTGGAAGCCCCAATTCTTCTCCATCGCTATTCACATGCTAACCAAAGCCTTCACCGTAAACT
CCAATGCCTATCACCACCCGCTACTTAGAACTAGGATATGCCCCAATATATTCTCAGTGGCCATTGTGACTTCAGGGACCAATGTATATTATGGAAAGGAACGTATAAAG
GTTGAAAAGGTCATGGCCACAATTTGTGAGGAGCTCCATCCAGATAGTATCATGCTGATTTATCTATCTGCAACAGGGAAATGCTGTCAAAACAGTGTCAATCAAATGGC
GAGTTATGGGGAATCAAGAAAATCCTTGAAAAATAAAGTCATCGCCCAGAATTCACGAGAAAATTGTAATGCTCTACCCGAATCTTGTAAGAGTGAGAAGCCAGGATCGA
GCGACCTTTATGATGAGTATTTGTGGTTTGGGCATAGGGGTAATGGAGGTCCAAACGTTTTATACCCTGGTGATATAATACCTTTTACACGTAGACCTGTTTTCTTGATA
GTTGACAGTAACAACAGCCATGCATTCAAGGCAGAGAGAGGAGAAACTGCCGCCGTACTTCTTTCACCATTGAGGCCTGCATTCAAGAATCCCTTAAATGTTGATACAAT
TCAATCAGGAAGTCAGTTTACTTTTTTCTTGACTGCCCCTCTGCCGGCATTTTGTGAAATGGTTGGCCTGTCCTCGGCCAATTTGGATATAGATGTTTACAATGATGCTG
AGACCATAATCTCCTCCGCATTTTCCGATTGGGAAATAATTCTTTGTACATCAACTAGCTTAAATATCGTTTGGGCCCAAGTTTTGTCTGATCATTTTTTACGCCGTCTC
ATTCTCAGATTTATATTCTGCCGATCCGTGCTATCGTTCTTCAACACTAAAGAAGACGACGACCTTCCTGTTTGCCTGCCTTGTCTTCCCGACTCTGTTTCTTCAAATTC
TGGAGTTGTCAGCTCAGCAATTCGCCGTCTCGCAAAGCACCTTAACATGGCTCGTACCAAGCAAACTGCCCGTAAGTCCACCGGAGGAAAGGCTCCAAGGAAGCAGCTGG
CCACAAAGGCTGCTCGTAAGTCTGCCCCAACCACAGGAGGAGTGAAGAAGCCCCACAGATACCGTCCTGGTACCGTTGCTCTTCGTGAAATTCGTAAGTATCAGAAGAGT
ACTGAGCTTTTGATCAGGAAGTTGCCCTTCCAGAGGTTGGTTCGTGAAATTGCCCAGGACTTTAAGACTGATCTGCGTTTCCAGAGCCATGCTGTTCTGGCATTGCAAGA
GGCAGCCGAGGCCTACCTTGTTGGGTTATTCGAGGACACCAATCTCTGTGCCATTCACGCAAAGCGTGTTACCATAATGCCTAAAGATATCCAGTTGGCTCGGAGAATCC
GGGGTGAACGTGCTTAGAAGATGATGTATTGTTGTTTGGTGGGATGGTAGTGTTGGCTGCAAGGAGGTGATCAAAGACCTAATTGATGGATGAAGATGATGGGTGGTTCT
TATTTTTGTTTTTAGCAAGGTTGGTAGATAGTTATACTAGAAGAAACTTGTAGTGAATATGGTAAAAAGGCATAATGCTTTGCTTGTGCATGTTAATGTTTAGATGTGGA
TGATTTTGTTTTTTGTTCTTTTTAAAGGATGGAGGAGCTTAATTCTGTATTTCATTGAATTGTGTTTTTGGGGTATGCAGCCAAACTAATATTTGTTGGATAACCTATCA
CATCTAATTTTTAGTTCAGATTGATTTGTAAGTTCATGAATTATTTCCACCTATTATTTGTGGTGTTGTGTTTGTTTCATGTGGAAGAAGATTGTTGTTACAGCTACGCG
GTGGGGAAAAGTGAATTGTTTGTTAGGAGGGTTTTGAACCTTAGCACCCCTAATAAATCAAATGATCTTTACTCGTCCATTGGCTAACATTTGATCTTCTACTGATTAAA
CCCATACTTTATTTCACGTTTTGCTGACTGCTTCGGTGTTTTTTAAAAGAAAGCGTGGGGTACTGTTGTTCAATGTGTGGAGTATCAGTTTGTATGTTGGGGTTCATGGT
TCAGGTTGTTGGGAAACATTGTTTGGGGGACAGTGTCGTCTGTTCGTAGTGTTGGGTCTTTTAATGAAGTTTTGTAATTCTTTATGC
Protein sequenceShow/hide protein sequence
MTDNDSEAKSFRAMVESANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAKLVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSY
FEGSKNSRKDLGARFKELRFYARFLLVSLFLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEIFCFMNVATASMNVRPLRYSAAFDSHPSSLPFVARFHAKRV
LKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNQNGAAIDHSGASGIIDINLATDITDPSLPPNPKKAILYRPSVTHLIAPARGNINSSFSELST
SICHHFWKPQFFSIAIHMLTKAFTVNSNAYHHPLLRTRICPNIFSVAIVTSGTNVYYGKERIKVEKVMATICEELHPDSIMLIYLSATGKCCQNSVNQMASYGESRKSLK
NKVIAQNSRENCNALPESCKSEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAERGETAAVLLSPLRPAFKNPLNVDTIQSGSQFTFFL
TAPLPAFCEMVGLSSANLDIDVYNDAETIISSAFSDWEIILCTSTSLNIVWAQVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRL
AKHLNMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLF
EDTNLCAIHAKRVTIMPKDIQLARRIRGERA