; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G833 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G833
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionHAT transposon superfamily
Genome locationctg1:2526676..2532611
RNA-Seq ExpressionCucsat.G833
SyntenyCucsat.G833
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR003656 - Zinc finger, BED-type
IPR007021 - Domain of unknown function DUF659
IPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008448901.1 PREDICTED: uncharacterized protein LOC103490927 [Cucumis melo]0.097.77Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KK RDEWKETGCTILCDSWSDG+TKSFLVISVTCSKGTLFLKSVD SGHEDDATYLSDLLETI+LEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLEDISKIEWVS VLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILE+NLKHMFAHSEWLSSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTIN+PGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCS WNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDN-LPLEGRGSP
         SRAEQEKLTDLVFVQCNLWLQH+C TRDSKYKP+VFDD+DVSLEWPSELECSAHVLDDSWLDN LPLE RGSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDN-LPLEGRGSP

XP_011650424.1 uncharacterized protein LOC101222344 [Cucumis sativus]0.099.85Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPST+PCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLEGRGSP
        HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLEGRGSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLEGRGSP

XP_022923437.1 uncharacterized protein LOC111431132 [Cucurbita moschata]0.090.94Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVP DVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPSTFP  SPSAQPPIDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMV+AIAEYG GY+APSYEKLKSTLL KVKGDI +SY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KK+RDEWKETGCTILC+SWSDG+TKSFL+IS+TCSKGTLFLKSV+ISG EDDATYLSDLLETI+LEVGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLED+SKIEWV  VL+EAKII RY+YSHA ILNTMRKFT GKELIRPRITRFVTNFLSLRSIV LED LKHMFAHSEW SSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        I+S LYLDRFWKDA EA+NI EPLIRILR+VDGDMPAMGYI+EGIERAKVE+K YYNG EDKYMPIW+TIDRRWNLQLHTTLHTAAAFLNPS+FYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVN QGALGTDFAILGRTIN PGDWWSGYGYEIPTLQRAA+RILSQPCSSYGCS WNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLEGRGSP
         S  EQEKL DLVFVQCNLWLQH+  TRD KYKPVVFDD+DVSLEWP+ELE SAHVLDDSWLDNLPLE  GSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLEGRGSP

XP_038876874.1 uncharacterized protein LOC120069237 isoform X1 [Benincasa hispida]0.094.54Show/hide
Query:  LSSQTMVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSS
        L  +TMVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSS
Subjt:  LSSQTMVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSS

Query:  SASGGIHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGD
        SASGGIHHGSSGQNESNCPSTFPC SPSAQPPIDD QKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV+AIAEYG GYKAPSYEKLKSTLLDKVKGD
Subjt:  SASGGIHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGD

Query:  IHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYT
        IH+SYKK+ DEWKETGCTILCDSWSDG+TKSFLVIS+TCSKG LFLKSVDISGHEDDATYLSDLLETI+LEVGVENVVQ+ITDATASYVYAGRLLMTKYT
Subjt:  IHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYT

Query:  SLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRR
        SLFWSPCVSYCVNQMLEDISKIEWV  VLEEAKIITRYIYSHASILNTMRKFT GKELIRPRITRFVTNFLSLRSIVI EDNLKHMFAHSEWLSSIYSRR
Subjt:  SLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRR

Query:  PDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFY
        PDAQAIISLLYLDRFWKDA EA+NI EPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNG EDKY+PIWETIDRRWNLQLHTTLHTAAAFLNPSVFY
Subjt:  PDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFY

Query:  NPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFET
        NPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAA+RIL+QPCSSYGCS WNWSTFET
Subjt:  NPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFET

Query:  LHSKKHSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLEGRGSP
        LHSKK SRAEQEKL DLVFVQCNLWLQH+CLTRD KYKPVVFDD+DVSLEWP+E E SAHVLDDSWLDNLPLE RGSP
Subjt:  LHSKKHSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLEGRGSP

XP_038876877.1 uncharacterized protein LOC120069237 isoform X2 [Benincasa hispida]0.094.95Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPSTFPC SPSAQPPIDD QKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV+AIAEYG GYKAPSYEKLKSTLLDKVKGDIH+SY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KK+ DEWKETGCTILCDSWSDG+TKSFLVIS+TCSKG LFLKSVDISGHEDDATYLSDLLETI+LEVGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLEDISKIEWV  VLEEAKIITRYIYSHASILNTMRKFT GKELIRPRITRFVTNFLSLRSIVI EDNLKHMFAHSEWLSSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        IISLLYLDRFWKDA EA+NI EPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNG EDKY+PIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAA+RIL+QPCSSYGCS WNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLEGRGSP
         SRAEQEKL DLVFVQCNLWLQH+CLTRD KYKPVVFDD+DVSLEWP+E E SAHVLDDSWLDNLPLE RGSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLEGRGSP

TrEMBL top hitse value%identityAlignment
A0A0A0L2E4 BED-type domain-containing protein0.099.85Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPST+PCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLEGRGSP
        HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLEGRGSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLEGRGSP

A0A1S3BLP8 uncharacterized protein LOC1034909270.097.77Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KK RDEWKETGCTILCDSWSDG+TKSFLVISVTCSKGTLFLKSVD SGHEDDATYLSDLLETI+LEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLEDISKIEWVS VLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILE+NLKHMFAHSEWLSSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTIN+PGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCS WNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDN-LPLEGRGSP
         SRAEQEKLTDLVFVQCNLWLQH+C TRDSKYKP+VFDD+DVSLEWPSELECSAHVLDDSWLDN LPLE RGSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDN-LPLEGRGSP

A0A5D3D7G5 HAT transposon superfamily0.097.77Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KK RDEWKETGCTILCDSWSDG+TKSFLVISVTCSKGTLFLKSVD SGHEDDATYLSDLLETI+LEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLEDISKIEWVS VLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILE+NLKHMFAHSEWLSSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTIN+PGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCS WNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDN-LPLEGRGSP
         SRAEQEKLTDLVFVQCNLWLQH+C TRDSKYKP+VFDD+DVSLEWPSELECSAHVLDDSWLDN LPLE RGSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDN-LPLEGRGSP

A0A6J1E9N1 uncharacterized protein LOC1114311320.090.94Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVP DVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPSTFP  SPSAQPPIDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMV+AIAEYG GY+APSYEKLKSTLL KVKGDI +SY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KK+RDEWKETGCTILC+SWSDG+TKSFL+IS+TCSKGTLFLKSV+ISG EDDATYLSDLLETI+LEVGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLED+SKIEWV  VL+EAKII RY+YSHA ILNTMRKFT GKELIRPRITRFVTNFLSLRSIV LED LKHMFAHSEW SSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        I+S LYLDRFWKDA EA+NI EPLIRILR+VDGDMPAMGYI+EGIERAKVE+K YYNG EDKYMPIW+TIDRRWNLQLHTTLHTAAAFLNPS+FYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVN QGALGTDFAILGRTIN PGDWWSGYGYEIPTLQRAA+RILSQPCSSYGCS WNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLEGRGSP
         S  EQEKL DLVFVQCNLWLQH+  TRD KYKPVVFDD+DVSLEWP+ELE SAHVLDDSWLDNLPLE  GSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLEGRGSP

A0A6J1KZI0 uncharacterized protein LOC1115002590.090.34Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVP DVRD IQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPST PC SPSAQP IDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMV+AIAEYG GY+APSY+KLKSTLLDKVKGDI +SY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KK+RDEWKETGCTILC+SWSDG+TKSFL+IS+TCSKGTLFLKSV+ISG EDDATYLSDLLETI+LEVGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLED+SKIEWV  VL+EAKII RY+YSHA IL+TMRKFT GKELIRPRITRFVTNFLSLRSIV LED LKHMFAHSEW SSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        I+S LYLDRFWKDA EA+NI EPLIRILR+VDGDMPAMGYI+EGIERAKVE+K YYNG EDKYMPIW+TIDRRWNLQLHTTLHTAAAFLNPS+FYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVN QGALGTDFAILGRTIN PGDWWSGYGYEIPTLQR A+RILSQPCSSYGCS WNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLEGRGSP
         SR EQEKL DLVFVQCNLWLQH+  TRD KYKPVVFDD+DVSLEWP+ELE SA VLDDSWLDNLPLE  GSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLEGRGSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily5.1e-11834.19Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVR +D CWE+   +D    KV+C +C R  +GG+ R+K HL+++ +K + PC +V  DV D ++ ILS         K K     +      +S     
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
                        FP   P+AQ           D  ++ +++FFF N I F+ A+S  Y  M+DA+A+ G G+ APS    K+  LD+VK DI    
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        K    EW  TGCTI+ ++W+D ++++ +  SV+      F KSVD S +  ++  L+DL +++I ++G E++VQII D +  Y      L+  Y ++F S
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PC S C+N +LE+ SK++WV+  + +A++I++++Y+++ +L+ +RK TGG+++IR  +TR V+NFLSL+S++  +  LKHMF   E+ ++  + +P + +
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
         +++L  + FW+   E++ I EP++++LR V    PA+G I+E + +AK  I+TYY   E+K+    + +D  W   LH+ LH AAAFLNPS+ YNP  K
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
            ++  F + + K+  T   + +IT +   +   +G  G + A+  R   +PG WW  +G   P LQR A+RILSQ CS Y      WSTF+ +H ++
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWL-QHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLD
         ++ ++E L  L +V  NL L + + L  D    P+  +D+D+  EW  E E  +      WLD
Subjt:  HSRAEQEKLTDLVFVQCNLWL-QHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLD

AT3G17450.1 hAT dimerisation domain-containing protein4.4e-9330.3Show/hide
Query:  DACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETAT-----------------
        D  WEH +  D  ++KV+CNYC +  SGG+ R K HLA+I   ++ PC   P +V   I+  +   +  K   +P  +M   T                 
Subjt:  DACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETAT-----------------

Query:  -------------NGQ--QHSSSASGGIHHGSSGQNESNCPSTFPCLSPSAQ------PPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVD
                     NG+  +    +    +  S  + ++      P  SPS+           +    +KD T   ++ F  H  +P  AA SLY+Q+M++
Subjt:  -------------NGQ--QHSSSASGGIHHGSSGQNESNCPSTFPCLSPSAQ------PPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVD

Query:  AIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEV
         I  YG G+  PS +     LL +    I S  +++R  W  TGC+I+ D+W++ + K  +   V+C +G  F  S+D +   +DA  L   L+ ++ ++
Subjt:  AIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEV

Query:  GVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMR-KFTGGKELIRPRITRFVTNFL
        G ENVVQ+IT  TA +  AG+LL  K  +L+W+PC  +C   +LED SK+E+VS  LE+A+ ITR+IY+   +LN M+ +FT G +L+RP + R  + F 
Subjt:  GVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMR-KFTGGKELIRPRITRFVTNFL

Query:  SLRSIVILEDNLKHMFAHSEW-LSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVD--GDMPAMGYIFEGIERAKVEIKTYYNGFEDKY
        +L+S++  + +L+ +F    W LS   ++  + + +  ++    FWK     +   +P+++++ +++  GD  +M Y +  +  AK+ IK+ ++    KY
Subjt:  SLRSIVILEDNLKHMFAHSEW-LSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVD--GDMPAMGYIFEGIERAKVEIKTYYNGFEDKY

Query:  MPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGY
         P W  I+ RWN   H  L+ AA F NP+  Y P+F     +  G  E ++++   +  ++    + P Y   +   GTD AI  RT   P  WW  +G 
Subjt:  MPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGY

Query:  EIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYK
            LQR AVRILS  CSS GC    WS ++ ++S+  S+  ++   DL +V  NL L+   L +   Y+
Subjt:  EIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYK

AT3G22220.1 hAT transposon superfamily5.9e-9030.91Show/hide
Query:  RDACWEHC-VLVDATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGIL-STPKKQKAPKK-----------PKVDMET----
        +D+ W+HC V     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP +VR  +Q  +  T ++Q+  +K           P  ++ET    
Subjt:  RDACWEHC-VLVDATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGIL-STPKKQKAPKK-----------PKVDMET----

Query:  ---ATNG-QQHSSSASGGIHHGSSGQN--------------------ESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYY
             NG +  SS    G   G + Q                     + +  +  P    S +  +    K+++      +  F F     F AA S+  
Subjt:  ---ATNG-QQHSSSASGGIHHGSSGQN--------------------ESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYY

Query:  QEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLET
        Q  +DAI   G G   P++E L+  +L     ++     + +  WK TGC++L    +  +    L   V C +  +FLKSVD S   D    L +LL+ 
Subjt:  QEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLET

Query:  IILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFV
        ++ E+G  NVVQ+IT     Y  AG+ LM  Y SL+W PC ++C+++MLE+  K++W+  ++E+A+ +TR IY+H+ +LN MRKFT G ++++P  T   
Subjt:  IILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFV

Query:  TNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYNGFED
        TNF ++  I  L+  L+ M   SEW    YS+     A+   +  + FWK    A +I  P++R+LRIV  +  PAMGY++  + RAK  IKT      +
Subjt:  TNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYNGFED

Query:  KYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGY
        +Y+  W+ IDR W   L   L+ A  +LNP  FY+ + ++   I     + + K+      +  + ++  +Y N  G  G + AI  R    P +WWS Y
Subjt:  KYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGY

Query:  GYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCNLWLQHVC--LTRDSKYKPVVFDDVDVSLEWPS
        G     L R A+RILSQ CSS   S  N ++   ++  K+S  E+++L DLVFVQ N+ L+ +    + D    P+   +++V  +W S
Subjt:  GYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCNLWLQHVC--LTRDSKYKPVVFDDVDVSLEWPS

AT4G15020.1 hAT transposon superfamily2.5e-9632.66Show/hide
Query:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILS-----TPKKQKAPKKP-----------------
        +D  W+HC +     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP DVR  +Q  +        K+ K+  +P                 
Subjt:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILS-----TPKKQKAPKKP-----------------

Query:  ---------------KVDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAK
                        V  E+  +G   Q+   S      +GS+  N      +  +  P    S +  +  + + +++     +  F F     F A  
Subjt:  ---------------KVDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAK

Query:  SLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSD
        S+ +Q M+DAIA  G G  AP+++ L+  +L     ++     + +  WK TGC+IL +  +  +    L   V C +  +FLKSVD S     A  L +
Subjt:  SLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSD

Query:  LLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRI
        LL  ++ EVG  NVVQ+IT     YV AG+ LM  Y SL+W PC ++C++QMLE+  K+ W+S  +E+A+ ITR++Y+H+ +LN M KFT G +++ P  
Subjt:  LLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRI

Query:  TRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYN
        +   TNF +L  I  L+ NL+ M   +EW    YS  P    +++ L  + FWK      ++  PL+R LRIV  +  PAMGY++  + RAK  IKT+  
Subjt:  TRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYN

Query:  GFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDW
          ED Y+  W+ IDR W  Q H  L  A  FLNP +FYN N +I   +     + + ++   DK + +I +E  +Y    G  G + AI  R    P +W
Subjt:  GFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDW

Query:  WSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCNLWLQHVCL-TRDSKYKPVVFDDVDVSLEWPS
        WS YG     L R A+RILSQ CSS      N    E ++  K+S  EQ++L+DLVFVQ N+ L+ +   + D    P+  + +DV  EW S
Subjt:  WSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCNLWLQHVCL-TRDSKYKPVVFDDVDVSLEWPS

AT4G15020.2 hAT transposon superfamily2.5e-9632.66Show/hide
Query:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILS-----TPKKQKAPKKP-----------------
        +D  W+HC +     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP DVR  +Q  +        K+ K+  +P                 
Subjt:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILS-----TPKKQKAPKKP-----------------

Query:  ---------------KVDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAK
                        V  E+  +G   Q+   S      +GS+  N      +  +  P    S +  +  + + +++     +  F F     F A  
Subjt:  ---------------KVDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAK

Query:  SLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSD
        S+ +Q M+DAIA  G G  AP+++ L+  +L     ++     + +  WK TGC+IL +  +  +    L   V C +  +FLKSVD S     A  L +
Subjt:  SLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSD

Query:  LLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRI
        LL  ++ EVG  NVVQ+IT     YV AG+ LM  Y SL+W PC ++C++QMLE+  K+ W+S  +E+A+ ITR++Y+H+ +LN M KFT G +++ P  
Subjt:  LLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRI

Query:  TRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYN
        +   TNF +L  I  L+ NL+ M   +EW    YS  P    +++ L  + FWK      ++  PL+R LRIV  +  PAMGY++  + RAK  IKT+  
Subjt:  TRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYN

Query:  GFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDW
          ED Y+  W+ IDR W  Q H  L  A  FLNP +FYN N +I   +     + + ++   DK + +I +E  +Y    G  G + AI  R    P +W
Subjt:  GFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDW

Query:  WSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCNLWLQHVCL-TRDSKYKPVVFDDVDVSLEWPS
        WS YG     L R A+RILSQ CSS      N    E ++  K+S  EQ++L+DLVFVQ N+ L+ +   + D    P+  + +DV  EW S
Subjt:  WSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCNLWLQHVCL-TRDSKYKPVVFDDVDVSLEWPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTGCCGTTGTATTTCAATTACTCGGCTCCTTTCCCTTTCCTCCTATCGGCCGTCATTTTCTTCCCGGAGTCCCGGCGGCCGGAATCTCTTCCCTCCAACTTCAGACCTTC
CGATGTTACTAACCTTCTATTTCTGATCTTCTCTTTCTCACTAATTCTATCTTCTCAGACAATGGTTCGAGGAAGGGATGCTTGTTGGGAGCATTGTGTTCTTGTTGATG
CAACACGACAGAAGGTTCGATGTAATTATTGTCAGCGGGAATTCAGTGGAGGTGTATACAGGATGAAATTTCATTTGGCTCAAATAAAAAACAAAGATATTGTTCCATGT
ACTGAAGTACCAACCGATGTTCGAGACCACATTCAAGGTATATTAAGCACTCCTAAGAAACAGAAGGCACCTAAGAAACCAAAGGTGGATATGGAAACTGCAACAAATGG
ACAGCAACATAGCTCCTCTGCTAGTGGTGGCATCCATCATGGATCCAGTGGTCAGAATGAAAGCAACTGCCCATCGACGTTTCCGTGCCTTTCACCAAGTGCACAACCAC
CAATTGATGATGCTCAAAAGCAGAAGAAGGATGAGACTGATAAAAAAGTTGCCATCTTTTTCTTCCATAATTCTATTCCTTTCAGTGCTGCCAAGTCTTTGTATTATCAG
GAAATGGTGGATGCAATAGCAGAATATGGAGGAGGATACAAAGCACCAAGTTATGAGAAATTAAAATCTACTCTTTTGGATAAAGTGAAAGGTGACATACATAGTTCTTA
CAAAAAGCATAGAGATGAATGGAAAGAAACAGGCTGTACTATCCTGTGTGATAGTTGGTCCGATGGACAGACCAAATCATTTCTAGTCATTTCTGTTACTTGTTCTAAAG
GAACACTGTTTCTGAAGTCGGTCGATATATCAGGTCATGAAGATGATGCAACTTACCTGTCCGACTTGCTTGAGACCATCATCCTTGAGGTTGGAGTGGAGAATGTTGTC
CAAATTATAACAGATGCTACTGCCAGTTATGTCTATGCTGGGAGGCTTCTCATGACCAAGTACACTTCCTTATTTTGGTCTCCATGTGTTTCTTATTGTGTTAATCAGAT
GTTGGAAGACATCAGTAAAATCGAGTGGGTCAGTGCAGTATTGGAGGAGGCAAAGATCATCACCCGGTACATTTATAGTCATGCGTCAATTTTGAATACCATGCGAAAAT
TCACTGGGGGAAAGGAATTAATCAGGCCAAGAATTACTAGATTTGTGACTAATTTTCTCTCTTTGAGGTCCATTGTGATTCTTGAGGACAATCTCAAACACATGTTTGCT
CATTCAGAGTGGCTGTCCTCAATTTATAGCAGGCGTCCTGATGCACAAGCAATTATTTCCTTGCTGTATTTGGATAGATTTTGGAAGGATGCACATGAAGCTATCAACAT
TTGTGAACCACTTATTAGAATTCTGAGAATTGTCGATGGAGACATGCCTGCCATGGGCTATATATTTGAAGGAATAGAGAGGGCAAAGGTTGAAATCAAAACATATTACA
ATGGCTTTGAGGATAAATATATGCCTATTTGGGAAACAATCGACCGGAGATGGAATTTGCAGCTTCACACAACGTTGCACACAGCAGCAGCGTTTCTTAACCCGTCTGTT
TTTTACAATCCAAACTTTAAGATTGATCTGAGAATTAGAAATGGATTTCAAGAAGCTATGTTGAAGATGGCGACAACTGATAAAGATAAAATGGAGATCACTAGAGAACA
TCCTGCTTATGTAAATGGGCAAGGTGCTCTTGGTACCGACTTCGCTATCTTGGGGAGAACTATAAATGCCCCAGGTGATTGGTGGTCTGGGTACGGTTATGAGATCCCCA
CACTCCAGAGAGCGGCGGTACGAATACTAAGCCAACCTTGTAGTTCTTATGGGTGCAGTGGATGGAACTGGAGCACATTCGAAACCTTACATTCAAAGAAGCATAGTAGA
GCCGAACAGGAAAAGTTGACTGATTTAGTGTTTGTACAGTGCAATCTTTGGTTGCAACACGTTTGTTTGACTCGGGATAGTAAATATAAACCCGTTGTATTTGATGATGT
AGATGTGAGTTTAGAATGGCCTTCCGAGTTGGAATGCTCAGCTCATGTTTTAGATGATTCATGGTTGGATAATCTGCCTCTTGAAGGTAGAGGCAGTCCTTAA
mRNA sequenceShow/hide mRNA sequence
TTGCCGTTGTATTTCAATTACTCGGCTCCTTTCCCTTTCCTCCTATCGGCCGTCATTTTCTTCCCGGAGTCCCGGCGGCCGGAATCTCTTCCCTCCAACTTCAGACCTTC
CGATGTTACTAACCTTCTATTTCTGATCTTCTCTTTCTCACTAATTCTATCTTCTCAGACAATGGTTCGAGGAAGGGATGCTTGTTGGGAGCATTGTGTTCTTGTTGATG
CAACACGACAGAAGGTTCGATGTAATTATTGTCAGCGGGAATTCAGTGGAGGTGTATACAGGATGAAATTTCATTTGGCTCAAATAAAAAACAAAGATATTGTTCCATGT
ACTGAAGTACCAACCGATGTTCGAGACCACATTCAAGGTATATTAAGCACTCCTAAGAAACAGAAGGCACCTAAGAAACCAAAGGTGGATATGGAAACTGCAACAAATGG
ACAGCAACATAGCTCCTCTGCTAGTGGTGGCATCCATCATGGATCCAGTGGTCAGAATGAAAGCAACTGCCCATCGACGTTTCCGTGCCTTTCACCAAGTGCACAACCAC
CAATTGATGATGCTCAAAAGCAGAAGAAGGATGAGACTGATAAAAAAGTTGCCATCTTTTTCTTCCATAATTCTATTCCTTTCAGTGCTGCCAAGTCTTTGTATTATCAG
GAAATGGTGGATGCAATAGCAGAATATGGAGGAGGATACAAAGCACCAAGTTATGAGAAATTAAAATCTACTCTTTTGGATAAAGTGAAAGGTGACATACATAGTTCTTA
CAAAAAGCATAGAGATGAATGGAAAGAAACAGGCTGTACTATCCTGTGTGATAGTTGGTCCGATGGACAGACCAAATCATTTCTAGTCATTTCTGTTACTTGTTCTAAAG
GAACACTGTTTCTGAAGTCGGTCGATATATCAGGTCATGAAGATGATGCAACTTACCTGTCCGACTTGCTTGAGACCATCATCCTTGAGGTTGGAGTGGAGAATGTTGTC
CAAATTATAACAGATGCTACTGCCAGTTATGTCTATGCTGGGAGGCTTCTCATGACCAAGTACACTTCCTTATTTTGGTCTCCATGTGTTTCTTATTGTGTTAATCAGAT
GTTGGAAGACATCAGTAAAATCGAGTGGGTCAGTGCAGTATTGGAGGAGGCAAAGATCATCACCCGGTACATTTATAGTCATGCGTCAATTTTGAATACCATGCGAAAAT
TCACTGGGGGAAAGGAATTAATCAGGCCAAGAATTACTAGATTTGTGACTAATTTTCTCTCTTTGAGGTCCATTGTGATTCTTGAGGACAATCTCAAACACATGTTTGCT
CATTCAGAGTGGCTGTCCTCAATTTATAGCAGGCGTCCTGATGCACAAGCAATTATTTCCTTGCTGTATTTGGATAGATTTTGGAAGGATGCACATGAAGCTATCAACAT
TTGTGAACCACTTATTAGAATTCTGAGAATTGTCGATGGAGACATGCCTGCCATGGGCTATATATTTGAAGGAATAGAGAGGGCAAAGGTTGAAATCAAAACATATTACA
ATGGCTTTGAGGATAAATATATGCCTATTTGGGAAACAATCGACCGGAGATGGAATTTGCAGCTTCACACAACGTTGCACACAGCAGCAGCGTTTCTTAACCCGTCTGTT
TTTTACAATCCAAACTTTAAGATTGATCTGAGAATTAGAAATGGATTTCAAGAAGCTATGTTGAAGATGGCGACAACTGATAAAGATAAAATGGAGATCACTAGAGAACA
TCCTGCTTATGTAAATGGGCAAGGTGCTCTTGGTACCGACTTCGCTATCTTGGGGAGAACTATAAATGCCCCAGGTGATTGGTGGTCTGGGTACGGTTATGAGATCCCCA
CACTCCAGAGAGCGGCGGTACGAATACTAAGCCAACCTTGTAGTTCTTATGGGTGCAGTGGATGGAACTGGAGCACATTCGAAACCTTACATTCAAAGAAGCATAGTAGA
GCCGAACAGGAAAAGTTGACTGATTTAGTGTTTGTACAGTGCAATCTTTGGTTGCAACACGTTTGTTTGACTCGGGATAGTAAATATAAACCCGTTGTATTTGATGATGT
AGATGTGAGTTTAGAATGGCCTTCCGAGTTGGAATGCTCAGCTCATGTTTTAGATGATTCATGGTTGGATAATCTGCCTCTTGAAGGTAGAGGCAGTCCTTAA
Protein sequenceShow/hide protein sequence
LPLYFNYSAPFPFLLSAVIFFPESRRPESLPSNFRPSDVTNLLFLIFSFSLILSSQTMVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC
TEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQ
EMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIILEVGVENVV
QIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFA
HSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSV
FYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSR
AEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLEGRGSP