; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G03090 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G03090
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionHAT transposon superfamily
Genome locationChr3:2518205..2524123
RNA-Seq ExpressionCSPI03G03090
SyntenyCSPI03G03090
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR003656 - Zinc finger, BED-type
IPR007021 - Domain of unknown function DUF659
IPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008448901.1 PREDICTED: uncharacterized protein LOC103490927 [Cucumis melo]0.0e+0097.92Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQK+DETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KK RDEWKETGCTILCDSWSDG+TKSFLVISVTCSKGTLFLKSVD SGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLEDISKIEWVS VLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILE+NLKHMFAHSEWLSSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTIN+PGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCS WNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLD-NLPLECRGSP
         SRAEQEKLTDLVFVQCNLWLQH+C TRDSKYKP+VFDD+DVSLEWPSELECSAHVLDDSWLD NLPLECRGSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLD-NLPLECRGSP

XP_011650424.1 uncharacterized protein LOC101222344 [Cucumis sativus]0.0e+0099.41Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPST+PCLSPSAQPPIDDAQKQK+DETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETI+LEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLECRGSP
        HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLE RGSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLECRGSP

XP_022923437.1 uncharacterized protein LOC111431132 [Cucurbita moschata]0.0e+0091.08Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVP DVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPSTFP  SPSAQPPIDDAQKQK+DETDKKVA+FFFHNSIPFSAAKSLYYQEMV+AIAEYG GY+APSYEKLKSTLL KVKGDI +SY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KK+RDEWKETGCTILC+SWSDG+TKSFL+IS+TCSKGTLFLKSV+ISG EDDATYLSDLLETIVLEVGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLED+SKIEWV  VL+EAKII RY+YSHA ILNTMRKFT GKELIRPRITRFVTNFLSLRSIV LED LKHMFAHSEW SSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        I+S LYLDRFWKDA EA+NI EPLIRILR+VDGDMPAMGYI+EGIERAKVE+K YYNG EDKYMPIW+TIDRRWNLQLHTTLHTAAAFLNPS+FYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVN QGALGTDFAILGRTIN PGDWWSGYGYEIPTLQRAA+RILSQPCSSYGCS WNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLECRGSP
         S  EQEKL DLVFVQCNLWLQH+  TRD KYKPVVFDD+DVSLEWP+ELE SAHVLDDSWLDNLPLEC GSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLECRGSP

XP_038876874.1 uncharacterized protein LOC120069237 isoform X1 [Benincasa hispida]0.0e+0095.1Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPSTFPC SPSAQPPIDD QKQK+DETDKKVAIFFFHNSIPFSAAKSLYYQEMV+AIAEYG GYKAPSYEKLKSTLLDKVKGDIH+SY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KK+ DEWKETGCTILCDSWSDG+TKSFLVIS+TCSKG LFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLEDISKIEWV  VLEEAKIITRYIYSHASILNTMRKFT GKELIRPRITRFVTNFLSLRSIVI EDNLKHMFAHSEWLSSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        IISLLYLDRFWKDA EA+NI EPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNG EDKY+PIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAA+RIL+QPCSSYGCS WNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLECRGSP
         SRAEQEKL DLVFVQCNLWLQH+CLTRD KYKPVVFDD+DVSLEWP+E E SAHVLDDSWLDNLPLECRGSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLECRGSP

XP_038876877.1 uncharacterized protein LOC120069237 isoform X2 [Benincasa hispida]0.0e+0095.1Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPSTFPC SPSAQPPIDD QKQK+DETDKKVAIFFFHNSIPFSAAKSLYYQEMV+AIAEYG GYKAPSYEKLKSTLLDKVKGDIH+SY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KK+ DEWKETGCTILCDSWSDG+TKSFLVIS+TCSKG LFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLEDISKIEWV  VLEEAKIITRYIYSHASILNTMRKFT GKELIRPRITRFVTNFLSLRSIVI EDNLKHMFAHSEWLSSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        IISLLYLDRFWKDA EA+NI EPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNG EDKY+PIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAA+RIL+QPCSSYGCS WNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLECRGSP
         SRAEQEKL DLVFVQCNLWLQH+CLTRD KYKPVVFDD+DVSLEWP+E E SAHVLDDSWLDNLPLECRGSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLECRGSP

TrEMBL top hitse value%identityAlignment
A0A0A0L2E4 BED-type domain-containing protein0.0e+0099.41Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPST+PCLSPSAQPPIDDAQKQK+DETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETI+LEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLECRGSP
        HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLE RGSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLECRGSP

A0A1S3BLP8 uncharacterized protein LOC1034909270.0e+0097.92Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQK+DETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KK RDEWKETGCTILCDSWSDG+TKSFLVISVTCSKGTLFLKSVD SGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLEDISKIEWVS VLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILE+NLKHMFAHSEWLSSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTIN+PGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCS WNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLD-NLPLECRGSP
         SRAEQEKLTDLVFVQCNLWLQH+C TRDSKYKP+VFDD+DVSLEWPSELECSAHVLDDSWLD NLPLECRGSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLD-NLPLECRGSP

A0A5D3D7G5 HAT transposon superfamily0.0e+0097.92Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQK+DETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KK RDEWKETGCTILCDSWSDG+TKSFLVISVTCSKGTLFLKSVD SGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLEDISKIEWVS VLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILE+NLKHMFAHSEWLSSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTIN+PGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCS WNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLD-NLPLECRGSP
         SRAEQEKLTDLVFVQCNLWLQH+C TRDSKYKP+VFDD+DVSLEWPSELECSAHVLDDSWLD NLPLECRGSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLD-NLPLECRGSP

A0A6J1E9N1 uncharacterized protein LOC1114311320.0e+0091.08Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVP DVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPSTFP  SPSAQPPIDDAQKQK+DETDKKVA+FFFHNSIPFSAAKSLYYQEMV+AIAEYG GY+APSYEKLKSTLL KVKGDI +SY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KK+RDEWKETGCTILC+SWSDG+TKSFL+IS+TCSKGTLFLKSV+ISG EDDATYLSDLLETIVLEVGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLED+SKIEWV  VL+EAKII RY+YSHA ILNTMRKFT GKELIRPRITRFVTNFLSLRSIV LED LKHMFAHSEW SSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        I+S LYLDRFWKDA EA+NI EPLIRILR+VDGDMPAMGYI+EGIERAKVE+K YYNG EDKYMPIW+TIDRRWNLQLHTTLHTAAAFLNPS+FYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVN QGALGTDFAILGRTIN PGDWWSGYGYEIPTLQRAA+RILSQPCSSYGCS WNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLECRGSP
         S  EQEKL DLVFVQCNLWLQH+  TRD KYKPVVFDD+DVSLEWP+ELE SAHVLDDSWLDNLPLEC GSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLECRGSP

A0A6J1KZI0 uncharacterized protein LOC1115002590.0e+0090.49Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVP DVRD IQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
        IHHGSSGQNESNCPST PC SPSAQP IDDAQKQK+DETDKKVA+FFFHNSIPFSAAKSLYYQEMV+AIAEYG GY+APSY+KLKSTLLDKVKGDI +SY
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        KK+RDEWKETGCTILC+SWSDG+TKSFL+IS+TCSKGTLFLKSV+ISG EDDATYLSDLLETIVLEVGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVNQMLED+SKIEWV  VL+EAKII RY+YSHA IL+TMRKFT GKELIRPRITRFVTNFLSLRSIV LED LKHMFAHSEW SSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
        I+S LYLDRFWKDA EA+NI EPLIRILR+VDGDMPAMGYI+EGIERAKVE+K YYNG EDKYMPIW+TIDRRWNLQLHTTLHTAAAFLNPS+FYNPNFK
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVN QGALGTDFAILGRTIN PGDWWSGYGYEIPTLQR A+RILSQPCSSYGCS WNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLECRGSP
         SR EQEKL DLVFVQCNLWLQH+  TRD KYKPVVFDD+DVSLEWP+ELE SA VLDDSWLDNLPLEC GSP
Subjt:  HSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLDNLPLECRGSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily4.7e-11834.04Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG
        MVR +D CWE+   +D    KV+C +C R  +GG+ R+K HL+++ +K + PC +V  DV D ++ ILS         K K     +      +S     
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY
                        FP   P+AQ           D  ++ +++FFF N I F+ A+S  Y  M+DA+A+ G G+ APS    K+  LD+VK DI    
Subjt:  IHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY

Query:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS
        K    EW  TGCTI+ ++W+D ++++ +  SV+      F KSVD S +  ++  L+DL ++++ ++G E++VQII D +  Y      L+  Y ++F S
Subjt:  KKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PC S C+N +LE+ SK++WV+  + +A++I++++Y+++ +L+ +RK TGG+++IR  +TR V+NFLSL+S++  +  LKHMF   E+ ++  + +P + +
Subjt:  PCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK
         +++L  + FW+   E++ I EP++++LR V    PA+G I+E + +AK  I+TYY   E+K+    + +D  W   LH+ LH AAAFLNPS+ YNP  K
Subjt:  IISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK
            ++  F + + K+  T   + +IT +   +   +G  G + A+  R   +PG WW  +G   P LQR A+RILSQ CS Y      WSTF+ +H ++
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKK

Query:  HSRAEQEKLTDLVFVQCNLWL-QHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLD
         ++ ++E L  L +V  NL L + + L  D    P+  +D+D+  EW  E E  +      WLD
Subjt:  HSRAEQEKLTDLVFVQCNLWL-QHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDSWLD

AT3G17450.1 hAT dimerisation domain-containing protein1.4e-9330.49Show/hide
Query:  DACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETAT-----------------
        D  WEH +  D  ++KV+CNYC +  SGG+ R K HLA+I   ++ PC   P +V   I+  +   +  K   +P  +M   T                 
Subjt:  DACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETAT-----------------

Query:  -------------NGQ----QHSSSASGGIHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQK---RDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDA
                     NG+    +  S  S  +   S  + +      F   S S Q  +  +   +   R +    ++ F  H  +P  AA SLY+Q+M++ 
Subjt:  -------------NGQ----QHSSSASGGIHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQK---RDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDA

Query:  IAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVG
        I  YG G+  PS +     LL +    I S  +++R  W  TGC+I+ D+W++ + K  +   V+C +G  F  S+D +   +DA  L   L+ +V ++G
Subjt:  IAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVG

Query:  VENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMR-KFTGGKELIRPRITRFVTNFLS
         ENVVQ+IT  TA +  AG+LL  K  +L+W+PC  +C   +LED SK+E+VS  LE+A+ ITR+IY+   +LN M+ +FT G +L+RP + R  + F +
Subjt:  VENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMR-KFTGGKELIRPRITRFVTNFLS

Query:  LRSIVILEDNLKHMFAHSEW-LSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVD--GDMPAMGYIFEGIERAKVEIKTYYNGFEDKYM
        L+S++  + +L+ +F    W LS   ++  + + +  ++    FWK     +   +P+++++ +++  GD  +M Y +  +  AK+ IK+ ++    KY 
Subjt:  LRSIVILEDNLKHMFAHSEW-LSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVD--GDMPAMGYIFEGIERAKVEIKTYYNGFEDKYM

Query:  PIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYE
        P W  I+ RWN   H  L+ AA F NP+  Y P+F     +  G  E ++++   +  ++    + P Y   +   GTD AI  RT   P  WW  +G  
Subjt:  PIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYE

Query:  IPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYK
           LQR AVRILS  CSS GC    WS ++ ++S+  S+  ++   DL +V  NL L+   L +   Y+
Subjt:  IPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYK

AT3G22220.1 hAT transposon superfamily4.2e-9031.06Show/hide
Query:  RDACWEHC-VLVDATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGIL-STPKKQKAPKK-----------PKVDMET----
        +D+ W+HC V     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP +VR  +Q  +  T ++Q+  +K           P  ++ET    
Subjt:  RDACWEHC-VLVDATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGIL-STPKKQKAPKK-----------PKVDMET----

Query:  ---ATNG-QQHSSSASGGIHHGSSGQN--------------------ESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYY
             NG +  SS    G   G + Q                     + +  +  P    S +  +    K++       +  F F     F AA S+  
Subjt:  ---ATNG-QQHSSSASGGIHHGSSGQN--------------------ESNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYY

Query:  QEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLET
        Q  +DAI   G G   P++E L+  +L     ++     + +  WK TGC++L    +  +    L   V C +  +FLKSVD S   D    L +LL+ 
Subjt:  QEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLET

Query:  IVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFV
        +V E+G  NVVQ+IT     Y  AG+ LM  Y SL+W PC ++C+++MLE+  K++W+  ++E+A+ +TR IY+H+ +LN MRKFT G ++++P  T   
Subjt:  IVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFV

Query:  TNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYNGFED
        TNF ++  I  L+  L+ M   SEW    YS+     A+   +  + FWK    A +I  P++R+LRIV  +  PAMGY++  + RAK  IKT      +
Subjt:  TNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYNGFED

Query:  KYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGY
        +Y+  W+ IDR W   L   L+ A  +LNP  FY+ + ++   I     + + K+      +  + ++  +Y N  G  G + AI  R    P +WWS Y
Subjt:  KYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGY

Query:  GYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCNLWLQHVC--LTRDSKYKPVVFDDVDVSLEWPS
        G     L R A+RILSQ CSS   S  N ++   ++  K+S  E+++L DLVFVQ N+ L+ +    + D    P+   +++V  +W S
Subjt:  GYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCNLWLQHVC--LTRDSKYKPVVFDDVDVSLEWPS

AT4G15020.1 hAT transposon superfamily1.7e-9632.8Show/hide
Query:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILS-----TPKKQKAPKKP-----------------
        +D  W+HC +     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP DVR  +Q  +        K+ K+  +P                 
Subjt:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILS-----TPKKQKAPKKP-----------------

Query:  ---------------KVDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAK
                        V  E+  +G   Q+   S      +GS+  N      +  +  P    S +  +  + + + +     +  F F     F A  
Subjt:  ---------------KVDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAK

Query:  SLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSD
        S+ +Q M+DAIA  G G  AP+++ L+  +L     ++     + +  WK TGC+IL +  +  +    L   V C +  +FLKSVD S     A  L +
Subjt:  SLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSD

Query:  LLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRI
        LL  +V EVG  NVVQ+IT     YV AG+ LM  Y SL+W PC ++C++QMLE+  K+ W+S  +E+A+ ITR++Y+H+ +LN M KFT G +++ P  
Subjt:  LLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRI

Query:  TRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYN
        +   TNF +L  I  L+ NL+ M   +EW    YS  P    +++ L  + FWK      ++  PL+R LRIV  +  PAMGY++  + RAK  IKT+  
Subjt:  TRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYN

Query:  GFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDW
          ED Y+  W+ IDR W  Q H  L  A  FLNP +FYN N +I   +     + + ++   DK + +I +E  +Y    G  G + AI  R    P +W
Subjt:  GFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDW

Query:  WSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCNLWLQHVCL-TRDSKYKPVVFDDVDVSLEWPS
        WS YG     L R A+RILSQ CSS      N    E ++  K+S  EQ++L+DLVFVQ N+ L+ +   + D    P+  + +DV  EW S
Subjt:  WSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCNLWLQHVCL-TRDSKYKPVVFDDVDVSLEWPS

AT4G15020.2 hAT transposon superfamily1.7e-9632.8Show/hide
Query:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILS-----TPKKQKAPKKP-----------------
        +D  W+HC +     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP DVR  +Q  +        K+ K+  +P                 
Subjt:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILS-----TPKKQKAPKKP-----------------

Query:  ---------------KVDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAK
                        V  E+  +G   Q+   S      +GS+  N      +  +  P    S +  +  + + + +     +  F F     F A  
Subjt:  ---------------KVDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAK

Query:  SLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSD
        S+ +Q M+DAIA  G G  AP+++ L+  +L     ++     + +  WK TGC+IL +  +  +    L   V C +  +FLKSVD S     A  L +
Subjt:  SLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSD

Query:  LLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRI
        LL  +V EVG  NVVQ+IT     YV AG+ LM  Y SL+W PC ++C++QMLE+  K+ W+S  +E+A+ ITR++Y+H+ +LN M KFT G +++ P  
Subjt:  LLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRI

Query:  TRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYN
        +   TNF +L  I  L+ NL+ M   +EW    YS  P    +++ L  + FWK      ++  PL+R LRIV  +  PAMGY++  + RAK  IKT+  
Subjt:  TRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYN

Query:  GFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDW
          ED Y+  W+ IDR W  Q H  L  A  FLNP +FYN N +I   +     + + ++   DK + +I +E  +Y    G  G + AI  R    P +W
Subjt:  GFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDW

Query:  WSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCNLWLQHVCL-TRDSKYKPVVFDDVDVSLEWPS
        WS YG     L R A+RILSQ CSS      N    E ++  K+S  EQ++L+DLVFVQ N+ L+ +   + D    P+  + +DV  EW S
Subjt:  WSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCNLWLQHVCL-TRDSKYKPVVFDDVDVSLEWPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGAGGAAGGGATGCTTGTTGGGAGCATTGTGTTCTTGTTGATGCAACACGACAGAAGGTTCGATGTAATTATTGTCAGCGGGAATTCAGTGGAGGTGTATACAG
GATGAAATTTCATTTGGCTCAAATAAAAAACAAAGATATTGTTCCATGTACTGAAGTACCAACCGATGTTCGAGACCACATTCAAGGTATATTAAGCACTCCTAAGAAAC
AGAAGGCACCTAAGAAACCAAAGGTGGATATGGAAACTGCAACAAATGGACAGCAACATAGCTCCTCTGCTAGTGGTGGCATCCATCATGGATCCAGTGGTCAGAATGAA
AGCAACTGCCCATCGACGTTTCCGTGCCTTTCACCAAGTGCACAACCACCAATTGATGATGCTCAAAAGCAGAAGAGGGATGAGACTGATAAAAAAGTTGCCATCTTTTT
CTTCCATAATTCTATTCCTTTCAGTGCTGCCAAGTCTTTGTATTATCAGGAAATGGTGGATGCAATAGCAGAATATGGAGGAGGATACAAAGCACCAAGTTATGAGAAAT
TAAAATCTACTCTTTTGGATAAAGTGAAAGGTGACATACATAGTTCTTACAAAAAGCATAGAGATGAATGGAAAGAAACAGGCTGTACTATCCTGTGTGATAGTTGGTCC
GATGGACAGACCAAATCATTTCTAGTCATTTCTGTTACTTGTTCTAAAGGAACACTGTTTCTGAAGTCGGTCGATATATCAGGTCATGAAGATGATGCAACTTACCTGTC
CGACTTGCTTGAGACCATCGTCCTTGAGGTTGGAGTGGAGAATGTTGTCCAAATTATAACAGATGCTACTGCCAGTTATGTCTATGCTGGGAGGCTTCTCATGACCAAGT
ACACTTCCTTATTTTGGTCTCCATGTGTTTCTTATTGTGTTAATCAGATGTTGGAAGACATCAGTAAAATCGAGTGGGTCAGTGCAGTATTGGAGGAGGCAAAGATCATC
ACCCGGTACATTTATAGTCATGCGTCAATTTTGAATACCATGCGAAAATTCACTGGGGGAAAGGAATTAATCAGGCCAAGAATTACTAGATTTGTGACTAATTTTCTCTC
TTTGAGGTCCATTGTGATTCTTGAGGACAATCTCAAACACATGTTTGCTCATTCAGAGTGGCTGTCCTCAATTTATAGCAGGCGTCCTGATGCACAAGCAATTATTTCCT
TGCTGTATTTGGATAGATTTTGGAAGGATGCACATGAAGCTATCAACATTTGTGAACCACTTATTAGAATTCTGAGAATTGTCGATGGAGACATGCCTGCCATGGGCTAT
ATATTTGAAGGAATAGAGAGGGCAAAGGTTGAAATCAAAACATATTACAATGGCTTTGAGGATAAATATATGCCTATTTGGGAAACAATCGACCGGAGATGGAATTTGCA
GCTTCACACAACGTTGCACACAGCAGCAGCGTTTCTTAACCCGTCTGTTTTTTACAATCCAAACTTTAAGATTGATCTGAGAATTAGAAATGGATTTCAAGAAGCTATGT
TGAAGATGGCGACAACTGATAAAGATAAAATGGAGATCACTAGAGAACATCCTGCTTATGTAAATGGGCAAGGTGCTCTTGGTACCGACTTCGCTATCTTGGGGAGAACT
ATAAATGCCCCAGGTGATTGGTGGTCTGGGTACGGTTATGAGATCCCCACACTCCAGAGAGCGGCGGTACGAATACTAAGCCAACCTTGTAGTTCTTATGGGTGCAGTGG
ATGGAACTGGAGCACATTCGAAACCTTACATTCAAAGAAGCATAGTAGAGCCGAACAGGAAAAGTTGACTGATTTAGTGTTTGTACAGTGCAATCTTTGGTTGCAACACG
TTTGTTTGACTCGGGATAGTAAATATAAACCCGTTGTATTTGATGATGTAGATGTGAGTTTAGAATGGCCTTCCGAGTTGGAATGCTCAGCTCATGTTTTAGATGATTCA
TGGTTGGATAATCTGCCTCTTGAATGTAGAGGCAGTCCTTAA
mRNA sequenceShow/hide mRNA sequence
TTTTCTTCCCGGAGTCCCGGCGGCCGGAATCTCTTCCCTCCAACTTCAGACTTTCCGATGTTACTAACCTTCTATTTCTGATCTTCTCTTTCTCACTAATTCTATCTTCT
CAGGGGCACAAAATATTGTATACCTTCTGCGATGGTAAAGGCATTCTACTAAACCATATATTCACAGATTGTGATATTTCCTATAGACAATGGTTCGAGGAAGGGATGCT
TGTTGGGAGCATTGTGTTCTTGTTGATGCAACACGACAGAAGGTTCGATGTAATTATTGTCAGCGGGAATTCAGTGGAGGTGTATACAGGATGAAATTTCATTTGGCTCA
AATAAAAAACAAAGATATTGTTCCATGTACTGAAGTACCAACCGATGTTCGAGACCACATTCAAGGTATATTAAGCACTCCTAAGAAACAGAAGGCACCTAAGAAACCAA
AGGTGGATATGGAAACTGCAACAAATGGACAGCAACATAGCTCCTCTGCTAGTGGTGGCATCCATCATGGATCCAGTGGTCAGAATGAAAGCAACTGCCCATCGACGTTT
CCGTGCCTTTCACCAAGTGCACAACCACCAATTGATGATGCTCAAAAGCAGAAGAGGGATGAGACTGATAAAAAAGTTGCCATCTTTTTCTTCCATAATTCTATTCCTTT
CAGTGCTGCCAAGTCTTTGTATTATCAGGAAATGGTGGATGCAATAGCAGAATATGGAGGAGGATACAAAGCACCAAGTTATGAGAAATTAAAATCTACTCTTTTGGATA
AAGTGAAAGGTGACATACATAGTTCTTACAAAAAGCATAGAGATGAATGGAAAGAAACAGGCTGTACTATCCTGTGTGATAGTTGGTCCGATGGACAGACCAAATCATTT
CTAGTCATTTCTGTTACTTGTTCTAAAGGAACACTGTTTCTGAAGTCGGTCGATATATCAGGTCATGAAGATGATGCAACTTACCTGTCCGACTTGCTTGAGACCATCGT
CCTTGAGGTTGGAGTGGAGAATGTTGTCCAAATTATAACAGATGCTACTGCCAGTTATGTCTATGCTGGGAGGCTTCTCATGACCAAGTACACTTCCTTATTTTGGTCTC
CATGTGTTTCTTATTGTGTTAATCAGATGTTGGAAGACATCAGTAAAATCGAGTGGGTCAGTGCAGTATTGGAGGAGGCAAAGATCATCACCCGGTACATTTATAGTCAT
GCGTCAATTTTGAATACCATGCGAAAATTCACTGGGGGAAAGGAATTAATCAGGCCAAGAATTACTAGATTTGTGACTAATTTTCTCTCTTTGAGGTCCATTGTGATTCT
TGAGGACAATCTCAAACACATGTTTGCTCATTCAGAGTGGCTGTCCTCAATTTATAGCAGGCGTCCTGATGCACAAGCAATTATTTCCTTGCTGTATTTGGATAGATTTT
GGAAGGATGCACATGAAGCTATCAACATTTGTGAACCACTTATTAGAATTCTGAGAATTGTCGATGGAGACATGCCTGCCATGGGCTATATATTTGAAGGAATAGAGAGG
GCAAAGGTTGAAATCAAAACATATTACAATGGCTTTGAGGATAAATATATGCCTATTTGGGAAACAATCGACCGGAGATGGAATTTGCAGCTTCACACAACGTTGCACAC
AGCAGCAGCGTTTCTTAACCCGTCTGTTTTTTACAATCCAAACTTTAAGATTGATCTGAGAATTAGAAATGGATTTCAAGAAGCTATGTTGAAGATGGCGACAACTGATA
AAGATAAAATGGAGATCACTAGAGAACATCCTGCTTATGTAAATGGGCAAGGTGCTCTTGGTACCGACTTCGCTATCTTGGGGAGAACTATAAATGCCCCAGGTGATTGG
TGGTCTGGGTACGGTTATGAGATCCCCACACTCCAGAGAGCGGCGGTACGAATACTAAGCCAACCTTGTAGTTCTTATGGGTGCAGTGGATGGAACTGGAGCACATTCGA
AACCTTACATTCAAAGAAGCATAGTAGAGCCGAACAGGAAAAGTTGACTGATTTAGTGTTTGTACAGTGCAATCTTTGGTTGCAACACGTTTGTTTGACTCGGGATAGTA
AATATAAACCCGTTGTATTTGATGATGTAGATGTGAGTTTAGAATGGCCTTCCGAGTTGGAATGCTCAGCTCATGTTTTAGATGATTCATGGTTGGATAATCTGCCTCTT
GAATGTAGAGGCAGTCCTTAAATATTTTTAAAGCAATCAGAAACAGATAGTTGATCCCTCTGATTGCATATCATTTCCAAATTTTCCATTCAAGTGTTCATAGATATCAT
CAAATAGAAGTTGTAAATTAGATAATCAAGCTCAAACTTGATATGTAAATTAGAGGGCACAACAATGGGCAATTTCCAAAATATGCTGTGGAAATCTTAGATAAATTCCC
AAGG
Protein sequenceShow/hide protein sequence
MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNE
SNCPSTFPCLSPSAQPPIDDAQKQKRDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWS
DGQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLETIVLEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVSAVLEEAKII
TRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMGY
IFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRT
INAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCNLWLQHVCLTRDSKYKPVVFDDVDVSLEWPSELECSAHVLDDS
WLDNLPLECRGSP