; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG10G003060 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG10G003060
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionHAT transposon superfamily
Genome locationCG_Chr10:3817546..3833769
RNA-Seq ExpressionClCG10G003060
SyntenyClCG10G003060
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR007021 - Domain of unknown function DUF659
IPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily
IPR016140 - Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain
IPR036312 - Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008448901.1 PREDICTED: uncharacterized protein LOC103490927 [Cucumis melo]0.0e+0093.42Show/hide
Query:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV
        I+S    +KAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPC SPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV
Subjt:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV

Query:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE
        DAIAE+G GYKAPSYEKLKSTLLDKVKGDIH+SY+K RDEWKETGCTILCDSWSDGRTKSFLVIS+TCSKGTLFLKSVD SG EDDATYLSDLLETIVLE
Subjt:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE

Query:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL
        VGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWV TVLEEAKIITRYIYSHASILNTMRKFT GKELIRP+ITRFVTNFL
Subjt:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL

Query:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI
        SLRSIVILE+NLKHMFAHSEWLSSIYSRRPDAQ IISLLYLDRFWKDA EA+NI EPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNG EDKYMPI
Subjt:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI

Query:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP
        W+TID RWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTIN+PGDWWSGYGYEIP
Subjt:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP

Query:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLD-NL
        TLQRAA+RILSQPCSSY  SRWNWSTFETLHSKKRSRAEQEKL DLVFVQCNLWLQHIC TRD KYKP+VFDDIDVSLEWP+E E SAHVLDDSWLD NL
Subjt:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLD-NL

Query:  PLECRGNP
        PLECRG+P
Subjt:  PLECRGNP

XP_011650424.1 uncharacterized protein LOC101222344 [Cucumis sativus]0.0e+0092.92Show/hide
Query:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV
        I+S    +KAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPST+PC SPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV
Subjt:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV

Query:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE
        DAIAE+G GYKAPSYEKLKSTLLDKVKGDIH+SY+K+RDEWKETGCTILCDSWSDG+TKSFLVIS+TCSKGTLFLKSVDISG EDDATYLSDLLETI+LE
Subjt:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE

Query:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL
        VGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWV  VLEEAKIITRYIYSHASILNTMRKFT GKELIRP+ITRFVTNFL
Subjt:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL

Query:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI
        SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQ IISLLYLDRFWKDA EA+NI EPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNG EDKYMPI
Subjt:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI

Query:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP
        W+TID RWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP
Subjt:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP

Query:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLDNLP
        TLQRAA+RILSQPCSSY  S WNWSTFETLHSKK SRAEQEKL DLVFVQCNLWLQH+CLTRD KYKPVVFDD+DVSLEWP+E E SAHVLDDSWLDNLP
Subjt:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLDNLP

Query:  LECRGNP
        LE RG+P
Subjt:  LECRGNP

XP_022923437.1 uncharacterized protein LOC111431132 [Cucurbita moschata]0.0e+0091.76Show/hide
Query:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV
        I+S    ++APKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFP  SPSAQPPIDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMV
Subjt:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV

Query:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE
        +AIAE+GVGY+APSYEKLKSTLL KVKGDI NSY+KYRDEWKETGCTILC+SWSDGRTKSFL+ISITCSKGTLFLKSV+ISGREDDATYLSDLLETIVLE
Subjt:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE

Query:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL
        VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHA ILNTMRKFTSGKELIRP+ITRFVTNFL
Subjt:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL

Query:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI
        SLRSIV LED LKHMFAHSEW SSIYSRRPDAQ I+S LYLDRFWKDA+EAVNISEPLIRILR+VDGDMPAMGYI+EGIERAKVE+K YYNGIEDKYMPI
Subjt:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI

Query:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP
        WDTID RWNLQLHTTLHTAAAFLNPS+FYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVN QGALGTDFAILGRTIN PGDWWSGYGYEIP
Subjt:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP

Query:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLDNLP
        TLQRAAIRILSQPCSSY  SRWNWSTFETLHSKKRS  EQEKLNDLVFVQCNLWLQHI  TRDGKYKPVVFDDIDVSLEWPTE E+SAHVLDDSWLDNLP
Subjt:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLDNLP

Query:  LECRGNP
        LEC G+P
Subjt:  LECRGNP

XP_038876874.1 uncharacterized protein LOC120069237 isoform X1 [Benincasa hispida]0.0e+0096.05Show/hide
Query:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV
        I+S    +KAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDD QKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV
Subjt:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV

Query:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE
        +AIAE+GVGYKAPSYEKLKSTLLDKVKGDIHNSY+KY DEWKETGCTILCDSWSDGRTKSFLVISITCSKG LFLKSVDISG EDDATYLSDLLETIVLE
Subjt:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE

Query:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL
        VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRP+ITRFVTNFL
Subjt:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL

Query:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI
        SLRSIVI EDNLKHMFAHSEWLSSIYSRRPDAQ IISLLYLDRFWKDAQEAVNI+EPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKY+PI
Subjt:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI

Query:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP
        W+TID RWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP
Subjt:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP

Query:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLDNLP
        TLQRAAIRIL+QPCSSY  SRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLDNLP
Subjt:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLDNLP

Query:  LECRGNP
        LECRG+P
Subjt:  LECRGNP

XP_038876877.1 uncharacterized protein LOC120069237 isoform X2 [Benincasa hispida]0.0e+0096.05Show/hide
Query:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV
        I+S    +KAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDD QKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV
Subjt:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV

Query:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE
        +AIAE+GVGYKAPSYEKLKSTLLDKVKGDIHNSY+KY DEWKETGCTILCDSWSDGRTKSFLVISITCSKG LFLKSVDISG EDDATYLSDLLETIVLE
Subjt:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE

Query:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL
        VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRP+ITRFVTNFL
Subjt:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL

Query:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI
        SLRSIVI EDNLKHMFAHSEWLSSIYSRRPDAQ IISLLYLDRFWKDAQEAVNI+EPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKY+PI
Subjt:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI

Query:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP
        W+TID RWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP
Subjt:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP

Query:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLDNLP
        TLQRAAIRIL+QPCSSY  SRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLDNLP
Subjt:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLDNLP

Query:  LECRGNP
        LECRG+P
Subjt:  LECRGNP

TrEMBL top hitse value%identityAlignment
A0A0A0L2E4 BED-type domain-containing protein0.0e+0092.92Show/hide
Query:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV
        I+S    +KAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPST+PC SPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV
Subjt:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV

Query:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE
        DAIAE+G GYKAPSYEKLKSTLLDKVKGDIH+SY+K+RDEWKETGCTILCDSWSDG+TKSFLVIS+TCSKGTLFLKSVDISG EDDATYLSDLLETI+LE
Subjt:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE

Query:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL
        VGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWV  VLEEAKIITRYIYSHASILNTMRKFT GKELIRP+ITRFVTNFL
Subjt:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL

Query:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI
        SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQ IISLLYLDRFWKDA EA+NI EPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNG EDKYMPI
Subjt:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI

Query:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP
        W+TID RWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP
Subjt:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP

Query:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLDNLP
        TLQRAA+RILSQPCSSY  S WNWSTFETLHSKK SRAEQEKL DLVFVQCNLWLQH+CLTRD KYKPVVFDD+DVSLEWP+E E SAHVLDDSWLDNLP
Subjt:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLDNLP

Query:  LECRGNP
        LE RG+P
Subjt:  LECRGNP

A0A1S3BLP8 uncharacterized protein LOC1034909270.0e+0093.42Show/hide
Query:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV
        I+S    +KAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPC SPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV
Subjt:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV

Query:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE
        DAIAE+G GYKAPSYEKLKSTLLDKVKGDIH+SY+K RDEWKETGCTILCDSWSDGRTKSFLVIS+TCSKGTLFLKSVD SG EDDATYLSDLLETIVLE
Subjt:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE

Query:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL
        VGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWV TVLEEAKIITRYIYSHASILNTMRKFT GKELIRP+ITRFVTNFL
Subjt:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL

Query:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI
        SLRSIVILE+NLKHMFAHSEWLSSIYSRRPDAQ IISLLYLDRFWKDA EA+NI EPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNG EDKYMPI
Subjt:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI

Query:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP
        W+TID RWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTIN+PGDWWSGYGYEIP
Subjt:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP

Query:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLD-NL
        TLQRAA+RILSQPCSSY  SRWNWSTFETLHSKKRSRAEQEKL DLVFVQCNLWLQHIC TRD KYKP+VFDDIDVSLEWP+E E SAHVLDDSWLD NL
Subjt:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLD-NL

Query:  PLECRGNP
        PLECRG+P
Subjt:  PLECRGNP

A0A5D3D7G5 HAT transposon superfamily0.0e+0093.42Show/hide
Query:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV
        I+S    +KAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPC SPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV
Subjt:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV

Query:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE
        DAIAE+G GYKAPSYEKLKSTLLDKVKGDIH+SY+K RDEWKETGCTILCDSWSDGRTKSFLVIS+TCSKGTLFLKSVD SG EDDATYLSDLLETIVLE
Subjt:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE

Query:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL
        VGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWV TVLEEAKIITRYIYSHASILNTMRKFT GKELIRP+ITRFVTNFL
Subjt:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL

Query:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI
        SLRSIVILE+NLKHMFAHSEWLSSIYSRRPDAQ IISLLYLDRFWKDA EA+NI EPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNG EDKYMPI
Subjt:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI

Query:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP
        W+TID RWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTIN+PGDWWSGYGYEIP
Subjt:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP

Query:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLD-NL
        TLQRAA+RILSQPCSSY  SRWNWSTFETLHSKKRSRAEQEKL DLVFVQCNLWLQHIC TRD KYKP+VFDDIDVSLEWP+E E SAHVLDDSWLD NL
Subjt:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLD-NL

Query:  PLECRGNP
        PLECRG+P
Subjt:  PLECRGNP

A0A6J1E9N1 uncharacterized protein LOC1114311320.0e+0091.76Show/hide
Query:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV
        I+S    ++APKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFP  SPSAQPPIDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMV
Subjt:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV

Query:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE
        +AIAE+GVGY+APSYEKLKSTLL KVKGDI NSY+KYRDEWKETGCTILC+SWSDGRTKSFL+ISITCSKGTLFLKSV+ISGREDDATYLSDLLETIVLE
Subjt:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE

Query:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL
        VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHA ILNTMRKFTSGKELIRP+ITRFVTNFL
Subjt:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL

Query:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI
        SLRSIV LED LKHMFAHSEW SSIYSRRPDAQ I+S LYLDRFWKDA+EAVNISEPLIRILR+VDGDMPAMGYI+EGIERAKVE+K YYNGIEDKYMPI
Subjt:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI

Query:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP
        WDTID RWNLQLHTTLHTAAAFLNPS+FYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVN QGALGTDFAILGRTIN PGDWWSGYGYEIP
Subjt:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP

Query:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLDNLP
        TLQRAAIRILSQPCSSY  SRWNWSTFETLHSKKRS  EQEKLNDLVFVQCNLWLQHI  TRDGKYKPVVFDDIDVSLEWPTE E+SAHVLDDSWLDNLP
Subjt:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLDNLP

Query:  LECRGNP
        LEC G+P
Subjt:  LECRGNP

A0A6J1KZI0 uncharacterized protein LOC1115002590.0e+0091.27Show/hide
Query:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV
        I+S    ++APKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPST PC SPSAQP IDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMV
Subjt:  IVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV

Query:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE
        +AIAE+GVGY+APSY+KLKSTLLDKVKGDI NSY+KYRDEWKETGCTILC+SWSDGRTKSFL+ISITCSKGTLFLKSV+ISGREDDATYLSDLLETIVLE
Subjt:  DAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLE

Query:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL
        VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHA IL+TMRKFTSGKELIRP+ITRFVTNFL
Subjt:  VGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFL

Query:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI
        SLRSIV LED LKHMFAHSEW SSIYSRRPDAQ I+S LYLDRFWKDA+EAVNISEPLIRILR+VDGDMPAMGYI+EGIERAKVE+K YYNGIEDKYMPI
Subjt:  SLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPI

Query:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP
        WDTID RWNLQLHTTLHTAAAFLNPS+FYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVN QGALGTDFAILGRTIN PGDWWSGYGYEIP
Subjt:  WDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIP

Query:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLDNLP
        TLQR AIRILSQPCSSY  SRWNWSTFETLHSKKRSR EQEKLNDLVFVQCNLWLQHI  TRDGKYKPVVFDDIDVSLEWPTE E+SA VLDDSWLDNLP
Subjt:  TLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLDNLP

Query:  LECRGNP
        LEC G+P
Subjt:  LECRGNP

SwissProt top hitse value%identityAlignment
B3H587 Non-specific lipid transfer protein GPI-anchored 84.0e-0428.83Show/hide
Query:  VSVAVAVVAVVVVVVAVGFVEAQVPDCASKLVPCADFLK-SNNPPASCCDPIKEAVATQLQCLCNLYTTPGFLESIGVNVSDAIHLTKACGVPVELSKCK
        V+  V + +V V     G V+ Q   C +KL+PC  ++  S  PP  CC+P+K+     + CLC  +  P  L  + +   +AI +  +CG+  + S C 
Subjt:  VSVAVAVVAVVVVVVAVGFVEAQVPDCASKLVPCADFLK-SNNPPASCCDPIKEAVATQLQCLCNLYTTPGFLESIGVNVSDAIHLTKACGVPVELSKCK

Query:  TAAPAPQHGAP
           P+     P
Subjt:  TAAPAPQHGAP

Q1G2Y5 Non-specific lipid transfer protein GPI-anchored 212.8e-0535.45Show/hide
Query:  PDCASKLVPCADFLKSNN--PPASCCDPIKEAVATQLQCLCNLYTTPGFLESIGVNVSDAIHLTKACGVPVELSKCK-TAAPAPQHGAPS-PPGVPGNDA
        P   S +  C  FL      P + CC  +K    T + CLC L  T G   SI +N + AI L +ACG+P    +CK +AAP P  G  S  P     D+
Subjt:  PDCASKLVPCADFLKSNN--PPASCCDPIKEAVATQLQCLCNLYTTPGFLESIGVNVSDAIHLTKACGVPVELSKCK-TAAPAPQHGAPS-PPGVPGNDA

Query:  QKFASTGFTS
        Q     G  S
Subjt:  QKFASTGFTS

Q1PFD8 Non-specific lipid transfer protein GPI-anchored 92.2e-1035.46Show/hide
Query:  VAVVVVVVAVGFVEA-------QVPDCASKLVPCADFLKSNN--PPASCCDPIKEAVATQLQCLCNLYTTPGFLESIGVNVSDAIHLTKACGVPVELSKC
        +AVVV VV +  VEA       Q+  C  KL+PC  ++ S N  PP SCC P+KE V     CLC  +  P  L+++ +   +A+ L KACGV  ++S C
Subjt:  VAVVVVVVAVGFVEA-------QVPDCASKLVPCADFLKSNN--PPASCCDPIKEAVATQLQCLCNLYTTPGFLESIGVNVSDAIHLTKACGVPVELSKC

Query:  -KTAAPAPQHGAPSPPGVPGNDAQKFAS----TGFTSLLFF
         K A P+P     S  G        F      + F +++FF
Subjt:  -KTAAPAPQHGAPSPPGVPGNDAQKFAS----TGFTSLLFF

Q6NLF7 Non-specific lipid transfer protein GPI-anchored 71.9e-1441.46Show/hide
Query:  AVVAVVVVVVAVGFVEAQVPDCASKLVPCADFLKSNNPPA-SCCDPIKEAVATQLQCLCNLYTTPGFLESIGVNVSDAIHLTKACGVPVELSKCKTAAPA
        A    V+ ++ V  +EAQ  +C SKLVPC + L +   P   CCD IKEAV  +L CLC +YT+PG L    V    A+ L++ C V  +LS C TA  A
Subjt:  AVVAVVVVVVAVGFVEAQVPDCASKLVPCADFLKSNNPPA-SCCDPIKEAVATQLQCLCNLYTTPGFLESIGVNVSDAIHLTKACGVPVELSKCKTAAPA

Query:  PQHGAPSPPGVPGNDAQKFASTG
        P   A  PP  P  + +K A  G
Subjt:  PQHGAPSPPGVPGNDAQKFASTG

Q9LE56 Non-specific lipid transfer protein GPI-anchored 39.2e-0931.95Show/hide
Query:  VSVAVAVVAVVVVVVAVGFVEAQVPD-----------CASKLVPCADFLK-SNNPPASCCDPIKEAVATQLQCLCNLYTTPGFLESIGVNVSDAIHLTKA
        V  AVAVV V   V +        P            C  KL+PC  +L  +  PPA+CC P+ E VA    CLC ++     L+S+ +   +A+ L KA
Subjt:  VSVAVAVVAVVVVVVAVGFVEAQVPD-----------CASKLVPCADFLK-SNNPPASCCDPIKEAVATQLQCLCNLYTTPGFLESIGVNVSDAIHLTKA

Query:  CGVPVELSKCKTAAPAPQHGAPSPPGVPGNDAQKFASTGFTSLLFFESAADGAELVTGSLPRSIMGDFS
        CG   ++S CKT+A    + + +PP  P        STG T      +AA      T S P    G  S
Subjt:  CGVPVELSKCKTAAPAPQHGAPSPPGVPGNDAQKFASTGFTSLLFFESAADGAELVTGSLPRSIMGDFS

Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily3.2e-10535.75Show/hide
Query:  PSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCT
        P + P  +P+++     +    +D  ++ +++FFF N I F+ A+S  Y  M+DA+A+ G G+ APS    K+  LD+VK DI    +    EW  TGCT
Subjt:  PSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCT

Query:  ILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLED
        I+ ++W+D ++++ +  S++      F KSVD S    ++  L+DL ++++ ++G E++VQ+I D +  Y      L+  Y ++F SPC S C+N +LE+
Subjt:  ILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLED

Query:  ISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKD
         SK++WV   + +A++I++++Y+++ +L+ +RK T G+++IR  +TR V+NFLSL+S++  +  LKHMF   E+ ++  + +P + + +++L  + FW+ 
Subjt:  ISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKD

Query:  AQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPIWDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAM
         +E+V ISEP++++LR V    PA+G I+E + +AK  I+TYY   E+K+    D +D  W   LH+ LH AAAFLNPS+ YNP  K    ++  F + +
Subjt:  AQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPIWDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAM

Query:  LKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLV
         K+  T   + +IT +   +   +G  G + A+  R   +PG WW  +G   P LQR AIRILSQ CS Y   R  WSTF+ +H ++R++ ++E LN L 
Subjt:  LKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLV

Query:  FVQCNLWL-QHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLD
        +V  NL L + I L  D    P+  +DID+  EW  E E  +      WLD
Subjt:  FVQCNLWL-QHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLD

AT3G22220.1 hAT transposon superfamily1.7e-8233.08Show/hide
Query:  PCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCD
        P    S +  +    K+++      +  F F     F AA S+  Q  +DAI   G G   P++E L+  +L     ++     + +  WK TGC++L  
Subjt:  PCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCD

Query:  SWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKI
          +       L   + C +  +FLKSVD S   D    L +LL+ +V E+G  NVVQVIT     Y  AG+ LM  Y SL+W PC ++C+++MLE+  K+
Subjt:  SWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKI

Query:  EWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEA
        +W+  ++E+A+ +TR IY+H+ +LN MRKFT G ++++P  T   TNF ++  I  L+  L+ M   SEW    YS+      +   +  + FWK    A
Subjt:  EWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEA

Query:  VNISEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYNGIEDKYMPIWDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKM
         +I+ P++R+LRIV  +  PAMGY++  + RAK  IKT      ++Y+  W  ID  W   L   L+ A  +LNP  FY+ + ++   I     + + K+
Subjt:  VNISEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYNGIEDKYMPIWDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKM

Query:  ATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQ
              +  + ++  +Y N  G  G + AI  R    P +WWS YG     L R AIRILSQ CSS   S  N ++   ++  K S  E+++LNDLVFVQ
Subjt:  ATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQ

Query:  CNLWLQHIC--LTRDGKYKPVVFDDIDVSLEW
         N+ L+ I    + D    P+   +++V  +W
Subjt:  CNLWLQHIC--LTRDGKYKPVVFDDIDVSLEW

AT3G22220.2 hAT transposon superfamily1.7e-8233.08Show/hide
Query:  PCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCD
        P    S +  +    K+++      +  F F     F AA S+  Q  +DAI   G G   P++E L+  +L     ++     + +  WK TGC++L  
Subjt:  PCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCD

Query:  SWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKI
          +       L   + C +  +FLKSVD S   D    L +LL+ +V E+G  NVVQVIT     Y  AG+ LM  Y SL+W PC ++C+++MLE+  K+
Subjt:  SWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKI

Query:  EWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEA
        +W+  ++E+A+ +TR IY+H+ +LN MRKFT G ++++P  T   TNF ++  I  L+  L+ M   SEW    YS+      +   +  + FWK    A
Subjt:  EWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEA

Query:  VNISEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYNGIEDKYMPIWDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKM
         +I+ P++R+LRIV  +  PAMGY++  + RAK  IKT      ++Y+  W  ID  W   L   L+ A  +LNP  FY+ + ++   I     + + K+
Subjt:  VNISEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYNGIEDKYMPIWDTIDGRWNLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKM

Query:  ATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQ
              +  + ++  +Y N  G  G + AI  R    P +WWS YG     L R AIRILSQ CSS   S  N ++   ++  K S  E+++LNDLVFVQ
Subjt:  ATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRILSQPCSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQ

Query:  CNLWLQHIC--LTRDGKYKPVVFDDIDVSLEW
         N+ L+ I    + D    P+   +++V  +W
Subjt:  CNLWLQHIC--LTRDGKYKPVVFDDIDVSLEW

AT4G15020.1 hAT transposon superfamily2.4e-8935.05Show/hide
Query:  QQHSSSASGGIHHGSSGQNES----NCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEHGVGYKAPSYEKLKS
        Q+   S      +GS+  N      +  +  P    S +  +  + + +++     +  F F     F A  S+ +Q M+DAIA  G G  AP+++ L+ 
Subjt:  QQHSSSASGGIHHGSSGQNES----NCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEHGVGYKAPSYEKLKS

Query:  TLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYA
         +L     ++     + +  WK TGC+IL +  +  +    L   + C +  +FLKSVD S     A  L +LL  +V EVG  NVVQVIT     YV A
Subjt:  TLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYA

Query:  GRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFLSLRSIVILEDNLKHMFAHSE
        G+ LM  Y SL+W PC ++C++QMLE+  K+ W+   +E+A+ ITR++Y+H+ +LN M KFTSG +++ P  +   TNF +L  I  L+ NL+ M   +E
Subjt:  GRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFLSLRSIVILEDNLKHMFAHSE

Query:  WLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYNGIEDKYMPIWDTIDGRWNLQLHTTLHTA
        W    YS  P    +++ L  + FWK      +++ PL+R LRIV  +  PAMGY++  + RAK  IKT+    ED Y+  W  ID  W  Q H  L  A
Subjt:  WLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYNGIEDKYMPIWDTIDGRWNLQLHTTLHTA

Query:  AAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRILSQPCSSYWR
          FLNP +FYN N +I   +     + + ++   DK + +I +E  +Y    G  G + AI  R    P +WWS YG     L R AIRILSQ CSS   
Subjt:  AAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRILSQPCSSYWR

Query:  SRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICL-TRDGKYKPVVFDDIDVSLEW
         R N    E ++  K S  EQ++L+DLVFVQ N+ L+ +   + D    P+  + IDV  EW
Subjt:  SRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICL-TRDGKYKPVVFDDIDVSLEW

AT4G15020.2 hAT transposon superfamily2.4e-8935.05Show/hide
Query:  QQHSSSASGGIHHGSSGQNES----NCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEHGVGYKAPSYEKLKS
        Q+   S      +GS+  N      +  +  P    S +  +  + + +++     +  F F     F A  S+ +Q M+DAIA  G G  AP+++ L+ 
Subjt:  QQHSSSASGGIHHGSSGQNES----NCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEHGVGYKAPSYEKLKS

Query:  TLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYA
         +L     ++     + +  WK TGC+IL +  +  +    L   + C +  +FLKSVD S     A  L +LL  +V EVG  NVVQVIT     YV A
Subjt:  TLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYA

Query:  GRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFLSLRSIVILEDNLKHMFAHSE
        G+ LM  Y SL+W PC ++C++QMLE+  K+ W+   +E+A+ ITR++Y+H+ +LN M KFTSG +++ P  +   TNF +L  I  L+ NL+ M   +E
Subjt:  GRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFLSLRSIVILEDNLKHMFAHSE

Query:  WLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYNGIEDKYMPIWDTIDGRWNLQLHTTLHTA
        W    YS  P    +++ L  + FWK      +++ PL+R LRIV  +  PAMGY++  + RAK  IKT+    ED Y+  W  ID  W  Q H  L  A
Subjt:  WLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGD-MPAMGYIFEGIERAKVEIKTYYNGIEDKYMPIWDTIDGRWNLQLHTTLHTA

Query:  AAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRILSQPCSSYWR
          FLNP +FYN N +I   +     + + ++   DK + +I +E  +Y    G  G + AI  R    P +WWS YG     L R AIRILSQ CSS   
Subjt:  AAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRILSQPCSSYWR

Query:  SRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICL-TRDGKYKPVVFDDIDVSLEW
         R N    E ++  K S  EQ++L+DLVFVQ N+ L+ +   + D    P+  + IDV  EW
Subjt:  SRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICL-TRDGKYKPVVFDDIDVSLEW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCGTAGTCAGGTCTCCGTCGCCGTCGCCGTCGTCGCGGTGGTGGTTGTAGTGGTGGCCGTCGGCTTCGTGGAGGCGCAGGTGCCGGACTGTGCTTCGAAG
CTAGTGCCATGTGCGGATTTCCTGAAGTCCAATAATCCTCCAGCGTCGTGCTGCGATCCGATCAAGGAAGCGGTGGCGACGCAGCTTCAGTGCCTTTGCAACCTC
TATACCACGCCGGGTTTCTTGGAGTCGATCGGCGTTAATGTCTCCGACGCTATCCATCTCACTAAAGCCTGCGGCGTCCCTGTCGAACTCTCCAAGTGCAAAACT
GCTGCTCCGGCTCCGCAGCATGGTGCTCCATCGCCGCCGGGAGTTCCTGGAAATGATGCACAAAAATTTGCATCTACTGGATTTACATCTCTGCTCTTCTTTGAA
TCTGCTGCAGACGGAGCCGAGCTTGTCACCGGTAGTCTGCCTAGGAGCATAATGGGTGATTTTTCCATTCAGATTAGTTCAAACCTTGTTAATATGTTAGTTGAT
GATACTGAAAAACCAAAAAGAAAATCGAGAAGAAATAGATCCAAGGTACCCCAAGAGAGCAAAAAGCCCCAAACAAAGGTTGATCAGAAACATACATCTGATGAT
TCTGGAATGCTCAAGGGAAGCACAAGTGATGGATGGCCACATCAACCTCCCCCGCTCTTCCTTCCTATAATTCCTCCAGTTCATTCTGCCAATGCAGAGCTAGAT
GCAATCCGTTCCGTCCTGCAAGAGAGCGAAAGGGTTGTGGAGAAGTTGCAGAAGCAGGAGGATAACATGCTGAAGGAAGTTACTCAAAGGGCAAAAGATCTCCAT
GACAAGGAGTTCAAGCTCCCTTACCAGAAGCCTATGCCATGCTTGGCCGAGAGCCAAGCTTGCTTTCAATGCTACAAAGATCATCCCAATGACCTTTTGAAATGT
GCTCGTCTGTCAACTAATATATCGCCGTTGTATTTCAATTACTCGGCTCCTTTCCCTTTCCTCCAATCGGCCGTCACTTTCTTCCCGGAGCCCCGGCGGCCGGAA
TCTCATTCTTCCAACTTCGGACCTTCCGATGATACTAAACTTCTATTTCAGATCTTCTCCTTCTTACTAATTCTGTCTCCTCAGGGGCATGAAATATTGTATACC
TTCTGCAATGGTAAAGGCATTCTACTAAATCAAAGTTTCACAGATTGTGATACTTCCCATATACAATGGTTCGAGGAAGGGATGCTTGCTGGGAACATTGCGTCC
TTGTTGATGCAACACGACAGAAGGTTCGATGTAATTATTGTCAGCGGGAATTCAGTGGAGAAGGCACCTAAGAAACCAAAGGTGGATATGGAAACAGCAACAAAT
GGACAGCAACATAGCTCCTCTGCTAGTGGTGGCATCCATCATGGATCCAGTGGTCAGAATGAAAGCAACTGCCCATCAACATTTCCGTGCCCTTCACCCAGTGCA
CAACCACCAATTGATGATGCTCAAAAGCAGAAGAAGGATGAGACTGATAAAAAAGTTGCCATCTTTTTTTTCCATAATTCTATTCCTTTCAGTGCTGCCAAGTCC
TTGTATTATCAGGAAATGGTCGATGCAATAGCAGAACATGGAGTAGGATACAAAGCACCAAGTTATGAGAAATTAAAATCTACTCTTTTGGATAAAGTAAAAGGT
GACATACATAATTCTTACAGAAAGTATAGAGATGAATGGAAAGAAACAGGCTGCACTATCCTGTGTGATAGCTGGTCAGATGGACGGACCAAATCATTTCTAGTC
ATTTCCATTACGTGTTCTAAAGGAACACTGTTTCTGAAGTCAGTCGATATATCAGGTCGTGAAGATGATGCGACTTACCTGTCTGACTTGCTCGAGACCATCGTC
CTCGAGGTTGGAGTGGAGAATGTTGTCCAAGTTATTACAGATGCTACTGCCAGTTATGTCTATGCTGGGAGGCTTCTCATGACCAAGTACACTTCCTTATTTTGG
TCTCCATGTGTTTCTTATTGTGTTAATCAGATGTTGGAGGACATCAGTAAAATCGAGTGGGTCGGTACAGTTTTGGAGGAGGCAAAGATCATCACGCGCTACATT
TATAGTCATGCATCGATTTTGAATACAATGCGAAAGTTCACCAGTGGGAAGGAATTGATCAGGCCAAAAATTACTAGATTTGTGACAAATTTTCTCTCCTTGAGG
TCCATTGTGATTCTTGAGGACAATCTCAAACACATGTTTGCTCATTCAGAGTGGCTGTCCTCGATTTATAGCAGGCGTCCCGATGCACAGACAATTATTTCCTTG
CTGTATTTGGATAGATTTTGGAAGGATGCACAAGAAGCTGTCAACATTTCTGAACCACTTATTAGAATTCTGAGAATTGTTGATGGAGACATGCCTGCCATGGGC
TATATATTTGAAGGAATAGAGAGGGCAAAGGTTGAAATCAAAACATATTACAATGGCATCGAGGATAAATATATGCCTATTTGGGACACAATCGACGGGAGATGG
AATTTGCAGCTTCACACAACGTTGCACACAGCAGCAGCATTCCTTAATCCGTCTGTTTTTTACAATCCGAACTTCAAGATTGATCTGAGAATTCGAAACGGATTT
CAGGAAGCCATGTTGAAGATGGCAACGACTGATAAAGATAAAATGGAGATCACTAGAGAACATCCTGCTTATGTAAATGGACAAGGTGCTCTTGGTACTGACTTT
GCGATCTTGGGGAGAACTATAAATGCCCCAGGTGATTGGTGGTCCGGGTACGGTTACGAGATCCCCACGCTTCAGAGAGCGGCGATACGAATACTAAGCCAACCC
TGTAGTTCTTATTGGCGCAGCAGATGGAACTGGAGCACATTCGAGACCTTACATTCGAAGAAGCGTAGTAGAGCCGAACAAGAAAAGTTGAATGATTTAGTGTTT
GTACAGTGCAATCTTTGGTTGCAACACATTTGTTTGACTCGGGATGGTAAATATAAACCCGTCGTATTTGATGATATAGATGTGAGTTTAGAATGGCCTACCGAG
TTCGAAACCTCAGCTCATGTTTTAGATGATTCATGGTTGGATAATCTGCCTCTTGAATGTAGAGGCAACCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCGTAGTCAGGTCTCCGTCGCCGTCGCCGTCGTCGCGGTGGTGGTTGTAGTGGTGGCCGTCGGCTTCGTGGAGGCGCAGGTGCCGGACTGTGCTTCGAAG
CTAGTGCCATGTGCGGATTTCCTGAAGTCCAATAATCCTCCAGCGTCGTGCTGCGATCCGATCAAGGAAGCGGTGGCGACGCAGCTTCAGTGCCTTTGCAACCTC
TATACCACGCCGGGTTTCTTGGAGTCGATCGGCGTTAATGTCTCCGACGCTATCCATCTCACTAAAGCCTGCGGCGTCCCTGTCGAACTCTCCAAGTGCAAAACT
GCTGCTCCGGCTCCGCAGCATGGTGCTCCATCGCCGCCGGGAGTTCCTGGAAATGATGCACAAAAATTTGCATCTACTGGATTTACATCTCTGCTCTTCTTTGAA
TCTGCTGCAGACGGAGCCGAGCTTGTCACCGGTAGTCTGCCTAGGAGCATAATGGGTGATTTTTCCATTCAGATTAGTTCAAACCTTGTTAATATGTTAGTTGAT
GATACTGAAAAACCAAAAAGAAAATCGAGAAGAAATAGATCCAAGGTACCCCAAGAGAGCAAAAAGCCCCAAACAAAGGTTGATCAGAAACATACATCTGATGAT
TCTGGAATGCTCAAGGGAAGCACAAGTGATGGATGGCCACATCAACCTCCCCCGCTCTTCCTTCCTATAATTCCTCCAGTTCATTCTGCCAATGCAGAGCTAGAT
GCAATCCGTTCCGTCCTGCAAGAGAGCGAAAGGGTTGTGGAGAAGTTGCAGAAGCAGGAGGATAACATGCTGAAGGAAGTTACTCAAAGGGCAAAAGATCTCCAT
GACAAGGAGTTCAAGCTCCCTTACCAGAAGCCTATGCCATGCTTGGCCGAGAGCCAAGCTTGCTTTCAATGCTACAAAGATCATCCCAATGACCTTTTGAAATGT
GCTCGTCTGTCAACTAATATATCGCCGTTGTATTTCAATTACTCGGCTCCTTTCCCTTTCCTCCAATCGGCCGTCACTTTCTTCCCGGAGCCCCGGCGGCCGGAA
TCTCATTCTTCCAACTTCGGACCTTCCGATGATACTAAACTTCTATTTCAGATCTTCTCCTTCTTACTAATTCTGTCTCCTCAGGGGCATGAAATATTGTATACC
TTCTGCAATGGTAAAGGCATTCTACTAAATCAAAGTTTCACAGATTGTGATACTTCCCATATACAATGGTTCGAGGAAGGGATGCTTGCTGGGAACATTGCGTCC
TTGTTGATGCAACACGACAGAAGGTTCGATGTAATTATTGTCAGCGGGAATTCAGTGGAGAAGGCACCTAAGAAACCAAAGGTGGATATGGAAACAGCAACAAAT
GGACAGCAACATAGCTCCTCTGCTAGTGGTGGCATCCATCATGGATCCAGTGGTCAGAATGAAAGCAACTGCCCATCAACATTTCCGTGCCCTTCACCCAGTGCA
CAACCACCAATTGATGATGCTCAAAAGCAGAAGAAGGATGAGACTGATAAAAAAGTTGCCATCTTTTTTTTCCATAATTCTATTCCTTTCAGTGCTGCCAAGTCC
TTGTATTATCAGGAAATGGTCGATGCAATAGCAGAACATGGAGTAGGATACAAAGCACCAAGTTATGAGAAATTAAAATCTACTCTTTTGGATAAAGTAAAAGGT
GACATACATAATTCTTACAGAAAGTATAGAGATGAATGGAAAGAAACAGGCTGCACTATCCTGTGTGATAGCTGGTCAGATGGACGGACCAAATCATTTCTAGTC
ATTTCCATTACGTGTTCTAAAGGAACACTGTTTCTGAAGTCAGTCGATATATCAGGTCGTGAAGATGATGCGACTTACCTGTCTGACTTGCTCGAGACCATCGTC
CTCGAGGTTGGAGTGGAGAATGTTGTCCAAGTTATTACAGATGCTACTGCCAGTTATGTCTATGCTGGGAGGCTTCTCATGACCAAGTACACTTCCTTATTTTGG
TCTCCATGTGTTTCTTATTGTGTTAATCAGATGTTGGAGGACATCAGTAAAATCGAGTGGGTCGGTACAGTTTTGGAGGAGGCAAAGATCATCACGCGCTACATT
TATAGTCATGCATCGATTTTGAATACAATGCGAAAGTTCACCAGTGGGAAGGAATTGATCAGGCCAAAAATTACTAGATTTGTGACAAATTTTCTCTCCTTGAGG
TCCATTGTGATTCTTGAGGACAATCTCAAACACATGTTTGCTCATTCAGAGTGGCTGTCCTCGATTTATAGCAGGCGTCCCGATGCACAGACAATTATTTCCTTG
CTGTATTTGGATAGATTTTGGAAGGATGCACAAGAAGCTGTCAACATTTCTGAACCACTTATTAGAATTCTGAGAATTGTTGATGGAGACATGCCTGCCATGGGC
TATATATTTGAAGGAATAGAGAGGGCAAAGGTTGAAATCAAAACATATTACAATGGCATCGAGGATAAATATATGCCTATTTGGGACACAATCGACGGGAGATGG
AATTTGCAGCTTCACACAACGTTGCACACAGCAGCAGCATTCCTTAATCCGTCTGTTTTTTACAATCCGAACTTCAAGATTGATCTGAGAATTCGAAACGGATTT
CAGGAAGCCATGTTGAAGATGGCAACGACTGATAAAGATAAAATGGAGATCACTAGAGAACATCCTGCTTATGTAAATGGACAAGGTGCTCTTGGTACTGACTTT
GCGATCTTGGGGAGAACTATAAATGCCCCAGGTGATTGGTGGTCCGGGTACGGTTACGAGATCCCCACGCTTCAGAGAGCGGCGATACGAATACTAAGCCAACCC
TGTAGTTCTTATTGGCGCAGCAGATGGAACTGGAGCACATTCGAGACCTTACATTCGAAGAAGCGTAGTAGAGCCGAACAAGAAAAGTTGAATGATTTAGTGTTT
GTACAGTGCAATCTTTGGTTGCAACACATTTGTTTGACTCGGGATGGTAAATATAAACCCGTCGTATTTGATGATATAGATGTGAGTTTAGAATGGCCTACCGAG
TTCGAAACCTCAGCTCATGTTTTAGATGATTCATGGTTGGATAATCTGCCTCTTGAATGTAGAGGCAACCCTTAA
Protein sequenceShow/hide protein sequence
MARSQVSVAVAVVAVVVVVVAVGFVEAQVPDCASKLVPCADFLKSNNPPASCCDPIKEAVATQLQCLCNLYTTPGFLESIGVNVSDAIHLTKACGVPVELSKCKT
AAPAPQHGAPSPPGVPGNDAQKFASTGFTSLLFFESAADGAELVTGSLPRSIMGDFSIQISSNLVNMLVDDTEKPKRKSRRNRSKVPQESKKPQTKVDQKHTSDD
SGMLKGSTSDGWPHQPPPLFLPIIPPVHSANAELDAIRSVLQESERVVEKLQKQEDNMLKEVTQRAKDLHDKEFKLPYQKPMPCLAESQACFQCYKDHPNDLLKC
ARLSTNISPLYFNYSAPFPFLQSAVTFFPEPRRPESHSSNFGPSDDTKLLFQIFSFLLILSPQGHEILYTFCNGKGILLNQSFTDCDTSHIQWFEEGMLAGNIAS
LLMQHDRRFDVIIVSGNSVEKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKS
LYYQEMVDAIAEHGVGYKAPSYEKLKSTLLDKVKGDIHNSYRKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDISGREDDATYLSDLLETIV
LEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHASILNTMRKFTSGKELIRPKITRFVTNFLSLR
SIVILEDNLKHMFAHSEWLSSIYSRRPDAQTIISLLYLDRFWKDAQEAVNISEPLIRILRIVDGDMPAMGYIFEGIERAKVEIKTYYNGIEDKYMPIWDTIDGRW
NLQLHTTLHTAAAFLNPSVFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRILSQP
CSSYWRSRWNWSTFETLHSKKRSRAEQEKLNDLVFVQCNLWLQHICLTRDGKYKPVVFDDIDVSLEWPTEFETSAHVLDDSWLDNLPLECRGNP