; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017570 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017570
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionHAT transposon superfamily
Genome locationchr5:5475374..5478460
RNA-Seq ExpressionLag0017570
SyntenyLag0017570
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR003656 - Zinc finger, BED-type
IPR007021 - Domain of unknown function DUF659
IPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015651.1 hypothetical protein SDJN02_23288, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0093.31Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVP DVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY
        IHHGSSGQNESNCPSTFP  SPSAQPPIDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAEYGVGY+APSYEKLKSTLL KVKGDIQ SY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKYRDEWKETGCTILC+SWSDGRTKSFL+ISITCSKGTLFLKSV++SGREDDATYLSDLLETIV+EVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE
        PCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHAWILNTMRKFTSGKELIRPRI+RFVTNFLSLRSIV LED LKHMF HSEW SSIYSRRPD Q 
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK
        I+S LYLDRFWKDA EAVNISEPLIRILR+VDGDMPAMGYIYEGIERAKVE+K YYNGIEDKYMPIWDTIDRRWNLQLHTTLH AAAFLNPSIF+NPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK

Query:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC-RWNWSTFESLHSKK
        IDLRIRNGFQEAMLKM TTDKDKMEITREHP YVN QGALGTDFA+LGRTINTPGDWWSGYGYEIPTLQR AIRILSQPCSSYGC RWNWSTFE+LHSKK
Subjt:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC-RWNWSTFESLHSKK

Query:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP
        RS +EQEKLNDLVFVQCNLWLQHI WTRDGKYKPVVFDDIDVSLEWPTE ESSA VLDDSWLDNL LEC GSP
Subjt:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP

XP_022923437.1 uncharacterized protein LOC111431132 [Cucurbita moschata]0.0e+0093.61Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVP DVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY
        IHHGSSGQNESNCPSTFP  SPSAQPPIDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAEYGVGY+APSYEKLKSTLL KVKGDIQ SY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKYRDEWKETGCTILC+SWSDGRTKSFL+ISITCSKGTLFLKSV++SGREDDATYLSDLLETIV+EVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE
        PCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHAWILNTMRKFTSGKELIRPRI+RFVTNFLSLRSIV LED LKHMF HSEW SSIYSRRPD Q 
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK
        I+S LYLDRFWKDA EAVNISEPLIRILR+VDGDMPAMGYIYEGIERAKVE+K YYNGIEDKYMPIWDTIDRRWNLQLHTTLH AAAFLNPSIF+NPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK

Query:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC-RWNWSTFESLHSKK
        IDLRIRNGFQEAMLKM TTDKDKMEITREHP YVN QGALGTDFA+LGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC RWNWSTFE+LHSKK
Subjt:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC-RWNWSTFESLHSKK

Query:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP
        RS +EQEKLNDLVFVQCNLWLQHI WTRDGKYKPVVFDDIDVSLEWPTE ESSA VLDDSWLDNLPLEC GSP
Subjt:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP

XP_023007736.1 uncharacterized protein LOC111500259 [Cucurbita maxima]0.0e+0093.16Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVP DVRD IQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY
        IHHGSSGQNESNCPST PC SPSAQP IDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAEYGVGY+APSY+KLKSTLLDKVKGDIQ SY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKYRDEWKETGCTILC+SWSDGRTKSFL+ISITCSKGTLFLKSV++SGREDDATYLSDLLETIV+EVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE
        PCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHAWIL+TMRKFTSGKELIRPRI+RFVTNFLSLRSIV LED LKHMF HSEW SSIYSRRPD Q 
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK
        I+S LYLDRFWKDA EAVNISEPLIRILR+VDGDMPAMGYIYEGIERAKVE+K YYNGIEDKYMPIWDTIDRRWNLQLHTTLH AAAFLNPSIF+NPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK

Query:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC-RWNWSTFESLHSKK
        IDLRIRNGFQEAMLKM TTDKDKMEITREHP YVN QGALGTDFA+LGRTINTPGDWWSGYGYEIPTLQR AIRILSQPCSSYGC RWNWSTFE+LHSKK
Subjt:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC-RWNWSTFESLHSKK

Query:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP
        RSR+EQEKLNDLVFVQCNLWLQHI WTRDGKYKPVVFDDIDVSLEWPTE ESSA VLDDSWLDNLPLEC GSP
Subjt:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP

XP_038876874.1 uncharacterized protein LOC120069237 isoform X1 [Benincasa hispida]0.0e+0094.65Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY
        IHHGSSGQNESNCPSTFPCPSPSAQPPIDD QKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDI  SY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKY DEWKETGCTILCDSWSDGRTKSFLVISITCSKG LFLKSVD+SG EDDATYLSDLLETIV+EVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE
        PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHA ILNTMRKFTSGKELIRPRI+RFVTNFLSLRSIVI ED+LKHMF HSEWLSSIYSRRPD Q 
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK
        IISLLYLDRFWKDA EAVNI+EPLIRILR+VDGDMPAMGYI+EGIERAKVEIKTYYNGIEDKY+PIW+TIDRRWNLQLHTTLH AAAFLNPS+F+NPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK

Query:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC-RWNWSTFESLHSKK
        IDLRIRNGFQEAMLKM TTDKDKMEITREHP YVNGQGALGTDFA+LGRTIN PGDWWSGYGYEIPTLQRAAIRIL+QPCSSYGC RWNWSTFE+LHSKK
Subjt:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC-RWNWSTFESLHSKK

Query:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP
        RSR+EQEKLNDLVFVQCNLWLQHIC TRDGKYKPVVFDDIDVSLEWPTEFE+SA VLDDSWLDNLPLECRGSP
Subjt:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP

XP_038876877.1 uncharacterized protein LOC120069237 isoform X2 [Benincasa hispida]0.0e+0094.65Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY
        IHHGSSGQNESNCPSTFPCPSPSAQPPIDD QKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDI  SY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKY DEWKETGCTILCDSWSDGRTKSFLVISITCSKG LFLKSVD+SG EDDATYLSDLLETIV+EVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE
        PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHA ILNTMRKFTSGKELIRPRI+RFVTNFLSLRSIVI ED+LKHMF HSEWLSSIYSRRPD Q 
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK
        IISLLYLDRFWKDA EAVNI+EPLIRILR+VDGDMPAMGYI+EGIERAKVEIKTYYNGIEDKY+PIW+TIDRRWNLQLHTTLH AAAFLNPS+F+NPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK

Query:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC-RWNWSTFESLHSKK
        IDLRIRNGFQEAMLKM TTDKDKMEITREHP YVNGQGALGTDFA+LGRTIN PGDWWSGYGYEIPTLQRAAIRIL+QPCSSYGC RWNWSTFE+LHSKK
Subjt:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC-RWNWSTFESLHSKK

Query:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP
        RSR+EQEKLNDLVFVQCNLWLQHIC TRDGKYKPVVFDDIDVSLEWPTEFE+SA VLDDSWLDNLPLECRGSP
Subjt:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP

TrEMBL top hitse value%identityAlignment
A0A0A0L2E4 BED-type domain-containing protein0.0e+0091.98Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVPTDVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY
        IHHGSSGQNESNCPST+PC SPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV+AIAEYG GYKAPSYEKLKSTLLDKVKGDI +SY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KK+RDEWKETGCTILCDSWSDG+TKSFLVIS+TCSKGTLFLKSVD+SG EDDATYLSDLLETI++EVGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE
        PCVSYCVNQMLEDISKIEWV  VLEEAKIITRYIYSHA ILNTMRKFT GKELIRPRI+RFVTNFLSLRSIVILED+LKHMF HSEWLSSIYSRRPD Q 
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK
        IISLLYLDRFWKDAHEA+NI EPLIRILR+VDGDMPAMGYI+EGIERAKVEIKTYYNG EDKYMPIW+TIDRRWNLQLHTTLH AAAFLNPS+F+NPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK

Query:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCR-WNWSTFESLHSKK
        IDLRIRNGFQEAMLKM TTDKDKMEITREHP YVNGQGALGTDFA+LGRTIN PGDWWSGYGYEIPTLQRAA+RILSQPCSSYGC  WNWSTFE+LHSKK
Subjt:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCR-WNWSTFESLHSKK

Query:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP
         SR+EQEKL DLVFVQCNLWLQH+C TRD KYKPVVFDD+DVSLEWP+E E SA VLDDSWLDNLPLE RGSP
Subjt:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP

A0A1S3BLP8 uncharacterized protein LOC1034909270.0e+0092.88Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVPTDVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY
        IHHGSSGQNESNCPSTFPC SPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV+AIAEYG GYKAPSYEKLKSTLLDKVKGDI +SY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KK RDEWKETGCTILCDSWSDGRTKSFLVIS+TCSKGTLFLKSVD SG EDDATYLSDLLETIV+EVGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE
        PCVSYCVNQMLEDISKIEWV TVLEEAKIITRYIYSHA ILNTMRKFT GKELIRPRI+RFVTNFLSLRSIVILE++LKHMF HSEWLSSIYSRRPD Q 
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK
        IISLLYLDRFWKDAHEA+NI EPLIRILR+VDGDMPAMGYI+EGIERAKVEIKTYYNG EDKYMPIW+TIDRRWNLQLHTTLH AAAFLNPS+F+NPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK

Query:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC-RWNWSTFESLHSKK
        IDLRIRNGFQEAMLKM TTDKDKMEITREHP YVNGQGALGTDFA+LGRTIN+PGDWWSGYGYEIPTLQRAA+RILSQPCSSYGC RWNWSTFE+LHSKK
Subjt:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC-RWNWSTFESLHSKK

Query:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLD-NLPLECRGSP
        RSR+EQEKL DLVFVQCNLWLQHIC TRD KYKP+VFDDIDVSLEWP+E E SA VLDDSWLD NLPLECRGSP
Subjt:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLD-NLPLECRGSP

A0A5D3D7G5 HAT transposon superfamily0.0e+0092.88Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVPTDVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY
        IHHGSSGQNESNCPSTFPC SPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV+AIAEYG GYKAPSYEKLKSTLLDKVKGDI +SY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KK RDEWKETGCTILCDSWSDGRTKSFLVIS+TCSKGTLFLKSVD SG EDDATYLSDLLETIV+EVGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE
        PCVSYCVNQMLEDISKIEWV TVLEEAKIITRYIYSHA ILNTMRKFT GKELIRPRI+RFVTNFLSLRSIVILE++LKHMF HSEWLSSIYSRRPD Q 
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK
        IISLLYLDRFWKDAHEA+NI EPLIRILR+VDGDMPAMGYI+EGIERAKVEIKTYYNG EDKYMPIW+TIDRRWNLQLHTTLH AAAFLNPS+F+NPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK

Query:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC-RWNWSTFESLHSKK
        IDLRIRNGFQEAMLKM TTDKDKMEITREHP YVNGQGALGTDFA+LGRTIN+PGDWWSGYGYEIPTLQRAA+RILSQPCSSYGC RWNWSTFE+LHSKK
Subjt:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC-RWNWSTFESLHSKK

Query:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLD-NLPLECRGSP
        RSR+EQEKL DLVFVQCNLWLQHIC TRD KYKP+VFDDIDVSLEWP+E E SA VLDDSWLD NLPLECRGSP
Subjt:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLD-NLPLECRGSP

A0A6J1E9N1 uncharacterized protein LOC1114311320.0e+0093.61Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVP DVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY
        IHHGSSGQNESNCPSTFP  SPSAQPPIDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAEYGVGY+APSYEKLKSTLL KVKGDIQ SY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKYRDEWKETGCTILC+SWSDGRTKSFL+ISITCSKGTLFLKSV++SGREDDATYLSDLLETIV+EVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE
        PCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHAWILNTMRKFTSGKELIRPRI+RFVTNFLSLRSIV LED LKHMF HSEW SSIYSRRPD Q 
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK
        I+S LYLDRFWKDA EAVNISEPLIRILR+VDGDMPAMGYIYEGIERAKVE+K YYNGIEDKYMPIWDTIDRRWNLQLHTTLH AAAFLNPSIF+NPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK

Query:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC-RWNWSTFESLHSKK
        IDLRIRNGFQEAMLKM TTDKDKMEITREHP YVN QGALGTDFA+LGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC RWNWSTFE+LHSKK
Subjt:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC-RWNWSTFESLHSKK

Query:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP
        RS +EQEKLNDLVFVQCNLWLQHI WTRDGKYKPVVFDDIDVSLEWPTE ESSA VLDDSWLDNLPLEC GSP
Subjt:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP

A0A6J1KZI0 uncharacterized protein LOC1115002590.0e+0093.16Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVP DVRD IQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY
        IHHGSSGQNESNCPST PC SPSAQP IDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAEYGVGY+APSY+KLKSTLLDKVKGDIQ SY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKYRDEWKETGCTILC+SWSDGRTKSFL+ISITCSKGTLFLKSV++SGREDDATYLSDLLETIV+EVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE
        PCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHAWIL+TMRKFTSGKELIRPRI+RFVTNFLSLRSIV LED LKHMF HSEW SSIYSRRPD Q 
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK
        I+S LYLDRFWKDA EAVNISEPLIRILR+VDGDMPAMGYIYEGIERAKVE+K YYNGIEDKYMPIWDTIDRRWNLQLHTTLH AAAFLNPSIF+NPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK

Query:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC-RWNWSTFESLHSKK
        IDLRIRNGFQEAMLKM TTDKDKMEITREHP YVN QGALGTDFA+LGRTINTPGDWWSGYGYEIPTLQR AIRILSQPCSSYGC RWNWSTFE+LHSKK
Subjt:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGC-RWNWSTFESLHSKK

Query:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP
        RSR+EQEKLNDLVFVQCNLWLQHI WTRDGKYKPVVFDDIDVSLEWPTE ESSA VLDDSWLDNLPLEC GSP
Subjt:  RSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily9.2e-12234.44Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVR +D CWE+   +D    KV+C +C R  +GG+ R+K HL+++ +K + PCA+V  DV D ++ ILS      A   P +                  
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY
             + + +   P + P  +P+++     +    +D  ++ +++FFF N I F+ A+S  Y  M++A+A+ G G+ APS    K+  LD+VK DI    
Subjt:  IHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        K    EW  TGCTI+ ++W+D ++++ +  S++      F KSVD S    ++  L+DL ++++ ++G E++VQ+I D +  Y      L+  Y ++F S
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE
        PC S C+N +LE+ SK++WV   + +A++I++++Y+++ +L+ +RK T G+++IR  ++R V+NFLSL+S++  +  LKHMFN  E+ ++  + +P    
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQE

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK
         +++L  + FW+   E+V ISEP++++LR V    PA+G IYE + +AK  I+TYY   E+K+    D +D  W   LH+ LHAAAAFLNPSI +NP  K
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGYIYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFK

Query:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCRWNWSTFESLHSKKR
            ++  F + + K++ T   + +IT +   +   +G  G + A+  R   +PG WW  +G   P LQR AIRILSQ CS Y     WSTF+ +H ++R
Subjt:  IDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCRWNWSTFESLHSKKR

Query:  SRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLD
        ++ ++E LN L +V  NL L  +      +  P+  +DID+  EW  E E+ +P     WLD
Subjt:  SRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLD

AT3G17450.1 hAT dimerisation domain-containing protein3.9e-9630.59Show/hide
Query:  DACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETAT-----------------
        D  WEH +  D  ++KV+CNYC +  SGG+ R K HLA+I   ++ PC   P +V   I+  +   +  +   +P  +M   T                 
Subjt:  DACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETAT-----------------

Query:  -------------NGQ----QHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQK---KDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNA
                     NG+    +  S  S  +   S  + +      F  PS S Q  +  +   +   + +    ++ F  H  +P  AA SLY+Q+M+  
Subjt:  -------------NGQ----QHSSSASGGIHHGSSGQNESNCPSTFPCPSPSAQPPIDDAQKQK---KDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNA

Query:  IAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSYKKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVG
        I  YG G+  PS +     LL +    I++  ++YR  W  TGC+I+ D+W++   K  +   ++C +G  F  S+D +   +DA  L   L+ +V ++G
Subjt:  IAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSYKKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVG

Query:  VENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMR-KFTSGKELIRPRISRFVTNFLS
         ENVVQVIT  TA +  AG+LL  K  +L+W+PC  +C   +LED SK+E+V   LE+A+ ITR+IY+  W+LN M+ +FT G +L+RP + R  + F +
Subjt:  VENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMR-KFTSGKELIRPRISRFVTNFLS

Query:  LRSIVILEDSLKHMFNHSEW-LSSIYSRRPDVQEIISLLYLDRFWKDAHEAVNISEPLIRILRVVD--GDMPAMGYIYEGIERAKVEIKTYYNGIEDKYM
        L+S++  + SL+ +F    W LS   ++  + +E+  ++    FWK     +   +P+++++ +++  GD  +M Y Y  +  AK+ IK+ ++    KY 
Subjt:  LRSIVILEDSLKHMFNHSEW-LSSIYSRRPDVQEIISLLYLDRFWKDAHEAVNISEPLIRILRVVD--GDMPAMGYIYEGIERAKVEIKTYYNGIEDKYM

Query:  PIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFKIDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYE
        P W  I+ RWN   H  L+ AA F NP+  + P+F     +  G  E ++++   +  ++    + P Y   +   GTD A+  RT   P  WW  +G  
Subjt:  PIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFKIDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGYGYE

Query:  IPTLQRAAIRILSQPCSSYGCRWNWSTFESLHSKKRSRSEQEKLNDLVFVQCNLWLQ
           LQR A+RILS  CSS GC   WS ++ ++S+ +S+  ++   DL +V  NL L+
Subjt:  IPTLQRAAIRILSQPCSSYGCRWNWSTFESLHSKKRSRSEQEKLNDLVFVQCNLWLQ

AT3G22220.1 hAT transposon superfamily1.7e-8831.15Show/hide
Query:  RDACWEHC-VLVDATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGIL-STPKKQRAPKK-----------PKVDMET----
        +D+ W+HC V     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP +VR  +Q  +  T ++QR  +K           P  ++ET    
Subjt:  RDACWEHC-VLVDATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGIL-STPKKQRAPKK-----------PKVDMET----

Query:  ---ATNG-QQHSSSASGGIHHGSSGQN--------------------ESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYY
             NG +  SS    G   G + Q                     + +  +  P    S +  +    K+++      +  F F     F AA S+  
Subjt:  ---ATNG-QQHSSSASGGIHHGSSGQN--------------------ESNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYY

Query:  QEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSYKKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLET
        Q  ++AI   G G   P++E L+  +L     +++    + +  WK TGC++L    +       L   + C +  +FLKSVD S   D    L +LL+ 
Subjt:  QEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSYKKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLET

Query:  IVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFV
        +V E+G  NVVQVIT     Y  AG+ LM  Y SL+W PC ++C+++MLE+  K++W+  ++E+A+ +TR IY+H+ +LN MRKFT G ++++P  +   
Subjt:  IVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFV

Query:  TNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQEIISLLYLDRFWKDAHEAVNISEPLIRILRVVDGD-MPAMGYIYEGIERAKVEIKTYYNGIED
        TNF ++  I  L+  L+ M   SEW    YS+      +   +  + FWK    A +I+ P++R+LR+V  +  PAMGY+Y  + RAK  IKT      +
Subjt:  TNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQEIISLLYLDRFWKDAHEAVNISEPLIRILRVVDGD-MPAMGYIYEGIERAKVEIKTYYNGIED

Query:  KYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFKIDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGY
        +Y+  W  IDR W   L   L+AA  +LNP  F++ + ++   I     + + K+V     +  + ++   Y N  G  G + A+  R    P +WWS Y
Subjt:  KYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFKIDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDWWSGY

Query:  GYEIPTLQRAAIRILSQPC-SSYGCRWNWSTFESLHSKKRSRSEQEKLNDLVFVQCNLWLQHIC--WTRDGKYKPVVFDDIDVSLEW
        G     L R AIRILSQ C SS G   N ++   ++  K S  E+++LNDLVFVQ N+ L+ I    + D    P+   +++V  +W
Subjt:  GYEIPTLQRAAIRILSQPC-SSYGCRWNWSTFESLHSKKRSRSEQEKLNDLVFVQCNLWLQHIC--WTRDGKYKPVVFDDIDVSLEW

AT4G15020.1 hAT transposon superfamily6.0e-9733.33Show/hide
Query:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGIL-STPKKQRAPKKPK--------------------
        +D  W+HC +     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP DVR  +Q  +  T ++QR   K                      
Subjt:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGIL-STPKKQRAPKKPK--------------------

Query:  ----------------VDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAK
                        V  E+  +G   Q+   S      +GS+  N      +  +  P    S +  +  + + +++     +  F F     F A  
Subjt:  ----------------VDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAK

Query:  SLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSYKKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSD
        S+ +Q M++AIA  G G  AP+++ L+  +L     ++     + +  WK TGC+IL +  +  +    L   + C +  +FLKSVD S     A  L +
Subjt:  SLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSYKKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSD

Query:  LLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRI
        LL  +V EVG  NVVQVIT     YV AG+ LM  Y SL+W PC ++C++QMLE+  K+ W+   +E+A+ ITR++Y+H+ +LN M KFTSG +++ P  
Subjt:  LLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRI

Query:  SRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQEIISLLYLDRFWKDAHEAVNISEPLIRILRVVDGD-MPAMGYIYEGIERAKVEIKTYYN
        S   TNF +L  I  L+ +L+ M   +EW    YS  P    +++ L  + FWK      +++ PL+R LR+V  +  PAMGY+Y  + RAK  IKT+  
Subjt:  SRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQEIISLLYLDRFWKDAHEAVNISEPLIRILRVVDGD-MPAMGYIYEGIERAKVEIKTYYN

Query:  GIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFKIDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDW
          ED Y+  W  IDR W  Q H  L AA  FLNP +F+N N +I   +     + + ++V  DK + +I +E   Y    G  G + A+  R    P +W
Subjt:  GIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFKIDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDW

Query:  WSGYGYEIPTLQRAAIRILSQPC-SSYGCRWNWSTFESLHSKKRSRSEQEKLNDLVFVQCNLWLQHI-CWTRDGKYKPVVFDDIDVSLEW
        WS YG     L R AIRILSQ C SS  CR N    E ++  K S  EQ++L+DLVFVQ N+ L+ +   + D    P+  + IDV  EW
Subjt:  WSGYGYEIPTLQRAAIRILSQPC-SSYGCRWNWSTFESLHSKKRSRSEQEKLNDLVFVQCNLWLQHI-CWTRDGKYKPVVFDDIDVSLEW

AT4G15020.2 hAT transposon superfamily6.0e-9733.33Show/hide
Query:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGIL-STPKKQRAPKKPK--------------------
        +D  W+HC +     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP DVR  +Q  +  T ++QR   K                      
Subjt:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGIL-STPKKQRAPKKPK--------------------

Query:  ----------------VDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAK
                        V  E+  +G   Q+   S      +GS+  N      +  +  P    S +  +  + + +++     +  F F     F A  
Subjt:  ----------------VDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAK

Query:  SLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSYKKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSD
        S+ +Q M++AIA  G G  AP+++ L+  +L     ++     + +  WK TGC+IL +  +  +    L   + C +  +FLKSVD S     A  L +
Subjt:  SLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSYKKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSD

Query:  LLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRI
        LL  +V EVG  NVVQVIT     YV AG+ LM  Y SL+W PC ++C++QMLE+  K+ W+   +E+A+ ITR++Y+H+ +LN M KFTSG +++ P  
Subjt:  LLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRI

Query:  SRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQEIISLLYLDRFWKDAHEAVNISEPLIRILRVVDGD-MPAMGYIYEGIERAKVEIKTYYN
        S   TNF +L  I  L+ +L+ M   +EW    YS  P    +++ L  + FWK      +++ PL+R LR+V  +  PAMGY+Y  + RAK  IKT+  
Subjt:  SRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQEIISLLYLDRFWKDAHEAVNISEPLIRILRVVDGD-MPAMGYIYEGIERAKVEIKTYYN

Query:  GIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFKIDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDW
          ED Y+  W  IDR W  Q H  L AA  FLNP +F+N N +I   +     + + ++V  DK + +I +E   Y    G  G + A+  R    P +W
Subjt:  GIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFKIDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRTINTPGDW

Query:  WSGYGYEIPTLQRAAIRILSQPC-SSYGCRWNWSTFESLHSKKRSRSEQEKLNDLVFVQCNLWLQHI-CWTRDGKYKPVVFDDIDVSLEW
        WS YG     L R AIRILSQ C SS  CR N    E ++  K S  EQ++L+DLVFVQ N+ L+ +   + D    P+  + IDV  EW
Subjt:  WSGYGYEIPTLQRAAIRILSQPC-SSYGCRWNWSTFESLHSKKRSRSEQEKLNDLVFVQCNLWLQHI-CWTRDGKYKPVVFDDIDVSLEW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGAGGAAGGGATGCTTGCTGGGAACATTGTGTCCTTGTTGATGCAACGAGACAGAAAGTTCGATGTAATTATTGCCAACGGGAATTCAGTGGAGGTGTATACAG
GATGAAATTTCATTTGGCTCAAATTAAAAATAAAGATATAGTTCCATGTGCTGAAGTCCCGACCGATGTTCGAGACCATATTCAAGGTATATTAAGCACTCCTAAGAAAC
AGAGGGCACCCAAGAAACCAAAGGTGGATATGGAAACTGCAACAAATGGACAGCAACATAGCTCCTCTGCTAGTGGTGGCATTCATCATGGATCCAGTGGACAGAATGAA
AGCAACTGCCCATCAACGTTTCCGTGCCCTTCACCAAGTGCACAACCACCAATTGATGATGCTCAAAAGCAGAAGAAGGATGAGACTGATAAAAAAGTTGCCATTTTTTT
CTTCCATAATTCTATTCCTTTCAGTGCTGCCAAGTCCTTGTATTATCAGGAAATGGTGAATGCAATAGCAGAATATGGAGTAGGATACAAAGCACCGAGTTATGAGAAAT
TAAAATCTACTCTTTTGGATAAAGTGAAAGGTGACATCCAGACTTCTTACAAAAAATATAGGGATGAATGGAAAGAAACAGGCTGCACTATCCTGTGTGATAGCTGGTCA
GATGGAAGGACTAAATCATTTCTAGTCATTTCCATTACGTGTTCTAAAGGAACGCTGTTTTTGAAGTCAGTTGATGTATCAGGTCGTGAAGATGATGCAACTTACCTGTC
CGACTTGCTTGAGACTATCGTCATCGAGGTTGGAGTGGAGAATGTTGTCCAAGTTATTACAGATGCTACTGCCAGTTATGTCTATGCTGGGAGGCTTCTCATGACCAAGT
ACACTTCCTTATTTTGGTCTCCATGTGTTTCTTATTGTGTCAATCAGATGTTGGAGGACATTAGTAAAATCGAGTGGGTCGGTACCGTGCTGGAGGAGGCAAAGATCATC
ACCCGCTACATTTATAGTCATGCGTGGATTTTGAATACAATGCGAAAATTCACCAGCGGGAAGGAATTGATCAGGCCGAGAATTAGTAGATTTGTGACTAATTTTCTCTC
CTTGAGGTCTATTGTGATTCTTGAGGACAGTCTCAAGCACATGTTCAATCATTCAGAGTGGCTGTCCTCGATTTATAGCAGGCGTCCCGACGTGCAGGAAATTATTTCCT
TGCTGTATTTGGATAGATTTTGGAAGGATGCACATGAAGCTGTCAACATTTCTGAACCACTTATTAGAATTCTGAGAGTCGTTGATGGAGACATGCCTGCCATGGGCTAT
ATATATGAAGGAATAGAGAGGGCAAAGGTTGAAATCAAAACATATTACAATGGCATTGAGGATAAGTACATGCCTATTTGGGACACAATCGACCGGAGATGGAATTTGCA
GCTTCACACGACATTGCACGCAGCAGCAGCGTTCCTTAACCCGTCCATTTTTTTCAATCCAAACTTTAAGATTGATCTGAGAATTAGGAATGGATTTCAGGAAGCCATGT
TGAAGATGGTGACGACGGATAAAGATAAAATGGAGATCACCAGAGAACATCCTGTCTATGTAAATGGACAAGGTGCTCTTGGAACTGACTTTGCTGTCTTGGGGAGAACT
ATAAATACCCCAGGTGATTGGTGGTCCGGGTACGGTTACGAAATCCCGACGCTCCAGAGAGCAGCGATACGAATACTAAGCCAACCCTGTAGTTCTTATGGGTGCAGGTG
GAACTGGAGCACGTTCGAAAGTTTACATTCGAAGAAGCGCAGTAGATCCGAACAGGAAAAGTTGAATGATTTAGTGTTTGTACAGTGCAATCTTTGGTTGCAACACATTT
GTTGGACTCGGGACGGTAAATATAAACCTGTTGTATTTGATGATATAGATGTAAGTTTAGAATGGCCTACGGAGTTTGAATCCTCAGCTCCTGTTTTAGATGATTCATGG
TTGGATAATTTGCCTCTTGAATGTAGGGGTAGCCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGAGGAAGGGATGCTTGCTGGGAACATTGTGTCCTTGTTGATGCAACGAGACAGAAAGTTCGATGTAATTATTGCCAACGGGAATTCAGTGGAGGTGTATACAG
GATGAAATTTCATTTGGCTCAAATTAAAAATAAAGATATAGTTCCATGTGCTGAAGTCCCGACCGATGTTCGAGACCATATTCAAGGTATATTAAGCACTCCTAAGAAAC
AGAGGGCACCCAAGAAACCAAAGGTGGATATGGAAACTGCAACAAATGGACAGCAACATAGCTCCTCTGCTAGTGGTGGCATTCATCATGGATCCAGTGGACAGAATGAA
AGCAACTGCCCATCAACGTTTCCGTGCCCTTCACCAAGTGCACAACCACCAATTGATGATGCTCAAAAGCAGAAGAAGGATGAGACTGATAAAAAAGTTGCCATTTTTTT
CTTCCATAATTCTATTCCTTTCAGTGCTGCCAAGTCCTTGTATTATCAGGAAATGGTGAATGCAATAGCAGAATATGGAGTAGGATACAAAGCACCGAGTTATGAGAAAT
TAAAATCTACTCTTTTGGATAAAGTGAAAGGTGACATCCAGACTTCTTACAAAAAATATAGGGATGAATGGAAAGAAACAGGCTGCACTATCCTGTGTGATAGCTGGTCA
GATGGAAGGACTAAATCATTTCTAGTCATTTCCATTACGTGTTCTAAAGGAACGCTGTTTTTGAAGTCAGTTGATGTATCAGGTCGTGAAGATGATGCAACTTACCTGTC
CGACTTGCTTGAGACTATCGTCATCGAGGTTGGAGTGGAGAATGTTGTCCAAGTTATTACAGATGCTACTGCCAGTTATGTCTATGCTGGGAGGCTTCTCATGACCAAGT
ACACTTCCTTATTTTGGTCTCCATGTGTTTCTTATTGTGTCAATCAGATGTTGGAGGACATTAGTAAAATCGAGTGGGTCGGTACCGTGCTGGAGGAGGCAAAGATCATC
ACCCGCTACATTTATAGTCATGCGTGGATTTTGAATACAATGCGAAAATTCACCAGCGGGAAGGAATTGATCAGGCCGAGAATTAGTAGATTTGTGACTAATTTTCTCTC
CTTGAGGTCTATTGTGATTCTTGAGGACAGTCTCAAGCACATGTTCAATCATTCAGAGTGGCTGTCCTCGATTTATAGCAGGCGTCCCGACGTGCAGGAAATTATTTCCT
TGCTGTATTTGGATAGATTTTGGAAGGATGCACATGAAGCTGTCAACATTTCTGAACCACTTATTAGAATTCTGAGAGTCGTTGATGGAGACATGCCTGCCATGGGCTAT
ATATATGAAGGAATAGAGAGGGCAAAGGTTGAAATCAAAACATATTACAATGGCATTGAGGATAAGTACATGCCTATTTGGGACACAATCGACCGGAGATGGAATTTGCA
GCTTCACACGACATTGCACGCAGCAGCAGCGTTCCTTAACCCGTCCATTTTTTTCAATCCAAACTTTAAGATTGATCTGAGAATTAGGAATGGATTTCAGGAAGCCATGT
TGAAGATGGTGACGACGGATAAAGATAAAATGGAGATCACCAGAGAACATCCTGTCTATGTAAATGGACAAGGTGCTCTTGGAACTGACTTTGCTGTCTTGGGGAGAACT
ATAAATACCCCAGGTGATTGGTGGTCCGGGTACGGTTACGAAATCCCGACGCTCCAGAGAGCAGCGATACGAATACTAAGCCAACCCTGTAGTTCTTATGGGTGCAGGTG
GAACTGGAGCACGTTCGAAAGTTTACATTCGAAGAAGCGCAGTAGATCCGAACAGGAAAAGTTGAATGATTTAGTGTTTGTACAGTGCAATCTTTGGTTGCAACACATTT
GTTGGACTCGGGACGGTAAATATAAACCTGTTGTATTTGATGATATAGATGTAAGTTTAGAATGGCCTACGGAGTTTGAATCCTCAGCTCCTGTTTTAGATGATTCATGG
TTGGATAATTTGCCTCTTGAATGTAGGGGTAGCCCTTAA
Protein sequenceShow/hide protein sequence
MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNE
SNCPSTFPCPSPSAQPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQTSYKKYRDEWKETGCTILCDSWS
DGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVIEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKII
TRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDSLKHMFNHSEWLSSIYSRRPDVQEIISLLYLDRFWKDAHEAVNISEPLIRILRVVDGDMPAMGY
IYEGIERAKVEIKTYYNGIEDKYMPIWDTIDRRWNLQLHTTLHAAAAFLNPSIFFNPNFKIDLRIRNGFQEAMLKMVTTDKDKMEITREHPVYVNGQGALGTDFAVLGRT
INTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCRWNWSTFESLHSKKRSRSEQEKLNDLVFVQCNLWLQHICWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSW
LDNLPLECRGSP