; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G191514 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G191514
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionTransposon Tf2-8 polyprotein
Genome locationCla97Chr10:12144441..12146468
RNA-Seq ExpressionCla97C10G191514
SyntenyCla97C10G191514
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR016197 - Chromo-like domain superfamily
IPR023780 - Chromo domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033203.1 polyprotein [Cucumis melo var. makuwa]2.5e-16574.8Show/hide
Query:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG
        LPAGLLEPLP+AE+PW+SVTMDFIVALPKS   G+IMVVVDRFSKYATFIPCS ++KVDEAARLFFKN+VKLWGIP+SIISDRDPRFTGKFWRELFKL+G
Subjt:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG

Query:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD
        T+LNFSTSFHPQSDGQTER+NALLEQYLRHY SAHQ DWA LLD+AQFSYNLQRSESTGKSPFEI+  +QPNTP  L + Y GPNPSA+  AK+W E+QD
Subjt:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD

Query:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD
        I+RACL KAA++MKKWAD KRRPK+Y  G++V+VKLLPNQFKSLR+VHKGLVR+YEGPF I+ERVGKAAY+V+LP +LKIH+VFHVSMLKPFH+D EDP+
Subjt:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD

Query:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKVSNY
        R+ ++RAP GV TE+D+ ++ ILA R++  +GVPSH EYL+ W  LP+SEASWE++  LWQF  ++  +
Subjt:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKVSNY

KAA0051289.1 reverse transcriptase [Cucumis melo var. makuwa]2.7e-16776.15Show/hide
Query:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG
        LPAGLLEPLP+AE+PW+SVTMDFIVALPKS   G+IMVVVDRFSKYATFIPCS +VKVDEAARLFFKN+VKLWGIP+SIISDRDPRFTGKFWRELFKLMG
Subjt:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG

Query:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD
        T+LNFSTSFHPQSDGQTER+NALLEQYLRHY SAHQKDWA LLD+AQFSYNLQRSESTGKSPFEI+  +QPNTP  L + Y GPNPSA+  AK+W E+QD
Subjt:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD

Query:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD
        I+RACL KAA++MKKWAD KRRPK+Y  G++V+VKLLPNQFKSLR+VHKGLVR+YEGPF I+ERVGKAAY+V+LP RLKIH+VFHVSMLKPFH+D EDP+
Subjt:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD

Query:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKVSNY
        R+ ++RAP GV TE+D+ ++ ILA R++  +GVPSH EYL+ W  LP+SEASWE+E  LWQF  ++  +
Subjt:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKVSNY

KAA0054276.1 polyprotein [Cucumis melo var. makuwa]3.3e-16574.8Show/hide
Query:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG
        LPAGLLEPLP+AE+PW+S+TMDFIVALPKS   G+IMVVVDRFSKYATFIPCS +VKVDEAARLFFKN+VKLWGIP+SIISDRDPRFTGKFWRELFKLMG
Subjt:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG

Query:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD
        T+LNFSTSFHPQSDGQTER+NALLEQYLRHY SAHQKDW  LLD+AQFSYNLQRSE+TGKSPFE++  +QPNTP  L + Y GPNPSA+  AK+W E+QD
Subjt:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD

Query:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD
        I+RACL KAA++MKKWAD KRRPK+Y  G++V+VKLLPNQFKSLR+VHKGLVR+YEGPF I+ERVGKAAY+V+LP RLKIH+VFHVSMLKPFH+D EDP+
Subjt:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD

Query:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKVSNY
        R+ ++RAP GV  E+D+ ++ ILA R++  +GVPSH EYL+ W  LP+SEASWE+E  LWQF  ++  +
Subjt:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKVSNY

KAA0065760.1 polyprotein [Cucumis melo var. makuwa]2.7e-16775.88Show/hide
Query:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG
        LPAGLLEPLP+AE+PW+SVTMDFIVALPKS   G+IMVVVDRFSKYATFIPCS +VKVDEAARLFFKN+VKLWGIP+SIISDRDPRFTGKFWRELFKLMG
Subjt:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG

Query:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD
        T+LNFSTSFHPQSDGQTER+NALLEQYLRHY SAHQKDWA LLD+AQFSYNLQRSESTGKSPFEI+  +QPNTP  L + Y GPNPSA+  AK+W E+QD
Subjt:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD

Query:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD
        I+RACL KAA++MKKWAD KRRPK+Y  G++V+VKLLPNQFKSLR+VHKGLVR+YEGPF I+ERVGKAAY+V+LP RLKIH+VFH+SMLKPFH+D EDP+
Subjt:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD

Query:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKVSNY
        R+ ++RAP GV TE+D+ ++ ILA R++  +GVPSH EYL+ W  LP+SEASWE+E  LWQF  ++  +
Subjt:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKVSNY

XP_008460615.1 PREDICTED: uncharacterized protein LOC103499392 [Cucumis melo]7.3e-16575.41Show/hide
Query:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG
        LPAGLLEPLP+AE+PW+S+TMDFIVALPKS   G+IMVVVDRFSKYATFIPCS +VKVDEAARLFFKN+VKLWGIP+SIISDRDPRFTGKFWRELFKLMG
Subjt:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG

Query:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD
        T+LNFSTSFHPQSDGQTER+NALLEQYLRHY SAHQKDW  LLD+AQFSYNLQRSE+TGKSPFE++  +QPNTP  L + Y GPNPSA+  AK+W E+QD
Subjt:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD

Query:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD
        I+RACL KAA++MKKWAD KRRPK+Y  G++V+VKLLPNQFKSLR+VHKGLVR+YEGPF I+ERVGKAAY+V+LP RLKIH+VFHVSMLKPFH+D EDP+
Subjt:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD

Query:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKV
        R+ ++RAP GV  E+D+ ++ ILA R++  +GVPSH EYL+ W  LP+SEASWE+E  LWQF  ++
Subjt:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKV

TrEMBL top hitse value%identityAlignment
A0A1S3CE17 uncharacterized protein LOC1034993923.5e-16575.41Show/hide
Query:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG
        LPAGLLEPLP+AE+PW+S+TMDFIVALPKS   G+IMVVVDRFSKYATFIPCS +VKVDEAARLFFKN+VKLWGIP+SIISDRDPRFTGKFWRELFKLMG
Subjt:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG

Query:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD
        T+LNFSTSFHPQSDGQTER+NALLEQYLRHY SAHQKDW  LLD+AQFSYNLQRSE+TGKSPFE++  +QPNTP  L + Y GPNPSA+  AK+W E+QD
Subjt:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD

Query:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD
        I+RACL KAA++MKKWAD KRRPK+Y  G++V+VKLLPNQFKSLR+VHKGLVR+YEGPF I+ERVGKAAY+V+LP RLKIH+VFHVSMLKPFH+D EDP+
Subjt:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD

Query:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKV
        R+ ++RAP GV  E+D+ ++ ILA R++  +GVPSH EYL+ W  LP+SEASWE+E  LWQF  ++
Subjt:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKV

A0A5A7SS05 Polyprotein1.2e-16574.8Show/hide
Query:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG
        LPAGLLEPLP+AE+PW+SVTMDFIVALPKS   G+IMVVVDRFSKYATFIPCS ++KVDEAARLFFKN+VKLWGIP+SIISDRDPRFTGKFWRELFKL+G
Subjt:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG

Query:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD
        T+LNFSTSFHPQSDGQTER+NALLEQYLRHY SAHQ DWA LLD+AQFSYNLQRSESTGKSPFEI+  +QPNTP  L + Y GPNPSA+  AK+W E+QD
Subjt:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD

Query:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD
        I+RACL KAA++MKKWAD KRRPK+Y  G++V+VKLLPNQFKSLR+VHKGLVR+YEGPF I+ERVGKAAY+V+LP +LKIH+VFHVSMLKPFH+D EDP+
Subjt:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD

Query:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKVSNY
        R+ ++RAP GV TE+D+ ++ ILA R++  +GVPSH EYL+ W  LP+SEASWE++  LWQF  ++  +
Subjt:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKVSNY

A0A5A7UCY6 Reverse transcriptase1.3e-16776.15Show/hide
Query:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG
        LPAGLLEPLP+AE+PW+SVTMDFIVALPKS   G+IMVVVDRFSKYATFIPCS +VKVDEAARLFFKN+VKLWGIP+SIISDRDPRFTGKFWRELFKLMG
Subjt:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG

Query:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD
        T+LNFSTSFHPQSDGQTER+NALLEQYLRHY SAHQKDWA LLD+AQFSYNLQRSESTGKSPFEI+  +QPNTP  L + Y GPNPSA+  AK+W E+QD
Subjt:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD

Query:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD
        I+RACL KAA++MKKWAD KRRPK+Y  G++V+VKLLPNQFKSLR+VHKGLVR+YEGPF I+ERVGKAAY+V+LP RLKIH+VFHVSMLKPFH+D EDP+
Subjt:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD

Query:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKVSNY
        R+ ++RAP GV TE+D+ ++ ILA R++  +GVPSH EYL+ W  LP+SEASWE+E  LWQF  ++  +
Subjt:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKVSNY

A0A5A7UJ25 Polyprotein1.6e-16574.8Show/hide
Query:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG
        LPAGLLEPLP+AE+PW+S+TMDFIVALPKS   G+IMVVVDRFSKYATFIPCS +VKVDEAARLFFKN+VKLWGIP+SIISDRDPRFTGKFWRELFKLMG
Subjt:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG

Query:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD
        T+LNFSTSFHPQSDGQTER+NALLEQYLRHY SAHQKDW  LLD+AQFSYNLQRSE+TGKSPFE++  +QPNTP  L + Y GPNPSA+  AK+W E+QD
Subjt:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD

Query:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD
        I+RACL KAA++MKKWAD KRRPK+Y  G++V+VKLLPNQFKSLR+VHKGLVR+YEGPF I+ERVGKAAY+V+LP RLKIH+VFHVSMLKPFH+D EDP+
Subjt:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD

Query:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKVSNY
        R+ ++RAP GV  E+D+ ++ ILA R++  +GVPSH EYL+ W  LP+SEASWE+E  LWQF  ++  +
Subjt:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKVSNY

A0A5A7VEX8 Polyprotein1.3e-16775.88Show/hide
Query:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG
        LPAGLLEPLP+AE+PW+SVTMDFIVALPKS   G+IMVVVDRFSKYATFIPCS +VKVDEAARLFFKN+VKLWGIP+SIISDRDPRFTGKFWRELFKLMG
Subjt:  LPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMG

Query:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD
        T+LNFSTSFHPQSDGQTER+NALLEQYLRHY SAHQKDWA LLD+AQFSYNLQRSESTGKSPFEI+  +QPNTP  L + Y GPNPSA+  AK+W E+QD
Subjt:  TELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLTSDYSGPNPSAYKMAKEWKEKQD

Query:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD
        I+RACL KAA++MKKWAD KRRPK+Y  G++V+VKLLPNQFKSLR+VHKGLVR+YEGPF I+ERVGKAAY+V+LP RLKIH+VFH+SMLKPFH+D EDP+
Subjt:  IARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSMLKPFHQDMEDPD

Query:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKVSNY
        R+ ++RAP GV TE+D+ ++ ILA R++  +GVPSH EYL+ W  LP+SEASWE+E  LWQF  ++  +
Subjt:  RAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKVSNY

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein6.6e-4434.42Show/hide
Query:  PAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGT
        P G L+P+P +E PWES++MDFI ALP+S    ++ VVVDRFSK A  +PC+ ++  ++ AR+F + ++  +G P+ II+D D  FT + W++       
Subjt:  PAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGT

Query:  ELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTP-------KDLTSDYSGPNPSAYKMAKE
         + FS  + PQ+DGQTER N  +E+ LR   S H   W D + + Q SYN     +T  +PFEIV+   P           D T + S      ++  KE
Subjt:  ELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTP-------KDLTSDYSGPNPSAYKMAKE

Query:  WKEKQDIARACLYKAAKKMKKWADLK-RRPKKYNEGEQVMVKLLPNQFKSLRQVHKG--LVRKYEGPFPILERVGKAAYRVQLPARLK--IHDVFHVSML
             +I          KMKK+ D+K +  +++  G+ VMVK     F     +HK   L   + GPF +L++ G   Y + LP  +K      FHVS L
Subjt:  WKEKQDIARACLYKAAKKMKKWADLK-RRPKKYNEGEQVMVKLLPNQFKSLRQVHKG--LVRKYEGPFPILERVGKAAYRVQLPARLK--IHDVFHVSML

Query:  KPFHQDME
        + +  + E
Subjt:  KPFHQDME

P0CT35 Transposon Tf2-2 polyprotein6.6e-4434.42Show/hide
Query:  PAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGT
        P G L+P+P +E PWES++MDFI ALP+S    ++ VVVDRFSK A  +PC+ ++  ++ AR+F + ++  +G P+ II+D D  FT + W++       
Subjt:  PAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGT

Query:  ELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTP-------KDLTSDYSGPNPSAYKMAKE
         + FS  + PQ+DGQTER N  +E+ LR   S H   W D + + Q SYN     +T  +PFEIV+   P           D T + S      ++  KE
Subjt:  ELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTP-------KDLTSDYSGPNPSAYKMAKE

Query:  WKEKQDIARACLYKAAKKMKKWADLK-RRPKKYNEGEQVMVKLLPNQFKSLRQVHKG--LVRKYEGPFPILERVGKAAYRVQLPARLK--IHDVFHVSML
             +I          KMKK+ D+K +  +++  G+ VMVK     F     +HK   L   + GPF +L++ G   Y + LP  +K      FHVS L
Subjt:  WKEKQDIARACLYKAAKKMKKWADLK-RRPKKYNEGEQVMVKLLPNQFKSLRQVHKG--LVRKYEGPFPILERVGKAAYRVQLPARLK--IHDVFHVSML

Query:  KPFHQDME
        + +  + E
Subjt:  KPFHQDME

P0CT36 Transposon Tf2-3 polyprotein6.6e-4434.42Show/hide
Query:  PAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGT
        P G L+P+P +E PWES++MDFI ALP+S    ++ VVVDRFSK A  +PC+ ++  ++ AR+F + ++  +G P+ II+D D  FT + W++       
Subjt:  PAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGT

Query:  ELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTP-------KDLTSDYSGPNPSAYKMAKE
         + FS  + PQ+DGQTER N  +E+ LR   S H   W D + + Q SYN     +T  +PFEIV+   P           D T + S      ++  KE
Subjt:  ELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTP-------KDLTSDYSGPNPSAYKMAKE

Query:  WKEKQDIARACLYKAAKKMKKWADLK-RRPKKYNEGEQVMVKLLPNQFKSLRQVHKG--LVRKYEGPFPILERVGKAAYRVQLPARLK--IHDVFHVSML
             +I          KMKK+ D+K +  +++  G+ VMVK     F     +HK   L   + GPF +L++ G   Y + LP  +K      FHVS L
Subjt:  WKEKQDIARACLYKAAKKMKKWADLK-RRPKKYNEGEQVMVKLLPNQFKSLRQVHKG--LVRKYEGPFPILERVGKAAYRVQLPARLK--IHDVFHVSML

Query:  KPFHQDME
        + +  + E
Subjt:  KPFHQDME

P0CT41 Transposon Tf2-12 polyprotein6.6e-4434.42Show/hide
Query:  PAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGT
        P G L+P+P +E PWES++MDFI ALP+S    ++ VVVDRFSK A  +PC+ ++  ++ AR+F + ++  +G P+ II+D D  FT + W++       
Subjt:  PAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGT

Query:  ELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTP-------KDLTSDYSGPNPSAYKMAKE
         + FS  + PQ+DGQTER N  +E+ LR   S H   W D + + Q SYN     +T  +PFEIV+   P           D T + S      ++  KE
Subjt:  ELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTP-------KDLTSDYSGPNPSAYKMAKE

Query:  WKEKQDIARACLYKAAKKMKKWADLK-RRPKKYNEGEQVMVKLLPNQFKSLRQVHKG--LVRKYEGPFPILERVGKAAYRVQLPARLK--IHDVFHVSML
             +I          KMKK+ D+K +  +++  G+ VMVK     F     +HK   L   + GPF +L++ G   Y + LP  +K      FHVS L
Subjt:  WKEKQDIARACLYKAAKKMKKWADLK-RRPKKYNEGEQVMVKLLPNQFKSLRQVHKG--LVRKYEGPFPILERVGKAAYRVQLPARLK--IHDVFHVSML

Query:  KPFHQDME
        + +  + E
Subjt:  KPFHQDME

Q9UR07 Transposon Tf2-11 polyprotein6.6e-4434.42Show/hide
Query:  PAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGT
        P G L+P+P +E PWES++MDFI ALP+S    ++ VVVDRFSK A  +PC+ ++  ++ AR+F + ++  +G P+ II+D D  FT + W++       
Subjt:  PAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKNIVKLWGIPRSIISDRDPRFTGKFWRELFKLMGT

Query:  ELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTP-------KDLTSDYSGPNPSAYKMAKE
         + FS  + PQ+DGQTER N  +E+ LR   S H   W D + + Q SYN     +T  +PFEIV+   P           D T + S      ++  KE
Subjt:  ELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTP-------KDLTSDYSGPNPSAYKMAKE

Query:  WKEKQDIARACLYKAAKKMKKWADLK-RRPKKYNEGEQVMVKLLPNQFKSLRQVHKG--LVRKYEGPFPILERVGKAAYRVQLPARLK--IHDVFHVSML
             +I          KMKK+ D+K +  +++  G+ VMVK     F     +HK   L   + GPF +L++ G   Y + LP  +K      FHVS L
Subjt:  WKEKQDIARACLYKAAKKMKKWADLK-RRPKKYNEGEQVMVKLLPNQFKSLRQVHKG--LVRKYEGPFPILERVGKAAYRVQLPARLK--IHDVFHVSML

Query:  KPFHQDME
        + +  + E
Subjt:  KPFHQDME

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGCCCATCACATATGAAGCATGGTTTAGGGGGTTTGGAGGACTCACCTTTCCCTTTCCCGCGTTTATCTTGGGAAGATTTGTTCTTGTCTCGGCGGGTCTGAATCG
GACTTCTTTCCTTACGTTGCCAGCTGGTCTCCTAGAACCACTGCCTGTGGCGGAGGAACCTTGGGAGAGTGTCACTATGGATTTCATCGTGGCACTACCAAAATCCAGAA
GCTGCGGCAGCATAATGGTGGTTGTAGACCGATTCAGCAAATATGCGACCTTCATACCTTGCTCTGCAAATGTCAAGGTAGACGAAGCTGCTCGGTTGTTCTTCAAAAAC
ATTGTGAAACTGTGGGGGATTCCGCGAAGCATTATCAGTGACCGAGACCCACGGTTCACAGGCAAGTTCTGGAGGGAGCTATTCAAGTTGATGGGGACCGAGTTGAACTT
CTCAACCAGCTTCCATCCTCAAAGTGACGGGCAAACCGAAAGGGTGAACGCTCTCTTAGAGCAATATCTGAGGCACTATGCTAGTGCTCATCAGAAGGATTGGGCAGACC
TACTAGACATCGCCCAATTCTCCTACAATCTCCAAAGGAGCGAGTCGACAGGAAAAAGTCCTTTCGAAATTGTAAATTGGCGGCAACCTAACACGCCCAAGGACCTGACC
TCTGACTATTCTGGACCAAACCCTTCGGCATATAAGATGGCCAAGGAATGGAAAGAAAAACAAGATATTGCAAGGGCATGCCTCTACAAAGCTGCCAAGAAAATGAAGAA
ATGGGCTGACCTAAAGAGGCGCCCTAAGAAGTACAACGAAGGGGAACAGGTGATGGTTAAGCTATTGCCTAACCAGTTCAAATCTCTTCGACAAGTACACAAGGGGTTAG
TCAGGAAATACGAAGGCCCATTTCCCATTCTCGAAAGAGTTGGAAAGGCAGCTTATCGAGTACAGTTGCCCGCAAGACTCAAAATCCACGATGTCTTCCATGTGAGCATG
CTAAAGCCATTCCATCAAGACATGGAAGACCCTGACCGAGCCATCTCGAATAGGGCCCCAATGGGTGTGACAACTGAGTATGACAAGAATGTTGAAACCATTTTGGCCCA
CAGAAGGGTAAGTAGCAAGGGTGTCCCAAGTCACTTCGAATATCTTGTGAAATGGACAAGACTCCCAGACTCGGAGGCAAGCTGGGAAAAGGAAGGACACCTATGGCAAT
TCGCCGACAAGGTCTCAAACTATTGGCAGGAGGTCCGCGACGAGGGCGTCGCTAGCATCAGTGGGGGAGGTGCCCATCCCCTTAGCATGGCCAAGCAAGGATGGGGAAAC
CGCTTGCCGCAGCCCCTCGGGAGGCCCATGCGTCTGCCGCACCCCACCGGCACGCATGGCCTAAGGCGCACAATAATGGGCTGCGCGCATAAGGGAGGAAAGGCGAGAGG
TAGACGCGTCGACCCCAGGCGTTGGCATGGGCGACGCATCGCCAGGCAAGGCTGCCAGGCGCAAAGGAGCTTGACGCTAGGCTGCGCGACACCAGGATGCTCGACGCAGG
GCTGCTCGACGCACAGTGCCGCAAGCCATACCCGGGAGACGCATAGACCCGGACACAAGGATCCCGATCACCCGGGAAAAGTCAGGGCGCGCCTAGAGAGACCCGGACAC
GGCTGGAAGCCTCCAGAGTCGGCTGACACGCGCGGCCATGGCCTAGACAGACCCCGATGGCGCTGGAATGGCCTAGATGGTTCTCGAATGTTCCTCCCAATACCTTTGCC
GGTTAAAGTCGGTGTTGGGGCCCTCTATAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGCCCATCACATATGAAGCATGGTTTAGGGGGTTTGGAGGACTCACCTTTCCCTTTCCCGCGTTTATCTTGGGAAGATTTGTTCTTGTCTCGGCGGGTCTGAATCG
GACTTCTTTCCTTACGTTGCCAGCTGGTCTCCTAGAACCACTGCCTGTGGCGGAGGAACCTTGGGAGAGTGTCACTATGGATTTCATCGTGGCACTACCAAAATCCAGAA
GCTGCGGCAGCATAATGGTGGTTGTAGACCGATTCAGCAAATATGCGACCTTCATACCTTGCTCTGCAAATGTCAAGGTAGACGAAGCTGCTCGGTTGTTCTTCAAAAAC
ATTGTGAAACTGTGGGGGATTCCGCGAAGCATTATCAGTGACCGAGACCCACGGTTCACAGGCAAGTTCTGGAGGGAGCTATTCAAGTTGATGGGGACCGAGTTGAACTT
CTCAACCAGCTTCCATCCTCAAAGTGACGGGCAAACCGAAAGGGTGAACGCTCTCTTAGAGCAATATCTGAGGCACTATGCTAGTGCTCATCAGAAGGATTGGGCAGACC
TACTAGACATCGCCCAATTCTCCTACAATCTCCAAAGGAGCGAGTCGACAGGAAAAAGTCCTTTCGAAATTGTAAATTGGCGGCAACCTAACACGCCCAAGGACCTGACC
TCTGACTATTCTGGACCAAACCCTTCGGCATATAAGATGGCCAAGGAATGGAAAGAAAAACAAGATATTGCAAGGGCATGCCTCTACAAAGCTGCCAAGAAAATGAAGAA
ATGGGCTGACCTAAAGAGGCGCCCTAAGAAGTACAACGAAGGGGAACAGGTGATGGTTAAGCTATTGCCTAACCAGTTCAAATCTCTTCGACAAGTACACAAGGGGTTAG
TCAGGAAATACGAAGGCCCATTTCCCATTCTCGAAAGAGTTGGAAAGGCAGCTTATCGAGTACAGTTGCCCGCAAGACTCAAAATCCACGATGTCTTCCATGTGAGCATG
CTAAAGCCATTCCATCAAGACATGGAAGACCCTGACCGAGCCATCTCGAATAGGGCCCCAATGGGTGTGACAACTGAGTATGACAAGAATGTTGAAACCATTTTGGCCCA
CAGAAGGGTAAGTAGCAAGGGTGTCCCAAGTCACTTCGAATATCTTGTGAAATGGACAAGACTCCCAGACTCGGAGGCAAGCTGGGAAAAGGAAGGACACCTATGGCAAT
TCGCCGACAAGGTCTCAAACTATTGGCAGGAGGTCCGCGACGAGGGCGTCGCTAGCATCAGTGGGGGAGGTGCCCATCCCCTTAGCATGGCCAAGCAAGGATGGGGAAAC
CGCTTGCCGCAGCCCCTCGGGAGGCCCATGCGTCTGCCGCACCCCACCGGCACGCATGGCCTAAGGCGCACAATAATGGGCTGCGCGCATAAGGGAGGAAAGGCGAGAGG
TAGACGCGTCGACCCCAGGCGTTGGCATGGGCGACGCATCGCCAGGCAAGGCTGCCAGGCGCAAAGGAGCTTGACGCTAGGCTGCGCGACACCAGGATGCTCGACGCAGG
GCTGCTCGACGCACAGTGCCGCAAGCCATACCCGGGAGACGCATAGACCCGGACACAAGGATCCCGATCACCCGGGAAAAGTCAGGGCGCGCCTAGAGAGACCCGGACAC
GGCTGGAAGCCTCCAGAGTCGGCTGACACGCGCGGCCATGGCCTAGACAGACCCCGATGGCGCTGGAATGGCCTAGATGGTTCTCGAATGTTCCTCCCAATACCTTTGCC
GGTTAAAGTCGGTGTTGGGGCCCTCTATAAATAG
Protein sequenceShow/hide protein sequence
MRPITYEAWFRGFGGLTFPFPAFILGRFVLVSAGLNRTSFLTLPAGLLEPLPVAEEPWESVTMDFIVALPKSRSCGSIMVVVDRFSKYATFIPCSANVKVDEAARLFFKN
IVKLWGIPRSIISDRDPRFTGKFWRELFKLMGTELNFSTSFHPQSDGQTERVNALLEQYLRHYASAHQKDWADLLDIAQFSYNLQRSESTGKSPFEIVNWRQPNTPKDLT
SDYSGPNPSAYKMAKEWKEKQDIARACLYKAAKKMKKWADLKRRPKKYNEGEQVMVKLLPNQFKSLRQVHKGLVRKYEGPFPILERVGKAAYRVQLPARLKIHDVFHVSM
LKPFHQDMEDPDRAISNRAPMGVTTEYDKNVETILAHRRVSSKGVPSHFEYLVKWTRLPDSEASWEKEGHLWQFADKVSNYWQEVRDEGVASISGGGAHPLSMAKQGWGN
RLPQPLGRPMRLPHPTGTHGLRRTIMGCAHKGGKARGRRVDPRRWHGRRIARQGCQAQRSLTLGCATPGCSTQGCSTHSAASHTRETHRPGHKDPDHPGKVRARLERPGH
GWKPPESADTRGHGLDRPRWRWNGLDGSRMFLPIPLPVKVGVGALYK