; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc05g0129771 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc05g0129771
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr05:7914319..7916151
RNA-Seq ExpressionCmc05g0129771
SyntenyCmc05g0129771
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040689.1 pol protein [Cucumis melo var. makuwa]1.3e-30989.67Show/hide
Query:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR
        +VDTREVDVSLSSEPVVRDY DVFP+ELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIR S+SPWG PVLFVKKKDGSMR
Subjt:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV
        LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTFVIV
Subjt:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV

Query:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA
        FIDDILIYSKTEAEH+EHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDP KIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA
Subjt:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA

Query:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV---------
        TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVL VPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV         
Subjt:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV---------

Query:  -------------------------------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQ
                                             DYDCEILYHPGKANV ADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVT+QLAQLTVQ
Subjt:  -------------------------------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQ

Query:  PTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRC
        PTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSA+KTELLSEAHSSPFSMHPGSTKMYQ+LKRVYWWRNMKREVAEFVSRC
Subjt:  PTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRC

Query:  LVCQQVKAQR
        LVCQQVKA R
Subjt:  LVCQQVKAQR

KAA0047433.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0096.15Show/hide
Query:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR
        +VDTREVDVSLSSEPVVRDY DVFP+ELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIR SVSPWG PVLFVKKKDGSMR
Subjt:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV
        LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTFVIV
Subjt:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV

Query:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA
        FIDDILIYSKTEAEH+EHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA
Subjt:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA

Query:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------D
        TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV        D
Subjt:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------D

Query:  YDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDG
        YDCEILYHPGKANV ADALSRKVSHSAALITRQAPLHRDLERAEIAVSVG VT+QLAQLTVQPTLRQRIIDAQ NDPYLVEKRGLAEAGQAVEFSISSDG
Subjt:  YDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDG

Query:  GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKAQR
        GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ+LKRVYWWRNMKREVA+FVSRCLVCQQVKA R
Subjt:  GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKAQR

KAA0048687.1 pol protein [Cucumis melo var. makuwa]3.0e-30989.51Show/hide
Query:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR
        +VDTRE DVSLSSEPVVRDY DVFP+ELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIR SVSPWG PVLFVKKKDGSMR
Subjt:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV
        LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTFVIV
Subjt:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV

Query:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA
        FIDDILIYSKTEAEH+EHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA
Subjt:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA

Query:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV---------
        TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV         
Subjt:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV---------

Query:  -------------------------------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQ
                                             DYDCEILYHPGKANV ADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVT+QLAQLTVQ
Subjt:  -------------------------------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQ

Query:  PTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRC
        PTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSDS VKTELLSEAHSSPFSMHPGSTKMY+++KRVYWWRNMKREVAEFVSRC
Subjt:  PTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRC

Query:  LVCQQVKAQR
        LVCQQVKA R
Subjt:  LVCQQVKAQR

KAA0062245.1 pol protein [Cucumis melo var. makuwa]0.0e+0095.09Show/hide
Query:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR
        +VDTRE DVSLSSEPVVRDY DVFP+ELPGLP HREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIR SVSPWG PVLFVKKKDGSMR
Subjt:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV
        LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTFVIV
Subjt:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV

Query:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA
        FIDDILIYSKTEAEH+EHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPST+SEVRSFLGLAGYYRRFVENFSRIA
Subjt:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA

Query:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------D
        TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDA KKGLGCVLMQQGKVV YASRQLKSHEQNYPTHDLELAAV        D
Subjt:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------D

Query:  YDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDG
        YDCEILYHPGKANV ADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVT+QLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQA EFS+SSDG
Subjt:  YDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDG

Query:  GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKA
        GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ+LKRVYWWRNMKREVAEFVS+CLVCQQVKA
Subjt:  GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKA

TYK01613.1 pol protein [Cucumis melo var. makuwa]1.5e-30889.34Show/hide
Query:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR
        +VDTRE DVSLSSEPVVRDY DVFP+ELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIR SVSPWG PVLFVKKKDGSMR
Subjt:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV
        LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTFVIV
Subjt:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV

Query:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA
        FIDDILIYSKTEAEH+EHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA
Subjt:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA

Query:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV---------
        TPLTQLTRKGAPFVWSKACEDSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV         
Subjt:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV---------

Query:  -------------------------------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQ
                                             DYDCEILYHPGKANV ADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVT+QLAQLTVQ
Subjt:  -------------------------------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQ

Query:  PTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRC
        PTLRQRIIDAQSNDPYLVEKRGLAEAGQ  EFS+SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ+LKRVYWWRNMKREVAEFVS+C
Subjt:  PTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRC

Query:  LVCQQVKAQR
        LVCQQVKA R
Subjt:  LVCQQVKAQR

TrEMBL top hitse value%identityAlignment
A0A5A7THE6 Reverse transcriptase6.3e-31089.67Show/hide
Query:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR
        +VDTREVDVSLSSEPVVRDY DVFP+ELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIR S+SPWG PVLFVKKKDGSMR
Subjt:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV
        LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTFVIV
Subjt:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV

Query:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA
        FIDDILIYSKTEAEH+EHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDP KIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA
Subjt:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA

Query:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV---------
        TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVL VPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV         
Subjt:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV---------

Query:  -------------------------------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQ
                                             DYDCEILYHPGKANV ADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVT+QLAQLTVQ
Subjt:  -------------------------------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQ

Query:  PTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRC
        PTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSA+KTELLSEAHSSPFSMHPGSTKMYQ+LKRVYWWRNMKREVAEFVSRC
Subjt:  PTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRC

Query:  LVCQQVKAQR
        LVCQQVKA R
Subjt:  LVCQQVKAQR

A0A5A7TV57 Ty3-gypsy retrotransposon protein0.0e+0096.15Show/hide
Query:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR
        +VDTREVDVSLSSEPVVRDY DVFP+ELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIR SVSPWG PVLFVKKKDGSMR
Subjt:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV
        LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTFVIV
Subjt:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV

Query:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA
        FIDDILIYSKTEAEH+EHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA
Subjt:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA

Query:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------D
        TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV        D
Subjt:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------D

Query:  YDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDG
        YDCEILYHPGKANV ADALSRKVSHSAALITRQAPLHRDLERAEIAVSVG VT+QLAQLTVQPTLRQRIIDAQ NDPYLVEKRGLAEAGQAVEFSISSDG
Subjt:  YDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDG

Query:  GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKAQR
        GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ+LKRVYWWRNMKREVA+FVSRCLVCQQVKA R
Subjt:  GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKAQR

A0A5A7U330 Reverse transcriptase1.5e-30989.51Show/hide
Query:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR
        +VDTRE DVSLSSEPVVRDY DVFP+ELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIR SVSPWG PVLFVKKKDGSMR
Subjt:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV
        LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTFVIV
Subjt:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV

Query:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA
        FIDDILIYSKTEAEH+EHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA
Subjt:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA

Query:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV---------
        TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV         
Subjt:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV---------

Query:  -------------------------------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQ
                                             DYDCEILYHPGKANV ADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVT+QLAQLTVQ
Subjt:  -------------------------------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQ

Query:  PTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRC
        PTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSDS VKTELLSEAHSSPFSMHPGSTKMY+++KRVYWWRNMKREVAEFVSRC
Subjt:  PTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRC

Query:  LVCQQVKAQR
        LVCQQVKA R
Subjt:  LVCQQVKAQR

A0A5A7V8L8 Pol protein0.0e+0095.09Show/hide
Query:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR
        +VDTRE DVSLSSEPVVRDY DVFP+ELPGLP HREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIR SVSPWG PVLFVKKKDGSMR
Subjt:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV
        LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTFVIV
Subjt:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV

Query:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA
        FIDDILIYSKTEAEH+EHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPST+SEVRSFLGLAGYYRRFVENFSRIA
Subjt:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA

Query:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------D
        TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDA KKGLGCVLMQQGKVV YASRQLKSHEQNYPTHDLELAAV        D
Subjt:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------D

Query:  YDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDG
        YDCEILYHPGKANV ADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVT+QLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQA EFS+SSDG
Subjt:  YDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDG

Query:  GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKA
        GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ+LKRVYWWRNMKREVAEFVS+CLVCQQVKA
Subjt:  GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKA

A0A5D3BPI1 Reverse transcriptase7.1e-30989.34Show/hide
Query:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR
        +VDTRE DVSLSSEPVVRDY DVFP+ELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIR SVSPWG PVLFVKKKDGSMR
Subjt:  MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMR

Query:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV
        LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTFVIV
Subjt:  LCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIV

Query:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA
        FIDDILIYSKTEAEH+EHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA
Subjt:  FIDDILIYSKTEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIA

Query:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV---------
        TPLTQLTRKGAPFVWSKACEDSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV         
Subjt:  TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV---------

Query:  -------------------------------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQ
                                             DYDCEILYHPGKANV ADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVT+QLAQLTVQ
Subjt:  -------------------------------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQ

Query:  PTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRC
        PTLRQRIIDAQSNDPYLVEKRGLAEAGQ  EFS+SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ+LKRVYWWRNMKREVAEFVS+C
Subjt:  PTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRC

Query:  LVCQQVKAQR
        LVCQQVKA R
Subjt:  LVCQQVKAQR

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.3e-7729.25Show/hide
Query:  PHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGA
        P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+
Subjt:  PHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGA

Query:  TVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHKEHLRIVLQTLRDNKLY
        T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ AP  F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L 
Subjt:  TVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHKEHLRIVLQTLRDNKLY

Query:  AKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVT
           +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+
Subjt:  AKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVT

Query:  APVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------------
         PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+                                    
Subjt:  APVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------------

Query:  --------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL
                      D++ EI Y PG AN  ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+    L
Subjt:  --------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL

Query:  AEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKAQ
            + VE +I    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K++
Subjt:  AEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKAQ

P0CT35 Transposon Tf2-2 polyprotein3.3e-7729.25Show/hide
Query:  PHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGA
        P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+
Subjt:  PHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGA

Query:  TVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHKEHLRIVLQTLRDNKLY
        T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ AP  F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L 
Subjt:  TVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHKEHLRIVLQTLRDNKLY

Query:  AKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVT
           +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+
Subjt:  AKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVT

Query:  APVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------------
         PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+                                    
Subjt:  APVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------------

Query:  --------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL
                      D++ EI Y PG AN  ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+    L
Subjt:  --------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL

Query:  AEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKAQ
            + VE +I    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K++
Subjt:  AEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKAQ

P0CT36 Transposon Tf2-3 polyprotein3.3e-7729.25Show/hide
Query:  PHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGA
        P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+
Subjt:  PHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGA

Query:  TVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHKEHLRIVLQTLRDNKLY
        T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ AP  F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L 
Subjt:  TVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHKEHLRIVLQTLRDNKLY

Query:  AKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVT
           +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+
Subjt:  AKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVT

Query:  APVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------------
         PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+                                    
Subjt:  APVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------------

Query:  --------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL
                      D++ EI Y PG AN  ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+    L
Subjt:  --------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL

Query:  AEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKAQ
            + VE +I    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K++
Subjt:  AEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKAQ

P0CT37 Transposon Tf2-4 polyprotein3.3e-7729.25Show/hide
Query:  PHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGA
        P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+
Subjt:  PHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGA

Query:  TVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHKEHLRIVLQTLRDNKLY
        T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ AP  F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L 
Subjt:  TVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHKEHLRIVLQTLRDNKLY

Query:  AKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVT
           +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+
Subjt:  AKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVT

Query:  APVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------------
         PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+                                    
Subjt:  APVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------------

Query:  --------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL
                      D++ EI Y PG AN  ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+    L
Subjt:  --------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL

Query:  AEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKAQ
            + VE +I    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K++
Subjt:  AEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKAQ

P0CT41 Transposon Tf2-12 polyprotein3.3e-7729.25Show/hide
Query:  PHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGA
        P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+
Subjt:  PHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGA

Query:  TVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHKEHLRIVLQTLRDNKLY
        T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ AP  F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L 
Subjt:  TVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHKEHLRIVLQTLRDNKLY

Query:  AKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVT
           +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+
Subjt:  AKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVT

Query:  APVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------------
         PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+                                    
Subjt:  APVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------------

Query:  --------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL
                      D++ EI Y PG AN  ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+    L
Subjt:  --------------DYDCEILYHPGKANVAADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL

Query:  AEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKAQ
            + VE +I    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K++
Subjt:  AEAGQAVEFSISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKAQ

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.2e-2745.04Show/hide
Query:  HLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW
        HL +VLQ    ++ YA   KC F   Q+++LG  H++S  GVS DPAK+EA+ GW  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W
Subjt:  HLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW

Query:  SKACEDSFQNLKQKLVTAPVLTVPDGSGSFV
        ++    +F+ LK  + T PVL +PD    FV
Subjt:  SKACEDSFQNLKQKLVTAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGATACTAGAGAGGTCGATGTATCCCTGTCATCAGAACCGGTGGTGAGGGACTATCTGGATGTCTTTCCTAAAGAACTTCCAGGGTTACCTCCTCACAGA
GAGGTTGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTTCCTATATCCAGAGCCCCTTACAGAATGGCCCCCGCAGAGCTGAAAGAACTGAAGGTGCAGTTACAA
GAATTGCTTGATAAGGGATTCATTCGATCGAGCGTGTCACCTTGGGGTGTGCCAGTTTTATTTGTTAAGAAGAAGGATGGATCAATGCGTCTATGCATTGACTAT
AGGGAGTTGAACAAAGTAACCGTTAAGAACAGATATCCCTTGCCCAGGATCGACGATCTATTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTT
CGGTCGGGATATCATCAGCTGAGGATTAAGGATGGTGATGTACCGAAGACAGCATTTCGTTCCAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTG
ACGAACGCTCCGACAGTATTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATTGATGATATCTTGATATACTCCAAG
ACGGAGGCCGAACATAAGGAGCATTTACGCATAGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAGCAGGTG
TCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGT
AGCTTTCTGGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTTTGTTTGG
AGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGAT
GCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAACAAGGTAAGGTGGTCGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGAT
TTAGAGTTGGCAGCAGTGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGCAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCA
CTTATTACCCGACAGGCCCCATTGCATCGAGATCTTGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTCACTATACAGTTAGCCCAGTTGACGGTACAGCCG
ACTTTGAGGCAAAGGATCATTGATGCTCAGAGTAACGATCCTTATTTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGAGTTCTCCATATCCTCT
GATGGTGGACTTTTGTTTGAGAGACGCCTCTGTGTGCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCA
GGTAGTACGAAGATGTATCAGAACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAATTTGTTAGTAGATGCTTGGTGTGTCAGCAGGTT
AAGGCACAAAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGGATACTAGAGAGGTCGATGTATCCCTGTCATCAGAACCGGTGGTGAGGGACTATCTGGATGTCTTTCCTAAAGAACTTCCAGGGTTACCTCCTCACAGA
GAGGTTGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTTCCTATATCCAGAGCCCCTTACAGAATGGCCCCCGCAGAGCTGAAAGAACTGAAGGTGCAGTTACAA
GAATTGCTTGATAAGGGATTCATTCGATCGAGCGTGTCACCTTGGGGTGTGCCAGTTTTATTTGTTAAGAAGAAGGATGGATCAATGCGTCTATGCATTGACTAT
AGGGAGTTGAACAAAGTAACCGTTAAGAACAGATATCCCTTGCCCAGGATCGACGATCTATTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTT
CGGTCGGGATATCATCAGCTGAGGATTAAGGATGGTGATGTACCGAAGACAGCATTTCGTTCCAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTG
ACGAACGCTCCGACAGTATTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATTGATGATATCTTGATATACTCCAAG
ACGGAGGCCGAACATAAGGAGCATTTACGCATAGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAGCAGGTG
TCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGT
AGCTTTCTGGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTTTGTTTGG
AGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGAT
GCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAACAAGGTAAGGTGGTCGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGAT
TTAGAGTTGGCAGCAGTGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGCAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCA
CTTATTACCCGACAGGCCCCATTGCATCGAGATCTTGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTCACTATACAGTTAGCCCAGTTGACGGTACAGCCG
ACTTTGAGGCAAAGGATCATTGATGCTCAGAGTAACGATCCTTATTTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGAGTTCTCCATATCCTCT
GATGGTGGACTTTTGTTTGAGAGACGCCTCTGTGTGCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCA
GGTAGTACGAAGATGTATCAGAACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAATTTGTTAGTAGATGCTTGGTGTGTCAGCAGGTT
AAGGCACAAAGGTAG
Protein sequenceShow/hide protein sequence
MVDTREVDVSLSSEPVVRDYLDVFPKELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRSSVSPWGVPVLFVKKKDGSMRLCIDY
RELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSK
TEAEHKEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW
SKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVDYDCEILYHPGKANVAADALSRKVSHSAA
LITRQAPLHRDLERAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP
GSTKMYQNLKRVYWWRNMKREVAEFVSRCLVCQQVKAQR