; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0096871 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0096871
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr04:11457730..11458901
RNA-Seq ExpressionCmc04g0096871
SyntenyCmc04g0096871
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040689.1 pol protein [Cucumis melo var. makuwa]7.7e-19791.28Show/hide
Query:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW
        MSFGLTNAP VFMDLMN+VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVS+AGVSVDP KIEAVTGW
Subjt:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW

Query:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
        TRPSTVSEVRSFLGLA                   +  KGAPFVWSKACEDSFQNLKQKLVTAPVL VPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS
        QAPLHRDLERAE+AVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL EAGQA  FSISSDGGL+FERRLCVPSDS IKTELLS
Subjt:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS

KAA0048687.1 pol protein [Cucumis melo var. makuwa]4.1e-19891.54Show/hide
Query:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW
        MSFGLTNAP VFMDLMN+VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVS+AGVSVDPAKIEAVTGW
Subjt:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW

Query:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
        TRPSTVSEVRSFLGLA                   +  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS
        QAPLHRDLERAE+AVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL EAGQA  FS+SSDGGL+FERRLCVPSDSV+KTELLS
Subjt:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS

KAA0050009.1 pol protein [Cucumis melo var. makuwa]2.0e-19791.03Show/hide
Query:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW
        MSF LTNAP VFMDLMN+VFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVS+AGVSVDPAKIEAVTGW
Subjt:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW

Query:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
        TRPSTVSEVRSFLGLA                   +  KGAPFVWSKACEDSFQNLKQKL+TAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS
        QAPLHRDLERAE+AVSVGAVT+QLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL EAGQA GFSISS+GGL+FERRLCVPSDS +KTELLS
Subjt:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS

KAA0057672.1 pol protein [Cucumis melo var. makuwa]2.6e-19791.03Show/hide
Query:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW
        MSFGLTNAP VFMDLMN+VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVS+AGVSVDPAKIEAVTGW
Subjt:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW

Query:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
        TRPSTVSEVRSFLGLA                   +  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS
        QAPLHRDLERAE+AVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL EAGQA  FS+SSDGGL+FERRLCVPSDS +KTELL+
Subjt:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS

TYK01613.1 pol protein [Cucumis melo var. makuwa]1.3e-19690.77Show/hide
Query:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW
        MSFGLTNAP VFMDLMN+VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVS+AGVSVDPAKIEAVTGW
Subjt:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW

Query:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
        TRPSTVSEVRSFLGLA                   +  KGAPFVWSKACEDSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS
        QAPLHRDLERAE+AVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL EAGQ   FS+SSDGGL+FERRLCVPSDS +KTELLS
Subjt:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS

TrEMBL top hitse value%identityAlignment
A0A5A7THE6 Reverse transcriptase3.7e-19791.28Show/hide
Query:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW
        MSFGLTNAP VFMDLMN+VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVS+AGVSVDP KIEAVTGW
Subjt:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW

Query:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
        TRPSTVSEVRSFLGLA                   +  KGAPFVWSKACEDSFQNLKQKLVTAPVL VPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS
        QAPLHRDLERAE+AVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL EAGQA  FSISSDGGL+FERRLCVPSDS IKTELLS
Subjt:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS

A0A5A7TSQ8 Reverse transcriptase6.3e-19790.51Show/hide
Query:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW
        MSFGLTNAP VFM+LMN+VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVS+AGVSVDPAKIEAVTGW
Subjt:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW

Query:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
        TRPSTVSEVRSFLGLA                   +  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS
        QAPLHRDLER E+AVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL EAGQ + FS+SSDGGL+FERRLCVPSDS +KTELLS
Subjt:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS

A0A5A7U330 Reverse transcriptase2.0e-19891.54Show/hide
Query:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW
        MSFGLTNAP VFMDLMN+VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVS+AGVSVDPAKIEAVTGW
Subjt:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW

Query:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
        TRPSTVSEVRSFLGLA                   +  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS
        QAPLHRDLERAE+AVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL EAGQA  FS+SSDGGL+FERRLCVPSDSV+KTELLS
Subjt:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS

A0A5A7U943 Pol protein9.7e-19891.03Show/hide
Query:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW
        MSF LTNAP VFMDLMN+VFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVS+AGVSVDPAKIEAVTGW
Subjt:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW

Query:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
        TRPSTVSEVRSFLGLA                   +  KGAPFVWSKACEDSFQNLKQKL+TAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS
        QAPLHRDLERAE+AVSVGAVT+QLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL EAGQA GFSISS+GGL+FERRLCVPSDS +KTELLS
Subjt:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS

A0A5A7UP94 Pol protein1.3e-19791.03Show/hide
Query:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW
        MSFGLTNAP VFMDLMN+VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVS+AGVSVDPAKIEAVTGW
Subjt:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW

Query:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
        TRPSTVSEVRSFLGLA                   +  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLA-------------------VDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS
        QAPLHRDLERAE+AVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL EAGQA  FS+SSDGGL+FERRLCVPSDS +KTELL+
Subjt:  QAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.66.8e-4735.86Show/hide
Query:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW
        M FGL NAP  F   MN + R  L+   +V++DDI+++S +  EH + L +V + L    L  +  KCEF  ++ +FLGHV++  G+  +P KIEA+  +
Subjt:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW

Query:  TRPSTVSEVRSFLGLAVDHK---------GAPF-----------VWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY
          P+   E+++FLGL   ++           P              +   + +F+ LK  +   P+L VPD +  F + +DAS   LG VL Q G  ++Y
Subjt:  TRPSTVSEVRSFLGLAVDHK---------GAPF-----------VWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY

Query:  ASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSR
         SR L  HE NY T + EL A+V+A K +RHYL G   +I +DH+ L + +  K+ N +  RW   + ++D +I Y  GK N VADALSR
Subjt:  ASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSR

P0CT41 Transposon Tf2-12 polyprotein1.1e-4129.32Show/hide
Query:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW
        M +G++ AP  F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +SE G +     I+ V  W
Subjt:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW

Query:  TRPSTVSEVRSFLGLA------------VDH-------KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----
         +P    E+R FLG              + H       K   + W+     + +N+KQ LV+ PVL   D S   ++ +DAS   +G VL Q+       
Subjt:  TRPSTVSEVRSFLGLA------------VDH-------KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----

Query:  VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKV
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR  
Subjt:  VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKV

Query:  SHSAALITRQAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFER-RLCVPSDSVIKTELL
             ++    P+ +D E   +          + Q+++    + +++   +ND  L+      +    E   +  DG L+  + ++ +P+D+ +   ++
Subjt:  SHSAALITRQAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLVEAGQAEGFSISSDGGLVFER-RLCVPSDSVIKTELL

P10401 Retrovirus-related Pol polyprotein from transposon gypsy2.4e-4433.99Show/hide
Query:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW
        + FGL NA  +F   ++ V RE +     V++DD++I+S+ E++H  H+  VL+ L D  +     K  F+ + V +LG +VS+ G   DP K++A+  +
Subjt:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW

Query:  TRPSTVSEVRSFLGLAVDH------------------------------KGAPFVWSKACEDSFQNLKQKLVTAPV-LTVPDGSGSFVIYSDASKKGLGC
          P  V +VRSFLGLA  +                              K  P  +++   ++FQ L+  L +  V L  PD    F + +DAS  G+G 
Subjt:  TRPSTVSEVRSFLGLAVDH------------------------------KGAPFVWSKACEDSFQNLKQKLVTAPV-LTVPDGSGSFVIYSDASKKGLGC

Query:  VLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEK-IQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADAL
        VL Q+G+ +   SR LK  EQNY T++ EL A+V+AL   +++LYG + I IFTDH+ L +    +  N + +RW   +  ++ ++ Y PGK N VADAL
Subjt:  VLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEK-IQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADAL

Query:  SRK
        SR+
Subjt:  SRK

P20825 Retrovirus-related Pol polyprotein from transposon 2976.8e-4735.86Show/hide
Query:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW
        M FGL NAP  F   MN + R  L+   +V++DDI+I+S +  EH   +++V   L D  L  +  KCEF  K+ +FLGH+V+  G+  +P K++A+  +
Subjt:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW

Query:  TRPSTVSEVRSFLGLAVDH------------------KGAPFVWSKACE--DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY
          P+   E+R+FLGL   +                  K    + ++  E  ++F+ LK  ++  P+L +PD    FV+ +DAS   LG VL Q G  +++
Subjt:  TRPSTVSEVRSFLGLAVDH------------------KGAPFVWSKACE--DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY

Query:  ASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSR
         SR L  HE NY   + EL A+V+A K +RHYL G +  I +DH+ L++    KE   +  RW   + +Y  +I Y  GK N VADALSR
Subjt:  ASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus2.7e-4332.13Show/hide
Query:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW
        + FGL NAP +F  +++ + RE +     V+IDDI+++S+    H ++LR+VL +L    L     K  F   QV FLG++V+  G+  DP K+ A++  
Subjt:  MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGW

Query:  TRPSTVSEVRSFLGLAVDHK------------------------------GAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCV
          P++V E++ FLG+   ++                                P    +    SF +LK  L ++ +L  P  +  F + +DAS   +G V
Subjt:  TRPSTVSEVRSFLGLAVDHK------------------------------GAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCV

Query:  LMQ----QGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE-KIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVA
        L Q    + + +AY SR L   E+NY T + E+ A++++L   R YLYG   I+++TDH+ L +    +  N + +RW   +++Y+CE++Y PGK+NVVA
Subjt:  LMQ----QGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE-KIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVA

Query:  DALSR
        DALSR
Subjt:  DALSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.4e-1535.38Show/hide
Query:  HLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSEAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAVDH------------------KGAPFVWS
        HL +VLQ    ++ YA   KC F   Q+++LG  H++S  GVS DPAK+EA+ GW  P   +E+R FLGL   +                  K     W+
Subjt:  HLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSEAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAVDH------------------KGAPFVWS

Query:  KACEDSFQNLKQKLVTAPVLTVPDGSGSFV
        +    +F+ LK  + T PVL +PD    FV
Subjt:  KACEDSFQNLKQKLVTAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTTGGTTTGACGAATGCTCCGGTAGTGTTTATGGACTTGATGAACAAAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATCGACGATATCTTGAT
ATACTCCAAGACGGAGGCCGAACATGAGGAGCATTTACGTATAGTTCTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAGC
AGGTGTCTTTTCTGGGCCACGTGGTTTCTGAGGCTGGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGT
AGCTTTCTGGGTTTAGCAGTTGACCATAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCT
TACTGTACCTGATGGTTCTGGCAGTTTCGTGATTTATAGTGATGCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAACAGGGTAAGGTGGTCGCTTATGCGTCTCGTC
AGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATTTGGAGTTGGCAGCGGTGGTTTTTGCTTTGAAAATATGGAGGCATTATTTATATGGTGAAAAGATACAGATA
TTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAAGAATTGAATATGAGACAGCGAAGGTGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCA
TCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGG
TTGCAGTGTCGGTGGGGGCAGTTACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGCAAAGGATCATTGATGCTCAGAGTAACGATCCTTATTTGGTCGAG
AAACGTGGCCTAGTAGAGGCAGGGCAAGCGGAGGGATTCTCCATATCCTCTGATGGTGGACTTGTGTTTGAGAGACGCCTCTGTGTGCCGTCAGACAGTGTGATTAAGAC
AGAATTATTATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTTTGGTTTGACGAATGCTCCGGTAGTGTTTATGGACTTGATGAACAAAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATCGACGATATCTTGAT
ATACTCCAAGACGGAGGCCGAACATGAGGAGCATTTACGTATAGTTCTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAGC
AGGTGTCTTTTCTGGGCCACGTGGTTTCTGAGGCTGGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGT
AGCTTTCTGGGTTTAGCAGTTGACCATAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCT
TACTGTACCTGATGGTTCTGGCAGTTTCGTGATTTATAGTGATGCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAACAGGGTAAGGTGGTCGCTTATGCGTCTCGTC
AGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATTTGGAGTTGGCAGCGGTGGTTTTTGCTTTGAAAATATGGAGGCATTATTTATATGGTGAAAAGATACAGATA
TTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAAGAATTGAATATGAGACAGCGAAGGTGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCA
TCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGG
TTGCAGTGTCGGTGGGGGCAGTTACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGCAAAGGATCATTGATGCTCAGAGTAACGATCCTTATTTGGTCGAG
AAACGTGGCCTAGTAGAGGCAGGGCAAGCGGAGGGATTCTCCATATCCTCTGATGGTGGACTTGTGTTTGAGAGACGCCTCTGTGTGCCGTCAGACAGTGTGATTAAGAC
AGAATTATTATCTTAG
Protein sequenceShow/hide protein sequence
MSFGLTNAPVVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSEAGVSVDPAKIEAVTGWTRPSTVSEVR
SFLGLAVDHKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQI
FTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEVAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVE
KRGLVEAGQAEGFSISSDGGLVFERRLCVPSDSVIKTELLS