; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0094211 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0094211
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr04:6713967..6715214
RNA-Seq ExpressionCmc04g0094211
SyntenyCmc04g0094211
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032248.1 pol protein [Cucumis melo var. makuwa]5.6e-16880Show/hide
Query:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL
        MSFGLTNAPAVFMD MN+V+KDFLD FVIVFIDDILIYSKTEAE +EHLH+VLKTLRANKLYAKF KCEFWL+KVTFL +VVSSEGVSVDPAK EAVT+ 
Subjt:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL

Query:  PRPSTVSEIHSFLGLT----------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAYASRQLKSHEQNYPTHDLMLTAVVFAL
        PRPSTVSEI SFLGL                             DGS +F+IYSDASKKGLGCVLMQQG+VVAYASRQLK HEQNYPTHDL L AVVFAL
Subjt:  PRPSTVSEIHSFLGLT----------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAYASRQLKSHEQNYPTHDLMLTAVVFAL

Query:  KILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITKQAPLLRDFERAGIVVSVGDVTSQLA
        KI RHYLY EKIQI+TDH SLKYFFTQKELNMRQRRWLELVKD DC+ILYHPGKANVVADALSRK+AHSAALITKQ PLLRDFERA I VSVG+VTSQLA
Subjt:  KILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITKQAPLLRDFERAGIVVSVGDVTSQLA

Query:  WLSIQPTLRQRIIVPQLKDPYLVEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMHPRSTKMYQDLRCVYW
         LS+QPTLRQ+IIV QL DPYLV KR +VETG+GEDFSISSDDGL FEGRLCV EDSAV+ ELLTEAHSSPFTMHP STKMYQ+LR VYW
Subjt:  WLSIQPTLRQRIIVPQLKDPYLVEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMHPRSTKMYQDLRCVYW

KAA0042241.1 pol protein [Cucumis melo var. makuwa]4.0e-16679.49Show/hide
Query:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL
        MSFGLTNAPAVFMD MNRV+KDFLD FVIVFIDDILIYSKTE E +EHLH+VL+TLRANKLYAKF KCEFWLKKVTFL +VVSSE VS+DPAK EAVT+ 
Subjt:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL

Query:  PRPSTVSEIHSFLGLT----------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAYASRQLKSHEQNYPTHDLMLTAVVFAL
        PR STVSEI SFLGL                             DGS+SF+IYSDASKKGLGCVLMQQG+VV YASRQLKSHEQNYPTHDL L AVVFAL
Subjt:  PRPSTVSEIHSFLGLT----------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAYASRQLKSHEQNYPTHDLMLTAVVFAL

Query:  KILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITKQAPLLRDFERAGIVVSVGDVTSQLA
        KI RHYLY EKIQI+TDH SLK+FFT KELNMRQRRWLELVKD DC+ILYHPGKANVVADAL+RK+AHSA LITKQAPLLRDFERA I VSVG+VTSQLA
Subjt:  KILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITKQAPLLRDFERAGIVVSVGDVTSQLA

Query:  WLSIQPTLRQRIIVPQLKDPYLVEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMHPRSTKMYQDLRCVYW
         LS+QPTLRQRIIV QL DPYLV KR LVETG+GEDFSISSDDGL F+G LCVPEDSAVK ELLTEAHSSPFTMHP STKMYQDLR VYW
Subjt:  WLSIQPTLRQRIIVPQLKDPYLVEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMHPRSTKMYQDLRCVYW

KAA0051719.1 pol protein [Cucumis melo var. makuwa]2.5e-16876.14Show/hide
Query:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL
        MSFGLTNAPAVFMD MNRV+KDFLD FVIVFIDDILIYSKTEAE +EHLH+VL+TLRANKLYAKF KCEFWL+KVTFL +VVSSEGVSVDPAK EAVT+ 
Subjt:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL

Query:  PRPSTVSEIHSFLGLT-----------------------------------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAYA
        PRPSTVSEI SFLGL                                                      DGS SF+IYSDASKKGLGCVLMQQG+VVAYA
Subjt:  PRPSTVSEIHSFLGLT-----------------------------------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAYA

Query:  SRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITK
        SRQLKSHEQNYPTHDL L AVVFALKI RHYLY EKIQIFTDH SLKYFFTQKELNMRQRRWLELVKD DC+ILYHPGK NVVADAL+RK+AHSAALITK
Subjt:  SRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITK

Query:  QAPLLRDFERAGIVVSVGDVTSQLAWLSIQPTLRQRIIVPQLKDPYLVEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMH
        Q PLLRDFERAGIV+S G+VTSQLA LS+QPTLRQ+IIV QL DPYLVEKR +VETG+G DFSISSDDGL FEGRLCVPEDSAVK ELLTEAHSSPFTMH
Subjt:  QAPLLRDFERAGIVVSVGDVTSQLAWLSIQPTLRQRIIVPQLKDPYLVEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMH

Query:  PRSTKMYQDLRCVYW
        P STKMYQDLR VYW
Subjt:  PRSTKMYQDLRCVYW

KAA0066451.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.0e-16682.88Show/hide
Query:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL
        MSFGLTNAPAVFMD MNRV+KDFLD FVIVFIDDILIYSKTEAE +EHLH+VL+TLRANKLYAKF KCEFW++KVTFL +VVSSEGVSVDPAK EA+T+ 
Subjt:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL

Query:  PRPSTVSEIHSFLGLTDGSRS------FMIYSDASKKGLGCVLMQQGRVVAYASRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLK
        PRPS VSEI SFLGL    RS      F+IYSDAS+KGLGCVLMQQG+VVAYA RQLKSHEQNYPTHDL L AVVFALKI RHYLY EKIQI+TDH S K
Subjt:  PRPSTVSEIHSFLGLTDGSRS------FMIYSDASKKGLGCVLMQQGRVVAYASRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLK

Query:  YFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITKQAPLLRDFERAGIVVSVGDVTSQLAWLSIQPTLRQRIIVPQLKDPYL
        YFFTQKELNMRQRRWLELVKD DC+ILYHPGKANVVAD LSRK+AHSAALITKQ PLL DFER  I VSVG+VTSQLA LS+QPTLRQ IIV QL DPYL
Subjt:  YFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITKQAPLLRDFERAGIVVSVGDVTSQLAWLSIQPTLRQRIIVPQLKDPYL

Query:  VEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMHPRSTKMYQDLRCVYW
        VEKR +VETG+GE+FSISS+DGL FEGRLCVPEDSAVK ELLTEAHSSPFTMHP STKMYQDLR VYW
Subjt:  VEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMHPRSTKMYQDLRCVYW

TYK01306.1 pol protein [Cucumis melo var. makuwa]2.5e-16876.14Show/hide
Query:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL
        MSFGLTNAPAVFMD MNRV+KDFLD FVIVFIDDILIYSKTEAE +EHLH+VL+TLRANKLYAKF KCEFWL+KVTFL +VVSSEGVSVDPAK EAVT+ 
Subjt:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL

Query:  PRPSTVSEIHSFLGLT-----------------------------------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAYA
        PRPSTVSEI SFLGL                                                      DGS SF+IYSDASKKGLGCVLMQQG+VVAYA
Subjt:  PRPSTVSEIHSFLGLT-----------------------------------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAYA

Query:  SRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITK
        SRQLKSHEQNYPTHDL L AVVFALKI RHYLY EKIQIFTDH SLKYFFTQKELNMRQRRWLELVKD DC+ILYHPGK NVVADAL+RK+AHSAALITK
Subjt:  SRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITK

Query:  QAPLLRDFERAGIVVSVGDVTSQLAWLSIQPTLRQRIIVPQLKDPYLVEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMH
        Q PLLRDFERAGIV+S G+VTSQLA LS+QPTLRQ+IIV QL DPYLVEKR +VETG+G DFSISSDDGL FEGRLCVPEDSAVK ELLTEAHSSPFTMH
Subjt:  QAPLLRDFERAGIVVSVGDVTSQLAWLSIQPTLRQRIIVPQLKDPYLVEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMH

Query:  PRSTKMYQDLRCVYW
        P STKMYQDLR VYW
Subjt:  PRSTKMYQDLRCVYW

TrEMBL top hitse value%identityAlignment
A0A5A7SNR3 Reverse transcriptase2.7e-16880Show/hide
Query:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL
        MSFGLTNAPAVFMD MN+V+KDFLD FVIVFIDDILIYSKTEAE +EHLH+VLKTLRANKLYAKF KCEFWL+KVTFL +VVSSEGVSVDPAK EAVT+ 
Subjt:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL

Query:  PRPSTVSEIHSFLGLT----------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAYASRQLKSHEQNYPTHDLMLTAVVFAL
        PRPSTVSEI SFLGL                             DGS +F+IYSDASKKGLGCVLMQQG+VVAYASRQLK HEQNYPTHDL L AVVFAL
Subjt:  PRPSTVSEIHSFLGLT----------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAYASRQLKSHEQNYPTHDLMLTAVVFAL

Query:  KILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITKQAPLLRDFERAGIVVSVGDVTSQLA
        KI RHYLY EKIQI+TDH SLKYFFTQKELNMRQRRWLELVKD DC+ILYHPGKANVVADALSRK+AHSAALITKQ PLLRDFERA I VSVG+VTSQLA
Subjt:  KILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITKQAPLLRDFERAGIVVSVGDVTSQLA

Query:  WLSIQPTLRQRIIVPQLKDPYLVEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMHPRSTKMYQDLRCVYW
         LS+QPTLRQ+IIV QL DPYLV KR +VETG+GEDFSISSDDGL FEGRLCV EDSAV+ ELLTEAHSSPFTMHP STKMYQ+LR VYW
Subjt:  WLSIQPTLRQRIIVPQLKDPYLVEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMHPRSTKMYQDLRCVYW

A0A5A7TKY8 Pol protein1.9e-16679.49Show/hide
Query:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL
        MSFGLTNAPAVFMD MNRV+KDFLD FVIVFIDDILIYSKTE E +EHLH+VL+TLRANKLYAKF KCEFWLKKVTFL +VVSSE VS+DPAK EAVT+ 
Subjt:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL

Query:  PRPSTVSEIHSFLGLT----------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAYASRQLKSHEQNYPTHDLMLTAVVFAL
        PR STVSEI SFLGL                             DGS+SF+IYSDASKKGLGCVLMQQG+VV YASRQLKSHEQNYPTHDL L AVVFAL
Subjt:  PRPSTVSEIHSFLGLT----------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAYASRQLKSHEQNYPTHDLMLTAVVFAL

Query:  KILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITKQAPLLRDFERAGIVVSVGDVTSQLA
        KI RHYLY EKIQI+TDH SLK+FFT KELNMRQRRWLELVKD DC+ILYHPGKANVVADAL+RK+AHSA LITKQAPLLRDFERA I VSVG+VTSQLA
Subjt:  KILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITKQAPLLRDFERAGIVVSVGDVTSQLA

Query:  WLSIQPTLRQRIIVPQLKDPYLVEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMHPRSTKMYQDLRCVYW
         LS+QPTLRQRIIV QL DPYLV KR LVETG+GEDFSISSDDGL F+G LCVPEDSAVK ELLTEAHSSPFTMHP STKMYQDLR VYW
Subjt:  WLSIQPTLRQRIIVPQLKDPYLVEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMHPRSTKMYQDLRCVYW

A0A5A7UE01 Reverse transcriptase1.2e-16876.14Show/hide
Query:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL
        MSFGLTNAPAVFMD MNRV+KDFLD FVIVFIDDILIYSKTEAE +EHLH+VL+TLRANKLYAKF KCEFWL+KVTFL +VVSSEGVSVDPAK EAVT+ 
Subjt:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL

Query:  PRPSTVSEIHSFLGLT-----------------------------------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAYA
        PRPSTVSEI SFLGL                                                      DGS SF+IYSDASKKGLGCVLMQQG+VVAYA
Subjt:  PRPSTVSEIHSFLGLT-----------------------------------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAYA

Query:  SRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITK
        SRQLKSHEQNYPTHDL L AVVFALKI RHYLY EKIQIFTDH SLKYFFTQKELNMRQRRWLELVKD DC+ILYHPGK NVVADAL+RK+AHSAALITK
Subjt:  SRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITK

Query:  QAPLLRDFERAGIVVSVGDVTSQLAWLSIQPTLRQRIIVPQLKDPYLVEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMH
        Q PLLRDFERAGIV+S G+VTSQLA LS+QPTLRQ+IIV QL DPYLVEKR +VETG+G DFSISSDDGL FEGRLCVPEDSAVK ELLTEAHSSPFTMH
Subjt:  QAPLLRDFERAGIVVSVGDVTSQLAWLSIQPTLRQRIIVPQLKDPYLVEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMH

Query:  PRSTKMYQDLRCVYW
        P STKMYQDLR VYW
Subjt:  PRSTKMYQDLRCVYW

A0A5A7VLF2 Reverse transcriptase1.9e-16682.88Show/hide
Query:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL
        MSFGLTNAPAVFMD MNRV+KDFLD FVIVFIDDILIYSKTEAE +EHLH+VL+TLRANKLYAKF KCEFW++KVTFL +VVSSEGVSVDPAK EA+T+ 
Subjt:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL

Query:  PRPSTVSEIHSFLGLTDGSRS------FMIYSDASKKGLGCVLMQQGRVVAYASRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLK
        PRPS VSEI SFLGL    RS      F+IYSDAS+KGLGCVLMQQG+VVAYA RQLKSHEQNYPTHDL L AVVFALKI RHYLY EKIQI+TDH S K
Subjt:  PRPSTVSEIHSFLGLTDGSRS------FMIYSDASKKGLGCVLMQQGRVVAYASRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLK

Query:  YFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITKQAPLLRDFERAGIVVSVGDVTSQLAWLSIQPTLRQRIIVPQLKDPYL
        YFFTQKELNMRQRRWLELVKD DC+ILYHPGKANVVAD LSRK+AHSAALITKQ PLL DFER  I VSVG+VTSQLA LS+QPTLRQ IIV QL DPYL
Subjt:  YFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITKQAPLLRDFERAGIVVSVGDVTSQLAWLSIQPTLRQRIIVPQLKDPYL

Query:  VEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMHPRSTKMYQDLRCVYW
        VEKR +VETG+GE+FSISS+DGL FEGRLCVPEDSAVK ELLTEAHSSPFTMHP STKMYQDLR VYW
Subjt:  VEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMHPRSTKMYQDLRCVYW

A0A5D3BSV9 Reverse transcriptase1.2e-16876.14Show/hide
Query:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL
        MSFGLTNAPAVFMD MNRV+KDFLD FVIVFIDDILIYSKTEAE +EHLH+VL+TLRANKLYAKF KCEFWL+KVTFL +VVSSEGVSVDPAK EAVT+ 
Subjt:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL

Query:  PRPSTVSEIHSFLGLT-----------------------------------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAYA
        PRPSTVSEI SFLGL                                                      DGS SF+IYSDASKKGLGCVLMQQG+VVAYA
Subjt:  PRPSTVSEIHSFLGLT-----------------------------------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAYA

Query:  SRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITK
        SRQLKSHEQNYPTHDL L AVVFALKI RHYLY EKIQIFTDH SLKYFFTQKELNMRQRRWLELVKD DC+ILYHPGK NVVADAL+RK+AHSAALITK
Subjt:  SRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKIAHSAALITK

Query:  QAPLLRDFERAGIVVSVGDVTSQLAWLSIQPTLRQRIIVPQLKDPYLVEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMH
        Q PLLRDFERAGIV+S G+VTSQLA LS+QPTLRQ+IIV QL DPYLVEKR +VETG+G DFSISSDDGL FEGRLCVPEDSAVK ELLTEAHSSPFTMH
Subjt:  QAPLLRDFERAGIVVSVGDVTSQLAWLSIQPTLRQRIIVPQLKDPYLVEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSAVKAELLTEAHSSPFTMH

Query:  PRSTKMYQDLRCVYW
        P STKMYQDLR VYW
Subjt:  PRSTKMYQDLRCVYW

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.68.4e-3432.41Show/hide
Query:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL
        M FGL NAPA F   MN + +  L+   +V++DDI+++S +  E  + L  V + L    L  +  KCEF  ++ TFL +V++ +G+  +P K EA+   
Subjt:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL

Query:  PRPSTVSEIHSFLGLT------------------------------------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAY
        P P+   EI +FLGLT                                                      D ++ F + +DAS   LG VL Q G  ++Y
Subjt:  PRPSTVSEIHSFLGLT------------------------------------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAY

Query:  ASRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSR
         SR L  HE NY T +  L A+V+A K  RHYL     +I +DH  L + +  K+ N +  RW   + + D  I Y  GK N VADALSR
Subjt:  ASRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSR

P10401 Retrovirus-related Pol polyprotein from transposon gypsy4.6e-3229.37Show/hide
Query:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL
        + FGL NA ++F   ++ V ++ +     V++DD++I+S+ E++   H+  VLK L    +     K  F+ + V +L  +VS +G   DP K +A+   
Subjt:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL

Query:  PRPSTVSEIHSFLGLT-----------------------------------------------------------------DGSRSFMIYSDASKKGLGC
        P P  V ++ SFLGL                                                                  D  + F + +DAS  G+G 
Subjt:  PRPSTVSEIHSFLGLT-----------------------------------------------------------------DGSRSFMIYSDASKKGLGC

Query:  VLMQQGRVVAYASRQLKSHEQNYPTHDLMLTAVVFALKILRHYLY-VEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADAL
        VL Q+GR +   SR LK  EQNY T++  L A+V+AL  L+++LY   +I IFTDH  L +    +  N + +RW   +   + K+ Y PGK N VADAL
Subjt:  VLMQQGRVVAYASRQLKSHEQNYPTHDLMLTAVVFALKILRHYLY-VEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADAL

Query:  SRK
        SR+
Subjt:  SRK

P20825 Retrovirus-related Pol polyprotein from transposon 2971.6e-3231.72Show/hide
Query:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL
        M FGL NAPA F   MN + +  L+   +V++DDI+I+S +  E    +  V   L    L  +  KCEF  K+  FL ++V+ +G+  +P K +A+ S 
Subjt:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL

Query:  PRPSTVSEIHSFLGLT------------------------------------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAY
        P P+   EI +FLGLT                                                      D  + F++ +DAS   LG VL Q G  +++
Subjt:  PRPSTVSEIHSFLGLT------------------------------------------------------DGSRSFMIYSDASKKGLGCVLMQQGRVVAY

Query:  ASRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSR
         SR L  HE NY   +  L A+V+A K  RHYL   +  I +DH  L++    KE   +  RW   + +   KI Y  GK N VADALSR
Subjt:  ASRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.5e-2529.83Show/hide
Query:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL
        M FGL NAP+ F  +M   ++D    FV V++DDILI+S++  E  +HL  VL+ L+   L  K  KC+F  ++  FL   +  + ++    K  A+   
Subjt:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL

Query:  PRPSTVSEIHSFLGLTDGSRSFM---------------------------------------------------IYSDASKKGLGCVLMQQGR------V
        P P TV +   FLG+ +  R F+                                                   + +DASK G+G VL +         V
Subjt:  PRPSTVSEIHSFLGLTDGSRSFM---------------------------------------------------IYSDASKKGLGCVLMQQGR------V

Query:  VAYASRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKI
        V Y S+ L+S ++NYP  +L L  ++ AL   R+ L+ +   + TDH SL     + E   R +RWL+ +   D  + Y  G  NVVADA+SR I
Subjt:  VAYASRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVADALSRKI

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.9e-3028.2Show/hide
Query:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL
        + FGL NAPA+F   ++ + ++ +     V+IDDI+++S+      ++L  VL +L    L     K  F   +V FL  +V+++G+  DP K  A++ +
Subjt:  MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSL

Query:  PRPSTVSEIHSFLGLTDGSRSFM----------------------------------------------------------------IYSDASKKGLGCV
        P P++V E+  FLG+T   R F+                                                                + +DAS   +G V
Subjt:  PRPSTVSEIHSFLGLTDGSRSFM----------------------------------------------------------------IYSDASKKGLGCV

Query:  LMQ----QGRVVAYASRQLKSHEQNYPTHDLMLTAVVFALKILRHYLY-VEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVA
        L Q    + R +AY SR L   E+NY T +  + A++++L  LR YLY    I+++TDH  L +    +  N + +RW   +++ +C+++Y PGK+NVVA
Subjt:  LMQ----QGRVVAYASRQLKSHEQNYPTHDLMLTAVVFALKILRHYLY-VEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKILYHPGKANVVA

Query:  DALSR
        DALSR
Subjt:  DALSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein7.4e-0939.74Show/hide
Query:  HLHRVLKTLRANKLYAKFPKCEFWLKKVTFLS--NVVSSEGVSVDPAKTEAVTSLPRPSTVSEIHSFLGLTDGSRSFM
        HL  VL+    ++ YA   KC F   ++ +L   +++S EGVS DPAK EA+   P P   +E+  FLGLT   R F+
Subjt:  HLHRVLKTLRANKLYAKFPKCEFWLKKVTFLS--NVVSSEGVSVDPAKTEAVTSLPRPSTVSEIHSFLGLTDGSRSFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTTTGGTTTGACTAATGCTCCTGCGGTATTCATGGACTTCATGAACAGGGTGTATAAGGATTTCTTAGACATGTTTGTTATAGTTTTCATTGACGACATTTTGAT
TTACTCCAAGACTGAGGCTGAGGATAAGGAGCATTTGCACCGAGTTTTGAAGACTCTTCGAGCCAATAAGCTATATGCCAAGTTCCCTAAGTGTGAGTTCTGGCTGAAGA
AGGTGACCTTCCTCAGCAACGTGGTTTCCAGTGAGGGGGTTTCTGTGGACCCAGCAAAGACCGAAGCGGTTACTAGTTTGCCTCGACCGTCTACAGTTAGCGAGATTCAT
AGTTTCCTGGGTTTGACAGATGGATCGAGGAGCTTTATGATCTACAGTGATGCTTCCAAGAAAGGACTGGGTTGTGTGCTGATGCAGCAGGGCAGGGTAGTTGCTTACGC
CTCTCGTCAGTTGAAGAGTCACGAGCAGAACTATCCTACCCATGACCTAATGTTGACAGCAGTGGTTTTTGCACTGAAGATATTGAGACATTACCTGTACGTTGAGAAGA
TACAAATTTTCACTGACCATACGAGCCTAAAGTACTTCTTCACCCAGAAGGAGCTGAACATGAGACAGAGAAGATGGCTTGAGTTGGTGAAGGATGATGACTGCAAGATT
CTGTACCACCCGGGTAAGGCAAATGTAGTAGCTGATGCGTTGAGCAGGAAGATTGCACATTCAGCTGCGCTTATCACGAAGCAGGCCCCCTTACTCAGAGATTTTGAGAG
AGCCGGGATTGTAGTCTCAGTAGGGGACGTTACCTCACAGTTGGCTTGGTTGTCAATACAGCCGACCTTGAGACAGAGGATTATTGTCCCTCAGCTAAAGGATCCTTATC
TGGTCGAGAAGCGTCATTTGGTAGAGACAGGACGAGGTGAGGATTTCTCCATATCCTCTGACGACGGCCTTACGTTTGAGGGACGTTTGTGTGTGCCGGAAGACAGTGCA
GTCAAGGCAGAGCTTTTGACTGAGGCTCATAGTTCTCCATTCACTATGCACCCTAGAAGTACGAAGATGTACCAAGACTTGAGGTGTGTCTATTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTTTGGTTTGACTAATGCTCCTGCGGTATTCATGGACTTCATGAACAGGGTGTATAAGGATTTCTTAGACATGTTTGTTATAGTTTTCATTGACGACATTTTGAT
TTACTCCAAGACTGAGGCTGAGGATAAGGAGCATTTGCACCGAGTTTTGAAGACTCTTCGAGCCAATAAGCTATATGCCAAGTTCCCTAAGTGTGAGTTCTGGCTGAAGA
AGGTGACCTTCCTCAGCAACGTGGTTTCCAGTGAGGGGGTTTCTGTGGACCCAGCAAAGACCGAAGCGGTTACTAGTTTGCCTCGACCGTCTACAGTTAGCGAGATTCAT
AGTTTCCTGGGTTTGACAGATGGATCGAGGAGCTTTATGATCTACAGTGATGCTTCCAAGAAAGGACTGGGTTGTGTGCTGATGCAGCAGGGCAGGGTAGTTGCTTACGC
CTCTCGTCAGTTGAAGAGTCACGAGCAGAACTATCCTACCCATGACCTAATGTTGACAGCAGTGGTTTTTGCACTGAAGATATTGAGACATTACCTGTACGTTGAGAAGA
TACAAATTTTCACTGACCATACGAGCCTAAAGTACTTCTTCACCCAGAAGGAGCTGAACATGAGACAGAGAAGATGGCTTGAGTTGGTGAAGGATGATGACTGCAAGATT
CTGTACCACCCGGGTAAGGCAAATGTAGTAGCTGATGCGTTGAGCAGGAAGATTGCACATTCAGCTGCGCTTATCACGAAGCAGGCCCCCTTACTCAGAGATTTTGAGAG
AGCCGGGATTGTAGTCTCAGTAGGGGACGTTACCTCACAGTTGGCTTGGTTGTCAATACAGCCGACCTTGAGACAGAGGATTATTGTCCCTCAGCTAAAGGATCCTTATC
TGGTCGAGAAGCGTCATTTGGTAGAGACAGGACGAGGTGAGGATTTCTCCATATCCTCTGACGACGGCCTTACGTTTGAGGGACGTTTGTGTGTGCCGGAAGACAGTGCA
GTCAAGGCAGAGCTTTTGACTGAGGCTCATAGTTCTCCATTCACTATGCACCCTAGAAGTACGAAGATGTACCAAGACTTGAGGTGTGTCTATTGGTAG
Protein sequenceShow/hide protein sequence
MSFGLTNAPAVFMDFMNRVYKDFLDMFVIVFIDDILIYSKTEAEDKEHLHRVLKTLRANKLYAKFPKCEFWLKKVTFLSNVVSSEGVSVDPAKTEAVTSLPRPSTVSEIH
SFLGLTDGSRSFMIYSDASKKGLGCVLMQQGRVVAYASRQLKSHEQNYPTHDLMLTAVVFALKILRHYLYVEKIQIFTDHTSLKYFFTQKELNMRQRRWLELVKDDDCKI
LYHPGKANVVADALSRKIAHSAALITKQAPLLRDFERAGIVVSVGDVTSQLAWLSIQPTLRQRIIVPQLKDPYLVEKRHLVETGRGEDFSISSDDGLTFEGRLCVPEDSA
VKAELLTEAHSSPFTMHPRSTKMYQDLRCVYW