; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0019621 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0019621
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr01:17518632..17519642
RNA-Seq ExpressionCmc01g0019621
SyntenyCmc01g0019621
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043661.1 pol protein [Cucumis melo var. makuwa]8.0e-17791.96Show/hide
Query:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVL+TLRDNKLYAKFSKCEFWLKQVSFLGHV+SKAGVSVDPAKIE VT W RP TVSEV SFLGLAGYY+RFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF
        DSFQNLKQKLVTAPVLTVP+GS SFVIYSDASKKGLGCVLM+QGKVVAYASRQLKSHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQI+TDHKSLKYFF
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK
        TQKELNMRQR+WLELVKDYDCEILYHPGKANVV D+LSRKVSHSTALITRQ PLHRD ERAEIAVSVGA+TMQLAQLTVQ  LRQ+IIDAQSNDPYLV+K
Subjt:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK

Query:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT
        RGLAEAGQ VEFS+SSDGG+LF RRLCVPSDSAVKT
Subjt:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT

KAA0045479.1 pol protein [Cucumis melo var. makuwa]6.1e-17791.96Show/hide
Query:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVL+TLRDNKLYAKFSKCEFWLKQVSFLGHV+SKAGVSVDPAKIE VT W RP TVSEV SFLGLAGYY+RFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF
        DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLM+QGKVVAYASRQLKSHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQI+TDHKSLKYFF
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK
        TQKELNMRQR+WLELVKDYDCEILYHPGKANVV D+LSRKVSHS ALITRQ PLHRD ER EIAVSVGA+TMQLAQLTVQ  LRQ+IIDAQSNDPYLV+K
Subjt:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK

Query:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT
        RGLAEAGQ VEFS+SSDGG+LF RRLCVPSDSAVKT
Subjt:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT

KAA0048687.1 pol protein [Cucumis melo var. makuwa]6.1e-17791.96Show/hide
Query:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVL+TLRDNKLYAKFSKCEFWLKQVSFLGHV+SKAGVSVDPAKIE VT W RP TVSEV SFLGLAGYY+RFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF
        DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLM+QGKVVAYASRQLKSHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQI+TDHKSLKYFF
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK
        TQKELNMRQR+WLELVKDYDCEILYHPGKANVV D+LSRKVSHS ALITRQ PLHRD ERAEIAVSVGA+TMQLAQLTVQ  LRQ+IIDAQSNDPYLV+K
Subjt:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK

Query:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT
        RGLAEAGQ VEFS+SSDGG+LF RRLCVPSDS VKT
Subjt:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT

KAA0057672.1 pol protein [Cucumis melo var. makuwa]8.0e-17791.96Show/hide
Query:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVL+TLRDNKLYAKFSKCEFWLKQVSFLGHV+SKAGVSVDPAKIE VT W RP TVSEV SFLGLAGYY+RFVENFSR ATPLTQLTRKGAPFVWSKACE
Subjt:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF
        DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLM+QGKVVAYASRQLKSHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQI+TDHKSLKYFF
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK
        TQKELNMRQR+WLELVKDYDCEILYHPGKANVV D+LSRKVSHS ALITRQ PLHRD ERAEIAVSVGA+TMQLAQLTVQ  LRQ+IIDAQSNDPYLV+K
Subjt:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK

Query:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT
        RGLAEAGQ VEFS+SSDGG+LF RRLCVPSDSAVKT
Subjt:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT

KAA0062719.1 pol protein [Cucumis melo var. makuwa]2.7e-17791.96Show/hide
Query:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVL+TLRDNKLYAKFSKCEFWLKQVSFLGHV+SKAGVSVDPAKIE VTSW RP TVSEV SFLGLAGYY+RFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF
        DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVL++QGKVVAYASRQLKSHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQI+TDHKSLKYFF
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK
        TQKELNMRQR+WLELVKDYDCEILYHPGKANVV D+LSRKVSHS ALITRQ PLHRD ERAEIAVSVGA+TMQLAQLTVQ  LRQ+IIDAQSNDPYLV+K
Subjt:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK

Query:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT
        RGLAEAGQ VEFS+SSDGG+LF RRLCVPSDSA+KT
Subjt:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT

TrEMBL top hitse value%identityAlignment
A0A5A7TQ36 Reverse transcriptase3.9e-17791.96Show/hide
Query:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVL+TLRDNKLYAKFSKCEFWLKQVSFLGHV+SKAGVSVDPAKIE VT W RP TVSEV SFLGLAGYY+RFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF
        DSFQNLKQKLVTAPVLTVP+GS SFVIYSDASKKGLGCVLM+QGKVVAYASRQLKSHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQI+TDHKSLKYFF
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK
        TQKELNMRQR+WLELVKDYDCEILYHPGKANVV D+LSRKVSHSTALITRQ PLHRD ERAEIAVSVGA+TMQLAQLTVQ  LRQ+IIDAQSNDPYLV+K
Subjt:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK

Query:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT
        RGLAEAGQ VEFS+SSDGG+LF RRLCVPSDSAVKT
Subjt:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT

A0A5A7TW75 Pol protein3.0e-17791.96Show/hide
Query:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVL+TLRDNKLYAKFSKCEFWLKQVSFLGHV+SKAGVSVDPAKIE VT W RP TVSEV SFLGLAGYY+RFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF
        DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLM+QGKVVAYASRQLKSHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQI+TDHKSLKYFF
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK
        TQKELNMRQR+WLELVKDYDCEILYHPGKANVV D+LSRKVSHS ALITRQ PLHRD ER EIAVSVGA+TMQLAQLTVQ  LRQ+IIDAQSNDPYLV+K
Subjt:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK

Query:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT
        RGLAEAGQ VEFS+SSDGG+LF RRLCVPSDSAVKT
Subjt:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT

A0A5A7U330 Reverse transcriptase3.0e-17791.96Show/hide
Query:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVL+TLRDNKLYAKFSKCEFWLKQVSFLGHV+SKAGVSVDPAKIE VT W RP TVSEV SFLGLAGYY+RFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF
        DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLM+QGKVVAYASRQLKSHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQI+TDHKSLKYFF
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK
        TQKELNMRQR+WLELVKDYDCEILYHPGKANVV D+LSRKVSHS ALITRQ PLHRD ERAEIAVSVGA+TMQLAQLTVQ  LRQ+IIDAQSNDPYLV+K
Subjt:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK

Query:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT
        RGLAEAGQ VEFS+SSDGG+LF RRLCVPSDS VKT
Subjt:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT

A0A5A7UP94 Pol protein3.9e-17791.96Show/hide
Query:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVL+TLRDNKLYAKFSKCEFWLKQVSFLGHV+SKAGVSVDPAKIE VT W RP TVSEV SFLGLAGYY+RFVENFSR ATPLTQLTRKGAPFVWSKACE
Subjt:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF
        DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLM+QGKVVAYASRQLKSHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQI+TDHKSLKYFF
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK
        TQKELNMRQR+WLELVKDYDCEILYHPGKANVV D+LSRKVSHS ALITRQ PLHRD ERAEIAVSVGA+TMQLAQLTVQ  LRQ+IIDAQSNDPYLV+K
Subjt:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK

Query:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT
        RGLAEAGQ VEFS+SSDGG+LF RRLCVPSDSAVKT
Subjt:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT

A0A5A7VAL8 Pol protein1.3e-17791.96Show/hide
Query:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVL+TLRDNKLYAKFSKCEFWLKQVSFLGHV+SKAGVSVDPAKIE VTSW RP TVSEV SFLGLAGYY+RFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF
        DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVL++QGKVVAYASRQLKSHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQI+TDHKSLKYFF
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFF

Query:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK
        TQKELNMRQR+WLELVKDYDCEILYHPGKANVV D+LSRKVSHS ALITRQ PLHRD ERAEIAVSVGA+TMQLAQLTVQ  LRQ+IIDAQSNDPYLV+K
Subjt:  TQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKK

Query:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT
        RGLAEAGQ VEFS+SSDGG+LF RRLCVPSDSA+KT
Subjt:  RGLAEAGQVVEFSISSDGGILFGRRLCVPSDSAVKT

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.68.9e-4638.75Show/hide
Query:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE
        +V E L    L  +  KCEF  ++ +FLGHV++  G+  +P KIE +  +P P    E+ +FLGL GYY++F+ NF+ IA P+T+  +K      +    
Subjt:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DS-FQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYF
        DS F+ LK  +   P+L VPD +  F + +DAS   LG VL + G  ++Y SR L  HE NY T + EL A+V+A K +RHYL G   +I +DH+ L + 
Subjt:  DS-FQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYF

Query:  FTQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSR
        +  K+ N +  +W   + ++D +I Y  GK N V D+LSR
Subjt:  FTQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSR

P10394 Retrovirus-related Pol polyprotein from transposon 4122.9e-3633.76Show/hide
Query:  RDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNL
        R+  L     KC F++ +V+FLGH  +  G+  D  K + + ++P P        F+    YY+RF++NF+  +  +T+L +K  PF W+  C+ +F +L
Subjt:  RDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNL

Query:  KQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGK----VVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFFTQ
        K +L+   +L  PD S  F I +DASK+  G VL +        VAYASR     E N  T + ELAA+ +A+  +R Y+YG+   + TDH+ L Y F+ 
Subjt:  KQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGK----VVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFFTQ

Query:  KELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSR
           + +  +    +++Y+  + Y  GK N V D+LSR
Subjt:  KELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSR

P10401 Retrovirus-related Pol polyprotein from transposon gypsy1.1e-4034.52Show/hide
Query:  VLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTR-----------KG
        VL+ L D  +     K  F+ + V +LG ++SK G   DP K++ +  +P P  V +V SFLGLA YY+ F+++F+ IA P+T + +           K 
Subjt:  VLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTR-----------KG

Query:  APFVWSKACEDSFQNLKQKLVTAPV-LTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEK-IQ
         P  +++   ++FQ L+  L +  V L  PD    F + +DAS  G+G VL ++G+ +   SR LK  EQNY T++ EL A+V+A+   +++LYG + I 
Subjt:  APFVWSKACEDSFQNLKQKLVTAPV-LTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEK-IQ

Query:  IYTDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRK
        I+TDH+ L +    +  N + ++W   +  ++ ++ Y PGK N V D+LSR+
Subjt:  IYTDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSRK

P20825 Retrovirus-related Pol polyprotein from transposon 2973.7e-4436.67Show/hide
Query:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSK-AC
        +V   L D  L  +  KCEF  K+ +FLGH+++  G+  +P K++ + S+P P    E+ +FLGL GYY++F+ N++ IA P+T   +K       K   
Subjt:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSK-AC

Query:  EDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYF
         ++F+ LK  ++  P+L +PD    FV+ +DAS   LG VL + G  +++ SR L  HE NY   + EL A+V+A K +RHYL G +  I +DH+ L++ 
Subjt:  EDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYF

Query:  FTQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSR
           KE   +  +W   + +Y  +I Y  GK N V D+LSR
Subjt:  FTQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus7.8e-4233.33Show/hide
Query:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTR-----------K
        +VL +L    L     K  F   QV FLG++++  G+  DP K+  ++  P P +V E+  FLG+  YY++F+++++++A PLT LTR            
Subjt:  MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTR-----------K

Query:  GAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLME----QGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGE
          P    +    SF +LK  L ++ +L  P  +  F + +DAS   +G VL +    + + +AY SR L   E+NY T + E+ A+++++   R YLYG 
Subjt:  GAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLME----QGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGE

Query:  -KIQIYTDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSR
          I++YTDH+ L +    +  N + ++W   +++Y+CE++Y PGK+NVV D+LSR
Subjt:  -KIQIYTDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPGKANVVVDSLSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.3e-2542.97Show/hide
Query:  MVLETLRDNKLYAKFSKCEFWLKQVSFLG--HVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKA
        MVL+    ++ YA   KC F   Q+++LG  H+IS  GVS DPAK+E +  WP P   +E+  FLGL GYY+RFV+N+ +I  PLT+L +K +   W++ 
Subjt:  MVLETLRDNKLYAKFSKCEFWLKQVSFLG--HVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKA

Query:  CEDSFQNLKQKLVTAPVLTVPDGSGSFV
           +F+ LK  + T PVL +PD    FV
Subjt:  CEDSFQNLKQKLVTAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCTAGAAACCCTTCGAGATAATAAATTGTATGCAAAGTTCTCGAAATGTGAGTTTTGGTTGAAGCAGGTATCCTTTCTAGGCCATGTGATTTCTAAGGCTGGAGT
TTCTGTGGATCCAGCTAAGATAGAGGAAGTCACCAGTTGGCCCCGACCTTTTACAGTCAGTGAGGTTCATAGCTTTCTGGGTTTAGCAGGTTATTATCAACGGTTTGTGG
AGAACTTTTCCCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTA
GTTACTGCACCGGTTCTTACGGTACCTGATGGTTCAGGGAGTTTTGTGATTTACAGTGATGCTTCTAAGAAAGGTTTGGGTTGTGTGTTGATGGAGCAAGGTAAGGTAGT
CGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAATTACCCTACACATGATTTAGAGTTGGCAGCAGTAGTTTTTGCAATGAAGATATGGAGGCACTACTTGTATG
GTGAAAAGATACAGATCTACACGGATCATAAGAGCTTGAAATATTTCTTTACTCAGAAGGAATTGAATATGAGACAGCGAAAATGGCTTGAATTAGTGAAGGATTACGAT
TGTGAGATATTATATCATCCAGGTAAGGCGAATGTGGTAGTTGATTCTCTTAGTAGAAAGGTATCACATTCAACAGCACTTATTACCCGACAGACCCCATTGCATCGAGA
TTTTGAGAGGGCTGAGATTGCAGTGTCAGTAGGGGCAATCACTATGCAGTTAGCCCAGTTGACGGTGCAGTCGATTTTGAGGCAGAAGATCATTGATGCTCAGAGTAACG
ATCCTTATTTGGTTAAGAAGCGTGGCCTAGCAGAGGCAGGGCAAGTTGTTGAGTTTTCCATATCCTCTGATGGTGGAATTTTGTTTGGAAGGCGCCTCTGTGTGCCATCA
GATAGTGCGGTTAAAACATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCTAGAAACCCTTCGAGATAATAAATTGTATGCAAAGTTCTCGAAATGTGAGTTTTGGTTGAAGCAGGTATCCTTTCTAGGCCATGTGATTTCTAAGGCTGGAGT
TTCTGTGGATCCAGCTAAGATAGAGGAAGTCACCAGTTGGCCCCGACCTTTTACAGTCAGTGAGGTTCATAGCTTTCTGGGTTTAGCAGGTTATTATCAACGGTTTGTGG
AGAACTTTTCCCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTA
GTTACTGCACCGGTTCTTACGGTACCTGATGGTTCAGGGAGTTTTGTGATTTACAGTGATGCTTCTAAGAAAGGTTTGGGTTGTGTGTTGATGGAGCAAGGTAAGGTAGT
CGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAATTACCCTACACATGATTTAGAGTTGGCAGCAGTAGTTTTTGCAATGAAGATATGGAGGCACTACTTGTATG
GTGAAAAGATACAGATCTACACGGATCATAAGAGCTTGAAATATTTCTTTACTCAGAAGGAATTGAATATGAGACAGCGAAAATGGCTTGAATTAGTGAAGGATTACGAT
TGTGAGATATTATATCATCCAGGTAAGGCGAATGTGGTAGTTGATTCTCTTAGTAGAAAGGTATCACATTCAACAGCACTTATTACCCGACAGACCCCATTGCATCGAGA
TTTTGAGAGGGCTGAGATTGCAGTGTCAGTAGGGGCAATCACTATGCAGTTAGCCCAGTTGACGGTGCAGTCGATTTTGAGGCAGAAGATCATTGATGCTCAGAGTAACG
ATCCTTATTTGGTTAAGAAGCGTGGCCTAGCAGAGGCAGGGCAAGTTGTTGAGTTTTCCATATCCTCTGATGGTGGAATTTTGTTTGGAAGGCGCCTCTGTGTGCCATCA
GATAGTGCGGTTAAAACATAA
Protein sequenceShow/hide protein sequence
MVLETLRDNKLYAKFSKCEFWLKQVSFLGHVISKAGVSVDPAKIEEVTSWPRPFTVSEVHSFLGLAGYYQRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL
VTAPVLTVPDGSGSFVIYSDASKKGLGCVLMEQGKVVAYASRQLKSHEQNYPTHDLELAAVVFAMKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRKWLELVKDYD
CEILYHPGKANVVVDSLSRKVSHSTALITRQTPLHRDFERAEIAVSVGAITMQLAQLTVQSILRQKIIDAQSNDPYLVKKRGLAEAGQVVEFSISSDGGILFGRRLCVPS
DSAVKT