; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc05g0129681 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc05g0129681
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr05:7813673..7814996
RNA-Seq ExpressionCmc05g0129681
SyntenyCmc05g0129681
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0043227 - membrane-bounded organelle (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026055.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]3.8e-16380Show/hide
Query:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW
        MSFGLTNAP VFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQ+NKLY KFSKCEFWLKQVSFLGH+VSKAGV +DPAKIEAVTSW
Subjt:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW

Query:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA
         RP TVSEVRSFLGLAGYYRRFVENFSRIA PLT                                             +ASKKGLGCVLMQQGKVVAYA
Subjt:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK---AEAGQAV
        SRQLKSHEQNYPTHDLEL AVVF LKIWRHYLYGE IQIF DHKSLKYFFTQKELNMRQR+WLELVKDYDCEILYHP KANVVADALSRK   AEAGQ V
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK---AEAGQAV

Query:  EFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVCQQVKAPR
        EFS+SSD GLLFERRLCVPSDSA+KT LLS+AH SPFSMHPGSTKMYQ  KRVYWWRNMKREV +FVSKCLVCQQVKAPR
Subjt:  EFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVCQQVKAPR

KAA0051051.1 reverse transcriptase [Cucumis melo var. makuwa]3.2e-16275.68Show/hide
Query:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW
        MSFGLTNAPAVFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTL+DNKLYAKFSKCEFWLKQVSFLGH+VSKAGV +DPAKIEAVT W
Subjt:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW

Query:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA
         RP TVSEVRSFLGLAGYYRRFVENFSRIATPLT                                             DASKKGLGCVLMQQGKVVAYA
Subjt:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK----------
        SRQLKSHEQ YPTHDLELAAVVFALKIWRHYLYGE IQIF DHKSLKYFFT KELN+RQR+WLELVKDYDCEILYHP KANVVADALSRK          
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK----------

Query:  --------------------AEAGQAVEFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVC
                            AEAGQAVEFS+SSD GLLFERRLCVPSDSAVKTELLS+AHSSPFSMHPGSTKMYQ  KRVYWWRNMKREV +FVS+CLVC
Subjt:  --------------------AEAGQAVEFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVC

Query:  QQVKAPR
        QQVKAPR
Subjt:  QQVKAPR

KAA0051724.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.8e-16376.17Show/hide
Query:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW
        MSFGLTNAPAVFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTL+DNKLYAKFSKCEFWLKQVSFLGH+VSK GV +DPAKIEAVT W
Subjt:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW

Query:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA
         RP TVSEVRSFLGLAGYYRRFVENFSRIATPLT                                             DASKKGLGCVLMQQGKVVAYA
Subjt:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK----------
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE IQIF DHKSLKYFFTQKELNMRQR+WLELVKD DCEILYHPSKANVVADALSRK          
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK----------

Query:  --------------------AEAGQAVEFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVC
                            AEAGQAVEFS+SSD GLLFERRLCVPSDSAVKTELLS+AHSSPFSMHPGSTKMYQ  KRVYWWRNMKREV +FVS+CLVC
Subjt:  --------------------AEAGQAVEFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVC

Query:  QQVKAPR
        QQVKAPR
Subjt:  QQVKAPR

KAA0053234.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]4.2e-16275.92Show/hide
Query:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW
        MSFGLTNAPAVFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTL+DNKLYAKFSKCEFWLKQVSFLGH+VSKAGV +DPAKIEAVT W
Subjt:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW

Query:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA
         RP TVSEVRSFLGLAGYYRRFVENFSRIATPLT                                             DA KKGLGCVLMQQGKVVAYA
Subjt:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK----------
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE IQIF DHKSLKYFFTQKELNMRQR+WLELVKDYDCEILYHP KANVVADALSRK          
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK----------

Query:  --------------------AEAGQAVEFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVC
                            AE  QAVEFS+SSD GLLFERRLCVPSD AVKTELLS+AHSSPFSMHPGSTKMYQ  KRVYWWRNMKREV +FVSKCLVC
Subjt:  --------------------AEAGQAVEFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVC

Query:  QQVKAPR
        QQVKAPR
Subjt:  QQVKAPR

TYK01903.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.9e-16274.94Show/hide
Query:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW
        MSFGLTNAP VFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTL+DNKLYAKFSKCEFWLKQVSFLGH+VSKAGV++DPAKIEA+TSW
Subjt:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW

Query:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA
        PRPFTVS+V SFLGLAGYYR+FVENFSRIATPLT                                             DASKKGLGCVLMQQGKVVAYA
Subjt:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK----------
        S QLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE IQIF DHKSLKYFFTQKELNMRQR+WLELVKDYDCEILYHP KAN+VADALSRK          
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK----------

Query:  --------------------AEAGQAVEFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVC
                            AEA QAVEFS+SSD GLLFERRLCVPSDSA+KTELLS+AHSSPF MHPGSTKMYQ  KRVYWWRNMKREV + VSKCLVC
Subjt:  --------------------AEAGQAVEFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVC

Query:  QQVKAPR
        QQVKAPR
Subjt:  QQVKAPR

TrEMBL top hitse value%identityAlignment
A0A5A7SNM4 Reverse transcriptase1.8e-16380Show/hide
Query:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW
        MSFGLTNAP VFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQ+NKLY KFSKCEFWLKQVSFLGH+VSKAGV +DPAKIEAVTSW
Subjt:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW

Query:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA
         RP TVSEVRSFLGLAGYYRRFVENFSRIA PLT                                             +ASKKGLGCVLMQQGKVVAYA
Subjt:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK---AEAGQAV
        SRQLKSHEQNYPTHDLEL AVVF LKIWRHYLYGE IQIF DHKSLKYFFTQKELNMRQR+WLELVKDYDCEILYHP KANVVADALSRK   AEAGQ V
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK---AEAGQAV

Query:  EFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVCQQVKAPR
        EFS+SSD GLLFERRLCVPSDSA+KT LLS+AH SPFSMHPGSTKMYQ  KRVYWWRNMKREV +FVSKCLVCQQVKAPR
Subjt:  EFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVCQQVKAPR

A0A5A7U8R8 Reverse transcriptase1.8e-16376.17Show/hide
Query:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW
        MSFGLTNAPAVFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTL+DNKLYAKFSKCEFWLKQVSFLGH+VSK GV +DPAKIEAVT W
Subjt:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW

Query:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA
         RP TVSEVRSFLGLAGYYRRFVENFSRIATPLT                                             DASKKGLGCVLMQQGKVVAYA
Subjt:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK----------
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE IQIF DHKSLKYFFTQKELNMRQR+WLELVKD DCEILYHPSKANVVADALSRK          
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK----------

Query:  --------------------AEAGQAVEFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVC
                            AEAGQAVEFS+SSD GLLFERRLCVPSDSAVKTELLS+AHSSPFSMHPGSTKMYQ  KRVYWWRNMKREV +FVS+CLVC
Subjt:  --------------------AEAGQAVEFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVC

Query:  QQVKAPR
        QQVKAPR
Subjt:  QQVKAPR

A0A5A7UC07 Reverse transcriptase1.6e-16275.68Show/hide
Query:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW
        MSFGLTNAPAVFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTL+DNKLYAKFSKCEFWLKQVSFLGH+VSKAGV +DPAKIEAVT W
Subjt:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW

Query:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA
         RP TVSEVRSFLGLAGYYRRFVENFSRIATPLT                                             DASKKGLGCVLMQQGKVVAYA
Subjt:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK----------
        SRQLKSHEQ YPTHDLELAAVVFALKIWRHYLYGE IQIF DHKSLKYFFT KELN+RQR+WLELVKDYDCEILYHP KANVVADALSRK          
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK----------

Query:  --------------------AEAGQAVEFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVC
                            AEAGQAVEFS+SSD GLLFERRLCVPSDSAVKTELLS+AHSSPFSMHPGSTKMYQ  KRVYWWRNMKREV +FVS+CLVC
Subjt:  --------------------AEAGQAVEFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVC

Query:  QQVKAPR
        QQVKAPR
Subjt:  QQVKAPR

A0A5A7UDB1 Reverse transcriptase2.0e-16275.92Show/hide
Query:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW
        MSFGLTNAPAVFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTL+DNKLYAKFSKCEFWLKQVSFLGH+VSKAGV +DPAKIEAVT W
Subjt:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW

Query:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA
         RP TVSEVRSFLGLAGYYRRFVENFSRIATPLT                                             DA KKGLGCVLMQQGKVVAYA
Subjt:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK----------
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE IQIF DHKSLKYFFTQKELNMRQR+WLELVKDYDCEILYHP KANVVADALSRK          
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK----------

Query:  --------------------AEAGQAVEFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVC
                            AE  QAVEFS+SSD GLLFERRLCVPSD AVKTELLS+AHSSPFSMHPGSTKMYQ  KRVYWWRNMKREV +FVSKCLVC
Subjt:  --------------------AEAGQAVEFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVC

Query:  QQVKAPR
        QQVKAPR
Subjt:  QQVKAPR

A0A5D3BQD5 Reverse transcriptase9.1e-16374.94Show/hide
Query:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW
        MSFGLTNAP VFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTL+DNKLYAKFSKCEFWLKQVSFLGH+VSKAGV++DPAKIEA+TSW
Subjt:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW

Query:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA
        PRPFTVS+V SFLGLAGYYR+FVENFSRIATPLT                                             DASKKGLGCVLMQQGKVVAYA
Subjt:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK----------
        S QLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE IQIF DHKSLKYFFTQKELNMRQR+WLELVKDYDCEILYHP KAN+VADALSRK          
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSRK----------

Query:  --------------------AEAGQAVEFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVC
                            AEA QAVEFS+SSD GLLFERRLCVPSDSA+KTELLS+AHSSPF MHPGSTKMYQ  KRVYWWRNMKREV + VSKCLVC
Subjt:  --------------------AEAGQAVEFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVC

Query:  QQVKAPR
        QQVKAPR
Subjt:  QQVKAPR

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.3e-4636.21Show/hide
Query:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW
        M FGL NAPA F   MN + R  L+   +V++DDI+++S +  EH + L +V + L    L  +  KCEF  ++ +FLGH+++  G+  +P KIEA+  +
Subjt:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW

Query:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATP---------------------------------------------LTHDASKKGLGCVLMQQGKVVAY
        P P    E+++FLGL GYYR+F+ NF+ IA P                                             LT DAS   LG VL Q G  ++Y
Subjt:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATP---------------------------------------------LTHDASKKGLGCVLMQQGKVVAY

Query:  ASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSR
         SR L  HE NY T + EL A+V+A K +RHYL G + +I  DH+ L + +  K+ N +  +W   + ++D +I Y   K N VADALSR
Subjt:  ASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSR

P0CT41 Transposon Tf2-12 polyprotein4.4e-3726.17Show/hide
Query:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW
        M +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+ G       I+ V  W
Subjt:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW

Query:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGK-----
         +P    E+R FLG   Y R+F+   S++  PL +                                            DAS   +G VL Q+       
Subjt:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH--------------------------------------------DASKKGLGCVLMQQGK-----

Query:  VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--ENIQIFMDHKSLKYFFTQKE--LNMRQRKWLELVKDYDCEILYHPSKANVVADALSRKA
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I  DH++L    T +    N R  +W   ++D++ EI Y P  AN +ADALSR  
Subjt:  VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--ENIQIFMDHKSLKYFFTQKE--LNMRQRKWLELVKDYDCEILYHPSKANVVADALSRKA

Query:  EAGQAV-------------EFSVSSD---------------------------------SGLLFERR--LCVPSDSAVKTELLSKAHSSPFSMHPGSTKM
        +  + +             + S++ D                                  GLL   +  + +P+D+ +   ++ K H     +HPG   +
Subjt:  EAGQAV-------------EFSVSSD---------------------------------SGLLFERR--LCVPSDSAVKTELLSKAHSSPFSMHPGSTKM

Query:  YQHPKRVYWWRNMKREVEKFVSKCLVCQ
             R + W+ ++++++++V  C  CQ
Subjt:  YQHPKRVYWWRNMKREVEKFVSKCLVCQ

P10401 Retrovirus-related Pol polyprotein from transposon gypsy1.2e-4233.33Show/hide
Query:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW
        + FGL NA ++F   ++ V RE +     V++DD++I+S+ E++H  H+  VL+ L D  +     K  F+ + V +LG +VSK G   DP K++A+  +
Subjt:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW

Query:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATP--------------------------------------------------------LTHDASKKGLGC
        P P  V +VRSFLGLA YYR F+++F+ IA P                                                        LT DAS  G+G 
Subjt:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATP--------------------------------------------------------LTHDASKKGLGC

Query:  VLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG-ENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADAL
        VL Q+G+ +   SR LK  EQNY T++ EL A+V+AL   +++LYG   I IF DH+ L +    +  N + ++W   +  ++ ++ Y P K N VADAL
Subjt:  VLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG-ENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADAL

Query:  SRK
        SR+
Subjt:  SRK

P20825 Retrovirus-related Pol polyprotein from transposon 2973.3e-4534.82Show/hide
Query:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW
        M FGL NAPA F   MN + R  L+   +V++DDI+I+S +  EH   +++V   L D  L  +  KCEF  K+ +FLGH+V+  G+  +P K++A+ S+
Subjt:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW

Query:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATP---------------------------------------------LTHDASKKGLGCVLMQQGKVVAY
        P P    E+R+FLGL GYYR+F+ N++ IA P                                             LT DAS   LG VL Q G  +++
Subjt:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATP---------------------------------------------LTHDASKKGLGCVLMQQGKVVAY

Query:  ASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSR----KAEAGQ
         SR L  HE NY   + EL A+V+A K +RHYL G    I  DH+ L++    KE   +  +W   + +Y  +I Y   K N VADALSR    +    +
Subjt:  ASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVADALSR----KAEAGQ

Query:  AVEFSVSSDSGLL
        A + S   D+  L
Subjt:  AVEFSVSSDSGLL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.5e-4231.48Show/hide
Query:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW
        + FGL NAPA+F  +++ + RE +     V+IDDI+++S+    H ++LR+VL +L    L     K  F   QV FLG++V+  G+  DP K+ A++  
Subjt:  MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSW

Query:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH-------------------------------------------------------DASKKGLGCV
        P P +V E++ FLG+  YYR+F+++++++A PLT+                                                       DAS   +G V
Subjt:  PRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTH-------------------------------------------------------DASKKGLGCV

Query:  LMQ----QGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE-NIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVA
        L Q    + + +AY SR L   E+NY T + E+ A++++L   R YLYG   I+++ DH+ L +    +  N + ++W   +++Y+CE++Y P K+NVVA
Subjt:  LMQ----QGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE-NIQIFMDHKSLKYFFTQKELNMRQRKWLELVKDYDCEILYHPSKANVVA

Query:  DALSR
        DALSR
Subjt:  DALSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein4.5e-2148.45Show/hide
Query:  HLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLG--HMVSKAGVFMDPAKIEAVTSWPRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTHDASKKGL
        HL MVLQ  + ++ YA   KC F   Q+++LG  H++S  GV  DPAK+EA+  WP P   +E+R FLGL GYYRRFV+N+ +I  PLT    K  L
Subjt:  HLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLG--HMVSKAGVFMDPAKIEAVTSWPRPFTVSEVRSFLGLAGYYRRFVENFSRIATPLTHDASKKGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAACATAGTGTTTAGGGAGTTCCTAGACACTTTTGTGATCGTGTTTATTGATGACATCTTGAT
ATATTCCAAGACGGAGGCTGAGCATGAGGAACATTTACGTATGGTTCTGCAAACCCTTCAGGATAATAAATTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAAC
AGGTGTCCTTTCTAGGCCATATGGTGTCTAAGGCTGGAGTTTTTATGGATCCAGCTAAGATAGAGGCAGTCACCAGTTGGCCCCGACCTTTCACAGTCAGTGAGGTCCGT
AGCTTTCTGGGCTTAGCAGGTTATTACCGGCGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCATGATGCTTCTAAGAAGGGTTTGGGTTGTGTATTGAT
GCAGCAAGGTAAGGTAGTCGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAATTACCCTACACATGATTTAGAGTTGGCAGCAGTGGTTTTTGCATTGAAGATAT
GGAGGCATTACTTGTATGGTGAAAATATACAGATCTTCATGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATGAGACAGCGAAAATGGCTTGAG
TTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAAGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGCAGAGGCAGGACAAGCTGTTGAGTTCTCCGT
ATCCTCTGATAGTGGACTGTTGTTTGAGAGGCGTCTCTGTGTGCCGTCAGATAGTGCGGTTAAAACAGAATTATTATCTAAGGCTCATAGTTCCCCATTTTCCATGCACC
CGGGAAGTACGAAGATGTATCAGCACCCGAAGCGGGTTTATTGGTGGCGTAATATGAAGAGAGAGGTGGAAAAATTTGTTAGTAAATGCTTGGTGTGTCAGCAGGTTAAG
GCACCAAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAACATAGTGTTTAGGGAGTTCCTAGACACTTTTGTGATCGTGTTTATTGATGACATCTTGAT
ATATTCCAAGACGGAGGCTGAGCATGAGGAACATTTACGTATGGTTCTGCAAACCCTTCAGGATAATAAATTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAAC
AGGTGTCCTTTCTAGGCCATATGGTGTCTAAGGCTGGAGTTTTTATGGATCCAGCTAAGATAGAGGCAGTCACCAGTTGGCCCCGACCTTTCACAGTCAGTGAGGTCCGT
AGCTTTCTGGGCTTAGCAGGTTATTACCGGCGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCATGATGCTTCTAAGAAGGGTTTGGGTTGTGTATTGAT
GCAGCAAGGTAAGGTAGTCGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAATTACCCTACACATGATTTAGAGTTGGCAGCAGTGGTTTTTGCATTGAAGATAT
GGAGGCATTACTTGTATGGTGAAAATATACAGATCTTCATGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATGAGACAGCGAAAATGGCTTGAG
TTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAAGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGCAGAGGCAGGACAAGCTGTTGAGTTCTCCGT
ATCCTCTGATAGTGGACTGTTGTTTGAGAGGCGTCTCTGTGTGCCGTCAGATAGTGCGGTTAAAACAGAATTATTATCTAAGGCTCATAGTTCCCCATTTTCCATGCACC
CGGGAAGTACGAAGATGTATCAGCACCCGAAGCGGGTTTATTGGTGGCGTAATATGAAGAGAGAGGTGGAAAAATTTGTTAGTAAATGCTTGGTGTGTCAGCAGGTTAAG
GCACCAAGGTAG
Protein sequenceShow/hide protein sequence
MSFGLTNAPAVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHMVSKAGVFMDPAKIEAVTSWPRPFTVSEVR
SFLGLAGYYRRFVENFSRIATPLTHDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGENIQIFMDHKSLKYFFTQKELNMRQRKWLE
LVKDYDCEILYHPSKANVVADALSRKAEAGQAVEFSVSSDSGLLFERRLCVPSDSAVKTELLSKAHSSPFSMHPGSTKMYQHPKRVYWWRNMKREVEKFVSKCLVCQQVK
APR