; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0050101 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0050101
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionIntegrase
Genome locationCMiso1.1chr02:16084056..16085081
RNA-Seq ExpressionCmc02g0050101
SyntenyCmc02g0050101
Gene Ontology termsGO:0006397 - mRNA processing (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039947.1 integrase [Cucumis melo var. makuwa]4.6e-16486.84Show/hide
Query:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK
        MR IQEIYN T+ INDDHFANFALFAGVDPVTFDE IQDEKWKIAMDQE+D IRRNETWEL+EL T K+ALGVKWVYR KLKSDGNVEKYKA LVVKGYK
Subjt:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK

Query:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR
        QEYGVDYEEIFAPVTRIE IRLILSL AQNGWKVYQMDVKSAF NG LKE+I + QPLGYV+RGEEEKVYKLKKALYGLKQAPRAWY+ ID+FFLKTGFR
Subjt:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR

Query:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQN-GEIVISQQN-AHDLLKKILIENALPCNTPMD
        RCPYEH LYVKEDKY KFLIVSLYVDDLLFTGNDKFLC DFKN MK +FEMS+MGLI YFLGIEVNQN GEIVISQQ  AHDLLKK  +ENA PCNTPMD
Subjt:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQN-GEIVISQQN-AHDLLKKILIENALPCNTPMD

Query:  ANLKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSI
        ANLKLCKDDIGEA++PSLYRSLVGSLMYLT TR DILFAVS+
Subjt:  ANLKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSI

KAA0056210.1 integrase [Cucumis melo var. makuwa]1.1e-19299.71Show/hide
Query:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK
        MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK
Subjt:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK

Query:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR
        QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR
Subjt:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR

Query:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQNGEIVISQQNAHDLLKKILIENALPCNTPMDAN
        RCPYEH LYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQNGEIVISQQNAHDLLKKILIENALPCNTPMDAN
Subjt:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQNGEIVISQQNAHDLLKKILIENALPCNTPMDAN

Query:  LKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSIC
        LKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSIC
Subjt:  LKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSIC

KAA0060377.1 integrase [Cucumis melo var. makuwa]4.6e-16486.84Show/hide
Query:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK
        MR IQEIYN T+ INDDHFANFALFAGVDPVTFDE IQDEKWKIAMDQE+D IRRNETWEL+EL T K+ALGVKWVYR KLKSDGNVEKYKA LVVKGYK
Subjt:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK

Query:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR
        QEYGVDYEEIFAPVTRIE IRLILSL AQNGWKVYQMDVKSAF NG LKE+I + QPLGYV+RGEEEKVYKLKKALYGLKQAPRAWY+ ID+FFLKTGFR
Subjt:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR

Query:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQN-GEIVISQQN-AHDLLKKILIENALPCNTPMD
        RCPYEH LYVKEDKY KFLIVSLYVDDLLFTGNDKFLC DFKN MK +FEMS+MGLI YFLGIEVNQN GEIVISQQ  AHDLLKK  +ENA PCNTPMD
Subjt:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQN-GEIVISQQN-AHDLLKKILIENALPCNTPMD

Query:  ANLKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSI
        ANLKLCKDDIGEA++PSLYRSLVGSLMYLT TR DILFAVS+
Subjt:  ANLKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSI

TYJ95504.1 integrase [Cucumis melo var. makuwa]4.6e-16486.84Show/hide
Query:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK
        MR IQEIYN T+ INDDHFANFALFAGVDPVTFDE IQDEKWKIAMDQE+D IRRNETWEL+EL T K+ALGVKWVYR KLKSDGNVEKYKA LVVKGYK
Subjt:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK

Query:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR
        QEYGVDYEEIFAPVTRIE IRLILSL AQNGWKVYQMDVKSAF NG LKE+I + QPLGYV+RGEEEKVYKLKKALYGLKQAPRAWY+ ID+FFLKTGFR
Subjt:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR

Query:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQN-GEIVISQQN-AHDLLKKILIENALPCNTPMD
        RCPYEH LYVKEDKY KFLIVSLYVDDLLFTGNDKFLC DFKN MK +FEMS+MGLI YFLGIEVNQN GEIVISQQ  AHDLLKK  +ENA PCNTPMD
Subjt:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQN-GEIVISQQN-AHDLLKKILIENALPCNTPMD

Query:  ANLKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSI
        ANLKLCKDDIGEA++PSLYRSLVGSLMYLT TR DILFAVS+
Subjt:  ANLKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSI

TYK30104.1 integrase [Cucumis melo var. makuwa]4.6e-16486.55Show/hide
Query:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK
        MR IQEIYN T+ INDDHFANFALFAGVDPVTFDE IQDEKWKIAMDQE+D IRRNETWEL+EL T K+ALGVKWVYR KLKSDGNVEKYKA LVVKGYK
Subjt:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK

Query:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR
        QEYGVDYEEIFAPVTRIE IRLILSL AQNGWKVYQMDVKSAF NG LKE+I + QPLGYV+RGEEEKVYKLKKALYGLKQAPRAWY+ ID+FFLKTGFR
Subjt:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR

Query:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQN-GEIVISQQN-AHDLLKKILIENALPCNTPMD
        RCPYEH LYVKEDKY KFLIVSLYVDDLLFTGNDKFLC DFKN MKK+FEMS+MGLI YFLGIEVNQN GEIVISQQ  AHDLLKK  +ENA PCNTPMD
Subjt:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQN-GEIVISQQN-AHDLLKKILIENALPCNTPMD

Query:  ANLKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSI
        ANL+LCKDDIGEA++PSLYRSLVGSLMYLT TR DILFA+S+
Subjt:  ANLKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSI

TrEMBL top hitse value%identityAlignment
A0A5A7UM47 Integrase5.1e-19399.71Show/hide
Query:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK
        MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK
Subjt:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK

Query:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR
        QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR
Subjt:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR

Query:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQNGEIVISQQNAHDLLKKILIENALPCNTPMDAN
        RCPYEH LYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQNGEIVISQQNAHDLLKKILIENALPCNTPMDAN
Subjt:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQNGEIVISQQNAHDLLKKILIENALPCNTPMDAN

Query:  LKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSIC
        LKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSIC
Subjt:  LKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSIC

A0A5A7UZJ8 Integrase2.2e-16486.84Show/hide
Query:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK
        MR IQEIYN T+ INDDHFANFALFAGVDPVTFDE IQDEKWKIAMDQE+D IRRNETWEL+EL T K+ALGVKWVYR KLKSDGNVEKYKA LVVKGYK
Subjt:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK

Query:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR
        QEYGVDYEEIFAPVTRIE IRLILSL AQNGWKVYQMDVKSAF NG LKE+I + QPLGYV+RGEEEKVYKLKKALYGLKQAPRAWY+ ID+FFLKTGFR
Subjt:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR

Query:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQN-GEIVISQQN-AHDLLKKILIENALPCNTPMD
        RCPYEH LYVKEDKY KFLIVSLYVDDLLFTGNDKFLC DFKN MK +FEMS+MGLI YFLGIEVNQN GEIVISQQ  AHDLLKK  +ENA PCNTPMD
Subjt:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQN-GEIVISQQN-AHDLLKKILIENALPCNTPMD

Query:  ANLKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSI
        ANLKLCKDDIGEA++PSLYRSLVGSLMYLT TR DILFAVS+
Subjt:  ANLKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSI

A0A5D3BQ81 Integrase2.2e-16486.84Show/hide
Query:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK
        MR IQEIYN T+ INDDHFANFALFAGVDPVTFDE IQDEKWKIAMDQE+D IRRNETWEL+EL T K+ALGVKWVYR KLKSDGNVEKYKA LVVKGYK
Subjt:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK

Query:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR
        QEYGVDYEEIFAPVTRIE IRLILSL AQNGWKVYQMDVKSAF NG LKE+I + QPLGYV+RGEEEKVYKLKKALYGLKQAPRAWY+ ID+FFLKTGFR
Subjt:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR

Query:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQN-GEIVISQQN-AHDLLKKILIENALPCNTPMD
        RCPYEH LYVKEDKY KFLIVSLYVDDLLFTGNDKFLC DFKN MK +FEMS+MGLI YFLGIEVNQN GEIVISQQ  AHDLLKK  +ENA PCNTPMD
Subjt:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQN-GEIVISQQN-AHDLLKKILIENALPCNTPMD

Query:  ANLKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSI
        ANLKLCKDDIGEA++PSLYRSLVGSLMYLT TR DILFAVS+
Subjt:  ANLKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSI

A0A5D3CLV1 Integrase2.2e-16486.84Show/hide
Query:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK
        MR IQEIYN T+ INDDHFANFALFAGVDPVTFDE IQDEKWKIAMDQE+D IRRNETWEL+EL T K+ALGVKWVYR KLKSDGNVEKYKA LVVKGYK
Subjt:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK

Query:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR
        QEYGVDYEEIFAPVTRIE IRLILSL AQNGWKVYQMDVKSAF NG LKE+I + QPLGYV+RGEEEKVYKLKKALYGLKQAPRAWY+ ID+FFLKTGFR
Subjt:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR

Query:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQN-GEIVISQQN-AHDLLKKILIENALPCNTPMD
        RCPYEH LYVKEDKY KFLIVSLYVDDLLFTGNDKFLC DFKN MK +FEMS+MGLI YFLGIEVNQN GEIVISQQ  AHDLLKK  +ENA PCNTPMD
Subjt:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQN-GEIVISQQN-AHDLLKKILIENALPCNTPMD

Query:  ANLKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSI
        ANLKLCKDDIGEA++PSLYRSLVGSLMYLT TR DILFAVS+
Subjt:  ANLKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSI

A0A5D3E3T2 Integrase2.2e-16486.84Show/hide
Query:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK
        MR IQEIYN T+ INDDHFANFALFAGVDPVTFDE IQDEKWKIAMDQE+D IRRNETWEL+EL T K+ALGVKWVYR KLKSDGNVEKYKA LVVKGYK
Subjt:  MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYK

Query:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR
        QEYGVDYEEIFAPVTRIE IRLILSL AQNGWKVYQMDVKSAF NG LKE+I + QPLGYV+RGEEEKVYKLKKALYGLKQAPRAWY+ ID+FFLKTGFR
Subjt:  QEYGVDYEEIFAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFR

Query:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQN-GEIVISQQN-AHDLLKKILIENALPCNTPMD
        RCPYEH LYVKEDKY KFLIVSLYVDDLLFTGNDKFLC DFKN MK +FEMS+MGLI YFLGIEVNQN GEIVISQQ  AHDLLKK  +ENA PCNTPMD
Subjt:  RCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQN-GEIVISQQN-AHDLLKKILIENALPCNTPMD

Query:  ANLKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSI
        ANLKLCKDDIGEA++PSLYRSLVGSLMYLT TR DILFAVS+
Subjt:  ANLKLCKDDIGEAINPSLYRSLVGSLMYLTTTRLDILFAVSI

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.9e-4435.85Show/hide
Query:  PVTFDEV-IQDEK--WKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYKQEYGVDYEEIFAPVTRIEIIRLILSL
        P +FDE+  +D+K  W+ A++ E++  + N TW + +    K  +  +WV+ +K    GN  +YKA LV +G+ Q+Y +DYEE FAPV RI   R ILSL
Subjt:  PVTFDEV-IQDEK--WKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYKQEYGVDYEEIFAPVTRIEIIRLILSL

Query:  VAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFRRCPYEHELYV-KEDKYDKFLIVSLYV
        V Q   KV+QMDVK+AF NG LKE+I +  P G       + V KL KA+YGLKQA R W+   +    +  F     +  +Y+  +   ++ + V LYV
Subjt:  VAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFRRCPYEHELYV-KEDKYDKFLIVSLYV

Query:  DDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQNGEIVISQQNAH--DLLKKILIENALPCNTPMDANLKLCKDDIGEAINPSLYRSLVGS
        DD++    D     +FK ++ +KF M+++  I +F+GI +    + +   Q+A+   +L K  +EN    +TP+ + +     +  E  N    RSL+G 
Subjt:  DDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQNGEIVISQQNAH--DLLKKILIENALPCNTPMDANLKLCKDDIGEAINPSLYRSLVGS

Query:  LMY-LTTTRLDILFAVSI
        LMY +  TR D+  AV+I
Subjt:  LMY-LTTTRLDILFAVSI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-5038.34Show/hide
Query:  DPVTFDEVI-QDEKWKI--AMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYKQEYGVDYEEIFAPVTRIEIIRLILS
        +P +  EV+   EK ++  AM +E++ +++N T++L+EL   KR L  KWV+++K   D  + +YKA LVVKG++Q+ G+D++EIF+PV ++  IR ILS
Subjt:  DPVTFDEVI-QDEKWKI--AMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYKQEYGVDYEEIFAPVTRIEIIRLILS

Query:  LVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFRRCPYEHELYVKEDKYDKFLIVSLYV
        L A    +V Q+DVK+AF +G L+E+I + QP G+   G++  V KL K+LYGLKQAPR WY   D+F     + +   +  +Y K    + F+I+ LYV
Subjt:  LVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFRRCPYEHELYVKEDKYDKFLIVSLYV

Query:  DDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEV---NQNGEIVISQQN-AHDLLKKILIENALPCNTPMDANLKL----CKDDIGEAINPSL-
        DD+L  G DK L    K  + K F+M ++G     LG+++     + ++ +SQ+     +L++  ++NA P +TP+  +LKL    C   + E  N +  
Subjt:  DDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEV---NQNGEIVISQQN-AHDLLKKILIENALPCNTPMDANLKL----CKDDIGEAINPSL-

Query:  -YRSLVGSLMY-LTTTRLDILFAVSI
         Y S VGSLMY +  TR DI  AV +
Subjt:  -YRSLVGSLMY-LTTTRLDILFAVSI

P25600 Putative transposon Ty5-1 protein YCL074W2.2e-2034.74Show/hide
Query:  MDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFRRCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKF
        MDV +AF N  + E I + QP G+V     + V++L   +YGLKQAP  W  HI+N   K GF R   EH LY +    D  + +++YVDDLL       
Subjt:  MDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFRRCPYEHELYVKEDKYDKFLIVSLYVDDLLFTGNDKF

Query:  LCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQ--NGEIVISQQNAHDLLKKILIENALPCNTPMDANLKLCKD-DIGEAINPSL-----YRSLVGSLMYLT
        +    K  + K + M ++G +  FLG+ ++Q  NG+I +S Q   D + K   E+ +  NT       LC    + E  +P L     Y+S+VG L++  
Subjt:  LCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQ--NGEIVISQQNAHDLLKKILIENALPCNTPMDANLKLCKD-DIGEAINPSL-----YRSLVGSLMYLT

Query:  TT-RLDILFAVSI
         T R DI + VS+
Subjt:  TT-RLDILFAVSI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-5136.25Show/hide
Query:  ALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELI-ELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYKQEYGVDYEEIFAPVTRIEIIR
        +L A  +P T  + ++DE+W+ AM  E++    N TW+L+    +    +G +W++  K  SDG++ +YKA LV KGY Q  G+DY E F+PV +   IR
Subjt:  ALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELI-ELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYKQEYGVDYEEIFAPVTRIEIIR

Query:  LILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFRRCPYEHELYVKEDKYDKFLIV
        ++L +     W + Q+DV +AF  G L + + + QP G++ +     V KL+KALYGLKQAPRAWY  + N+ L  GF     +  L+V + +    + +
Subjt:  LILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFRRCPYEHELYVKEDKYDKFLIV

Query:  SLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQ--NGEIVISQQNAHDLLKKILIENALPCNTPMDANLKLCKDDIGEAINPSLYRS
         +YVDD+L TGND  L ++  + + ++F + +   + YFLGIE  +   G  +  ++   DLL +  +  A P  TPM  + KL      +  +P+ YR 
Subjt:  SLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQ--NGEIVISQQNAHDLLKKILIENALPCNTPMDANLKLCKDDIGEAINPSLYRS

Query:  LVGSLMYLTTTRLDILFAVS
        +VGSL YL  TR DI +AV+
Subjt:  LVGSLMYLTTTRLDILFAVS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.9e-5035Show/hide
Query:  ALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKK-RALGVKWVYRIKLKSDGNVEKYKAILVVKGYKQEYGVDYEEIFAPVTRIEIIR
        +L A  +P T  + ++D++W+ AM  E++    N TW+L+         +G +W++  K  SDG++ +YKA LV KGY Q  G+DY E F+PV +   IR
Subjt:  ALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKK-RALGVKWVYRIKLKSDGNVEKYKAILVVKGYKQEYGVDYEEIFAPVTRIEIIR

Query:  LILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFRRCPYEHELYVKEDKYDKFLIV
        ++L +     W + Q+DV +AF  G L +++ + QP G+V +   + V +L+KA+YGLKQAPRAWY  +  + L  GF     +  L+V + +    + +
Subjt:  LILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFRRCPYEHELYVKEDKYDKFLIV

Query:  SLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQ--NGEIVISQQNAHDLLKKILIENALPCNTPMDANLKLCKDDIGEAINPSLYRS
         +YVDD+L TGND  L     + + ++F +     + YFLGIE  +   G  +  ++   DLL +  +  A P  TPM  + KL      +  +P+ YR 
Subjt:  SLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQ--NGEIVISQQNAHDLLKKILIENALPCNTPMDANLKLCKDDIGEAINPSLYRS

Query:  LVGSLMYLTTTRLDILFAVS
        +VGSL YL  TR D+ +AV+
Subjt:  LVGSLMYLTTTRLDILFAVS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 87.8e-5336.28Show/hide
Query:  DPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYKQEYGVDYEEIFAPVTRIEIIRLILSLVA
        +P T++E  +   W  AMD E+  +    TWE+  L   K+ +G KWVY+IK  SDG +E+YKA LV KGY Q+ G+D+ E F+PV ++  ++LIL++ A
Subjt:  DPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYKQEYGVDYEEIFAPVTRIEIIRLILSLVA

Query:  QNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEE----EKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFRRCPYEHELYVKEDKYDKFLIVSLY
           + ++Q+D+ +AF NG L E+I +  P GY  R  +      V  LKK++YGLKQA R W+       +  GF +   +H  ++K      FL V +Y
Subjt:  QNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEE----EKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFRRCPYEHELYVKEDKYDKFLIVSLY

Query:  VDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQN--GEIVISQQNAHDLLKKILIENALPCNTPMDANLKLCKDDIGEAINPSLYRSLVG
        VDD++   N+     + K+ +K  F++ ++G + YFLG+E+ ++  G  +  ++ A DLL +  +    P + PMD ++       G+ ++   YR L+G
Subjt:  VDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQN--GEIVISQQNAHDLLKKILIENALPCNTPMDANLKLCKDDIGEAINPSLYRSLVG

Query:  SLMYLTTTRLDILFAVS
         LMYL  TRLDI FAV+
Subjt:  SLMYLTTTRLDILFAVS

ATMG00810.1 DNA/RNA polymerases superfamily protein3.7e-1034.17Show/hide
Query:  LYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEV--NQNGEIVISQQNAHDLLKKILIENALPCNTPMDANLKLCKDDIGEAINPSLYRSL
        LYVDD+L TG+   L       +   F M ++G + YFLGI++  + +G  +   + A  +L    + +  P +TP+   L        +  +PS +RS+
Subjt:  LYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEV--NQNGEIVISQQNAHDLLKKILIENALPCNTPMDANLKLCKDDIGEAINPSLYRSL

Query:  VGSLMYLTTTRLDILFAVSI
        VG+L YLT TR DI +AV+I
Subjt:  VGSLMYLTTTRLDILFAVSI

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.5e-1637.12Show/hide
Query:  DPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYKQEYGVDYEEIFAPVTRIEIIRLILSLVA
        +P +    ++D  W  AM +E+D + RN+TW L+     +  LG KWV++ KL SDG +++ KA LV KG+ QE G+ + E ++PV R   IR IL++  
Subjt:  DPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYKQEYGVDYEEIFAPVTRIEIIRLILSLVA

Query:  Q--NGWKVYQMDVKSAFSNGQLKEKILLVQPL
        Q   G  +  M  K  FS G  K+K + +  L
Subjt:  Q--NGWKVYQMDVKSAFSNGQLKEKILLVQPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGCATATTCAAGAAATTTATAATGCTACTGACATAATTAATGATGATCATTTTGCTAATTTTGCATTGTTTGCTGGTGTTGATCCTGTAACTTTTGATGAAGTCAT
CCAAGATGAAAAATGGAAGATTGCAATGGATCAAGAAGTTGATGTGATAAGAAGAAATGAAACATGGGAGTTGATAGAGCTTCTGACAAAAAAACGAGCTCTTGGAGTAA
AATGGGTGTACAGAATAAAGTTGAAGTCAGATGGTAATGTTGAAAAATACAAGGCAATACTTGTTGTAAAAGGCTACAAGCAGGAATATGGTGTGGATTATGAAGAAATA
TTTGCCCCTGTGACAAGAATTGAGATCATTCGATTGATTTTGTCATTAGTTGCTCAAAATGGATGGAAAGTTTATCAAATGGATGTAAAATCCGCTTTTTCTAATGGACA
ATTGAAGGAAAAGATATTGCTTGTACAACCTTTGGGCTATGTGAAAAGGGGAGAAGAAGAAAAAGTGTACAAGTTGAAAAAGGCCTTGTATGGGTTGAAGCAAGCTCCGC
GAGCTTGGTACAATCATATCGACAATTTTTTTCTAAAGACAGGATTTCGAAGATGTCCATATGAGCATGAACTCTATGTCAAAGAAGACAAGTATGACAAATTTCTTATC
GTTTCTCTTTATGTTGATGATTTACTTTTTACTGGAAATGATAAATTTTTGTGTTATGATTTTAAAAATTTCATGAAAAAGAAATTTGAGATGAGTAATATGGGTCTCAT
CTTCTACTTTCTCGGAATTGAAGTTAATCAAAATGGAGAAATTGTTATTTCACAACAAAACGCTCATGATTTACTAAAAAAAATTCTAATTGAAAATGCTTTACCTTGCA
ACACTCCCATGGATGCAAATTTGAAATTATGCAAGGATGATATTGGAGAAGCTATCAATCCAAGTTTGTATCGAAGTTTAGTTGGAAGCTTAATGTACTTGACAACAACA
AGACTTGATATTTTATTTGCTGTAAGTATATGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGCATATTCAAGAAATTTATAATGCTACTGACATAATTAATGATGATCATTTTGCTAATTTTGCATTGTTTGCTGGTGTTGATCCTGTAACTTTTGATGAAGTCAT
CCAAGATGAAAAATGGAAGATTGCAATGGATCAAGAAGTTGATGTGATAAGAAGAAATGAAACATGGGAGTTGATAGAGCTTCTGACAAAAAAACGAGCTCTTGGAGTAA
AATGGGTGTACAGAATAAAGTTGAAGTCAGATGGTAATGTTGAAAAATACAAGGCAATACTTGTTGTAAAAGGCTACAAGCAGGAATATGGTGTGGATTATGAAGAAATA
TTTGCCCCTGTGACAAGAATTGAGATCATTCGATTGATTTTGTCATTAGTTGCTCAAAATGGATGGAAAGTTTATCAAATGGATGTAAAATCCGCTTTTTCTAATGGACA
ATTGAAGGAAAAGATATTGCTTGTACAACCTTTGGGCTATGTGAAAAGGGGAGAAGAAGAAAAAGTGTACAAGTTGAAAAAGGCCTTGTATGGGTTGAAGCAAGCTCCGC
GAGCTTGGTACAATCATATCGACAATTTTTTTCTAAAGACAGGATTTCGAAGATGTCCATATGAGCATGAACTCTATGTCAAAGAAGACAAGTATGACAAATTTCTTATC
GTTTCTCTTTATGTTGATGATTTACTTTTTACTGGAAATGATAAATTTTTGTGTTATGATTTTAAAAATTTCATGAAAAAGAAATTTGAGATGAGTAATATGGGTCTCAT
CTTCTACTTTCTCGGAATTGAAGTTAATCAAAATGGAGAAATTGTTATTTCACAACAAAACGCTCATGATTTACTAAAAAAAATTCTAATTGAAAATGCTTTACCTTGCA
ACACTCCCATGGATGCAAATTTGAAATTATGCAAGGATGATATTGGAGAAGCTATCAATCCAAGTTTGTATCGAAGTTTAGTTGGAAGCTTAATGTACTTGACAACAACA
AGACTTGATATTTTATTTGCTGTAAGTATATGTTAA
Protein sequenceShow/hide protein sequence
MRHIQEIYNATDIINDDHFANFALFAGVDPVTFDEVIQDEKWKIAMDQEVDVIRRNETWELIELLTKKRALGVKWVYRIKLKSDGNVEKYKAILVVKGYKQEYGVDYEEI
FAPVTRIEIIRLILSLVAQNGWKVYQMDVKSAFSNGQLKEKILLVQPLGYVKRGEEEKVYKLKKALYGLKQAPRAWYNHIDNFFLKTGFRRCPYEHELYVKEDKYDKFLI
VSLYVDDLLFTGNDKFLCYDFKNFMKKKFEMSNMGLIFYFLGIEVNQNGEIVISQQNAHDLLKKILIENALPCNTPMDANLKLCKDDIGEAINPSLYRSLVGSLMYLTTT
RLDILFAVSIC