; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0165151 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0165151
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr06:16517162..16518266
RNA-Seq ExpressionCmc06g0165151
SyntenyCmc06g0165151
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-15277.81Show/hide
Query:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY
        M+VK AFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIY LKQASRSWNIRFDTAIKSYGFDQNVDEP VYKKINKGKVAFLVLYVDDILLIGNDVGY
Subjt:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---
        LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRD KNK LALSQATYIDK+LVRYSMQNSKK LLPFRHGVHLS EQ PKTPQEVEDMR I    + G L   
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---

Query:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG
                                                          DLI+TGY DSDFQTDKDSRKSTSGSVFTLNGGAVV        WRSIKQG
Subjt:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG

Query:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR
        CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAV NSKEPRSHKRGK+IERKYHLIR
Subjt:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR

KAA0033121.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-14975.94Show/hide
Query:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY
        M+V  AFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIY LKQASRSWNIRFDTAIKSYGF+QNVDEP VYKKINKGKV FLVLYVDDILLIGNDVGY
Subjt:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---
        LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRD KNK LALSQATYIDK+LVRYSMQNSKK LLPFRHGVHLS EQCPKTPQEVEDMR I    + G L   
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---

Query:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG
                                                          DLI+TGY DSDFQT+KDSRKSTS SVFTLNGGA+V        WRSIKQG
Subjt:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG

Query:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR
        CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAV NSKEPRSHKR K+IERKYHLI+
Subjt:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-15076.74Show/hide
Query:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY
        M+VK AFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIY LKQASRSWNIRFDTAIKSYGFDQNVDEP VYKKINKGKVAFLVLYVDDILLIGNDVGY
Subjt:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---
        LTDVKAWLAAQFQMKDLGE QYVLGIQIIRD KNK LALSQATYIDK+LVRYSMQNSKK LLPFRHGVHLS EQ PKTPQEVEDMR I    + G L   
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---

Query:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG
                                                          DLI+TGY +SDFQTDKDSRKSTS SVFTLNGGAVV        WRSIKQG
Subjt:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG

Query:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR
        CIADSTMEAEYVAACEAAKEAVWL+KFLHDLEVVPNMNLPITLYCDNSGAV NSKEPRSHKRGK+IERKYHLIR
Subjt:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR

KAA0040367.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-14975.4Show/hide
Query:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY
        M+VK AFLN NLEESIFMSQPEGFITQGQEQKVCKLNRSIY LKQ+SRSWN+RFDTAIKSYGFDQNVDEP VYKKINKGKVAFLVLYVDDILLIGNDVGY
Subjt:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---
        LTDVKAWLA QFQMKDLGE QYVLGIQIIRD KNK LALSQATYIDK+LVRYSMQNSKKDLLPFRHGVHLS EQCPKTPQE+EDMR I    + G L   
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---

Query:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG
                                                          DLI+TGY DSDFQTDKDSRKSTSGSVFTLN GAVVWH        SIKQG
Subjt:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG

Query:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR
        CIADSTMEAEY+AACEAAKE VWLRKFLHDLEVVPNMNL ITLYCDNSGAV NSKEPR+HKRGK+IERKYHLIR
Subjt:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-15277.81Show/hide
Query:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY
        M+VK AFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIY LKQASRSWNIRFDTAIKSYGFDQNVDEP VYKKINKGKVAFLVLYVDDILLIGNDVGY
Subjt:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---
        LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRD KNK LALSQATYIDK+LVRYSMQNSKK LLPFRHGVHLS EQ PKTPQEVEDMR I    + G L   
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---

Query:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG
                                                          DLI+TGY DSDFQTDKDSRKSTSGSVFTLNGGAVV        WRSIKQG
Subjt:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG

Query:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR
        CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAV NSKEPRSHKRGK+IERKYHLIR
Subjt:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein1.0e-15076.74Show/hide
Query:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY
        M+VK AFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIY LKQASRSWNIRFDTAIKSYGFDQNVDEP VYKKINKGKVAFLVLYVDDILLIGNDVGY
Subjt:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---
        LTDVKAWLAAQFQMKDLGE QYVLGIQIIRD KNK LALSQATYIDK+LVRYSMQNSKK LLPFRHGVHLS EQ PKTPQEVEDMR I    + G L   
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---

Query:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG
                                                          DLI+TGY +SDFQTDKDSRKSTS SVFTLNGGAVV        WRSIKQG
Subjt:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG

Query:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR
        CIADSTMEAEYVAACEAAKEAVWL+KFLHDLEVVPNMNLPITLYCDNSGAV NSKEPRSHKRGK+IERKYHLIR
Subjt:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR

A0A5A7TZD0 Gag/pol protein4.9e-15377.81Show/hide
Query:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY
        M+VK AFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIY LKQASRSWNIRFDTAIKSYGFDQNVDEP VYKKINKGKVAFLVLYVDDILLIGNDVGY
Subjt:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---
        LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRD KNK LALSQATYIDK+LVRYSMQNSKK LLPFRHGVHLS EQ PKTPQEVEDMR I    + G L   
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---

Query:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG
                                                          DLI+TGY DSDFQTDKDSRKSTSGSVFTLNGGAVV        WRSIKQG
Subjt:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG

Query:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR
        CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAV NSKEPRSHKRGK+IERKYHLIR
Subjt:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR

A0A5A7UYE8 Gag/pol protein4.9e-15377.81Show/hide
Query:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY
        M+VK AFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIY LKQASRSWNIRFDTAIKSYGFDQNVDEP VYKKINKGKVAFLVLYVDDILLIGNDVGY
Subjt:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---
        LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRD KNK LALSQATYIDK+LVRYSMQNSKK LLPFRHGVHLS EQ PKTPQEVEDMR I    + G L   
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---

Query:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG
                                                          DLI+TGY DSDFQTDKDSRKSTSGSVFTLNGGAVV        WRSIKQG
Subjt:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG

Query:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR
        CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAV NSKEPRSHKRGK+IERKYHLIR
Subjt:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR

A0A5D3CZY3 Gag/pol protein1.5e-14975.94Show/hide
Query:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY
        M+V  AFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIY LKQASRSWNIRFDTAIKSYGF+QNVDEP VYKKINKGKV FLVLYVDDILLIGNDVGY
Subjt:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---
        LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRD KNK LALSQATYIDK+LVRYSMQNSKK LLPFRHGVHLS EQCPKTPQEVEDMR I    + G L   
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---

Query:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG
                                                          DLI+TGY DSDFQT+KDSRKSTS SVFTLNGGA+V        WRSIKQG
Subjt:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG

Query:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR
        CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAV NSKEPRSHKR K+IERKYHLI+
Subjt:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR

A0A5D3DI92 Gag/pol protein8.6e-15075.4Show/hide
Query:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY
        M+VK AFLN NLEESIFMSQPEGFITQGQEQKVCKLNRSIY LKQ+SRSWN+RFDTAIKSYGFDQNVDEP VYKKINKGKVAFLVLYVDDILLIGNDVGY
Subjt:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---
        LTDVKAWLA QFQMKDLGE QYVLGIQIIRD KNK LALSQATYIDK+LVRYSMQNSKKDLLPFRHGVHLS EQCPKTPQE+EDMR I    + G L   
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL---

Query:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG
                                                          DLI+TGY DSDFQTDKDSRKSTSGSVFTLN GAVVWH        SIKQG
Subjt:  -------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG

Query:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR
        CIADSTMEAEY+AACEAAKE VWLRKFLHDLEVVPNMNL ITLYCDNSGAV NSKEPR+HKRGK+IERKYHLIR
Subjt:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.7e-3929.95Show/hide
Query:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGF-DQNVDE-PYVYKKINKGKVAFLVLYVDDILLIGNDV
        M+VK AFLNG L+E I+M  P+G         VCKLN++IY LKQA+R W   F+ A+K   F + +VD   Y+  K N  +  +++LYVDD+++   D+
Subjt:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGF-DQNVDE-PYVYKKINKGKVAFLVLYVDDILLIGNDV

Query:  GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSM----EQCPKTPQEVEDMRCISLCLSC
          + + K +L  +F+M DL E ++ +GI+I  + +   + LSQ+ Y+ KIL +++M+N      P    ++  +    E C    + +  + C+   + C
Subjt:  GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSM----EQCPKTPQEVEDMRCISLCLSC

Query:  -----------------------------------GQLNDLI-----------ITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG
                                           G ++  +           I GY DSD+   +  RKST+G +F +          N   W + +Q 
Subjt:  -----------------------------------GQLNDLI-----------ITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQG

Query:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR
         +A S+ EAEY+A  EA +EA+WL+  L  + +   +  PI +Y DN G ++ +  P  HKR K+I+ KYH  R
Subjt:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.5e-6236Show/hide
Query:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVY-KKINKGKVAFLVLYVDDILLIGNDVG
        ++VK AFL+G+LEE I+M QPEGF   G++  VCKLN+S+Y LKQA R W ++FD+ +KS  + +   +P VY K+ ++     L+LYVDD+L++G D G
Subjt:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVY-KKINKGKVAFLVLYVDDILLIGNDVG

Query:  YLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL--
         +  +K  L+  F MKDLG AQ +LG++I+R+  ++ L LSQ  YI+++L R++M+N+K    P    + LS + CP T +E  +M  +    + G L  
Subjt:  YLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQL--

Query:  --------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQ
                                                          +D I+ GY D+D   D D+RKS++G +FT +GGA+        +W+S  Q
Subjt:  --------------------------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQ

Query:  GCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR
         C+A ST EAEY+AA E  KE +WL++FL +L +         +YCD+  A+  SK    H R K+I+ +YH IR
Subjt:  GCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR

P25600 Putative transposon Ty5-1 protein YCL074W1.3e-1727.22Show/hide
Query:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY
        M+V  AFLN  ++E I++ QP GF+ +     V +L   +Y LKQA   WN   +  +K  GF ++  E  +Y +       ++ +YVDD+L+       
Subjt:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKI----------LVRYSMQNSK----------KDLLPFRHGVHLSMEQCPKTPQ
           VK  L   + MKDLG+    LG+  I    N  + LS   YI K           L +  + NSK          KD+ P++  V   +  C  T +
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKI----------LVRYSMQNSK----------KDLLPFRHGVHLSMEQCPKTPQ

Query:  --------------------EVEDMRCI--------SLCLSCGQLNDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQGCIAD
                             +E  R +        S+CL     + L +T Y D+      D   ST G V  L G  V W        + +K G I  
Subjt:  --------------------EVEDMRCI--------SLCLSCGQLNDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQGCIAD

Query:  STMEAEYVAACEAAKE
         + EAEY+ A E   E
Subjt:  STMEAEYVAACEAAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.7e-3328.8Show/hide
Query:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY
        ++V  AFL G L + ++MSQP GFI + +   VCKL +++Y LKQA R+W +     + + GF  +V +  ++       + ++++YVDDIL+ GND   
Subjt:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPK-----------------------
        L +    L+ +F +KD  E  Y LGI+  R      L LSQ  YI  +L R +M  +K    P      LS+    K                       
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPK-----------------------

Query:  ---------------TPQEVEDMRCISLCLS--------CGQLNDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQGCIADST
                       T + ++ ++ I   L+          + N L +  Y+D+D+  DKD   ST+        G +V+ G +  +W S KQ  +  S+
Subjt:  ---------------TPQEVEDMRCISLCLS--------CGQLNDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQGCIADST

Query:  MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR
         EAEY +    + E  W+   L +L +   +  P  +YCDN GA      P  H R K+I   YH IR
Subjt:  MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.4e-3227.45Show/hide
Query:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY
        ++V  AFL G L + ++MSQP GF+ + +   VC+L ++IY LKQA R+W +   T + + GF  ++ +  ++       + ++++YVDDIL+ GND   
Subjt:  MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCI-----------S
        L      L+ +F +K+  +  Y LGI+  R  +   L LSQ  Y   +L R +M  +K    P      L++    K P   E    +            
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCI-----------S

Query:  LCLSCGQL-----------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQGCIADST
        L  +  +L                                   N L +  Y+D+D+  D D   ST+        G +V+ G +  +W S KQ  +  S+
Subjt:  LCLSCGQL-----------------------------------NDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQGCIADST

Query:  MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR
         EAEY +    + E  W+   L +L +   ++ P  +YCDN GA      P  H R K+I   YH IR
Subjt:  MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.5e-3326.61Show/hide
Query:  MEVKIAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGN
        +++  AFLNG+L+E I+M  P G+   QG       VC L +SIY LKQASR W ++F   +  +GF Q+  +   + KI       +++YVDDI++  N
Subjt:  MEVKIAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGN

Query:  DVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLS-------------------------
        +   + ++K+ L + F+++DLG  +Y LG++I R      + + Q  Y   +L    +   K   +P    V  S                         
Subjt:  DVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLS-------------------------

Query:  --------------MEQCPKTPQEVEDMRCISLC-------LSCGQLNDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQGCI
                        + P+   +   M+ +          L      ++ +  ++D+ FQ+ KD+R+ST+G    L        G +  +W+S KQ  +
Subjt:  --------------MEQCPKTPQEVEDMRCISLC-------LSCGQLNDLIITGYNDSDFQTDKDSRKSTSGSVFTLNGGAVVWHGINASAWRSIKQGCI

Query:  ADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR
        + S+ EAEY A   A  E +WL +F  +L++   ++ P  L+CDN+ A+  +     H+R K+IE   H +R
Subjt:  ADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR

ATMG00810.1 DNA/RNA polymerases superfamily protein8.0e-0744.74Show/hide
Query:  FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSK
        +L+LYVDDILL G+    L  +   L++ F MKDLG   Y LGIQ I+ H +  L LSQ  Y ++IL    M + K
Subjt:  FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTCAAGATTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAAGTTGAA
TCGATCCATTTATAGGTTGAAACAAGCATCTAGATCTTGGAACATTAGGTTTGATACTGCGATCAAATCCTACGGTTTTGACCAGAACGTTGATGAACCTTATGTATATA
AGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGATACCTTACTGACGTTAAAGCTTGGCTAGCAGCC
CAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCATAAGAACAAAATGCTAGCATTGTCTCAAGCAACCTATATCGACAA
AATATTGGTTCGATATTCGATGCAGAACTCTAAGAAGGATTTATTACCTTTCAGGCATGGGGTTCACTTGTCTATGGAACAGTGTCCTAAGACACCTCAAGAAGTTGAGG
ATATGAGATGTATTTCCCTATGCCTCAGCTGTGGGCAGCTTAATGATTTGATCATTACAGGATACAATGACTCTGATTTCCAAACTGATAAGGATTCTAGAAAATCTACG
TCAGGATCAGTGTTCACCCTAAATGGGGGAGCTGTAGTATGGCATGGCATCAATGCATCAGCATGGCGTAGCATCAAGCAAGGATGCATTGCAGACTCTACAATGGAGGC
TGAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCTTACATGATTTGGAAGTTGTTCCAAACATGAACTTGCCCATCACTCTATATTGCG
ATAACAGTGGGGCAGTAACCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGAAATACATAGAGAGGAAGTATCACCTGATACGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTCAAGATTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAAGTTGAA
TCGATCCATTTATAGGTTGAAACAAGCATCTAGATCTTGGAACATTAGGTTTGATACTGCGATCAAATCCTACGGTTTTGACCAGAACGTTGATGAACCTTATGTATATA
AGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGATACCTTACTGACGTTAAAGCTTGGCTAGCAGCC
CAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCATAAGAACAAAATGCTAGCATTGTCTCAAGCAACCTATATCGACAA
AATATTGGTTCGATATTCGATGCAGAACTCTAAGAAGGATTTATTACCTTTCAGGCATGGGGTTCACTTGTCTATGGAACAGTGTCCTAAGACACCTCAAGAAGTTGAGG
ATATGAGATGTATTTCCCTATGCCTCAGCTGTGGGCAGCTTAATGATTTGATCATTACAGGATACAATGACTCTGATTTCCAAACTGATAAGGATTCTAGAAAATCTACG
TCAGGATCAGTGTTCACCCTAAATGGGGGAGCTGTAGTATGGCATGGCATCAATGCATCAGCATGGCGTAGCATCAAGCAAGGATGCATTGCAGACTCTACAATGGAGGC
TGAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCTTACATGATTTGGAAGTTGTTCCAAACATGAACTTGCCCATCACTCTATATTGCG
ATAACAGTGGGGCAGTAACCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGAAATACATAGAGAGGAAGTATCACCTGATACGGTAG
Protein sequenceShow/hide protein sequence
MEVKIAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYRLKQASRSWNIRFDTAIKSYGFDQNVDEPYVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA
QFQMKDLGEAQYVLGIQIIRDHKNKMLALSQATYIDKILVRYSMQNSKKDLLPFRHGVHLSMEQCPKTPQEVEDMRCISLCLSCGQLNDLIITGYNDSDFQTDKDSRKST
SGSVFTLNGGAVVWHGINASAWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVTNSKEPRSHKRGKYIERKYHLIR