; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G012790 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G012790
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCmo_Chr04:6485651..6486760
RNA-Seq ExpressionCmoCh04G012790
SyntenyCmoCh04G012790
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8655616.1 hypothetical protein F3Y22_tig00117021pilonHSYRG00028 [Hibiscus syriacus]2.8e-9845.05Show/hide
Query:  NLEYAN-AVEDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYEIMCTPDGSIVRYKAQTIDSGFSQEYGLDYDETIRPV
        N +YAN A+ +E  EPE +EEAS++S W  AM+EE  AL+QNQTW++V + +D K ISCKW Y+I   PDGSI RYKA+ +  GFSQ+YGLDYDET  PV
Subjt:  NLEYAN-AVEDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYEIMCTPDGSIVRYKAQTIDSGFSQEYGLDYDETIRPV

Query:  AKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFENE---------------------------------------------------
        AK+  V+V   L  NKDW LWQMD+ NAFL GEL+RE YM QP  F+++                                                   
Subjt:  AKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFENE---------------------------------------------------

Query:  --------------------------------------------------------------------VGVFDQYMQNLKKPYLDTARPTLRYVK-----
                                                                             GV  +YMQN KKP+L+  R  LRYVK     
Subjt:  --------------------------------------------------------------------VGVFDQYMQNLKKPYLDTARPTLRYVK-----

Query:  ---------------------GDHDTRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIR
                             GDHDTRRS TGYVFKLGS TI WC+KRQ T+SLST EAEYRAAA A QE+T L  LM + HQ ++Y IPLYC+NQSAIR
Subjt:  ---------------------GDHDTRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIR

Query:  LAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDVQVADLFTKGLNTGKHESFRCHLNMVQRMRTSAEGEC
        LAENPVFHARTKHVEVH HF+REKVL+EEI+M+QIKTD Q+ADLFTK L+ GK E FR    ++QRM  + EGEC
Subjt:  LAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDVQVADLFTKGLNTGKHESFRCHLNMVQRMRTSAEGEC

KAE8667231.1 tir-nbs resistance protein [Hibiscus syriacus]1.2e-10148.89Show/hide
Query:  DSSDTSVGEQEVTQPSEPSEN--------EMTPQQLRHSEIILKENLEYANA-VEDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTK
        D     +GE +V   +   E+        E    Q   S  I + N +YANA + ++  EPE +EEAS++S W  AM+EE  AL+QNQTW++V + +D K
Subjt:  DSSDTSVGEQEVTQPSEPSEN--------EMTPQQLRHSEIILKENLEYANA-VEDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTK

Query:  LISCKWAYEIMCTPDGSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKF---------
         ISCKW Y+I   PDGSI RYKA+ +  GFSQ+YGLDYDET  PVAK+  V+V   L  NKDW LWQMD+ NAFL GEL+RE YM QP +          
Subjt:  LISCKWAYEIMCTPDGSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKF---------

Query:  ----------------------------------------ENEVGVFDQYMQNLKKPYLDTARPTLRYVK--------------------------GDHD
                                                +NEVGV   YMQN KKP+L+  R  LRYV                           GDHD
Subjt:  ----------------------------------------ENEVGVFDQYMQNLKKPYLDTARPTLRYVK--------------------------GDHD

Query:  TRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKV
        TRRS TGYVFKLGS TI WC KRQ T+SLST EAEYRAAA A QE+T L  LM + HQ ++Y I LYC+NQS IRLAENPVFHARTKHVEVH HF+REKV
Subjt:  TRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKV

Query:  LKEEIKMQQIKTDVQVADLFTKGLNTGKHESFRCHLNMVQRMRTSAEGEC
        L+EEI+M+QIKTD Q+ADLFTK L+  K E FR    ++QRM  + EGEC
Subjt:  LKEEIKMQQIKTDVQVADLFTKGLNTGKHESFRCHLNMVQRMRTSAEGEC

KAE8676439.1 hypothetical protein F3Y22_tig00111614pilonHSYRG00169 [Hibiscus syriacus]1.8e-9742.78Show/hide
Query:  QEVTQPSEPSENEMT-PQ-QLRHSEIILKENLEYAN-AVEDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYEIMCTPD
        Q+  +   PSE E + PQ QLR S  I + N +YAN A+ +E  EPE +EEAS++S W  AM+EE  AL+QNQTW++V + +D K ISCKW Y+I   PD
Subjt:  QEVTQPSEPSENEMT-PQ-QLRHSEIILKENLEYAN-AVEDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYEIMCTPD

Query:  GSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFENE---------------------
        GSI RYKA+ +  GFSQ+YGLDYDET  PVAK+  V+V   L  NKDW LWQMD+ NAFL GEL+RE YM QP  F+++                     
Subjt:  GSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFENE---------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------VGVFDQYMQNLKKPYLDTARPTLRYVK--------------------------GDHDTRRSITGYVFKLGSRTIFWCNKRQ
                            GV  +YMQN KKP+L+  +  LRYVK                          GDHDTRRS TGYVFKLGS TI WC+KRQ
Subjt:  -------------------VGVFDQYMQNLKKPYLDTARPTLRYVK--------------------------GDHDTRRSITGYVFKLGSRTIFWCNKRQ

Query:  QTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDVQVADLFTKGL
         T+SLST EAEYRAAA A QE+T L  LM + HQ ++Y IPLYC+NQSAIRLAENPVFHARTKHVEVH HF+REKVL+EEI+M+QIKTD Q+ADLFTK L
Subjt:  QTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDVQVADLFTKGL

Query:  NTGKHESFRCHLNMVQRMRTSAEGEC
        + GK E FR    ++QRM  + EGEC
Subjt:  NTGKHESFRCHLNMVQRMRTSAEGEC

KAE8684576.1 hypothetical protein F3Y22_tig00111127pilonHSYRG00074 [Hibiscus syriacus]3.6e-9842.83Show/hide
Query:  QEVTQPSEPSENEMT-PQ-QLRHSEIILKENLEYAN-AVEDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYEIMCTPD
        Q+  +   PSE E + PQ QLR S  I + N +YAN A+ +E  EPE +EEAS++S W  AM+EE  AL+QNQTW++V + +D KLISCKW Y+I   PD
Subjt:  QEVTQPSEPSENEMT-PQ-QLRHSEIILKENLEYAN-AVEDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYEIMCTPD

Query:  GSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFENE---------------------
        GSI RYKA+ +  GFSQ+YGLDYDET  PVAK+  V+V   L  NKDW LWQMD+ NAFL GEL+RE YM QP  F+++                     
Subjt:  GSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFENE---------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------VGVFDQYMQNLKKPYLDTARPTLRYVK--------------------------GDHDTRRSITGYVFKLGSRTIFWC
                               VGV  +YMQN KKP+L+  R  LRYVK                          GDHDTRRS TGYVFKLGS TI WC
Subjt:  -----------------------VGVFDQYMQNLKKPYLDTARPTLRYVK--------------------------GDHDTRRSITGYVFKLGSRTIFWC

Query:  NKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDVQVADLF
        +KRQ T+SLST E EYRAAA A QE+T L  LM + HQ ++Y IPLYC+NQSAIRLAENPVFHARTKHVEVH HF+REKVL+EEI+M+QIKTD Q+ADLF
Subjt:  NKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDVQVADLF

Query:  TKGLNTGKHESFRCHLNMVQRMRTSAEGEC
        TK L+ GK E FR    ++QRM  + EGEC
Subjt:  TKGLNTGKHESFRCHLNMVQRMRTSAEGEC

KAE8704364.1 PLAC8 family protein [Hibiscus syriacus]1.4e-10246.89Show/hide
Query:  QEVTQPSEPSENEMT-PQ-QLRHSEIILKENLEYAN-AVEDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYEIMCTPD
        Q+  +   PSE E + PQ QLR S  I + N +YAN A+ +EV E E +EEASQ+S W  AM+EE  AL+QNQTW+LV + +D K ISCKW Y+I   PD
Subjt:  QEVTQPSEPSENEMT-PQ-QLRHSEIILKENLEYAN-AVEDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYEIMCTPD

Query:  GSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFEN----------------------
        GSI RYKA+ +  GFSQ+YGLDYDET  PVAK+  V+V  VL  NKDW LWQMD+ NAFL GEL+RE YM QP  F++                      
Subjt:  GSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFEN----------------------

Query:  --------------------------------------------------------------------------EVGVFDQYMQNLKKPYLDTARPTLRY
                                                                                  E G+  +YMQN KKP+L+  R  LRY
Subjt:  --------------------------------------------------------------------------EVGVFDQYMQNLKKPYLDTARPTLRY

Query:  VK--------------------------GDHDTRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYC
        VK                          GDHDTRRS TGYVFKLGSRTI WC+KRQ T+SLST EA+YRA A A QE+T L  LM + HQ ++Y IPLYC
Subjt:  VK--------------------------GDHDTRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYC

Query:  NNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDVQVADLFTKGLNTGKHESFRCHLNMVQRMRTSAEGEC
        +NQS IRLAENPVFHARTKHVEVH HF+REKVL+EEI+M+QIKTD Q+ADLFTK L+ GK + FR    ++QRM  + EGEC
Subjt:  NNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDVQVADLFTKGLNTGKHESFRCHLNMVQRMRTSAEGEC

TrEMBL top hitse value%identityAlignment
A0A2N9ELW1 Uncharacterized protein2.7e-9946.46Show/hide
Query:  TSVGEQEVTQPSEPSENEMTPQ-QLRHSEIILKENLEYANAV---EDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYE
        T V +Q   +       E+TPQ QLR S    + N +YANA    E  + EPE +EEASQ+S W KAMEEE  AL+QNQTW+L+ +PRD K ISCKW Y+
Subjt:  TSVGEQEVTQPSEPSENEMTPQ-QLRHSEIILKENLEYANAV---EDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYE

Query:  IMCTPDGSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFENE---------------
        I   PDGSI RYKA+ +  GFSQ+YGLDYDET  PVAK+  V+V   L  NKDW LWQMD+ NAFL GEL+RE YM QP  F+N+               
Subjt:  IMCTPDGSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFENE---------------

Query:  ----------------------------------------------------------------------------------VGVFDQYMQNLKKPYLDT
                                                                                          V V  +YMQN KKP+L+ 
Subjt:  ----------------------------------------------------------------------------------VGVFDQYMQNLKKPYLDT

Query:  ARPTLRYVK--------------------------GDHDTRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIE
         R  LRYVK                          GDHDTRRS TGYVFKLGS TI WC+KRQ T+SLST EAEYRAAA A QE+T L  LM D HQ I+
Subjt:  ARPTLRYVK--------------------------GDHDTRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIE

Query:  YPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDVQVADLFTKGLNTGKHESFRCHLNMVQR
        Y + L+C+NQSAIRLAENP FHARTKHVEVH HF+REKVL+ +IKM Q KT+ QVAD+FTKGLNT K   FR  L MV +
Subjt:  YPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDVQVADLFTKGLNTGKHESFRCHLNMVQR

A0A2N9GCM1 Uncharacterized protein4.6e-9946.46Show/hide
Query:  TSVGEQEVTQPSEPSENEMTPQ-QLRHSEIILKENLEYANAV---EDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYE
        T V +Q   +       E+TPQ QLR S    + N +YANA    E  + EPE +EEASQ+S W KAMEEE  AL+QNQTW+L+ +PRD K ISCKW Y+
Subjt:  TSVGEQEVTQPSEPSENEMTPQ-QLRHSEIILKENLEYANAV---EDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYE

Query:  IMCTPDGSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFENE---------------
        I   PDGSI RYKA+ +  GFSQ+YGLDYDET  PVAK+  V+V   L  NKDW LWQMD+ NAFL GEL+RE YM QP  F+N+               
Subjt:  IMCTPDGSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFENE---------------

Query:  ----------------------------------------------------------------------------------VGVFDQYMQNLKKPYLDT
                                                                                          V V  +YMQN KKP+L+ 
Subjt:  ----------------------------------------------------------------------------------VGVFDQYMQNLKKPYLDT

Query:  ARPTLRYVK--------------------------GDHDTRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIE
         R  LRYVK                          GDHDTRRS TGYVFKLGS TI WC+KRQ T+SLST EAEYRAAA A QE+T L  LM D HQ I+
Subjt:  ARPTLRYVK--------------------------GDHDTRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIE

Query:  YPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDVQVADLFTKGLNTGKHESFRCHLNMVQR
        Y + L+C+NQSAIRLAENP FHARTKHVEVH HF+REKVL+ +IKM Q KT+ QVAD+FTKGLNT K   FR  L MV +
Subjt:  YPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDVQVADLFTKGLNTGKHESFRCHLNMVQR

A0A2N9HDK6 Uncharacterized protein2.7e-9946.46Show/hide
Query:  TSVGEQEVTQPSEPSENEMTPQ-QLRHSEIILKENLEYANAV---EDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYE
        T V +Q   +       E+TPQ QLR S    + N +YANA    E  + EPE +EEASQ+S W KAMEEE  AL+QNQTW+L+ +PRD K ISCKW Y+
Subjt:  TSVGEQEVTQPSEPSENEMTPQ-QLRHSEIILKENLEYANAV---EDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYE

Query:  IMCTPDGSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFENE---------------
        I   PDGSI RYKA+ +  GFSQ+YGLDYDET  PVAK+  V+V   L  NKDW LWQMD+ NAFL GEL+RE YM QP  F+N+               
Subjt:  IMCTPDGSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFENE---------------

Query:  ----------------------------------------------------------------------------------VGVFDQYMQNLKKPYLDT
                                                                                          V V  +YMQN KKP+L+ 
Subjt:  ----------------------------------------------------------------------------------VGVFDQYMQNLKKPYLDT

Query:  ARPTLRYVK--------------------------GDHDTRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIE
         R  LRYVK                          GDHDTRRS TGYVFKLGS TI WC+KRQ T+SLST EAEYRAAA A QE+T L  LM D HQ I+
Subjt:  ARPTLRYVK--------------------------GDHDTRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIE

Query:  YPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDVQVADLFTKGLNTGKHESFRCHLNMVQR
        Y + L+C+NQSAIRLAENP FHARTKHVEVH HF+REKVL+ +IKM Q KT+ QVAD+FTKGLNT K   FR  L MV +
Subjt:  YPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDVQVADLFTKGLNTGKHESFRCHLNMVQR

A0A6A2WZM7 Tir-nbs resistance protein5.8e-10248.89Show/hide
Query:  DSSDTSVGEQEVTQPSEPSEN--------EMTPQQLRHSEIILKENLEYANA-VEDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTK
        D     +GE +V   +   E+        E    Q   S  I + N +YANA + ++  EPE +EEAS++S W  AM+EE  AL+QNQTW++V + +D K
Subjt:  DSSDTSVGEQEVTQPSEPSEN--------EMTPQQLRHSEIILKENLEYANA-VEDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTK

Query:  LISCKWAYEIMCTPDGSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKF---------
         ISCKW Y+I   PDGSI RYKA+ +  GFSQ+YGLDYDET  PVAK+  V+V   L  NKDW LWQMD+ NAFL GEL+RE YM QP +          
Subjt:  LISCKWAYEIMCTPDGSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKF---------

Query:  ----------------------------------------ENEVGVFDQYMQNLKKPYLDTARPTLRYVK--------------------------GDHD
                                                +NEVGV   YMQN KKP+L+  R  LRYV                           GDHD
Subjt:  ----------------------------------------ENEVGVFDQYMQNLKKPYLDTARPTLRYVK--------------------------GDHD

Query:  TRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKV
        TRRS TGYVFKLGS TI WC KRQ T+SLST EAEYRAAA A QE+T L  LM + HQ ++Y I LYC+NQS IRLAENPVFHARTKHVEVH HF+REKV
Subjt:  TRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKV

Query:  LKEEIKMQQIKTDVQVADLFTKGLNTGKHESFRCHLNMVQRMRTSAEGEC
        L+EEI+M+QIKTD Q+ADLFTK L+  K E FR    ++QRM  + EGEC
Subjt:  LKEEIKMQQIKTDVQVADLFTKGLNTGKHESFRCHLNMVQRMRTSAEGEC

A0A6A3AIG6 PLAC8 family protein6.9e-10346.89Show/hide
Query:  QEVTQPSEPSENEMT-PQ-QLRHSEIILKENLEYAN-AVEDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYEIMCTPD
        Q+  +   PSE E + PQ QLR S  I + N +YAN A+ +EV E E +EEASQ+S W  AM+EE  AL+QNQTW+LV + +D K ISCKW Y+I   PD
Subjt:  QEVTQPSEPSENEMT-PQ-QLRHSEIILKENLEYAN-AVEDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYEIMCTPD

Query:  GSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFEN----------------------
        GSI RYKA+ +  GFSQ+YGLDYDET  PVAK+  V+V  VL  NKDW LWQMD+ NAFL GEL+RE YM QP  F++                      
Subjt:  GSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFEN----------------------

Query:  --------------------------------------------------------------------------EVGVFDQYMQNLKKPYLDTARPTLRY
                                                                                  E G+  +YMQN KKP+L+  R  LRY
Subjt:  --------------------------------------------------------------------------EVGVFDQYMQNLKKPYLDTARPTLRY

Query:  VK--------------------------GDHDTRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYC
        VK                          GDHDTRRS TGYVFKLGSRTI WC+KRQ T+SLST EA+YRA A A QE+T L  LM + HQ ++Y IPLYC
Subjt:  VK--------------------------GDHDTRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYC

Query:  NNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDVQVADLFTKGLNTGKHESFRCHLNMVQRMRTSAEGEC
        +NQS IRLAENPVFHARTKHVEVH HF+REKVL+EEI+M+QIKTD Q+ADLFTK L+ GK + FR    ++QRM  + EGEC
Subjt:  NNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDVQVADLFTKGLNTGKHESFRCHLNMVQRMRTSAEGEC

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.6e-2622.83Show/hide
Query:  SENEMTPQQLRHSEIILKENLEYANA---VEDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYEIMCTPDGSIVRYKAQ
        SE   T  Q+ ++E     N    NA     D  N  +  +     S+W++A+  E  A + N TW +  RP +  ++  +W + +     G+ +RYKA+
Subjt:  SENEMTPQQLRHSEIILKENLEYANA---VEDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYEIMCTPDGSIVRYKAQ

Query:  TIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQP------------------------------------
         +  GF+Q+Y +DY+ET  PVA+I   +    L +  + K+ QMD+  AFL G L  E YM  P                                    
Subjt:  TIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQP------------------------------------

Query:  ----------------------------------------------------KKFE----NEVGVFD---------------------------------
                                                            +KF     NE+  F                                  
Subjt:  ----------------------------------------------------KKFE----NEVGVFD---------------------------------

Query:  -------------------------------QYMQNLKKPYLDTA-------------------RPTLRYVKGDHD------------------------
                                        Y+    +P L TA                   +  LRY+KG  D                        
Subjt:  -------------------------------QYMQNLKKPYLDTA-------------------RPTLRYVKGDHD------------------------

Query:  ----TRRSITGYVFKL-GSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHF
             R+S TGY+FK+     I W  KRQ +++ S+ EAEY A   A +E   LK L+   + K+E PI +Y +NQ  I +A NP  H R KH+++  HF
Subjt:  ----TRRSITGYVFKL-GSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHF

Query:  IREKVLKEEIKMQQIKTDVQVADLFTKGLNTGKHESFRCHLNMVQRMRTSAE
         RE+V    I ++ I T+ Q+AD+FTK L   +    R  L ++Q  +++AE
Subjt:  IREKVLKEEIKMQQIKTDVQVADLFTKGLNTGKHESFRCHLNMVQRMRTSAE

P0CV72 Secreted RxLR effector protein 1614.4e-0648.98Show/hide
Query:  GDHDTRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQE
        GD ++RRS +GY+FKL    + W +K+Q+T++LS+ E EY A + ATQE
Subjt:  GDHDTRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.7e-2923.16Show/hide
Query:  EQEVTQPSEPSENEMTPQQLRHSEIILKENLEYANA----VEDEVNEPEIYEEA---SQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYEI
        ++ V +   P++ E   Q LR SE    E+  Y +     + D+  EPE  +E     + +   KAM+EE  +L++N T++LV  P+  + + CKW +++
Subjt:  EQEVTQPSEPSENEMTPQQLRHSEIILKENLEYANA----VEDEVNEPEIYEEA---SQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYEI

Query:  MCTPDGSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFE------------------
            D  +VRYKA+ +  GF Q+ G+D+DE   PV K+  ++    L  + D ++ Q+D+  AFL G+L  E YM+QP+ FE                  
Subjt:  MCTPDGSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFE------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------NEVGVFDQYMQNLKKPYLDTARPTLRYVK----
                                                                           + VGV  ++++N  K + +  +  LRY++    
Subjt:  -------------------------------------------------------------------NEVGVFDQYMQNLKKPYLDTARPTLRYVK----

Query:  ---------------------GDHDTRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMED--WHQKIEYPIPLYCNNQSA
                             GD D R+S TGY+F      I W +K Q+ ++LST EAEY AA    +E   LK  +++   HQK EY +  YC++QSA
Subjt:  ---------------------GDHDTRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMED--WHQKIEYPIPLYCNNQSA

Query:  IRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDVQVADLFTKGLNTGKHE
        I L++N ++HARTKH++V  H+IRE V  E +K+ +I T+   AD+ TK +   K E
Subjt:  IRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDVQVADLFTKGLNTGKHE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-2521.29Show/hide
Query:  ILKENLEYANAVE-DEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELV-SRPRDTKLISCKWAYEIMCTPDGSIVRYKAQTIDSGFSQEYGLDYDE
        I+K N +Y+ AV     +EP    +A ++  W+ AM  E  A   N TW+LV   P    ++ C+W +      DGS+ RYKA+ +  G++Q  GLDY E
Subjt:  ILKENLEYANAVE-DEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELV-SRPRDTKLISCKWAYEIMCTPDGSIVRYKAQTIDSGFSQEYGLDYDE

Query:  TIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFENE----------------------------------------------
        T  PV K   +++   + V++ W + Q+D+NNAFLQG L  + YM QP  F ++                                              
Subjt:  TIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFENE----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------VGVFDQYMQNLKKPYLDTARPTLRYV--------------------------KGDHDTRRSITGYVFKLGS
                                     V    Q+M    + +L   +  LRY+                           GD D   S  GY+  LG 
Subjt:  -----------------------------VGVFDQYMQNLKKPYLDTARPTLRYV--------------------------KGDHDTRRSITGYVFKLGS

Query:  RTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDV
          I W +K+Q+ +  S+ EAEYR+ A  + E   +  L+ +   ++  P  +YC+N  A  L  NPVFH+R KH+ +  HFIR +V    +++  + T  
Subjt:  RTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIKTDV

Query:  QVADLFTKGLNTGKHESFRCHLNMVQ
        Q+AD  TK L+    ++F   + + +
Subjt:  QVADLFTKGLNTGKHESFRCHLNMVQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-2521.85Show/hide
Query:  RHSEIILKENLEYANAVEDEVN-EPEIYEEASQNSAWQKAMEEETIALEQNQTWELV-SRPRDTKLISCKWAYEIMCTPDGSIVRYKAQTIDSGFSQEYG
        R  + I K N +Y+ A     N EP    +A ++  W++AM  E  A   N TW+LV   P    ++ C+W +      DGS+ RYKA+ +  G++Q  G
Subjt:  RHSEIILKENLEYANAVEDEVN-EPEIYEEASQNSAWQKAMEEETIALEQNQTWELV-SRPRDTKLISCKWAYEIMCTPDGSIVRYKAQTIDSGFSQEYG

Query:  LDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKF--------------------------------------------
        LDY ET  PV K   +++   + V++ W + Q+D+NNAFLQG L  E YM QP  F                                            
Subjt:  LDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKF--------------------------------------------

Query:  -----------------------------------------------ENE--------------------------------------------------
                                                       E+E                                                  
Subjt:  -----------------------------------------------ENE--------------------------------------------------

Query:  ----------------------------------VGVFDQYMQNLKKPYLDTARPTLRYV--------------------------KGDHDTRRSITGYV
                                          V    QYM      + +  +  LRY+                           GD D   S  GY+
Subjt:  ----------------------------------VGVFDQYMQNLKKPYLDTARPTLRYV--------------------------KGDHDTRRSITGYV

Query:  FKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQ
          LG   I W +K+Q+ +  S+ EAEYR+ A  + E   +  L+ +   ++ +P  +YC+N  A  L  NPVFH+R KH+ +  HFIR +V    +++  
Subjt:  FKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQ

Query:  IKTDVQVADLFTKGLNTGKHESFRCHLNMVQ
        + T  Q+AD  TK L+    ++F   + +++
Subjt:  IKTDVQVADLFTKGLNTGKHESFRCHLNMVQ

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.0e-2723.26Show/hide
Query:  EVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYEIMCTPDGSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPV
        +  EP  Y EA +   W  AM++E  A+E   TWE+ + P + K I CKW Y+I    DG+I RYKA+ +  G++Q+ G+D+ ET  PV K+  V++   
Subjt:  EVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYEIMCTPDGSIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPV

Query:  LTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKF-----------------------------------------------------------------
        ++   ++ L Q+D++NAFL G+L+ E YM  P  +                                                                 
Subjt:  LTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKF-----------------------------------------------------------------

Query:  ---------ENEVGVFDQYMQNLK----------------------------------------------KP----------------------------
                  N     D+    LK                                              KP                            
Subjt:  ---------ENEVGVFDQYMQNLK----------------------------------------------KP----------------------------

Query:  -----YLDTAR--------------------------PTLRYVKGD--------------------------HDTRRSITGYVFKLGSRTIFWCNKRQQT
             YL   R                            L Y+KG                            DTRRS  GY   LG+  I W +K+QQ 
Subjt:  -----YLDTAR--------------------------PTLRYVKGD--------------------------HDTRRSITGYVFKLGSRTIFWCNKRQQT

Query:  ISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREK
        +S S+ EAEYRA + AT E   L     +    +  P  L+C+N +AI +A N VFH RTKH+E   H +RE+
Subjt:  ISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREK

ATMG00810.1 DNA/RNA polymerases superfamily protein1.5e-0449.02Show/hide
Query:  GDHDTRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENT
        G   TRRS TG+   LG   I W  KRQ T+S S+ E EYRA A    E T
Subjt:  GDHDTRRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGAAGATTCAAGTGACACTAGTGTTGGCGAGCAAGAAGTGACTCAACCAAGCGAGCCTAGTGAAAATGAAATGACACCTCAACAACTCAGACATTCAGAAATAAT
CCTAAAGGAAAATCTAGAGTATGCCAACGCAGTAGAAGACGAAGTTAATGAGCCAGAGATATATGAAGAAGCATCACAAAACTCAGCTTGGCAGAAAGCAATGGAGGAAG
AAACTATAGCCCTAGAACAAAATCAGACTTGGGAATTAGTGTCAAGACCAAGAGATACCAAACTCATCTCTTGCAAGTGGGCTTACGAAATAATGTGTACCCCGGATGGA
TCAATCGTGAGATACAAAGCTCAGACTATAGATTCAGGGTTCTCTCAAGAATATGGACTAGATTATGATGAAACGATTAGGCCAGTGGCAAAGATCATTATCGTACAAGT
TCCTCCAGTACTTACGGTAAATAAAGATTGGAAATTATGGCAGATGGATATGAATAATGCTTTCTTGCAGGGAGAGTTAAACAGAGAGTTCTACATGGACCAACCGAAAA
AATTCGAAAATGAAGTTGGAGTCTTTGATCAATATATGCAAAATCTGAAGAAGCCTTATTTAGATACAGCTCGACCGACCTTAAGATATGTCAAAGGAGACCACGATACC
CGAAGATCAATCACTGGGTATGTGTTCAAGCTCGGTTCAAGAACAATTTTTTGGTGTAACAAAAGACAACAAACAATATCATTGTCAACTAAAGAAGCAGAGTACAGAGC
AGCAGCTGGAGCAACTCAGGAAAATACATGCTTAAAACTTTTGATGGAAGATTGGCACCAGAAAATTGAGTATCCAATACCACTTTATTGCAACAATCAATCTGCGATTC
GCCTTGCAGAAAATCCGGTGTTTCATGCTAGAACAAAGCATGTGGAGGTGCACAACCACTTCATTAGAGAGAAGGTCCTAAAGGAAGAGATCAAGATGCAGCAAATCAAG
ACAGATGTCCAAGTGGCAGACTTGTTTACAAAAGGGCTGAATACTGGCAAACATGAGAGTTTTCGCTGTCACCTCAACATGGTGCAGCGAATGAGGACTAGTGCTGAGGG
GGAGTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTGAAGATTCAAGTGACACTAGTGTTGGCGAGCAAGAAGTGACTCAACCAAGCGAGCCTAGTGAAAATGAAATGACACCTCAACAACTCAGACATTCAGAAATAAT
CCTAAAGGAAAATCTAGAGTATGCCAACGCAGTAGAAGACGAAGTTAATGAGCCAGAGATATATGAAGAAGCATCACAAAACTCAGCTTGGCAGAAAGCAATGGAGGAAG
AAACTATAGCCCTAGAACAAAATCAGACTTGGGAATTAGTGTCAAGACCAAGAGATACCAAACTCATCTCTTGCAAGTGGGCTTACGAAATAATGTGTACCCCGGATGGA
TCAATCGTGAGATACAAAGCTCAGACTATAGATTCAGGGTTCTCTCAAGAATATGGACTAGATTATGATGAAACGATTAGGCCAGTGGCAAAGATCATTATCGTACAAGT
TCCTCCAGTACTTACGGTAAATAAAGATTGGAAATTATGGCAGATGGATATGAATAATGCTTTCTTGCAGGGAGAGTTAAACAGAGAGTTCTACATGGACCAACCGAAAA
AATTCGAAAATGAAGTTGGAGTCTTTGATCAATATATGCAAAATCTGAAGAAGCCTTATTTAGATACAGCTCGACCGACCTTAAGATATGTCAAAGGAGACCACGATACC
CGAAGATCAATCACTGGGTATGTGTTCAAGCTCGGTTCAAGAACAATTTTTTGGTGTAACAAAAGACAACAAACAATATCATTGTCAACTAAAGAAGCAGAGTACAGAGC
AGCAGCTGGAGCAACTCAGGAAAATACATGCTTAAAACTTTTGATGGAAGATTGGCACCAGAAAATTGAGTATCCAATACCACTTTATTGCAACAATCAATCTGCGATTC
GCCTTGCAGAAAATCCGGTGTTTCATGCTAGAACAAAGCATGTGGAGGTGCACAACCACTTCATTAGAGAGAAGGTCCTAAAGGAAGAGATCAAGATGCAGCAAATCAAG
ACAGATGTCCAAGTGGCAGACTTGTTTACAAAAGGGCTGAATACTGGCAAACATGAGAGTTTTCGCTGTCACCTCAACATGGTGCAGCGAATGAGGACTAGTGCTGAGGG
GGAGTGTTGA
Protein sequenceShow/hide protein sequence
MPEDSSDTSVGEQEVTQPSEPSENEMTPQQLRHSEIILKENLEYANAVEDEVNEPEIYEEASQNSAWQKAMEEETIALEQNQTWELVSRPRDTKLISCKWAYEIMCTPDG
SIVRYKAQTIDSGFSQEYGLDYDETIRPVAKIIIVQVPPVLTVNKDWKLWQMDMNNAFLQGELNREFYMDQPKKFENEVGVFDQYMQNLKKPYLDTARPTLRYVKGDHDT
RRSITGYVFKLGSRTIFWCNKRQQTISLSTKEAEYRAAAGATQENTCLKLLMEDWHQKIEYPIPLYCNNQSAIRLAENPVFHARTKHVEVHNHFIREKVLKEEIKMQQIK
TDVQVADLFTKGLNTGKHESFRCHLNMVQRMRTSAEGEC