; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh08G004290 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh08G004290
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationCmo_Chr08:2654130..2657159
RNA-Seq ExpressionCmoCh08G004290
SyntenyCmoCh08G004290
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN61340.1 hypothetical protein VITISV_007301 [Vitis vinifera]4.4e-16948.21Show/hide
Query:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ
        +LEHI D KTP EAW+TF  LFSKKNDTRLQLLE+ELL ++Q ++TIAQYFHKVK +CR+I ELD ++ I E+RMKRIIIHGLR E R F+ AVQGW  Q
Subjt:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ

Query:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPST--KRGYNGDKRRSHQGTS----QPERAQMNDNKSFQEKRF---------------
        PSLVEFENLL+ QEA+AKQMGG++LKGEE+ALY  + R N++  T  +   N DK +S QG      + +   +   K FQ + +               
Subjt:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPST--KRGYNGDKRRSHQGTS----QPERAQMNDNKSFQEKRF---------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------GESAYVDKIRKNETTDLSHARLGHISYHKLKLIMEKSMLKGLPQLEVKAD-----CSNKH-
                                                E+AYV+K RKNET +L H RL HISY KL ++M+KSMLKGLP+LE++ D     C  ++ 
Subjt:  ---------------------------------------GESAYVDKIRKNETTDLSHARLGHISYHKLKLIMEKSMLKGLPQLEVKAD-----CSNKH-

Query:  ------------------------------------------------QSAKCAIVED-RVYEPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLG
                                                        + A  AIVED    EPET+ EA QN  W KA++EEI AL+QNQTWELVP+  
Subjt:  ------------------------------------------------QSAKCAIVED-RVYEPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLG

Query:  DVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANP
        DV+PISCKWVYKIKRR DGSIER+KARLVARGFSQQYGLDYDETFS VAK+TTVRVLLALAA+KDW  WQMDVKNAFLHGELDREIYMNQP GF S  +P
Subjt:  DVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANP

Query:  NYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDR
         YVCKLRKALYGLKQAPRAWY                 DSSLF+K   G L IVLVYVDDLIIT DD  EI++T ENLSVRF+MKELG+LKHFLGLEVD 
Subjt:  NYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDR

Query:  TDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR
        T EG+FLCQQKY +D+L+KF MLECK +STPME NAK+C HEGK+L D T YR
Subjt:  TDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR

KAE8679378.1 hypothetical protein F3Y22_tig00111402pilonHSYRG01323 [Hibiscus syriacus]8.6e-16538.59Show/hide
Query:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ
        +LEHI D KTPKEAWDTFV LFSK+NDT+LQLLENELLS++Q DM +AQYFHKVK ICR+I+ELDP +AI E+ +KRII+HGLR EYR F+ AVQGWPTQ
Subjt:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ

Query:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPSTKRG--YNGDKRRSHQGTSQPERAQMNDNKSFQEKRFG------------------
        PSLVEF+NLL+ QEAMAKQMGG++LKGEEEALYTS+SR   +  T  G   +GDK +++QG   P     + N+    K  G                  
Subjt:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPSTKRG--YNGDKRRSHQGTSQPERAQMNDNKSFQEKRFG------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------ESAYVDKIRKNETTDLSHARLGHISYHKLKLIMEKSMLKGLPQLEVKAD----------
                                                 ESAYVD+ RKNET+DL H RLGHISY KL ++++KSMLKGLPQL+V+ D          
Subjt:  -----------------------------------------ESAYVDKIRKNETTDLSHARLGHISYHKLKLIMEKSMLKGLPQLEVKAD----------

Query:  -----------------------------------------------------------------------------------CSNKHQ-----------
                                                                                           C  +HQ           
Subjt:  -----------------------------------------------------------------------------------CSNKHQ-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------SAKC---------------------------------------------------------------------------------------
                 S +C                                                                                       
Subjt:  ---------SAKC---------------------------------------------------------------------------------------

Query:  ----------------------AIVEDRVYEPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARG
                              AI+E+   EPET+EEAS++S W  AM+EEI AL+QN+TW++VP++ DVKPISCKWVYKIKRRPDGSIERYKARLVARG
Subjt:  ----------------------AIVEDRVYEPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARG

Query:  FSQQYGLDYDETFSLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSG
        FSQQYGLDYDETFS VAK+TTVRVLLALAA+KDW  WQMDVKNAFLHGELDREIYM QP GF+S  +P YVCKLRKALYGLKQAPRAWYGKIAEFLT+SG
Subjt:  FSQQYGLDYDETFSLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSG

Query:  YSVAHTDSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPM
        YSV   DSSLF+K  EG L IVLVYVDDLIITGDDE +I QT+ENLSVRFQMKELG+LKHFLGLEVDRT EG+FLCQQKY +D+L++F MLECK  STPM
Subjt:  YSVAHTDSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPM

Query:  ETNAKICAHEGKELNDETTYR
        E N K+CAHEGK+L D T YR
Subjt:  ETNAKICAHEGKELNDETTYR

KAE8704478.1 hypothetical protein F3Y22_tig00110450pilonHSYRG00264 [Hibiscus syriacus]2.0e-16639.35Show/hide
Query:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ
        MLEHI D KTPKEAWDTFV LFSK+NDT+LQLLENELLS++Q DM +AQYFHKVK ICR+I+ELDP +AI E+R+KRII+HGLR EYR F+ AVQGWPTQ
Subjt:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ

Query:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPSTKRG--YNGDKRRSHQGTSQPERAQMNDNKSFQEKRFG------------------
        PSLVEFENLL+ QEAMAKQMGG++LKGEEEALYTS+SR   +  T  G   +GDK +++QG   P     + N+    K  G                  
Subjt:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPSTKRG--YNGDKRRSHQGTSQPERAQMNDNKSFQEKRFG------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------ESAYVDKIRKNETTDLSHARLGHISYHKLKLIMEKSMLKGLPQLEVKAD---------------------
                                      ESAYVD+ RKNET+DL H RLGH+SY KL ++++KSMLKGLPQL+V+ D                     
Subjt:  ------------------------------ESAYVDKIRKNETTDLSHARLGHISYHKLKLIMEKSMLKGLPQLEVKAD---------------------

Query:  -------------------------------------------------------------------------------------------CSNKHQ---
                                                                                                   C  +HQ   
Subjt:  -------------------------------------------------------------------------------------------CSNKHQ---

Query:  --------------------------------------------------------------------------------SAKCAIVEDRVY--------
                                                                                        S +C  + + V+        
Subjt:  --------------------------------------------------------------------------------SAKCAIVEDRVY--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  EPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAKITTVRVLLALAA
        EPET+EEAS++S W  +M+EEI AL+QNQTW++VP++ DVKPISCKWV KIKRRPDGSIERYKARLVARGFSQQYGLDYDETFS VAK+T VRVLLAL A
Subjt:  EPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAKITTVRVLLALAA

Query:  SKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVLVYVDDLI
        +KDW  WQMDVKNAFLHGELDREIYM QP GF+S  +P YVCKLRKALYGLKQAPRAWYGKIAEFLT+SGY V   DSSLF+K  EG L+IVLVYVDDLI
Subjt:  SKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVLVYVDDLI

Query:  ITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR
        ITGDDE EI QT+ENLSVRFQMKELG+LKHFLGLEVDRT EG+FLCQQKY +D+L++F MLECK  STPME N K+CAHE K+L D T YR
Subjt:  ITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR

KAE8725434.1 Indole-3-acetic acid-amido synthetase GH3.17 [Hibiscus syriacus]1.6e-16638.92Show/hide
Query:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ
        MLEHI D KTPKEAWDTFV LFSK+NDT+LQLLENELLS++Q DM +AQYFHKVK ICR+I+ELDP +AI E+R+KRII+HGLR EYR F+ AVQGWPTQ
Subjt:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ

Query:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPSTKRG--YNGDKRRSHQGTSQPERAQMNDNKSFQEKRFG------------------
        PSLVEFENLL+ QEAMAKQMGG++LKGEEEALYTS+SR   +  T  G   +GDK +++QG   P     + N+    K  G                  
Subjt:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPSTKRG--YNGDKRRSHQGTSQPERAQMNDNKSFQEKRFG------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------ESAYVDKIRKNETTDLSHARLGHISYHKLKLIMEKSMLKGLPQLEVKAD------------------------------
                             ESAYVD+ RKNET+DL H RLGH+SY KL ++++KSMLKGLPQL+V+ D                              
Subjt:  ---------------------ESAYVDKIRKNETTDLSHARLGHISYHKLKLIMEKSMLKGLPQLEVKAD------------------------------

Query:  ----------------------------------------------------------------------------------CSNKHQ------------
                                                                                          C  +HQ            
Subjt:  ----------------------------------------------------------------------------------CSNKHQ------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------SAKC----------------------------------------------------------------------------------------
                S +C                                                                                        
Subjt:  --------SAKC----------------------------------------------------------------------------------------

Query:  ---------------------AIVEDRVYEPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGF
                             AI+E+   EPET+EEAS++S W   M+EEI AL+QNQTW++VP++ DVKPISCKWVYKIKRRPDGSIERYKARLVARGF
Subjt:  ---------------------AIVEDRVYEPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGF

Query:  SQQYGLDYDETFSLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGY
        SQQYGLDYDETFS VAK+TTVRVLLALAA+KDW  WQMDVKNAFLHGELDREIYM QP GF+S  +P YVCKLRKALYGLKQAPRAWYGKIAEFLT+SGY
Subjt:  SQQYGLDYDETFSLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGY

Query:  SVAHTDSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPME
        SV   DSSLF+K  EG L IVLVYVDDLIITGDDE EI QT+ENLSVRFQMKELG+LKHFLGLEVDRT EG+FLCQQKY +D+L++F MLECK  STPME
Subjt:  SVAHTDSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPME

Query:  TNAKICAHEGKELNDETTYR
         N K+CAHEGK+L D T YR
Subjt:  TNAKICAHEGKELNDETTYR

KAE8733549.1 hypothetical protein F3Y22_tig00001120pilonHSYRG00173 [Hibiscus syriacus]5.0e-16538.31Show/hide
Query:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ
        MLEHI D KTPKEAWDTFV LFSK+NDT+LQLLENELLS++Q DM +AQYFHKVK ICR+I+ELDP +AI E+R+KRII+HGLR EYR F+ AVQGWPTQ
Subjt:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ

Query:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPSTKRG--YNGDKRRSHQGTSQPERAQMNDNKSFQEKRFG------------------
        PSLVEFENLL+ QEAMAKQMGG++LKGEEEALYTS+SR   +  T  G   +GDK +++QG   P     + N+    K  G                  
Subjt:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPSTKRG--YNGDKRRSHQGTSQPERAQMNDNKSFQEKRFG------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------ESAYVDKIRKNETTDLSHARLGHISYHKLKLI
                                                                            ESAYVD+ RKNET+DL H RLGH+SY KL ++
Subjt:  --------------------------------------------------------------------ESAYVDKIRKNETTDLSHARLGHISYHKLKLI

Query:  MEKSMLKGLPQLEVKAD-----------------------------------------------------------------------------------
        ++KSMLKGLPQL+V+ D                                                                                   
Subjt:  MEKSMLKGLPQLEVKAD-----------------------------------------------------------------------------------

Query:  -----------------------------CSNKHQ-----------------------------------------------------------------
                                     C  +HQ                                                                 
Subjt:  -----------------------------CSNKHQ-----------------------------------------------------------------

Query:  ---------------------------SAKC---------------------------------------------------------------------
                                   S +C                                                                     
Subjt:  ---------------------------SAKC---------------------------------------------------------------------

Query:  ----------------------------------------AIVEDRVYEPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIK
                                                AI+E+   EPET+EEAS++S W  AM+EEI AL+QNQTW++VP++ DVKPISCKWVYKIK
Subjt:  ----------------------------------------AIVEDRVYEPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIK

Query:  RRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLK
        RRPDGSIERYKARLVARGFSQQYGLDYDETFS VAK+TTVRVLLALAA+KDW  WQMDVKNAFLHGELDREIYM QP GF+S  +P YVCKLRKALYGLK
Subjt:  RRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLK

Query:  QAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTR
        QAPRAWYGKIAEFLT+SGYSV   DSSLF+K  EG L IVLVYVDDLIITGDDE EI QT+ENLSVRFQMKELG+LKHFLGLEVDRT EG+FLCQQKY +
Subjt:  QAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTR

Query:  DMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR
        D+L++F MLECK  STPME N K+CAHEGK+L D T YR
Subjt:  DMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR

TrEMBL top hitse value%identityAlignment
A0A2N9EQ78 Uncharacterized protein6.2e-16939.67Show/hide
Query:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ
        MLEHI D KTPKEAWDTFV LFSKKNDTRLQLLENELLS++Q DMTIAQYFHKVK ICR+I++LDP + I ESR+KRIIIHGLR EYR F+ A+QGWPTQ
Subjt:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ

Query:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPST--KRGYNGDKRRSHQGT--SQPERAQMN-------DNKSFQEKRFG---------
        PSLVEFENLL+ QEAMAKQMGG++LKGEEEALYTS++R   + +T  +   +GDK +SHQG   S P     N       D K +   + G         
Subjt:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPST--KRGYNGDKRRSHQGT--SQPERAQMN-------DNKSFQEKRFG---------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------ESAYVDKIRKNETTDLSHARLGHISYHKLKLI
                                                                            ESAYVD+ RKNETTDL H RLGH+SY KL ++
Subjt:  --------------------------------------------------------------------ESAYVDKIRKNETTDLSHARLGHISYHKLKLI

Query:  MEKSMLKGLPQLEVKAD-----------------------------------------------------------------------------------
        ++KSMLKGLPQL+V+ D                                                                                   
Subjt:  MEKSMLKGLPQLEVKAD-----------------------------------------------------------------------------------

Query:  -----------------------------CSNKHQ-----------------------------------------------------------------
                                     C  +HQ                                                                 
Subjt:  -----------------------------CSNKHQ-----------------------------------------------------------------

Query:  --------SAKC----------------------------------------------------------------------------------------
                S +C                                                                                        
Subjt:  --------SAKC----------------------------------------------------------------------------------------

Query:  ---------------------AIVED-RVYEPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARG
                             AIVE+  + EPET+EEASQ+S W KAMEEEI AL+QNQTW+L+P+  DVKPISCKWVYKIKRRPDGSIERYKARLVARG
Subjt:  ---------------------AIVED-RVYEPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARG

Query:  FSQQYGLDYDETFSLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSG
        FSQQYGLDYDETFS VAK+TTVRVLLALAA+KDW  WQMDVKNAFLHGELDREIYM QP GF++  +P YVCKLRKALYGLKQAPRAWYGKIAEFLTQSG
Subjt:  FSQQYGLDYDETFSLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSG

Query:  YSVAHTDSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPM
        YSV   D+SLF+K  EG L IVLVYVDDL+ITGDDE EI +T+ENLSVRFQMKELG+LKHFLGLEVDRT EG+FLCQQKY++D+L++F MLECK +S PM
Subjt:  YSVAHTDSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPM

Query:  ETNAKICAHEGKELNDETTYR
        E NAK+CAHEGK+L D T YR
Subjt:  ETNAKICAHEGKELNDETTYR

A0A2N9FUR0 CCHC-type domain-containing protein5.2e-16848.05Show/hide
Query:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ
        MLEHI D KTPKEAWDTFV LFSKKNDTRLQLLENELLS++Q DMTIAQYFHKVK ICR+I+ELDP + I E+R+KRIIIH                   
Subjt:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ

Query:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPST--KRGYNGDKRRSHQGT--SQPERAQMN-------DNKSFQEKRFG---------
                    QEAMAKQMGG++LKGEEEALYTS++R   + +T  +   +GDK +SHQG   S P     N       D K +   + G         
Subjt:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPST--KRGYNGDKRRSHQGT--SQPERAQMN-------DNKSFQEKRFG---------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------ESAYVDKIRKNETTDLSHARLGHISYHKLKLI
                                                                            ESAYVDK RKNET DL HARLGH+SYHKLK++
Subjt:  --------------------------------------------------------------------ESAYVDKIRKNETTDLSHARLGHISYHKLKLI

Query:  MEKSMLKGLPQLEVKAD--CSN-------KHQSAKC-------------AIVEDRVYEPETYEEASQ--NSVWQKAM------------------EEEII
        M KS++KGLPQLE + D  C+         +Q +K              A+ E ++ E E   E+ +   S WQ  +                  EE+I 
Subjt:  MEKSMLKGLPQLEVKAD--CSN-------KHQSAKC-------------AIVEDRVYEPETYEEASQ--NSVWQKAM------------------EEEII

Query:  ALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDRE
        AL+QNQTW+L+P+  DVKPISCKWVYKIKRR DGSIERYKARLVARGFSQQYGLDYDETFS VAK+TTVRVLLALAA+KDW  WQMDVKNAFLHGELDRE
Subjt:  ALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDRE

Query:  IYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMK
        IYM QP GF++  +P YVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSV   D+SLF+K  EG L IVLVYVDDL+ITGDDE EI +T+ENLSVRFQMK
Subjt:  IYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMK

Query:  ELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR
        ELG+LKHFLGLEVDRT EG+FLCQQKY++D+L++F MLECK +S PME+NAK+CAHEGK+L D T YR
Subjt:  ELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR

A0A2N9I4M6 Integrase catalytic domain-containing protein2.5e-17040.12Show/hide
Query:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ
        MLEHI D KTPKEAWDTFV LFSKKNDTRLQLLENELLS++Q DMTIAQYFHKVK ICR+I++LDP + I ESR+KRIIIHGLR EYR F+ A+QGWPTQ
Subjt:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ

Query:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPST--KRGYNGDKRRSHQGT--SQPERAQMN---------------------------
        PSLVEFENLL+ QEAMAKQMGG++LKGEEEALYTS++R   + +T  +   +GDK +SHQG   S P     N                           
Subjt:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPST--KRGYNGDKRRSHQGT--SQPERAQMN---------------------------

Query:  ------------------------------------------------------------------------------DNKSFQEKRF------------
                                                                                      D K +++ +             
Subjt:  ------------------------------------------------------------------------------DNKSFQEKRF------------

Query:  ------GESAYVDKIRKNETTDLSHARLGHISYHKLKLIMEKSMLKGLPQLEVKAD--------------------------------------------
               ESAYVD+ RKNETTDL H RLGH+SY KL ++++KSMLKGLPQL+V+ D                                            
Subjt:  ------GESAYVDKIRKNETTDLSHARLGHISYHKLKLIMEKSMLKGLPQLEVKAD--------------------------------------------

Query:  --------------------------------------------------------------------CSNKHQ--------------------------
                                                                            C  +HQ                          
Subjt:  --------------------------------------------------------------------CSNKHQ--------------------------

Query:  ----------------------------------------------------------------------------------------------SAKC--
                                                                                                      S +C  
Subjt:  ----------------------------------------------------------------------------------------------SAKC--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------AIVED-RVYEPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFS
               AIVE+  + EPET+EEASQ+S W KAMEEEI AL+QNQTW+L+P+  DVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFS
Subjt:  -------AIVED-RVYEPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFS

Query:  LVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKE
         VAK+TTVRVLLALAA+KDW  WQMDVKNAFLHGELDREIYM QP GF++  +P YVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSV   D+SLF+K 
Subjt:  LVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKE

Query:  REGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKEL
         EG L IVLVYVDDL+ITGDDE EI +T+ENLSVRFQMKELG+LKHFLGLEVDRT EG+FLCQQKY++D+L++F MLECK +S PME NAK+CAHEGK+L
Subjt:  REGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKEL

Query:  NDETTYR
         D T YR
Subjt:  NDETTYR

A0A2N9IS08 Uncharacterized protein1.1e-16839.57Show/hide
Query:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ
        MLEHI D KTPKEAWDTFV LFSKKNDTRLQLLENELLS++Q DMTIAQYFHKVK ICR+I++LDP + I ESR+KRIIIHGLR EYR F+ A+QGWPTQ
Subjt:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ

Query:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPST--KRGYNGDKRRSHQGT--SQPERAQMN-------DNKSFQEKRFG---------
        PSLVEFENLL+ QEAMAKQMGG++LKGEEEALYTS++R   + +T  +   +GDK +SHQG   S P     N       D K +   + G         
Subjt:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPST--KRGYNGDKRRSHQGT--SQPERAQMN-------DNKSFQEKRFG---------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------ESAYVDKIRKNETTDLSHARLGHISYHKLKLIMEKSMLKGLPQLEVKAD------------
                                               ESAYVD+ RKNETTDL H RLGH+SY KL ++++KSMLKGLPQL+V+ D            
Subjt:  ---------------------------------------ESAYVDKIRKNETTDLSHARLGHISYHKLKLIMEKSMLKGLPQLEVKAD------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  CSNKHQ----------------------------------------------------------------------------------------------
        C  +HQ                                                                                              
Subjt:  CSNKHQ----------------------------------------------------------------------------------------------

Query:  --------------------------SAKCAIVEDRVYE-------------------------------------------------------------
                                  S +C    D V++                                                             
Subjt:  --------------------------SAKCAIVEDRVYE-------------------------------------------------------------

Query:  --------------------------PETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQY
                                  PET+EEASQ+S W KAMEEEI AL+QNQTW+L+P+  DVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQY
Subjt:  --------------------------PETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQY

Query:  GLDYDETFSLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAH
        GLDYDETFS VAK+TTVRVLLALAA+KDW  WQMDVKNAFLHGELDREIYM QP GF++  +P YVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSV  
Subjt:  GLDYDETFSLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAH

Query:  TDSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAK
         D+SLF+K  EG L IVLVYVDDL+ITGDDE EI +T+ENLSVRFQMKELG+LKHFLGLEVDRT EG+FLCQQKY++D+L++F MLECK +S PME NAK
Subjt:  TDSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAK

Query:  ICAHEGKELNDETTYR
        +CAHEGK+L D T YR
Subjt:  ICAHEGKELNDETTYR

A5BGK7 CCHC-type domain-containing protein2.1e-16948.21Show/hide
Query:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ
        +LEHI D KTP EAW+TF  LFSKKNDTRLQLLE+ELL ++Q ++TIAQYFHKVK +CR+I ELD ++ I E+RMKRIIIHGLR E R F+ AVQGW  Q
Subjt:  MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQ

Query:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPST--KRGYNGDKRRSHQGTS----QPERAQMNDNKSFQEKRF---------------
        PSLVEFENLL+ QEA+AKQMGG++LKGEE+ALY  + R N++  T  +   N DK +S QG      + +   +   K FQ + +               
Subjt:  PSLVEFENLLSSQEAMAKQMGGITLKGEEEALYTSESRSNNRPST--KRGYNGDKRRSHQGTS----QPERAQMNDNKSFQEKRF---------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------GESAYVDKIRKNETTDLSHARLGHISYHKLKLIMEKSMLKGLPQLEVKAD-----CSNKH-
                                                E+AYV+K RKNET +L H RL HISY KL ++M+KSMLKGLP+LE++ D     C  ++ 
Subjt:  ---------------------------------------GESAYVDKIRKNETTDLSHARLGHISYHKLKLIMEKSMLKGLPQLEVKAD-----CSNKH-

Query:  ------------------------------------------------QSAKCAIVED-RVYEPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLG
                                                        + A  AIVED    EPET+ EA QN  W KA++EEI AL+QNQTWELVP+  
Subjt:  ------------------------------------------------QSAKCAIVED-RVYEPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLG

Query:  DVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANP
        DV+PISCKWVYKIKRR DGSIER+KARLVARGFSQQYGLDYDETFS VAK+TTVRVLLALAA+KDW  WQMDVKNAFLHGELDREIYMNQP GF S  +P
Subjt:  DVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANP

Query:  NYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDR
         YVCKLRKALYGLKQAPRAWY                 DSSLF+K   G L IVLVYVDDLIIT DD  EI++T ENLSVRF+MKELG+LKHFLGLEVD 
Subjt:  NYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDR

Query:  TDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR
        T EG+FLCQQKY +D+L+KF MLECK +STPME NAK+C HEGK+L D T YR
Subjt:  TDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-4832.21Show/hide
Query:  NNRPSTKRGYNGDKRRSHQGTSQPERAQMNDNKSFQEK--RFGESAYVDKIRKNETTDLSHARLGHISYHKLKLIMEKSMLKGLPQLEVKADCS--NKHQ
        +++ S K   N  K+R         +   N N+S + +     +   +D   KN+  ++ + R                 LK  PQ+    + +  NK  
Subjt:  NNRPSTKRGYNGDKRRSHQGTSQPERAQMNDNKSFQEK--RFGESAYVDKIRKNETTDLSHARLGHISYHKLKLIMEKSMLKGLPQLEVKADCS--NKHQ

Query:  SAKCAIVEDRVYEPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAK
             I  D     +  +     S W++A+  E+ A + N TW +  R  +   +  +WV+ +K    G+  RYKARLVARGF+Q+Y +DY+ETF+ VA+
Subjt:  SAKCAIVEDRVYEPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAK

Query:  ITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGN
        I++ R +L+L    + K  QMDVK AFL+G L  EIYM  P+G   + N + VCKL KA+YGLKQA R W+    + L +  +  +  D  ++I ++ GN
Subjt:  ITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGN

Query:  LT---IVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPM
        +     VL+YVDD++I   D   +   +  L  +F+M +L E+KHF+G+ ++  ++ ++L Q  Y + +L KFNM  C  VSTP+
Subjt:  LT---IVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.0e-6543.48Show/hide
Query:  SNKHQSAKCAIVEDRVYEPETYEEA---SQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYD
        S ++ S +  ++ D   EPE+ +E     + +   KAM+EE+ +L++N T++LV      +P+ CKWV+K+K+  D  + RYKARLV +GF Q+ G+D+D
Subjt:  SNKHQSAKCAIVEDRVYEPETYEEA---SQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYD

Query:  ETFSLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSL
        E FS V K+T++R +L+LAAS D +  Q+DVK AFLHG+L+ EIYM QP+GFE A   + VCKL K+LYGLKQAPR WY K   F+    Y   ++D  +
Subjt:  ETFSLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSL

Query:  FIKE-REGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEV--DRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKI
        + K   E N  I+L+YVDD++I G D+  I + + +LS  F MK+LG  +  LG+++  +RT   L+L Q+KY   +L++FNM   K VSTP+  + K+
Subjt:  FIKE-REGNLTIVLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEV--DRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKI

P25600 Putative transposon Ty5-1 protein YCL074W6.0e-2030.43Show/hide
Query:  MDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVLVYVDDLIITGDDERE
        MDV  AFL+  +D  IY+ QP GF +  NP+YV +L   +YGLKQAP  W   I   L + G+     +  L+ +        + VYVDDL++     + 
Subjt:  MDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVLVYVDDLIITGDDERE

Query:  IYQTRENLSVRFQMKELGELKHFLGLEVDRTDEG-LFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR
          + ++ L+  + MK+LG++  FLGL + ++  G + L  Q Y      +  +   K   TP+  +  +       L D T Y+
Subjt:  IYQTRENLSVRFQMKELGELKHFLGLEVDRTDEG-LFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.8e-7246.92Show/hide
Query:  EPETYEEASQNSVWQKAMEEEIIALEQNQTWELV-PRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAKITTVRVLLALA
        EP T  +A ++  W+ AM  EI A   N TW+LV P    V  + C+W++  K   DGS+ RYKARLVA+G++Q+ GLDY ETFS V K T++R++L +A
Subjt:  EPETYEEASQNSVWQKAMEEEIIALEQNQTWELV-PRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAKITTVRVLLALA

Query:  ASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVLVYVDDL
          + W   Q+DV NAFL G L  ++YM+QP GF     PNYVCKLRKALYGLKQAPRAWY ++  +L   G+  + +D+SLF+ +R  ++  +LVYVDD+
Subjt:  ASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVLVYVDDL

Query:  IITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR
        +ITG+D   ++ T +NLS RF +K+  EL +FLG+E  R   GL L Q++Y  D+L + NM+  K V+TPM  + K+  + G +L D T YR
Subjt:  IITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.3e-7247.26Show/hide
Query:  EPETYEEASQNSVWQKAMEEEIIALEQNQTWELV-PRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAKITTVRVLLALA
        EP T  +A ++  W++AM  EI A   N TW+LV P    V  + C+W++  K   DGS+ RYKARLVA+G++Q+ GLDY ETFS V K T++R++L +A
Subjt:  EPETYEEASQNSVWQKAMEEEIIALEQNQTWELV-PRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAKITTVRVLLALA

Query:  ASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVLVYVDDL
          + W   Q+DV NAFL G L  E+YM+QP GF     P+YVC+LRKA+YGLKQAPRAWY ++  +L   G+  + +D+SLF+ +R  ++  +LVYVDD+
Subjt:  ASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVLVYVDDL

Query:  IITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR
        +ITG+D   +  T + LS RF +KE  +L +FLG+E  R  +GL L Q++YT D+L + NML  K V+TPM T+ K+  H G +L D T YR
Subjt:  IITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.0e-7044.75Show/hide
Query:  EPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAKITTVRVLLALAA
        EP TY EA +  VW  AM++EI A+E   TWE+     + KPI CKWVYKIK   DG+IERYKARLVA+G++QQ G+D+ ETFS V K+T+V+++LA++A
Subjt:  EPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAKITTVRVLLALAA

Query:  SKDWKPWQMDVKNAFLHGELDREIYMNQPKGFES----AANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVLVYV
          ++   Q+D+ NAFL+G+LD EIYM  P G+ +    +  PN VC L+K++YGLKQA R W+ K +  L   G+  +H+D + F+K        VLVYV
Subjt:  SKDWKPWQMDVKNAFLHGELDREIYMNQPKGFES----AANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVLVYV

Query:  DDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR
        DD+II  +++  + + +  L   F++++LG LK+FLGLE+ R+  G+ +CQ+KY  D+L +  +L CK  S PM+ +    AH G +  D   YR
Subjt:  DDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR

ATMG00810.1 DNA/RNA polymerases superfamily protein1.8e-1140.51Show/hide
Query:  VLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPM
        +L+YVDD+++TG     +      LS  F MK+LG + +FLG+++     GLFL Q KY   +L    ML+CK +STP+
Subjt:  VLVYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPM

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.7e-1743.43Show/hide
Query:  EPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAKITTVRVLLALA
        EP++   A ++  W +AM+EE+ AL +N+TW LVP   +   + CKWV+K K   DG+++R KARLVA+GF Q+ G+ + ET+S V +  T+R +L +A
Subjt:  EPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETFSLVAKITTVRVLLALA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGAGCATATTTGGGATGACAAGACACCGAAAGAAGCATGGGACACGTTCGTGATGTTGTTCTCAAAGAAGAACGATACAAGGCTACAACTTCTGGAGAATGAGTT
GTTGTCAATTTCACAACATGACATGACGATTGCTCAGTACTTCCACAAGGTCAAGTTGATCTGTCGGAAGATTACTGAACTAGACCCAAAGTCTGCCATTGTAGAATCTC
GAATGAAGAGGATTATAATCCACGGATTGCGATCGGAATATCGAAGCTTCATTATTGCTGTACAAGGATGGCCCACTCAACCATCACTTGTAGAGTTCGAAAATTTGTTA
TCCAGCCAAGAAGCTATGGCTAAACAAATGGGAGGCATCACATTGAAGGGTGAAGAAGAAGCACTCTACACAAGTGAAAGTCGGAGCAATAATAGGCCGTCTACCAAACG
TGGATACAATGGTGACAAAAGAAGAAGCCACCAAGGAACTTCACAACCCGAGAGAGCTCAAATGAACGACAACAAGAGTTTCCAAGAAAAGAGATTTGGAGAGTCTGCCT
ATGTTGACAAGATCCGAAAGAATGAGACGACAGATCTATCGCATGCAAGATTGGGACATATTAGCTACCATAAGTTGAAGCTGATTATGGAGAAATCCATGCTCAAAGGT
CTACCTCAACTTGAAGTCAAAGCAGACTGCTCAAACAAGCATCAATCAGCGAAATGCGCTATTGTAGAAGATAGAGTTTATGAACCAGAGACATATGAAGAAGCATCACA
AAACTCGGTTTGGCAAAAAGCGATGGAGGAAGAAATTATAGCCCTGGAGCAGAATCAAACTTGGGAACTAGTGCCAAGACTAGGAGATGTCAAACCCATCTCTTGCAAGT
GGGTCTACAAAATAAAGCGTCGACCGGATGGATCAATCGAGAGATACAAGGCTCGACTCGTGGCTCGAGGATTTTCTCAACAATATGGACTAGACTATGATGAAACATTC
AGCCTAGTGGCAAAGATTACTACCGTACGAGTCCTGCTAGCACTCGCAGCAAGTAAAGATTGGAAACCGTGGCAAATGGATGTGAAGAATGCCTTCTTGCACGGAGAGTT
AGACAGGGAGATTTATATGAACCAACCAAAGGGATTTGAGAGTGCAGCTAATCCTAATTATGTATGCAAGCTTAGAAAAGCTCTTTATGGACTGAAGCAAGCACCGAGAG
CTTGGTATGGTAAGATTGCTGAATTTCTTACCCAAAGTGGTTATTCAGTTGCGCATACAGACTCAAGCCTATTCATCAAAGAAAGAGAAGGAAATTTGACAATTGTATTG
GTCTACGTGGACGATTTGATTATCACCGGGGACGACGAAAGAGAAATTTATCAAACAAGAGAAAATTTATCAGTACGCTTTCAGATGAAAGAGCTAGGAGAGCTTAAACA
CTTCTTAGGCCTAGAAGTTGATCGCACAGATGAAGGATTGTTTCTCTGCCAACAAAAGTATACAAGAGATATGCTTCAGAAGTTCAACATGTTAGAGTGCAAGCAAGTTT
CAACACCGATGGAGACAAATGCCAAGATTTGTGCACATGAAGGCAAAGAGTTGAACGATGAAACAACATACCGATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGAGCATATTTGGGATGACAAGACACCGAAAGAAGCATGGGACACGTTCGTGATGTTGTTCTCAAAGAAGAACGATACAAGGCTACAACTTCTGGAGAATGAGTT
GTTGTCAATTTCACAACATGACATGACGATTGCTCAGTACTTCCACAAGGTCAAGTTGATCTGTCGGAAGATTACTGAACTAGACCCAAAGTCTGCCATTGTAGAATCTC
GAATGAAGAGGATTATAATCCACGGATTGCGATCGGAATATCGAAGCTTCATTATTGCTGTACAAGGATGGCCCACTCAACCATCACTTGTAGAGTTCGAAAATTTGTTA
TCCAGCCAAGAAGCTATGGCTAAACAAATGGGAGGCATCACATTGAAGGGTGAAGAAGAAGCACTCTACACAAGTGAAAGTCGGAGCAATAATAGGCCGTCTACCAAACG
TGGATACAATGGTGACAAAAGAAGAAGCCACCAAGGAACTTCACAACCCGAGAGAGCTCAAATGAACGACAACAAGAGTTTCCAAGAAAAGAGATTTGGAGAGTCTGCCT
ATGTTGACAAGATCCGAAAGAATGAGACGACAGATCTATCGCATGCAAGATTGGGACATATTAGCTACCATAAGTTGAAGCTGATTATGGAGAAATCCATGCTCAAAGGT
CTACCTCAACTTGAAGTCAAAGCAGACTGCTCAAACAAGCATCAATCAGCGAAATGCGCTATTGTAGAAGATAGAGTTTATGAACCAGAGACATATGAAGAAGCATCACA
AAACTCGGTTTGGCAAAAAGCGATGGAGGAAGAAATTATAGCCCTGGAGCAGAATCAAACTTGGGAACTAGTGCCAAGACTAGGAGATGTCAAACCCATCTCTTGCAAGT
GGGTCTACAAAATAAAGCGTCGACCGGATGGATCAATCGAGAGATACAAGGCTCGACTCGTGGCTCGAGGATTTTCTCAACAATATGGACTAGACTATGATGAAACATTC
AGCCTAGTGGCAAAGATTACTACCGTACGAGTCCTGCTAGCACTCGCAGCAAGTAAAGATTGGAAACCGTGGCAAATGGATGTGAAGAATGCCTTCTTGCACGGAGAGTT
AGACAGGGAGATTTATATGAACCAACCAAAGGGATTTGAGAGTGCAGCTAATCCTAATTATGTATGCAAGCTTAGAAAAGCTCTTTATGGACTGAAGCAAGCACCGAGAG
CTTGGTATGGTAAGATTGCTGAATTTCTTACCCAAAGTGGTTATTCAGTTGCGCATACAGACTCAAGCCTATTCATCAAAGAAAGAGAAGGAAATTTGACAATTGTATTG
GTCTACGTGGACGATTTGATTATCACCGGGGACGACGAAAGAGAAATTTATCAAACAAGAGAAAATTTATCAGTACGCTTTCAGATGAAAGAGCTAGGAGAGCTTAAACA
CTTCTTAGGCCTAGAAGTTGATCGCACAGATGAAGGATTGTTTCTCTGCCAACAAAAGTATACAAGAGATATGCTTCAGAAGTTCAACATGTTAGAGTGCAAGCAAGTTT
CAACACCGATGGAGACAAATGCCAAGATTTGTGCACATGAAGGCAAAGAGTTGAACGATGAAACAACATACCGATAA
Protein sequenceShow/hide protein sequence
MLEHIWDDKTPKEAWDTFVMLFSKKNDTRLQLLENELLSISQHDMTIAQYFHKVKLICRKITELDPKSAIVESRMKRIIIHGLRSEYRSFIIAVQGWPTQPSLVEFENLL
SSQEAMAKQMGGITLKGEEEALYTSESRSNNRPSTKRGYNGDKRRSHQGTSQPERAQMNDNKSFQEKRFGESAYVDKIRKNETTDLSHARLGHISYHKLKLIMEKSMLKG
LPQLEVKADCSNKHQSAKCAIVEDRVYEPETYEEASQNSVWQKAMEEEIIALEQNQTWELVPRLGDVKPISCKWVYKIKRRPDGSIERYKARLVARGFSQQYGLDYDETF
SLVAKITTVRVLLALAASKDWKPWQMDVKNAFLHGELDREIYMNQPKGFESAANPNYVCKLRKALYGLKQAPRAWYGKIAEFLTQSGYSVAHTDSSLFIKEREGNLTIVL
VYVDDLIITGDDEREIYQTRENLSVRFQMKELGELKHFLGLEVDRTDEGLFLCQQKYTRDMLQKFNMLECKQVSTPMETNAKICAHEGKELNDETTYR