; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh00G000930 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh00G000930
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr00:17523483..17527912
RNA-Seq ExpressionCmoCh00G000930
SyntenyCmoCh00G000930
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033251.1 pol protein [Cucumis melo var. makuwa]3.9e-17744.01Show/hide
Query:  HALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQ
        H LSV TP+G  +++K++V+  Q+ IAG  I V L V+DM+DFDVILGMDWLA N A IDC +KEV F PP+  +FKFKG  +   P++IS ++A +L+ 
Subjt:  HALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQ

Query:  QGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLF
        QG W  LA  V+T+  +  + + PVV +++DVF ++LPG+PP REV+F I+LEPGT PIS+APYRMAP ELKELK QLQ+LLDKGFIR SVSPWGA VLF
Subjt:  QGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLF

Query:  VKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP------------
        VKKKDG MRLCIDYRELNK TVKN+YPLPRI+DLFDQL+GATVFSKIDLRSGYHQ++IK+ D+PKTAFR+RYGHYEF+VMSFGLTNAP            
Subjt:  VKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP------------

Query:  ----------------------------------------------------MVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRR
                                                             VSFLGH+VSK G+ VDP KIEAVT W RP+TV+E+RSFLGLAGYYRR
Subjt:  ----------------------------------------------------MVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRR

Query:  FVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAW----------------------
        FV++F+RIATPLTQLT+KG PFVW   CE SFQ LKQ+LV+APVLTVP+ S  + IYSDASKKGLGCVLMQ GKVVA+                      
Subjt:  FVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAW----------------------

Query:  ------------------------------NKDVS------------------------------------------RLEKTLLVAKYEEGY----RGVR
                                       K+++                                           L + ++ A+  + Y    RG+ 
Subjt:  ------------------------------NKDVS------------------------------------------RLEKTLLVAKYEEGY----RGVR

Query:  ELTQNIKGLYGNLGG-----------------------RGSP---------------------------------DKVGSLP------------------
        E  Q ++    + GG                         SP                                 D +  LP                  
Subjt:  ELTQNIKGLYGNLGG-----------------------RGSP---------------------------------DKVGSLP------------------

Query:  -------------------------------------------------------TGFQATIGMAPFEALYGKRCRSPLCWDEVGERELIGPELVHVTNE
                                                               T +QATIGMAPFEALYGK CRSP+CW EVGE+ L+GPELV  TNE
Subjt:  -------------------------------------------------------TGFQATIGMAPFEALYGKRCRSPLCWDEVGERELIGPELVHVTNE

Query:  AIQKIRVRMRITQSRQKSYADVRRRNLEFEEGDPVFLKVAPMK
        AI+KIR RM   QSRQKSYADVRR++LEFE GD VFLKVAPM+
Subjt:  AIQKIRVRMRITQSRQKSYADVRRRNLEFEEGDPVFLKVAPMK

XP_022933231.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111440131 [Cucurbita moschata]1.1e-22452.65Show/hide
Query:  MHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLI
        MHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDM DFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLI
Subjt:  MHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLI

Query:  QQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVL
        QQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVF EDLPGIPPSREVDFGIDLE GTGPISKAPYRMAPAELKELKTQLQDLLD                
Subjt:  QQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVL

Query:  FVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP-----------
            KD SMRLCI YRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP           
Subjt:  FVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP-----------

Query:  -----------------------------------------------------MVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYR
                                                              VSFLGHIVSKDGIFVDPNKIEAVTK KRPTTVTEIRSFLGL GYYR
Subjt:  -----------------------------------------------------MVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYR

Query:  RFVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAW---------------------
        RFV DFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVA+                     
Subjt:  RFVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAW---------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------N
                                                                                                            
Subjt:  ---------------------------------------------------------------------------------------------------N

Query:  KDVSRLEKTLLVA----------------------KYE--------------EGYR----GVRELTQNIKGLYGN----------------LGGRGSPDK
        KD++      LV                       K+E              +GY      V  LT++   L G                 +   G P  
Subjt:  KDVSRLEKTLLVA----------------------KYE--------------EGYR----GVRELTQNIKGLYGN----------------LGGRGSPDK

Query:  V--------------------GSLPT-----------GFQATIGMAPFEALYGKRCRSPLCWDEVGERELIGPELVHVTNEAIQKIRVRMRITQSRQKSY
        +                    GS  T            FQATIGMAPFEALYGKRCRSPLCWDEVGERELIGPELVHVTNEAIQKIRVRMR TQSRQKSY
Subjt:  V--------------------GSLPT-----------GFQATIGMAPFEALYGKRCRSPLCWDEVGERELIGPELVHVTNEAIQKIRVRMRITQSRQKSY

Query:  ADVRRRNLEFEEGDPVFLKVAPMKE
        ADVRRRNLEFEEGDPVFLK+APMK+
Subjt:  ADVRRRNLEFEEGDPVFLKVAPMKE

XP_022951914.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111454654 [Cucurbita moschata]5.3e-21982.05Show/hide
Query:  MHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLI
        MHALSVGTPAGVDLVTK+RV+DGQVVIAGQTIH+DLKVVDM  FD ILGMDWLAENFATIDCHKKEVIFTPPN LTFKFKGTSTGTTPKIISMMKARRLI
Subjt:  MHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLI

Query:  QQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVL
        QQG  AFLA AVNTKGKEKPIDT+PVVNEFMDVF EDLPGIPPS+EVDFGIDLEP TGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGA VL
Subjt:  QQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVL

Query:  FVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP-----------
        FVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLR GYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP           
Subjt:  FVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP-----------

Query:  -----------------------------------------------------MVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYR
                                                              VSFLGH+VSK GIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYR
Subjt:  -----------------------------------------------------MVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYR

Query:  RFVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAW
        RFVQDFARIATPLTQLTKKGVPFVWDDTCE SFQELKQRLVSAPVLTVPESSVGY IYSDASKKGLGCVLMQHGKVVA+
Subjt:  RFVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAW

XP_022951914.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111454654 [Cucurbita moschata]3.2e-0679.49Show/hide
Query:  IRVRMRITQSRQKSYADVRRRNLEFEEGDPVFLKVAPMK
        + + MR  QSRQK+YADVRRRNLEFEEGDPVFL VAPMK
Subjt:  IRVRMRITQSRQKSYADVRRRNLEFEEGDPVFLKVAPMK

XP_022951914.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111454654 [Cucurbita moschata]4.4e-18154.56Show/hide
Query:  HALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQ
        H LSV TP+G  +++K++V+  Q+ IAG  I V L V+DM+DFDVILGMDWLA N A+IDC +KEV F PP+  +FKFKG  + + P++IS ++A + + 
Subjt:  HALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQ

Query:  QGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLF
        QG W  LA  V+T+  +  + + PVV ++ DVF E+LPG+PP REV+F I+LEPGT PIS+APYRMAPAELKELK QLQ+LLDKGFIRPSVSPWGA VLF
Subjt:  QGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLF

Query:  VKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP------------
        VKKKDGSMRLCIDY+ELNK TVKN+YPLPRI+DLFDQL+GA VFSKI LRSGYHQ++IK+ D+PKTAFR+RYGHYEF+VMSFGLTNAP            
Subjt:  VKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP------------

Query:  ----------------------------------------------------MVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRR
                                                             VSF+ H+VSK G+ VDP KIEAVT W RP+TV+E+RSFLGLAGYYRR
Subjt:  ----------------------------------------------------MVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRR

Query:  FVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPE----SSVGYAIYSDASKKGLGCVLMQHGKVVAWNKDVSRLEKTLLVAKYE-
        FV++F+RIATPLTQLT+KG PFVW  TCE SFQ LKQ+LV+A VLTVP+    +  G+ +      +         GK        ++L  + +V  +  
Subjt:  FVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPE----SSVGYAIYSDASKKGLGCVLMQHGKVVAWNKDVSRLEKTLLVAKYE-

Query:  ----EGYRGVRELTQNIKGLYGNLGGRGSPDKVGSLPTGFQATIGMAPFEALYGKRCRSPLCWDEVGERELIGPELVHVTNEAIQKIRVRMRITQSRQKS
               R  R  ++  KGL   +G            T + ATIGMAPFEALYGK CRSP+CW EVGE+ L+GPELV  TN+AIQKIR RM   QSRQ S
Subjt:  ----EGYRGVRELTQNIKGLYGNLGGRGSPDKVGSLPTGFQATIGMAPFEALYGKRCRSPLCWDEVGERELIGPELVHVTNEAIQKIRVRMRITQSRQKS

Query:  YADVRRRNLEFEEGDPVFLKVAPMK
        YADVRR++LEFE GD VFLKVAPM+
Subjt:  YADVRRRNLEFEEGDPVFLKVAPMK

XP_023520277.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111783585 [Cucurbita pepo subsp. pepo]9.5e-17664.99Show/hide
Query:  ALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQ
        A   GTPAGV+++   RV+D  V+++G  I VDL V+DM  +DVILGMDWLA+N A+IDCHKKEV+FTPP+   FKF GTS GT PK+IS+MKA++L+Q 
Subjt:  ALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQ

Query:  GGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLFV
        G WA LA  V+T+ +E   DT+PVV+EF DVF ED PGIPP+R+VDF I LEPGTGPISKAPYRMA AELKELKTQLQ+LLDKGFIRPSVSPWGA VLFV
Subjt:  GGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLFV

Query:  KKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPMV-----------
        KK DGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQL+ ATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP+V           
Subjt:  KKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPMV-----------

Query:  -----------------------------------------------------SFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRF
                                                             +FLGH++S+ G+ VDP K+EAVTKW RP TVTE+RSFLGLAGYYRRF
Subjt:  -----------------------------------------------------SFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRF

Query:  VQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAW
        ++DF++IA PLTQLT+K VPFVW + CE SFQELK +LV+APVLT+P+ +  Y IYSDASKKGLGCVLMQ GKV+A+
Subjt:  VQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAW

TrEMBL top hitse value%identityAlignment
A0A5A7SUW2 Reverse transcriptase1.9e-17744.01Show/hide
Query:  HALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQ
        H LSV TP+G  +++K++V+  Q+ IAG  I V L V+DM+DFDVILGMDWLA N A IDC +KEV F PP+  +FKFKG  +   P++IS ++A +L+ 
Subjt:  HALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQ

Query:  QGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLF
        QG W  LA  V+T+  +  + + PVV +++DVF ++LPG+PP REV+F I+LEPGT PIS+APYRMAP ELKELK QLQ+LLDKGFIR SVSPWGA VLF
Subjt:  QGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLF

Query:  VKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP------------
        VKKKDG MRLCIDYRELNK TVKN+YPLPRI+DLFDQL+GATVFSKIDLRSGYHQ++IK+ D+PKTAFR+RYGHYEF+VMSFGLTNAP            
Subjt:  VKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP------------

Query:  ----------------------------------------------------MVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRR
                                                             VSFLGH+VSK G+ VDP KIEAVT W RP+TV+E+RSFLGLAGYYRR
Subjt:  ----------------------------------------------------MVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRR

Query:  FVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAW----------------------
        FV++F+RIATPLTQLT+KG PFVW   CE SFQ LKQ+LV+APVLTVP+ S  + IYSDASKKGLGCVLMQ GKVVA+                      
Subjt:  FVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAW----------------------

Query:  ------------------------------NKDVS------------------------------------------RLEKTLLVAKYEEGY----RGVR
                                       K+++                                           L + ++ A+  + Y    RG+ 
Subjt:  ------------------------------NKDVS------------------------------------------RLEKTLLVAKYEEGY----RGVR

Query:  ELTQNIKGLYGNLGG-----------------------RGSP---------------------------------DKVGSLP------------------
        E  Q ++    + GG                         SP                                 D +  LP                  
Subjt:  ELTQNIKGLYGNLGG-----------------------RGSP---------------------------------DKVGSLP------------------

Query:  -------------------------------------------------------TGFQATIGMAPFEALYGKRCRSPLCWDEVGERELIGPELVHVTNE
                                                               T +QATIGMAPFEALYGK CRSP+CW EVGE+ L+GPELV  TNE
Subjt:  -------------------------------------------------------TGFQATIGMAPFEALYGKRCRSPLCWDEVGERELIGPELVHVTNE

Query:  AIQKIRVRMRITQSRQKSYADVRRRNLEFEEGDPVFLKVAPMK
        AI+KIR RM   QSRQKSYADVRR++LEFE GD VFLKVAPM+
Subjt:  AIQKIRVRMRITQSRQKSYADVRRRNLEFEEGDPVFLKVAPMK

A0A5A7UP12 Ty3-gypsy retrotransposon protein2.5e-17452.57Show/hide
Query:  HALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQ
        H LSV TP+G  +++K++V+  Q+ I G  I V L V+DM++FDVILG DWLA N A+ID   K+V F PP+  +FKFKG  + + P++IS M+A +L+ 
Subjt:  HALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQ

Query:  QGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLF
        QG W+ LA  V+T+  + P+ + PVV ++ DVF E+LPG+PP RE +F I+LEPGT PIS+ PY MAPAELKELK QLQ+LLDKGFIRPSVSPWGA VLF
Subjt:  QGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLF

Query:  VKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTN--------------
        VKK DGSMRLCIDYRELNK TVKN+YPLPRI+DLFDQL+GATVF KIDLRSGYHQ++IK+ D+PK AF +RYGHYEF+VMSF   +              
Subjt:  VKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTN--------------

Query:  ------------APMVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQR
                       VSFLGH+VSK G++VDP KIE V  W RP+T +E+ SFLGLAGYYRRFV++F+RIATPLTQLT+KG PFVW   CE SFQ LKQ+
Subjt:  ------------APMVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQR

Query:  LVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAW----------------------------------------------------NKDVS-RL
        LV+A VLT+P+ S  + IYSDASKKGLGCVLMQ GKVVA+                                                     K+++ R 
Subjt:  LVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAW----------------------------------------------------NKDVS-RL

Query:  EKTLLVAK-------YEEGYRGVRELTQNIKGLYGNLGGRGSPDKVGSLPTGFQ-ATIGMAP--FEALYGKRCRSPLCWDEVGERELIGPELVHVTNEAI
         + L + K       Y  G   V     + KGL   +G R        + T F   T G      + L      SP+CW EVGE+ L+GPELV  TNEAI
Subjt:  EKTLLVAK-------YEEGYRGVRELTQNIKGLYGNLGGRGSPDKVGSLPTGFQ-ATIGMAP--FEALYGKRCRSPLCWDEVGERELIGPELVHVTNEAI

Query:  QKIRVRMRITQSRQKSYADVRRRNLEFEEGDPVFLKVAPMK
        QKIR RM   QSRQKSY DVRR++LEF+ GD VFLKVAPMK
Subjt:  QKIRVRMRITQSRQKSYADVRRRNLEFEEGDPVFLKVAPMK

A0A6J1EYH9 Reverse transcriptase5.4e-22552.65Show/hide
Query:  MHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLI
        MHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDM DFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLI
Subjt:  MHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLI

Query:  QQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVL
        QQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVF EDLPGIPPSREVDFGIDLE GTGPISKAPYRMAPAELKELKTQLQDLLD                
Subjt:  QQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVL

Query:  FVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP-----------
            KD SMRLCI YRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP           
Subjt:  FVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP-----------

Query:  -----------------------------------------------------MVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYR
                                                              VSFLGHIVSKDGIFVDPNKIEAVTK KRPTTVTEIRSFLGL GYYR
Subjt:  -----------------------------------------------------MVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYR

Query:  RFVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAW---------------------
        RFV DFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVA+                     
Subjt:  RFVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAW---------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------N
                                                                                                            
Subjt:  ---------------------------------------------------------------------------------------------------N

Query:  KDVSRLEKTLLVA----------------------KYE--------------EGYR----GVRELTQNIKGLYGN----------------LGGRGSPDK
        KD++      LV                       K+E              +GY      V  LT++   L G                 +   G P  
Subjt:  KDVSRLEKTLLVA----------------------KYE--------------EGYR----GVRELTQNIKGLYGN----------------LGGRGSPDK

Query:  V--------------------GSLPT-----------GFQATIGMAPFEALYGKRCRSPLCWDEVGERELIGPELVHVTNEAIQKIRVRMRITQSRQKSY
        +                    GS  T            FQATIGMAPFEALYGKRCRSPLCWDEVGERELIGPELVHVTNEAIQKIRVRMR TQSRQKSY
Subjt:  V--------------------GSLPT-----------GFQATIGMAPFEALYGKRCRSPLCWDEVGERELIGPELVHVTNEAIQKIRVRMRITQSRQKSY

Query:  ADVRRRNLEFEEGDPVFLKVAPMKE
        ADVRRRNLEFEEGDPVFLK+APMK+
Subjt:  ADVRRRNLEFEEGDPVFLKVAPMKE

A0A6J1GK52 Reverse transcriptase2.6e-21982.05Show/hide
Query:  MHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLI
        MHALSVGTPAGVDLVTK+RV+DGQVVIAGQTIH+DLKVVDM  FD ILGMDWLAENFATIDCHKKEVIFTPPN LTFKFKGTSTGTTPKIISMMKARRLI
Subjt:  MHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLI

Query:  QQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVL
        QQG  AFLA AVNTKGKEKPIDT+PVVNEFMDVF EDLPGIPPS+EVDFGIDLEP TGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGA VL
Subjt:  QQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVL

Query:  FVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP-----------
        FVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLR GYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP           
Subjt:  FVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP-----------

Query:  -----------------------------------------------------MVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYR
                                                              VSFLGH+VSK GIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYR
Subjt:  -----------------------------------------------------MVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYR

Query:  RFVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAW
        RFVQDFARIATPLTQLTKKGVPFVWDDTCE SFQELKQRLVSAPVLTVPESSVGY IYSDASKKGLGCVLMQHGKVVA+
Subjt:  RFVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAW

A0A6J1GK52 Reverse transcriptase1.6e-0679.49Show/hide
Query:  IRVRMRITQSRQKSYADVRRRNLEFEEGDPVFLKVAPMK
        + + MR  QSRQK+YADVRRRNLEFEEGDPVFL VAPMK
Subjt:  IRVRMRITQSRQKSYADVRRRNLEFEEGDPVFLKVAPMK

A0A6J1GK52 Reverse transcriptase2.1e-18154.56Show/hide
Query:  HALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQ
        H LSV TP+G  +++K++V+  Q+ IAG  I V L V+DM+DFDVILGMDWLA N A+IDC +KEV F PP+  +FKFKG  + + P++IS ++A + + 
Subjt:  HALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQ

Query:  QGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLF
        QG W  LA  V+T+  +  + + PVV ++ DVF E+LPG+PP REV+F I+LEPGT PIS+APYRMAPAELKELK QLQ+LLDKGFIRPSVSPWGA VLF
Subjt:  QGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLF

Query:  VKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP------------
        VKKKDGSMRLCIDY+ELNK TVKN+YPLPRI+DLFDQL+GA VFSKI LRSGYHQ++IK+ D+PKTAFR+RYGHYEF+VMSFGLTNAP            
Subjt:  VKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP------------

Query:  ----------------------------------------------------MVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRR
                                                             VSF+ H+VSK G+ VDP KIEAVT W RP+TV+E+RSFLGLAGYYRR
Subjt:  ----------------------------------------------------MVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRR

Query:  FVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPE----SSVGYAIYSDASKKGLGCVLMQHGKVVAWNKDVSRLEKTLLVAKYE-
        FV++F+RIATPLTQLT+KG PFVW  TCE SFQ LKQ+LV+A VLTVP+    +  G+ +      +         GK        ++L  + +V  +  
Subjt:  FVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPE----SSVGYAIYSDASKKGLGCVLMQHGKVVAWNKDVSRLEKTLLVAKYE-

Query:  ----EGYRGVRELTQNIKGLYGNLGGRGSPDKVGSLPTGFQATIGMAPFEALYGKRCRSPLCWDEVGERELIGPELVHVTNEAIQKIRVRMRITQSRQKS
               R  R  ++  KGL   +G            T + ATIGMAPFEALYGK CRSP+CW EVGE+ L+GPELV  TN+AIQKIR RM   QSRQ S
Subjt:  ----EGYRGVRELTQNIKGLYGNLGGRGSPDKVGSLPTGFQATIGMAPFEALYGKRCRSPLCWDEVGERELIGPELVHVTNEAIQKIRVRMRITQSRQKS

Query:  YADVRRRNLEFEEGDPVFLKVAPMK
        YADVRR++LEFE GD VFLKVAPM+
Subjt:  YADVRRRNLEFEEGDPVFLKVAPMK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.3e-4628.19Show/hide
Query:  DFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFL----AYAVNTKGKEKPIDTIPVVNEFMDVFREDL
        ++D++LG   LAE  ATI    +EV          +   T   +  + ++M+    L Q    + +     Y +     E+      ++ ++ D+   + 
Subjt:  DFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFL----AYAVNTKGKEKPIDTIPVVNEFMDVFREDL

Query:  PGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLFV-KKKDGS----MRLCIDYRELNKRTVKNKYPLPRIE
          +  + +    I+ +      SK  Y  A  +  E+++Q+QD+L++G IR S SP+ + +  V KK+D S     R+ IDYR+LN+ TV +++P+P ++
Subjt:  PGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLFV-KKKDGS----MRLCIDYRELNKRTVKNKYPLPRIE

Query:  DLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPM-------------------------------------------
        ++  +L     F+ IDL  G+HQI++  E + KTAF T++GHYE++ M FGL NAP                                            
Subjt:  DLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPM-------------------------------------------

Query:  ---------------------VSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPF-VWDDTCEVS
                              +FLGH+++ DGI  +P KIEA+ K+  PT   EI++FLGL GYYR+F+ +FA IA P+T+  KK +     +   + +
Subjt:  ---------------------VSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPF-VWDDTCEVS

Query:  FQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAW
        F++LK  +   P+L VP+ +  + + +DAS   LG VL Q G  +++
Subjt:  FQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAW

P20825 Retrovirus-related Pol polyprotein from transposon 2979.0e-4431.43Show/hide
Query:  PISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLFVKKKD-----GSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSG
        PI    Y +A     E++ Q+Q++L++G IR S SP+ +    V KK         R+ IDYR+LN+ T+ ++YP+P ++++  +L     F+ IDL  G
Subjt:  PISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLFVKKKD-----GSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSG

Query:  YHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPM---------------------------------------------------------------
        +HQI++  E I KTAF T+ GHYE++ M FGL NAP                                                                
Subjt:  YHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPM---------------------------------------------------------------

Query:  -VSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCEV----SFQELKQRLVSAPVLTVP
          +FLGHIV+ DGI  +P K++A+  +  PT   EIR+FLGL GYYR+F+ ++A IA P+T   KK       DT ++    +F++LK  ++  P+L +P
Subjt:  -VSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCEV----SFQELKQRLVSAPVLTVP

Query:  ESSVGYAIYSDASKKGLGCVLMQHGKVVAW--------NKDVSRLEKTLL
        +    + + +DAS   LG VL Q+G  +++          + S +EK LL
Subjt:  ESSVGYAIYSDASKKGLGCVLMQHGKVVAW--------NKDVSRLEKTLL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.4e-4331.34Show/hide
Query:  TIPV--VNEFMDVFREDLPGIPP---SREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLFVKKKDGSMRLCIDYRE
        T+PV    ++ ++ R DLP  P    +  V   I+++PG       PY +     +E+   +Q LLD  FI PS SP  + V+ V KKDG+ RLC+DYR 
Subjt:  TIPV--VNEFMDVFREDLPGIPP---SREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLFVKKKDGSMRLCIDYRE

Query:  LNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPM---------------------------
        LNK T+ + +PLPRI++L  ++  A +F+ +DL SGYHQI ++ +D  KTAF T  G YE+ VM FGL NAP                            
Subjt:  LNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPM---------------------------

Query:  -----------------------------------VSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKK
                                             FLG+ +    I    +K  A+  +  P TV + + FLG+  YYRRF+ + ++IA P+      
Subjt:  -----------------------------------VSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKK

Query:  GVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVL
             W +  + + ++LK  L ++PVL    +   Y + +DASK G+G VL
Subjt:  GVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.6e-4528.63Show/hide
Query:  DMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLP
        ++  FD I+G D L +  A +D     +I TP  G+             KI  + +A   +       L  A +  G ++ ++++  + EF  +F   L 
Subjt:  DMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFREDLP

Query:  GIPPSREVDFGIDLEPGT-GPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLFVKKK-----DGSMRLCIDYRELNKRTVKNKYPLPRIE
        G+  S E     ++   T  PI    Y        E++ Q+ +LL  G IRPS SP+ + +  V KK     +   R+ +D++ LN  T+ + YP+P I 
Subjt:  GIPPSREVDFGIDLEPGT-GPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLFVKKK-----DGSMRLCIDYRELNKRTVKNKYPLPRIE

Query:  DLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP--------------------------------------------
             L  A  F+ +DL SG+HQI +K  DIPKTAF T  G YEF+ + FGL NAP                                            
Subjt:  DLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAP--------------------------------------------

Query:  --------------------MVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTK-----------KGVP
                             V FLG+IV+ DGI  DP K+ A+++   PT+V E++ FLG+  YYR+F+QD+A++A PLT LT+             VP
Subjt:  --------------------MVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTK-----------KGVP

Query:  FVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAWNKDVSRLEKTLLVAKYEEGYRGVRE
           D+T   SF +LK  L S+ +L  P  +  + + +DAS   +G VL Q  +    ++ ++ + ++L   K EE Y  + +
Subjt:  FVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAWNKDVSRLEKTLLVAKYEEGYRGVRE

Q99315 Transposon Ty3-G Gag-Pol polyprotein4.5e-4331.34Show/hide
Query:  TIPV--VNEFMDVFREDLPGIPP---SREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLFVKKKDGSMRLCIDYRE
        T+PV    ++ ++ R DLP  P    +  V   I+++PG       PY +     +E+   +Q LLD  FI PS SP  + V+ V KKDG+ RLC+DYR 
Subjt:  TIPV--VNEFMDVFREDLPGIPP---SREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLFVKKKDGSMRLCIDYRE

Query:  LNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPM---------------------------
        LNK T+ + +PLPRI++L  ++  A +F+ +DL SGYHQI ++ +D  KTAF T  G YE+ VM FGL NAP                            
Subjt:  LNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPM---------------------------

Query:  -----------------------------------VSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKK
                                             FLG+ +    I    +K  A+  +  P TV + + FLG+  YYRRF+ + ++IA P+      
Subjt:  -----------------------------------VSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKK

Query:  GVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVL
             W +  + +  +LK  L ++PVL    +   Y + +DASK G+G VL
Subjt:  GVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein4.2e-2043Show/hide
Query:  PMVSFLG--HIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPE
        P +++LG  HI+S +G+  DP K+EA+  W  P   TE+R FLGL GYYRRFV+++ +I  PLT+L KK     W +   ++F+ LK  + + PVL +P+
Subjt:  PMVSFLG--HIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGCGTTGTCGGTCGGTACCCCGGCAGGGGTAGACCTAGTTACGAAAGATAGAGTAAGGGACGGACAAGTGGTAATAGCTGGACAAACCATCCACGTAGACTTAAA
GGTAGTGGATATGATGGATTTTGACGTCATACTAGGAATGGACTGGTTAGCAGAAAACTTTGCTACCATAGACTGCCATAAGAAGGAGGTGATATTCACACCTCCTAACG
GACTTACCTTTAAGTTTAAAGGAACCTCCACAGGTACCACCCCAAAAATAATATCGATGATGAAAGCGAGGCGCCTGATACAACAAGGGGGGTGGGCGTTTCTAGCCTAT
GCAGTAAATACAAAAGGAAAGGAAAAACCCATAGATACAATACCAGTAGTAAACGAATTTATGGATGTCTTTCGGGAGGACCTCCCGGGAATTCCTCCATCGCGAGAGGT
AGACTTTGGAATAGATCTAGAACCAGGAACGGGACCCATCTCTAAGGCCCCGTACCGCATGGCACCAGCAGAACTAAAAGAGCTTAAAACACAACTGCAAGACCTACTAG
ATAAGGGCTTCATTCGACCTAGCGTGTCCCCTTGGGGTGCGCTAGTGTTGTTTGTTAAAAAGAAAGACGGCTCAATGCGTTTATGCATTGATTATAGGGAATTAAACAAG
AGGACGGTAAAGAACAAATACCCATTACCCCGTATTGAGGACCTATTTGATCAACTACGTGGGGCGACAGTGTTTTCTAAAATAGATCTTCGATCAGGATACCATCAAAT
TAAGATTAAAAATGAAGACATACCGAAAACAGCCTTTCGAACCAGATATGGCCACTACGAGTTTGTGGTGATGTCTTTTGGTCTCACCAATGCCCCAATGGTTTCGTTTC
TGGGGCACATAGTGTCTAAAGACGGAATCTTCGTAGATCCCAATAAGATAGAGGCGGTCACGAAATGGAAACGCCCAACAACGGTCACCGAGATACGAAGTTTCTTGGGA
TTGGCGGGTTATTATCGAAGGTTTGTCCAGGACTTCGCTAGAATAGCCACGCCTCTCACCCAATTAACCAAAAAAGGTGTACCTTTTGTTTGGGACGATACTTGTGAGGT
CAGCTTTCAAGAACTAAAACAACGACTAGTATCCGCCCCAGTGCTCACTGTTCCAGAAAGTTCTGTGGGATACGCGATTTACAGTGACGCATCCAAAAAGGGATTAGGTT
GTGTACTGATGCAACATGGCAAGGTTGTTGCGTGGAACAAAGATGTATCAAGACTTGAAAAAACACTTCTGGTGGCGAAGTATGAAGAAGGATATCGCGGAGTACGTGAG
CTTACCCAAAACATTAAAGGGCTATACGGTAATCTGGGTGGTCGTGGATCGCCTGACAAAGTCGGCTCACTTCCTACTGGCTTCCAAGCAACCATCGGAATGGCACCCTT
TGAGGCGTTGTATGGAAAACGATGTAGATCCCCATTGTGTTGGGACGAAGTAGGAGAACGAGAATTGATAGGACCCGAGCTGGTCCATGTCACCAATGAAGCAATCCAGA
AAATCCGAGTAAGAATGCGTATCACGCAGAGTAGGCAAAAGAGCTACGCCGACGTTAGGCGTAGGAATTTAGAGTTTGAAGAGGGGGACCCAGTGTTCCTAAAAGTAGCC
CCCATGAAAGAGCCGAAAACTCCGGCGAACGCCGGAGTTTTCGCTCACACACGACCCACAAAGGGCGGCCGAGGGCTGGGATGCCTCAAAATCGCGTCCCCGACCTTCAG
CCACTCTGAAAGAACTTGCAAAGAAACGAAAACGATAAGGGCAACGCACCCATGTGACAACGACGACGGCATCGTACCCCCCAGGATCGTGGCACATGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGCATGCGTTGTCGGTCGGTACCCCGGCAGGGGTAGACCTAGTTACGAAAGATAGAGTAAGGGACGGACAAGTGGTAATAGCTGGACAAACCATCCACGTAGACTTAAA
GGTAGTGGATATGATGGATTTTGACGTCATACTAGGAATGGACTGGTTAGCAGAAAACTTTGCTACCATAGACTGCCATAAGAAGGAGGTGATATTCACACCTCCTAACG
GACTTACCTTTAAGTTTAAAGGAACCTCCACAGGTACCACCCCAAAAATAATATCGATGATGAAAGCGAGGCGCCTGATACAACAAGGGGGGTGGGCGTTTCTAGCCTAT
GCAGTAAATACAAAAGGAAAGGAAAAACCCATAGATACAATACCAGTAGTAAACGAATTTATGGATGTCTTTCGGGAGGACCTCCCGGGAATTCCTCCATCGCGAGAGGT
AGACTTTGGAATAGATCTAGAACCAGGAACGGGACCCATCTCTAAGGCCCCGTACCGCATGGCACCAGCAGAACTAAAAGAGCTTAAAACACAACTGCAAGACCTACTAG
ATAAGGGCTTCATTCGACCTAGCGTGTCCCCTTGGGGTGCGCTAGTGTTGTTTGTTAAAAAGAAAGACGGCTCAATGCGTTTATGCATTGATTATAGGGAATTAAACAAG
AGGACGGTAAAGAACAAATACCCATTACCCCGTATTGAGGACCTATTTGATCAACTACGTGGGGCGACAGTGTTTTCTAAAATAGATCTTCGATCAGGATACCATCAAAT
TAAGATTAAAAATGAAGACATACCGAAAACAGCCTTTCGAACCAGATATGGCCACTACGAGTTTGTGGTGATGTCTTTTGGTCTCACCAATGCCCCAATGGTTTCGTTTC
TGGGGCACATAGTGTCTAAAGACGGAATCTTCGTAGATCCCAATAAGATAGAGGCGGTCACGAAATGGAAACGCCCAACAACGGTCACCGAGATACGAAGTTTCTTGGGA
TTGGCGGGTTATTATCGAAGGTTTGTCCAGGACTTCGCTAGAATAGCCACGCCTCTCACCCAATTAACCAAAAAAGGTGTACCTTTTGTTTGGGACGATACTTGTGAGGT
CAGCTTTCAAGAACTAAAACAACGACTAGTATCCGCCCCAGTGCTCACTGTTCCAGAAAGTTCTGTGGGATACGCGATTTACAGTGACGCATCCAAAAAGGGATTAGGTT
GTGTACTGATGCAACATGGCAAGGTTGTTGCGTGGAACAAAGATGTATCAAGACTTGAAAAAACACTTCTGGTGGCGAAGTATGAAGAAGGATATCGCGGAGTACGTGAG
CTTACCCAAAACATTAAAGGGCTATACGGTAATCTGGGTGGTCGTGGATCGCCTGACAAAGTCGGCTCACTTCCTACTGGCTTCCAAGCAACCATCGGAATGGCACCCTT
TGAGGCGTTGTATGGAAAACGATGTAGATCCCCATTGTGTTGGGACGAAGTAGGAGAACGAGAATTGATAGGACCCGAGCTGGTCCATGTCACCAATGAAGCAATCCAGA
AAATCCGAGTAAGAATGCGTATCACGCAGAGTAGGCAAAAGAGCTACGCCGACGTTAGGCGTAGGAATTTAGAGTTTGAAGAGGGGGACCCAGTGTTCCTAAAAGTAGCC
CCCATGAAAGAGCCGAAAACTCCGGCGAACGCCGGAGTTTTCGCTCACACACGACCCACAAAGGGCGGCCGAGGGCTGGGATGCCTCAAAATCGCGTCCCCGACCTTCAG
CCACTCTGAAAGAACTTGCAAAGAAACGAAAACGATAAGGGCAACGCACCCATGTGACAACGACGACGGCATCGTACCCCCCAGGATCGTGGCACATGCATAG
Protein sequenceShow/hide protein sequence
MHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMMDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAY
AVNTKGKEKPIDTIPVVNEFMDVFREDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKTQLQDLLDKGFIRPSVSPWGALVLFVKKKDGSMRLCIDYRELNK
RTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPMVSFLGHIVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLG
LAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCEVSFQELKQRLVSAPVLTVPESSVGYAIYSDASKKGLGCVLMQHGKVVAWNKDVSRLEKTLLVAKYEEGYRGVRE
LTQNIKGLYGNLGGRGSPDKVGSLPTGFQATIGMAPFEALYGKRCRSPLCWDEVGERELIGPELVHVTNEAIQKIRVRMRITQSRQKSYADVRRRNLEFEEGDPVFLKVA
PMKEPKTPANAGVFAHTRPTKGGRGLGCLKIASPTFSHSERTCKETKTIRATHPCDNDDGIVPPRIVAHA