; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036492 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036492
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:47299231..47301582
RNA-Seq ExpressionLag0036492
SyntenyLag0036492
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]9.8e-8229.21Show/hide
Query:  VDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIIISAIKTKVLEILSQKNETNEGLFVKGKS-----KGRENKHQIEEKN
        +D+F+K++ +  N+ EK+ +EN+A +LLNSLP+ Y+EVK A+KYG D+LT  I++ A+KT+ LEI  ++ +  E L  +G+S     KG+E   + + K 
Subjt:  VDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIIISAIKTKVLEILSQKNETNEGLFVKGKS-----KGRENKHQIEEKN

Query:  KAKIRCNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCENSIT--YS--------DALATSYQCSQDQSSTEK---HDWVIDSGCSFHMTPSKGWF
        K++ +C  CHKEGH K++C              NK  EAS  E ++T  Y+        D+  T Y+ ++    + +     W++DSGC+FHMTP + + 
Subjt:  KAKIRCNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCENSIT--YS--------DALATSYQCSQDQSSTEK---HDWVIDSGCSFHMTPSKGWF

Query:  NTYLEWDGGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLGCLTQ-----------LAARMGEVVDVE-MIQHAL----------
          + + DGG V +G+N TC V G  SV +   DG  ++L NVR+VP LKRNLISLG L +           +    G +V +   ++H L          
Subjt:  NTYLEWDGGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLGCLTQ-----------LAARMGEVVDVE-MIQHAL----------

Query:  ---VVSDKSSTESDLWHKRMSHISEK--------------------------------------------------------------------------
           + S K +  S LWHKR++H+SE+                                                                          
Subjt:  ---VVSDKSSTESDLWHKRMSHISEK--------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------------DAMLSEKF
                                                                                                    +A L  KF
Subjt:  --------------------------------------------------------------------------------------------DAMLSEKF

Query:  WVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLKARAAKCMFLGFTEGVKGYRMWHPVEK---RCVNSRDVIFREQDM
        W EAA    Y +NR P T++N  TP+E W+GK P L+HL+VFGCT + H   GKL  RA KCMF+G+ +GVKGY++W  +EK   +C+ SRDV F E +M
Subjt:  WVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLKARAAKCMFLGFTEGVKGYRMWHPVEK---RCVNSRDVIFREQDM

Query:  -----------------FMLQTHTEDKPSYEPN-------------------------------------------------TRDRQRRTIVPPSRFSEA
                           ++  +E +PS + +                                                 TRDR +R    P R+  A
Subjt:  -----------------FMLQTHTEDKPSYEPN-------------------------------------------------TRDRQRRTIVPPSRFSEA

Query:  DCISLALNVADSLNIEEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGIDYN
        D ++ AL  A      EP +F+EA+   + + W +AM EE+ SL +N TW+L P P   K I SKWI+KIK G  G  KPR+KARLVAKG+TQKEG+D++
Subjt:  DCISLALNVADSLNIEEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGIDYN

Query:  K
        +
Subjt:  K

PNX96445.1 copia LTR rider [Trifolium pratense]1.8e-8029.48Show/hide
Query:  VDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIIISAIKTKVL-EILSQKNET-NEGLFVKGKSKGRENKHQIEEKNKAK
        +D F K+I + +N++ K+ +E++A +LL +LP+++   K  L YG ++LT E + SA+ +K L E    K  T  EGL VKGK   +  K   + K+++K
Subjt:  VDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIIISAIKTKVL-EILSQKNET-NEGLFVKGKSKGRENKHQIEEKNKAK

Query:  --------IRCNYCHKEGHLKRDCYSLKRKNQNQRYKKN-KQPEASVCENSITYSDALATSYQCSQDQSSTEKHDWVIDSGCSFHMTPSKGWFNTYLEWD
                IRC +C KEGH ++ C         +R K +     A++ ++    SD L  S       SS  + +W++DSGC++HMTP+K  F    + D
Subjt:  --------IRCNYCHKEGHLKRDCYSLKRKNQNQRYKKN-KQPEASVCENSITYSDALATSYQCSQDQSSTEKHDWVIDSGCSFHMTPSKGWFNTYLEWD

Query:  GGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLG-------------CLTQLAARMGEV-----------VDVEMIQHAL-VVSD
        GG V +GNN  C++ G+ SV  KL D S +LL  VR+VP+LKRNL+SLG              + ++     EV           ++ E++  +  VVS 
Subjt:  GGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLG-------------CLTQLAARMGEV-----------VDVEMIQHAL-VVSD

Query:  KSSTESDLWHKRMSHISEK---------------------------------------------------------------------------------
        K  +++++WH R+ H+SE+                                                                                 
Subjt:  KSSTESDLWHKRMSHISEK---------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------DAMLSEKFWVEAASY
                                                                                              A L + FW EA S 
Subjt:  -------------------------------------------------------------------------------------DAMLSEKFWVEAASY

Query:  TVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLKARAAKCMFLGFTEGVKGYRMW--HPVEKRCVNSRDVIFREQDMFMLQT---
          Y +NRCP T+++  TPEE WSG PP L  L+VFGC  + H  Q K++ RA KCMF+G+ EGVK YR+W   P  KRC+ SRDV+F E +M   +T   
Subjt:  TVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLKARAAKCMFLGFTEGVKGYRMW--HPVEKRCVNSRDVIFREQDMFMLQT---

Query:  ---------------------------HTEDKPSYEPN--------------TRDRQRRTIVPPSRFSEADCISLALNVADSLNIEEPSSFDEAVNGPNA
                                   H  D+   E                +RDR RR I  P R   AD I+ AL  A  +  EEP  + E +   N 
Subjt:  ---------------------------HTEDKPSYEPN--------------TRDRQRRTIVPPSRFSEADCISLALNVADSLNIEEPSSFDEAVNGPNA

Query:  RDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGIDYN
         +W++AM++EMKSL +N TW L   P G + ++ KWIFK+KEGI GV   R+KARLVA+GFTQKEG+D+N
Subjt:  RDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGIDYN

RZB42800.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]3.7e-8130.07Show/hide
Query:  LSDFSLSRLLPRSHARFSLS----AVSALSLFCLCTVVAVNCRLRPVIAVVDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLT
        ++D     LL ++H+   LS     +  +S       V    + R V   +D F K+I + +N++  + +E++A +LL SLPK+Y   K  L +G D+++
Subjt:  LSDFSLSRLLPRSHARFSLS----AVSALSLFCLCTVVAVNCRLRPVIAVVDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLT

Query:  TEIIISAIKTKVLEILSQK--NETNEGLFVKGKSKGRENKHQIE----------EKNKAKIRCNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCE
         + + +A+ +K      +K  + + EGL  +GK+  +++K   +          E N  KIRC +C KEGH ++ C   ++   +   KK+    A V +
Subjt:  TEIIISAIKTKVLEILSQK--NETNEGLFVKGKSKGRENKHQIE----------EKNKAKIRCNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCE

Query:  NSITYSDALATSYQCSQDQSSTEKHDWVIDSGCSFHMTPSKGWFNTYLEWDGGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLG
        +    ++ L  S           +  W++DSGCS+HMTP++ WF  + +   G+V +G+N  C++ GI S+  K  DG+ ++L  VR+VP LKRNLISLG
Subjt:  NSITYSDALATSYQCSQDQSSTEKHDWVIDSGCSFHMTPSKGWFNTYLEWDGGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLG

Query:  -----------------------------------------CLTQLAARMGEV-VDVEMIQHALVVSDKSSTESDLW-----------------------
                                                  +   A   G V +   ++ H +         +D W                       
Subjt:  -----------------------------------------CLTQLAARMGEV-VDVEMIQHALVVSDKSSTESDLW-----------------------

Query:  ------------------HKRMSHISEKDAM--------------------LSEKFWVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGC
                          H+ ++   +++ +                    L + FW EAA   VY +N+CP T++NF TPEE WSG PP L+ LKVFGC
Subjt:  ------------------HKRMSHISEKDAM--------------------LSEKFWVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGC

Query:  TGFIHQNQGKLKARAAKCMFLGFTEGVKGYRMW--HPVEKRCVNSRDVIFREQDMF------MLQTHTE-------DKPSYEPNT---------------
          + H  Q KL+ RA KC+FLG+ EGVKGY++W      KRC+ S DV+F E +M       M+Q+ T+       +K ++E  T               
Subjt:  TGFIHQNQGKLKARAAKCMFLGFTEGVKGYRMW--HPVEKRCVNSRDVIFREQDMF------MLQTHTE-------DKPSYEPNT---------------

Query:  -----------------RDRQRRTIVPPSRFSEADCISLALNVADSLNIEEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKW
                         RDR RR I  P ++  AD I+ AL  A  +  E+P +    +       W+ AMNEE+KSL +N TW L   P G + ++ KW
Subjt:  -----------------RDRQRRTIVPPSRFSEADCISLALNVADSLNIEEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKW

Query:  IFKIKEGITGVLKPRFKARLVAKGFTQKEGIDYNK
        IFK KEGI GV   RFKARLVA+GFTQKEGID+N+
Subjt:  IFKIKEGITGVLKPRFKARLVAKGFTQKEGIDYNK

RZB91070.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja]6.1e-8433.11Show/hide
Query:  VDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIIISAIKTKVLEILSQKNETNEGLFVKGKSKGRENKHQIEEKNKAKIR
        +D+   ++ E ++++ K+ +E+ A +LL SLP +Y+         +++L+ E   S +      + S  N+  +    KGK K   N   I         
Subjt:  VDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIIISAIKTKVLEILSQKNETNEGLFVKGKSKGRENKHQIEEKNKAKIR

Query:  CNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCENSITYSDALATSYQCSQDQSSTEKHDWVIDSGCSFHMTPSKGWFNTYLEWDGGIVYMGNNNT
        CNYC + GH K+DC            KK  +P A V +   T  + L  S     D     ++ W++DSGCSFHM P+K WF+TY E  GG V+MGN+ +
Subjt:  CNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCENSITYSDALATSYQCSQDQSSTEKHDWVIDSGCSFHMTPSKGWFNTYLEWDGGIVYMGNNNT

Query:  CRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLG--------CLTQ---LAARMGEVVDVEMIQHA---------------LVVSDKSSTE----S
        C+  GI ++ +K+ DG  + L  VRHVP LK+NLIS+G        C T+   +  + G  + ++ I+                 + V+ +S+      +
Subjt:  CRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLG--------CLTQ---LAARMGEVVDVEMIQHA---------------LVVSDKSSTE----S

Query:  DLWHKRMSHISEKDAMLSE-----------------------KFWVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLK
         LWH R+SH+SEK    ++                        FW EA + T + +NR P T+I    P E W+GK P   +L+VFGC  + H N+GKL 
Subjt:  DLWHKRMSHISEKDAMLSE-----------------------KFWVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLK

Query:  ARAAKCMFLGFTEGVKGYRMWHPVEKRCVNSRDVIFREQDMF----------MLQTHTEDKPSYEPNTRD------------------RQRRTIVPPSRF
         R+ K +F+G+ +GVKGYR+W P EK+ + SRDVIF E  +F              H+  +   E   +D                  R +R   PP R+
Subjt:  ARAAKCMFLGFTEGVKGYRMWHPVEKRCVNSRDVIFREQDMF----------MLQTHTEDKPSYEPNTRD------------------RQRRTIVPPSRF

Query:  SEADCISLALNVADSLNIEEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGI
           D  + AL  A+ ++  EP+++ EA+N P A +W+ AM EEM+SL +N TW L  LP G   +  KWI+K K G++     R+KARLVAKGF+QKEG+
Subjt:  SEADCISLALNVADSLNIEEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGI

Query:  DYNK
        D+N+
Subjt:  DYNK

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]9.8e-8229.21Show/hide
Query:  VDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIIISAIKTKVLEILSQKNETNEGLFVKGKS-----KGRENKHQIEEKN
        +D+F+K++ +  N+ EK+ +EN+A +LLNSLP+ Y+EVK A+KYG D+LT  I++ A+KT+ LEI  ++ +  E L  +G+S     KG+E   + + K 
Subjt:  VDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIIISAIKTKVLEILSQKNETNEGLFVKGKS-----KGRENKHQIEEKN

Query:  KAKIRCNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCENSIT--YS--------DALATSYQCSQDQSSTEK---HDWVIDSGCSFHMTPSKGWF
        K++ +C  CHKEGH K++C              NK  EAS  E ++T  Y+        D+  T Y+ ++    + +     W++DSGC+FHMTP + + 
Subjt:  KAKIRCNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCENSIT--YS--------DALATSYQCSQDQSSTEK---HDWVIDSGCSFHMTPSKGWF

Query:  NTYLEWDGGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLGCLTQ-----------LAARMGEVVDVE-MIQHAL----------
          + + DGG V +G+N TC V G  SV +   DG  ++L NVR+VP LKRNLISLG L +           +    G +V +   ++H L          
Subjt:  NTYLEWDGGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLGCLTQ-----------LAARMGEVVDVE-MIQHAL----------

Query:  ---VVSDKSSTESDLWHKRMSHISEK--------------------------------------------------------------------------
           + S K +  S LWHKR++H+SE+                                                                          
Subjt:  ---VVSDKSSTESDLWHKRMSHISEK--------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------------DAMLSEKF
                                                                                                    +A L  KF
Subjt:  --------------------------------------------------------------------------------------------DAMLSEKF

Query:  WVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLKARAAKCMFLGFTEGVKGYRMWHPVEK---RCVNSRDVIFREQDM
        W EAA    Y +NR P T++N  TP+E W+GK P L+HL+VFGCT + H   GKL  RA KCMF+G+ +GVKGY++W  +EK   +C+ SRDV F E +M
Subjt:  WVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLKARAAKCMFLGFTEGVKGYRMWHPVEK---RCVNSRDVIFREQDM

Query:  -----------------FMLQTHTEDKPSYEPN-------------------------------------------------TRDRQRRTIVPPSRFSEA
                           ++  +E +PS + +                                                 TRDR +R    P R+  A
Subjt:  -----------------FMLQTHTEDKPSYEPN-------------------------------------------------TRDRQRRTIVPPSRFSEA

Query:  DCISLALNVADSLNIEEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGIDYN
        D ++ AL  A      EP +F+EA+   + + W +AM EE+ SL +N TW+L P P   K I SKWI+KIK G  G  KPR+KARLVAKG+TQKEG+D++
Subjt:  DCISLALNVADSLNIEEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGIDYN

Query:  K
        +
Subjt:  K

TrEMBL top hitse value%identityAlignment
A0A2K3N065 Copia LTR rider9.0e-8129.48Show/hide
Query:  VDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIIISAIKTKVL-EILSQKNET-NEGLFVKGKSKGRENKHQIEEKNKAK
        +D F K+I + +N++ K+ +E++A +LL +LP+++   K  L YG ++LT E + SA+ +K L E    K  T  EGL VKGK   +  K   + K+++K
Subjt:  VDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIIISAIKTKVL-EILSQKNET-NEGLFVKGKSKGRENKHQIEEKNKAK

Query:  --------IRCNYCHKEGHLKRDCYSLKRKNQNQRYKKN-KQPEASVCENSITYSDALATSYQCSQDQSSTEKHDWVIDSGCSFHMTPSKGWFNTYLEWD
                IRC +C KEGH ++ C         +R K +     A++ ++    SD L  S       SS  + +W++DSGC++HMTP+K  F    + D
Subjt:  --------IRCNYCHKEGHLKRDCYSLKRKNQNQRYKKN-KQPEASVCENSITYSDALATSYQCSQDQSSTEKHDWVIDSGCSFHMTPSKGWFNTYLEWD

Query:  GGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLG-------------CLTQLAARMGEV-----------VDVEMIQHAL-VVSD
        GG V +GNN  C++ G+ SV  KL D S +LL  VR+VP+LKRNL+SLG              + ++     EV           ++ E++  +  VVS 
Subjt:  GGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLG-------------CLTQLAARMGEV-----------VDVEMIQHAL-VVSD

Query:  KSSTESDLWHKRMSHISEK---------------------------------------------------------------------------------
        K  +++++WH R+ H+SE+                                                                                 
Subjt:  KSSTESDLWHKRMSHISEK---------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------DAMLSEKFWVEAASY
                                                                                              A L + FW EA S 
Subjt:  -------------------------------------------------------------------------------------DAMLSEKFWVEAASY

Query:  TVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLKARAAKCMFLGFTEGVKGYRMW--HPVEKRCVNSRDVIFREQDMFMLQT---
          Y +NRCP T+++  TPEE WSG PP L  L+VFGC  + H  Q K++ RA KCMF+G+ EGVK YR+W   P  KRC+ SRDV+F E +M   +T   
Subjt:  TVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLKARAAKCMFLGFTEGVKGYRMW--HPVEKRCVNSRDVIFREQDMFMLQT---

Query:  ---------------------------HTEDKPSYEPN--------------TRDRQRRTIVPPSRFSEADCISLALNVADSLNIEEPSSFDEAVNGPNA
                                   H  D+   E                +RDR RR I  P R   AD I+ AL  A  +  EEP  + E +   N 
Subjt:  ---------------------------HTEDKPSYEPN--------------TRDRQRRTIVPPSRFSEADCISLALNVADSLNIEEPSSFDEAVNGPNA

Query:  RDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGIDYN
         +W++AM++EMKSL +N TW L   P G + ++ KWIFK+KEGI GV   R+KARLVA+GFTQKEG+D+N
Subjt:  RDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGIDYN

A0A445F227 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-8130.07Show/hide
Query:  LSDFSLSRLLPRSHARFSLS----AVSALSLFCLCTVVAVNCRLRPVIAVVDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLT
        ++D     LL ++H+   LS     +  +S       V    + R V   +D F K+I + +N++  + +E++A +LL SLPK+Y   K  L +G D+++
Subjt:  LSDFSLSRLLPRSHARFSLS----AVSALSLFCLCTVVAVNCRLRPVIAVVDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLT

Query:  TEIIISAIKTKVLEILSQK--NETNEGLFVKGKSKGRENKHQIE----------EKNKAKIRCNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCE
         + + +A+ +K      +K  + + EGL  +GK+  +++K   +          E N  KIRC +C KEGH ++ C   ++   +   KK+    A V +
Subjt:  TEIIISAIKTKVLEILSQK--NETNEGLFVKGKSKGRENKHQIE----------EKNKAKIRCNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCE

Query:  NSITYSDALATSYQCSQDQSSTEKHDWVIDSGCSFHMTPSKGWFNTYLEWDGGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLG
        +    ++ L  S           +  W++DSGCS+HMTP++ WF  + +   G+V +G+N  C++ GI S+  K  DG+ ++L  VR+VP LKRNLISLG
Subjt:  NSITYSDALATSYQCSQDQSSTEKHDWVIDSGCSFHMTPSKGWFNTYLEWDGGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLG

Query:  -----------------------------------------CLTQLAARMGEV-VDVEMIQHALVVSDKSSTESDLW-----------------------
                                                  +   A   G V +   ++ H +         +D W                       
Subjt:  -----------------------------------------CLTQLAARMGEV-VDVEMIQHALVVSDKSSTESDLW-----------------------

Query:  ------------------HKRMSHISEKDAM--------------------LSEKFWVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGC
                          H+ ++   +++ +                    L + FW EAA   VY +N+CP T++NF TPEE WSG PP L+ LKVFGC
Subjt:  ------------------HKRMSHISEKDAM--------------------LSEKFWVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGC

Query:  TGFIHQNQGKLKARAAKCMFLGFTEGVKGYRMW--HPVEKRCVNSRDVIFREQDMF------MLQTHTE-------DKPSYEPNT---------------
          + H  Q KL+ RA KC+FLG+ EGVKGY++W      KRC+ S DV+F E +M       M+Q+ T+       +K ++E  T               
Subjt:  TGFIHQNQGKLKARAAKCMFLGFTEGVKGYRMW--HPVEKRCVNSRDVIFREQDMF------MLQTHTE-------DKPSYEPNT---------------

Query:  -----------------RDRQRRTIVPPSRFSEADCISLALNVADSLNIEEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKW
                         RDR RR I  P ++  AD I+ AL  A  +  E+P +    +       W+ AMNEE+KSL +N TW L   P G + ++ KW
Subjt:  -----------------RDRQRRTIVPPSRFSEADCISLALNVADSLNIEEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKW

Query:  IFKIKEGITGVLKPRFKARLVAKGFTQKEGIDYNK
        IFK KEGI GV   RFKARLVA+GFTQKEGID+N+
Subjt:  IFKIKEGITGVLKPRFKARLVAKGFTQKEGIDYNK

A0A445IYF4 Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment)3.0e-8433.11Show/hide
Query:  VDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIIISAIKTKVLEILSQKNETNEGLFVKGKSKGRENKHQIEEKNKAKIR
        +D+   ++ E ++++ K+ +E+ A +LL SLP +Y+         +++L+ E   S +      + S  N+  +    KGK K   N   I         
Subjt:  VDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIIISAIKTKVLEILSQKNETNEGLFVKGKSKGRENKHQIEEKNKAKIR

Query:  CNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCENSITYSDALATSYQCSQDQSSTEKHDWVIDSGCSFHMTPSKGWFNTYLEWDGGIVYMGNNNT
        CNYC + GH K+DC            KK  +P A V +   T  + L  S     D     ++ W++DSGCSFHM P+K WF+TY E  GG V+MGN+ +
Subjt:  CNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCENSITYSDALATSYQCSQDQSSTEKHDWVIDSGCSFHMTPSKGWFNTYLEWDGGIVYMGNNNT

Query:  CRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLG--------CLTQ---LAARMGEVVDVEMIQHA---------------LVVSDKSSTE----S
        C+  GI ++ +K+ DG  + L  VRHVP LK+NLIS+G        C T+   +  + G  + ++ I+                 + V+ +S+      +
Subjt:  CRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLG--------CLTQ---LAARMGEVVDVEMIQHA---------------LVVSDKSSTE----S

Query:  DLWHKRMSHISEKDAMLSE-----------------------KFWVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLK
         LWH R+SH+SEK    ++                        FW EA + T + +NR P T+I    P E W+GK P   +L+VFGC  + H N+GKL 
Subjt:  DLWHKRMSHISEKDAMLSE-----------------------KFWVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLK

Query:  ARAAKCMFLGFTEGVKGYRMWHPVEKRCVNSRDVIFREQDMF----------MLQTHTEDKPSYEPNTRD------------------RQRRTIVPPSRF
         R+ K +F+G+ +GVKGYR+W P EK+ + SRDVIF E  +F              H+  +   E   +D                  R +R   PP R+
Subjt:  ARAAKCMFLGFTEGVKGYRMWHPVEKRCVNSRDVIFREQDMF----------MLQTHTEDKPSYEPNTRD------------------RQRRTIVPPSRF

Query:  SEADCISLALNVADSLNIEEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGI
           D  + AL  A+ ++  EP+++ EA+N P A +W+ AM EEM+SL +N TW L  LP G   +  KWI+K K G++     R+KARLVAKGF+QKEG+
Subjt:  SEADCISLALNVADSLNIEEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGI

Query:  DYNK
        D+N+
Subjt:  DYNK

A0A5A7UB25 Putative gag-pol polyprotein4.7e-8229.21Show/hide
Query:  VDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIIISAIKTKVLEILSQKNETNEGLFVKGKS-----KGRENKHQIEEKN
        +D+F+K++ +  N+ EK+ +EN+A +LLNSLP+ Y+EVK A+KYG D+LT  I++ A+KT+ LEI  ++ +  E L  +G+S     KG+E   + + K 
Subjt:  VDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIIISAIKTKVLEILSQKNETNEGLFVKGKS-----KGRENKHQIEEKN

Query:  KAKIRCNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCENSIT--YS--------DALATSYQCSQDQSSTEK---HDWVIDSGCSFHMTPSKGWF
        K++ +C  CHKEGH K++C              NK  EAS  E ++T  Y+        D+  T Y+ ++    + +     W++DSGC+FHMTP + + 
Subjt:  KAKIRCNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCENSIT--YS--------DALATSYQCSQDQSSTEK---HDWVIDSGCSFHMTPSKGWF

Query:  NTYLEWDGGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLGCLTQ-----------LAARMGEVVDVE-MIQHAL----------
          + + DGG V +G+N TC V G  SV +   DG  ++L NVR+VP LKRNLISLG L +           +    G +V +   ++H L          
Subjt:  NTYLEWDGGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLGCLTQ-----------LAARMGEVVDVE-MIQHAL----------

Query:  ---VVSDKSSTESDLWHKRMSHISEK--------------------------------------------------------------------------
           + S K +  S LWHKR++H+SE+                                                                          
Subjt:  ---VVSDKSSTESDLWHKRMSHISEK--------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------------DAMLSEKF
                                                                                                    +A L  KF
Subjt:  --------------------------------------------------------------------------------------------DAMLSEKF

Query:  WVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLKARAAKCMFLGFTEGVKGYRMWHPVEK---RCVNSRDVIFREQDM
        W EAA    Y +NR P T++N  TP+E W+GK P L+HL+VFGCT + H   GKL  RA KCMF+G+ +GVKGY++W  +EK   +C+ SRDV F E +M
Subjt:  WVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLKARAAKCMFLGFTEGVKGYRMWHPVEK---RCVNSRDVIFREQDM

Query:  -----------------FMLQTHTEDKPSYEPN-------------------------------------------------TRDRQRRTIVPPSRFSEA
                           ++  +E +PS + +                                                 TRDR +R    P R+  A
Subjt:  -----------------FMLQTHTEDKPSYEPN-------------------------------------------------TRDRQRRTIVPPSRFSEA

Query:  DCISLALNVADSLNIEEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGIDYN
        D ++ AL  A      EP +F+EA+   + + W +AM EE+ SL +N TW+L P P   K I SKWI+KIK G  G  KPR+KARLVAKG+TQKEG+D++
Subjt:  DCISLALNVADSLNIEEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGIDYN

Query:  K
        +
Subjt:  K

A0A5D3DNU1 Putative gag-pol polyprotein4.7e-8229.21Show/hide
Query:  VDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIIISAIKTKVLEILSQKNETNEGLFVKGKS-----KGRENKHQIEEKN
        +D+F+K++ +  N+ EK+ +EN+A +LLNSLP+ Y+EVK A+KYG D+LT  I++ A+KT+ LEI  ++ +  E L  +G+S     KG+E   + + K 
Subjt:  VDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIIISAIKTKVLEILSQKNETNEGLFVKGKS-----KGRENKHQIEEKN

Query:  KAKIRCNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCENSIT--YS--------DALATSYQCSQDQSSTEK---HDWVIDSGCSFHMTPSKGWF
        K++ +C  CHKEGH K++C              NK  EAS  E ++T  Y+        D+  T Y+ ++    + +     W++DSGC+FHMTP + + 
Subjt:  KAKIRCNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCENSIT--YS--------DALATSYQCSQDQSSTEK---HDWVIDSGCSFHMTPSKGWF

Query:  NTYLEWDGGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLGCLTQ-----------LAARMGEVVDVE-MIQHAL----------
          + + DGG V +G+N TC V G  SV +   DG  ++L NVR+VP LKRNLISLG L +           +    G +V +   ++H L          
Subjt:  NTYLEWDGGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLGCLTQ-----------LAARMGEVVDVE-MIQHAL----------

Query:  ---VVSDKSSTESDLWHKRMSHISEK--------------------------------------------------------------------------
           + S K +  S LWHKR++H+SE+                                                                          
Subjt:  ---VVSDKSSTESDLWHKRMSHISEK--------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------------DAMLSEKF
                                                                                                    +A L  KF
Subjt:  --------------------------------------------------------------------------------------------DAMLSEKF

Query:  WVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLKARAAKCMFLGFTEGVKGYRMWHPVEK---RCVNSRDVIFREQDM
        W EAA    Y +NR P T++N  TP+E W+GK P L+HL+VFGCT + H   GKL  RA KCMF+G+ +GVKGY++W  +EK   +C+ SRDV F E +M
Subjt:  WVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLKARAAKCMFLGFTEGVKGYRMWHPVEK---RCVNSRDVIFREQDM

Query:  -----------------FMLQTHTEDKPSYEPN-------------------------------------------------TRDRQRRTIVPPSRFSEA
                           ++  +E +PS + +                                                 TRDR +R    P R+  A
Subjt:  -----------------FMLQTHTEDKPSYEPN-------------------------------------------------TRDRQRRTIVPPSRFSEA

Query:  DCISLALNVADSLNIEEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGIDYN
        D ++ AL  A      EP +F+EA+   + + W +AM EE+ SL +N TW+L P P   K I SKWI+KIK G  G  KPR+KARLVAKG+TQKEG+D++
Subjt:  DCISLALNVADSLNIEEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGIDYN

Query:  K
        +
Subjt:  K

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-1723.14Show/hide
Query:  AMLSEKFWVEAASYTVYTLNRCPHTSI--NFLTPEEKWSGKPPKLQHLKVFGCTGFIH--QNQGKLKARAAKCMFLGFTEGVKGYRMWHPVEKRCVNSRD
        A L + FW EA     Y +NR P  ++  +  TP E W  K P L+HL+VFG T ++H    QGK   ++ K +F+G+     G+++W  V ++ + +RD
Subjt:  AMLSEKFWVEAASYTVYTLNRCPHTSI--NFLTPEEKWSGKPPKLQHLKVFGCTGFIH--QNQGKLKARAAKCMFLGFTEGVKGYRMWHPVEKRCVNSRD

Query:  VIFREQDMF-----------------------------------------------------------------MLQTH---------------------
        V+  E +M                                                                  ++QT                      
Subjt:  VIFREQDMF-----------------------------------------------------------------MLQTH---------------------

Query:  ----------------TEDKPSYEPN----------------------------TRDRQRRTIVPPSRFSEAD-CISLALNVADSLNIEEPSSFDEAVNG
                         E K S  PN                             R  +R    P   ++E D  ++  +  A ++  + P+SFDE    
Subjt:  ----------------TEDKPSYEPN----------------------------TRDRQRRTIVPPSRFSEAD-CISLALNVADSLNIEEPSSFDEAVNG

Query:  PNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKP-RFKARLVAKGFTQKEGIDYNKS
         +   W EA+N E+ + + N TWT+   P     + S+W+F +K    G   P R+KARLVA+GFTQK  IDY ++
Subjt:  PNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKP-RFKARLVAKGFTQKEGIDYNKS

P04146 Copia protein2.4e-0621.9Show/hide
Query:  FKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALK-YGIDNLTTEIIISAIKTKVLEILSQKNET-----------NEGLFVKGKSKGRENKHQ-
        F ++ISE      K+ E ++   LL +LP  Y  +  A++    +NLT   + + +  + ++I +  N+T           N   +     K R  K + 
Subjt:  FKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALK-YGIDNLTTEIIISAIKTKVLEILSQKNET-----------NEGLFVKGKSKGRENKHQ-

Query:  -IEEKNKAKIRCNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCENSITYSDALATSYQCSQDQSSTEKHDWVIDSGCSFHMTPSKGWFNTYLE--
          +  +K K++C++C +EGH+K+DC+  KR   N+  +  KQ + +        S  +A   +   + S  +   +V+DSG S H+   +  +   +E  
Subjt:  -IEEKNKAKIRCNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCENSITYSDALATSYQCSQDQSSTEKHDWVIDSGCSFHMTPSKGWFNTYLE--

Query:  --------WDGGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISL------------GCLTQLAARMGEVVDVEMIQHALVVSDKSS
                  G  +Y       R+     ++L+      +   N+  V  L+   +S+            G +    + M   V V   Q A  ++ K  
Subjt:  --------WDGGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISL------------GCLTQLAARMGEVVDVEMIQHALVVSDKSS

Query:  TESDLWHKRMSHISE
            LWH+R  HIS+
Subjt:  TESDLWHKRMSHISE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-4522.99Show/hide
Query:  FKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIIISAIKTKVLEILSQKNETNEGLFVKGKSKGRENKHQI----------EE
        F  +I++  N+  K+ EE++A +LLNSLP +Y  +   + +G   +  + + SA+   +L    +K   N+G  +  + +GR  +             + 
Subjt:  FKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIIISAIKTKVLEILSQKNETNEGLFVKGKSKGRENKHQI----------EE

Query:  KNKAKIR---CNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCENSITYSDALATSYQCSQDQSSTEKHDWVIDSGCSFHMTPSKGWFNTYLEWDG
        KN++K R   C  C++ GH KRDC + ++       +KN    A++ +N+      +    +C     S  + +WV+D+  S H TP +  F  Y+  D 
Subjt:  KNKAKIR---CNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCENSITYSDALATSYQCSQDQSSTEKHDWVIDSGCSFHMTPSKGWFNTYLEWDG

Query:  GIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLGCLTQ------------------LAARMGEV------VDVEMIQHALVVSDKS
        G V MGN +  ++ GI  + +K   G T +L++VRHVP+L+ NLIS   L +                  L    G         + E+ Q  L  + + 
Subjt:  GIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLGCLTQ------------------LAARMGEV------VDVEMIQHALVVSDKS

Query:  STESDLWHKRMSHISEKD----------------------------------------------------------------------------------
            DLWHKRM H+SEK                                                                                   
Subjt:  STESDLWHKRMSHISEKD----------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------AMLSEKFWVEAASYTV
                                                                                            A L + FW EA     
Subjt:  ------------------------------------------------------------------------------------AMLSEKFWVEAASYTV

Query:  YTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIH---QNQGKLKARAAKCMFLGFTEGVKGYRMWHPVEKRCVNSRDVIFREQDM---------
        Y +NR P   + F  PE  W+ K     HLKVFGC  F H   + + KL  ++  C+F+G+ +   GYR+W PV+K+ + SRDV+FRE ++         
Subjt:  YTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIH---QNQGKLKARAAKCMFLGFTEGVKGYRMWHPVEKRCVNSRDVIFREQDM---------

Query:  --------FMLQTHTEDKPSYEPNTRD----------------------------------------RQRRTIVPPSRFSEADCISLALNVADSLNIEEP
                F+    T + P+   +T D                                        R  R  V   R+   + + ++       +  EP
Subjt:  --------FMLQTHTEDKPSYEPNTRD----------------------------------------RQRRTIVPPSRFSEADCISLALNVADSLNIEEP

Query:  SSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGIDYNK
         S  E ++ P     ++AM EEM+SL++N T+ L  LP G +P+  KW+FK+K+     L  R+KARLV KGF QK+GID+++
Subjt:  SSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGIDYNK

P92512 Uncharacterized mitochondrial protein AtMg007105.2e-0942.42Show/hide
Query:  LSEKFWVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLKARAAK
        L + F  +AA+  V+ +N+ P T+INF  P+E W    P   +L+ FGC  +IH ++GKLK RA K
Subjt:  LSEKFWVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLKARAAK

P92520 Uncharacterized mitochondrial protein AtMg008202.3e-0946.34Show/hide
Query:  EEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGI
        +EP S   A+  P    W +AM EE+ +L  N TW L P P     +  KW+FK K    G L  R KARLVAKGF Q+EGI
Subjt:  EEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.0e-0930.23Show/hide
Query:  AMLSEKFWVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGF---IHQNQGKLKARAAKCMFLGFTEGVKGYRMWHPVEKRCVNSRDV
        A + + +W  A S  VY +NR P   +   +P +K  G+PP  + LKVFGC  +      N+ KL+ ++ +C F+G++     Y   H    R   SR V
Subjt:  AMLSEKFWVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGF---IHQNQGKLKARAAKCMFLGFTEGVKGYRMWHPVEKRCVNSRDV

Query:  IFREQ------DMFMLQTHTEDKPSYEPN
         F E+        F + T  E +    PN
Subjt:  IFREQ------DMFMLQTHTEDKPSYEPN

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.6e-1345.24Show/hide
Query:  EEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGIDY
        +EPS+++EA        W  AM++E+ ++E   TW +  LP   KPI  KW++KIK    G ++ R+KARLVAKG+TQ+EGID+
Subjt:  EEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGIDY

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.7e-1042.42Show/hide
Query:  LSEKFWVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLKARAAK
        L + F  +AA+  V+ +N+ P T+INF  P+E W    P   +L+ FGC  +IH ++GKLK RA K
Subjt:  LSEKFWVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLKARAAK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.6e-1046.34Show/hide
Query:  EEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGI
        +EP S   A+  P    W +AM EE+ +L  N TW L P P     +  KW+FK K    G L  R KARLVAKGF Q+EGI
Subjt:  EEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEGITGVLKPRFKARLVAKGFTQKEGI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGCGCCGGTAACGCCCTTGTCTCTCTCAGATTTCTCTCTCTCTCGTTTACTCCCTCGGTCTCATGCTCGATTTTCTCTCTCTGCCGTCTCAGCTCTCTCTCTCTT
CTGTCTATGTACCGTCGTCGCCGTGAATTGCCGCCTCAGACCCGTCATCGCCGTCGTCGATGATTTTAAGAAGATGATTTCAGAGTTCAAGAACATGAATGAAAAAGTTG
GTGAAGAAAATGAAGCCTTTGTATTGCTAAACTCCCTTCCAAAAGCATACAAGGAAGTAAAAAATGCTTTGAAATATGGTATAGATAATCTTACCACAGAAATCATAATA
TCAGCTATCAAGACCAAAGTGCTGGAGATTTTGTCTCAAAAGAACGAGACAAATGAAGGCCTTTTTGTCAAGGGAAAATCCAAAGGCAGAGAAAACAAACATCAAATAGA
AGAAAAGAATAAAGCAAAAATCAGATGTAATTACTGCCATAAAGAAGGCCATCTCAAAAGGGACTGCTACTCACTTAAAAGAAAGAATCAAAATCAGAGGTACAAAAAGA
ATAAGCAGCCAGAAGCGTCAGTGTGTGAAAATTCAATTACATACTCTGATGCATTGGCTACTTCATATCAGTGCAGTCAAGATCAGTCATCCACTGAAAAACACGATTGG
GTAATTGACTCGGGTTGTTCATTCCATATGACACCCTCCAAAGGTTGGTTTAACACTTACCTAGAGTGGGATGGAGGGATAGTTTATATGGGAAACAACAATACTTGCAG
GGTAAATGGAATCAGATCAGTTTCTCTGAAATTGAAAGATGGCTCTACTAAACTCTTGCGAAATGTGAGACATGTACCGAACCTAAAGAGGAATTTAATCTCTTTAGGAT
GCTTGACTCAATTGGCTGCACGTATGGGGGAAGTGGTGGATGTAGAGATGATTCAACATGCCCTTGTAGTCTCAGACAAAAGCTCAACCGAGAGTGATCTTTGGCACAAA
AGAATGTCACACATCAGTGAAAAAGACGCAATGCTTTCTGAAAAATTCTGGGTAGAGGCTGCATCTTACACTGTTTACACCTTGAATAGGTGCCCTCACACCTCCATCAA
CTTTCTAACACCAGAGGAGAAATGGTCAGGTAAACCCCCTAAACTTCAACACCTTAAAGTTTTTGGTTGTACAGGATTCATACACCAAAACCAAGGGAAGCTGAAAGCAA
GGGCTGCTAAGTGTATGTTCCTAGGCTTTACCGAAGGGGTAAAAGGGTATAGAATGTGGCATCCAGTTGAGAAGAGATGTGTCAATAGTAGAGATGTAATCTTCAGGGAA
CAAGATATGTTTATGCTCCAAACTCATACAGAAGACAAACCATCTTATGAACCTAATACAAGAGACAGACAAAGGAGAACTATAGTTCCTCCTTCAAGGTTTAGTGAGGC
CGACTGCATTTCATTAGCCTTAAATGTTGCAGACTCACTAAACATTGAAGAACCTAGCAGTTTTGATGAAGCAGTAAATGGCCCAAATGCTAGAGACTGGATTGAGGCCA
TGAACGAAGAGATGAAATCCCTTGAAGAAAATTTCACTTGGACACTGAAGCCTCTTCCAAACGGTTACAAGCCTATAACATCCAAATGGATCTTCAAAATTAAGGAAGGG
ATAACTGGAGTGTTGAAGCCTAGATTCAAAGCAAGGCTTGTGGCAAAAGGATTCACACAGAAAGAGGGCATTGATTACAACAAATCTTCTCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCGGCGCCGGTAACGCCCTTGTCTCTCTCAGATTTCTCTCTCTCTCGTTTACTCCCTCGGTCTCATGCTCGATTTTCTCTCTCTGCCGTCTCAGCTCTCTCTCTCTT
CTGTCTATGTACCGTCGTCGCCGTGAATTGCCGCCTCAGACCCGTCATCGCCGTCGTCGATGATTTTAAGAAGATGATTTCAGAGTTCAAGAACATGAATGAAAAAGTTG
GTGAAGAAAATGAAGCCTTTGTATTGCTAAACTCCCTTCCAAAAGCATACAAGGAAGTAAAAAATGCTTTGAAATATGGTATAGATAATCTTACCACAGAAATCATAATA
TCAGCTATCAAGACCAAAGTGCTGGAGATTTTGTCTCAAAAGAACGAGACAAATGAAGGCCTTTTTGTCAAGGGAAAATCCAAAGGCAGAGAAAACAAACATCAAATAGA
AGAAAAGAATAAAGCAAAAATCAGATGTAATTACTGCCATAAAGAAGGCCATCTCAAAAGGGACTGCTACTCACTTAAAAGAAAGAATCAAAATCAGAGGTACAAAAAGA
ATAAGCAGCCAGAAGCGTCAGTGTGTGAAAATTCAATTACATACTCTGATGCATTGGCTACTTCATATCAGTGCAGTCAAGATCAGTCATCCACTGAAAAACACGATTGG
GTAATTGACTCGGGTTGTTCATTCCATATGACACCCTCCAAAGGTTGGTTTAACACTTACCTAGAGTGGGATGGAGGGATAGTTTATATGGGAAACAACAATACTTGCAG
GGTAAATGGAATCAGATCAGTTTCTCTGAAATTGAAAGATGGCTCTACTAAACTCTTGCGAAATGTGAGACATGTACCGAACCTAAAGAGGAATTTAATCTCTTTAGGAT
GCTTGACTCAATTGGCTGCACGTATGGGGGAAGTGGTGGATGTAGAGATGATTCAACATGCCCTTGTAGTCTCAGACAAAAGCTCAACCGAGAGTGATCTTTGGCACAAA
AGAATGTCACACATCAGTGAAAAAGACGCAATGCTTTCTGAAAAATTCTGGGTAGAGGCTGCATCTTACACTGTTTACACCTTGAATAGGTGCCCTCACACCTCCATCAA
CTTTCTAACACCAGAGGAGAAATGGTCAGGTAAACCCCCTAAACTTCAACACCTTAAAGTTTTTGGTTGTACAGGATTCATACACCAAAACCAAGGGAAGCTGAAAGCAA
GGGCTGCTAAGTGTATGTTCCTAGGCTTTACCGAAGGGGTAAAAGGGTATAGAATGTGGCATCCAGTTGAGAAGAGATGTGTCAATAGTAGAGATGTAATCTTCAGGGAA
CAAGATATGTTTATGCTCCAAACTCATACAGAAGACAAACCATCTTATGAACCTAATACAAGAGACAGACAAAGGAGAACTATAGTTCCTCCTTCAAGGTTTAGTGAGGC
CGACTGCATTTCATTAGCCTTAAATGTTGCAGACTCACTAAACATTGAAGAACCTAGCAGTTTTGATGAAGCAGTAAATGGCCCAAATGCTAGAGACTGGATTGAGGCCA
TGAACGAAGAGATGAAATCCCTTGAAGAAAATTTCACTTGGACACTGAAGCCTCTTCCAAACGGTTACAAGCCTATAACATCCAAATGGATCTTCAAAATTAAGGAAGGG
ATAACTGGAGTGTTGAAGCCTAGATTCAAAGCAAGGCTTGTGGCAAAAGGATTCACACAGAAAGAGGGCATTGATTACAACAAATCTTCTCTCTAG
Protein sequenceShow/hide protein sequence
MPAPVTPLSLSDFSLSRLLPRSHARFSLSAVSALSLFCLCTVVAVNCRLRPVIAVVDDFKKMISEFKNMNEKVGEENEAFVLLNSLPKAYKEVKNALKYGIDNLTTEIII
SAIKTKVLEILSQKNETNEGLFVKGKSKGRENKHQIEEKNKAKIRCNYCHKEGHLKRDCYSLKRKNQNQRYKKNKQPEASVCENSITYSDALATSYQCSQDQSSTEKHDW
VIDSGCSFHMTPSKGWFNTYLEWDGGIVYMGNNNTCRVNGIRSVSLKLKDGSTKLLRNVRHVPNLKRNLISLGCLTQLAARMGEVVDVEMIQHALVVSDKSSTESDLWHK
RMSHISEKDAMLSEKFWVEAASYTVYTLNRCPHTSINFLTPEEKWSGKPPKLQHLKVFGCTGFIHQNQGKLKARAAKCMFLGFTEGVKGYRMWHPVEKRCVNSRDVIFRE
QDMFMLQTHTEDKPSYEPNTRDRQRRTIVPPSRFSEADCISLALNVADSLNIEEPSSFDEAVNGPNARDWIEAMNEEMKSLEENFTWTLKPLPNGYKPITSKWIFKIKEG
ITGVLKPRFKARLVAKGFTQKEGIDYNKSSL