; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018228 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018228
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr5:19784211..19794616
RNA-Seq ExpressionLag0018228
SyntenyLag0018228
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR032675 - Leucine-rich repeat domain superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8700517.1 hypothetical protein F3Y22_tig00110556pilonHSYRG00215 [Hibiscus syriacus]4.4e-8129.96Show/hide
Query:  EVERFDGNGDFGLWKIKMNVVLTQQKVDIAIS--KIIPKTVTETELKEMDKIAHSLIILHLADNVFRKVSGE----------------------------
        ++E+F G  DFGLW++KM  +L QQ +  A+     +P  + E E   + + AHS IIL L D   R+VS E                            
Subjt:  EVERFDGNGDFGLWKIKMNVVLTQQKVDIAIS--KIIPKTVTETELKEMDKIAHSLIILHLADNVFRKVSGE----------------------------

Query:  ----------------DTF---------------DENQSIILLNSLSDSYKDVKAAI-------------------------------------------
                        D F               DE+++++LLNS+  SY+  K A+                                           
Subjt:  ----------------DTF---------------DENQSIILLNSLSDSYKDVKAAI-------------------------------------------

Query:  ------------------------------------KGNIEK----------------GEVSGVNLGEACDTAEVLMVSENQEDEAWILDSGCSFHMTPH
                                            +G+ +K                GEVS V+  +  D+ E L+++E++  + WILDSGCSFHM PH
Subjt:  ------------------------------------KGNIEK----------------GEVSGVNLGEACDTAEVLMVSENQEDEAWILDSGCSFHMTPH

Query:  KNWLEDFQEMSGGKVLLGNNQHCEVEGQGTVRIK------------TSDGAVRLLTGV------------------------------------------
        K+W E  Q +SGG VLLG+N+ C V G GT+RI+             + G++  + G+                                          
Subjt:  KNWLEDFQEMSGGKVLLGNNQHCEVEGQGTVRIK------------TSDGAVRLLTGV------------------------------------------

Query:  -----------------------------------------------------------------------SSRLKKLRTDNGLEFCNHEFNNFCKKKGI
                                                                                ++LK LRTDNGLE+ + +FN  C+K GI
Subjt:  -----------------------------------------------------------------------SSRLKKLRTDNGLEFCNHEFNNFCKKKGI

Query:  QRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPSTAIGSYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSA
         R  TV  TPQQNG+AERMNRTL+E+VRCML NA LPKSFWGEAV T CY++NR               KLE RA +CIF+GYP G KGYKLWC+EP   
Subjt:  QRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPSTAIGSYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSA

Query:  KTIISRDVVFDESKMGYVENTTESNPIENPKIPLEEEVVQ----------------EDEEEQDTQD---LTDYQLSRDRIRRT-----------------
        K IISRDVVFDESKM  + +    N +    IP+E E+ Q                E E +Q+T+    L DY L RDR RRT                 
Subjt:  KTIISRDVVFDESKMGYVENTTESNPIENPKIPLEEEVVQ----------------EDEEEQDTQD---LTDYQLSRDRIRRT-----------------

Query:  -----------------------------------------TGHLLVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQENK
                                                 T  L+ R  + R++ + VA +D+ELEQ+DVKT FLHGEL+E IYMDQP G+     ++K
Subjt:  -----------------------------------------TGHLLVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQENK

Query:  VCLLMKSLYGLKQSSRQWYKR
        VCLL KSLYGLKQS RQWYKR
Subjt:  VCLLMKSLYGLKQSSRQWYKR

KAE8700517.1 hypothetical protein F3Y22_tig00110556pilonHSYRG00215 [Hibiscus syriacus]5.8e-0951.81Show/hide
Query:  LGFVVPGQEN---KVCLLMKSLYGLKQSSRQWYKRSLTGYVFKFLDCTISWKTNLQSVVALSTTESEYIALTEAIKESIWLKG
        LG +   QEN   KV   + S Y     +R    +SLTG++F      +SWK+NLQSVVALSTTE+EYIA+TEAIKE++WL+G
Subjt:  LGFVVPGQEN---KVCLLMKSLYGLKQSSRQWYKRSLTGYVFKFLDCTISWKTNLQSVVALSTTESEYIALTEAIKESIWLKG

KAE8700517.1 hypothetical protein F3Y22_tig00110556pilonHSYRG00215 [Hibiscus syriacus]6.3e-8032.09Show/hide
Query:  GTHLEVERFDGNGDFGLWKIKMNVVLTQQKVDIAISKIIPKTVTETELKEMDKIAHSLIILHLADNVFRKVSGEDTF----------------------D
        G   ++E+FDG GDFGLW++KM  +L Q + + A+  ++P  +      E++K AHS +IL L + V R+V+GE T                       D
Subjt:  GTHLEVERFDGNGDFGLWKIKMNVVLTQQKVDIAISKIIPKTVTETELKEMDKIAHSLIILHLADNVFRKVSGEDTF----------------------D

Query:  ENQSIILLNSLSDSY-------------------------KDVKAAIKGNIEKGEVSGVNLGEAC-------------------------DTAEVLMVSE
        E+ ++ LL SL   Y                         K++K   K   + GE    +L   C                         D +EV+MV  
Subjt:  ENQSIILLNSLSDSY-------------------------KDVKAAIKGNIEKGEVSGVNLGEAC-------------------------DTAEVLMVSE

Query:  NQEDEAWILDSGCSFHMTPHKNWLEDFQEMSGGKVLLGNNQHCEVEGQGTVRIKTSDG------------------------------------------
         +    WI+DSG S+HMTP  +   DF E  G +VLLG+N+ C++ G G VR++  DG                                          
Subjt:  NQEDEAWILDSGCSFHMTPHKNWLEDFQEMSGGKVLLGNNQHCEVEGQGTVRIKTSDG------------------------------------------

Query:  -------------------------------------------------------------------------AVRLLTGVSS-----------------
                                                                                 + RL+  VS                  
Subjt:  -------------------------------------------------------------------------AVRLLTGVSS-----------------

Query:  ----------RLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPSTAIG--
                   +KKLRTDNGLEFCN EF   C + GI R L V GTPQQNG+ ERMNRTLM+KVRC+L  + LPK+FW EA  T  Y++NRSPSTAI   
Subjt:  ----------RLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPSTAIG--

Query:  ----------------------SYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDESKMGYVENTTESNPI-----ENPKIPL
                              +Y+H   GKLEPRA KC+ LGYPEG+KGY+L+ L+ +S K + SR+VVF+ES M Y +   +S  +        K  +
Subjt:  ----------------------SYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDESKMGYVENTTESNPI-----ENPKIPL

Query:  EEEVVQEDEEEQDTQDLTDYQ-------LSRDRIRRT------TGHLLVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQE
        +EE+  +   +  T +L D+Q       ++R   +R           +VRHTSIR++LA  A  D ELEQ+DVKT FLHG ++E+IYM QP G+    Q 
Subjt:  EEEVVQEDEEEQDTQDLTDYQ-------LSRDRIRRT------TGHLLVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQE

Query:  NKVCLLMKSLYGLKQSSRQWYKR
        NKVCLL KSLYGLKQS RQWY+R
Subjt:  NKVCLLMKSLYGLKQSSRQWYKR

KAG8473450.1 hypothetical protein CXB51_035780 [Gossypium anomalum]7.0e-7931.97Show/hide
Query:  EVERFDGNGDFGLWKIKMNVVLTQQKVDIAISKI--IPKTVTETELKEMDKIAHSLIILHLA------DNVFRKVSGEDT--------------------
        E+   D N  F LW+IKM  VL Q  ++ A+  I  +P T+T+ E K  D+      I+  A      +N  ++V+ E                      
Subjt:  EVERFDGNGDFGLWKIKMNVVLTQQKVDIAISKI--IPKTVTETELKEMDKIAHSLIILHLA------DNVFRKVSGEDT--------------------

Query:  ------------FD-ENQSIILLNSLSDSYKDVKAAI----------------------------KGNIEKGEVSGVNLG-------EACDTAEVLMVSE
                    +D E+  +ILL SL  SY   K  I                            +G  ++    G + G       E     E+L+ S 
Subjt:  ------------FD-ENQSIILLNSLSDSYKDVKAAI----------------------------KGNIEKGEVSGVNLG-------EACDTAEVLMVSE

Query:  NQE--DEAWILDSGCSFHMTPHKNWLEDFQEMSGGKVLLGNNQHCEVEGQGTVRIKTSDGAVRLLTGVSS------------------------------
        N     E WILDSGC+FHM+P+++W   ++ +S G VL+GNN  C++ G GT+++K  DG VR L+  S+                              
Subjt:  NQE--DEAWILDSGCSFHMTPHKNWLEDFQEMSGGKVLLGNNQHCEVEGQGTVRIKTSDGAVRLLTGVSS------------------------------

Query:  -------------------------------------------------------------------------------------RLKKLRTDNGLEFCN
                                                                                             ++K LRTDNGLEFC+
Subjt:  -------------------------------------------------------------------------------------RLKKLRTDNGLEFCN

Query:  HEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPSTAI------------------------GSYA
         EFN  CK +GI R LTV  TPQQNGVAERMNRT+MEKVRCML N+ LPKSFW EA  T C+++NRSPS AI                         +Y 
Subjt:  HEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPSTAI------------------------GSYA

Query:  HSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDESKM------------------------------------GYVENTTESNPIE
        H N GKLEPR+ KC+FLGY  G+KGYKLWC  P++ K +ISRDVVFDE+ M                                     Y  N  E     
Subjt:  HSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDESKM------------------------------------GYVENTTESNPIE

Query:  NPKIPLEEEVVQEDEEE-----QDTQDLTDYQLSRDRIRRTTGHLL--------------VRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYM
               E +  ED E+     Q+  +      + D ++   G                 V+H SIR LL  VA  D+ELEQ+DVKT FLHGEL+E IYM
Subjt:  NPKIPLEEEVVQEDEEE-----QDTQDLTDYQLSRDRIRRTTGHLL--------------VRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYM

Query:  DQPLGFVVPGQENKVCLLMKSLYGLKQSSRQWYKR
         QP GF V  +E+ VCLL KSLYGLKQS RQWYKR
Subjt:  DQPLGFVVPGQENKVCLLMKSLYGLKQSSRQWYKR

KAG8481491.1 hypothetical protein CXB51_026341 [Gossypium anomalum]7.0e-7930.91Show/hide
Query:  EVERFDGNGDFGLWKIKMNVVLTQQKVDIAISKI--IPKTVTETELKEMDKIAHSLIILHLADNVFRKVSGEDT--------------------------
        E+   D N  F LW+IKM  VL Q  ++ A+  I  +P T+T+ E K  D+ A + + LHL++ + + V  E T                          
Subjt:  EVERFDGNGDFGLWKIKMNVVLTQQKVDIAISKI--IPKTVTETELKEMDKIAHSLIILHLADNVFRKVSGEDT--------------------------

Query:  --------------------------------FD-ENQSIILLNSLSDSYKDVKAAI---------------------------------KGNI------
                                        +D E+  +ILL SL  SY   +  I                                 +G I      
Subjt:  --------------------------------FD-ENQSIILLNSLSDSYKDVKAAI---------------------------------KGNI------

Query:  -----EKGEVSGVNLGEACDTAEVLMVSENQE--DEAWILDSGCSFHMTPHKNWLEDFQEMSGGKVLLGNNQHCEVEGQGTVRIKTSDGAVRLLTGV---
             ++G     N  E     E+L+ S N     E WILDSGC+FHM+P+ +W   ++ +S G +L+GNN  C++ G GT+++K  DG VR L+ V   
Subjt:  -----EKGEVSGVNLGEACDTAEVLMVSENQE--DEAWILDSGCSFHMTPHKNWLEDFQEMSGGKVLLGNNQHCEVEGQGTVRIKTSDGAVRLLTGV---

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------SSRLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPSTAI--
                   ++K LRTDNGLEFC+ EFN  CK +GI R LTV  TPQQNGVAERMNRT+MEKVRCML NA LPKSFW EA  T C+++NRSPS AI  
Subjt:  ---------SSRLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPSTAI--

Query:  ---------GSYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDESKMGYVENTTESNPIENPKIPLEEEVVQEDEEEQDTQ--
                  +YAH N GKLEPR+ KC+FLGY  G+KGYKLWC  P++ K +ISRDVVFDE+ M    +  +S+  EN K  +E ++  E   +  T+  
Subjt:  ---------GSYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDESKMGYVENTTESNPIENPKIPLEEEVVQEDEEEQDTQ--

Query:  ----DLTDYQLSRDRIRR------------------------------------------------------TTG-------------------------
                Y ++++R +R                                                      T G                         
Subjt:  ----DLTDYQLSRDRIRR------------------------------------------------------TTG-------------------------

Query:  --HLLVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQENKVCLLMKSLYGLKQSSRQWYKR---SLTGYVFK---FLDCTI
            +V+H+SIR LL  VA  D+ELEQ+DVKT FLHGEL+E IYM QP GF V  +EN VCLL KSLYGLKQS RQWYKR    +T + FK   F  C  
Subjt:  --HLLVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQENKVCLLMKSLYGLKQSSRQWYKR---SLTGYVFK---FLDCTI

Query:  SWKTNLQSVVAL
          K N  S V L
Subjt:  SWKTNLQSVVAL

RZB42800.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]3.5e-7829.23Show/hide
Query:  EVERFDGNGDFGLWKIKMNVVLTQQKVDIAI--SKIIPKTVTETELKEMDKIAHSLIILHLADNVFRKVSGEDTF-------------------------
        EVE+F    DFGLW++KM  +L QQ +  A+     + K + + + K + + AHS IIL L D V R+VS E T                          
Subjt:  EVERFDGNGDFGLWKIKMNVVLTQQKVDIAI--SKIIPKTVTETELKEMDKIAHSLIILHLADNVFRKVSGEDTF-------------------------

Query:  ---------DENQSIILLNSLSDSYK----------------DVKAAI----------------------------------------------KGNI--
                 DE+Q+++LL SL  SY                 +V+AA+                                              +GNI  
Subjt:  ---------DENQSIILLNSLSDSYK----------------DVKAAI----------------------------------------------KGNI--

Query:  -----------------EKGEVSGVN------------LGEACDTAEVLMVSENQEDEAWILDSGCSFHMTPHKNWLEDFQEMSGGKVLLGNNQHCEVEG
                         E+ +  G N              +  ++AE LMVSE   +  WI+DSGCS+HMTP+++W E F + + G VLLG+N+ C++EG
Subjt:  -----------------EKGEVSGVN------------LGEACDTAEVLMVSENQEDEAWILDSGCSFHMTPHKNWLEDFQEMSGGKVLLGNNQHCEVEG

Query:  QGTVRIKTSDGAVRLLTGV---------------------------------------------------------------------------------
         G++R K  DGA R+LT V                                                                                 
Subjt:  QGTVRIKTSDGAVRLLTGV---------------------------------------------------------------------------------

Query:  --------------------SSRLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYI
                            + ++K+LRTDNGLEFC+  FN+FCK+    R  TV GTPQQNG+AER NRT++E+VRCML +A LPK FW EA +   Y+
Subjt:  --------------------SSRLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYI

Query:  VNRSPSTAIG------------------------SYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDESKMGY-------VEN
        +N+ PST +                         +YAH    KLEPRA KCIFLGYPEG+KGYKLWCLE    + ++S DVVF+E++M Y         +
Subjt:  VNRSPSTAIG------------------------SYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDESKMGY-------VEN

Query:  TTESNPIENPKIPLEEEV-------------VQEDEEEQDTQDLTDYQLSRDRIRRT-------------------------------------------
        T +S  I+  K+  E E              + E++ E++ Q+  DY L+RDRIRR                                            
Subjt:  TTESNPIENPKIPLEEEV-------------VQEDEEEQDTQDLTDYQLSRDRIRRT-------------------------------------------

Query:  ---------------TGHLL-----------------------------------------------------VRHTSIRILLAFVAHFDIELEQMDVKT
                       T  L+                                                     V+H SIRIL+A VA FD+ LEQMDVKT
Subjt:  ---------------TGHLL-----------------------------------------------------VRHTSIRILLAFVAHFDIELEQMDVKT

Query:  TFLHGELDEMIYMDQPLGFVVPGQENKVCLLMKSLYGLKQSSRQW
        TFL+G+LDE+I M QP GF V G+E+ VC L KSLYGLKQS RQW
Subjt:  TFLHGELDEMIYMDQPLGFVVPGQENKVCLLMKSLYGLKQSSRQW

RZB42800.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]9.3e-0766Show/hide
Query:  KRSLTGYVFKFLDCTISWKTNLQSVVALSTTESEYIALTEAIKESIWLKG
        ++SLTG+VF      ISWK  LQ VVALSTTE EYIAL EA+KES+WL+G
Subjt:  KRSLTGYVFKFLDCTISWKTNLQSVVALSTTESEYIALTEAIKESIWLKG

TrEMBL top hitse value%identityAlignment
A0A445F227 Retrovirus-related Pol polyprotein from transposon TNT 1-944.5e-0766Show/hide
Query:  KRSLTGYVFKFLDCTISWKTNLQSVVALSTTESEYIALTEAIKESIWLKG
        ++SLTG+VF      ISWK  LQ VVALSTTE EYIAL EA+KES+WL+G
Subjt:  KRSLTGYVFKFLDCTISWKTNLQSVVALSTTESEYIALTEAIKESIWLKG

A0A445F227 Retrovirus-related Pol polyprotein from transposon TNT 1-945.4e-7726.67Show/hide
Query:  THLEVERFDGNGDFGLWKIKMNVVLTQQKV-DIAISKIIPKTVTETELKEMDKIAHSLIILHLADNVFRKVS----------------------------
        T  EV +F+G+GDF LW+ K+  +L Q KV  I   + +P  +TE+E ++MD++A+S I+L+L+D V R V                             
Subjt:  THLEVERFDGNGDFGLWKIKMNVVLTQQKV-DIAISKIIPKTVTETELKEMDKIAHSLIILHLADNVFRKVS----------------------------

Query:  -------------------------------GEDTFDENQSIILLNSLSDSYKDVKAAIK-----------------------------------GNIEK
                                       GE   DENQ++ILLNSL ++Y++VKAAIK                                   G  EK
Subjt:  -------------------------------GEDTFDENQSIILLNSLSDSYKDVKAAIK-----------------------------------GNIEK

Query:  -----------------------------------------------GEVSGVNLGEACD----------TAEVLMVSENQEDEAWILDSGCSFHMTPHK
                                                           G N  E  D          +AEVLMVS     +AWI+DSGC+FHMTPH+
Subjt:  -----------------------------------------------GEVSGVNLGEACD----------TAEVLMVSENQEDEAWILDSGCSFHMTPHK

Query:  NWLEDFQEMSGGKVLLGNNQHCEVEGQGTVRIKTSDGAVRLLTGV-------------------------------------------------------
        ++L +FQ++ GGKVLLG+N  C+V+G G+V+I T DG VR+LT V                                                       
Subjt:  NWLEDFQEMSGGKVLLGNNQHCEVEGQGTVRIKTSDGAVRLLTGV-------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------SSRLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLP
                                            ++K LRTDNGLEF N++FN FCK +GI R  TV  TPQQNG+AER NRT+ME+ RC+L NA LP
Subjt:  ----------------------------------SSRLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLP

Query:  KSFWGEAVVTTCYIVNRSPSTAIG------------------------SYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDES
          FWGEA  T CY++NRSPSTA+                         +YAH   GKL  RA KC+F+GYP+G+KGYKLWC+E    K IISRDV F+E+
Subjt:  KSFWGEAVVTTCYIVNRSPSTAIG------------------------SYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDES

Query:  KMGY-----------------VENTTESNP---IEN-PKIPLEEEVVQED--------------------EEEQDTQDLTDYQLSRDRIRR---------
        +M Y                 V   +E  P   ++N P +  E E  Q+                     EE     DL +YQL+RDR++R         
Subjt:  KMGY-----------------VENTTESNP---IEN-PKIPLEEEVVQED--------------------EEEQDTQDLTDYQLSRDRIRR---------

Query:  ---------------------------------------------------------------------------TTGHL--------------------
                                                                                   T G+                     
Subjt:  ---------------------------------------------------------------------------TTGHL--------------------

Query:  -------LVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQENKVCLLMKSLYGLKQSSRQWYKRSLT-----GYVFKFLDC
               +VRH+SIR++L+   HFD+ +EQMDV T FLHGEL+E+IYM QP G+ V G+E+ VC L KSLYGLKQS RQWY R  T     G+     D 
Subjt:  -------LVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQENKVCLLMKSLYGLKQSSRQWYKRSLT-----GYVFKFLDC

Query:  TISWK
         + WK
Subjt:  TISWK

A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class1.6e-7626.79Show/hide
Query:  THLEVERFDGNGDFGLWKIKMNVVLTQQKV-DIAISKIIPKTVTETELKEMDKIAHSLIILHLADNVFRKVS----------------------------
        T  EV +F+G+GDF LW+ K+  +L Q KV  I   + +P  +TE+E ++MD++A+  I+L+L+D V R V                             
Subjt:  THLEVERFDGNGDFGLWKIKMNVVLTQQKV-DIAISKIIPKTVTETELKEMDKIAHSLIILHLADNVFRKVS----------------------------

Query:  -------------------------------GEDTFDENQSIILLNSLSDSYKDVKAAIK-----------------------------------GNIEK
                                       GE   DENQ++ILLNSL ++Y++VKAAIK                                   G  EK
Subjt:  -------------------------------GEDTFDENQSIILLNSLSDSYKDVKAAIK-----------------------------------GNIEK

Query:  GEVSG--------------------------------------------------VNLGEACD-------TAEVLMVSENQEDEAWILDSGCSFHMTPHK
            G                                                    + + CD       +AEVLMVS     +AWI+DSGC+FHMTPH+
Subjt:  GEVSG--------------------------------------------------VNLGEACD-------TAEVLMVSENQEDEAWILDSGCSFHMTPHK

Query:  NWLEDFQEMSGGKVLLGNNQHCEVEGQGTVRIKTSDGAVRLLTGV-------------------------------------------------------
        ++L +FQ++ GGKVLLG+N  C+V+G G+V+I T DG VR+LT V                                                       
Subjt:  NWLEDFQEMSGGKVLLGNNQHCEVEGQGTVRIKTSDGAVRLLTGV-------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------SSRLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLP
                                            ++K LRTDNGLEF N++FN FCK +GI R  TV  TPQQNG+AER NRT+ME+ RC+L NA LP
Subjt:  ----------------------------------SSRLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLP

Query:  KSFWGEAVVTTCYIVNRSPSTAIG------------------------SYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDES
          FWGEA  T CY++NRSPSTA+                         +YAH   GKL  RA KC+F+GYP+G+KGYKLWC+E    K IISRDV F+E+
Subjt:  KSFWGEAVVTTCYIVNRSPSTAIG------------------------SYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDES

Query:  KMGY-----------------VENTTESNP---IEN-PKIPLEEEVVQED--------------------EEEQDTQDLTDYQLSRDRIRR---------
        +M Y                 V   +E  P   ++N P +  E E  Q+                     EE     DL +YQL+RDR++R         
Subjt:  KMGY-----------------VENTTESNP---IEN-PKIPLEEEVVQED--------------------EEEQDTQDLTDYQLSRDRIRR---------

Query:  ---------------------------------------------------------------------------TTGHLLVRHTSIRILLAFVAHFDIE
                                                                                   T G+   R+ + R++L+   HFD+ 
Subjt:  ---------------------------------------------------------------------------TTGHLLVRHTSIRILLAFVAHFDIE

Query:  LEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQENKVCLLMKSLYGLKQSSRQWYKRSLT-----GYVFKFLDCTISWK
        +EQMDV TTFLHGEL+E+IYM QP G+ V G+E+ VC L KSLYGLKQS RQWY    T     G+     D  + WK
Subjt:  LEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQENKVCLLMKSLYGLKQSSRQWYKRSLT-----GYVFKFLDCTISWK

A0A5A7UB25 Putative gag-pol polyprotein5.4e-7726.67Show/hide
Query:  THLEVERFDGNGDFGLWKIKMNVVLTQQKV-DIAISKIIPKTVTETELKEMDKIAHSLIILHLADNVFRKVS----------------------------
        T  EV +F+G+GDF LW+ K+  +L Q KV  I   + +P  +TE+E ++MD++A+S I+L+L+D V R V                             
Subjt:  THLEVERFDGNGDFGLWKIKMNVVLTQQKV-DIAISKIIPKTVTETELKEMDKIAHSLIILHLADNVFRKVS----------------------------

Query:  -------------------------------GEDTFDENQSIILLNSLSDSYKDVKAAIK-----------------------------------GNIEK
                                       GE   DENQ++ILLNSL ++Y++VKAAIK                                   G  EK
Subjt:  -------------------------------GEDTFDENQSIILLNSLSDSYKDVKAAIK-----------------------------------GNIEK

Query:  -----------------------------------------------GEVSGVNLGEACD----------TAEVLMVSENQEDEAWILDSGCSFHMTPHK
                                                           G N  E  D          +AEVLMVS     +AWI+DSGC+FHMTPH+
Subjt:  -----------------------------------------------GEVSGVNLGEACD----------TAEVLMVSENQEDEAWILDSGCSFHMTPHK

Query:  NWLEDFQEMSGGKVLLGNNQHCEVEGQGTVRIKTSDGAVRLLTGV-------------------------------------------------------
        ++L +FQ++ GGKVLLG+N  C+V+G G+V+I T DG VR+LT V                                                       
Subjt:  NWLEDFQEMSGGKVLLGNNQHCEVEGQGTVRIKTSDGAVRLLTGV-------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------SSRLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLP
                                            ++K LRTDNGLEF N++FN FCK +GI R  TV  TPQQNG+AER NRT+ME+ RC+L NA LP
Subjt:  ----------------------------------SSRLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLP

Query:  KSFWGEAVVTTCYIVNRSPSTAIG------------------------SYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDES
          FWGEA  T CY++NRSPSTA+                         +YAH   GKL  RA KC+F+GYP+G+KGYKLWC+E    K IISRDV F+E+
Subjt:  KSFWGEAVVTTCYIVNRSPSTAIG------------------------SYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDES

Query:  KMGY-----------------VENTTESNP---IEN-PKIPLEEEVVQED--------------------EEEQDTQDLTDYQLSRDRIRR---------
        +M Y                 V   +E  P   ++N P +  E E  Q+                     EE     DL +YQL+RDR++R         
Subjt:  KMGY-----------------VENTTESNP---IEN-PKIPLEEEVVQED--------------------EEEQDTQDLTDYQLSRDRIRR---------

Query:  ---------------------------------------------------------------------------TTGHL--------------------
                                                                                   T G+                     
Subjt:  ---------------------------------------------------------------------------TTGHL--------------------

Query:  -------LVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQENKVCLLMKSLYGLKQSSRQWYKRSLT-----GYVFKFLDC
               +VRH+SIR++L+   HFD+ +EQMDV T FLHGEL+E+IYM QP G+ V G+E+ VC L KSLYGLKQS RQWY R  T     G+     D 
Subjt:  -------LVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQENKVCLLMKSLYGLKQSSRQWYKRSLT-----GYVFKFLDC

Query:  TISWK
         + WK
Subjt:  TISWK

A0A6A3A9V0 Uncharacterized protein2.1e-8129.96Show/hide
Query:  EVERFDGNGDFGLWKIKMNVVLTQQKVDIAIS--KIIPKTVTETELKEMDKIAHSLIILHLADNVFRKVSGE----------------------------
        ++E+F G  DFGLW++KM  +L QQ +  A+     +P  + E E   + + AHS IIL L D   R+VS E                            
Subjt:  EVERFDGNGDFGLWKIKMNVVLTQQKVDIAIS--KIIPKTVTETELKEMDKIAHSLIILHLADNVFRKVSGE----------------------------

Query:  ----------------DTF---------------DENQSIILLNSLSDSYKDVKAAI-------------------------------------------
                        D F               DE+++++LLNS+  SY+  K A+                                           
Subjt:  ----------------DTF---------------DENQSIILLNSLSDSYKDVKAAI-------------------------------------------

Query:  ------------------------------------KGNIEK----------------GEVSGVNLGEACDTAEVLMVSENQEDEAWILDSGCSFHMTPH
                                            +G+ +K                GEVS V+  +  D+ E L+++E++  + WILDSGCSFHM PH
Subjt:  ------------------------------------KGNIEK----------------GEVSGVNLGEACDTAEVLMVSENQEDEAWILDSGCSFHMTPH

Query:  KNWLEDFQEMSGGKVLLGNNQHCEVEGQGTVRIK------------TSDGAVRLLTGV------------------------------------------
        K+W E  Q +SGG VLLG+N+ C V G GT+RI+             + G++  + G+                                          
Subjt:  KNWLEDFQEMSGGKVLLGNNQHCEVEGQGTVRIK------------TSDGAVRLLTGV------------------------------------------

Query:  -----------------------------------------------------------------------SSRLKKLRTDNGLEFCNHEFNNFCKKKGI
                                                                                ++LK LRTDNGLE+ + +FN  C+K GI
Subjt:  -----------------------------------------------------------------------SSRLKKLRTDNGLEFCNHEFNNFCKKKGI

Query:  QRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPSTAIGSYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSA
         R  TV  TPQQNG+AERMNRTL+E+VRCML NA LPKSFWGEAV T CY++NR               KLE RA +CIF+GYP G KGYKLWC+EP   
Subjt:  QRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPSTAIGSYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSA

Query:  KTIISRDVVFDESKMGYVENTTESNPIENPKIPLEEEVVQ----------------EDEEEQDTQD---LTDYQLSRDRIRRT-----------------
        K IISRDVVFDESKM  + +    N +    IP+E E+ Q                E E +Q+T+    L DY L RDR RRT                 
Subjt:  KTIISRDVVFDESKMGYVENTTESNPIENPKIPLEEEVVQ----------------EDEEEQDTQD---LTDYQLSRDRIRRT-----------------

Query:  -----------------------------------------TGHLLVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQENK
                                                 T  L+ R  + R++ + VA +D+ELEQ+DVKT FLHGEL+E IYMDQP G+     ++K
Subjt:  -----------------------------------------TGHLLVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQENK

Query:  VCLLMKSLYGLKQSSRQWYKR
        VCLL KSLYGLKQS RQWYKR
Subjt:  VCLLMKSLYGLKQSSRQWYKR

A0A6A3A9V0 Uncharacterized protein2.8e-0951.81Show/hide
Query:  LGFVVPGQEN---KVCLLMKSLYGLKQSSRQWYKRSLTGYVFKFLDCTISWKTNLQSVVALSTTESEYIALTEAIKESIWLKG
        LG +   QEN   KV   + S Y     +R    +SLTG++F      +SWK+NLQSVVALSTTE+EYIA+TEAIKE++WL+G
Subjt:  LGFVVPGQEN---KVCLLMKSLYGLKQSSRQWYKRSLTGYVFKFLDCTISWKTNLQSVVALSTTESEYIALTEAIKESIWLKG

A0A6A3A9V0 Uncharacterized protein1.7e-7829.23Show/hide
Query:  EVERFDGNGDFGLWKIKMNVVLTQQKVDIAI--SKIIPKTVTETELKEMDKIAHSLIILHLADNVFRKVSGEDTF-------------------------
        EVE+F    DFGLW++KM  +L QQ +  A+     + K + + + K + + AHS IIL L D V R+VS E T                          
Subjt:  EVERFDGNGDFGLWKIKMNVVLTQQKVDIAI--SKIIPKTVTETELKEMDKIAHSLIILHLADNVFRKVSGEDTF-------------------------

Query:  ---------DENQSIILLNSLSDSYK----------------DVKAAI----------------------------------------------KGNI--
                 DE+Q+++LL SL  SY                 +V+AA+                                              +GNI  
Subjt:  ---------DENQSIILLNSLSDSYK----------------DVKAAI----------------------------------------------KGNI--

Query:  -----------------EKGEVSGVN------------LGEACDTAEVLMVSENQEDEAWILDSGCSFHMTPHKNWLEDFQEMSGGKVLLGNNQHCEVEG
                         E+ +  G N              +  ++AE LMVSE   +  WI+DSGCS+HMTP+++W E F + + G VLLG+N+ C++EG
Subjt:  -----------------EKGEVSGVN------------LGEACDTAEVLMVSENQEDEAWILDSGCSFHMTPHKNWLEDFQEMSGGKVLLGNNQHCEVEG

Query:  QGTVRIKTSDGAVRLLTGV---------------------------------------------------------------------------------
         G++R K  DGA R+LT V                                                                                 
Subjt:  QGTVRIKTSDGAVRLLTGV---------------------------------------------------------------------------------

Query:  --------------------SSRLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYI
                            + ++K+LRTDNGLEFC+  FN+FCK+    R  TV GTPQQNG+AER NRT++E+VRCML +A LPK FW EA +   Y+
Subjt:  --------------------SSRLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYI

Query:  VNRSPSTAIG------------------------SYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDESKMGY-------VEN
        +N+ PST +                         +YAH    KLEPRA KCIFLGYPEG+KGYKLWCLE    + ++S DVVF+E++M Y         +
Subjt:  VNRSPSTAIG------------------------SYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDESKMGY-------VEN

Query:  TTESNPIENPKIPLEEEV-------------VQEDEEEQDTQDLTDYQLSRDRIRRT-------------------------------------------
        T +S  I+  K+  E E              + E++ E++ Q+  DY L+RDRIRR                                            
Subjt:  TTESNPIENPKIPLEEEV-------------VQEDEEEQDTQDLTDYQLSRDRIRRT-------------------------------------------

Query:  ---------------TGHLL-----------------------------------------------------VRHTSIRILLAFVAHFDIELEQMDVKT
                       T  L+                                                     V+H SIRIL+A VA FD+ LEQMDVKT
Subjt:  ---------------TGHLL-----------------------------------------------------VRHTSIRILLAFVAHFDIELEQMDVKT

Query:  TFLHGELDEMIYMDQPLGFVVPGQENKVCLLMKSLYGLKQSSRQW
        TFL+G+LDE+I M QP GF V G+E+ VC L KSLYGLKQS RQW
Subjt:  TFLHGELDEMIYMDQPLGFVVPGQENKVCLLMKSLYGLKQSSRQW

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-2335Show/hide
Query:  LRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPSTAIGS---------------
        L  DNG E+ ++E   FC KKGI   LTV  TPQ NGV+ERM RT+ EK R M+  A+L KSFWGEAV+T  Y++NR PS A+                 
Subjt:  LRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPSTAIGS---------------

Query:  -----------YAH--SNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDESKM----------GYVENTTESNPIENPKIPLEEEVV
                   Y H  +  GK + ++ K IF+GY     G+KLW  +  + K I++RDVV DE+ M           +++++ ES   EN   P +   +
Subjt:  -----------YAH--SNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDESKM----------GYVENTTESNPIENPKIPLEEEVV

Query:  QEDEEEQDTQDLTDYQLSRD
         + E   ++++  + Q  +D
Subjt:  QEDEEEQDTQDLTDYQLSRD

P04146 Copia protein3.3e-0733.88Show/hide
Query:  TSIRILLAFVAHFDIEL-EQMDVKTTFLHGELDEMIYMDQPLGFVVPGQENKVCLLMKSLYGLKQSSRQWYKRSLTGYVFKFLDCT-ISWKTNLQSVVAL
        T++ IL  + +  + EL + +     +L G +D  +   + L F     ENK+   + S +   +  R    +S TGY+FK  D   I W T  Q+ VA 
Subjt:  TSIRILLAFVAHFDIEL-EQMDVKTTFLHGELDEMIYMDQPLGFVVPGQENKVCLLMKSLYGLKQSSRQWYKRSLTGYVFKFLDCT-ISWKTNLQSVVAL

Query:  STTESEYIALTEAIKESIWLK
        S+TE+EY+AL EA++E++WLK
Subjt:  STTESEYIALTEAIKESIWLK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-4632.79Show/hide
Query:  RLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPSTAIG------------
        +LK+LR+DNG E+ + EF  +C   GI+   TV GTPQ NGVAERMNRT++EKVR ML  A+LPKSFWGEAV T CY++NRSPS  +             
Subjt:  RLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPSTAIG------------

Query:  -SYAHSNI--------------GKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDES------------KMGYVEN-----TTESNP---
         SY+H  +               KL+ ++  CIF+GY +   GY+LW  +P   K I SRDVVF ES            K G + N     +T +NP   
Subjt:  -SYAHSNI--------------GKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISRDVVFDES------------KMGYVEN-----TTESNP---

Query:  ---------------------------------------------------IENPKIPLEEEVVQEDEEEQD----------------------------
                                                           +E+ + P  E V+  D+ E +                            
Subjt:  ---------------------------------------------------IENPKIPLEEEVVQEDEEEQD----------------------------

Query:  -TQDLTD-------------YQLSRD---RIRRTTGHL-------------------LVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQ
         T  L +             ++L +D   ++ R    L                   +V+ TSIR +L+  A  D+E+EQ+DVKT FLHG+L+E IYM+Q
Subjt:  -TQDLTD-------------YQLSRD---RIRRTTGHL-------------------LVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQ

Query:  PLGFVVPGQENKVCLLMKSLYGLKQSSRQWYKR
        P GF V G+++ VC L KSLYGLKQ+ RQWY +
Subjt:  PLGFVVPGQENKVCLLMKSLYGLKQSSRQWYKR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-0761.22Show/hide
Query:  KRSLTGYVFKFLDCTISWKTNLQSVVALSTTESEYIALTEAIKESIWLK
        ++S TGY+F F    ISW++ LQ  VALSTTE+EYIA TE  KE IWLK
Subjt:  KRSLTGYVFKFLDCTISWKTNLQSVVALSTTESEYIALTEAIKESIWLK

Q12472 Transposon Ty2-DR1 Gag-Pol polyprotein3.7e-0628.89Show/hide
Query:  SSRLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNR--SPSTAIGSYAHSNIG
        ++R+  ++ D G E+ N   + F   +GI    T     + +GVAER+NRTL+   R +L  + LP   W  AV  +  I N   SP     +  H+ + 
Subjt:  SSRLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNR--SPSTAIGSYAHSNIG

Query:  KLE-----PRAKKCIFLG-------YPEGIKGYKL
         L+     P  +  I          +P GI GY L
Subjt:  KLE-----PRAKKCIFLG-------YPEGIKGYKL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.2e-1044.59Show/hide
Query:  LVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQENKVCLLMKSLYGLKQSSRQWY
        +++ TSIRI+L         + Q+DV   FL G L + +YM QP GF+   + N VC L K+LYGLKQ+ R WY
Subjt:  LVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQENKVCLLMKSLYGLKQSSRQWY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.1e-0530.59Show/hide
Query:  SRLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPS
        +R+    +DNG EF       +  + GI    +   TP+ NG++ER +R ++E    +L +A +PK++W  A     Y++NR P+
Subjt:  SRLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-0943.24Show/hide
Query:  LVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQENKVCLLMKSLYGLKQSSRQWY
        +++ TSIRI+L         + Q+DV   FL G L + +YM QP GFV   + + VC L K++YGLKQ+ R WY
Subjt:  LVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQENKVCLLMKSLYGLKQSSRQWY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.4e-0531.76Show/hide
Query:  SRLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPS
        +R+  L +DNG EF      ++  + GI    +   TP+ NG++ER +R ++E    +L +A +PK++W  A     Y++NR P+
Subjt:  SRLKKLRTDNGLEFCNHEFNNFCKKKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.7e-1244.83Show/hide
Query:  TSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQE----NKVCLLMKSLYGLKQSSRQWY-KRSLTGYVFKFL
        TS++++LA  A ++  L Q+D+   FL+G+LDE IYM  P G+     +    N VC L KS+YGLKQ+SRQW+ K S+T   F F+
Subjt:  TSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFVVPGQE----NKVCLLMKSLYGLKQSSRQWY-KRSLTGYVFKFL

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.5e-0243.75Show/hide
Query:  KRSLTGYVFKFLDCTISWKTNLQSVVALSTTESEYIALTEAIKESIWL
        +RS  GY        ISWK+  Q VV+ S+ E+EY AL+ A  E +WL
Subjt:  KRSLTGYVFKFLDCTISWKTNLQSVVALSTTESEYIALTEAIKESIWL

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.0e-0739.76Show/hide
Query:  MNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPSTAIG------------------------SYAHSNIGKLEPRAKK
        MNRT++EKVR ML    LPK+F  +A  T  +I+N+ PSTAI                         +Y H + GKL+PRAKK
Subjt:  MNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPSTAIG------------------------SYAHSNIGKLEPRAKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAACGCATTTGGAAGTAGAACGGTTTGATGGGAATGGTGATTTTGGTCTATGGAAGATCAAAATGAATGTTGTCTTAACACAACAAAAGGTGGATATTGCAATTTC
CAAAATCATCCCAAAGACTGTGACAGAAACTGAACTTAAGGAAATGGACAAGATCGCTCATAGTTTGATTATTTTGCATCTTGCCGACAATGTCTTCCGCAAGGTTAGTG
GAGAAGACACATTCGATGAAAATCAATCCATTATTTTGTTGAATTCTTTGTCTGATTCTTATAAGGATGTTAAAGCTGCTATTAAGGGTAATATAGAGAAGGGAGAGGTG
AGCGGAGTTAACCTAGGAGAGGCTTGTGACACCGCTGAAGTCCTTATGGTTAGTGAGAACCAAGAGGATGAAGCTTGGATATTAGATTCAGGTTGTTCCTTCCATATGAC
TCCCCATAAAAACTGGTTAGAAGACTTCCAAGAAATGAGTGGTGGTAAAGTCTTGTTAGGTAATAACCAACATTGTGAGGTTGAAGGACAAGGTACTGTTAGGATTAAAA
CTAGTGATGGGGCCGTGAGACTCTTGACGGGGGTAAGCTCTAGACTCAAGAAACTTAGAACTGATAATGGTCTTGAATTTTGCAATCATGAGTTTAATAATTTTTGTAAG
AAAAAGGGTATTCAAAGGCGTTTGACAGTTGTAGGTACTCCTCAACAAAATGGGGTTGCTGAAAGAATGAACCGTACTCTAATGGAGAAAGTTAGATGCATGTTGTTTAA
TGCACAGCTTCCCAAAAGTTTCTGGGGAGAAGCTGTAGTGACAACATGCTATATTGTTAATAGGAGTCCTTCTACAGCTATAGGATCCTATGCTCATAGTAACATAGGAA
AACTAGAACCTAGAGCTAAGAAGTGTATCTTCTTAGGGTATCCCGAGGGCATTAAGGGGTACAAACTTTGGTGCCTAGAACCAGATAGTGCTAAAACAATAATTAGTAGG
GATGTTGTCTTTGATGAGTCTAAGATGGGCTATGTAGAAAATACCACTGAGTCTAACCCCATTGAAAATCCTAAGATACCACTTGAAGAAGAAGTTGTTCAAGAGGATGA
GGAAGAACAAGACACCCAAGATCTTACTGATTATCAACTGTCTAGGGATAGGATTAGGAGAACTACTGGCCACCTACTAGTAAGACATACATCTATTAGAATTTTGCTTG
CTTTTGTTGCTCATTTTGATATTGAACTTGAACAAATGGATGTTAAAACAACTTTCTTGCATGGTGAACTTGATGAGATGATCTATATGGATCAACCTCTTGGATTTGTA
GTTCCGGGACAAGAAAATAAAGTTTGTTTACTAATGAAATCCCTTTATGGCTTAAAGCAATCTTCTAGACAATGGTATAAAAGATCTTTGACTGGTTATGTATTTAAATT
TCTTGATTGTACTATTAGTTGGAAAACCAATTTACAATCTGTGGTTGCTTTATCTACAACTGAATCAGAATACATTGCCTTAACTGAAGCAATAAAGGAATCAATATGGC
TTAAAGGCAAAAATCTTATCATATCTGCCATGCCACAGGGCTCAAGAGCTTTACATACAACCAACGATGTAGATGAATGGATGTGGTCATGTTTGTCGGCTTTCAAACTG
CTCACTGTCCTCAACATTACACGTCCAAATGTAGATAACTTGGGATCCTTGGTGTTACCTTGTTTGAGAATTTTAGAGATCAAAAGGGTATTCAATTCCTGGCATGATGA
CTTCAATGAGTACTACAACAATTTGGGAAACGACAGCGACGTACCATTGAATGAAAAGACATTCTCAGGTTGTCCAAATTTGGAATGTTTAGTTTTAATTGATTGTCTAT
TCGAAAGCGACCGTATATCTGCTCCAAAACTTGAGAGCTTGGAGCTAAGTACTGTAGAATTTCTACAAGCTTCCCCGACAGTTGAGTTGTGTACACCAAATCTGAAAACT
GTCAAGCTTAGCAATGTTGTACCTTGTGTGAAGTCTGCTTCTGACTTACTCTACATACACAAAGTAACCTTAGCAATTATTGGGTTAATTGATGACTATGAATTTGAAAG
GGAAAAAATCTCTCAACTGCTTAGAGTTTTTCATAATGCAGTGTCTCTTACGCTGCCTGAAAATGCAACACAGATGGACAAGGAGCTGACGAGGACAATCGGGCAGAGGT
GGGACCAAAAGACCGACCCAGAGGAAGACCGGGCCAAAGGGTCGGGCCAAAATGGCCAGACCCATATGGTCGGCCTCGGCCTTTTGCCAAGGCCGAGCATATGGCATCGG
AGGCGGTGTGGCCTACACCACGCCGGTGTGCAGCGGTTTTTGCTGGTCTTGCAGGTCACGTCTTCCCCGGTTTCTACAAATTCACTGTTTGTGTCACGTGAAGGTCAGGT
GAGTCTTCTGTCCGGATTTTGGCATCAACAGTTGGCGCCGTCTGTGGGGAGGAGTGCTTACCAGTTCAAACCGCGCATCGGTGGTGGTCTGAAGGATGGATCGGGTTCAG
AAACTAGAGGGAAGCCTCGGGTTCCGTGTGCCCACGATGTTGTGGACCTAACCCAAGAGGACAACCAGCAGAAGGAGAGGAGAGCCGGGAAGCTTTTGTATCTGACCAGC
ATCAAGAAGACAGAGGAAAAGGACGTCGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAACGCATTTGGAAGTAGAACGGTTTGATGGGAATGGTGATTTTGGTCTATGGAAGATCAAAATGAATGTTGTCTTAACACAACAAAAGGTGGATATTGCAATTTC
CAAAATCATCCCAAAGACTGTGACAGAAACTGAACTTAAGGAAATGGACAAGATCGCTCATAGTTTGATTATTTTGCATCTTGCCGACAATGTCTTCCGCAAGGTTAGTG
GAGAAGACACATTCGATGAAAATCAATCCATTATTTTGTTGAATTCTTTGTCTGATTCTTATAAGGATGTTAAAGCTGCTATTAAGGGTAATATAGAGAAGGGAGAGGTG
AGCGGAGTTAACCTAGGAGAGGCTTGTGACACCGCTGAAGTCCTTATGGTTAGTGAGAACCAAGAGGATGAAGCTTGGATATTAGATTCAGGTTGTTCCTTCCATATGAC
TCCCCATAAAAACTGGTTAGAAGACTTCCAAGAAATGAGTGGTGGTAAAGTCTTGTTAGGTAATAACCAACATTGTGAGGTTGAAGGACAAGGTACTGTTAGGATTAAAA
CTAGTGATGGGGCCGTGAGACTCTTGACGGGGGTAAGCTCTAGACTCAAGAAACTTAGAACTGATAATGGTCTTGAATTTTGCAATCATGAGTTTAATAATTTTTGTAAG
AAAAAGGGTATTCAAAGGCGTTTGACAGTTGTAGGTACTCCTCAACAAAATGGGGTTGCTGAAAGAATGAACCGTACTCTAATGGAGAAAGTTAGATGCATGTTGTTTAA
TGCACAGCTTCCCAAAAGTTTCTGGGGAGAAGCTGTAGTGACAACATGCTATATTGTTAATAGGAGTCCTTCTACAGCTATAGGATCCTATGCTCATAGTAACATAGGAA
AACTAGAACCTAGAGCTAAGAAGTGTATCTTCTTAGGGTATCCCGAGGGCATTAAGGGGTACAAACTTTGGTGCCTAGAACCAGATAGTGCTAAAACAATAATTAGTAGG
GATGTTGTCTTTGATGAGTCTAAGATGGGCTATGTAGAAAATACCACTGAGTCTAACCCCATTGAAAATCCTAAGATACCACTTGAAGAAGAAGTTGTTCAAGAGGATGA
GGAAGAACAAGACACCCAAGATCTTACTGATTATCAACTGTCTAGGGATAGGATTAGGAGAACTACTGGCCACCTACTAGTAAGACATACATCTATTAGAATTTTGCTTG
CTTTTGTTGCTCATTTTGATATTGAACTTGAACAAATGGATGTTAAAACAACTTTCTTGCATGGTGAACTTGATGAGATGATCTATATGGATCAACCTCTTGGATTTGTA
GTTCCGGGACAAGAAAATAAAGTTTGTTTACTAATGAAATCCCTTTATGGCTTAAAGCAATCTTCTAGACAATGGTATAAAAGATCTTTGACTGGTTATGTATTTAAATT
TCTTGATTGTACTATTAGTTGGAAAACCAATTTACAATCTGTGGTTGCTTTATCTACAACTGAATCAGAATACATTGCCTTAACTGAAGCAATAAAGGAATCAATATGGC
TTAAAGGCAAAAATCTTATCATATCTGCCATGCCACAGGGCTCAAGAGCTTTACATACAACCAACGATGTAGATGAATGGATGTGGTCATGTTTGTCGGCTTTCAAACTG
CTCACTGTCCTCAACATTACACGTCCAAATGTAGATAACTTGGGATCCTTGGTGTTACCTTGTTTGAGAATTTTAGAGATCAAAAGGGTATTCAATTCCTGGCATGATGA
CTTCAATGAGTACTACAACAATTTGGGAAACGACAGCGACGTACCATTGAATGAAAAGACATTCTCAGGTTGTCCAAATTTGGAATGTTTAGTTTTAATTGATTGTCTAT
TCGAAAGCGACCGTATATCTGCTCCAAAACTTGAGAGCTTGGAGCTAAGTACTGTAGAATTTCTACAAGCTTCCCCGACAGTTGAGTTGTGTACACCAAATCTGAAAACT
GTCAAGCTTAGCAATGTTGTACCTTGTGTGAAGTCTGCTTCTGACTTACTCTACATACACAAAGTAACCTTAGCAATTATTGGGTTAATTGATGACTATGAATTTGAAAG
GGAAAAAATCTCTCAACTGCTTAGAGTTTTTCATAATGCAGTGTCTCTTACGCTGCCTGAAAATGCAACACAGATGGACAAGGAGCTGACGAGGACAATCGGGCAGAGGT
GGGACCAAAAGACCGACCCAGAGGAAGACCGGGCCAAAGGGTCGGGCCAAAATGGCCAGACCCATATGGTCGGCCTCGGCCTTTTGCCAAGGCCGAGCATATGGCATCGG
AGGCGGTGTGGCCTACACCACGCCGGTGTGCAGCGGTTTTTGCTGGTCTTGCAGGTCACGTCTTCCCCGGTTTCTACAAATTCACTGTTTGTGTCACGTGAAGGTCAGGT
GAGTCTTCTGTCCGGATTTTGGCATCAACAGTTGGCGCCGTCTGTGGGGAGGAGTGCTTACCAGTTCAAACCGCGCATCGGTGGTGGTCTGAAGGATGGATCGGGTTCAG
AAACTAGAGGGAAGCCTCGGGTTCCGTGTGCCCACGATGTTGTGGACCTAACCCAAGAGGACAACCAGCAGAAGGAGAGGAGAGCCGGGAAGCTTTTGTATCTGACCAGC
ATCAAGAAGACAGAGGAAAAGGACGTCGGGTAG
Protein sequenceShow/hide protein sequence
MGTHLEVERFDGNGDFGLWKIKMNVVLTQQKVDIAISKIIPKTVTETELKEMDKIAHSLIILHLADNVFRKVSGEDTFDENQSIILLNSLSDSYKDVKAAIKGNIEKGEV
SGVNLGEACDTAEVLMVSENQEDEAWILDSGCSFHMTPHKNWLEDFQEMSGGKVLLGNNQHCEVEGQGTVRIKTSDGAVRLLTGVSSRLKKLRTDNGLEFCNHEFNNFCK
KKGIQRRLTVVGTPQQNGVAERMNRTLMEKVRCMLFNAQLPKSFWGEAVVTTCYIVNRSPSTAIGSYAHSNIGKLEPRAKKCIFLGYPEGIKGYKLWCLEPDSAKTIISR
DVVFDESKMGYVENTTESNPIENPKIPLEEEVVQEDEEEQDTQDLTDYQLSRDRIRRTTGHLLVRHTSIRILLAFVAHFDIELEQMDVKTTFLHGELDEMIYMDQPLGFV
VPGQENKVCLLMKSLYGLKQSSRQWYKRSLTGYVFKFLDCTISWKTNLQSVVALSTTESEYIALTEAIKESIWLKGKNLIISAMPQGSRALHTTNDVDEWMWSCLSAFKL
LTVLNITRPNVDNLGSLVLPCLRILEIKRVFNSWHDDFNEYYNNLGNDSDVPLNEKTFSGCPNLECLVLIDCLFESDRISAPKLESLELSTVEFLQASPTVELCTPNLKT
VKLSNVVPCVKSASDLLYIHKVTLAIIGLIDDYEFEREKISQLLRVFHNAVSLTLPENATQMDKELTRTIGQRWDQKTDPEEDRAKGSGQNGQTHMVGLGLLPRPSIWHR
RRCGLHHAGVQRFLLVLQVTSSPVSTNSLFVSREGQVSLLSGFWHQQLAPSVGRSAYQFKPRIGGGLKDGSGSETRGKPRVPCAHDVVDLTQEDNQQKERRAGKLLYLTS
IKKTEEKDVG