; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G15865 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G15865
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationClcChr01:28649399..28652068
RNA-Seq ExpressionClc01G15865
SyntenyClc01G15865
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]1.9e-7734.35Show/hide
Query:  ANATNNFPAATRMP-NFNSPPL--NQLLNQITTIKIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEIINPEYEAWLVVDQLL
        +N  N+ P++  +  + N+ PL  + +L  I   K++GY+LG + CP  FI  + S+ +  ++  E                          W   DQ L
Subjt:  ANATNNFPAATRMP-NFNSPPL--NQLLNQITTIKIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEIINPEYEAWLVVDQLL

Query:  LGWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALISQVLLGLDENYNAIV
        LGW+ NS+T EIATQL+  E SK LWD  Q L G  +R++  YL+  F   RKG   M +YL  MK   + L  AG+P  +  LI Q L GLD  YN +V
Subjt:  LGWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALISQVLLGLDENYNAIV

Query:  ATLQRKPDISWLHHNTQRHKIRVAAEINTL-NVEAISVEIEVRAEEEIDPPAKYVE----------------RLVIQLMSATIASTKNLILI--------
          L  +  +SW+  + Q   +   + I  L N+  +++          D   K                   R         +    N I I        
Subjt:  ATLQRKPDISWLHHNTQRHKIRVAAEINTL-NVEAISVEIEVRAEEEIDPPAKYVE----------------RLVIQLMSATIASTKNLILI--------

Query:  -------------------------SPKVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIAKN
                                 S +  +WY DSGA+NHVT         TE+ G   + VGNG  L I + G++ L +    LNL +IL+VP+I KN
Subjt:  -------------------------SPKVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIAKN

Query:  LVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRLG
        L+SVSKLA DNN+ VEF E+ C VKDK + + +LKG LKDGLY L  T                        K N SA     +S+     K + HRRLG
Subjt:  LVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRLG

Query:  HSSSRIFYLITKGCNISF-KSDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAFE
        H ++++   + + C +    SD   FC ACQ GK H L F SS S A +  EL+HTDVWGP  +++S+GF+YYV F+DDFSR+ WIYPL+ K +T+ AF 
Subjt:  HSSSRIFYLITKGCNISF-KSDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAFE

Query:  HFMQMMHTQFNSSIKAIQSD
         F  +   QFN  IK IQ D
Subjt:  HFMQMMHTQFNSSIKAIQSD

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]5.1e-7832.91Show/hide
Query:  ANATNNFPA--ATRMPNFNSPP-LNQLLNQITTIKIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEIINPEYEAWLVVDQLL
        +N  N+ P+  + ++   N P   + +L  I   +++GY+LG K CP  FI                     T +D     +++  NPE+E W   DQ L
Subjt:  ANATNNFPA--ATRMPNFNSPP-LNQLLNQITTIKIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEIINPEYEAWLVVDQLL

Query:  LGWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALISQVLLGLDENYNAIV
        LGWL NS+T  IATQL+  E S  LWD  Q L G  +R++  YL+  F  TRKG   M +YL  MK  ++ L  AG+P  +  LI Q L GLD  YN +V
Subjt:  LGWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALISQVLLGLDENYNAIV

Query:  ATLQRKPDISWLHHNTQ----RHKIRVAAEINTLNVEA----------------------------------------------------------ISVE
          L  +  +SW+    Q     ++I     +  L + A                                                          I+++
Subjt:  ATLQRKPDISWLHHNTQ----RHKIRVAAEINTLNVEA----------------------------------------------------------ISVE

Query:  IEVRAEEEIDPPAKYVERLVIQLMSATIASTKNLILISPKVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNL
           R ++                 +A +AS  ++     +  +WY DSGA+NHVT       + +E+ G   + VGNG  L+I + G++ L +    LNL
Subjt:  IEVRAEEEIDPPAKYVERLVIQLMSATIASTKNLILISPKVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNL

Query:  KNILHVPDIAKNLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINS
         +IL+VP I KNL+SVSKLA DNN+ VEF E+ C VKDK + + +L+G LKDGLY L E   +++ ++                                
Subjt:  KNILHVPDIAKNLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINS

Query:  AVTKTTSHRRLGHSSSRIFYLITKGCNISFK-SDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYP
           K + HR+LGH ++++  ++ K CN+    SD   FC ACQ GK H L F +S S A +  EL+HTDVWGP  ++SS+GF+YYV F+DDF+R+ WIYP
Subjt:  AVTKTTSHRRLGHSSSRIFYLITKGCNISFK-SDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYP

Query:  LRLKGDTMVAFEHFMQMMHTQFNSSIKAIQSD
        L+ K DT  AF  F  M+  QF+  IK IQ D
Subjt:  LRLKGDTMVAFEHFMQMMHTQFNSSIKAIQSD

PNX78574.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]8.7e-7835.43Show/hide
Query:  ANATNNFPA--ATRMPNFNSPPLNQL-LNQITTIKIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEIINPEYEAWLVVDQLL
        +N  N+ P+  +  +   N P    L L  I   +++GY+LG K CP  FI           + AEAS  +              INP++  W   DQ +
Subjt:  ANATNNFPA--ATRMPNFNSPPLNQL-LNQITTIKIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEIINPEYEAWLVVDQLL

Query:  LGWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALISQVLLGLDENYNAIV
        LGWL N++T   A+QL+  E SK LW+  Q L    +R+   YLR  F  TRKG   M +YL  MK  ++ L  AGSP  +  LI Q L GLD +YN IV
Subjt:  LGWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALISQVLLGLDENYNAIV

Query:  ATLQRKPDISWLHHNTQ----RHKIRVAAEINTLNVEAISVEI-----------------------------EVRAEEEIDPPAKYVERLVIQL-----M
          L  + ++SW+    Q      ++      N LN  A +                                + R   +I           I+       
Subjt:  ATLQRKPDISWLHHNTQ----RHKIRVAAEINTLNVEAISVEI-----------------------------EVRAEEEIDPPAKYVERLVIQL-----M

Query:  SATIASTKNLILISPKVKN-------------WYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIAK
        S T +S  N  +   +  N             WY DSGA+NHVT         TE  G   + VGNG  LKI + G++ L N    LNL ++L+VP I K
Subjt:  SATIASTKNLILISPKVKN-------------WYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIAK

Query:  NLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRL
        NL+SVSKL  DNN+ VEF  D C VKDK + + +L+G LKDGLY L      S+ +   N   C                            K + HR+L
Subjt:  NLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRL

Query:  GHSSSRIFYLITKGCNI-SFKSDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAF
        GH S+ +   + K CN+ +  SD  +FC ACQLGKSH L F SS S A +  ELIHTDVWGP  + S +GF+YYV F+DD SR+ WIYPL+ K DT+ AF
Subjt:  GHSSSRIFYLITKGCNI-SFKSDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAF

Query:  EHFMQMMHTQFNSSIKAIQSD
          F  M+  QFN  IK IQ D
Subjt:  EHFMQMMHTQFNSSIKAIQSD

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]2.7e-7935.1Show/hide
Query:  NATNNFPA--ATRMPNFNSPPLNQL-LNQITTIKIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEIINPEYEAWLVVDQLLL
        N  N+ P+  + ++   N P    L L  I   K +GY+LG K CP  F+          TSI                  TE INP+Y+ W   DQ LL
Subjt:  NATNNFPA--ATRMPNFNSPPLNQL-LNQITTIKIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEIINPEYEAWLVVDQLLL

Query:  GWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALISQVLLGLDENYNAIVA
        GWL NS+T +IATQ++  E SK LWD  Q L G  +R+   YL+  F  T K    M +YL  MK  ++ L  AGSP  S  L+ Q L GLD  YN +V 
Subjt:  GWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALISQVLLGLDENYNAIVA

Query:  TLQRKPDISWLHHNTQ----RHKIRVAAEINTLNVEAISVEIEVRAE---------------------------EEIDPPA------------------K
         L  + +ISW+    Q      ++      N +N+ A S     + E                               PP                   +
Subjt:  TLQRKPDISWLHHNTQ----RHKIRVAAEINTLNVEAISVEIEVRAE---------------------------EEIDPPA------------------K

Query:  YVERLVIQLMSATIASTKNLILISP---KVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIAK
        + +    +   A    + +  + SP   +   WY DSGA+NHVT     L    E  G   + VGNG  LKI + G+T L +    +NL+N+L+VP+I K
Subjt:  YVERLVIQLMSATIASTKNLILISP---KVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIAK

Query:  NLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRL
        NL+SVSKL  DNN  VEF E+YC VKDK + + +LKG LKDGLY L     +++   P N   C                  + IS+     K   HR+L
Subjt:  NLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRL

Query:  GHSSSRIFYLITKGCNISFK-SDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAF
        GH ++++   + K  N+    SD   FC ACQ GK H L F +S S A +  +LIHTDVWGP  +LS + F+YYV FLDDFSR+ WI+PL+ K +T+ AF
Subjt:  GHSSSRIFYLITKGCNISFK-SDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAF

Query:  EHFMQMMHTQFNSSIKAIQSD
          F  ++  QFN  IK I+ D
Subjt:  EHFMQMMHTQFNSSIKAIQSD

PNY01489.1 copia-like polyprotein, partial [Trifolium pratense]1.8e-7834.73Show/hide
Query:  ANATNNFPA--ATRMPNFNSPPLNQL-LNQITTIKIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEIINPEYEAWLVVDQLL
        +N  N+ P+  + ++   N P    L L  I   K +GY+LG K CP  F                     VT++D      ++ +NP+++ W+  DQ L
Subjt:  ANATNNFPA--ATRMPNFNSPPLNQL-LNQITTIKIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEIINPEYEAWLVVDQLL

Query:  LGWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALISQVLLGLDENYNAIV
        LGWL NS+  +IATQL+  E SK LWD  Q L G  +++   YL+  F  TRKG   M EYL  MK  S+ L  +GSP  +  L+ Q L GLD  YN +V
Subjt:  LGWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALISQVLLGLDENYNAIV

Query:  ATLQRKPDISWLHHNTQRHKIRVAAEIN--------TLNVEA-ISVEIEVRAEE--------------------------------------EIDPPAKY
          L  + ++SW+  + Q   +   + ++        TLN  A  + + E R  +                                       +D   ++
Subjt:  ATLQRKPDISWLHHNTQRHKIRVAAEIN--------TLNVEA-ISVEIEVRAEE--------------------------------------EIDPPAKY

Query:  VERLVIQLMS--ATIASTKNLILISP---KVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIA
              +  S  A    + +  + SP   +   WY DSGA+NHVT          E+ G   + VGNG  LKI + G+T L    + LNL ++L+VP I 
Subjt:  VERLVIQLMS--ATIASTKNLILISP---KVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIA

Query:  KNLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRR
        KNL+SVSKL  DNN+FVEF  + C VKDK + QT+LKG LKDGLY L + +  S                   NK+    +S           K + HR+
Subjt:  KNLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRR

Query:  LGHSSSRIFYLITKGCNISFK-SDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVA
        LGH ++++   + K CN+    SD   FC ACQ GK H L F SS S   +   LIH+DVWGP  +LS +GF+YYV F+DDFSR+ WI+PL+ K DT+ A
Subjt:  LGHSSSRIFYLITKGCNISFK-SDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVA

Query:  FEHFMQMMHTQFNSSIKAIQSD
        F  F  +   QFN  IK IQ D
Subjt:  FEHFMQMMHTQFNSSIKAIQSD

TrEMBL top hitse value%identityAlignment
A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)2.5e-7832.91Show/hide
Query:  ANATNNFPA--ATRMPNFNSPP-LNQLLNQITTIKIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEIINPEYEAWLVVDQLL
        +N  N+ P+  + ++   N P   + +L  I   +++GY+LG K CP  FI                     T +D     +++  NPE+E W   DQ L
Subjt:  ANATNNFPA--ATRMPNFNSPP-LNQLLNQITTIKIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEIINPEYEAWLVVDQLL

Query:  LGWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALISQVLLGLDENYNAIV
        LGWL NS+T  IATQL+  E S  LWD  Q L G  +R++  YL+  F  TRKG   M +YL  MK  ++ L  AG+P  +  LI Q L GLD  YN +V
Subjt:  LGWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALISQVLLGLDENYNAIV

Query:  ATLQRKPDISWLHHNTQ----RHKIRVAAEINTLNVEA----------------------------------------------------------ISVE
          L  +  +SW+    Q     ++I     +  L + A                                                          I+++
Subjt:  ATLQRKPDISWLHHNTQ----RHKIRVAAEINTLNVEA----------------------------------------------------------ISVE

Query:  IEVRAEEEIDPPAKYVERLVIQLMSATIASTKNLILISPKVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNL
           R ++                 +A +AS  ++     +  +WY DSGA+NHVT       + +E+ G   + VGNG  L+I + G++ L +    LNL
Subjt:  IEVRAEEEIDPPAKYVERLVIQLMSATIASTKNLILISPKVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNL

Query:  KNILHVPDIAKNLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINS
         +IL+VP I KNL+SVSKLA DNN+ VEF E+ C VKDK + + +L+G LKDGLY L E   +++ ++                                
Subjt:  KNILHVPDIAKNLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINS

Query:  AVTKTTSHRRLGHSSSRIFYLITKGCNISFK-SDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYP
           K + HR+LGH ++++  ++ K CN+    SD   FC ACQ GK H L F +S S A +  EL+HTDVWGP  ++SS+GF+YYV F+DDF+R+ WIYP
Subjt:  AVTKTTSHRRLGHSSSRIFYLITKGCNISFK-SDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYP

Query:  LRLKGDTMVAFEHFMQMMHTQFNSSIKAIQSD
        L+ K DT  AF  F  M+  QF+  IK IQ D
Subjt:  LRLKGDTMVAFEHFMQMMHTQFNSSIKAIQSD

A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)1.3e-7935.1Show/hide
Query:  NATNNFPA--ATRMPNFNSPPLNQL-LNQITTIKIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEIINPEYEAWLVVDQLLL
        N  N+ P+  + ++   N P    L L  I   K +GY+LG K CP  F+          TSI                  TE INP+Y+ W   DQ LL
Subjt:  NATNNFPA--ATRMPNFNSPPLNQL-LNQITTIKIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEIINPEYEAWLVVDQLLL

Query:  GWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALISQVLLGLDENYNAIVA
        GWL NS+T +IATQ++  E SK LWD  Q L G  +R+   YL+  F  T K    M +YL  MK  ++ L  AGSP  S  L+ Q L GLD  YN +V 
Subjt:  GWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALISQVLLGLDENYNAIVA

Query:  TLQRKPDISWLHHNTQ----RHKIRVAAEINTLNVEAISVEIEVRAE---------------------------EEIDPPA------------------K
         L  + +ISW+    Q      ++      N +N+ A S     + E                               PP                   +
Subjt:  TLQRKPDISWLHHNTQ----RHKIRVAAEINTLNVEAISVEIEVRAE---------------------------EEIDPPA------------------K

Query:  YVERLVIQLMSATIASTKNLILISP---KVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIAK
        + +    +   A    + +  + SP   +   WY DSGA+NHVT     L    E  G   + VGNG  LKI + G+T L +    +NL+N+L+VP+I K
Subjt:  YVERLVIQLMSATIASTKNLILISP---KVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIAK

Query:  NLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRL
        NL+SVSKL  DNN  VEF E+YC VKDK + + +LKG LKDGLY L     +++   P N   C                  + IS+     K   HR+L
Subjt:  NLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRL

Query:  GHSSSRIFYLITKGCNISFK-SDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAF
        GH ++++   + K  N+    SD   FC ACQ GK H L F +S S A +  +LIHTDVWGP  +LS + F+YYV FLDDFSR+ WI+PL+ K +T+ AF
Subjt:  GHSSSRIFYLITKGCNISFK-SDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAF

Query:  EHFMQMMHTQFNSSIKAIQSD
          F  ++  QFN  IK I+ D
Subjt:  EHFMQMMHTQFNSSIKAIQSD

A0A2K3NEN7 Copia-like polyprotein (Fragment)8.5e-7934.73Show/hide
Query:  ANATNNFPA--ATRMPNFNSPPLNQL-LNQITTIKIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEIINPEYEAWLVVDQLL
        +N  N+ P+  + ++   N P    L L  I   K +GY+LG K CP  F                     VT++D      ++ +NP+++ W+  DQ L
Subjt:  ANATNNFPA--ATRMPNFNSPPLNQL-LNQITTIKIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEIINPEYEAWLVVDQLL

Query:  LGWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALISQVLLGLDENYNAIV
        LGWL NS+  +IATQL+  E SK LWD  Q L G  +++   YL+  F  TRKG   M EYL  MK  S+ L  +GSP  +  L+ Q L GLD  YN +V
Subjt:  LGWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALISQVLLGLDENYNAIV

Query:  ATLQRKPDISWLHHNTQRHKIRVAAEIN--------TLNVEA-ISVEIEVRAEE--------------------------------------EIDPPAKY
          L  + ++SW+  + Q   +   + ++        TLN  A  + + E R  +                                       +D   ++
Subjt:  ATLQRKPDISWLHHNTQRHKIRVAAEIN--------TLNVEA-ISVEIEVRAEE--------------------------------------EIDPPAKY

Query:  VERLVIQLMS--ATIASTKNLILISP---KVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIA
              +  S  A    + +  + SP   +   WY DSGA+NHVT          E+ G   + VGNG  LKI + G+T L    + LNL ++L+VP I 
Subjt:  VERLVIQLMS--ATIASTKNLILISP---KVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIA

Query:  KNLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRR
        KNL+SVSKL  DNN+FVEF  + C VKDK + QT+LKG LKDGLY L + +  S                   NK+    +S           K + HR+
Subjt:  KNLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRR

Query:  LGHSSSRIFYLITKGCNISFK-SDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVA
        LGH ++++   + K CN+    SD   FC ACQ GK H L F SS S   +   LIH+DVWGP  +LS +GF+YYV F+DDFSR+ WI+PL+ K DT+ A
Subjt:  LGHSSSRIFYLITKGCNISFK-SDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVA

Query:  FEHFMQMMHTQFNSSIKAIQSD
        F  F  +   QFN  IK IQ D
Subjt:  FEHFMQMMHTQFNSSIKAIQSD

A0A803PEH4 Uncharacterized protein2.3e-8433.63Show/hide
Query:  ANATNNFPAATRMPNFNSPP-LNQLLN-------------QITTI----KIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEI
        A+++NN   A+++PN  +PP LNQ  +              ++TI    ++ GYL G  +CPP F                           V    T++
Subjt:  ANATNNFPAATRMPNFNSPP-LNQLLN-------------QITTI----KIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEI

Query:  INPEYEAWLVVDQLLLGWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALI
         NPEYE W++ DQLL+GWLY+S+T  IAT++MG   + +L   ++ L+G  S+++ D  R   Q TRKG+  MSEYLR  K  SN L  AG P P   L+
Subjt:  INPEYEAWLVVDQLLLGWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALI

Query:  SQVLLGLDENYNAIVATLQRKPDISW---------------------LHHN---TQRHKIRVAAEINT--------------------LNVEAISVEIEV
        + VL GLD  Y +IV  ++ + + +W                     L+ N   +   +  +AA+ N                      N    S     
Subjt:  SQVLLGLDENYNAIVATLQRKPDISW---------------------LHHN---TQRHKIRVAAEINT--------------------LNVEAISVEIEV

Query:  R-------AEEEIDPPAKYVERLVI-------------------QLMSATIASTKNLILISPKVKN---WYADSGATNHVTSDYTNLTHPTEYGGTKLVT
        R       +        KY     +                   Q  +    +  +  + +P+V     W+ADSGA+NH+TSD  NLT   +Y G + V 
Subjt:  R-------AEEEIDPPAKYVERLVI-------------------QLMSATIASTKNLILISPKVKN---WYADSGATNHVTSDYTNLTHPTEYGGTKLVT

Query:  VGNGNTLKIASVGNTVLT--NGKHLLNLKNILHVPDIAKNLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEET-TVASHDAVPVN
        VGNG+ L+I  +GN  L   +G +LL LK++L VP IAKNLVSVSKLA DNNV +EF+ ++CLVKDK +++ +L G LKD LY L+   T +SH     N
Subjt:  VGNGNTLKIASVGNTVLT--NGKHLLNLKNILHVPDIAKNLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEET-TVASHDAVPVN

Query:  -SAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRLGHSSSRIFYLITKGCNISFKSDATE-FCHACQLGKSHRLLFSSSESRALKAFELIHTDV
          +A  +    N+N+    +L  S++ +         HRRLGH S ++   + +  N+S   +A +  C ACQ GK+H L F SS +RA    +LIHTD+
Subjt:  -SAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRLGHSSSRIFYLITKGCNISFKSDATE-FCHACQLGKSHRLLFSSSESRALKAFELIHTDV

Query:  WGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAFEHFMQMMHTQFNSSIKAIQSDN
        WGP  + S+    YY+ F+DD+SRY W+YPL+LK D + AF  F  ++  QF   IK+++SD+
Subjt:  WGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAFEHFMQMMHTQFNSSIKAIQSDN

A0A803PM38 Uncharacterized protein2.5e-7836.74Show/hide
Query:  INPEYEAWLVVDQLLLGWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALI
        +NP +E W+V DQLLLGWLY S+T  IA ++MG + S  LW A++ELFG  S+A+ D  R   Q  RKG  +M++YLR  +  ++ L  AG P P   L+
Subjt:  INPEYEAWLVVDQLLLGWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALI

Query:  SQVLLGLDENYNAIVATLQRKPDISW-------LHHNTQRHKIRVAAEINTLNVEAISVEIEV----------RAEEEIDPPAKYVERLVIQLMSATIAS
        S VL GLD  Y  +V  ++ +   +W       L  +++  ++   +  + L    ++    +          R     +    +         S     
Subjt:  SQVLLGLDENYNAIVATLQRKPDISW-------LHHNTQRHKIRVAAEINTLNVEAISVEIEV----------RAEEEIDPPAKYVERLVIQLMSATIAS

Query:  TKNLILISPKVKNWYADS-------GATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVG-NTVLTNGKHLLNLKNILHVPDIAKNLVSVSKLAKD
          +    + +V   Y  S       GA+NH+TS+   +    EY G + VTV NGN L I  +G  ++ T     L LK ILHVP I KNL+S+SKL  D
Subjt:  TKNLILISPKVKNWYADS-------GATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVG-NTVLTNGKHLLNLKNILHVPDIAKNLVSVSKLAKD

Query:  NNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRLGHSSSRIFYLI
        NNV VEF  D C VKDK + Q VLKG LKDGLY  +  T ++       S +CP   +  +    ES ++    +      K   HRRLGH S R+   +
Subjt:  NNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRLGHSSSRIFYLI

Query:  TKGCNISFKSDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAFEHFMQMMHTQFN
            N+   + +  FC ACQLGKSH L F  +  RA    EL+HTD+WGP+ ++S+  FRYY+ F+DDFSRY WIYPL+ K + + AF  F  ++  QFN
Subjt:  TKGCNISFKSDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAFEHFMQMMHTQFN

Query:  SSIKAIQSD
        S +K +Q+D
Subjt:  SSIKAIQSD

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-2227.85Show/hide
Query:  ISPKVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGN-TVLTNGKHLLNLKNILHVPDIAKNLVSVSKLAKDNNVFVEFHEDYCL
        +S     W  D+ A++H T    +L      G    V +GN +  KIA +G+  + TN    L LK++ HVPD+  NL+S   L +D       ++ + L
Subjt:  ISPKVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGN-TVLTNGKHLLNLKNILHVPDIAKNLVSVSKLAKDNNVFVEFHEDYCL

Query:  VKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRLGHSSSRIFYLITKGCNISF-KSDA
         K       + KG  +  LY               N+  C          + E   +   IS++        H+R+GH S +   ++ K   IS+ K   
Subjt:  VKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRLGHSSSRIFYLITKGCNISF-KSDA

Query:  TEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAFEHFMQMMHTQFNSSIKAIQSDN
         + C  C  GK HR+ F +S  R L   +L+++DV GP  + S  G +Y+V F+DD SR +W+Y L+ K      F+ F  ++  +    +K ++SDN
Subjt:  TEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAFEHFMQMMHTQFNSSIKAIQSDN

Q07791 Transposon Ty2-DR3 Gag-Pol polyprotein1.9e-1126.09Show/hide
Query:  DSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIAKNLVSVSKLAKDNNVFVEFHEDYCLVKD--KGSEQT
        DSGA+  +      L H T      +V       + I ++GN               LH P+IA +L+S+S+LA  N          C  ++  + S+ T
Subjt:  DSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIAKNLVSVSKLAKDNNVFVEFHEDYCLVKD--KGSEQT

Query:  VLKGTLKDGLYH--LEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRLGHSSSR----------IFYLITKGCNISFKS
        VL   +K G ++   ++  + SH +             + IN  N+S       S+N        HR LGH++ R          + YL  K  +I + +
Subjt:  VLKGTLKDGLYH--LEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRLGHSSSR----------IFYLITKGCNISFKS

Query:  DATEFCHACQLGKS--HRLLFSS--SESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPL--RLKGDTMVAFEHFMQMMHTQFNSSIK
         +T  C  C +GKS  HR +  S      + + F+ +HTD++GP   L  +   Y++ F D+ +R+ W+YPL  R +   +  F   +  +  QFN+ + 
Subjt:  DATEFCHACQLGKS--HRLLFSS--SESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPL--RLKGDTMVAFEHFMQMMHTQFNSSIK

Query:  AIQSDNES---NRYLAPFSGSR
         IQ D  S   N+ L  F  +R
Subjt:  AIQSDNES---NRYLAPFSGSR

Q12491 Transposon Ty2-B Gag-Pol polyprotein2.5e-1126.09Show/hide
Query:  DSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIAKNLVSVSKLAKDNNVFVEFHEDYCLVKD--KGSEQT
        DSGA+  +      L H T      +V       + I ++GN               LH P+IA +L+S+S+LA  N          C  ++  + S+ T
Subjt:  DSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIAKNLVSVSKLAKDNNVFVEFHEDYCLVKD--KGSEQT

Query:  VLKGTLKDGLYH--LEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRLGHSSSR----------IFYLITKGCNISFKS
        VL   +K G ++   ++  + SH +             + IN  N+S       S+N        HR LGH++ R          + YL  K  +I + +
Subjt:  VLKGTLKDGLYH--LEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRLGHSSSR----------IFYLITKGCNISFKS

Query:  DATEFCHACQLGKS--HRLLFSS--SESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPL--RLKGDTMVAFEHFMQMMHTQFNSSIK
         +T  C  C +GKS  HR +  S      + + F+ +HTD++GP   L  +   Y++ F D+ +R+ W+YPL  R +   +  F   +  +  QFN+ + 
Subjt:  DATEFCHACQLGKS--HRLLFSS--SESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPL--RLKGDTMVAFEHFMQMMHTQFNSSIK

Query:  AIQSDNES---NRYLAPFSGSR
         IQ D  S   N+ L  F  +R
Subjt:  AIQSDNES---NRYLAPFSGSR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.0e-4426.81Show/hide
Query:  TTSDPVTFATTEI--INPEYEAWLVVDQLLLGWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSN
        TT  P T  T     +NP+Y  W   D+L+   +  +I+  +   +     +  +W+ +++++   S      LR   +Q  KG  T+ +Y++ +    +
Subjt:  TTSDPVTFATTEI--INPEYEAWLVVDQLLLGWLYNSITPEIATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSN

Query:  NLGQAGSPGPSKALISQVLLGLDENYNAIVATLQRK---PDISWLHHNTQRHKIRV----AAEINTLNVEAIS----------------VEIEVRAEEEI
         L   G P      + +VL  L E Y  ++  +  K   P ++ +H     H+ ++    +A +  +   A+S                   + R     
Subjt:  NLGQAGSPGPSKALISQVLLGLDENYNAIVATLQRK---PDISWLHHNTQRHKIRV----AAEINTLNVEAIS----------------VEIEVRAEEEI

Query:  DPP-------------------------------AKYVERLVIQLMSATIAS----------TKNLILISP-KVKNWYADSGATNHVTSDYTNLTHPTEY
          P                               AK   +L   L S                 NL L SP    NW  DSGAT+H+TSD+ NL+    Y
Subjt:  DPP-------------------------------AKYVERLVIQLMSATIAS----------TKNLILISP-KVKNWYADSGATNHVTSDYTNLTHPTEY

Query:  GGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIAKNLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDA
         G   V V +G+T+ I+  G+T L+     LNL NIL+VP+I KNL+SV +L   N V VEF      VKD  +   +L+G  KD LY   E  +AS  +
Subjt:  GGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIAKNLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDA

Query:  VPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRLGHSSSRIFYLITKGCNISFKSDATEF--CHACQLGKSHRLLFSSSESRALKAFELI
         PV+  A P                      +S  T ++ H RLGH +  I   +    ++S  + + +F  C  C + KS+++ FS S   + +  E I
Subjt:  VPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRLGHSSSRIFYLITKGCNISFKSDATEF--CHACQLGKSHRLLFSSSESRALKAFELI

Query:  HTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAFEHFMQMMHTQFNSSIKAIQSDN
        ++DVW  + +LS   +RYYV+F+D F+RY W+YPL+ K      F  F  ++  +F + I    SDN
Subjt:  HTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAFEHFMQMMHTQFNSSIKAIQSDN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.4e-3833.99Show/hide
Query:  NLILISP-KVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIAKNLVSVSKLAKDNNVFVEFHE
        NL + SP    NW  DSGAT+H+TSD+ NL+    Y G   V + +G+T+ I   G+  L      L+L  +L+VP+I KNL+SV +L   N V VEF  
Subjt:  NLILISP-KVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNGKHLLNLKNILHVPDIAKNLVSVSKLAKDNNVFVEFHE

Query:  DYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRLGHSSSRIFYLITKGCNISFK
            VKD  +   +L+G  KD LY   E  +AS  AV + ++ C                        S  T ++ H RLGH S  I   +    ++   
Subjt:  DYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTKTTSHRRLGHSSSRIFYLITKGCNISFK

Query:  SDATEF--CHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAFEHFMQMMHTQFNSSIKAIQ
        + + +   C  C + KSH++ FS+S   + K  E I++DVW  + +LS   +RYYV+F+D F+RY W+YPL+ K      F  F  ++  +F + I  + 
Subjt:  SDATEF--CHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAFEHFMQMMHTQFNSSIKAIQ

Query:  SDN
        SDN
Subjt:  SDN

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein3.3e-0636.23Show/hide
Query:  HRRLGHSSSR-IFYLITKGCNISFKSDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSM
        H RL H S R +  L+ KG   S K  + +FC  C  GK+HR+ FS+ +       + +H+D+WG  S+
Subjt:  HRRLGHSSSR-IFYLITKGCNISFKSDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAACGCTACCAACAATTTTCCAGCAGCCACCAGAATGCCCAACTTCAACAGTCCTCCTTTAAATCAATTGCTGAACCAAATTACCACTATTAAAATTGAAGGATA
CCTTTTAGGGCACAAAATCTGCCCACCCATGTTTATTCGACAGAACCCCAGCAATGCAAGCAATGAAACATCTATTGCTGAAGCATCCAGCTCCCAAGTCACAACCAGCG
ACCCAGTGACATTCGCGACAACCGAAATCATCAATCCAGAGTATGAAGCCTGGCTTGTCGTTGATCAGTTGCTCCTCGGATGGCTTTACAACTCCATAACACCAGAAATT
GCCACTCAACTCATGGGGTTTGAACGATCGAAAGATTTATGGGATGCTATCCAAGAATTATTCGGGGTTCAGTCTAGAGCAGAGGAAGACTACCTCCGCCAGACATTCCA
ACAGACCAGAAAAGGTAACAATACTATGTCTGAGTATTTACGCTTGATGAAAATGCACTCTAACAATTTAGGGCAAGCTGGAAGTCCTGGCCCATCAAAAGCATTAATAT
CCCAGGTTCTGCTTGGGCTTGATGAAAATTACAATGCCATCGTTGCCACCCTTCAAAGGAAACCTGACATCAGTTGGCTCCACCACAACACCCAACGTCACAAAATCAGG
GTCGCGGCGGAAATCAACACACTCAACGTGGAGGCAATCAGTGTGGAAATAGAGGTCAGGGCAGAGGAAGAAATTGACCCACCTGCCAAGTATGTGGAAAGATTGGTCAT
ACAGCTGATGTCTGCTACAATCGCTTCAACAAAGAATTTAATCCTAATCAGCCCCAAAGTCAAGAATTGGTATGCCGACAGTGGAGCAACCAACCATGTGACATCCGACT
ACACCAATCTCACCCACCCAACTGAGTATGGAGGTACGAAGTTAGTTACTGTTGGCAATGGTAATACACTTAAAATTGCATCTGTTGGTAATACTGTTCTGACTAATGGG
AAACACTTGTTGAACTTAAAGAATATACTGCATGTTCCGGATATAGCTAAGAACCTTGTTAGTGTATCAAAACTTGCTAAGGACAATAACGTTTTTGTTGAATTTCATGA
GGATTATTGTCTTGTAAAGGACAAGGGTTCAGAGCAAACAGTTTTGAAGGGCACACTTAAAGATGGACTTTATCATTTAGAGGAAACCACGGTGGCGTCTCATGATGCGG
TTCCGGTGAATTCTGCTGCTTGCCCACTCAAGGAGGCTGTGAACATAAATAAGGAGAATGAATCTGCCTTATCTCCTTCTCGTATTTCTATTAATAGTGCTGTAACTAAA
ACTACTTCGCATCGACGGTTGGGACATTCTTCCTCGAGAATTTTTTATTTGATCACCAAAGGTTGTAATATCTCGTTTAAAAGCGATGCAACTGAATTTTGTCATGCATG
TCAATTGGGTAAATCACACCGTCTTCTATTTTCTTCATCTGAATCTCGTGCTTTGAAAGCTTTTGAGTTAATTCATACAGATGTGTGGGGCCCTACTTCTATGCTATCTT
CTGCTGGTTTTCGATATTATGTATTGTTTCTTGATGACTTCAGCCGATATGTCTGGATTTACCCTCTTCGTTTAAAAGGCGATACAATGGTAGCTTTTGAACACTTCATG
CAGATGATGCACACTCAGTTTAATAGCAGCATCAAAGCTATACAATCTGATAACGAGTCGAATCGCTACCTTGCTCCTTTTTCGGGTTCAAGAGCAAACCCGATCCTTTC
CACGGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCAACGCTACCAACAATTTTCCAGCAGCCACCAGAATGCCCAACTTCAACAGTCCTCCTTTAAATCAATTGCTGAACCAAATTACCACTATTAAAATTGAAGGATA
CCTTTTAGGGCACAAAATCTGCCCACCCATGTTTATTCGACAGAACCCCAGCAATGCAAGCAATGAAACATCTATTGCTGAAGCATCCAGCTCCCAAGTCACAACCAGCG
ACCCAGTGACATTCGCGACAACCGAAATCATCAATCCAGAGTATGAAGCCTGGCTTGTCGTTGATCAGTTGCTCCTCGGATGGCTTTACAACTCCATAACACCAGAAATT
GCCACTCAACTCATGGGGTTTGAACGATCGAAAGATTTATGGGATGCTATCCAAGAATTATTCGGGGTTCAGTCTAGAGCAGAGGAAGACTACCTCCGCCAGACATTCCA
ACAGACCAGAAAAGGTAACAATACTATGTCTGAGTATTTACGCTTGATGAAAATGCACTCTAACAATTTAGGGCAAGCTGGAAGTCCTGGCCCATCAAAAGCATTAATAT
CCCAGGTTCTGCTTGGGCTTGATGAAAATTACAATGCCATCGTTGCCACCCTTCAAAGGAAACCTGACATCAGTTGGCTCCACCACAACACCCAACGTCACAAAATCAGG
GTCGCGGCGGAAATCAACACACTCAACGTGGAGGCAATCAGTGTGGAAATAGAGGTCAGGGCAGAGGAAGAAATTGACCCACCTGCCAAGTATGTGGAAAGATTGGTCAT
ACAGCTGATGTCTGCTACAATCGCTTCAACAAAGAATTTAATCCTAATCAGCCCCAAAGTCAAGAATTGGTATGCCGACAGTGGAGCAACCAACCATGTGACATCCGACT
ACACCAATCTCACCCACCCAACTGAGTATGGAGGTACGAAGTTAGTTACTGTTGGCAATGGTAATACACTTAAAATTGCATCTGTTGGTAATACTGTTCTGACTAATGGG
AAACACTTGTTGAACTTAAAGAATATACTGCATGTTCCGGATATAGCTAAGAACCTTGTTAGTGTATCAAAACTTGCTAAGGACAATAACGTTTTTGTTGAATTTCATGA
GGATTATTGTCTTGTAAAGGACAAGGGTTCAGAGCAAACAGTTTTGAAGGGCACACTTAAAGATGGACTTTATCATTTAGAGGAAACCACGGTGGCGTCTCATGATGCGG
TTCCGGTGAATTCTGCTGCTTGCCCACTCAAGGAGGCTGTGAACATAAATAAGGAGAATGAATCTGCCTTATCTCCTTCTCGTATTTCTATTAATAGTGCTGTAACTAAA
ACTACTTCGCATCGACGGTTGGGACATTCTTCCTCGAGAATTTTTTATTTGATCACCAAAGGTTGTAATATCTCGTTTAAAAGCGATGCAACTGAATTTTGTCATGCATG
TCAATTGGGTAAATCACACCGTCTTCTATTTTCTTCATCTGAATCTCGTGCTTTGAAAGCTTTTGAGTTAATTCATACAGATGTGTGGGGCCCTACTTCTATGCTATCTT
CTGCTGGTTTTCGATATTATGTATTGTTTCTTGATGACTTCAGCCGATATGTCTGGATTTACCCTCTTCGTTTAAAAGGCGATACAATGGTAGCTTTTGAACACTTCATG
CAGATGATGCACACTCAGTTTAATAGCAGCATCAAAGCTATACAATCTGATAACGAGTCGAATCGCTACCTTGCTCCTTTTTCGGGTTCAAGAGCAAACCCGATCCTTTC
CACGGGTTAG
Protein sequenceShow/hide protein sequence
MANATNNFPAATRMPNFNSPPLNQLLNQITTIKIEGYLLGHKICPPMFIRQNPSNASNETSIAEASSSQVTTSDPVTFATTEIINPEYEAWLVVDQLLLGWLYNSITPEI
ATQLMGFERSKDLWDAIQELFGVQSRAEEDYLRQTFQQTRKGNNTMSEYLRLMKMHSNNLGQAGSPGPSKALISQVLLGLDENYNAIVATLQRKPDISWLHHNTQRHKIR
VAAEINTLNVEAISVEIEVRAEEEIDPPAKYVERLVIQLMSATIASTKNLILISPKVKNWYADSGATNHVTSDYTNLTHPTEYGGTKLVTVGNGNTLKIASVGNTVLTNG
KHLLNLKNILHVPDIAKNLVSVSKLAKDNNVFVEFHEDYCLVKDKGSEQTVLKGTLKDGLYHLEETTVASHDAVPVNSAACPLKEAVNINKENESALSPSRISINSAVTK
TTSHRRLGHSSSRIFYLITKGCNISFKSDATEFCHACQLGKSHRLLFSSSESRALKAFELIHTDVWGPTSMLSSAGFRYYVLFLDDFSRYVWIYPLRLKGDTMVAFEHFM
QMMHTQFNSSIKAIQSDNESNRYLAPFSGSRANPILSTG