; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005120 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005120
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr6:10933013..10934675
RNA-Seq ExpressionLag0005120
SyntenyLag0005120
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]1.2e-9740.95Show/hide
Query:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL
        + +K LW+  Q L G  +R++  YL+  F   RKG  KM DYL  MK   D L  AG+PV+T  LI Q L GLD EYNPVV  L  +  +SW +LQA LL
Subjt:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL

Query:  VFEKRLEFQNSHKTNLSFSHNAAV-NMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKEY
         FE R+E Q ++ TNL+ +  A V N  + RG++S N  +               N RG  GGR RG+    G N   CQVCG + H A+ C+HRFDK Y
Subjt:  VFEKRLEFQNSHKTNLSFSHNAAV-NMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKEY

Query:  GNGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI-------
               SNH+A  +        G+ N    AF+A Q+S         V D  WY DSGAS+HVT          E+ GK+ +V+GNG +L I       
Subjt:  GNGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI-------

Query:  ----------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSNVN
                         +++SVSKLA DN + +EF  + C VKD  TGKV+L G+LKDGLY+L                               SG+  N
Subjt:  ----------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSNVN

Query:  VNV-VESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRHI
         +  V  K  WH RLGHP+ KV D +++ CK+ V  +D F FCEAC++GK+H LPF++S S A+    LVH+D+WGPAP+++S GF+YYVHF+DD+SR  
Subjt:  VNV-VESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRHI

Query:  WVYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQV
        W+YPLKQKSET+ AF+ F N+ +NQF+  IK+ + D  GE+  V ++
Subjt:  WVYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQV

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]4.1e-9539.45Show/hide
Query:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL
        + ++ +WE  Q L G  +R+   +L+  F +TRKG  KM +YL  MK  AD+L  AGS V+T  L++Q L GLD EYNP+V  L  +  ++W E+QA LL
Subjt:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL

Query:  VFEKRLE-FQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKEY
         +E RLE   N     L+ S N +  + N RG       +S  F G R    N    RG  GGR RGR      +R +CQVC K GH+A  CYHRF+K Y
Subjt:  VFEKRLE-FQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKEY

Query:  GNGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI-------
           + +NS+   ++E  K      N+               +VASP TV D  WY DSGAS+HVT + N +    E  GK  + +GNG  L I       
Subjt:  GNGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI-------

Query:  --------------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSG
                             +++S+SKL  DN +++EFH  +C VKD  TG++LL G +KDGLY+L                          STS    
Subjt:  --------------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSG

Query:  SNVNVNVVESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYS
         +V  ++   K  WH +LGHP+ KV ++++K C +     + F FCEAC+FGK H LPFQNS S A+    LVHSD+WGPAP+ S  GF+YYV FLDD+S
Subjt:  SNVNVNVVESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYS

Query:  RHIWVYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQV
        R  W+YPLKQKS+   AF+ F N+V+NQF+  IK  + D  GEF  + +V
Subjt:  RHIWVYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQV

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]4.9e-9640.07Show/hide
Query:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL
        + +  LW+  Q L G  +R++  YL+  F  TRKG  KM DYL  MK  AD L  AG+P++T  LI Q L GLD EYNPVV  L  +  +SW +LQA LL
Subjt:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL

Query:  VFEKRLEFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKEYG
         FE R+E  NS  TNL+ +  A V   +    N  N   + + + N    SN+   RG  GGR RGR     + +  CQVCG   H A+ C++RFDK Y 
Subjt:  VFEKRLEFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKEYG

Query:  NGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI--------
              SNH+A  +        G+ N    AF+A Q+SI          D  WY DSGAS+HVT   +   +  E+ GK+ +++GNG +L I        
Subjt:  NGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI--------

Query:  ---------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSNVNV
                        +++SVSKLA DN + +EF  + C VKD  TGK +L G+LKDGLY+L                                 S+  V
Subjt:  ---------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSNVNV

Query:  NVVESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRHIWV
        ++ ES   WH +LGHP+ KV D ++K C + +  +D F FCEAC++GK+H LPF+ S S A+    LVH+D+WGPAP++SS GF+YYVHF+DD++R  W+
Subjt:  NVVESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRHIWV

Query:  YPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQ
        YPLKQKS+T  AF+ F N+V+NQFS  IK  + D  GE+  V +
Subjt:  YPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQ

PNX78574.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]2.8e-9640.69Show/hide
Query:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL
        + +K LWE  Q L    +R+   YLR  F  TRKG  KM DYL  MK  AD L  AGSP+T   LI Q L GLD +YNP+V  L  ++ +SW +LQA LL
Subjt:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL

Query:  VFEKRLEFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTF--SNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKE
         FE RL+  NS     + + NA  N+ N + Q  GN         +R ++  S++ N RGG     RG+GR    +  ICQVC K GH+A+ C HR+DK 
Subjt:  VFEKRLEFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTF--SNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKE

Query:  YGNGVARNSN-HNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI-----
        Y      N+N    RT N+                        F+AS     D  WY DSGAS+HVT   +      E  GK+ +++GNG +L I     
Subjt:  YGNGVARNSN-HNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI-----

Query:  ------------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSN
                           +++SVSKL  DN + +EF  D C VKD  TGKVLL G+LKDGLY+              L N S   N   C         
Subjt:  ------------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSN

Query:  VNVNVVESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRH
        V ++V ES   WH +LGHPS  V D ++K C +    +D F FCEAC+ GK H LPF++S S A+    L+H+D+WGPAP+ S  GF+YYVHF+DD SR 
Subjt:  VNVNVVESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRH

Query:  IWVYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQV
         W+YPLKQKS+T+ AF+ F N+V+NQF+  IKI + D  GEF  V +V
Subjt:  IWVYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQV

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]3.9e-9340.18Show/hide
Query:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL
        + +K LW+  Q L G  +R+   YL+  F  T K   KM  YL  MK  AD L  AGSP+++  L+ Q L GLD EYNPVV  L  +  ISW + QA LL
Subjt:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL

Query:  VFEKRL-EFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKEY
         FE RL +  N +  NL+ S N A     ++ ++ GN+  S+   G R +     N RG  GG  RGR R     RPICQ+CGK GH+A  CY+RFDK Y
Subjt:  VFEKRL-EFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKEY

Query:  GNGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI-------
                NH A  E S             H+         FVASP    D  WY DSGAS+HVT     +    E  GK+ +++GNG +L I       
Subjt:  GNGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI-------

Query:  ----------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSNVN
                         +++SVSKL  DN   +EF  + C VKD  TGK LL G LKDGLY+L               N     N   C+          
Subjt:  ----------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSNVN

Query:  VNVVESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRHIW
           +  K +WH +LGHP+ KV + ++K   + +  +D F FCEAC+FGKLH LPF+ S S A+    L+H+D+WGPAP+LS   F+YYVHFLDD+SR  W
Subjt:  VNVVESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRHIW

Query:  VYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQ
        ++PLKQKSET+ AF  F N+V+NQF+  IK+ R D  GE+  V +
Subjt:  VYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQ

TrEMBL top hitse value%identityAlignment
A0A151S6M8 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-9539.45Show/hide
Query:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL
        + ++ +WE  Q L G  +R+   +L+  F +TRKG  KM +YL  MK  AD+L  AGS V+T  L++Q L GLD EYNP+V  L  +  ++W E+QA LL
Subjt:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL

Query:  VFEKRLE-FQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKEY
         +E RLE   N     L+ S N +  + N RG       +S  F G R    N    RG  GGR RGR      +R +CQVC K GH+A  CYHRF+K Y
Subjt:  VFEKRLE-FQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKEY

Query:  GNGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI-------
           + +NS+   ++E  K      N+               +VASP TV D  WY DSGAS+HVT + N +    E  GK  + +GNG  L I       
Subjt:  GNGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI-------

Query:  --------------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSG
                             +++S+SKL  DN +++EFH  +C VKD  TG++LL G +KDGLY+L                          STS    
Subjt:  --------------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSG

Query:  SNVNVNVVESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYS
         +V  ++   K  WH +LGHP+ KV ++++K C +     + F FCEAC+FGK H LPFQNS S A+    LVHSD+WGPAP+ S  GF+YYV FLDD+S
Subjt:  SNVNVNVVESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYS

Query:  RHIWVYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQV
        R  W+YPLKQKS+   AF+ F N+V+NQF+  IK  + D  GEF  + +V
Subjt:  RHIWVYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQV

A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)2.4e-9640.07Show/hide
Query:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL
        + +  LW+  Q L G  +R++  YL+  F  TRKG  KM DYL  MK  AD L  AG+P++T  LI Q L GLD EYNPVV  L  +  +SW +LQA LL
Subjt:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL

Query:  VFEKRLEFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKEYG
         FE R+E  NS  TNL+ +  A V   +    N  N   + + + N    SN+   RG  GGR RGR     + +  CQVCG   H A+ C++RFDK Y 
Subjt:  VFEKRLEFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKEYG

Query:  NGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI--------
              SNH+A  +        G+ N    AF+A Q+SI          D  WY DSGAS+HVT   +   +  E+ GK+ +++GNG +L I        
Subjt:  NGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI--------

Query:  ---------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSNVNV
                        +++SVSKLA DN + +EF  + C VKD  TGK +L G+LKDGLY+L                                 S+  V
Subjt:  ---------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSNVNV

Query:  NVVESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRHIWV
        ++ ES   WH +LGHP+ KV D ++K C + +  +D F FCEAC++GK+H LPF+ S S A+    LVH+D+WGPAP++SS GF+YYVHF+DD++R  W+
Subjt:  NVVESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRHIWV

Query:  YPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQ
        YPLKQKS+T  AF+ F N+V+NQFS  IK  + D  GE+  V +
Subjt:  YPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQ

A0A2K3LJ49 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-9640.69Show/hide
Query:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL
        + +K LWE  Q L    +R+   YLR  F  TRKG  KM DYL  MK  AD L  AGSP+T   LI Q L GLD +YNP+V  L  ++ +SW +LQA LL
Subjt:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL

Query:  VFEKRLEFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTF--SNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKE
         FE RL+  NS     + + NA  N+ N + Q  GN         +R ++  S++ N RGG     RG+GR    +  ICQVC K GH+A+ C HR+DK 
Subjt:  VFEKRLEFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTF--SNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKE

Query:  YGNGVARNSN-HNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI-----
        Y      N+N    RT N+                        F+AS     D  WY DSGAS+HVT   +      E  GK+ +++GNG +L I     
Subjt:  YGNGVARNSN-HNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI-----

Query:  ------------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSN
                           +++SVSKL  DN + +EF  D C VKD  TGKVLL G+LKDGLY+              L N S   N   C         
Subjt:  ------------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSN

Query:  VNVNVVESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRH
        V ++V ES   WH +LGHPS  V D ++K C +    +D F FCEAC+ GK H LPF++S S A+    L+H+D+WGPAP+ S  GF+YYVHF+DD SR 
Subjt:  VNVNVVESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRH

Query:  IWVYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQV
         W+YPLKQKS+T+ AF+ F N+V+NQF+  IKI + D  GEF  V +V
Subjt:  IWVYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQV

A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)1.9e-9340.18Show/hide
Query:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL
        + +K LW+  Q L G  +R+   YL+  F  T K   KM  YL  MK  AD L  AGSP+++  L+ Q L GLD EYNPVV  L  +  ISW + QA LL
Subjt:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL

Query:  VFEKRL-EFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKEY
         FE RL +  N +  NL+ S N A     ++ ++ GN+  S+   G R +     N RG  GG  RGR R     RPICQ+CGK GH+A  CY+RFDK Y
Subjt:  VFEKRL-EFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKEY

Query:  GNGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI-------
                NH A  E S             H+         FVASP    D  WY DSGAS+HVT     +    E  GK+ +++GNG +L I       
Subjt:  GNGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI-------

Query:  ----------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSNVN
                         +++SVSKL  DN   +EF  + C VKD  TGK LL G LKDGLY+L               N     N   C+          
Subjt:  ----------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSNVN

Query:  VNVVESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRHIW
           +  K +WH +LGHP+ KV + ++K   + +  +D F FCEAC+FGKLH LPF+ S S A+    L+H+D+WGPAP+LS   F+YYVHFLDD+SR  W
Subjt:  VNVVESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRHIW

Query:  VYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQ
        ++PLKQKSET+ AF  F N+V+NQF+  IK+ R D  GE+  V +
Subjt:  VYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQ

A0A2Z6MBG6 Integrase catalytic domain-containing protein5.6e-9840.95Show/hide
Query:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL
        + +K LW+  Q L G  +R++  YL+  F   RKG  KM DYL  MK   D L  AG+PV+T  LI Q L GLD EYNPVV  L  +  +SW +LQA LL
Subjt:  DNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLL

Query:  VFEKRLEFQNSHKTNLSFSHNAAV-NMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKEY
         FE R+E Q ++ TNL+ +  A V N  + RG++S N  +               N RG  GGR RG+    G N   CQVCG + H A+ C+HRFDK Y
Subjt:  VFEKRLEFQNSHKTNLSFSHNAAV-NMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKEY

Query:  GNGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI-------
               SNH+A  +        G+ N    AF+A Q+S         V D  WY DSGAS+HVT          E+ GK+ +V+GNG +L I       
Subjt:  GNGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI-------

Query:  ----------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSNVN
                         +++SVSKLA DN + +EF  + C VKD  TGKV+L G+LKDGLY+L                               SG+  N
Subjt:  ----------------SHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSNVN

Query:  VNV-VESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRHI
         +  V  K  WH RLGHP+ KV D +++ CK+ V  +D F FCEAC++GK+H LPF++S S A+    LVH+D+WGPAP+++S GF+YYVHF+DD+SR  
Subjt:  VNV-VESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRHI

Query:  WVYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQV
        W+YPLKQKSET+ AF+ F N+ +NQF+  IK+ + D  GE+  V ++
Subjt:  WVYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQV

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.9e-1121.11Show/hide
Query:  AKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSS-KMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQ--GRVGISWAELQADL
        A+ + E +  ++  +S A +  LR+     +  S   +  +  I       L  AG+ +     IS +L+ L   Y+ ++  ++      ++ A ++  L
Subjt:  AKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSS-KMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQ--GRVGISWAELQADL

Query:  LVFEKRLEFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNN--RPICQVCGKTGHSALICYHRFDK
        L  ++ ++ +N H        NA V+       N+ N  ++  F  NR T               + +  ++GN+  +  C  CG+ GH    C+H    
Subjt:  LVFEKRLEFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNN--RPICQVCGKTGHSALICYHRFDK

Query:  EYGNGVARNSNHNARTENSKVITSVGNSNPTPH--AFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNISHV
               +   +N   EN K + +      T H  AF+  +     V +   + +  + +DSGAS H+  + +  +  VE      + +    +   +  
Subjt:  EYGNGVARNSNHNARTENSKVITSVGNSNPTPH--AFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNISHV

Query:  VSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKL---KTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSNVNVNVVESKAVWHCRLGH
          + +L  D+ + LE   D    K+   G ++    L++    +   K+   ++ + L  ++NS    N  V +    S   +N     +  +WH R GH
Subjt:  VSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKL---KTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSNVNVNVVESKAVWHCRLGH

Query:  PSFKVFDDIVKRCKLPVR--VNDIFLFCEACK---FGKLHALPFQNSDSRAEVSFAL--VHSDLWGPAPVLSSDGFRYYVHFLDDYSRHIWVYPLKQKSE
         S     +I ++     +  +N++ L CE C+    GK   LPF+    +  +   L  VHSD+ GP   ++ D   Y+V F+D ++ +   Y +K KS+
Subjt:  PSFKVFDDIVKRCKLPVR--VNDIFLFCEACK---FGKLHALPFQNSDSRAEVSFAL--VHSDLWGPAPVLSSDGFRYYVHFLDDYSRHIWVYPLKQKSE

Query:  TLSAFLHFLNIVKNQFSSTIKIFRSDNRGEF--TKVHQVC
          S F  F+   +  F+  +     DN  E+   ++ Q C
Subjt:  TLSAFLHFLNIVKNQFSSTIKIFRSDNRGEF--TKVHQVC

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-2223.84Show/hide
Query:  DNAKDLWEAIQELFGVQSRAEEDYL-RQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVAT-LQGRVGISWAELQAD
        D A+ +W  ++ L+  ++   + YL +Q++       +    +L +       L   G  +        +L  L   Y+ +  T L G+  I   ++ + 
Subjt:  DNAKDLWEAIQELFGVQSRAEEDYL-RQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVAT-LQGRVGISWAELQAD

Query:  LLVFEKRLEFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYH-RFDK
        LL+ EK                           +   NQ Q+    G   ++    N  G +G R + + R +   R  C  C + GH    C + R  K
Subjt:  LLVFEKRLEFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYH-RFDK

Query:  EYGNGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNISHVVS
           +G   + N  A  +N          N     FI  +     ++ PE+     W VD+ AS H T   +     V  G    V +GN          S
Subjt:  EYGNGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNISHVVS

Query:  VSKLAQDNVVFLEFHVDSCLV-KDT-HTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSNV---NVNVVE---SKAVWHCR
         SK+A    + ++ +V   LV KD  H   + +  +    L +               + S   A  +   T  R+ + +    +N  +   S  +WH R
Subjt:  VSKLAQDNVVFLEFHVDSCLV-KDT-HTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSNV---NVNVVE---SKAVWHCR

Query:  LGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRHIWVYPLKQKSETLSA
        +GH S K    + K+  +          C+ C FGK H + FQ S  R      LV+SD+ GP  + S  G +Y+V F+DD SR +WVY LK K +    
Subjt:  LGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRHIWVYPLKQKSETLSA

Query:  FLHFLNIVKNQFSSTIKIFRSDNRGEFT--KVHQVCS
        F  F  +V+ +    +K  RSDN GE+T  +  + CS
Subjt:  FLHFLNIVKNQFSSTIKIFRSDNRGEFT--KVHQVCS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.7e-4728.75Show/hide
Query:  AKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGR-VGISWAELQADLLV
        A  +WE +++++   S      LR   +Q  KG+  + DY++ + T  D L   G P+     + +VL  L EEY PV+  +  +    +  E+   LL 
Subjt:  AKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGR-VGISWAELQADLLV

Query:  FEKRLEFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGR--NRGRGRWQGNN---RPI---CQVCGKTGHSALICYH
         E ++    S  T +  + N AV+  NT   N+ N       NGNR+  + Y N+   N  +   +    +  NN   +P    CQ+CG  GHSA  C  
Subjt:  FEKRLEFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGR--NRGRGRWQGNN---RPI---CQVCGKTGHSALICYH

Query:  RFDKEYGNGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPF-VASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI
                           ++    ++SV NS   P  F   Q      + SP +    +W +DSGA+ H+T + NN+S    Y G D V++ +G  + I
Subjt:  RFDKEYGNGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPF-VASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNI

Query:  SH---------------------------VVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVC
        SH                           ++SV +L   N V +EF   S  VKD +TG  LL G  KD LY+   + +   S   F   SSK+ +    
Subjt:  SH---------------------------VVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVC

Query:  STSVRSGSNVNVNVVESKAVWHCRLGHPSFKVFDDIVKRCKLPV-RVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYY
                          + WH RLGHP+  + + ++    L V   +  FL C  C   K + +PF  S   +      ++SD+W  +P+LS D +RYY
Subjt:  STSVRSGSNVNVNVVESKAVWHCRLGHPSFKVFDDIVKRCKLPV-RVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYY

Query:  VHFLDDYSRHIWVYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQVCS
        V F+D ++R+ W+YPLKQKS+    F+ F N+++N+F + I  F SDN GEF  + +  S
Subjt:  VHFLDDYSRHIWVYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQVCS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.5e-4327.75Show/hide
Query:  AKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGR-VGISWAELQADLLV
        A  +WE +++++   S      LR I                   T  D L   G P+     + +VL  L ++Y PV+  +  +    S  E+   L+ 
Subjt:  AKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGR-VGISWAELQADLLV

Query:  FEKRLEFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALIC--YHRFDKEY
         E +L   NS +  +  + N   +      +N  N+  ++ +N N +  +++     G+   NR    + G     CQ+C   GHSA  C   H+F    
Subjt:  FEKRLEFQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALIC--YHRFDKEY

Query:  GNGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNISH-----
                     T N +  TS   +   P A +A  S  P+ A+       +W +DSGA+ H+T + NN+S    Y G D V+I +G  + I+H     
Subjt:  GNGVARNSNHNARTENSKVITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNISH-----

Query:  ----------------------VVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYK--LKTSGAVTGSALEFLENSSKSANNIVCSTSVR
                              ++SV +L   N V +EF   S  VKD +TG  LL G  KD LY+  + +S AV+  A               CS +  
Subjt:  ----------------------VVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVLLTGVLKDGLYK--LKTSGAVTGSALEFLENSSKSANNIVCSTSVR

Query:  SGSNVNVNVVESKAVWHCRLGHPSFKVFDDIVKRCKLPV-RVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLD
        S              WH RLGHPS  + + ++    LPV   +   L C  C   K H +PF NS   +      ++SD+W  +P+LS D +RYYV F+D
Subjt:  SGSNVNVNVVESKAVWHCRLGHPSFKVFDDIVKRCKLPV-RVNDIFLFCEACKFGKLHALPFQNSDSRAEVSFALVHSDLWGPAPVLSSDGFRYYVHFLD

Query:  DYSRHIWVYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQVCS
         ++R+ W+YPLKQKS+    F+ F ++V+N+F + I    SDN GEF  +    S
Subjt:  DYSRHIWVYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQVCS

Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.6e-0426.52Show/hide
Query:  AKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGI-SWAELQADLLV
        A+DLW +++ LF     A         + T      + +Y + +K+ +D L    SP++ R L+  +L GL E+Y+ ++  ++ +    S+ E ++ LL+
Subjt:  AKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGI-SWAELQADLLV

Query:  FEKRLEFQNSHKTNLSF-SHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTF----SNYPNQRGGNG-GRNRGRGRWQGNNRP
         E RL   N  K++LS  +H +  N++ T  +    +R  Q+++ N S      S   N+ GG+  GR      W+ N  P
Subjt:  FEKRLEFQNSHKTNLSF-SHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTF----SNYPNQRGGNG-GRNRGRGRWQGNNRP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTATGATAATGCTAAAGATTTATGGGAAGCTATTCAAGAATTATTTGGAGTTCAGTCCAGAGCAGAGGAAGATTATCTTCGTCAAATTTTTCAACAAACTCGAAA
AGGGTCATCCAAAATGGCTGACTATCTAAGAATCATGAAGACACATGCGGACAATCTCGGGCAAGCAGGAAGTCCTGTTACAACAAGGTCGTTGATATCCCAAGTACTAT
TGGGATTGGATGAAGAATATAATCCTGTAGTAGCTACTCTTCAAGGACGCGTTGGAATTTCCTGGGCAGAGTTGCAAGCAGATCTCCTTGTCTTTGAAAAACGGTTAGAG
TTTCAAAATTCTCATAAGACTAATTTATCTTTTAGTCACAATGCAGCTGTAAACATGGTAAATACCAGAGGCCAGAATTCTGGAAATCAAAGACAGAGTCAGCAATTTAA
TGGGAACCGGTCTACATTCTCCAACTACCCTAATCAGAGAGGTGGTAATGGAGGGCGAAATCGTGGTAGAGGAAGATGGCAAGGTAATAATCGACCAATTTGTCAGGTAT
GTGGAAAAACTGGGCATTCCGCTCTAATCTGCTATCATCGTTTTGATAAAGAGTATGGAAATGGTGTGGCTAGAAATTCTAATCACAATGCTAGGACTGAAAATAGCAAA
GTGATAACCTCTGTTGGAAATAGCAATCCAACACCTCATGCGTTTATAGCAGGACAGAGTTCTATTCCGTTTGTAGCAAGCCCCGAAACTGTGGTTGATCCTCACTGGTA
TGTTGATAGTGGAGCCTCCAGCCATGTCACCGGAAATCACAACAACATTTCACATCCTGTTGAGTATGGAGGTAAGGATATTGTTGTCATTGGAAATGGGCATCAATTAA
ATATCTCACATGTTGTCAGTGTTTCGAAGCTTGCTCAAGACAATGTCGTTTTTCTTGAATTCCATGTTGATTCTTGTCTTGTAAAGGACACACATACGGGCAAGGTGCTG
CTGACGGGGGTTCTTAAAGATGGACTTTACAAACTCAAAACAAGTGGAGCAGTTACTGGTAGTGCTTTGGAGTTTTTAGAAAATAGTTCGAAGTCGGCTAATAATATTGT
TTGCTCTACTAGTGTTCGATCTGGTTCAAATGTTAATGTTAATGTTGTGGAATCTAAAGCAGTTTGGCATTGTAGGCTTGGACATCCATCTTTTAAAGTGTTTGATGATA
TTGTCAAGAGATGTAAACTACCAGTAAGAGTTAATGATATTTTTCTGTTTTGTGAAGCCTGCAAATTTGGAAAATTACATGCTCTTCCTTTTCAAAATTCTGATTCTCGT
GCAGAAGTGTCGTTTGCCTTGGTTCATTCTGATCTTTGGGGTCCAGCACCAGTGTTGTCTTCTGATGGTTTTCGTTATTATGTGCATTTTTTGGATGACTATAGTAGACA
TATTTGGGTTTATCCGTTAAAACAAAAGAGTGAAACTTTGTCTGCTTTTCTGCACTTTCTTAATATTGTGAAAAATCAGTTTAGCAGCACCATTAAGATATTTCGATCAG
ATAATAGAGGCGAGTTTACAAAAGTTCATCAAGTCTGTTCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTATGATAATGCTAAAGATTTATGGGAAGCTATTCAAGAATTATTTGGAGTTCAGTCCAGAGCAGAGGAAGATTATCTTCGTCAAATTTTTCAACAAACTCGAAA
AGGGTCATCCAAAATGGCTGACTATCTAAGAATCATGAAGACACATGCGGACAATCTCGGGCAAGCAGGAAGTCCTGTTACAACAAGGTCGTTGATATCCCAAGTACTAT
TGGGATTGGATGAAGAATATAATCCTGTAGTAGCTACTCTTCAAGGACGCGTTGGAATTTCCTGGGCAGAGTTGCAAGCAGATCTCCTTGTCTTTGAAAAACGGTTAGAG
TTTCAAAATTCTCATAAGACTAATTTATCTTTTAGTCACAATGCAGCTGTAAACATGGTAAATACCAGAGGCCAGAATTCTGGAAATCAAAGACAGAGTCAGCAATTTAA
TGGGAACCGGTCTACATTCTCCAACTACCCTAATCAGAGAGGTGGTAATGGAGGGCGAAATCGTGGTAGAGGAAGATGGCAAGGTAATAATCGACCAATTTGTCAGGTAT
GTGGAAAAACTGGGCATTCCGCTCTAATCTGCTATCATCGTTTTGATAAAGAGTATGGAAATGGTGTGGCTAGAAATTCTAATCACAATGCTAGGACTGAAAATAGCAAA
GTGATAACCTCTGTTGGAAATAGCAATCCAACACCTCATGCGTTTATAGCAGGACAGAGTTCTATTCCGTTTGTAGCAAGCCCCGAAACTGTGGTTGATCCTCACTGGTA
TGTTGATAGTGGAGCCTCCAGCCATGTCACCGGAAATCACAACAACATTTCACATCCTGTTGAGTATGGAGGTAAGGATATTGTTGTCATTGGAAATGGGCATCAATTAA
ATATCTCACATGTTGTCAGTGTTTCGAAGCTTGCTCAAGACAATGTCGTTTTTCTTGAATTCCATGTTGATTCTTGTCTTGTAAAGGACACACATACGGGCAAGGTGCTG
CTGACGGGGGTTCTTAAAGATGGACTTTACAAACTCAAAACAAGTGGAGCAGTTACTGGTAGTGCTTTGGAGTTTTTAGAAAATAGTTCGAAGTCGGCTAATAATATTGT
TTGCTCTACTAGTGTTCGATCTGGTTCAAATGTTAATGTTAATGTTGTGGAATCTAAAGCAGTTTGGCATTGTAGGCTTGGACATCCATCTTTTAAAGTGTTTGATGATA
TTGTCAAGAGATGTAAACTACCAGTAAGAGTTAATGATATTTTTCTGTTTTGTGAAGCCTGCAAATTTGGAAAATTACATGCTCTTCCTTTTCAAAATTCTGATTCTCGT
GCAGAAGTGTCGTTTGCCTTGGTTCATTCTGATCTTTGGGGTCCAGCACCAGTGTTGTCTTCTGATGGTTTTCGTTATTATGTGCATTTTTTGGATGACTATAGTAGACA
TATTTGGGTTTATCCGTTAAAACAAAAGAGTGAAACTTTGTCTGCTTTTCTGCACTTTCTTAATATTGTGAAAAATCAGTTTAGCAGCACCATTAAGATATTTCGATCAG
ATAATAGAGGCGAGTTTACAAAAGTTCATCAAGTCTGTTCGTAG
Protein sequenceShow/hide protein sequence
MGYDNAKDLWEAIQELFGVQSRAEEDYLRQIFQQTRKGSSKMADYLRIMKTHADNLGQAGSPVTTRSLISQVLLGLDEEYNPVVATLQGRVGISWAELQADLLVFEKRLE
FQNSHKTNLSFSHNAAVNMVNTRGQNSGNQRQSQQFNGNRSTFSNYPNQRGGNGGRNRGRGRWQGNNRPICQVCGKTGHSALICYHRFDKEYGNGVARNSNHNARTENSK
VITSVGNSNPTPHAFIAGQSSIPFVASPETVVDPHWYVDSGASSHVTGNHNNISHPVEYGGKDIVVIGNGHQLNISHVVSVSKLAQDNVVFLEFHVDSCLVKDTHTGKVL
LTGVLKDGLYKLKTSGAVTGSALEFLENSSKSANNIVCSTSVRSGSNVNVNVVESKAVWHCRLGHPSFKVFDDIVKRCKLPVRVNDIFLFCEACKFGKLHALPFQNSDSR
AEVSFALVHSDLWGPAPVLSSDGFRYYVHFLDDYSRHIWVYPLKQKSETLSAFLHFLNIVKNQFSSTIKIFRSDNRGEFTKVHQVCS