; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035231 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035231
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr3:17154585..17157129
RNA-Seq ExpressionLag0035231
SyntenyLag0035231
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8516701.1 hypothetical protein F0562_016793 [Nyssa sinensis]9.7e-7741.71Show/hide
Query:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV
        L  Y+DGT   P + ++ +      Q+NPEY+ W  +DQAL TL+NATLS T LS+VIG  TS+EAW  LE+ FS+S+R +I+ LK+ L +ISK   +S+
Subjt:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV

Query:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM
        D Y+Q+IK+  + L ++ V+I+ ED++IY +NGLP  YN FKTS+RT+S+N T +E++ ++K EE  ++   K   +                S++R   
Subjt:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM

Query:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS
         +NF  +GRG  R   RG  +         +  Q+ L       +     SN  H   CQIC K GH+ALDCY+RM++SYQG  P  +L AM+A     S
Subjt:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS

Query:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS
          +  +      W  DTG   H+T DL+NL+    Y G++NIT+ NGQ+L ISH G   I  +D TF L+ +  VP ++TNLLSVHQFC DN+C FIF S
Subjt:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS

Query:  TSFTIQDKTSGKVLFHRPSVNG
          F IQDK + ++LF  PS +G
Subjt:  TSFTIQDKTSGKVLFHRPSVNG

KAA8519786.1 hypothetical protein F0562_014124 [Nyssa sinensis]2.0e-7741.71Show/hide
Query:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV
        L  Y+DGT   P + ++ +      Q+NPEY+ W  +DQAL TL+NATLS T LS+VIG  TS+EAW  LE+ FS+S+R +I+ LK+ L +ISK   +S+
Subjt:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV

Query:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM
        D Y+Q+IK+  + L ++ V+I+ ED++IY +NGLP  YN FKTS+RT+S+N T +E++ ++K EE  ++   K   +                S++R   
Subjt:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM

Query:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS
         +NF  +GRG  R   RG  +         +  Q+ L       +     SN  H   CQIC K GH+ALDCY+RM++SYQG  P  +L AM+A     S
Subjt:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS

Query:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS
          +  +      W  DTG   H+T DL+NL+    Y G++NIT+ NGQ+L ISH G   I  +D TF L+ +  VP+++TNLLSVHQFC DN+C FIF S
Subjt:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS

Query:  TSFTIQDKTSGKVLFHRPSVNG
          F IQDK + ++LF  PS +G
Subjt:  TSFTIQDKTSGKVLFHRPSVNG

KAA8521875.1 hypothetical protein F0562_012811 [Nyssa sinensis]1.7e-7641.47Show/hide
Query:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV
        L  Y+DGT   P + ++ +      Q+NPEY+ W  +DQAL TL+NATLS T LS+VIG  TS+EAW  LE+ FS+S+R +I+ LK+ L +ISK   +S+
Subjt:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV

Query:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM
        D Y+Q+IK+  + L ++ V+I+ ED++IY +NGLP  YN FKTS+RT+S+N T +E++ ++K EE  ++   K   +                S++R   
Subjt:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM

Query:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS
         +NF  +GRG  R   RG  +         +  Q+ L       +     SN  H   CQIC K GH+ALDCY+RM++SYQG  P  +L AM+A     S
Subjt:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS

Query:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS
          +  +      W  DTG   H+T DL+NL+    Y G++NIT+ NGQ+L ISH G   I  +D TF L+ +  VP+++TNLLSVHQFC DN+C FIF S
Subjt:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS

Query:  TSFTIQDKTSGKVLFHRPSVNG
          F IQDK + ++LF  P  +G
Subjt:  TSFTIQDKTSGKVLFHRPSVNG

KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis]3.3e-7741.71Show/hide
Query:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV
        L  Y+DGT   P + ++ +      Q+NPEY+ W  +DQAL TL+NATLS T LS+VIG  TS+EAW  LE+ FS+S+R +I+ LK+ L +ISK   +S+
Subjt:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV

Query:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM
        D Y+Q+IK+  + L ++ V+I+ ED++IY +NGLP  YN FKTS+RT+S+N T +E++ ++K EE  ++   K   +                S++R   
Subjt:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM

Query:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS
         +NF  +GRG  R   RG  +         +  Q+ L       +     SN  H   CQIC K GH+ALDCY+RM++SYQG  P  +L AM+A     S
Subjt:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS

Query:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS
          +  +      W  DTG   H+T DL+NL+    Y G++NIT+ NGQ+L ISH G   I  +D TF L+ +  VP+++TNLLSVHQFC DN+C FIF S
Subjt:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS

Query:  TSFTIQDKTSGKVLFHRPSVNG
          F IQDK + ++LF  PS +G
Subjt:  TSFTIQDKTSGKVLFHRPSVNG

KAA8535282.1 hypothetical protein F0562_030285 [Nyssa sinensis]5.7e-7741.71Show/hide
Query:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV
        L  Y+DGT   P + ++ +      Q+NPEY+ W  +DQAL TL+NATLS T LS+VIG  TS+EAW  LE+ FS+S+R +I+ LK+ L +ISK   +S+
Subjt:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV

Query:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM
        D Y+Q+IK   + L ++ V+I+ ED++IY +NGLP  YN FKTS+RT+S+N T +E++ ++K EE  ++   K   +                S++R   
Subjt:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM

Query:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS
         +NF  +GRG  R   RG  +         +  Q+ L       +     SN  H   CQIC K GH+ALDCY+RM++SYQG  P  +L AM+A     S
Subjt:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS

Query:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS
          +  +      W  DTG   H+T DL+NL+    Y G++NIT+ NGQ+L ISH G   I  +D TF L+ +  VP+++TNLLSVHQFC DN+C FIF S
Subjt:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS

Query:  TSFTIQDKTSGKVLFHRPSVNG
          F IQDK + ++LF  PS +G
Subjt:  TSFTIQDKTSGKVLFHRPSVNG

TrEMBL top hitse value%identityAlignment
A0A5J4ZHB6 Uncharacterized protein4.7e-7741.71Show/hide
Query:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV
        L  Y+DGT   P + ++ +      Q+NPEY+ W  +DQAL TL+NATLS T LS+VIG  TS+EAW  LE+ FS+S+R +I+ LK+ L +ISK   +S+
Subjt:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV

Query:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM
        D Y+Q+IK+  + L ++ V+I+ ED++IY +NGLP  YN FKTS+RT+S+N T +E++ ++K EE  ++   K   +                S++R   
Subjt:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM

Query:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS
         +NF  +GRG  R   RG  +         +  Q+ L       +     SN  H   CQIC K GH+ALDCY+RM++SYQG  P  +L AM+A     S
Subjt:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS

Query:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS
          +  +      W  DTG   H+T DL+NL+    Y G++NIT+ NGQ+L ISH G   I  +D TF L+ +  VP ++TNLLSVHQFC DN+C FIF S
Subjt:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS

Query:  TSFTIQDKTSGKVLFHRPSVNG
          F IQDK + ++LF  PS +G
Subjt:  TSFTIQDKTSGKVLFHRPSVNG

A0A5J4ZPW7 Retrotran_gag_3 domain-containing protein9.5e-7841.71Show/hide
Query:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV
        L  Y+DGT   P + ++ +      Q+NPEY+ W  +DQAL TL+NATLS T LS+VIG  TS+EAW  LE+ FS+S+R +I+ LK+ L +ISK   +S+
Subjt:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV

Query:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM
        D Y+Q+IK+  + L ++ V+I+ ED++IY +NGLP  YN FKTS+RT+S+N T +E++ ++K EE  ++   K   +                S++R   
Subjt:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM

Query:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS
         +NF  +GRG  R   RG  +         +  Q+ L       +     SN  H   CQIC K GH+ALDCY+RM++SYQG  P  +L AM+A     S
Subjt:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS

Query:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS
          +  +      W  DTG   H+T DL+NL+    Y G++NIT+ NGQ+L ISH G   I  +D TF L+ +  VP+++TNLLSVHQFC DN+C FIF S
Subjt:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS

Query:  TSFTIQDKTSGKVLFHRPSVNG
          F IQDK + ++LF  PS +G
Subjt:  TSFTIQDKTSGKVLFHRPSVNG

A0A5J4ZT09 Flavin-containing monooxygenase8.0e-7741.47Show/hide
Query:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV
        L  Y+DGT   P + ++ +      Q+NPEY+ W  +DQAL TL+NATLS T LS+VIG  TS+EAW  LE+ FS+S+R +I+ LK+ L +ISK   +S+
Subjt:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV

Query:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM
        D Y+Q+IK+  + L ++ V+I+ ED++IY +NGLP  YN FKTS+RT+S+N T +E++ ++K EE  ++   K   +                S++R   
Subjt:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM

Query:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS
         +NF  +GRG  R   RG  +         +  Q+ L       +     SN  H   CQIC K GH+ALDCY+RM++SYQG  P  +L AM+A     S
Subjt:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS

Query:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS
          +  +      W  DTG   H+T DL+NL+    Y G++NIT+ NGQ+L ISH G   I  +D TF L+ +  VP+++TNLLSVHQFC DN+C FIF S
Subjt:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS

Query:  TSFTIQDKTSGKVLFHRPSVNG
          F IQDK + ++LF  P  +G
Subjt:  TSFTIQDKTSGKVLFHRPSVNG

A0A5J5A1U7 Integrase catalytic domain-containing protein1.6e-7741.71Show/hide
Query:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV
        L  Y+DGT   P + ++ +      Q+NPEY+ W  +DQAL TL+NATLS T LS+VIG  TS+EAW  LE+ FS+S+R +I+ LK+ L +ISK   +S+
Subjt:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV

Query:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM
        D Y+Q+IK+  + L ++ V+I+ ED++IY +NGLP  YN FKTS+RT+S+N T +E++ ++K EE  ++   K   +                S++R   
Subjt:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM

Query:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS
         +NF  +GRG  R   RG  +         +  Q+ L       +     SN  H   CQIC K GH+ALDCY+RM++SYQG  P  +L AM+A     S
Subjt:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS

Query:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS
          +  +      W  DTG   H+T DL+NL+    Y G++NIT+ NGQ+L ISH G   I  +D TF L+ +  VP+++TNLLSVHQFC DN+C FIF S
Subjt:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS

Query:  TSFTIQDKTSGKVLFHRPSVNG
          F IQDK + ++LF  PS +G
Subjt:  TSFTIQDKTSGKVLFHRPSVNG

A0A5J5B049 Retrotran_gag_3 domain-containing protein2.8e-7741.71Show/hide
Query:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV
        L  Y+DGT   P + ++ +      Q+NPEY+ W  +DQAL TL+NATLS T LS+VIG  TS+EAW  LE+ FS+S+R +I+ LK+ L +ISK   +S+
Subjt:  LFKYLDGTIMTPAEVLRSD--GQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESV

Query:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM
        D Y+Q+IK   + L ++ V+I+ ED++IY +NGLP  YN FKTS+RT+S+N T +E++ ++K EE  ++   K   +                S++R   
Subjt:  DQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAA---------------SASRLEM

Query:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS
         +NF  +GRG  R   RG  +         +  Q+ L       +     SN  H   CQIC K GH+ALDCY+RM++SYQG  P  +L AM+A     S
Subjt:  AANFDSQGRGNWRGQGRGAGVED-------SLIQAEL-------EDEAMFSNKVH---CQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTS

Query:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS
          +  +      W  DTG   H+T DL+NL+    Y G++NIT+ NGQ+L ISH G   I  +D TF L+ +  VP+++TNLLSVHQFC DN+C FIF S
Subjt:  TATVTFPQESQVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYS

Query:  TSFTIQDKTSGKVLFHRPSVNG
          F IQDK + ++LF  PS +G
Subjt:  TSFTIQDKTSGKVLFHRPSVNG

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.4e-3226.02Show/hide
Query:  YLDGTIMTPAEVLRSDGQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESVDQYVQ
        +LDG+   P   + +D  P +VNP+Y +W  +D+ +++ +   +S +    V    T+ + W+ L K +++ S  H+  L+T+L+  +K  T+++D Y+Q
Subjt:  YLDGTIMTPAEVLRSDGQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESVDQYVQ

Query:  RIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAASASRLEMAANF-----------DSQGR
         +    ++L  +   +D ++ V   +  LP  Y      +  +   PT  E+H      E  L+ + K+   +SA+ + + AN            ++ G 
Subjt:  RIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAASASRLEMAANF-----------DSQGR

Query:  GNWRGQGRGAGVEDSLIQAELEDEAMFSNKV-----HCQICQKFGHNALDCYNRMNY--SYQGHQPPTKLAAMAAAAPNTSTATVTFPQESQVWLADTGC
         N R   R         Q    +    +N+       CQIC   GH+A  C    ++  S    QPP+        A       +  P  S  WL D+G 
Subjt:  GNWRGQGRGAGVEDSLIQAELEDEAMFSNKV-----HCQICQKFGHNALDCYNRMNY--SYQGHQPPTKLAAMAAAAPNTSTATVTFPQESQVWLADTGC

Query:  NAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYSTSFTIQDKTSG
          H+T+D +NLS+   Y G +++ V +G ++ ISH GS  +S       L  +  VPNI  NL+SV++ C  N     F+  SF ++D  +G
Subjt:  NAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYSTSFTIQDKTSG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-2424.17Show/hide
Query:  YLDGTIMTPAEVLRSDGQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESVDQYVQ
        +LDG+   P   + +D  P +VNP+Y +W  +D+ +++ I   +S +    V    T+ + W+ L K +++ S  H+    T+L+ I++           
Subjt:  YLDGTIMTPAEVLRSDGQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVIGCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESVDQYVQ

Query:  RIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAASASRLEMAANF----------DSQGRG
              ++L  +   +D ++ V   +  LP  Y      +  +   P+  E+H      E  ++++ KL    SA  + + AN           +   RG
Subjt:  RIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETALDKQMKLEEAASASRLEMAANF----------DSQGRG

Query:  NWR----GQGRGAGVEDSLIQAELEDEAMFSNKVHCQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNT-----STATVTFPQESQVWLADTG
        + R       R    + S   +  ++         CQIC   GH+A  C        Q HQ  +      + +P T     +   V  P  +  WL D+G
Subjt:  NWR----GQGRGAGVEDSLIQAELEDEAMFSNKVHCQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNT-----STATVTFPQESQVWLADTG

Query:  CNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYSTSFTIQDKTSG
           H+T+D +NLS    Y G +++ + +G ++ I+H GS  +  S  +  L+K+  VPNI  NL+SV++ C  N     F+  SF ++D  +G
Subjt:  CNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYSTSFTIQDKTSG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAGGCTCCCACAAACTGGTCTTTGCCAAGAGAATAGCTGGGAAGACGTTTTGGTGGTGTCTTGCCAAGTTCCTGTTCTTGGTGTTCGTGCTGTATACCGATCGTT
GAGGGAGACCTTGCTTGTTGTTCAACTGTTGGTGACCAAAGAGTCTACGAGGCTGTTCAAGTATCTTGATGGCACTATCATGACACCTGCTGAAGTACTTCGTTCTGACG
GACAGCCCGATCAGGTCAATCCTGAGTATGAAAAATGGTATGAAAAAGATCAAGCCCTCTTTACTTTGATAAATGCGACTTTATCGCCAACAACCCTATCCTATGTGATT
GGTTGCAAGACTTCAAAAGAAGCCTGGGACAAGCTCGAGAAACATTTCTCTTCATCTTCAAGGATGCACATTGTTGGTCTCAAGACCGAATTACAAAGTATATCCAAGAA
AGTGACAGAATCCGTCGATCAATACGTTCAGCGTATCAAAGAAATTGTCAATCGACTACTGGCTATATTTGTTGTCATCGATGCCGAAGATCTGGTAATATACACTGTTA
ATGGACTACCTTCAACATACAATGTCTTCAAGACATCCTTGAGGACTAGGTCTCAAAATCCGACATTTGATGAACTTCACGTCCTCATGAAGACTGAGGAAACTGCACTC
GATAAACAGATGAAACTAGAGGAGGCAGCCTCTGCATCACGATTAGAAATGGCCGCCAATTTTGATTCACAAGGACGAGGAAATTGGAGAGGACAAGGCAGAGGAGCCGG
AGTGGAGGACAGTCTGATTCAGGCCGAACTGGAGGACGAGGCCATGTTCTCCAACAAAGTACATTGCCAAATTTGTCAAAAATTTGGGCACAATGCGTTGGATTGCTACA
ATCGCATGAACTACTCCTACCAAGGGCATCAACCTCCAACCAAACTTGCTGCAATGGCTGCTGCTGCTCCAAACACTTCTACTGCAACCGTTACTTTTCCGCAAGAATCT
CAGGTATGGCTAGCTGATACAGGCTGTAACGCTCATCTTACCAATGATCTGTCAAATCTCAGTGTTTCAACTGCATACAATGGGGAGGAGAACATCACAGTAGGCAATGG
TCAGTCCCTCTCCATTTCTCATCTTGGTTCTATCCAAATTTCCCTATCTGACTCTACTTTTACCTTGTCTAAATTGTTCCGTGTACCTAACATTTCCACAAATCTTCTTT
CGGTGCATCAGTTCTGTATTGATAACAATTGCTGCTTTATTTTTTACTCCACTTCCTTCACCATTCAGGACAAAACTTCGGGTAAAGTTCTCTTTCACAGACCTAGCGTT
AATGGCTTTACCCATTCTCTGTCTCGCCAACACAAGCAAATCTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGAGGCTCCCACAAACTGGTCTTTGCCAAGAGAATAGCTGGGAAGACGTTTTGGTGGTGTCTTGCCAAGTTCCTGTTCTTGGTGTTCGTGCTGTATACCGATCGTT
GAGGGAGACCTTGCTTGTTGTTCAACTGTTGGTGACCAAAGAGTCTACGAGGCTGTTCAAGTATCTTGATGGCACTATCATGACACCTGCTGAAGTACTTCGTTCTGACG
GACAGCCCGATCAGGTCAATCCTGAGTATGAAAAATGGTATGAAAAAGATCAAGCCCTCTTTACTTTGATAAATGCGACTTTATCGCCAACAACCCTATCCTATGTGATT
GGTTGCAAGACTTCAAAAGAAGCCTGGGACAAGCTCGAGAAACATTTCTCTTCATCTTCAAGGATGCACATTGTTGGTCTCAAGACCGAATTACAAAGTATATCCAAGAA
AGTGACAGAATCCGTCGATCAATACGTTCAGCGTATCAAAGAAATTGTCAATCGACTACTGGCTATATTTGTTGTCATCGATGCCGAAGATCTGGTAATATACACTGTTA
ATGGACTACCTTCAACATACAATGTCTTCAAGACATCCTTGAGGACTAGGTCTCAAAATCCGACATTTGATGAACTTCACGTCCTCATGAAGACTGAGGAAACTGCACTC
GATAAACAGATGAAACTAGAGGAGGCAGCCTCTGCATCACGATTAGAAATGGCCGCCAATTTTGATTCACAAGGACGAGGAAATTGGAGAGGACAAGGCAGAGGAGCCGG
AGTGGAGGACAGTCTGATTCAGGCCGAACTGGAGGACGAGGCCATGTTCTCCAACAAAGTACATTGCCAAATTTGTCAAAAATTTGGGCACAATGCGTTGGATTGCTACA
ATCGCATGAACTACTCCTACCAAGGGCATCAACCTCCAACCAAACTTGCTGCAATGGCTGCTGCTGCTCCAAACACTTCTACTGCAACCGTTACTTTTCCGCAAGAATCT
CAGGTATGGCTAGCTGATACAGGCTGTAACGCTCATCTTACCAATGATCTGTCAAATCTCAGTGTTTCAACTGCATACAATGGGGAGGAGAACATCACAGTAGGCAATGG
TCAGTCCCTCTCCATTTCTCATCTTGGTTCTATCCAAATTTCCCTATCTGACTCTACTTTTACCTTGTCTAAATTGTTCCGTGTACCTAACATTTCCACAAATCTTCTTT
CGGTGCATCAGTTCTGTATTGATAACAATTGCTGCTTTATTTTTTACTCCACTTCCTTCACCATTCAGGACAAAACTTCGGGTAAAGTTCTCTTTCACAGACCTAGCGTT
AATGGCTTTACCCATTCTCTGTCTCGCCAACACAAGCAAATCTTGTAG
Protein sequenceShow/hide protein sequence
MQRLPQTGLCQENSWEDVLVVSCQVPVLGVRAVYRSLRETLLVVQLLVTKESTRLFKYLDGTIMTPAEVLRSDGQPDQVNPEYEKWYEKDQALFTLINATLSPTTLSYVI
GCKTSKEAWDKLEKHFSSSSRMHIVGLKTELQSISKKVTESVDQYVQRIKEIVNRLLAIFVVIDAEDLVIYTVNGLPSTYNVFKTSLRTRSQNPTFDELHVLMKTEETAL
DKQMKLEEAASASRLEMAANFDSQGRGNWRGQGRGAGVEDSLIQAELEDEAMFSNKVHCQICQKFGHNALDCYNRMNYSYQGHQPPTKLAAMAAAAPNTSTATVTFPQES
QVWLADTGCNAHLTNDLSNLSVSTAYNGEENITVGNGQSLSISHLGSIQISLSDSTFTLSKLFRVPNISTNLLSVHQFCIDNNCCFIFYSTSFTIQDKTSGKVLFHRPSV
NGFTHSLSRQHKQIL