; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028569 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028569
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr8:25206591..25208188
RNA-Seq ExpressionLag0028569
SyntenyLag0028569
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8516701.1 hypothetical protein F0562_016793 [Nyssa sinensis]7.9e-5233.91Show/hide
Query:  SSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTE--GQPPQVNLEYEKWHEKDQ----------------------TS
        S + LL+NI NL+  R+DS+NYV W+F IS ILKAH L  ++DG+Y   N  ++ E      Q+N EY+ W+ +DQ                      TS
Subjt:  SSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTE--GQPPQVNLEYEKWHEKDQ----------------------TS

Query:  QEVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPSTLNV----VEIGAVKVVVEEILVDLME
        +E W  LE+ FS+STRSNI+ LK  L NIS K  ++ID YIQ+I    + L +VSV+I+ ED++IY++NGLP   N     +   +  + +EE+   L  
Subjt:  QEVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPSTLNV----VEIGAVKVVVEEILVDLME

Query:  TE-----------------VMVAT----------------------------------------------------------------------------
         E                  M+AT                                                                            
Subjt:  TE-----------------VMVAT----------------------------------------------------------------------------

Query:  -NGHDALECYNRMNYSYQGCHPPTKLAAMATSANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDS
         NGH AL+CY+RM++SYQG  P  +L AM+ + N  S  S       W TDTG   H+T D  NL+    Y G +NI I N Q+L I+H G       D 
Subjt:  -NGHDALECYNRMNYSYQGCHPPTKLAAMATSANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDS

Query:  SFILSNLLRVPNISTNLLSILQFFLDNDCSFTFNATSFTIQDNATGKILYHRLNINGLYPLTTSSL
        +F L+N+L VP ++TNLLS+ QF  DN C F F++  F IQD AT ++L+   + +GLYPL TSS+
Subjt:  SFILSNLLRVPNISTNLLSILQFFLDNDCSFTFNATSFTIQDNATGKILYHRLNINGLYPLTTSSL

KAA8519786.1 hypothetical protein F0562_014124 [Nyssa sinensis]1.6e-5233.91Show/hide
Query:  SSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTE--GQPPQVNLEYEKWHEKDQ----------------------TS
        S + LL+NI NL+  R+DS+NYV W+F IS ILKAH L  ++DG+Y   N  ++ E      Q+N EY+ W+ +DQ                      TS
Subjt:  SSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTE--GQPPQVNLEYEKWHEKDQ----------------------TS

Query:  QEVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPSTLNV----VEIGAVKVVVEEILVDLME
        +E W  LE+ FS+STRSNI+ LK  L NIS K  ++ID YIQ+I    + L +VSV+I+ ED++IY++NGLP   N     +   +  + +EE+   L  
Subjt:  QEVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPSTLNV----VEIGAVKVVVEEILVDLME

Query:  TE-----------------VMVAT----------------------------------------------------------------------------
         E                  M+AT                                                                            
Subjt:  TE-----------------VMVAT----------------------------------------------------------------------------

Query:  -NGHDALECYNRMNYSYQGCHPPTKLAAMATSANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDS
         NGH AL+CY+RM++SYQG  P  +L AM+ + N  S  S       W TDTG   H+T D  NL+    Y G +NI I N Q+L I+H G       D 
Subjt:  -NGHDALECYNRMNYSYQGCHPPTKLAAMATSANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDS

Query:  SFILSNLLRVPNISTNLLSILQFFLDNDCSFTFNATSFTIQDNATGKILYHRLNINGLYPLTTSSL
        +F L+N+L VP+++TNLLS+ QF  DN C F F++  F IQD AT ++L+   + +GLYPL TSS+
Subjt:  SFILSNLLRVPNISTNLLSILQFFLDNDCSFTFNATSFTIQDNATGKILYHRLNINGLYPLTTSSL

KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis]5.5e-6934.72Show/hide
Query:  SSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTE--GQPPQVNLEYEKWHEKDQ----------------------TS
        S + LL+NI NL+  R+DS+NYV W+F IS ILKAH L  ++DG+Y   N  ++ E      Q+N EY+ W+ +DQ                      TS
Subjt:  SSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTE--GQPPQVNLEYEKWHEKDQ----------------------TS

Query:  QEVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPSTLNV----VEIGAVKVVVEEILVDLME
        +E W  LE+ FS+STRSNI+ LK  L NIS K  ++ID YIQ+I    + L +VSV+I+ ED++IY++NGLP   N     +   +  + +EE+   L  
Subjt:  QEVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPSTLNV----VEIGAVKVVVEEILVDLME

Query:  TE-----------------VMVAT----------------------------------------------------------------------------
         E                  M+AT                                                                            
Subjt:  TE-----------------VMVAT----------------------------------------------------------------------------

Query:  -NGHDALECYNRMNYSYQGCHPPTKLAAMATSANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDS
         NGH AL+CY+RM++SYQG  P  +L AM+ + N  S  S       W TDTG   H+T D  NL+    Y G +NI I N Q+L I+H G       D 
Subjt:  -NGHDALECYNRMNYSYQGCHPPTKLAAMATSANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDS

Query:  SFILSNLLRVPNISTNLLSILQFFLDNDCSFTFNATSFTIQDNATGKILYHRLNINGLYPLTTSSL-----PSTQVRL----------------------
        +F L+N+L VP+++TNLLS+ QF  DN C F F++  F IQD AT ++L+   + +GLYPL TSS+     PS Q  L                      
Subjt:  SFILSNLLRVPNISTNLLSILQFFLDNDCSFTFNATSFTIQDNATGKILYHRLNINGLYPLTTSSL-----PSTQVRL----------------------

Query:  ---IAQVGAKVSSTVCHDRLGHPCSSTLQHILHYFAFLVSKNASTNICTHCLNGKMSKLPFSFSST-STTPLELIHSDV
            A +G +VS+ + HDRLGHP ++TLQ IL   A + +   S  +C HCL GKM+KLPF  S+T ST PL+L+HSD+
Subjt:  ---IAQVGAKVSSTVCHDRLGHPCSSTLQHILHYFAFLVSKNASTNICTHCLNGKMSKLPFSFSST-STTPLELIHSDV

KAA8535282.1 hypothetical protein F0562_030285 [Nyssa sinensis]4.6e-5233.91Show/hide
Query:  SSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTE--GQPPQVNLEYEKWHEKDQ----------------------TS
        S + LL+NI NL+  R+DS+NYV W+F IS ILKAH L  ++DG+Y   N  ++ E      Q+N EY+ W+ +DQ                      TS
Subjt:  SSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTE--GQPPQVNLEYEKWHEKDQ----------------------TS

Query:  QEVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPSTLNV----VEIGAVKVVVEEILVDLME
        +E W  LE+ FS+STRSNI+ LK  L NIS K  ++ID YIQ+I    + L +VSV+I+ ED++IY++NGLP   N     +   +  + +EE+   L  
Subjt:  QEVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPSTLNV----VEIGAVKVVVEEILVDLME

Query:  TE-----------------VMVAT----------------------------------------------------------------------------
         E                  M+AT                                                                            
Subjt:  TE-----------------VMVAT----------------------------------------------------------------------------

Query:  -NGHDALECYNRMNYSYQGCHPPTKLAAMATSANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDS
         NGH AL+CY+RM++SYQG  P  +L AM+ + N  S  S       W TDTG   H+T D  NL+    Y G +NI I N Q+L I+H G       D 
Subjt:  -NGHDALECYNRMNYSYQGCHPPTKLAAMATSANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDS

Query:  SFILSNLLRVPNISTNLLSILQFFLDNDCSFTFNATSFTIQDNATGKILYHRLNINGLYPLTTSSL
        +F L+N+L VP+++TNLLS+ QF  DN C F F++  F IQD AT ++L+   + +GLYPL TSS+
Subjt:  SFILSNLLRVPNISTNLLSILQFFLDNDCSFTFNATSFTIQDNATGKILYHRLNINGLYPLTTSSL

RWR76373.1 putative polyprotein [Cinnamomum micranthum f. kanehirae]1.7e-5431.78Show/hide
Query:  MQSSEQNQNSSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTEGQ--PPQVNLEYEKWHEKDQ---------------
        M SS  N ++   +++NI NLV +++D  NY+LWR    P+L +H L  FVDGS L      R   +     +   + +WH +DQ               
Subjt:  MQSSEQNQNSSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTEGQ--PPQVNLEYEKWHEKDQ---------------

Query:  -------TSQEVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTA--------VSVVIDTEDLIIYIVNGLPSTLNVVEIGAV
               TS+  W  +E+ F+S +R++ + LK +LQN+ +K G  +D   Q +  I++ L          V+    ++ + +  V+GL   LN+    A 
Subjt:  -------TSQEVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTA--------VSVVIDTEDLIIYIVNGLPSTLNVVEIGAV

Query:  KVVVEEILVDLMETEVMVATN----------------------------------------------------------------GHDALECYNRMNYSY
          +  +   D   T +    N                                                                GH AL+CY+RM+++Y
Subjt:  KVVVEEILVDLMETEVMVATN----------------------------------------------------------------GHDALECYNRMNYSY

Query:  QGCHPPTKLAAMATSANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDSSFILSNLLRVPNISTNL
        QG HPP KLAAMA S  F          Q W TDTG   H+T++  NLS+ S Y+  + + +GN   L I+H G    ST  S+F L+N+L VP+ISTNL
Subjt:  QGCHPPTKLAAMATSANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDSSFILSNLLRVPNISTNL

Query:  LSILQFFLDNDCSFTFNATSFTIQDNATGKILYHRLNINGLYPLTTSSLP--STQVRLIAQVGAKVSSTVCHDRLGHPCSSTLQHILHYFAFLVSKNAS-
        +S+ +F  DN+C F F+++ F I+D A+GK L+   + NGLYP     LP  S      A VG +V++++ H RLGHP S+  QH+   F   V  ++  
Subjt:  LSILQFFLDNDCSFTFNATSFTIQDNATGKILYHRLNINGLYPLTTSSLP--STQVRLIAQVGAKVSSTVCHDRLGHPCSSTLQHILHYFAFLVSKNAS-

Query:  TNICTHCLNGKMSKLPFSFSST-STTPLELIHSDV
        ++ICT C  GK  KLPFS SS+ S+ PL+LIH D+
Subjt:  TNICTHCLNGKMSKLPFSFSST-STTPLELIHSDV

TrEMBL top hitse value%identityAlignment
A0A2N9G021 Integrase catalytic domain-containing protein7.4e-6433.27Show/hide
Query:  SSEQNQNSSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTEGQPPQVNLEYEKWHEKDQ-------------------
        S+  +  S + LL N+ NL+  ++DSTNY++W+  I+ IL A+ +   +DGS ++   +L  E   P VN  +  WH+K++                   
Subjt:  SSEQNQNSSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTEGQPPQVNLEYEKWHEKDQ-------------------

Query:  ---TSQEVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPSTL-----------NVVEIGAVK
           TSQEVW+KLE+ F+ + R+N++ LK+ELQ+I KK  ET+  Y+QRI  + +KL+AV V  D E+L+  I+ GLP               ++ +  + 
Subjt:  ---TSQEVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPSTL-----------NVVEIGAVK

Query:  VVVEEILVDLMETE---------VMVATN-----------------------------------------GHDALECYNRMNYSYQGCHPPTKLAAMATS
        V+++     + ET          + V+ N                                         GH A++CY+RM+++YQG +P TKLAAMA++
Subjt:  VVVEEILVDLMETE---------VMVATN-----------------------------------------GHDALECYNRMNYSYQGCHPPTKLAAMATS

Query:  ANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDSSFILSNLLRVPNISTNLLSILQFFLDNDCSFT
        +N   T    Q+ + WLTD+G + H+T    NL+  + Y G + + +GN Q+LPI H G  +  T    F L N+L VP I++NLLS+ +  L N CS  
Subjt:  ANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDSSFILSNLLRVPNISTNLLSILQFFLDNDCSFT

Query:  FNATSFTIQDNATGKILYHRLNINGLYPLTTSSLPSTQVRLIAQVGAKVSS---TVCHDRLGHPCSSTLQHI---LHYFAFLVSKNASTNICTHCLNGKM
        F++    IQD  TG++LY  L+ NG+YP+ +S+  ++     A     VS+    + H RLGHP +  L ++   L   +++ SK+     CTHCL GKM
Subjt:  FNATSFTIQDNATGKILYHRLNINGLYPLTTSSLPSTQVRLIAQVGAKVSS---TVCHDRLGHPCSSTLQHI---LHYFAFLVSKNASTNICTHCLNGKM

Query:  SKLPFSFSS-TSTTPLELIHSDV
         +LPFS S+  ST P  LIH+D+
Subjt:  SKLPFSFSS-TSTTPLELIHSDV

A0A2N9G7E3 Integrase catalytic domain-containing protein4.7e-6633.16Show/hide
Query:  SEQNQNSSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTEGQPPQVNLEYEKWHEKDQ--------------------
        S     + M+LL+NI NLV V++D TNY+LW+F I+  LKA+KL   VDGSY       R     P +N ++ +W  KDQ                    
Subjt:  SEQNQNSSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTEGQPPQVNLEYEKWHEKDQ--------------------

Query:  --TSQEVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPS--------------TLNVVEIGA
          +++ VW+ LEK F+S +RSN++ LK +L +I KK+ E+I++Y+Q+I +  +KL AV V I+ E+++  +++GLP+              +++  E+  
Subjt:  --TSQEVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPS--------------TLNVVEIGA

Query:  VKVVVEEILVDLMETE------VMVA--------------------------------------------------------------------------
        + +  E+ L    E+        MV                                                                           
Subjt:  VKVVVEEILVDLMETE------VMVA--------------------------------------------------------------------------

Query:  --TNGHDALECYNRMNYSYQGCHPPTKLAAMATSANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTI
           NGH AL+CY+RM++++QG HPPTKLAAMA S+N  S+N        W++DTG   H T D  NL  +  YNG + + +GN Q LPITH G  +    
Subjt:  --TNGHDALECYNRMNYSYQGCHPPTKLAAMATSANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTI

Query:  DSSFILSNLLRVPNISTNLLSILQFFLDNDCSFTFNATSFTIQDNATGKILYHRLNINGLYPLTTSSLPSTQVRL---------------IAQVGAKVSS
             L   LRVPN+ TNLLS+ +   DN+C F F+A+ F+IQD  +GK+LY   N  GLYP+      ST+V+                 A    KVSS
Subjt:  DSSFILSNLLRVPNISTNLLSILQFFLDNDCSFTFNATSFTIQDNATGKILYHRLNINGLYPLTTSSLPSTQVRL---------------IAQVGAKVSS

Query:  TVCHDRLGHPCSSTLQHIL-HYFAFLVSKNASTNICTHCLNGKMSKLPFSFSST-STTPLELIHSDV
        +  H RLGHP S  LQ +  H     +  ++S + C HC  GKMS+LPFS S T +T PL+L+HSDV
Subjt:  TVCHDRLGHPCSSTLQHIL-HYFAFLVSKNASTNICTHCLNGKMSKLPFSFSST-STTPLELIHSDV

A0A2N9GZR9 Integrase catalytic domain-containing protein3.0e-6534.67Show/hide
Query:  NQNSSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTEGQPPQVNLEYEKWHEKDQ----------------------T
        N +S +LLL N+ NL+  ++DSTNY++W+  I+ +L A+ +   +DGS    +  L TE     VN ++  W++KD+                      +
Subjt:  NQNSSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTEGQPPQVNLEYEKWHEKDQ----------------------T

Query:  SQEVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPSTL----NVVEIGAVKVVVEEILVDLM
        SQEVW  LE+ F+S+ RSN++ LK+ELQ+I K   ET+  Y+QRI  + +KL+AV V  D E+L   I+ GLP       + +      + +E++ V L+
Subjt:  SQEVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPSTL----NVVEIGAVKVVVEEILVDLM

Query:  ETE-----------------VMVATN----------------------------------------------GHDALECYNRMNYSYQGCHPPTKLAAMA
        +TE                 + V+ N                                              GH A++CY+RMN++YQG +P TKLAAMA
Subjt:  ETE-----------------VMVATN----------------------------------------------GHDALECYNRMNYSYQGCHPPTKLAAMA

Query:  TSANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDSSFILSNLLRVPNISTNLLSILQFFLDNDCS
        +++N   T    Q+ + WLTDTG   H+T + NNLS  + Y G+E + +GN Q+LPI + G  +  T    F L N+L VP I++NLLS+ +  LDN+CS
Subjt:  TSANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDSSFILSNLLRVPNISTNLLSILQFFLDNDCS

Query:  FTFNATSFTIQDNATGKILYHRLNINGLYPLTTSSLPSTQVRLIAQVGAKVSST---VCHDRLGHPCSSTLQHILHYFAFLVSKNASTNICTHCLNGKMS
          F+A  F+IQD  TG+ILY  L+ NG+YP+  S +       IA     +SS    + H RLGHP +  L ++   F+   S       C HCL GKM 
Subjt:  FTFNATSFTIQDNATGKILYHRLNINGLYPLTTSSLPSTQVRLIAQVGAKVSST---VCHDRLGHPCSSTLQHILHYFAFLVSKNASTNICTHCLNGKMS

Query:  KLPFSFSS-TSTTPLELIHSDV
        +LPF  S+ T T+P EL+H+D+
Subjt:  KLPFSFSS-TSTTPLELIHSDV

A0A2N9IEP2 Uncharacterized protein1.1e-6736.51Show/hide
Query:  MLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTEGQPPQVNLEYEKWHEKDQ----------------------TSQEVW
        M+LL+NI NLV V++D +NYVLW++ I+ ILKA+ +  FVDG+       L+      Q N  Y++W  +DQ                      T+  VW
Subjt:  MLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTEGQPPQVNLEYEKWHEKDQ----------------------TSQEVW

Query:  EKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLP--------------------------------
          LEK ++SS+RSNI+ LK+EL +I K+S ++I+ ++Q+I D  ++L AV V ID E+++  ++ GLP                                
Subjt:  EKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLP--------------------------------

Query:  -------STLNVVEIGAVKVVVEEILVDLMETE---VMVATNGHDALECYNRMNYSYQGCHPPTKLAAMATSANFVSTNSLPQDSQVWLTDTGCNAHLTN
               S+ N   +G                     +   NGH AL+CY+RM+YSYQG  PP+KLAAMA      ++NS   D   W++DTG   H T 
Subjt:  -------STLNVVEIGAVKVVVEEILVDLMETE---VMVATNGHDALECYNRMNYSYQGCHPPTKLAAMATSANFVSTNSLPQDSQVWLTDTGCNAHLTN

Query:  DFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDSSFILSNLLRVPNISTNLLSILQFFLDNDCSFTFNATSFTIQDNATGKILYHRLNINGLYP
        D + +     Y G +   +GN Q++PITH G  +       F L  +LRVP++++NLLS+ +F  DN+C F F+A  F I+D  TGK+LY   + NGLYP
Subjt:  DFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDSSFILSNLLRVPNISTNLLSILQFFLDNDCSFTFNATSFTIQDNATGKILYHRLNINGLYP

Query:  LTTSSLP---STQVRLIAQVGAKVSSTVCHDRLGHPCSSTLQHIL-HYFAFLVSKNASTNICTHCLNGKMSKLPFSFS-STSTTPLELIHSDV
        +   SLP    T      Q    VSS V HDRLGHP S   Q I  +      S N + + CTHC+ GKM+ LPF  S S +  PLE+IHSDV
Subjt:  LTTSSLP---STQVRLIAQVGAKVSSTVCHDRLGHPCSSTLQHIL-HYFAFLVSKNASTNICTHCLNGKMSKLPFSFS-STSTTPLELIHSDV

A0A5J5A1U7 Integrase catalytic domain-containing protein2.6e-6934.72Show/hide
Query:  SSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTE--GQPPQVNLEYEKWHEKDQ----------------------TS
        S + LL+NI NL+  R+DS+NYV W+F IS ILKAH L  ++DG+Y   N  ++ E      Q+N EY+ W+ +DQ                      TS
Subjt:  SSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTE--GQPPQVNLEYEKWHEKDQ----------------------TS

Query:  QEVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPSTLNV----VEIGAVKVVVEEILVDLME
        +E W  LE+ FS+STRSNI+ LK  L NIS K  ++ID YIQ+I    + L +VSV+I+ ED++IY++NGLP   N     +   +  + +EE+   L  
Subjt:  QEVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPSTLNV----VEIGAVKVVVEEILVDLME

Query:  TE-----------------VMVAT----------------------------------------------------------------------------
         E                  M+AT                                                                            
Subjt:  TE-----------------VMVAT----------------------------------------------------------------------------

Query:  -NGHDALECYNRMNYSYQGCHPPTKLAAMATSANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDS
         NGH AL+CY+RM++SYQG  P  +L AM+ + N  S  S       W TDTG   H+T D  NL+    Y G +NI I N Q+L I+H G       D 
Subjt:  -NGHDALECYNRMNYSYQGCHPPTKLAAMATSANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDS

Query:  SFILSNLLRVPNISTNLLSILQFFLDNDCSFTFNATSFTIQDNATGKILYHRLNINGLYPLTTSSL-----PSTQVRL----------------------
        +F L+N+L VP+++TNLLS+ QF  DN C F F++  F IQD AT ++L+   + +GLYPL TSS+     PS Q  L                      
Subjt:  SFILSNLLRVPNISTNLLSILQFFLDNDCSFTFNATSFTIQDNATGKILYHRLNINGLYPLTTSSL-----PSTQVRL----------------------

Query:  ---IAQVGAKVSSTVCHDRLGHPCSSTLQHILHYFAFLVSKNASTNICTHCLNGKMSKLPFSFSST-STTPLELIHSDV
            A +G +VS+ + HDRLGHP ++TLQ IL   A + +   S  +C HCL GKM+KLPF  S+T ST PL+L+HSD+
Subjt:  ---IAQVGAKVSSTVCHDRLGHPCSSTLQHILHYFAFLVSKNASTNICTHCLNGKMSKLPFSFSST-STTPLELIHSDV

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.9e-3024.95Show/hide
Query:  NSSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTEGQPPQVNLEYEKWHEKDQ----------------------TSQ
        N++ +L  N+ N+   ++ STNY++W   +  +   ++L  F+DGS       + T+   P+VN +Y +W  +D+                      T+ 
Subjt:  NSSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTEGQPPQVNLEYEKWHEKDQ----------------------TSQ

Query:  EVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPSTLNVV--EIGA--VKVVVEEILVDLMET
        ++WE L K +++ +  ++  L+ +L+  +K + +TID Y+Q +    ++L  +   +D ++ +  ++  LP     V  +I A      + EI   L+  
Subjt:  EVWEKLEKHFSSSTRSNIVGLKIELQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPSTLNVV--EIGA--VKVVVEEILVDLMET

Query:  EVMV---------------------------------------------------ATN---------------------GHDALECYNRMNY--SYQGCH
        E  +                                                   +TN                     GH A  C    ++  S     
Subjt:  EVMV---------------------------------------------------ATN---------------------GHDALECYNRMNY--SYQGCH

Query:  PPTKLAAMATSANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDSSFILSNLLRVPNISTNLLSIL
        PP+        AN    +  P  S  WL D+G   H+T+DFNNLS+   Y G +++ + +  ++PI+H G    ST      L N+L VPNI  NL+S+ 
Subjt:  PPTKLAAMATSANFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDSSFILSNLLRVPNISTNLLSIL

Query:  QFFLDNDCSFTFNATSFTIQDNATGKILYHRLNINGLYPLTTSSLPSTQVRLIAQVGAKVSSTVCHDRLGHPCSSTLQHIL-HYFAFLVSKNASTNICTH
        +    N  S  F   SF ++D  TG  L      + LY    +S  S  V L A   +K + +  H RLGHP  S L  ++ +Y   +++ +     C+ 
Subjt:  QFFLDNDCSFTFNATSFTIQDNATGKILYHRLNINGLYPLTTSSLPSTQVRLIAQVGAKVSSTVCHDRLGHPCSSTLQHIL-HYFAFLVSKNASTNICTH

Query:  CLNGKMSKLPFSFSS-TSTTPLELIHSDV
        CL  K +K+PFS S+  ST PLE I+SDV
Subjt:  CLNGKMSKLPFSFSS-TSTTPLELIHSDV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.7e-2323.14Show/hide
Query:  MLLLTNIRNLVF---VRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTEGQPPQVNLEYEKWHEKDQ----------------------TSQ
        +L+ TNI N+      ++ STNY++W   +  +   ++L  F+DGS       + T+   P+VN +Y +W  +D+                      T+ 
Subjt:  MLLLTNIRNLVF---VRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTEGQPPQVNLEYEKWHEKDQ----------------------TSQ

Query:  EVWEKLEKHFSSSTRSNIVGLK-------IELQNISKKSGETIDVYIQRIND----IVNKLTAVSVVIDTEDLIIYIVNGLPSTLNVVEIGAVKVVVEEI
        ++WE L K +++ +  ++  L+       + L        E ++  ++ + D    +++++ A        ++   ++N     L +     V +    +
Subjt:  EVWEKLEKHFSSSTRSNIVGLK-------IELQNISKKSGETIDVYIQRIND----IVNKLTAVSVVIDTEDLIIYIVNGLPSTLNVVEIGAVKVVVEEI

Query:  LVDLMETE-------------------------------------------VMVATNGHDALEC-----YNRMNYSYQGCHPPTKLAAMATSANFVSTNS
              T                                             + +  GH A  C     +       Q   P T     A     ++ NS
Subjt:  LVDLMETE-------------------------------------------VMVATNGHDALEC-----YNRMNYSYQGCHPPTKLAAMATSANFVSTNS

Query:  LPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDSSFILSNLLRVPNISTNLLSILQFFLDNDCSFTFNATSFTI
         P ++  WL D+G   H+T+DFNNLS    Y G +++ I +  ++PITH G     T   S  L+ +L VPNI  NL+S+ +    N  S  F   SF +
Subjt:  LPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDSSFILSNLLRVPNISTNLLSILQFFLDNDCSFTFNATSFTI

Query:  QDNATGKILYHRLNINGLYPLTTSSLPSTQVRLIAQVGAKVSSTVCHDRLGHPCSSTLQHILHYFAF-LVSKNASTNICTHCLNGKMSKLPFSFSS-TST
        +D  TG  L      + LY    +S  S  V + A   +K + +  H RLGHP  + L  ++   +  +++ +     C+ C   K  K+PFS S+ TS+
Subjt:  QDNATGKILYHRLNINGLYPLTTSSLPSTQVRLIAQVGAKVSSTVCHDRLGHPCSSTLQHILHYFAF-LVSKNASTNICTHCLNGKMSKLPFSFSS-TST

Query:  TPLELIHSDV
         PLE I+SDV
Subjt:  TPLELIHSDV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATCCAGTGAGCAGAACCAAAATTCTTCCATGCTTCTTCTCACAAACATTCGCAACTTAGTCTTTGTTCGTGTTGATTCCACAAATTACGTTTTGTGGCGATTTCT
GATCTCTCCGATTCTCAAGGCTCACAAACTTTTTCGTTTTGTTGATGGATCTTATCTTGCATCTAATTGTCTTCTTCGTACCGAAGGACAACCTCCACAAGTGAATCTTG
AATATGAGAAGTGGCATGAGAAAGATCAAACCTCTCAAGAAGTTTGGGAAAAACTGGAGAAACACTTTTCTTCATCTACTCGGTCGAATATAGTTGGTTTGAAAATAGAG
TTGCAGAACATTTCAAAGAAATCAGGAGAAACGATTGACGTTTATATTCAGAGGATCAATGATATCGTTAACAAGCTCACTGCTGTTTCAGTTGTAATTGATACTGAAGA
TTTGATTATATACATAGTTAATGGACTGCCTTCAACTTTAAATGTCGTGGAAATTGGTGCGGTCAAGGTCGTGGTAGAAGAAATTCTGGTTGATTTGATGGAAACCGAGG
TCATGGTCGCAACGAATGGTCACGATGCCTTAGAATGCTATAATAGGATGAATTACTCCTATCAAGGATGTCATCCCCCAACCAAACTTGCTGCAATGGCAACCTCTGCA
AATTTTGTCTCAACAAATTCTCTGCCTCAAGATTCTCAGGTATGGTTAACTGATACTGGTTGTAATGCTCATTTAACTAATGATTTCAATAATCTCAGTATATCTTCAGC
TTACAATGGAAAGGAGAATATACCTATTGGAAATGACCAGTCACTTCCAATAACTCACCAAGGTTGTGGTAAAACCTCTACTATAGACTCCTCTTTCATTCTATCCAATC
TACTTCGTGTTCCAAATATTTCCACTAATTTGCTTTCCATTCTTCAATTCTTTCTTGACAATGACTGTTCCTTTACCTTTAATGCTACTTCCTTCACTATTCAGGACAAT
GCTACGGGCAAAATTTTGTACCATAGACTCAACATTAATGGCCTTTATCCTCTAACTACTTCTTCATTACCATCAACACAAGTACGCCTTATAGCTCAAGTTGGTGCTAA
AGTTTCATCTACTGTGTGTCATGATAGGTTAGGCCACCCTTGTTCTTCTACTCTTCAACATATTCTTCATTATTTTGCTTTCCTTGTATCCAAAAATGCCTCAACTAATA
TATGTACACATTGTCTTAATGGGAAGATGTCTAAGCTTCCTTTTTCTTTTTCATCTACTTCTACTACTCCTTTAGAGCTTATCCATAGTGATGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAATCCAGTGAGCAGAACCAAAATTCTTCCATGCTTCTTCTCACAAACATTCGCAACTTAGTCTTTGTTCGTGTTGATTCCACAAATTACGTTTTGTGGCGATTTCT
GATCTCTCCGATTCTCAAGGCTCACAAACTTTTTCGTTTTGTTGATGGATCTTATCTTGCATCTAATTGTCTTCTTCGTACCGAAGGACAACCTCCACAAGTGAATCTTG
AATATGAGAAGTGGCATGAGAAAGATCAAACCTCTCAAGAAGTTTGGGAAAAACTGGAGAAACACTTTTCTTCATCTACTCGGTCGAATATAGTTGGTTTGAAAATAGAG
TTGCAGAACATTTCAAAGAAATCAGGAGAAACGATTGACGTTTATATTCAGAGGATCAATGATATCGTTAACAAGCTCACTGCTGTTTCAGTTGTAATTGATACTGAAGA
TTTGATTATATACATAGTTAATGGACTGCCTTCAACTTTAAATGTCGTGGAAATTGGTGCGGTCAAGGTCGTGGTAGAAGAAATTCTGGTTGATTTGATGGAAACCGAGG
TCATGGTCGCAACGAATGGTCACGATGCCTTAGAATGCTATAATAGGATGAATTACTCCTATCAAGGATGTCATCCCCCAACCAAACTTGCTGCAATGGCAACCTCTGCA
AATTTTGTCTCAACAAATTCTCTGCCTCAAGATTCTCAGGTATGGTTAACTGATACTGGTTGTAATGCTCATTTAACTAATGATTTCAATAATCTCAGTATATCTTCAGC
TTACAATGGAAAGGAGAATATACCTATTGGAAATGACCAGTCACTTCCAATAACTCACCAAGGTTGTGGTAAAACCTCTACTATAGACTCCTCTTTCATTCTATCCAATC
TACTTCGTGTTCCAAATATTTCCACTAATTTGCTTTCCATTCTTCAATTCTTTCTTGACAATGACTGTTCCTTTACCTTTAATGCTACTTCCTTCACTATTCAGGACAAT
GCTACGGGCAAAATTTTGTACCATAGACTCAACATTAATGGCCTTTATCCTCTAACTACTTCTTCATTACCATCAACACAAGTACGCCTTATAGCTCAAGTTGGTGCTAA
AGTTTCATCTACTGTGTGTCATGATAGGTTAGGCCACCCTTGTTCTTCTACTCTTCAACATATTCTTCATTATTTTGCTTTCCTTGTATCCAAAAATGCCTCAACTAATA
TATGTACACATTGTCTTAATGGGAAGATGTCTAAGCTTCCTTTTTCTTTTTCATCTACTTCTACTACTCCTTTAGAGCTTATCCATAGTGATGTATGA
Protein sequenceShow/hide protein sequence
MQSSEQNQNSSMLLLTNIRNLVFVRVDSTNYVLWRFLISPILKAHKLFRFVDGSYLASNCLLRTEGQPPQVNLEYEKWHEKDQTSQEVWEKLEKHFSSSTRSNIVGLKIE
LQNISKKSGETIDVYIQRINDIVNKLTAVSVVIDTEDLIIYIVNGLPSTLNVVEIGAVKVVVEEILVDLMETEVMVATNGHDALECYNRMNYSYQGCHPPTKLAAMATSA
NFVSTNSLPQDSQVWLTDTGCNAHLTNDFNNLSISSAYNGKENIPIGNDQSLPITHQGCGKTSTIDSSFILSNLLRVPNISTNLLSILQFFLDNDCSFTFNATSFTIQDN
ATGKILYHRLNINGLYPLTTSSLPSTQVRLIAQVGAKVSSTVCHDRLGHPCSSTLQHILHYFAFLVSKNASTNICTHCLNGKMSKLPFSFSSTSTTPLELIHSDV