; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018769 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018769
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr5:34155454..34156772
RNA-Seq ExpressionLag0018769
SyntenyLag0018769
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis]5.3e-5837.12Show/hide
Query:  MRTRSQFVTFKELHVLLKTEEAAIEKHMKHDDALTQPAAMFASQSTPN-SSQRSNPSGNFGRGRSSGRGRNQNRGGRGFNPSGRGNFNQGRGSFYFP---
        +RT+S+ +T +E++ +LK EE  IE   K +++   P AM A+   PN SS R     NF  GR  GRGR  NRGGR  +     + N G+ +  +P   
Subjt:  MRTRSQFVTFKELHVLLKTEEAAIEKHMKHDDALTQPAAMFASQSTPN-SSQRSNPSGNFGRGRSSGRGRNQNRGGRGFNPSGRGNFNQGRGSFYFP---

Query:  -----QSSDGQGRVSCQTCQRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNNLTIASEYA
             Q S+    V CQ C + GH A+DCY+RM++ +QG+ P  QL  M A+ N             G+  S + W T +G   H+T+DL NL    EY 
Subjt:  -----QSSDGQGRVSCQTCQRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNNLTIASEYA

Query:  GDDQVSVGSGQSLSISHNGSGSLHTSSSFSFKLNNLL--------------------------------YDKSTGKVLFQGPSINGLYPL--SFIHSSTA
        GDD +++ +GQ+L ISH+G  S+H ++  +F+LNN+L                                 DK+T ++LFQGPS +GLYPL  S I   +A
Subjt:  GDDQVSVGSGQSLSISHNGSGSLHTSSSFSFKLNNLL--------------------------------YDKSTGKVLFQGPSINGLYPL--SFIHSSTA

Query:  PSC----------------------------YVAHVATNKSYSLWHNRLGHPGHYVLNCVLRSLGLSTFPISSCICKHCASGKMTRLPFSRSFTASAFPL
        PS                             + A++    S  LWH+RLGHP    L  +L S  ++    S+ +C+HC  GKMT+LPF  S T S  PL
Subjt:  PSC----------------------------YVAHVATNKSYSLWHNRLGHPGHYVLNCVLRSLGLSTFPISSCICKHCASGKMTRLPFSRSFTASAFPL

Query:  QLVHSDVWGPAPKTSVDGFNYYVSFIDDHSK
        QLVHSD+WGPAP TS D F YYVSF+DD S+
Subjt:  QLVHSDVWGPAPKTSVDGFNYYVSFIDDHSK

KAF8394586.1 hypothetical protein HHK36_020800 [Tetracentron sinense]5.1e-6137.53Show/hide
Query:  VTFKELHVLLKTEEAAIEKHMKHDDALTQPA-AMFASQSTPNSSQRSNPSGNFGRGRSSGRGRNQNRGGRGFNPSGRGNFNQGRGSFYFPQSSDGQGRVS
        + F++L   L T E  ++ H      L QP+ A F ++ST + +Q     G+ G     GRGR +N GGR  NPS                S+      S
Subjt:  VTFKELHVLLKTEEAAIEKHMKHDDALTQPA-AMFASQSTPNSSQRSNPSGNFGRGRSSGRGRNQNRGGRGFNPSGRGNFNQGRGSFYFPQSSDGQGRVS

Query:  CQTCQRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNNLTIASEYAGDDQVSVGSGQSLSI
        CQ C R GH A+DC++R++  FQGR PP +L  M A +                    STW T +G   H+T+DLNNL++ S+Y G D ++VG+G+ L I
Subjt:  CQTCQRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNNLTIASEYAGDDQVSVGSGQSLSI

Query:  SHNGSGSLHTSSSFSFKLNNLLY--------------------------------DKSTGKVLFQGPSINGLYPLSF-IHSSTAP-SCYVAHVATNKSYS
         H GS S ++S+  +F + ++L+                                D  +GK+LFQG S  GLYPL F  H    P S   A  ++  S S
Subjt:  SHNGSGSLHTSSSFSFKLNNLLY--------------------------------DKSTGKVLFQGPSINGLYPLSF-IHSSTAP-SCYVAHVATNKSYS

Query:  LWHNRLGHPGHYVLNCVLRSLGLSTFPISSCICKHCASGKMTRLPFSRSFTASAFPLQLVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKSD
        +WH+RLGHP H  L  V   + L        IC  C  GK +RLPF  S + S +PL+LVH+DVWGP+  TS++G  +YV+FIDD S++ W++P+A KS 
Subjt:  LWHNRLGHPGHYVLNCVLRSLGLSTFPISSCICKHCASGKMTRLPFSRSFTASAFPLQLVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKSD

Query:  VPTVFQRFKPLVENLFSTRIKTLRTDGGG
        V  VF RFK LVE +F  +IK+L+TDGGG
Subjt:  VPTVFQRFKPLVENLFSTRIKTLRTDGGG

PRQ43209.1 putative RNA-directed DNA polymerase [Rosa chinensis]2.6e-5735.04Show/hide
Query:  MRTRSQF--VTFKELHVLLKTEEAAIEKHMKHDDALTQPAAMFASQSTPNSSQRSNPSGNFGRGRSSGRGRNQNRGGRGFNPSGRGNFNQGRGSFYFPQS
        +RTR +   VT  ELH LL  EE  +E   + +  L+ PA+ FA+ ST  S+ R        RG S  RGR +N   RG            RGSF  P S
Subjt:  MRTRSQF--VTFKELHVLLKTEEAAIEKHMKHDDALTQPAAMFASQSTPNSSQRSNPSGNFGRGRSSGRGRNQNRGGRGFNPSGRGNFNQGRGSFYFPQS

Query:  SDGQGRVSCQTCQRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNV------------------------------AYCNSAPRSMHNGASSSSSTWL
           +  + CQ C R GH AIDCYNRMN+ + GR PP +L  + A+                                  Y  S+P    + A  ++STWL
Subjt:  SDGQGRVSCQTCQRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNV------------------------------AYCNSAPRSMHNGASSSSSTWL

Query:  TGSGCNAHLTSDLNNLTIASEYAGDDQVSVGSGQSLSISHNGSGSLHTSSSFSFKLNNLLY--------------------------------DKSTGKV
          SG + H+T DL++L+  + Y G+ +++VG+G +L I+H GS  +HT S+ SFKL N+L+                                D    + 
Subjt:  TGSGCNAHLTSDLNNLTIASEYAGDDQVSVGSGQSLSISHNGSGSLHTSSSFSFKLNNLLY--------------------------------DKSTGKV

Query:  LFQGPSINGLYPLSF------IHSSTAPSCYVAHVATNKSYSLWHNRLGHPGHYVLNCVLRSLGLSTFPISSCICKHCASGKMTRLPFSRSFTASAFPLQ
        L QGP  +G+YP+ F        S+ + S   AH  +  S +LWH R G     ++N VL++L +     +  +C HC  GK  RLPFS S  +S+ PLQ
Subjt:  LFQGPSINGLYPLSF------IHSSTAPSCYVAHVATNKSYSLWHNRLGHPGHYVLNCVLRSLGLSTFPISSCICKHCASGKMTRLPFSRSFTASAFPLQ

Query:  LVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTD
        L+H+D+WGPA  +SV G+ +++S +DD S+  W+ P+ +KSDVP  F  FK  +ENL ST I+TLR D
Subjt:  LVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTD

RWR76373.1 putative polyprotein [Cinnamomum micranthum f. kanehirae]1.8e-6137.47Show/hide
Query:  RSQFVTFKELHVLLKTEEAAIEKHMKHDDALTQPAAMFASQSTPNSSQRSNPSGNFGR-GRSSGRGRNQNRGGRGFNPSGRGNFNQGRGSFYFPQSSDGQ
        +S  V+   +H LL   E  I  H     + +  A    +  TP + Q +N +   GR GR  GRG+N  RG   FNPS   N          P ++ G 
Subjt:  RSQFVTFKELHVLLKTEEAAIEKHMKHDDALTQPAAMFASQSTPNSSQRSNPSGNFGR-GRSSGRGRNQNRGGRGFNPSGRGNFNQGRGSFYFPQSSDGQ

Query:  GRVSCQTCQRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNNLTIASEYAGDDQVSVGSGQ
         RV CQ C R GH A+DCY+RM++ +QG HPP +LA M AS                  S    W T +G   H+TS++ NL++ S+Y   D+VSVG+G 
Subjt:  GRVSCQTCQRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNNLTIASEYAGDDQVSVGSGQ

Query:  SLSISHNGSGSLHTSSSFSFKLNNLL--------------------------------YDKSTGKVLFQGPSINGLYPLSF--IHSSTAPSCYVAHVATN
         L ISH GS S+ T SS +F+LNN+L                                 DK++GK LF+G S NGLYP     + + +    + A V   
Subjt:  SLSISHNGSGSLHTSSSFSFKLNNLL--------------------------------YDKSTGKVLFQGPSINGLYPLSF--IHSSTAPSCYVAHVATN

Query:  KSYSLWHNRLGHPGHYVLNCVLRSLGLSTFPIS--SCICKHCASGKMTRLPFSRSFTASAFPLQLVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYP
         + S+WH+RLGHP   V   +  +  L     S  S IC  C  GK  +LPFS S + S+ PL L+H D+WG +P+ S+ G++YYVSFIDD +K+ W YP
Subjt:  KSYSLWHNRLGHPGHYVLNCVLRSLGLSTFPIS--SCICKHCASGKMTRLPFSRSFTASAFPLQLVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYP

Query:  IARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG
        +A KS     F +FK  VEN+ ST IK  ++DGGG
Subjt:  IARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG

TQD84801.1 hypothetical protein C1H46_029649 [Malus baccata]3.6e-5436.98Show/hide
Query:  TQPAAMFASQSTPNSSQRSNPSGNFGRGRSSGRGRNQN-RGGRGFNPSGRGN--------FNQGRGSFYFPQSSDGQGRVSCQTCQRPGHGAIDCYNRMN
        T P A  A Q   ++S R N          S RGRN      RG   + RGN        FN+G  S     SS G  R SCQ C  P H AIDC++RMN
Subjt:  TQPAAMFASQSTPNSSQRSNPSGNFGRGRSSGRGRNQN-RGGRGFNPSGRGN--------FNQGRGSFYFPQSSDGQGRVSCQTCQRPGHGAIDCYNRMN

Query:  YHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNNLTIASEYAGDDQVSVGSGQSLSISHNGSGSLHTSSSFSFKLN
            G+ PP +LA M A              H  + SSS +WL  SG  +H+T+D++N++  S Y G+D+V +G G+ LSI+H GS  LH +   SFKLN
Subjt:  YHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNNLTIASEYAGDDQVSVGSGQSLSISHNGSGSLHTSSSFSFKLN

Query:  NLLY--------------------------------DKSTGKVLFQGPSINGLYPL-SFIHSSTAPSCYVAHVATNKSYSLWHNRLGHPGHYVLNCVLRS
        N+L+                                D+STGK+L +GP  +G YPL SF  S  + +   A V+      +WH+RLGHP   +   V+ S
Subjt:  NLLY--------------------------------DKSTGKVLFQGPSINGLYPL-SFIHSSTAPSCYVAHVATNKSYSLWHNRLGHPGHYVLNCVLRS

Query:  LGLSTFPISSC--ICKHCASGKMTRLPFSRSFTASAFPLQLVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFST
          L+    SS    C  CA  K  +L FS + +++   L L+H DVWGPAP  SV GF YY+  +DD+++++W +P+ RKS+V + F  FK  VE     
Subjt:  LGLSTFPISSC--ICKHCASGKMTRLPFSRSFTASAFPLQLVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFST

Query:  RIKTLRTDGGG
        +IKT+R+D GG
Subjt:  RIKTLRTDGGG

TrEMBL top hitse value%identityAlignment
A0A2N9FKJ8 Uncharacterized protein1.6e-6841.16Show/hide
Query:  MRTRSQFVTFKELHVLLKTEEAAIEKHMKHDDALTQPAAMFASQSTPNSSQRSNPSGNFGRGRSSGRGRNQNRGGRG-FNPSGRGNFNQGRGSFYFPQSS
        +RTR++ V F+E+ VLL+TEE +I +       +    AMFAS +  N +  S  S      ++ GRGRN ++ GRG FN + + +        +   S 
Subjt:  MRTRSQFVTFKELHVLLKTEEAAIEKHMKHDDALTQPAAMFASQSTPNSSQRSNPSGNFGRGRSSGRGRNQNRGGRG-FNPSGRGNFNQGRGSFYFPQSS

Query:  DGQ-----GRVSCQTCQRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNNLTIASEYAGDD
          Q      R  CQ C + GH A+DCY+RM++ +QGRHPP +LA M ++ N               S    TWLT +G   HLT++LNNLT+ + Y G D
Subjt:  DGQ-----GRVSCQTCQRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNNLTIASEYAGDD

Query:  QVSVGSGQSLSISHNGSGSLH-----TSSSFSFKLNN-LLYDKSTGKVLFQGPSINGLYPLS-------FIHSSTAPSCYVAHVATNKSYSLWHNRLGHP
        QV+VG+GQS+ I  N +G++H      + S  F  N  L+ D  +GKVL++G S NGLYP+           S TA S   A +++   + LWH RLGHP
Subjt:  QVSVGSGQSLSISHNGSGSLH-----TSSSFSFKLNN-LLYDKSTGKVLFQGPSINGLYPLS-------FIHSSTAPSCYVAHVATNKSYSLWHNRLGHP

Query:  GHYVLNCVLRSLGLSTFPISSCI----------CKHCASGKMTRLPFSRSFTASAFPLQLVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKS
           VL   L SL       SSC+          CKHC  GKM +LPF  S   S  PL+L+HSDVWGPAP  S +G+ YY+ F+DD S+F+WLY +  KS
Subjt:  GHYVLNCVLRSLGLSTFPISSCI----------CKHCASGKMTRLPFSRSFTASAFPLQLVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKS

Query:  DVPTVFQRFKPLVENLFSTRIKTLRTDGGG
        DV ++F+ FK  VEN FST+IK LRTD GG
Subjt:  DVPTVFQRFKPLVENLFSTRIKTLRTDGGG

A0A2N9FUJ5 Uncharacterized protein5.2e-6738.35Show/hide
Query:  MRTRSQFVTFKELHVLLKTEEAAIEKHMKHDDALTQPAAMFASQSTPNSSQRSNPSGNFGRGRSSGRGRNQNRGGRGFN-------------PSGRGNFN
        +RTR++ ++F+E+ VLL+TEE ++ +       L    AMFAS +  N +  S  S      ++ GRGRN N+ GRG                SG   F+
Subjt:  MRTRSQFVTFKELHVLLKTEEAAIEKHMKHDDALTQPAAMFASQSTPNSSQRSNPSGNFGRGRSSGRGRNQNRGGRGFN-------------PSGRGNFN

Query:  QGR------GSFYFPQSSDGQ---GRVSCQTCQRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLT
          +      G+  F Q+  G+    R  CQ C + GH A+DCY+RM++ +QGRHPP +LA M ++ N +             + +  TWLT +G   HLT
Subjt:  QGR------GSFYFPQSSDGQ---GRVSCQTCQRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLT

Query:  SDLNNLTIASEYAGDDQVSVGSGQSLSISHNGSGSLHTSSSFSFKLNNLLY--------------------------------DKSTGKVLFQGPSINGL
        S+L NLT  + Y G DQV+VG+GQS+ I++ G+G L T   ++F+L+NLL+                                D  +GKVL++G S NGL
Subjt:  SDLNNLTIASEYAGDDQVSVGSGQSLSISHNGSGSLHTSSSFSFKLNNLLY--------------------------------DKSTGKVLFQGPSINGL

Query:  YPL-------SFIHSSTAPSCYVAHVATNKSYSLWHNRLGHPGHYVLNCVLRSLGLSTFPISSCI----------CKHCASGKMTRLPFSRSFTASAFPL
        YP+       S   SS A S   A +++   + LWH RLGHP   VL   + SL       SSCI          CKHC  GKM +LPF  S   S  PL
Subjt:  YPL-------SFIHSSTAPSCYVAHVATNKSYSLWHNRLGHPGHYVLNCVLRSLGLSTFPISSCI----------CKHCASGKMTRLPFSRSFTASAFPL

Query:  QLVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG
        +LVHSDVWGPAP  S +G+ +Y+ F+DD+S+F+WLY + RKSDV   F+ F+  VENL S +IK LRTD GG
Subjt:  QLVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG

A0A2N9G2I0 Uncharacterized protein1.0e-6739.22Show/hide
Query:  MRTRSQFVTFKELHVLLKTEEAAIEKHMKHDDALTQPAAMFASQSTPNSSQRSNPSGNFGRGRSSGRGRNQNRGGRGFNPSGRGNFN-------------
        +RTR++ V+F+E+ VLL+TEE ++ +       L Q  A+FAS +  N +  S  S       S GRGRN ++ GRG    GR N N             
Subjt:  MRTRSQFVTFKELHVLLKTEEAAIEKHMKHDDALTQPAAMFASQSTPNSSQRSNPSGNFGRGRSSGRGRNQNRGGRGFNPSGRGNFN-------------

Query:  ---QGRGSF-YFPQSSDGQGRVSCQTCQRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNN
           QG+ +F    Q+     R  CQ C + GH A+DCY+RM++ +QGRHPP +LA M              S  NGA +  S WLT +G   HLT+++NN
Subjt:  ---QGRGSF-YFPQSSDGQGRVSCQTCQRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNN

Query:  LTIASEYAGDDQVSVGSGQSLSISHNGSGSLHTSSSFSFKLNNLLY--------------------------------DKSTGKVLFQGPSINGLYPLSF
        L + + Y G+DQV+VG+GQS+ I++ G+G L T   ++F+L +LL+                                D  +GKVL++G S NGLYP+  
Subjt:  LTIASEYAGDDQVSVGSGQSLSISHNGSGSLHTSSSFSFKLNNLLY--------------------------------DKSTGKVLFQGPSINGLYPLSF

Query:  IHSS----TAPSCYVAHVATNKSYSLWHNRLGHPGHYVLNCVLRSLGLSTFPISSCI----------CKHCASGKMTRLPFSRSFTASAFPLQLVHSDVW
         H S    TA    +A +++   + LWH+RLGHP   VL   L SL       SSCI          CKHC  GKM +LPF  S   S  PL+L+HSDVW
Subjt:  IHSS----TAPSCYVAHVATNKSYSLWHNRLGHPGHYVLNCVLRSLGLSTFPISSCI----------CKHCASGKMTRLPFSRSFTASAFPLQLVHSDVW

Query:  GPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG
        GPAP TS +G+ YY+ F+DD+++F+WLY +  KSDV + F+ FK  VEN  S +IK LRTD GG
Subjt:  GPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG

A0A2N9GCR2 Uncharacterized protein1.8e-6741.1Show/hide
Query:  MRTRSQFVTFKELHVLLKTEEAAIEKHMKHDDALTQPAAMFASQSTPNSSQRSNPSGNFGRGRSSGRGRNQNRGGRGFNPSGRGNFN-------------
        +RTR++ V+F+E+ VLL+TEE ++ +       L Q  A+FAS +  N +  S  S       S GRGRN ++ GRG    GR N N             
Subjt:  MRTRSQFVTFKELHVLLKTEEAAIEKHMKHDDALTQPAAMFASQSTPNSSQRSNPSGNFGRGRSSGRGRNQNRGGRGFNPSGRGNFN-------------

Query:  ---QGRGSF-YFPQSSDGQGRVSCQTCQRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNN
           QG+ +F    Q+     R  CQ C + GH A+DCY+RM++ +QGRHPP +LA M              S  NGA +  S WLT +G   HLT+++NN
Subjt:  ---QGRGSF-YFPQSSDGQGRVSCQTCQRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNN

Query:  LTIASEYAGDDQVSVGSGQSLSISHNGSGSLH-----TSSSFSFKLNN-LLYDKSTGKVLFQGPSINGLYPLSFIHSS----TAPSCYVAHVATNKSYSL
        L + + Y G+DQV+VG+GQS+ I  N  G++H      +    F  N  L+ D  +GKVL++G S NGLYP+   H S    TA     A +++   + L
Subjt:  LTIASEYAGDDQVSVGSGQSLSISHNGSGSLH-----TSSSFSFKLNN-LLYDKSTGKVLFQGPSINGLYPLSFIHSS----TAPSCYVAHVATNKSYSL

Query:  WHNRLGHPGHYVLNCVLRSLGLSTFPISSCI----------CKHCASGKMTRLPFSRSFTASAFPLQLVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTW
        WH+RLGHP   VL   L SL       SSCI          CKHC  GKM +LPF  S   S  PL+L+HSDVWGPAP TS +G+ YY+ F+DD+++F+W
Subjt:  WHNRLGHPGHYVLNCVLRSLGLSTFPISSCI----------CKHCASGKMTRLPFSRSFTASAFPLQLVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTW

Query:  LYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG
        LY +  KSDV + F+ FK  VEN  S +IK LRTD GG
Subjt:  LYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG

A0A2N9I8F3 Uncharacterized protein2.9e-7041.1Show/hide
Query:  MRTRSQFVTFKELHVLLKTEE-AAIEKHMKHDDALTQPAAMFASQSTPNSSQRSNPSGNFGRG-RSSGRGRNQNRGGRG--FNPSGRGNFNQ-GRGSFYF
        +RTR++ VTF+E+ VLL+TEE +A E      D  + P AMFA  S PN+   ++ S  +G   +  GRGRN ++ GRG  F  S +  F+Q  +G+  F
Subjt:  MRTRSQFVTFKELHVLLKTEE-AAIEKHMKHDDALTQPAAMFASQSTPNSSQRSNPSGNFGRG-RSSGRGRNQNRGGRG--FNPSGRGNFNQ-GRGSFYF

Query:  PQSSDGQGRVSCQTCQRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNNLTIASEYAGDDQ
        PQ  +G  R  CQ C + GH A+DCY+RM++ +QGRHPP +LA M ++ N               S    TWLT +G   HLT++L NL  A+ Y G +Q
Subjt:  PQSSDGQGRVSCQTCQRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNNLTIASEYAGDDQ

Query:  VSVGSGQSLSISHNGSGSLHTSSSFSFKLNNLLY--------------------------------DKSTGKVLFQGPSINGLYPL-------SFIHSST
        VSVG+GQS+ I+H G+G L T  +++F+L NLL+                                D  +GKVL++G S NGLYP+       S   S+T
Subjt:  VSVGSGQSLSISHNGSGSLHTSSSFSFKLNNLLY--------------------------------DKSTGKVLFQGPSINGLYPL-------SFIHSST

Query:  APSCYVAHVATNKSYSLWHNRLGHPGHYVLNCVLRSLGLSTFPISSCI----------CKHCASGKMTRLPFSRSFTASAFPLQLVHSDVWGPAPKTSVD
        A     A +++   + LWH+RLGHP   VL   + SL       SSC+          CKHC  GKM RLPF  S   S  PL+LVHSDVWGPAP  S +
Subjt:  APSCYVAHVATNKSYSLWHNRLGHPGHYVLNCVLRSLGLSTFPISSCI----------CKHCASGKMTRLPFSRSFTASAFPLQLVHSDVWGPAPKTSVD

Query:  GFNYYVSFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG
        G+ YY+ F+DD SKF+WL+ +  KS+V   F+ FK  VEN  S  IK+LRTD GG
Subjt:  GFNYYVSFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-1333.78Show/hide
Query:  YVAHVATNKSYSLWHNRLGHPGHYVL------NCVLRSLGLSTFPISSCICKHCASGKMTRLPFS--RSFTASAFPLQLVHSDVWGPAPKTSVDGFNYYV
        Y  +     ++ LWH R GH     L      N       L+   +S  IC+ C +GK  RLPF   +  T    PL +VHSDV GP    ++D  NY+V
Subjt:  YVAHVATNKSYSLWHNRLGHPGHYVL------NCVLRSLGLSTFPISSCICKHCASGKMTRLPFS--RSFTASAFPLQLVHSDVWGPAPKTSVDGFNYYV

Query:  SFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGG
         F+D  + +   Y I  KSDV ++FQ F    E  F+ ++  L  D G
Subjt:  SFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-1426.03Show/hide
Query:  NSSQRSNPSGNFGRGRSSGRGRNQNRGGRGFNPSGRGNFNQGRGSFYFPQSSDGQGRV-SCQTCQRPGHGAIDCYN--RMNYHFQGRHPPTQLATMVASQ
        N   R  P        + GRGR+  R    +  SG    ++ R           + RV +C  C +PGH   DC N  +      G+      A MV + 
Subjt:  NSSQRSNPSGNFGRGRSSGRGRNQNRGGRGFNPSGRGNFNQGRGSFYFPQSSDGQGRV-SCQTCQRPGHGAIDCYN--RMNYHFQGRHPPTQLATMVASQ

Query:  N--VAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLT--SDLNNLTIASEYAGDDQVSVGSGQSLSISHNGSGSLHTSSSFSFKLNNLLYDKSTGKVLFQG
        +  V + N     MH   S   S W+  +  + H T   DL    +A ++     V +G+     I+  G   + T+   +  L ++ +       L  G
Subjt:  N--VAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLT--SDLNNLTIASEYAGDDQVSVGSGQSLSISHNGSGSLHTSSSFSFKLNNLLYDKSTGKVLFQG

Query:  PSI----------NGLYPLS-------------FIHSSTAPSCYVAHVATNK--SYSLWHNRLGHPGHYVLNCVLRSLGLSTFPISSCI--CKHCASGKM
         ++          N  + L+              ++ + A  C     A     S  LWH R+GH     L  +L    L ++   + +  C +C  GK 
Subjt:  PSI----------NGLYPLS-------------FIHSSTAPSCYVAHVATNK--SYSLWHNRLGHPGHYVLNCVLRSLGLSTFPISSCI--CKHCASGKM

Query:  TRLPFSRSFTASAFPLQLVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG
         R+ F  S       L LV+SDV GP    S+ G  Y+V+FIDD S+  W+Y +  K  V  VFQ+F  LVE     ++K LR+D GG
Subjt:  TRLPFSRSFTASAFPLQLVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG

Q12491 Transposon Ty2-B Gag-Pol polyprotein8.1e-0927.03Show/hide
Query:  YSLWHNRLGHPGHYVL------NCV--LRSLGLSTFPISSCICKHCASGKMT--------RLPFSRSFTASAFPLQLVHSDVWGPAPKTSVDGFNYYVSF
        Y L H  LGH     +      N V  L+   +     S+  C  C  GK T        RL +  S+     P Q +H+D++GP         +Y++SF
Subjt:  YSLWHNRLGHPGHYVL------NCV--LRSLGLSTFPISSCICKHCASGKMT--------RLPFSRSFTASAFPLQLVHSDVWGPAPKTSVDGFNYYVSF

Query:  IDDHSKFTWLYPI--ARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGG
         D+ ++F W+YP+   R+  +  VF      ++N F+ R+  ++ D G
Subjt:  IDDHSKFTWLYPI--ARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.4e-3932.28Show/hide
Query:  RSSGRGRNQNRGGRGFNPSGRGNFN-----QGRGSFYFPQSSDGQGRV-SCQTCQRPGHGAIDCYNRMNY--HFQGRHPPTQLATMVASQNVAYCNSAPR
        R++    N N G R      R N N     Q   + + P ++  +  +  CQ C   GH A  C    ++      + PP+         N+A       
Subjt:  RSSGRGRNQNRGGRGFNPSGRGNFN-----QGRGSFYFPQSSDGQGRV-SCQTCQRPGHGAIDCYNRMNY--HFQGRHPPTQLATMVASQNVAYCNSAPR

Query:  SMHNGASSSSSTWLTGSGCNAHLTSDLNNLTIASEYAGDDQVSVGSGQSLSISHNGSGSLHTSSSFSFKLNNLLY-------------------------
            G+  SS+ WL  SG   H+TSD NNL++   Y G D V V  G ++ ISH GS SL T S     L+N+LY                         
Subjt:  SMHNGASSSSSTWLTGSGCNAHLTSDLNNLTIASEYAGDDQVSVGSGQSLSISHNGSGSLHTSSSFSFKLNNLLY-------------------------

Query:  -------DKSTGKVLFQGPSINGLYPLSFIHSSTAPSCYVAHVATNKSYSLWHNRLGHPGHYVLNCVLRSLGLSTFPISS--CICKHCASGKMTRLPFSR
               D +TG  L QG + + LY      +S+ P    A  ++  ++S WH RLGHP   +LN V+ +  LS    S     C  C   K  ++PFS+
Subjt:  -------DKSTGKVLFQGPSINGLYPLSFIHSSTAPSCYVAHVATNKSYSLWHNRLGHPGHYVLNCVLRSLGLSTFPISS--CICKHCASGKMTRLPFSR

Query:  SFTASAFPLQLVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG
        S   S  PL+ ++SDVW  +P  S D + YYV F+D  +++TWLYP+ +KS V   F  FK L+EN F TRI T  +D GG
Subjt:  SFTASAFPLQLVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.3e-3834.17Show/hide
Query:  NSSQRSNPSGNFGRGRSSGRGRNQNRGGRGFNPSGRGNFNQGRGSFYFPQSSDGQ----------GRVSCQTCQRPGHGAIDCYNRMNYHFQGRHPPTQL
        NS++    + N    R++   RNQN  G   N     N N  R + + P SS  +          GR  CQ C   GH A  C     + FQ      Q 
Subjt:  NSSQRSNPSGNFGRGRSSGRGRNQNRGGRGFNPSGRGNFNQGRGSFYFPQSSDGQ----------GRVSCQTCQRPGHGAIDCYNRMNYHFQGRHPPTQL

Query:  ATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNNLTIASEYAGDDQVSVGSGQSLSISHNGSGSLHTSSSFSFKLNNLLY--------
        +T   +      N A  S +N     ++ WL  SG   H+TSD NNL+    Y G D V +  G ++ I+H GS SL TSS  S  LN +LY        
Subjt:  ATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNNLTIASEYAGDDQVSVGSGQSLSISHNGSGSLHTSSSFSFKLNNLLY--------

Query:  ------------------------DKSTGKVLFQGPSINGLYPLSFIHSSTAPSCYVAHVATNKSYSLWHNRLGHPGHYVLNCVLRSLGLSTFPISSCI-
                                D +TG  L QG + + LY    I SS A S + A   +  ++S WH+RLGHP   +LN V+ +  L     S  + 
Subjt:  ------------------------DKSTGKVLFQGPSINGLYPLSFIHSSTAPSCYVAHVATNKSYSLWHNRLGHPGHYVLNCVLRSLGLSTFPISSCI-

Query:  -CKHCASGKMTRLPFSRSFTASAFPLQLVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG
         C  C   K  ++PFS S   S+ PL+ ++SDVW  +P  S+D + YYV F+D  +++TWLYP+ +KS V   F  FK LVEN F TRI TL +D GG
Subjt:  -CKHCASGKMTRLPFSRSFTASAFPLQLVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGG

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein4.2e-0535.82Show/hide
Query:  LWHNRLGHPGHYVLNCVLRSLGLSTFPISSC-ICKHCASGKMTRLPFSRSFTASAFPLQLVHSDVWG
        LWH+RL H     +  +++   L +  +SS   C+ C  GK  R+ FS     +  PL  VHSD+WG
Subjt:  LWHNRLGHPGHYVLNCVLRSLGLSTFPISSC-ICKHCASGKMTRLPFSRSFTASAFPLQLVHSDVWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAACCCGTTCACAGTTTGTTACTTTCAAGGAGCTACATGTTTTATTGAAGACCGAAGAAGCTGCCATTGAAAAACATATGAAACATGATGATGCCCTAACTCAACC
AGCAGCTATGTTTGCATCGCAATCAACTCCTAACTCTTCTCAACGTTCAAATCCGTCTGGAAATTTTGGTAGAGGAAGATCATCTGGTCGTGGGCGAAATCAAAATCGTG
GTGGTCGTGGCTTCAATCCATCTGGGCGAGGAAATTTCAATCAAGGTAGAGGATCCTTTTATTTTCCACAATCATCTGATGGACAGGGTCGTGTCTCCTGTCAAACCTGT
CAACGCCCTGGACATGGTGCCATCGATTGCTACAATAGAATGAATTACCATTTCCAAGGGCGTCATCCACCCACACAATTGGCTACCATGGTTGCTTCTCAAAATGTAGC
CTACTGTAACTCAGCGCCTAGAAGTATGCATAATGGTGCTTCTAGTTCATCTTCTACTTGGTTAACCGGCTCAGGTTGTAATGCCCATCTTACTTCAGATCTCAATAATT
TAACTATTGCTTCAGAGTATGCAGGTGATGATCAAGTCTCAGTAGGCAGTGGACAATCCCTCTCAATCTCTCATAATGGTTCAGGTTCCCTTCATACTTCCTCCTCTTTT
TCCTTTAAGCTTAATAATCTCCTTTATGACAAATCTACGGGCAAGGTTTTGTTCCAAGGTCCTAGTATCAATGGACTCTATCCTTTATCATTCATTCATTCATCTACTGC
TCCATCATGTTACGTTGCTCATGTTGCTACAAATAAATCTTATTCTCTATGGCATAATCGTTTAGGACATCCTGGCCACTATGTTCTTAATTGTGTTCTTCGTTCTTTAG
GTTTATCTACTTTTCCTATTTCTTCTTGCATATGTAAGCATTGTGCTAGTGGAAAAATGACTAGGCTCCCTTTTTCTCGTTCTTTTACTGCCTCTGCTTTTCCTTTACAG
CTTGTACACAGTGATGTTTGGGGTCCTGCTCCTAAAACTTCTGTTGATGGCTTTAATTATTATGTCTCTTTTATTGATGACCACTCTAAGTTTACTTGGTTGTATCCCAT
TGCTCGCAAGTCTGATGTCCCTACTGTTTTTCAACGCTTCAAACCTCTTGTTGAGAATTTATTTTCCACTCGAATTAAAACACTTCGAACAGACGGTGGGGGTGTGTTGG
GTTTTATGCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAACCCGTTCACAGTTTGTTACTTTCAAGGAGCTACATGTTTTATTGAAGACCGAAGAAGCTGCCATTGAAAAACATATGAAACATGATGATGCCCTAACTCAACC
AGCAGCTATGTTTGCATCGCAATCAACTCCTAACTCTTCTCAACGTTCAAATCCGTCTGGAAATTTTGGTAGAGGAAGATCATCTGGTCGTGGGCGAAATCAAAATCGTG
GTGGTCGTGGCTTCAATCCATCTGGGCGAGGAAATTTCAATCAAGGTAGAGGATCCTTTTATTTTCCACAATCATCTGATGGACAGGGTCGTGTCTCCTGTCAAACCTGT
CAACGCCCTGGACATGGTGCCATCGATTGCTACAATAGAATGAATTACCATTTCCAAGGGCGTCATCCACCCACACAATTGGCTACCATGGTTGCTTCTCAAAATGTAGC
CTACTGTAACTCAGCGCCTAGAAGTATGCATAATGGTGCTTCTAGTTCATCTTCTACTTGGTTAACCGGCTCAGGTTGTAATGCCCATCTTACTTCAGATCTCAATAATT
TAACTATTGCTTCAGAGTATGCAGGTGATGATCAAGTCTCAGTAGGCAGTGGACAATCCCTCTCAATCTCTCATAATGGTTCAGGTTCCCTTCATACTTCCTCCTCTTTT
TCCTTTAAGCTTAATAATCTCCTTTATGACAAATCTACGGGCAAGGTTTTGTTCCAAGGTCCTAGTATCAATGGACTCTATCCTTTATCATTCATTCATTCATCTACTGC
TCCATCATGTTACGTTGCTCATGTTGCTACAAATAAATCTTATTCTCTATGGCATAATCGTTTAGGACATCCTGGCCACTATGTTCTTAATTGTGTTCTTCGTTCTTTAG
GTTTATCTACTTTTCCTATTTCTTCTTGCATATGTAAGCATTGTGCTAGTGGAAAAATGACTAGGCTCCCTTTTTCTCGTTCTTTTACTGCCTCTGCTTTTCCTTTACAG
CTTGTACACAGTGATGTTTGGGGTCCTGCTCCTAAAACTTCTGTTGATGGCTTTAATTATTATGTCTCTTTTATTGATGACCACTCTAAGTTTACTTGGTTGTATCCCAT
TGCTCGCAAGTCTGATGTCCCTACTGTTTTTCAACGCTTCAAACCTCTTGTTGAGAATTTATTTTCCACTCGAATTAAAACACTTCGAACAGACGGTGGGGGTGTGTTGG
GTTTTATGCCCTAA
Protein sequenceShow/hide protein sequence
MRTRSQFVTFKELHVLLKTEEAAIEKHMKHDDALTQPAAMFASQSTPNSSQRSNPSGNFGRGRSSGRGRNQNRGGRGFNPSGRGNFNQGRGSFYFPQSSDGQGRVSCQTC
QRPGHGAIDCYNRMNYHFQGRHPPTQLATMVASQNVAYCNSAPRSMHNGASSSSSTWLTGSGCNAHLTSDLNNLTIASEYAGDDQVSVGSGQSLSISHNGSGSLHTSSSF
SFKLNNLLYDKSTGKVLFQGPSINGLYPLSFIHSSTAPSCYVAHVATNKSYSLWHNRLGHPGHYVLNCVLRSLGLSTFPISSCICKHCASGKMTRLPFSRSFTASAFPLQ
LVHSDVWGPAPKTSVDGFNYYVSFIDDHSKFTWLYPIARKSDVPTVFQRFKPLVENLFSTRIKTLRTDGGGVLGFMP