; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005901 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005901
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionATP-dependent helicase ATRX
Genome locationchr6:33480161..33491036
RNA-Seq ExpressionLag0005901
SyntenyLag0005901
Gene Ontology termsGO:0000781 - chromosome, telomeric region (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily
IPR044574 - ATPase ARIP4-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020999.1 Protein CHROMATIN REMODELING 20 [Cucurbita argyrosperma subsp. argyrosperma]4.2e-9977.13Show/hide
Query:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV
        AAEAQEALEKESL+KVENEVREEL LTLNGDDLET V+NEM   +EEWEGVLDELEIESA LLEQLDGAG+ELPSL+K IESQAS GCYTEAWKKRIHWV
Subjt:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV

Query:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAEM
        GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHE+EG   ENLEVDWCSLNKVFSEGS +N+TLFGSKNWAS+YLASTPQQAAEM
Subjt:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAEM

Query:  GLNFPGVDEALR----SPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD
        GL FPGVDE         N+  PFV   + N     +K    S       RKV  E D
Subjt:  GLNFPGVDEALR----SPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD

XP_008441784.1 PREDICTED: protein CHROMATIN REMODELING 20 isoform X2 [Cucumis melo]3.2e-9978.74Show/hide
Query:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV
        AAEAQEALEKESLAKVE EVREEL LTLNGDDLET ++NEMATF+EEWE VLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGC+TEAWKKRIHWV
Subjt:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV

Query:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAEM
        GSQVTGDLLASVSDAEKTLQ +RPVRRRHGKLLEEGASGYLQKKFST++IEG   E LEVDW SLNKVFSEGSKD+D LFGSKNWASVYLASTPQQAAEM
Subjt:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAEM

Query:  GLNFPGVDEALRSPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD
        GL FPGVDE     +      D  V   I E++K    S    ++ RKV  E D
Subjt:  GLNFPGVDEALRSPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD

XP_011649017.1 protein CHROMATIN REMODELING 20 isoform X1 [Cucumis sativus]1.4e-9979.13Show/hide
Query:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV
        AAEAQEALEKESLAKVE EVREEL LTLNGDDLET ++NEMA F+EEWE VLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGC+TEAWKKRIHWV
Subjt:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV

Query:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAEM
        GSQVTGDLLASVSDAEKTLQ +RPV RRHGKLLEEGASGYLQKKFSTHEIEG   E LEVDW SLNKVFSEGSKD+DTLFGSKNWASVYLASTPQQAAEM
Subjt:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAEM

Query:  GLNFPGVDEALRSPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD
        GL FPGVDE     +      D  V   I E++K    S    +  RKV  E D
Subjt:  GLNFPGVDEALRSPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD

XP_022965425.1 protein CHROMATIN REMODELING 20 isoform X2 [Cucurbita maxima]8.4e-10077.52Show/hide
Query:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV
        AAEAQEALEKESL+KVENEVREEL LTLNGDDLET V+NEMA  +EEWEGVLDELEIESA LLEQLDGAG+ELPSL+K IESQAS GCYTEAWKKRIHWV
Subjt:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV

Query:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAEM
        GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHE+EG   ENLEVDWCSLNKVFSEGS +N+TLFGSKNWAS+YLASTPQQAAEM
Subjt:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAEM

Query:  GLNFPGVDEALR----SPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD
        GL FPGVDE         N+  PFV   + N     +K    S       RKV  E D
Subjt:  GLNFPGVDEALR----SPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD

XP_038889289.1 protein CHROMATIN REMODELING 20 [Benincasa hispida]3.2e-9977.69Show/hide
Query:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHL--LEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIH
        AAEAQEALEKESLAKVENEVREEL L LNGDDLET ++NEMATFIEEWE VLDELEIESAH    EQLDGAGIELPSLYKLIESQASNGC+TEAWKKRIH
Subjt:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHL--LEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIH

Query:  WVGSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAA
        WVGSQ TGDLLASVSDAEKTLQ +RPVRRRHGKLLEEGASGYLQ+K STHEIEG   EN EVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAA
Subjt:  WVGSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAA

Query:  EMGLNFPGVDEALR----SPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD
        EMGL FPGVDE         N+  PFV   +     E++K  + S    +  RKV  E D
Subjt:  EMGLNFPGVDEALR----SPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD

TrEMBL top hitse value%identityAlignment
A0A0A0LMU1 PHD-type domain-containing protein6.9e-10079.13Show/hide
Query:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV
        AAEAQEALEKESLAKVE EVREEL LTLNGDDLET ++NEMA F+EEWE VLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGC+TEAWKKRIHWV
Subjt:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV

Query:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAEM
        GSQVTGDLLASVSDAEKTLQ +RPV RRHGKLLEEGASGYLQKKFSTHEIEG   E LEVDW SLNKVFSEGSKD+DTLFGSKNWASVYLASTPQQAAEM
Subjt:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAEM

Query:  GLNFPGVDEALRSPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD
        GL FPGVDE     +      D  V   I E++K    S    +  RKV  E D
Subjt:  GLNFPGVDEALRSPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD

A0A1S3B484 ATP-dependent helicase ATRX1.5e-9978.74Show/hide
Query:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV
        AAEAQEALEKESLAKVE EVREEL LTLNGDDLET ++NEMATF+EEWE VLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGC+TEAWKKRIHWV
Subjt:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV

Query:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAEM
        GSQVTGDLLASVSDAEKTLQ +RPVRRRHGKLLEEGASGYLQKKFST++IEG   E LEVDW SLNKVFSEGSKD+D LFGSKNWASVYLASTPQQAAEM
Subjt:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAEM

Query:  GLNFPGVDEALRSPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD
        GL FPGVDE     +      D  V   I E++K    S    ++ RKV  E D
Subjt:  GLNFPGVDEALRSPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD

A0A1S3B4Z9 ATP-dependent helicase ATRX7.6e-9976.34Show/hide
Query:  SSLWYKITAAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEA
        S + Y     EAQEALEKESLAKVE EVREEL LTLNGDDLET ++NEMATF+EEWE VLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGC+TEA
Subjt:  SSLWYKITAAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEA

Query:  WKKRIHWVGSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLAS
        WKKRIHWVGSQVTGDLLASVSDAEKTLQ +RPVRRRHGKLLEEGASGYLQKKFST++IEG   E LEVDW SLNKVFSEGSKD+D LFGSKNWASVYLAS
Subjt:  WKKRIHWVGSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLAS

Query:  TPQQAAEMGLNFPGVDEALRSPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD
        TPQQAAEMGL FPGVDE     +      D  V   I E++K    S    ++ RKV  E D
Subjt:  TPQQAAEMGLNFPGVDEALRSPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD

A0A6J1HNN1 ATP-dependent helicase ATRX4.1e-10077.52Show/hide
Query:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV
        AAEAQEALEKESL+KVENEVREEL LTLNGDDLET V+NEMA  +EEWEGVLDELEIESA LLEQLDGAG+ELPSL+K IESQAS GCYTEAWKKRIHWV
Subjt:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV

Query:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAEM
        GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHE+EG   ENLEVDWCSLNKVFSEGS +N+TLFGSKNWAS+YLASTPQQAAEM
Subjt:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAEM

Query:  GLNFPGVDEALR----SPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD
        GL FPGVDE         N+  PFV   + N     +K    S       RKV  E D
Subjt:  GLNFPGVDEALR----SPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD

A0A6J1HQY8 ATP-dependent helicase ATRX1.3e-9876.92Show/hide
Query:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV
        AAEAQEALEKESL+KVENEVREEL LTLNGDDLET V+NEMA  +EEWEGVLDELEIESA LLEQLDGAG+ELPSL+K IESQAS GCYTEAWKKRIHWV
Subjt:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV

Query:  GSQVTGDLLASVSDAEKTLQTQRPVR--RRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAA
        GSQVTGDLLASVSDAEKTLQTQRPVR  RRHGKLLEEGASGYLQKKFSTHE+EG   ENLEVDWCSLNKVFSEGS +N+TLFGSKNWAS+YLASTPQQAA
Subjt:  GSQVTGDLLASVSDAEKTLQTQRPVR--RRHGKLLEEGASGYLQKKFSTHEIEG---ENLEVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAA

Query:  EMGLNFPGVDEALR----SPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD
        EMGL FPGVDE         N+  PFV   + N     +K    S       RKV  E D
Subjt:  EMGLNFPGVDEALR----SPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESD

SwissProt top hitse value%identityAlignment
F4HW51 Protein CHROMATIN REMODELING 202.2e-7167.62Show/hide
Query:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV
        AAEAQEALEKESL+KVE+EVREEL   L GD+L+  V+ EM TF +EWE  LDELE ESA LLEQLDGAGIELP LY++IESQA NGCYTEAWK+R HWV
Subjt:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV

Query:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEGENL----EVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAE
        G+QVT + + S+++AE+ L T RPVR+RHGKLLEEGASG+L+KK +   ++ E+L    E+DW SLNKVFSE  +D    FGSK WASVYLASTP QAA 
Subjt:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEGENL----EVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAE

Query:  MGLNFPGVDE
        MGL FPGV+E
Subjt:  MGLNFPGVDE

O00370 LINE-1 retrotransposable element ORF2 protein5.2e-2030.7Show/hide
Query:  FVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESDGNKAPGPDGFSMAFFQNNWEVVKDDLVKVFKEFFERGILDSSLNETYICLIPKKER-ANKVNDFR
        F+D      +++ +   L+   +  EI  ++      K+PGPDGF+  F+Q   E +   L+K+F+   + GIL +S  E  I LIPK  R   K  +FR
Subjt:  FVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESDGNKAPGPDGFSMAFFQNNWEVVKDDLVKVFKEFFERGILDSSLNETYICLIPKKER-ANKVNDFR

Query:  PISLITSTYKIISKVLANRLKKVLPSTISETQGAFVKGRQ-ILDQALIANEALEDYRTTGREGIIFKIDFEKAYDHVDWDFLDKVLEKKGFGYKWRSWIW
        PISL+    KI++K+LANR+++ +   I   Q  F+ G Q   +     N      R   +  +I  ID EKA+D +   F+ K L K G    +   I 
Subjt:  PISLITSTYKIISKVLANRLKKVLPSTISETQGAFVKGRQ-ILDQALIANEALEDYRTTGREGIIFKIDFEKAYDHVDWDFLDKVLEKKGFGYKWRSWIW

Query:  SSVRSVQHSILINGK
        +       +I++NG+
Subjt:  SSVRSVQHSILINGK

P08548 LINE-1 reverse transcriptase homolog3.7e-1828.45Show/hide
Query:  NFPGVDEALRSPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESDGNKAPGPDGFSMAFFQNNWEVVKDDLVKVFKEFFERGILDSSLNETYI
        N   +D+ L + + PR          +S+ +   L+   S  EI   +      K+PGPDGF+  F+Q   E +   L+ +F+   + GIL ++  E  I
Subjt:  NFPGVDEALRSPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESDGNKAPGPDGFSMAFFQNNWEVVKDDLVKVFKEFFERGILDSSLNETYI

Query:  CLIPK--KERANKVNDFRPISLITSTYKIISKVLANRLKKVLPSTISETQGAFVKGRQ-ILDQALIANEALEDYRTTGREGIIFKIDFEKAYDHVDWDFL
         LIPK  K+   K N +RPISL+    KI++K+L NR+++ +   I   Q  F+ G Q   +     N      +   ++ +I  ID EKA+D++   F+
Subjt:  CLIPK--KERANKVNDFRPISLITSTYKIISKVLANRLKKVLPSTISETQGAFVKGRQ-ILDQALIANEALEDYRTTGREGIIFKIDFEKAYDHVDWDFL

Query:  DKVLEKKGFGYKWRSWIWSSVRSVQHSILING
         + L+K G    +   I +       +I++NG
Subjt:  DKVLEKKGFGYKWRSWIWSSVRSVQHSILING

P11369 LINE-1 retrotransposable element ORF2 protein4.0e-2030.59Show/hide
Query:  FVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESDGNKAPGPDGFSMAFFQNNWEVVKDDLVKVFKEFFER----GILDSSLNETYICLIPKKER-ANKV
        F+D      +++     L+S  S +EI  V+      K+PGPDGFS  F+Q      K+DL+ +  + F +    G L +S  E  I LIPK ++   K+
Subjt:  FVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESDGNKAPGPDGFSMAFFQNNWEVVKDDLVKVFKEFFER----GILDSSLNETYICLIPKKER-ANKV

Query:  NDFRPISLITSTYKIISKVLANRLKKVLPSTISETQGAFVKGRQ-ILDQALIANEALEDYRTTGREGIIFKIDFEKAYDHVDWDFLDKVLEKKGFGYKWR
         +FRPISL+    KI++K+LANR+++ + + I   Q  F+ G Q   +     N      +   +  +I  +D EKA+D +   F+ KVLE+ G    + 
Subjt:  NDFRPISLITSTYKIISKVLANRLKKVLPSTISETQGAFVKGRQ-ILDQALIANEALEDYRTTGREGIIFKIDFEKAYDHVDWDFLDKVLEKKGFGYKWR

Query:  SWIWSSVRSVQHSILINGK
        + I +       +I +NG+
Subjt:  SWIWSSVRSVQHSILINGK

P14381 Transposon TX1 uncharacterized 149 kDa protein9.8e-2732.57Show/hide
Query:  SPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESDGNKAPGPDGFSMAFFQNNWEVVKDDLVKVFKEFFERGILDSSLNETYICLIPKKERAN
        SP+A     DGL    +SE  K  L++  +L+E+ + +     NK+PG DG ++ FFQ  W+ +  D  +V  E F++G L  S     + L+PKK    
Subjt:  SPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESDGNKAPGPDGFSMAFFQNNWEVVKDDLVKVFKEFFERGILDSSLNETYICLIPKKERAN

Query:  KVNDFRPISLITSTYKIISKVLANRLKKVLPSTISETQGAFVKGRQILDQALIANEALEDYRTTGREGIIFKIDFEKAYDHVDWDFLDKVLEKKGFGYKW
         + ++RP+SL+++ YKI++K ++ RLK VL   I   Q   V GR I D   +  + L   R TG       +D EKA+D VD  +L   L+   FG ++
Subjt:  KVNDFRPISLITSTYKIISKVLANRLKKVLPSTISETQGAFVKGRQILDQALIANEALEDYRTTGREGIIFKIDFEKAYDHVDWDFLDKVLEKKGFGYKW

Query:  RSWIWSSVRSVQHSILIN
          ++ +   S +  + IN
Subjt:  RSWIWSSVRSVQHSILIN

Arabidopsis top hitse value%identityAlignment
AT1G08600.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein1.6e-7267.62Show/hide
Query:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV
        AAEAQEALEKESL+KVE+EVREEL   L GD+L+  V+ EM TF +EWE  LDELE ESA LLEQLDGAGIELP LY++IESQA NGCYTEAWK+R HWV
Subjt:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV

Query:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEGENL----EVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAE
        G+QVT + + S+++AE+ L T RPVR+RHGKLLEEGASG+L+KK +   ++ E+L    E+DW SLNKVFSE  +D    FGSK WASVYLASTP QAA 
Subjt:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEGENL----EVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAE

Query:  MGLNFPGVDE
        MGL FPGV+E
Subjt:  MGLNFPGVDE

AT1G08600.2 P-loop containing nucleoside triphosphate hydrolases superfamily protein1.6e-7267.62Show/hide
Query:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV
        AAEAQEALEKESL+KVE+EVREEL   L GD+L+  V+ EM TF +EWE  LDELE ESA LLEQLDGAGIELP LY++IESQA NGCYTEAWK+R HWV
Subjt:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV

Query:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEGENL----EVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAE
        G+QVT + + S+++AE+ L T RPVR+RHGKLLEEGASG+L+KK +   ++ E+L    E+DW SLNKVFSE  +D    FGSK WASVYLASTP QAA 
Subjt:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEGENL----EVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAE

Query:  MGLNFPGVDE
        MGL FPGV+E
Subjt:  MGLNFPGVDE

AT1G08600.3 P-loop containing nucleoside triphosphate hydrolases superfamily protein1.6e-7267.62Show/hide
Query:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV
        AAEAQEALEKESL+KVE+EVREEL   L GD+L+  V+ EM TF +EWE  LDELE ESA LLEQLDGAGIELP LY++IESQA NGCYTEAWK+R HWV
Subjt:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV

Query:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEGENL----EVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAE
        G+QVT + + S+++AE+ L T RPVR+RHGKLLEEGASG+L+KK +   ++ E+L    E+DW SLNKVFSE  +D    FGSK WASVYLASTP QAA 
Subjt:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEGENL----EVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAE

Query:  MGLNFPGVDE
        MGL FPGV+E
Subjt:  MGLNFPGVDE

AT1G08600.4 P-loop containing nucleoside triphosphate hydrolases superfamily protein1.6e-7267.62Show/hide
Query:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV
        AAEAQEALEKESL+KVE+EVREEL   L GD+L+  V+ EM TF +EWE  LDELE ESA LLEQLDGAGIELP LY++IESQA NGCYTEAWK+R HWV
Subjt:  AAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEGVLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWV

Query:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEGENL----EVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAE
        G+QVT + + S+++AE+ L T RPVR+RHGKLLEEGASG+L+KK +   ++ E+L    E+DW SLNKVFSE  +D    FGSK WASVYLASTP QAA 
Subjt:  GSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEGENL----EVDWCSLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAE

Query:  MGLNFPGVDE
        MGL FPGV+E
Subjt:  MGLNFPGVDE

AT1G43760.1 DNAse I-like superfamily protein5.2e-1546.59Show/hide
Query:  EEIRKVVFESDGNKAPGPDGFSMAFFQNNWEVVKDDLVKVFKEFFERGILDSSLNETYICLIPKKERANKVNDFRPISLITSTYKIIS
        +EI   VF    NKAPGPD F+  FF  +W VVKD  +   KEFF  G L    N T I LIPK    ++++ FRP+S  T  YKII+
Subjt:  EEIRKVVFESDGNKAPGPDGFSMAFFQNNWEVVKDDLVKVFKEFFERGILDSSLNETYICLIPKKERANKVNDFRPISLITSTYKIIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTTAGGTAAGGTGATCCGTTATCGCCCTCCTTTTTCTTTTGGTGGTAGATGTTCTCATGAGATAGTTTCAAAGGGGTGGAGGGAGGTATCATTGAAGGATTCAAG
AGGAGACCTTTCTCATCCTTACCATATGCTCTTGTTTTTGGAGATGATGTTTGAGCTTAAGATTAATAGGAGCAAAAGTCAAGTGGTGGGGATAAACTGTGAGCGATCTA
AGGTGGATAGGTGGGCTTCTTTGGTGGGTTGTGAAGTAGGAGTGTTGTCTATGTCTTATTTAGGCCTCCCCCAGGGCCATAATCCTAGGAGTTTTTCTTTTTGGTGCTCG
ATGATAGATAAGGGAAGGAGAGTAAAGGGGCGCCACCTAGTGAGTTGGGAGGTGGTGGGAAAACCAATTTACCTTGGAGGTCTGGAGATTGGGAACCTTAAAATTCGTAA
CAAAGCTCTATTAGCAAAGTGGTTGTGGCATTTCCCCATTGAACCCTCTTCCTTGTGGTATAAGATAACTGCTGCAGAGGCTCAAGAAGCACTTGAAAAGGAGTCTCTTG
CCAAAGTGGAGAATGAAGTTCGAGAAGAACTTACTTTGACTCTTAATGGTGATGATTTGGAAACAACCGTTTCAAATGAAATGGCTACTTTTATAGAAGAGTGGGAAGGT
GTGCTCGATGAGCTTGAGATTGAGAGTGCTCATTTATTGGAGCAACTTGATGGTGCTGGTATCGAGCTACCAAGTCTGTACAAGTTAATTGAAAGTCAGGCTTCTAATGG
TTGCTATACTGAAGCTTGGAAAAAAAGGATACACTGGGTTGGGTCTCAGGTAACTGGTGATCTTCTTGCATCGGTATCTGACGCAGAGAAGACCCTCCAAACCCAAAGGC
CTGTAAGGAGACGACACGGTAAACTTTTGGAGGAGGGAGCAAGTGGATATCTGCAGAAGAAATTCTCCACTCACGAGATTGAGGGAGAAAATTTGGAAGTTGATTGGTGC
TCCCTTAATAAAGTATTTTCAGAAGGCTCAAAGGACAATGACACATTATTCGGCAGCAAGAACTGGGCTTCCGTTTACTTGGCCAGCACTCCGCAGCAAGCTGCAGAAAT
GGGACTCAATTTTCCTGGAGTTGATGAGGCTTTACGCTCCCCAAACGCCCCTAGACCATTCGTGGACGGCTTGGTTTGGAATCCCATTTCTGAGAGTGATAAGTCCTTCC
TTGATTCGTCGTTTTCCTTGGAGGAGATTAGAAAGGTTGTTTTTGAGAGTGATGGTAACAAAGCCCCTGGTCCTGATGGTTTTTCAATGGCCTTTTTCCAAAATAACTGG
GAGGTGGTTAAGGACGATTTGGTTAAGGTTTTCAAGGAATTCTTTGAACGAGGCATTTTGGACTCTTCCCTGAACGAGACTTATATTTGTCTTATCCCCAAAAAGGAGAG
GGCAAATAAAGTCAACGACTTTAGACCCATCAGTTTGATTACTAGCACTTACAAAATCATATCTAAGGTTTTAGCAAACAGGCTTAAGAAAGTCCTTCCCTCCACAATTT
CCGAAACCCAGGGGGCTTTTGTTAAGGGGAGGCAAATTCTTGATCAAGCCCTTATTGCCAACGAGGCCTTGGAGGATTATCGTACTACAGGTCGAGAAGGAATCATCTTT
AAAATTGACTTTGAAAAAGCATACGATCATGTGGACTGGGACTTCCTTGATAAGGTTCTTGAAAAGAAAGGCTTCGGTTATAAGTGGAGATCGTGGATCTGGAGCAGTGT
TAGATCGGTTCAACATTCTATCCTCATTAATGGTAAGCCGAGGGCAAGATCAGAGCCACAAGAGGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCTTAGGTAAGGTGATCCGTTATCGCCCTCCTTTTTCTTTTGGTGGTAGATGTTCTCATGAGATAGTTTCAAAGGGGTGGAGGGAGGTATCATTGAAGGATTCAAG
AGGAGACCTTTCTCATCCTTACCATATGCTCTTGTTTTTGGAGATGATGTTTGAGCTTAAGATTAATAGGAGCAAAAGTCAAGTGGTGGGGATAAACTGTGAGCGATCTA
AGGTGGATAGGTGGGCTTCTTTGGTGGGTTGTGAAGTAGGAGTGTTGTCTATGTCTTATTTAGGCCTCCCCCAGGGCCATAATCCTAGGAGTTTTTCTTTTTGGTGCTCG
ATGATAGATAAGGGAAGGAGAGTAAAGGGGCGCCACCTAGTGAGTTGGGAGGTGGTGGGAAAACCAATTTACCTTGGAGGTCTGGAGATTGGGAACCTTAAAATTCGTAA
CAAAGCTCTATTAGCAAAGTGGTTGTGGCATTTCCCCATTGAACCCTCTTCCTTGTGGTATAAGATAACTGCTGCAGAGGCTCAAGAAGCACTTGAAAAGGAGTCTCTTG
CCAAAGTGGAGAATGAAGTTCGAGAAGAACTTACTTTGACTCTTAATGGTGATGATTTGGAAACAACCGTTTCAAATGAAATGGCTACTTTTATAGAAGAGTGGGAAGGT
GTGCTCGATGAGCTTGAGATTGAGAGTGCTCATTTATTGGAGCAACTTGATGGTGCTGGTATCGAGCTACCAAGTCTGTACAAGTTAATTGAAAGTCAGGCTTCTAATGG
TTGCTATACTGAAGCTTGGAAAAAAAGGATACACTGGGTTGGGTCTCAGGTAACTGGTGATCTTCTTGCATCGGTATCTGACGCAGAGAAGACCCTCCAAACCCAAAGGC
CTGTAAGGAGACGACACGGTAAACTTTTGGAGGAGGGAGCAAGTGGATATCTGCAGAAGAAATTCTCCACTCACGAGATTGAGGGAGAAAATTTGGAAGTTGATTGGTGC
TCCCTTAATAAAGTATTTTCAGAAGGCTCAAAGGACAATGACACATTATTCGGCAGCAAGAACTGGGCTTCCGTTTACTTGGCCAGCACTCCGCAGCAAGCTGCAGAAAT
GGGACTCAATTTTCCTGGAGTTGATGAGGCTTTACGCTCCCCAAACGCCCCTAGACCATTCGTGGACGGCTTGGTTTGGAATCCCATTTCTGAGAGTGATAAGTCCTTCC
TTGATTCGTCGTTTTCCTTGGAGGAGATTAGAAAGGTTGTTTTTGAGAGTGATGGTAACAAAGCCCCTGGTCCTGATGGTTTTTCAATGGCCTTTTTCCAAAATAACTGG
GAGGTGGTTAAGGACGATTTGGTTAAGGTTTTCAAGGAATTCTTTGAACGAGGCATTTTGGACTCTTCCCTGAACGAGACTTATATTTGTCTTATCCCCAAAAAGGAGAG
GGCAAATAAAGTCAACGACTTTAGACCCATCAGTTTGATTACTAGCACTTACAAAATCATATCTAAGGTTTTAGCAAACAGGCTTAAGAAAGTCCTTCCCTCCACAATTT
CCGAAACCCAGGGGGCTTTTGTTAAGGGGAGGCAAATTCTTGATCAAGCCCTTATTGCCAACGAGGCCTTGGAGGATTATCGTACTACAGGTCGAGAAGGAATCATCTTT
AAAATTGACTTTGAAAAAGCATACGATCATGTGGACTGGGACTTCCTTGATAAGGTTCTTGAAAAGAAAGGCTTCGGTTATAAGTGGAGATCGTGGATCTGGAGCAGTGT
TAGATCGGTTCAACATTCTATCCTCATTAATGGTAAGCCGAGGGCAAGATCAGAGCCACAAGAGGTTTGA
Protein sequenceShow/hide protein sequence
MVLGKVIRYRPPFSFGGRCSHEIVSKGWREVSLKDSRGDLSHPYHMLLFLEMMFELKINRSKSQVVGINCERSKVDRWASLVGCEVGVLSMSYLGLPQGHNPRSFSFWCS
MIDKGRRVKGRHLVSWEVVGKPIYLGGLEIGNLKIRNKALLAKWLWHFPIEPSSLWYKITAAEAQEALEKESLAKVENEVREELTLTLNGDDLETTVSNEMATFIEEWEG
VLDELEIESAHLLEQLDGAGIELPSLYKLIESQASNGCYTEAWKKRIHWVGSQVTGDLLASVSDAEKTLQTQRPVRRRHGKLLEEGASGYLQKKFSTHEIEGENLEVDWC
SLNKVFSEGSKDNDTLFGSKNWASVYLASTPQQAAEMGLNFPGVDEALRSPNAPRPFVDGLVWNPISESDKSFLDSSFSLEEIRKVVFESDGNKAPGPDGFSMAFFQNNW
EVVKDDLVKVFKEFFERGILDSSLNETYICLIPKKERANKVNDFRPISLITSTYKIISKVLANRLKKVLPSTISETQGAFVKGRQILDQALIANEALEDYRTTGREGIIF
KIDFEKAYDHVDWDFLDKVLEKKGFGYKWRSWIWSSVRSVQHSILINGKPRARSEPQEV