; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001270 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001270
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationchr4:28308374..28310391
RNA-Seq ExpressionLag0001270
SyntenyLag0001270
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0068176.1 putative nuclease HARBI1 isoform X1 [Cucumis melo var. makuwa]4.0e-6778.62Show/hide
Query:  KKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLND
        K+DQYYLVDSGY+NMPGFL P+RGQRYHLR+FR+RRH+PRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYP+ TQKYI   CCTVH++IRLND
Subjt:  KKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLND

Query:  LQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVRDNIADQIWAAFE
         QD LFN+FSNE M+VED  +L  NL+S++IELDVS+Q LR MAR RD+IA+QIWA FE
Subjt:  LQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVRDNIADQIWAAFE

KAF7123090.1 hypothetical protein RHSIM_Rhsim12G0067000 [Rhododendron simsii]5.6e-5353.05Show/hide
Query:  YVGFHKMRSSRQPCRTSLLKGHDYVIELLNG--NDTRCF-DCF--------RMKKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFN
        Y G H + +       S      YV+    G  ND+R F +C         +  + +YY+VDSGYTNMPGFL PYRG+RYHL  FR    +P+  EE FN
Subjt:  YVGFHKMRSSRQPCRTSLLKGHDYVIELLNG--NDTRCF-DCF--------RMKKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFN

Query:  YRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLNDLQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVR
        +RHSSLRNVIERCFGVLKARFPILKQMPPY V TQKYIPT CCTVH++IR++D  D LF E+S E+MV+   Q    +   N +E+DV++ QLRRM+ VR
Subjt:  YRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLNDLQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVR

Query:  DNIADQIWAAFER
        D IA+QIW++  R
Subjt:  DNIADQIWAAFER

KAF7123090.1 hypothetical protein RHSIM_Rhsim12G0067000 [Rhododendron simsii]4.3e-1326.37Show/hide
Query:  GWDPLLGTITLEEEQWNDLFKVNRRAKRFKKSGCPHYAKLMRFFGDTTATGASVCPSTKLPSDSE---------DENDVGS-------------------
        GWD  L T+T  ++ W  LFK  +     KK G P+Y +L R FGDT+ATGA   PS K+PS S+         DE +V S                   
Subjt:  GWDPLLGTITLEEEQWNDLFKVNRRAKRFKKSGCPHYAKLMRFFGDTTATGASVCPSTKLPSDSE---------DENDVGS-------------------

Query:  -NTNFCL------------------------CLMRGGKISF----------------------------------------VVSRSILIE-PFQRHI---
         N +  L                        C    G +S                                         V+ R + ++ P +R I   
Subjt:  -NTNFCL------------------------CLMRGGKISF----------------------------------------VVSRSILIE-PFQRHI---

Query:  -EMNLIEDNDFENCDSDDDDDALYIFFNLLYVGFHKMRSSRQPCRTSLLKGHDYVIELLNGNDTRCFDCFRMK
         + N    +   + D D +D+ + +  +L    ++ M    +PCRTS+L+GHDYV+E+LNG++ RC   FRMK
Subjt:  -EMNLIEDNDFENCDSDDDDDALYIFFNLLYVGFHKMRSSRQPCRTSLLKGHDYVIELLNGNDTRCFDCFRMK

KAF7123090.1 hypothetical protein RHSIM_Rhsim12G0067000 [Rhododendron simsii]5.6e-5353.05Show/hide
Query:  YVGFHKMRSSRQPCRTSLLKGHDYVIELLNG--NDTRCF-DCF--------RMKKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFN
        Y G H + +       S      YV+    G  ND+R F +C         +  + +YY+VDSGYTNMPGFL PYRG+RYHL  FR    +P+  EE FN
Subjt:  YVGFHKMRSSRQPCRTSLLKGHDYVIELLNG--NDTRCF-DCF--------RMKKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFN

Query:  YRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLNDLQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVR
        +RHSSLRNVIERCFGVLKARFPILKQMPPY V TQKYIPT CCTVH++IR++D  D LF E+S E+MV+   Q    +   N +E+DV++ QLRRM+ VR
Subjt:  YRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLNDLQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVR

Query:  DNIADQIWAAFER
        D IA+QIW +  R
Subjt:  DNIADQIWAAFER

TYK06269.1 protein ALP1-like [Cucumis melo var. makuwa]3.0e-6778.62Show/hide
Query:  KKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLND
        K+DQYYLV+SGY+NMPGFL P+RGQRYHLR+FR+RRH+PRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYP+ TQKYIP  CCTVH++IRLND
Subjt:  KKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLND

Query:  LQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVRDNIADQIWAAFE
         QD LFN FSNE M+VED  +L  NL+S++IELDVS+Q LR MAR RD+IA+QIWA FE
Subjt:  LQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVRDNIADQIWAAFE

TYK21096.1 protein ALP1-like [Cucumis melo var. makuwa]4.0e-6778.62Show/hide
Query:  KKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLND
        K+DQYYLVDSGY+NMPGFL P+RGQRYHLR+FR+RRH+PRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYP+ TQKYI   CCTVH++IRLND
Subjt:  KKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLND

Query:  LQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVRDNIADQIWAAFE
         QD LFN+FSNE M+VED  +L  NL+S++IELDVS+Q LR MAR RD+IA+QIWA FE
Subjt:  LQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVRDNIADQIWAAFE

TrEMBL top hitse value%identityAlignment
A0A1R3GLP1 Harbinger transposase-derived nuclease2.2e-4754.86Show/hide
Query:  NDTRCF-DCFRMKKD--------QYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVN
        NDTR F +C    ++        +YYLVDSGYTNMPGFL PYRG+RYHLR++R R  QP G EE+FN+RHS LRN IERCFGVLKARFPILK MPPY   
Subjt:  NDTRCF-DCFRMKKD--------QYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVN

Query:  TQKYIPTPCCTVHDFIRLNDLQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVRDNIADQIW
         Q+YI   CCTVH+FIR + ++D +F EF  + +++++ ++L      N +E++V+  QL++MARVRD IA Q+W
Subjt:  TQKYIPTPCCTVHDFIRLNDLQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVRDNIADQIW

A0A1S4E0U9 uncharacterized protein LOC1034960412.4e-4976.19Show/hide
Query:  KKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLND
        K+DQYYLVDSGY+NMP FL P++G+RYHLR+FR+RRH PRGR EVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYP+ T KYI   CCT+H++IRLND
Subjt:  KKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLND

Query:  LQDALFNEFSNESMVVEDMQSLPDNL
         QD LFN+FSNE M+VED+ +L  NL
Subjt:  LQDALFNEFSNESMVVEDMQSLPDNL

A0A5A7VNL5 Putative nuclease HARBI1 isoform X11.9e-6778.62Show/hide
Query:  KKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLND
        K+DQYYLVDSGY+NMPGFL P+RGQRYHLR+FR+RRH+PRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYP+ TQKYI   CCTVH++IRLND
Subjt:  KKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLND

Query:  LQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVRDNIADQIWAAFE
         QD LFN+FSNE M+VED  +L  NL+S++IELDVS+Q LR MAR RD+IA+QIWA FE
Subjt:  LQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVRDNIADQIWAAFE

A0A5D3C7F6 Protein ALP1-like1.5e-6778.62Show/hide
Query:  KKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLND
        K+DQYYLV+SGY+NMPGFL P+RGQRYHLR+FR+RRH+PRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYP+ TQKYIP  CCTVH++IRLND
Subjt:  KKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLND

Query:  LQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVRDNIADQIWAAFE
         QD LFN FSNE M+VED  +L  NL+S++IELDVS+Q LR MAR RD+IA+QIWA FE
Subjt:  LQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVRDNIADQIWAAFE

A0A5D3DC11 Protein ALP1-like1.9e-6778.62Show/hide
Query:  KKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLND
        K+DQYYLVDSGY+NMPGFL P+RGQRYHLR+FR+RRH+PRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYP+ TQKYI   CCTVH++IRLND
Subjt:  KKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLND

Query:  LQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVRDNIADQIWAAFE
         QD LFN+FSNE M+VED  +L  NL+S++IELDVS+Q LR MAR RD+IA+QIWA FE
Subjt:  LQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVRDNIADQIWAAFE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein1.8e-0948.53Show/hide
Query:  DQYYLVDSGYTNMPGFLEPYRGQ-----RYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLK
        ++YYLVDSGY N  G L PYR       RYH+ +F     +PR + E+FN  H+SLR+VIER F + K
Subjt:  DQYYLVDSGYTNMPGFLEPYRGQ-----RYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLK

AT4G10890.1 unknown protein4.5e-0847.06Show/hide
Query:  QYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPIL
        +YYLV+S Y    G+L P+R   YHL +F  R   P   +E+FN +H  LR+VI+R FGV KA++ IL
Subjt:  QYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPIL

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)3.2e-2238.06Show/hide
Query:  QYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLNDLQD
        ++YLVD G+ N   FL P+RG RYHL+EF  +R  P    E+FN RH SLRNVIER FG+ K+RF I K  PP+    Q  +   C  +H+F+R     D
Subjt:  QYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLNDLQD

Query:  A--LFNEFSNESMVV-EDMQSLPDNLRSNVIELDVSQQQLRRMARVRDNIADQIW
             +E  NE  VV  +  ++  N   N   L+  +Q        R ++A+ +W
Subjt:  A--LFNEFSNESMVV-EDMQSLPDNLRSNVIELDVSQQQLRRMARVRDNIADQIW

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.2e-2134.16Show/hide
Query:  RMKKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRL
        ++ + +YY+VD+ Y N+PGF+ PY G   + RE  K         E+FN RH  L   I R FG LK RFPIL   PPYP+ TQ  +    C +H+++RL
Subjt:  RMKKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGREEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRL

Query:  NDLQDALFNEFSNESMVV--EDMQSLPDNLRSNVI--ELDVSQQQLRRMARVRDNIADQIW
            D +F  F  E++    ED +   +  +  ++  E     +++    R+RD IA ++W
Subjt:  NDLQDALFNEFSNESMVV--EDMQSLPDNLRSNVI--ELDVSQQQLRRMARVRDNIADQIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAGAAATGGTTGGGATCCATTATTGGGCACTATCACCCTAGAAGAGGAGCAATGGAATGATCTTTTTAAGGTCAATAGGAGAGCTAAAAGGTTTAAAAAAAGTGG
TTGCCCGCATTATGCAAAGCTTATGAGATTTTTTGGAGATACTACAGCTACAGGTGCTAGTGTTTGTCCATCAACAAAACTTCCTTCAGATTCTGAAGATGAAAATGATG
TTGGAAGCAATACAAATTTTTGCTTATGCCTGATGCGAGGAGGAAAAATTTCATTCGTAGTATCTAGGAGTATTCTAATTGAACCCTTTCAACGACATATTGAGATGAAT
TTAATTGAAGACAATGACTTTGAGAATTGTGATTCTGACGACGATGATGATGCTCTATACATCTTTTTCAATTTATTGTATGTTGGTTTTCACAAGATGCGATCTTCTAG
ACAACCATGTAGGACGTCTTTACTAAAAGGTCATGATTATGTGATCGAGTTGTTAAATGGCAACGACACAAGATGTTTTGATTGCTTTAGGATGAAGAAAGACCAGTACT
ATCTTGTTGATTCTGGATATACAAATATGCCTGGATTTTTAGAACCATATCGTGGTCAAAGATACCATTTACGAGAATTTAGAAAAAGGAGACATCAGCCTCGCGGTAGG
GAAGAAGTTTTTAACTATCGGCATTCTTCACTTCGAAATGTTATTGAACGTTGTTTTGGCGTATTGAAGGCTCGATTTCCAATTCTCAAACAAATGCCACCTTACCCAGT
CAACACACAAAAGTATATTCCGACACCATGTTGTACTGTTCACGATTTCATTAGATTGAATGATCTTCAAGATGCTCTATTCAATGAGTTTAGCAATGAATCAATGGTCG
TTGAAGATATGCAGAGTTTGCCAGACAATTTACGAAGTAATGTAATTGAGTTAGATGTGAGTCAACAACAGTTAAGGCGAATGGCTCGAGTGAGAGACAACATTGCCGAT
CAAATTTGGGCAGCATTTGAAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAAGAAATGGTTGGGATCCATTATTGGGCACTATCACCCTAGAAGAGGAGCAATGGAATGATCTTTTTAAGGTCAATAGGAGAGCTAAAAGGTTTAAAAAAAGTGG
TTGCCCGCATTATGCAAAGCTTATGAGATTTTTTGGAGATACTACAGCTACAGGTGCTAGTGTTTGTCCATCAACAAAACTTCCTTCAGATTCTGAAGATGAAAATGATG
TTGGAAGCAATACAAATTTTTGCTTATGCCTGATGCGAGGAGGAAAAATTTCATTCGTAGTATCTAGGAGTATTCTAATTGAACCCTTTCAACGACATATTGAGATGAAT
TTAATTGAAGACAATGACTTTGAGAATTGTGATTCTGACGACGATGATGATGCTCTATACATCTTTTTCAATTTATTGTATGTTGGTTTTCACAAGATGCGATCTTCTAG
ACAACCATGTAGGACGTCTTTACTAAAAGGTCATGATTATGTGATCGAGTTGTTAAATGGCAACGACACAAGATGTTTTGATTGCTTTAGGATGAAGAAAGACCAGTACT
ATCTTGTTGATTCTGGATATACAAATATGCCTGGATTTTTAGAACCATATCGTGGTCAAAGATACCATTTACGAGAATTTAGAAAAAGGAGACATCAGCCTCGCGGTAGG
GAAGAAGTTTTTAACTATCGGCATTCTTCACTTCGAAATGTTATTGAACGTTGTTTTGGCGTATTGAAGGCTCGATTTCCAATTCTCAAACAAATGCCACCTTACCCAGT
CAACACACAAAAGTATATTCCGACACCATGTTGTACTGTTCACGATTTCATTAGATTGAATGATCTTCAAGATGCTCTATTCAATGAGTTTAGCAATGAATCAATGGTCG
TTGAAGATATGCAGAGTTTGCCAGACAATTTACGAAGTAATGTAATTGAGTTAGATGTGAGTCAACAACAGTTAAGGCGAATGGCTCGAGTGAGAGACAACATTGCCGAT
CAAATTTGGGCAGCATTTGAAAGATGA
Protein sequenceShow/hide protein sequence
MSRNGWDPLLGTITLEEEQWNDLFKVNRRAKRFKKSGCPHYAKLMRFFGDTTATGASVCPSTKLPSDSEDENDVGSNTNFCLCLMRGGKISFVVSRSILIEPFQRHIEMN
LIEDNDFENCDSDDDDDALYIFFNLLYVGFHKMRSSRQPCRTSLLKGHDYVIELLNGNDTRCFDCFRMKKDQYYLVDSGYTNMPGFLEPYRGQRYHLREFRKRRHQPRGR
EEVFNYRHSSLRNVIERCFGVLKARFPILKQMPPYPVNTQKYIPTPCCTVHDFIRLNDLQDALFNEFSNESMVVEDMQSLPDNLRSNVIELDVSQQQLRRMARVRDNIAD
QIWAAFER