; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032572 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032572
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPolynucleotidyl transferase, ribonuclease H-like superfamily protein
Genome locationchr11:34815151..34815915
RNA-Seq ExpressionLag0032572
SyntenyLag0032572
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CCA66050.1 hypothetical protein [Beta vulgaris subsp. vulgaris]5.2e-1129.95Show/hide
Query:  LDNEGIAKGVIIMWCIWFFKNRIVHSNTKPSAEFIHNLIESKIKEWENTHQNHQQPRRSRSQVSHSTWEKLQRNRWKMNTDAAWNEKDKSRGLGWIVRDS
        +D + + KG  I+W +W  +NR V  +T   A  +   I  +++++ N         RS + +S S W        K+NTDA+  E+    GLG I RDS
Subjt:  LDNEGIAKGVIIMWCIWFFKNRIVHSNTKPSAEFIHNLIESKIKEWENTHQNHQQPRRSRSQVSHSTWEKLQRNRWKMNTDAAWNEKDKSRGLGWIVRDS

Query:  NGSTICLGMKSIRKIWPVKMMEAEAILEGLK----HVIDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRHCSRLLNTV
         G       + +R  WP ++ E +AI    +    H     I        ESD+L     LT+ ++  SDL +I   I +M +    V F H  R  NTV
Subjt:  NGSTICLGMKSIRKIWPVKMMEAEAILEGLK----HVIDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRHCSRLLNTV

Query:  AHCVANL
        AH +A +
Subjt:  AHCVANL

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]1.8e-1128.7Show/hide
Query:  DSLDNEGIAKGVIIMWCIWFFKNRIVHSNTKPSAEFIHNLIESKI---------KEWENTHQNHQQPRRSRSQVSHSTWEKLQRNRWKMNTDAAWNEKDK
        D    E   + +II W IW  +N+ +     P    I   I+  I          + ++T+++    RR       + W+    N WK+NT+AAW     
Subjt:  DSLDNEGIAKGVIIMWCIWFFKNRIVHSNTKPSAEFIHNLIESKI---------KEWENTHQNHQQPRRSRSQVSHSTWEKLQRNRWKMNTDAAWNEKDK

Query:  SRGLGWIVRDSNGSTICLGMKSIRKIWPVKMMEAEAILEGLKHV-IDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRH
        + G+GWI+RD  G  I    + IR    +  +E  AI EGL+ +  + C  R IH  +ESD+L+ +++L     + +++  +   I  M   +  V  RH
Subjt:  SRGLGWIVRDSNGSTICLGMKSIRKIWPVKMMEAEAILEGLKHV-IDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRH

Query:  CSRLLNTVAHCVANLA
         SR  N VAH +A  A
Subjt:  CSRLLNTVAHCVANLA

XP_022148549.1 uncharacterized protein LOC111017181 [Momordica charantia]4.2e-1330.81Show/hide
Query:  GVIIMWCIWFFKNRIVHSN-TKPSAEFIHNLIESKIKEWENTHQNHQQPRRSRSQVSHSTWEKLQRNRWKMNTDAAWNEKDKSRGLGWIVRDSNGSTICL
        G++++W IW ++N+IVHS   +P ++ I    ESKI E+  T+         ++      W    ++ WK+N DA W +   + GLGWIVRDS G  I  
Subjt:  GVIIMWCIWFFKNRIVHSN-TKPSAEFIHNLIESKIKEWENTHQNHQQPRRSRSQVSHSTWEKLQRNRWKMNTDAAWNEKDKSRGLGWIVRDSNGSTICL

Query:  GMKSIRKIWPVKMMEAEAILEGLKHVIDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRHCSRLLNTVAHCVANLAC
                  +K +     LE             I +E+ESD L+V+N++ ++S+ L+++  I   I      LP   F+H     N VAH +A  AC
Subjt:  GMKSIRKIWPVKMMEAEAILEGLKHVIDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRHCSRLLNTVAHCVANLAC

XP_030502823.1 uncharacterized protein LOC115717993 [Cannabis sativa]1.8e-1130.18Show/hide
Query:  IMWCIWFFKNRIVHSNTKPSAEFIHNLIESKIKEWEN------THQNHQQPRRSRS-------QVSHST----WEKLQRNRWKMNTDAAWNEKDKSRGLG
        IMW IW  +NR+VH N   SA+ + +     +  W+N       H +   P+ S S       Q S +T    W+     + K+N DAA +   K  G+G
Subjt:  IMWCIWFFKNRIVHSNTKPSAEFIHNLIESKIKEWEN------THQNHQQPRRSRS-------QVSHST----WEKLQRNRWKMNTDAAWNEKDKSRGLG

Query:  WIVRDSNGSTICLGMKSIRKIWPVKMMEAEAILEGLKHVIDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRHCSRLLN
         +VRDSNG       K +   +    MEA A+   L  V+   +  +   +IE+DAL+V N L + S  +S  + +   IS++ S  P V   H  R  N
Subjt:  WIVRDSNGSTICLGMKSIRKIWPVKMMEAEAILEGLKHVIDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRHCSRLLN

Query:  TVAHCVANLAC---NRCFGNNF
          A C+A  A      CF +++
Subjt:  TVAHCVANLAC---NRCFGNNF

XP_031112158.1 uncharacterized protein LOC116016135 [Ipomoea triloba]1.5e-1033.49Show/hide
Query:  DSLDNEGIAKGVIIMWCIWFFKNRIVHSNTKPSAEF-IHNLIESKIKEWENTHQNHQQPRRSRSQVSHSTWEKLQRNRWKMNTDAAWNEKDKSRGLGWIV
        ++L NE IAK VII W IW  +N  V       A F +  ++   +   EN    +  P  ++   S   WEK QR R KMNTDAA N+     G GW++
Subjt:  DSLDNEGIAKGVIIMWCIWFFKNRIVHSNTKPSAEF-IHNLIESKIKEWENTHQNHQQPRRSRSQVSHSTWEKLQRNRWKMNTDAAWNEKDKSRGLGWIV

Query:  RDSNGSTICLGMKSIR--KIWPVKMMEAEAILEGLKHVIDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRHCSRLLNT
        RD +G    LG K++R   I+  K  EA  + E L  + DTC+     +++E+D+  V   ++      S    I   I  +AS++  V F    R  N 
Subjt:  RDSNGSTICLGMKSIR--KIWPVKMMEAEAILEGLKHVIDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRHCSRLLNT

Query:  VAHCVANLA
         AH VA  A
Subjt:  VAHCVANLA

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134128.6e-1228.7Show/hide
Query:  DSLDNEGIAKGVIIMWCIWFFKNRIVHSNTKPSAEFIHNLIESKI---------KEWENTHQNHQQPRRSRSQVSHSTWEKLQRNRWKMNTDAAWNEKDK
        D    E   + +II W IW  +N+ +     P    I   I+  I          + ++T+++    RR       + W+    N WK+NT+AAW     
Subjt:  DSLDNEGIAKGVIIMWCIWFFKNRIVHSNTKPSAEFIHNLIESKI---------KEWENTHQNHQQPRRSRSQVSHSTWEKLQRNRWKMNTDAAWNEKDK

Query:  SRGLGWIVRDSNGSTICLGMKSIRKIWPVKMMEAEAILEGLKHV-IDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRH
        + G+GWI+RD  G  I    + IR    +  +E  AI EGL+ +  + C  R IH  +ESD+L+ +++L     + +++  +   I  M   +  V  RH
Subjt:  SRGLGWIVRDSNGSTICLGMKSIRKIWPVKMMEAEAILEGLKHV-IDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRH

Query:  CSRLLNTVAHCVANLA
         SR  N VAH +A  A
Subjt:  CSRLLNTVAHCVANLA

A0A6J1D4B6 uncharacterized protein LOC1110171812.0e-1330.81Show/hide
Query:  GVIIMWCIWFFKNRIVHSN-TKPSAEFIHNLIESKIKEWENTHQNHQQPRRSRSQVSHSTWEKLQRNRWKMNTDAAWNEKDKSRGLGWIVRDSNGSTICL
        G++++W IW ++N+IVHS   +P ++ I    ESKI E+  T+         ++      W    ++ WK+N DA W +   + GLGWIVRDS G  I  
Subjt:  GVIIMWCIWFFKNRIVHSN-TKPSAEFIHNLIESKIKEWENTHQNHQQPRRSRSQVSHSTWEKLQRNRWKMNTDAAWNEKDKSRGLGWIVRDSNGSTICL

Query:  GMKSIRKIWPVKMMEAEAILEGLKHVIDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRHCSRLLNTVAHCVANLAC
                  +K +     LE             I +E+ESD L+V+N++ ++S+ L+++  I   I      LP   F+H     N VAH +A  AC
Subjt:  GMKSIRKIWPVKMMEAEAILEGLKHVIDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRHCSRLLNTVAHCVANLAC

A0A803NTZ9 Uncharacterized protein4.3e-1127.56Show/hide
Query:  LDNEGIAKGVIIMWCIWFFKNRIVHS--NTKPSAEFIH--NLIESKIKEWEN------------THQNHQQPRRSRSQVSHS-----TWEKLQRNRWKMN
        L  E     + IMW IW  +N+I+H   N   +A  +H    + + IK   N             + +H Q     SQ  HS      W+    N+ K+N
Subjt:  LDNEGIAKGVIIMWCIWFFKNRIVHS--NTKPSAEFIH--NLIESKIKEWEN------------THQNHQQPRRSRSQVSHS-----TWEKLQRNRWKMN

Query:  TDAAWNEKDKSRGLGWIVRDSNGSTICLGMKSIRKIWPVKMMEAEAILEGLKHVIDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMAS
         DAA N  DK  G+G I+R+ +G  +    K ++  +    MEA+A+   L  +    ++ +    +E+DAL+V + +   S +LS    +   +  + S
Subjt:  TDAAWNEKDKSRGLGWIVRDSNGSTICLGMKSIRKIWPVKMMEAEAILEGLKHVIDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMAS

Query:  KLPGVVFRHCSRLLNTVAHCVANLA
          PG+V  H  R  N  AH +A  A
Subjt:  KLPGVVFRHCSRLLNTVAHCVANLA

A0A803PLQ2 Uncharacterized protein3.9e-1227.45Show/hide
Query:  IMWCIWFFKNRIVHSNTKPSAEFIHNLIESKIKEWENTHQNHQQPRRSRSQVSHST----------WEKLQRNRWKMNTDAAWNEKDKSRGLGWIVRDSN
        I+W IW  +N++VH  T      I N     ++++    ++      + S++  ++          W+  Q N +KMN DAA NE+ K  G+G ++RD N
Subjt:  IMWCIWFFKNRIVHSNTKPSAEFIHNLIESKIKEWENTHQNHQQPRRSRSQVSHST----------WEKLQRNRWKMNTDAAWNEKDKSRGLGWIVRDSN

Query:  GSTICLGMKSIRKIWPVKMMEAEAILEGLKHVIDTCIRRNIHM-EIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRHCSRLLNTVAHCV
        G+ I    K ++  +    MEA+A+     H ++  ++  + +  IE+DAL+V + +   S NLS    + + +  + S  P V+  H  R  N  AH +
Subjt:  GSTICLGMKSIRKIWPVKMMEAEAILEGLKHVIDTCIRRNIHM-EIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRHCSRLLNTVAHCV

Query:  ANLA
        A  A
Subjt:  ANLA

A0A803PM52 Uncharacterized protein8.6e-1230.18Show/hide
Query:  IMWCIWFFKNRIVHSNTKPSAEFIHNLIESKIKEWEN------THQNHQQPRRSRS-------QVSHST----WEKLQRNRWKMNTDAAWNEKDKSRGLG
        IMW IW  +NR+VH N   SA+ + +     +  W+N       H +   P+ S S       Q S +T    W+     + K+N DAA +   K  G+G
Subjt:  IMWCIWFFKNRIVHSNTKPSAEFIHNLIESKIKEWEN------THQNHQQPRRSRS-------QVSHST----WEKLQRNRWKMNTDAAWNEKDKSRGLG

Query:  WIVRDSNGSTICLGMKSIRKIWPVKMMEAEAILEGLKHVIDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRHCSRLLN
         +VRDSNG       K +   +    MEA A+   L  V+   +  +   +IE+DAL+V N L + S  +S  + +   IS++ S  P V   H  R  N
Subjt:  WIVRDSNGSTICLGMKSIRKIWPVKMMEAEAILEGLKHVIDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRHCSRLLN

Query:  TVAHCVANLAC---NRCFGNNF
          A C+A  A      CF +++
Subjt:  TVAHCVANLAC---NRCFGNNF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27870.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.7e-0430.49Show/hide
Query:  QNHQQPRRSRSQVSHSTWEKLQRNRWKMNTDAAWNEKDKSRGLGWIVRDSNGSTICLGMKSIRKIWPVKMMEAEAILEGLKH
        Q + + R  R + +H  W + +R   K N D ++   D     GW+VRDSNGS +  G    RK+      E +A++  ++H
Subjt:  QNHQQPRRSRSQVSHSTWEKLQRNRWKMNTDAAWNEKDKSRGLGWIVRDSNGSTICLGMKSIRKIWPVKMMEAEAILEGLKH

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.8e-0625.68Show/hide
Query:  IMWCIWFFKNRIVHSNTKPSAEFIHNLIESKIKEWENTHQNHQQPRRSRSQVSHSTWEKLQRNRW-KMNTDAAWNEKDKSRGLGWIVRDSNGSTICLGMK
        ++W +W  +N ++    +  A  +        +EW    +   +    + + + S   K    +W K NTDA W  ++   G+GWI+R+ +G  + +G +
Subjt:  IMWCIWFFKNRIVHSNTKPSAEFIHNLIESKIKEWENTHQNHQQPRRSRSQVSHSTWEKLQRNRW-KMNTDAAWNEKDKSRGLGWIVRDSNGSTICLGMK

Query:  SIRKIWPVKMMEAEAILEGLKHVIDTCIRRNIHMEI-ESDALKVLNVL
        ++    P      EA LE L+  + T  R N    I ESDA  ++N+L
Subjt:  SIRKIWPVKMMEAEAILEGLKHVIDTCIRRNIHMEI-ESDALKVLNVL

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.0e-0835Show/hide
Query:  IMWCIWFFKNRIVHSNTKPSAEFIHNLIESKIKEW-ENTHQNHQQPRRSRSQVSHST-WEKLQRNRWKMNTDAAWNEKDKSRGLGWIVRDSNGSTICLGM
        +MW IW   N +V ++T+   +    +  +  KEW +NT  N QQ     +  S +T W    R++ K N DA+ +E++   GLGWI+R+S G+ I  GM
Subjt:  IMWCIWFFKNRIVHSNTKPSAEFIHNLIESKIKEW-ENTHQNHQQPRRSRSQVSHST-WEKLQRNRWKMNTDAAWNEKDKSRGLGWIVRDSNGSTICLGM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGACTCACTGGACAATGAGGGGATAGCTAAAGGAGTCATAATTATGTGGTGTATTTGGTTTTTCAAGAATCGCATTGTTCATTCCAACACAAAGCCATCAGCAGA
GTTCATTCATAACCTGATTGAATCAAAGATTAAAGAATGGGAGAATACTCACCAGAATCATCAGCAGCCGAGAAGATCGAGGAGCCAAGTGAGTCACAGTACGTGGGAGA
AGTTGCAGAGGAACCGTTGGAAGATGAACACTGATGCCGCTTGGAACGAGAAGGATAAGAGTAGAGGATTAGGCTGGATTGTTCGTGACTCAAATGGATCCACAATCTGT
CTTGGGATGAAATCAATCAGGAAAATATGGCCAGTCAAAATGATGGAGGCAGAAGCGATTTTAGAAGGGCTAAAGCATGTAATTGATACCTGTATTCGAAGGAATATCCA
TATGGAAATTGAGTCGGATGCTCTCAAAGTGCTTAACGTGCTGACCGAAACTTCCGTCAACCTATCAGACTTGAAATCCATCACAAGTGCCATCTCTGCCATGGCTTCGA
AGCTTCCGGGGGTCGTTTTTCGCCATTGTAGTAGGCTTTTGAACACAGTAGCGCACTGTGTTGCCAATTTGGCTTGTAATCGCTGTTTTGGGAACAATTTTGATCGAAGT
TTTTTGGTTGACAAGGAGAAAGAGTGCGTTTTCTGGGCCCCAAATATCCCCAACTGTTTTCTCCCTCCATTTATTGAGGGTGGTTGTAGTTCTGATTTGCGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACGGACTCACTGGACAATGAGGGGATAGCTAAAGGAGTCATAATTATGTGGTGTATTTGGTTTTTCAAGAATCGCATTGTTCATTCCAACACAAAGCCATCAGCAGA
GTTCATTCATAACCTGATTGAATCAAAGATTAAAGAATGGGAGAATACTCACCAGAATCATCAGCAGCCGAGAAGATCGAGGAGCCAAGTGAGTCACAGTACGTGGGAGA
AGTTGCAGAGGAACCGTTGGAAGATGAACACTGATGCCGCTTGGAACGAGAAGGATAAGAGTAGAGGATTAGGCTGGATTGTTCGTGACTCAAATGGATCCACAATCTGT
CTTGGGATGAAATCAATCAGGAAAATATGGCCAGTCAAAATGATGGAGGCAGAAGCGATTTTAGAAGGGCTAAAGCATGTAATTGATACCTGTATTCGAAGGAATATCCA
TATGGAAATTGAGTCGGATGCTCTCAAAGTGCTTAACGTGCTGACCGAAACTTCCGTCAACCTATCAGACTTGAAATCCATCACAAGTGCCATCTCTGCCATGGCTTCGA
AGCTTCCGGGGGTCGTTTTTCGCCATTGTAGTAGGCTTTTGAACACAGTAGCGCACTGTGTTGCCAATTTGGCTTGTAATCGCTGTTTTGGGAACAATTTTGATCGAAGT
TTTTTGGTTGACAAGGAGAAAGAGTGCGTTTTCTGGGCCCCAAATATCCCCAACTGTTTTCTCCCTCCATTTATTGAGGGTGGTTGTAGTTCTGATTTGCGTTAA
Protein sequenceShow/hide protein sequence
MTDSLDNEGIAKGVIIMWCIWFFKNRIVHSNTKPSAEFIHNLIESKIKEWENTHQNHQQPRRSRSQVSHSTWEKLQRNRWKMNTDAAWNEKDKSRGLGWIVRDSNGSTIC
LGMKSIRKIWPVKMMEAEAILEGLKHVIDTCIRRNIHMEIESDALKVLNVLTETSVNLSDLKSITSAISAMASKLPGVVFRHCSRLLNTVAHCVANLACNRCFGNNFDRS
FLVDKEKECVFWAPNIPNCFLPPFIEGGCSSDLR