; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031100 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031100
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRNase H domain-containing protein
Genome locationscaffold8:29907514..29909937
RNA-Seq ExpressionSpg031100
SyntenySpg031100
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]2.6e-2031.94Show/hide
Query:  WSATDFWDEFGRLLKDEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIEQNIKELEKSKDPNLAVVAAKRHQNPDW---------------WSPPP
        W+  D W+    +L DEE+  + +I W +W   N+    G   + Q+L R+I   +  +  + D    +   +R Q  D                WS PP
Subjt:  WSATDFWDEFGRLLKDEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIEQNIKELEKSKDPNLAVVAAKRHQNPDW---------------WSPPP

Query:  SGCWKLNSDASWFPEANSGEIGWIIRDSDGSLITAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSECVELS
        + CWKLN+DASW  E   G IGWI+ D  G ++ AG  KI+ +  + ALE + +  G L  + + +R PI +ESD+ +VI+ +  E V+L+
Subjt:  SGCWKLNSDASWFPEANSGEIGWIIRDSDGSLITAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSECVELS

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]3.2e-1830.1Show/hide
Query:  DEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIE--------QNIKELEKSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWFPEANSGEIG
        +EE   + +I W +W   NK    G     + ++ AI+        +N     KS + +L ++          W PP S  WKLN++A+W  + N+G IG
Subjt:  DEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIE--------QNIKELEKSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWFPEANSGEIG

Query:  WIIRDSDGSLITAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSECVELSEAKIVMVEVEEQGKAAGVIVFNKCRRSSNKV
        WI+RD  G +I A  R I+ + ++  LE +A+ EG L ++   +  PI +ESD+ + I  L+ +C + +E   ++ E+ +  K   ++      R +NKV
Subjt:  WIIRDSDGSLITAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSECVELSEAKIVMVEVEEQGKAAGVIVFNKCRRSSNKV

Query:  AHSLAK
        AH LA+
Subjt:  AHSLAK

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]8.5e-1930.81Show/hide
Query:  DEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIEQNIKELEKSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWFPEANSGEIGWIIRDSDG
        DE+L++  +  W +WN  N +   G   +   + + + + + E     + +L+++  K   N   W PPP   W LN+DASW    + G IGWIIR  DG
Subjt:  DEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIEQNIKELEKSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWFPEANSGEIGWIIRDSDG

Query:  SLITAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSECVELSEAKIVMVEVEEQGKAAGVIVFNKCRRSSNKVAHSLAK
         ++ AG R ++   +VK LE  A+ EG+     L    P+ +E+D+++V   LN +  +L++   V+ E+     +  ++ F K  R +N  AHSLA+
Subjt:  SLITAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSECVELSEAKIVMVEVEEQGKAAGVIVFNKCRRSSNKVAHSLAK

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]1.8e-1628.84Show/hide
Query:  DEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIEQ----------NIKELEKSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWFPEANSGE
        +EE   + +I W +W   NK    G     + ++  I++          N+K   KS + +L ++          W PP S  WKLN+DA+W  + N+G 
Subjt:  DEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIEQ----------NIKELEKSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWFPEANSGE

Query:  IGWIIRDSDGSLITAGGRKIKRQWSVKALEFLAVKEGILA-------SLGLNNRLPIIVESDASDVIKALNSECVELSEAKIVMVEVEEQGKAAGVIVFN
        IGWI+RD  G +I A  R I+ + ++  LE +A+ EG+ A        +   +  PI +ESD+ + I  L+ +C + +E   ++ E+ +  +   ++   
Subjt:  IGWIIRDSDGSLITAGGRKIKRQWSVKALEFLAVKEGILA-------SLGLNNRLPIIVESDASDVIKALNSECVELSEAKIVMVEVEEQGKAAGVIVFN

Query:  KCRRSSNKVAHSLAK
           R +NKVAH LA+
Subjt:  KCRRSSNKVAHSLAK

XP_024041966.1 uncharacterized protein LOC112099096 [Citrus clementina]6.3e-1430.14Show/hide
Query:  FWDEFGRLLKDEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIEQNIKELEKSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWFPEANSGE
        FW +  +  K E  E+AAL LW +WN  NK    G +EN  ++    E  ++  +K +   +A    +  +    WSPPP+G  K+N DA+   E     
Subjt:  FWDEFGRLLKDEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIEQNIKELEKSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWFPEANSGE

Query:  IGWIIRDSDGSLITAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSECVELSEAKIVMVEVEEQGKAAGVIVFNKCRRSSN
        +G ++RDSDG+   A  + ++   SV   E  A++ G+  +   N    I  ESD+ +VI  +N +   L+E   ++ +++E  +         C R  N
Subjt:  IGWIIRDSDGSLITAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSECVELSEAKIVMVEVEEQGKAAGVIVFNKCRRSSN

Query:  KVAHSLAKI
          AHSLAK+
Subjt:  KVAHSLAKI

TrEMBL top hitse value%identityAlignment
A0A4Y1R838 Ribonuclease H-like superfamily protein8.9e-1430.28Show/hide
Query:  DFWDEFGRLLKDEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIEQNIKELEKSKDPNLAVVAAKRH----------QNPDWWSPPPSGCWKLNSD
        +F+D  GR+  +E L++ A +LW +W C N +   G +  +Q    A+   + ++++    N+A     +            +P W SPPP G  KLN D
Subjt:  DFWDEFGRLLKDEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIEQNIKELEKSKDPNLAVVAAKRH----------QNPDWWSPPPSGCWKLNSD

Query:  ASWFPEANSGEIGWIIRDSDGSLITAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSECVELSEAKIVMVEVEEQGKAAGV
         +W  +   G +GW++RD  G  I+AGG    R  S    E  AV+E +   L L     + VESD+  +IK LN E V   E + ++ +V      A  
Subjt:  ASWFPEANSGEIGWIIRDSDGSLITAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSECVELSEAKIVMVEVEEQGKAAGV

Query:  IVFNKCRRSSNKVAHSLA
        + F    R  N+ AH++A
Subjt:  IVFNKCRRSSNKVAHSLA

A0A6J1CP26 uncharacterized protein LOC1110134121.6e-1830.1Show/hide
Query:  DEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIE--------QNIKELEKSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWFPEANSGEIG
        +EE   + +I W +W   NK    G     + ++ AI+        +N     KS + +L ++          W PP S  WKLN++A+W  + N+G IG
Subjt:  DEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIE--------QNIKELEKSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWFPEANSGEIG

Query:  WIIRDSDGSLITAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSECVELSEAKIVMVEVEEQGKAAGVIVFNKCRRSSNKV
        WI+RD  G +I A  R I+ + ++  LE +A+ EG L ++   +  PI +ESD+ + I  L+ +C + +E   ++ E+ +  K   ++      R +NKV
Subjt:  WIIRDSDGSLITAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSECVELSEAKIVMVEVEEQGKAAGVIVFNKCRRSSNKV

Query:  AHSLAK
        AH LA+
Subjt:  AHSLAK

A0A6J1CQG0 uncharacterized protein LOC1110132161.3e-2031.94Show/hide
Query:  WSATDFWDEFGRLLKDEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIEQNIKELEKSKDPNLAVVAAKRHQNPDW---------------WSPPP
        W+  D W+    +L DEE+  + +I W +W   N+    G   + Q+L R+I   +  +  + D    +   +R Q  D                WS PP
Subjt:  WSATDFWDEFGRLLKDEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIEQNIKELEKSKDPNLAVVAAKRHQNPDW---------------WSPPP

Query:  SGCWKLNSDASWFPEANSGEIGWIIRDSDGSLITAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSECVELS
        + CWKLN+DASW  E   G IGWI+ D  G ++ AG  KI+ +  + ALE + +  G L  + + +R PI +ESD+ +VI+ +  E V+L+
Subjt:  SGCWKLNSDASWFPEANSGEIGWIIRDSDGSLITAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSECVELS

A0A6J1DNV9 uncharacterized protein LOC1110224034.1e-1930.81Show/hide
Query:  DEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIEQNIKELEKSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWFPEANSGEIGWIIRDSDG
        DE+L++  +  W +WN  N +   G   +   + + + + + E     + +L+++  K   N   W PPP   W LN+DASW    + G IGWIIR  DG
Subjt:  DEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIEQNIKELEKSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWFPEANSGEIGWIIRDSDG

Query:  SLITAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSECVELSEAKIVMVEVEEQGKAAGVIVFNKCRRSSNKVAHSLAK
         ++ AG R ++   +VK LE  A+ EG+     L    P+ +E+D+++V   LN +  +L++   V+ E+     +  ++ F K  R +N  AHSLA+
Subjt:  SLITAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSECVELSEAKIVMVEVEEQGKAAGVIVFNKCRRSSNKVAHSLAK

A0A6J1DSV1 uncharacterized protein LOC1110236088.6e-1728.84Show/hide
Query:  DEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIEQ----------NIKELEKSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWFPEANSGE
        +EE   + +I W +W   NK    G     + ++  I++          N+K   KS + +L ++          W PP S  WKLN+DA+W  + N+G 
Subjt:  DEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIEQ----------NIKELEKSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWFPEANSGE

Query:  IGWIIRDSDGSLITAGGRKIKRQWSVKALEFLAVKEGILA-------SLGLNNRLPIIVESDASDVIKALNSECVELSEAKIVMVEVEEQGKAAGVIVFN
        IGWI+RD  G +I A  R I+ + ++  LE +A+ EG+ A        +   +  PI +ESD+ + I  L+ +C + +E   ++ E+ +  +   ++   
Subjt:  IGWIIRDSDGSLITAGGRKIKRQWSVKALEFLAVKEGILA-------SLGLNNRLPIIVESDASDVIKALNSECVELSEAKIVMVEVEEQGKAAGVIVFN

Query:  KCRRSSNKVAHSLAK
           R +NKVAH LA+
Subjt:  KCRRSSNKVAHSLAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G09775.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT2G02650.1)2.8e-0425Show/hide
Query:  LWNVWNCMN------------KISINGGKENMQKLKRAIEQNIKELEKSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWFPEANSGEIGWIIRDS
        +W +W   N            K++  G +E  + ++  I         ++ PN       +      WSPPP G  K N D+ +    +     WIIRDS
Subjt:  LWNVWNCMN------------KISINGGKENMQKLKRAIEQNIKELEKSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWFPEANSGEIGWIIRDS

Query:  DGSLITAGGRKIKRQWSVKALEFL
        +G +I +G  K+++ +S    E L
Subjt:  DGSLITAGGRKIKRQWSVKALEFL

AT4G29090.1 Ribonuclease H-like superfamily protein5.9e-1026.14Show/hide
Query:  EIAALILWNVWNCMNKISINGGKENMQKLKRAIEQNIKELE-KSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWFPEANSGEIGWIIRDSDGSLI
        ++   +LW +W   N++   G + N Q++ R  E +++E   +++  +          +   W PPP    K N+DA+W  +     IGW++R+  G + 
Subjt:  EIAALILWNVWNCMNKISINGGKENMQKLKRAIEQNIKELE-KSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWFPEANSGEIGWIIRDSDGSLI

Query:  TAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSE
          G R + +  SV   E  A++  +L SL       +I ESD+  +I+ LN++
Subjt:  TAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSE

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.9e-0723.68Show/hide
Query:  ILWNVWNCMNKISINGGKENMQKLKRAIEQNIKELEKSKDPNLAVVAAKRHQNPD-----WWSPPPSGCWKLNSDASWFPEANSGEIGWIIRDSDGSLIT
        ++W +W   N +  N       K +  +E  + + ++  D  +       ++N D      WSPP     K N DAS         +GWI+R+S G++I 
Subjt:  ILWNVWNCMNKISINGGKENMQKLKRAIEQNIKELEKSKDPNLAVVAAKRHQNPD-----WWSPPPSGCWKLNSDASWFPEANSGEIGWIIRDSDGSLIT

Query:  AGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSE
         G  K + + + +  E   +   I AS G  ++  +I E D   + + +N++
Subjt:  AGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGCTGGAGCGCAGCAAATTTTTGGGATGGTTGGAGCGCAACAGATTTTTGGGATGAATTTGGAAGATTGCTGAAGGACGAGGAGCTGGAAATTGCGGCCCTAAT
TCTATGGAACGTGTGGAATTGCATGAACAAAATCTCTATCAATGGAGGAAAAGAAAATATGCAGAAGCTAAAAAGAGCAATTGAACAAAACATCAAAGAACTCGAGAAAA
GTAAGGACCCGAACCTGGCAGTGGTGGCAGCAAAGAGGCATCAGAATCCGGATTGGTGGTCTCCCCCTCCGAGTGGCTGTTGGAAACTCAATTCAGATGCATCGTGGTTT
CCTGAAGCAAACTCGGGCGAAATTGGTTGGATTATTCGAGATTCCGATGGGTCTCTGATTACAGCTGGAGGAAGGAAAATCAAAAGGCAATGGTCAGTTAAAGCTCTCGA
ATTCTTGGCAGTTAAAGAGGGGATCCTTGCCTCGTTAGGTTTGAACAACCGATTACCTATTATTGTGGAGTCGGATGCATCTGATGTGATCAAAGCCTTGAATTCAGAGT
GTGTTGAGCTTTCGGAAGCAAAAATTGTGATGGTGGAAGTTGAAGAGCAAGGCAAAGCTGCTGGAGTGATCGTCTTCAACAAATGTCGAAGGTCAAGTAACAAAGTGGCG
CATTCCCTCGCCAAAATCGTGGCATCTCCTTGGCCGGTTGGCTCGTCTTCGTCTGTTGCAGGCGTTAATGTTTTTGTGCAGTTCCCCTCTTCCACGCTGGAAGAGCCTTT
TTGTTTTTGTGGGGATTCTGTGGTGCCTTTATGGCTATCCTCGATATTAAATGAGGATATGGTTATCCGCATTTATGAAAAGGGGCCTAAGTACATGGAGATTGGAGTGG
ATATAGAGAGTCAATCGGAACCCTACCTGGCCCTGCCGCCATCTTTAAAACGGAAGGAGGAGAGCAAAAGAGCAAGAGGAGAGAGTAGAGAATATAGATTAGAGTTCGGG
ATCCTTTCTTCAGCGATGAAGAAGGGTTTAAATACCTGTTCTTGCCCTAGCGTTACGTTTTTAGGAATTCGAAGGCGTTTTGGGCTGAACCAAGTGGAACCGGAGCGGAC
AGGGGCGGTAGGGACCGAACAGAGGTGGAAGGCGCGGGCTGACCATATGGGTCGAGCCAAGGCCCGGTCCCTCCGACCTTGGCCCGACCTTTTGGCCGGTTTCGCCTGCG
GAGTCCGCTTTCCAGTCTTATTTCTGCCCGACTGTCCTCGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGGCTGGAGCGCAGCAAATTTTTGGGATGGTTGGAGCGCAACAGATTTTTGGGATGAATTTGGAAGATTGCTGAAGGACGAGGAGCTGGAAATTGCGGCCCTAAT
TCTATGGAACGTGTGGAATTGCATGAACAAAATCTCTATCAATGGAGGAAAAGAAAATATGCAGAAGCTAAAAAGAGCAATTGAACAAAACATCAAAGAACTCGAGAAAA
GTAAGGACCCGAACCTGGCAGTGGTGGCAGCAAAGAGGCATCAGAATCCGGATTGGTGGTCTCCCCCTCCGAGTGGCTGTTGGAAACTCAATTCAGATGCATCGTGGTTT
CCTGAAGCAAACTCGGGCGAAATTGGTTGGATTATTCGAGATTCCGATGGGTCTCTGATTACAGCTGGAGGAAGGAAAATCAAAAGGCAATGGTCAGTTAAAGCTCTCGA
ATTCTTGGCAGTTAAAGAGGGGATCCTTGCCTCGTTAGGTTTGAACAACCGATTACCTATTATTGTGGAGTCGGATGCATCTGATGTGATCAAAGCCTTGAATTCAGAGT
GTGTTGAGCTTTCGGAAGCAAAAATTGTGATGGTGGAAGTTGAAGAGCAAGGCAAAGCTGCTGGAGTGATCGTCTTCAACAAATGTCGAAGGTCAAGTAACAAAGTGGCG
CATTCCCTCGCCAAAATCGTGGCATCTCCTTGGCCGGTTGGCTCGTCTTCGTCTGTTGCAGGCGTTAATGTTTTTGTGCAGTTCCCCTCTTCCACGCTGGAAGAGCCTTT
TTGTTTTTGTGGGGATTCTGTGGTGCCTTTATGGCTATCCTCGATATTAAATGAGGATATGGTTATCCGCATTTATGAAAAGGGGCCTAAGTACATGGAGATTGGAGTGG
ATATAGAGAGTCAATCGGAACCCTACCTGGCCCTGCCGCCATCTTTAAAACGGAAGGAGGAGAGCAAAAGAGCAAGAGGAGAGAGTAGAGAATATAGATTAGAGTTCGGG
ATCCTTTCTTCAGCGATGAAGAAGGGTTTAAATACCTGTTCTTGCCCTAGCGTTACGTTTTTAGGAATTCGAAGGCGTTTTGGGCTGAACCAAGTGGAACCGGAGCGGAC
AGGGGCGGTAGGGACCGAACAGAGGTGGAAGGCGCGGGCTGACCATATGGGTCGAGCCAAGGCCCGGTCCCTCCGACCTTGGCCCGACCTTTTGGCCGGTTTCGCCTGCG
GAGTCCGCTTTCCAGTCTTATTTCTGCCCGACTGTCCTCGTTAG
Protein sequenceShow/hide protein sequence
MDGWSAANFWDGWSATDFWDEFGRLLKDEELEIAALILWNVWNCMNKISINGGKENMQKLKRAIEQNIKELEKSKDPNLAVVAAKRHQNPDWWSPPPSGCWKLNSDASWF
PEANSGEIGWIIRDSDGSLITAGGRKIKRQWSVKALEFLAVKEGILASLGLNNRLPIIVESDASDVIKALNSECVELSEAKIVMVEVEEQGKAAGVIVFNKCRRSSNKVA
HSLAKIVASPWPVGSSSSVAGVNVFVQFPSSTLEEPFCFCGDSVVPLWLSSILNEDMVIRIYEKGPKYMEIGVDIESQSEPYLALPPSLKRKEESKRARGESREYRLEFG
ILSSAMKKGLNTCSCPSVTFLGIRRRFGLNQVEPERTGAVGTEQRWKARADHMGRAKARSLRPWPDLLAGFACGVRFPVLFLPDCPR