; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg038656 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg038656
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold12:436578..439140
RNA-Seq ExpressionSpg038656
SyntenySpg038656
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4360260.1 hypothetical protein F8388_020551 [Cannabis sativa]5.4e-1430.63Show/hide
Query:  HWRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSV--SINSPRVILEMDSLQVVNLLTGRDEDITKIV
        HW   P+G   +NCDAA N G    G G+I R WDG  + AG+   N   ++   EA A ++ LN++  + +SP  I + D   +V+ +  +D  +T + 
Subjt:  HWRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSV--SINSPRVILEMDSLQVVNLLTGRDEDITKIV

Query:  YLVREAKSRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWLISLNNID
         ++ + + ++  L  +S+VH  R  N  AH+LAR+      +  + + FP WL      D
Subjt:  YLVREAKSRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWLISLNNID

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]8.1e-1838.97Show/hide
Query:  WRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSVSINSPRVI-LEMDSLQVVNLLTGRDEDITKIVYL
        W+   S  WK+N +AAW    + GGIGWILR   G  + A  R I  +RNI +LE +A  +GL ++     R I LE DSL+ ++LL  + +D T+I++L
Subjt:  WRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSVSINSPRVI-LEMDSLQVVNLLTGRDEDITKIVYL

Query:  VREAKSRIASLQMDSVVHTPRRYNNMAHILARRACD
        + E    +  +++ S+ H  R  N +AH LARRA +
Subjt:  VREAKSRIASLQMDSVVHTPRRYNNMAHILARRACD

XP_022148737.1 uncharacterized protein LOC111017329 [Momordica charantia]1.6e-1334.85Show/hide
Query:  PSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNS-VSINSPRVILEMDSLQVVNLLTGRDEDITKIVYLVREA
        P   WK+N DAAW+  Q  GG+GWI+R        AG ++I   R+I +LE +A   G+ + VS +S  +I+E +SL+ ++L+ G  +++T+I++LV++ 
Subjt:  PSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNS-VSINSPRVILEMDSLQVVNLLTGRDEDITKIVYLVREA

Query:  KSRIASLQMDSVVHTPRRYNNMAHILARRACD
        K+     ++    H  R  N++A  +A RA D
Subjt:  KSRIASLQMDSVVHTPRRYNNMAHILARRACD

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]7.1e-2240.26Show/hide
Query:  LHWRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGL-NSVSINSPRVI-LEMDSLQVVNLLTGRDEDITKI
        L W   P  +W +N DA+W++   RGGIGWI+R WDG  V AG R +    N+K LEA A ++GL N  ++   R + +E DS +V +LL  + ED+TK 
Subjt:  LHWRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGL-NSVSINSPRVI-LEMDSLQVVNLLTGRDEDITKI

Query:  VYLVREAKSRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWL
         ++V E  +   S ++ +     R  N  AH LA+RA  L  SM W + FP+WL
Subjt:  VYLVREAKSRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWL

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]3.1e-1737.5Show/hide
Query:  WRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSVSINSPRVI---------LEMDSLQVVNLLTGRDE
        W+   S  WK+N DAAW    + GGIGWILR   G  + A  R I  +RNI +LE +A  +GL ++     R I         LE DSL+ ++LL  + +
Subjt:  WRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSVSINSPRVI---------LEMDSLQVVNLLTGRDE

Query:  DITKIVYLVREAKSRIASLQMDSVVHTPRRYNNMAHILARRACD
        D T+I++L+ E    +  +++ S+ H  R  N +AH LARRA +
Subjt:  DITKIVYLVREAKSRIASLQMDSVVHTPRRYNNMAHILARRACD

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134123.9e-1838.97Show/hide
Query:  WRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSVSINSPRVI-LEMDSLQVVNLLTGRDEDITKIVYL
        W+   S  WK+N +AAW    + GGIGWILR   G  + A  R I  +RNI +LE +A  +GL ++     R I LE DSL+ ++LL  + +D T+I++L
Subjt:  WRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSVSINSPRVI-LEMDSLQVVNLLTGRDEDITKIVYL

Query:  VREAKSRIASLQMDSVVHTPRRYNNMAHILARRACD
        + E    +  +++ S+ H  R  N +AH LARRA +
Subjt:  VREAKSRIASLQMDSVVHTPRRYNNMAHILARRACD

A0A6J1DNV9 uncharacterized protein LOC1110224033.4e-2240.26Show/hide
Query:  LHWRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGL-NSVSINSPRVI-LEMDSLQVVNLLTGRDEDITKI
        L W   P  +W +N DA+W++   RGGIGWI+R WDG  V AG R +    N+K LEA A ++GL N  ++   R + +E DS +V +LL  + ED+TK 
Subjt:  LHWRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGL-NSVSINSPRVI-LEMDSLQVVNLLTGRDEDITKI

Query:  VYLVREAKSRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWL
         ++V E  +   S ++ +     R  N  AH LA+RA  L  SM W + FP+WL
Subjt:  VYLVREAKSRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWL

A0A6J1DSV1 uncharacterized protein LOC1110236081.5e-1737.5Show/hide
Query:  WRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSVSINSPRVI---------LEMDSLQVVNLLTGRDE
        W+   S  WK+N DAAW    + GGIGWILR   G  + A  R I  +RNI +LE +A  +GL ++     R I         LE DSL+ ++LL  + +
Subjt:  WRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSVSINSPRVI---------LEMDSLQVVNLLTGRDE

Query:  DITKIVYLVREAKSRIASLQMDSVVHTPRRYNNMAHILARRACD
        D T+I++L+ E    +  +++ S+ H  R  N +AH LARRA +
Subjt:  DITKIVYLVREAKSRIASLQMDSVVHTPRRYNNMAHILARRACD

A0A7J6ERF5 Uncharacterized protein2.6e-1430.63Show/hide
Query:  HWRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSV--SINSPRVILEMDSLQVVNLLTGRDEDITKIV
        HW   P+G   +NCDAA N G    G G+I R WDG  + AG+   N   ++   EA A ++ LN++  + +SP  I + D   +V+ +  +D  +T + 
Subjt:  HWRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSV--SINSPRVILEMDSLQVVNLLTGRDEDITKIV

Query:  YLVREAKSRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWLISLNNID
         ++ + + ++  L  +S+VH  R  N  AH+LAR+      +  + + FP WL      D
Subjt:  YLVREAKSRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWLISLNNID

A0A803PLN3 Uncharacterized protein5.8e-1430.86Show/hide
Query:  LLHWRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSV--SINSPRVILEMDSLQVVNLLTGRDEDITK
        L HW   P+G   +NCDAA N G    G G+I R W+G  + AG+   +   +++  EA A ++ L     + N P + ++ D   +V+ +  RD++++ 
Subjt:  LLHWRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSV--SINSPRVILEMDSLQVVNLLTGRDEDITK

Query:  IVYLVREAKSRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWLISLNNID
        +  L+ + K R+ S   +++VH  R  NN AH+LAR+      +  + + FP WL S    D
Subjt:  IVYLVREAKSRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWLISLNNID

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52990.1 thioredoxin family protein1.1e-0726.85Show/hide
Query:  SIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSVS-INSPRVILEMDSLQVVNLLTGRDEDITKIVYLVR
        S+PS   K N DA+ +EG    G+GW++R   G+ +  G+     +   +  E  A +  + + S     +VI E D+   VN L     D  ++ + + 
Subjt:  SIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSVS-INSPRVILEMDSLQVVNLLTGRDEDITKIVYLVR

Query:  EAKSRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWL
          KS I S      + T R  N  A  L ++A   +   S  N  P +L
Subjt:  EAKSRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWL

AT3G09510.1 Ribonuclease H-like superfamily protein1.4e-0726.45Show/hide
Query:  LHWRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSVSINS-PRVILEMDSLQVVNLLTGRDEDITKIV
        + WR+ P+   K N DA ++  +     GWI+R   G+P+S G   +    N    E  A +  L    I    +V +E D   ++NL+ G     +   
Subjt:  LHWRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSVSINS-PRVILEMDSLQVVNLLTGRDEDITKIV

Query:  YL--VREAKSRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWL
        +L  +    ++ AS+Q   +    R+ N +AH+LA+  C  +   S     P WL
Subjt:  YL--VREAKSRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWL

AT3G23320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.4e-0423.87Show/hide
Query:  PSGMW-KINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSV-SINSPRVILEMDSLQVVNLLTGRDEDITKIVYLVRE
        P   W K N D + + G+   G+ WI+R   G+ +  G      ++ IK  E  A +  +     +   RV  E D++  VN L    E   ++ Y +  
Subjt:  PSGMW-KINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSV-SINSPRVILEMDSLQVVNLLTGRDEDITKIVYLVRE

Query:  AKSRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWLISLNNID
         +    +          R  N    +LA++A   +I+ +  +F P +L+S  N D
Subjt:  AKSRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWLISLNNID

AT4G29090.1 Ribonuclease H-like superfamily protein9.5e-0930.07Show/hide
Query:  WRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNI--KWLEALA-AVDGLNSVSINSPRVILEMDSLQVVNLLTGRDEDITKIV
        WR  P    K N DA WN   +R GIGW+LR   G     G R++   +++    LEA+  AV  L+    N   VI E DS  ++ +L   DE    + 
Subjt:  WRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNI--KWLEALA-AVDGLNSVSINSPRVILEMDSLQVVNLLTGRDEDITKIV

Query:  YLVREAKSRIASLQMDSVVHTPRRYNNMAHILARRACD-LNISMSWCNFFPSW
          +++ +  ++       V  PR  N +A  +AR +   LN      +  PSW
Subjt:  YLVREAKSRIASLQMDSVVHTPRRYNNMAHILARRACD-LNISMSWCNFFPSW

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.4e-0423.08Show/hide
Query:  PSGMWKINC--DAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLN-SVSINSPRVILEMDSLQVVNLLTGRDEDITKIVYLVR
        P G  K+ C  DA+ +E     G+GWILR   G+ +  G+     +   +  E    +  +  S      +VI E D+  +  ++  +  +  ++ + + 
Subjt:  PSGMWKINC--DAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLN-SVSINSPRVILEMDSLQVVNLLTGRDEDITKIVYLVR

Query:  EAKSRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWLISLNNID
          +S I S +        R  N  A  LA++A   N   S  +  P +L    N D
Subjt:  EAKSRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWLISLNNID


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCTGCTTTTGCATTGGCGGTCAATTCCATCTGGAATGTGGAAAATCAATTGTGATGCCGCCTGGAACGAAGGTCAGGACCGAGGTGGTATTGGCTGGATTCTCCG
ACGGTGGGACGGTTCTCCGGTGTCAGCGGGTCTCAGGAGCATCAACTGCCAAAGGAACATCAAGTGGCTGGAAGCCCTTGCAGCGGTCGACGGTCTCAATTCGGTGTCAA
TCAACTCTCCTAGAGTGATTCTTGAGATGGATTCCCTTCAAGTCGTGAACCTGCTCACGGGAAGAGATGAGGATATCACTAAGATTGTCTATCTGGTGCGGGAAGCCAAG
AGTCGTATAGCTTCCCTCCAGATGGATTCGGTTGTTCACACTCCAAGAAGATACAATAATATGGCCCACATTTTGGCTAGAAGGGCTTGTGATCTTAATATATCTATGAG
TTGGTGTAATTTTTTCCCTTCTTGGTTGATTTCTTTAAACAATATTGACATTGGTGTGGAAAATCACATAAGTGGGGGTGCCTGTCCCATTATGGATAACCCTATGAGGG
GTCTGTTCACCACTGCTCCAATTCTTGTGCCTTCATCACCCATGCTCATCGATGCATCCGTATACAAATTAAACTCTGTCACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCTGCTTTTGCATTGGCGGTCAATTCCATCTGGAATGTGGAAAATCAATTGTGATGCCGCCTGGAACGAAGGTCAGGACCGAGGTGGTATTGGCTGGATTCTCCG
ACGGTGGGACGGTTCTCCGGTGTCAGCGGGTCTCAGGAGCATCAACTGCCAAAGGAACATCAAGTGGCTGGAAGCCCTTGCAGCGGTCGACGGTCTCAATTCGGTGTCAA
TCAACTCTCCTAGAGTGATTCTTGAGATGGATTCCCTTCAAGTCGTGAACCTGCTCACGGGAAGAGATGAGGATATCACTAAGATTGTCTATCTGGTGCGGGAAGCCAAG
AGTCGTATAGCTTCCCTCCAGATGGATTCGGTTGTTCACACTCCAAGAAGATACAATAATATGGCCCACATTTTGGCTAGAAGGGCTTGTGATCTTAATATATCTATGAG
TTGGTGTAATTTTTTCCCTTCTTGGTTGATTTCTTTAAACAATATTGACATTGGTGTGGAAAATCACATAAGTGGGGGTGCCTGTCCCATTATGGATAACCCTATGAGGG
GTCTGTTCACCACTGCTCCAATTCTTGTGCCTTCATCACCCATGCTCATCGATGCATCCGTATACAAATTAAACTCTGTCACATGA
Protein sequenceShow/hide protein sequence
MALLLHWRSIPSGMWKINCDAAWNEGQDRGGIGWILRRWDGSPVSAGLRSINCQRNIKWLEALAAVDGLNSVSINSPRVILEMDSLQVVNLLTGRDEDITKIVYLVREAK
SRIASLQMDSVVHTPRRYNNMAHILARRACDLNISMSWCNFFPSWLISLNNIDIGVENHISGGACPIMDNPMRGLFTTAPILVPSSPMLIDASVYKLNSVT