; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036203 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036203
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold5:41010562..41011353
RNA-Seq ExpressionSpg036203
SyntenySpg036203
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAY61101.1 hypothetical protein CUMW_207140, partial [Citrus unshiu]7.2e-0725.47Show/hide
Query:  EINVAALIMWSIWNFRNQSKLKGNRAVEQQLIRDIEARMEELNS---------------------RVNSNLASFKTENQSCHSRWAVRDSFSSLVGAGCK
        ++++ A ++W+ WN RNQ   KG R   Q ++   EA ME                         ++N++ A+  +E         +RD    +     K
Subjt:  EINVAALIMWSIWNFRNQSKLKGNRAVEQQLIRDIEARMEELNS---------------------RVNSNLASFKTENQSCHSRWAVRDSFSSLVGAGCK

Query:  KIPNEGNIKWLESKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKALSSVANVVSFNFCHRKANRLAHNLARAAAI
             G++ + E++A++ GL ++ K   V+D      +++ESD+ EV+ ++       SEI   + EI+ L    + VS  + HR  N +AH+LA+ A  
Subjt:  KIPNEGNIKWLESKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKALSSVANVVSFNFCHRKANRLAHNLARAAAI

Query:  HGDFVFFKGIPP
          + V +KG  P
Subjt:  HGDFVFFKGIPP

XP_015387358.1 uncharacterized protein LOC107177672 [Citrus sinensis]2.7e-0628Show/hide
Query:  VRDSFSSLVGAGCKKIPNEGNIKWLESKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKALSSVANVVSFNFCHRK
        VR+S   ++ A  KK+  +G +  +E++A+  G++       V       PM++ESD+ EV++++      ++E+S ++EEIK    ++N  S  +  RK
Subjt:  VRDSFSSLVGAGCKKIPNEGNIKWLESKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKALSSVANVVSFNFCHRK

Query:  ANRLAHNLARAAA-IHGDFVFFKGIPPPVEDEAHLVREILLSLFGSPQLL
         N +AH +A+ A       ++ +    PV+    L R++ L   G  QLL
Subjt:  ANRLAHNLARAAA-IHGDFVFFKGIPPPVEDEAHLVREILLSLFGSPQLL

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]1.7e-0829.95Show/hide
Query:  LIMWSIWNFRNQSKLKG--------NRAVEQ---------------------QLIRDIE----ARMEELNS---RVNSNLASFKTENQSCHSRWAVRDSF
        +I W IW  RN+S  KG          A+++                      LIR IE    A+ +   S   ++N+N A+++ +  +    W +RD  
Subjt:  LIMWSIWNFRNQSKLKG--------NRAVEQ---------------------QLIRDIE----ARMEELNS---RVNSNLASFKTENQSCHSRWAVRDSF

Query:  SSLVGAGCKKIPNEGNIKWLESKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKALSSVANVVSFNFCHRKANRLA
          ++ A C+ I  E NI +LE  AI EGL+ I +         C P+ +ESD+ E I ++   C+D +EI   +EEI  +     +VS     R+AN++A
Subjt:  SSLVGAGCKKIPNEGNIKWLESKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKALSSVANVVSFNFCHRKANRLA

Query:  HNLARAA
        H LAR A
Subjt:  HNLARAA

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]6.5e-0834.11Show/hide
Query:  ASFKTENQSCHSRWAVRDSFSSLVGAGCKKIPNEGNIKWLESKAIKEGLKEITKR--RGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIK
        A+++ +  +    W +RD    ++ A C+ I  E NI +LE  AI EGL+ I +   R ++  + C P+ +ESD+ E I ++   C+D +EI   +EEI 
Subjt:  ASFKTENQSCHSRWAVRDSFSSLVGAGCKKIPNEGNIKWLESKAIKEGLKEITKR--RGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIK

Query:  ALSSVANVVSFNFCHRKANRLAHNLARAA
         +     +VS     R+AN++AH+LAR A
Subjt:  ALSSVANVVSFNFCHRKANRLAHNLARAA

XP_024046691.1 uncharacterized protein LOC112101027 [Citrus clementina]6.1e-0624.77Show/hide
Query:  EINVAALIMWSIWNFRNQSKLKGNRAVEQQLIRDIEARMEELNS----------------------------RVNSNLASFKTENQSCHSRWAVRDSFSS
        E+ +  ++ WSIW+ RN    K  +   Q  +   EA ++  +                             +VN + A+ + E Q       +R+S   
Subjt:  EINVAALIMWSIWNFRNQSKLKGNRAVEQQLIRDIEARMEELNS----------------------------RVNSNLASFKTENQSCHSRWAVRDSFSS

Query:  LVGAGCKKIPNEGNIKWLESKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKALSSVANVVSFNFCHRKANRLAHN
        ++ A  K     G + + +++AI+ GL+       + +   CFP++VESD+ EV+ +I       +EI     E++      N V      R  N LAH+
Subjt:  LVGAGCKKIPNEGNIKWLESKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKALSSVANVVSFNFCHRKANRLAHN

Query:  LARAAAIHGDFVFF
        LAR A    DFVF+
Subjt:  LARAAAIHGDFVFF

TrEMBL top hitse value%identityAlignment
A0A2H5Q972 Uncharacterized protein (Fragment)3.5e-0725.47Show/hide
Query:  EINVAALIMWSIWNFRNQSKLKGNRAVEQQLIRDIEARMEELNS---------------------RVNSNLASFKTENQSCHSRWAVRDSFSSLVGAGCK
        ++++ A ++W+ WN RNQ   KG R   Q ++   EA ME                         ++N++ A+  +E         +RD    +     K
Subjt:  EINVAALIMWSIWNFRNQSKLKGNRAVEQQLIRDIEARMEELNS---------------------RVNSNLASFKTENQSCHSRWAVRDSFSSLVGAGCK

Query:  KIPNEGNIKWLESKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKALSSVANVVSFNFCHRKANRLAHNLARAAAI
             G++ + E++A++ GL ++ K   V+D      +++ESD+ EV+ ++       SEI   + EI+ L    + VS  + HR  N +AH+LA+ A  
Subjt:  KIPNEGNIKWLESKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKALSSVANVVSFNFCHRKANRLAHNLARAAAI

Query:  HGDFVFFKGIPP
          + V +KG  P
Subjt:  HGDFVFFKGIPP

A0A2K3PJW0 Glycosyl hydrolase family protein 438.6e-0633.33Show/hide
Query:  WAVRDSFSSLVGAGCKKIPNEGNIKWLESKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKALSSVANVVSFNFCH
        W VR S  S V AG  K  +       E+  I E ++E   R           +V ESD+  V+  I    + ISE+SS I  IK L          F  
Subjt:  WAVRDSFSSLVGAGCKKIPNEGNIKWLESKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKALSSVANVVSFNFCH

Query:  RKANRLAHNLARAAAIHGDFVFFKGIPPPVEDEAH
        R+AN  AH LAR A       +F G+P  +E E H
Subjt:  RKANRLAHNLARAAAIHGDFVFFKGIPPPVEDEAH

A0A6J1CP26 uncharacterized protein LOC1110134128.3e-0929.95Show/hide
Query:  LIMWSIWNFRNQSKLKG--------NRAVEQ---------------------QLIRDIE----ARMEELNS---RVNSNLASFKTENQSCHSRWAVRDSF
        +I W IW  RN+S  KG          A+++                      LIR IE    A+ +   S   ++N+N A+++ +  +    W +RD  
Subjt:  LIMWSIWNFRNQSKLKG--------NRAVEQ---------------------QLIRDIE----ARMEELNS---RVNSNLASFKTENQSCHSRWAVRDSF

Query:  SSLVGAGCKKIPNEGNIKWLESKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKALSSVANVVSFNFCHRKANRLA
          ++ A C+ I  E NI +LE  AI EGL+ I +         C P+ +ESD+ E I ++   C+D +EI   +EEI  +     +VS     R+AN++A
Subjt:  SSLVGAGCKKIPNEGNIKWLESKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKALSSVANVVSFNFCHRKANRLA

Query:  HNLARAA
        H LAR A
Subjt:  HNLARAA

A0A6J1DNV9 uncharacterized protein LOC1110224031.5e-0524.39Show/hide
Query:  EINVAALIMWSIWNFRNQSKLKGNRAVEQQLIRDI-----------EARMEELNSRVNSNL---------------ASFKTENQSCHSRWAVRDSFSSLV
        +++V  +  W IWN RN    +G  +    +I+ +           E  +  L+  +N+ L               AS+          W +R     +V
Subjt:  EINVAALIMWSIWNFRNQSKLKGNRAVEQQLIRDI-----------EARMEELNSRVNSNL---------------ASFKTENQSCHSRWAVRDSFSSLV

Query:  GAGCKKIPNEGNIKWLESKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKALSSVANVVSFNFCHRKANRLAHNLA
         AG + +    N+K LE+ AI EGL+ +T    +R      P+ +E+D+ EV  ++    ED+++    +EEI  L     +++F    R+ N  AH+LA
Subjt:  GAGCKKIPNEGNIKWLESKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKALSSVANVVSFNFCHRKANRLAHNLA

Query:  RAAAI
        + A++
Subjt:  RAAAI

A0A6J1DSV1 uncharacterized protein LOC1110236083.1e-0834.11Show/hide
Query:  ASFKTENQSCHSRWAVRDSFSSLVGAGCKKIPNEGNIKWLESKAIKEGLKEITKR--RGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIK
        A+++ +  +    W +RD    ++ A C+ I  E NI +LE  AI EGL+ I +   R ++  + C P+ +ESD+ E I ++   C+D +EI   +EEI 
Subjt:  ASFKTENQSCHSRWAVRDSFSSLVGAGCKKIPNEGNIKWLESKAIKEGLKEITKR--RGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIK

Query:  ALSSVANVVSFNFCHRKANRLAHNLARAA
         +     +VS     R+AN++AH+LAR A
Subjt:  ALSSVANVVSFNFCHRKANRLAHNLARAA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein4.2e-0524.82Show/hide
Query:  ASFKTENQSCHSRWAVRDSFSSLVGAGCKKIPNEGNIKWLESKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKAL
        A F  +       W +R+ + + +  G  K+ +  N    E+KA+   L++ T  RG         + +E D   +I +I G+    S +++ +E+I   
Subjt:  ASFKTENQSCHSRWAVRDSFSSLVGAGCKKIPNEGNIKWLESKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKAL

Query:  SSVANVVSFNFCHRKANRLAHNLARAAAIHGDFVFFKGIPP
        ++    + F F  RK N+LAH LA+    +  F    G  P
Subjt:  SSVANVVSFNFCHRKANRLAHNLARAAAIHGDFVFFKGIPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATGGAAGCCAATTGACTACCGGGAGAAATTGCTTAATATCTTAGATGCTTTGGAAATCAATGTAGCAGCTCTAATTATGTGGTCCATTTGGAATTTCAGGAATCA
ATCAAAACTCAAAGGAAACAGAGCGGTAGAGCAGCAGTTGATTAGAGATATAGAAGCGAGAATGGAAGAGCTAAATTCCAGAGTCAATTCGAACCTGGCTTCATTCAAGA
CGGAGAACCAATCGTGTCATAGTCGTTGGGCAGTCCGTGATTCTTTCAGTTCTCTAGTCGGGGCTGGTTGCAAGAAAATTCCCAATGAGGGGAACATTAAATGGCTGGAA
TCAAAAGCCATTAAGGAGGGGCTCAAAGAGATTACCAAAAGAAGGGGCGTGCGAGATGCAAATTCGTGTTTTCCTATGGTGGTCGAATCTGACGCCAATGAAGTAATCAA
AGTGATCACAGGCGTTTGCGAGGATATATCTGAGATTTCTTCATCGATCGAAGAAATTAAAGCCCTCAGTTCTGTTGCGAATGTGGTTTCGTTCAATTTCTGTCATCGAA
AGGCTAATCGTCTGGCGCACAATCTTGCGCGTGCGGCTGCGATTCATGGTGATTTTGTATTTTTTAAGGGTATCCCTCCGCCTGTGGAGGATGAAGCCCATCTTGTAAGG
GAGATTTTGTTATCCCTTTTTGGTTCTCCTCAGTTATTAGGGAGGAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAATGGAAGCCAATTGACTACCGGGAGAAATTGCTTAATATCTTAGATGCTTTGGAAATCAATGTAGCAGCTCTAATTATGTGGTCCATTTGGAATTTCAGGAATCA
ATCAAAACTCAAAGGAAACAGAGCGGTAGAGCAGCAGTTGATTAGAGATATAGAAGCGAGAATGGAAGAGCTAAATTCCAGAGTCAATTCGAACCTGGCTTCATTCAAGA
CGGAGAACCAATCGTGTCATAGTCGTTGGGCAGTCCGTGATTCTTTCAGTTCTCTAGTCGGGGCTGGTTGCAAGAAAATTCCCAATGAGGGGAACATTAAATGGCTGGAA
TCAAAAGCCATTAAGGAGGGGCTCAAAGAGATTACCAAAAGAAGGGGCGTGCGAGATGCAAATTCGTGTTTTCCTATGGTGGTCGAATCTGACGCCAATGAAGTAATCAA
AGTGATCACAGGCGTTTGCGAGGATATATCTGAGATTTCTTCATCGATCGAAGAAATTAAAGCCCTCAGTTCTGTTGCGAATGTGGTTTCGTTCAATTTCTGTCATCGAA
AGGCTAATCGTCTGGCGCACAATCTTGCGCGTGCGGCTGCGATTCATGGTGATTTTGTATTTTTTAAGGGTATCCCTCCGCCTGTGGAGGATGAAGCCCATCTTGTAAGG
GAGATTTTGTTATCCCTTTTTGGTTCTCCTCAGTTATTAGGGAGGAGTTAG
Protein sequenceShow/hide protein sequence
MEWKPIDYREKLLNILDALEINVAALIMWSIWNFRNQSKLKGNRAVEQQLIRDIEARMEELNSRVNSNLASFKTENQSCHSRWAVRDSFSSLVGAGCKKIPNEGNIKWLE
SKAIKEGLKEITKRRGVRDANSCFPMVVESDANEVIKVITGVCEDISEISSSIEEIKALSSVANVVSFNFCHRKANRLAHNLARAAAIHGDFVFFKGIPPPVEDEAHLVR
EILLSLFGSPQLLGRS