; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018997 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018997
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationtig00153231:996158..996538
RNA-Seq ExpressionSgr018997
SyntenySgr018997
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058980.1 uncharacterized protein E6C27_scaffold98G001710 [Cucumis melo var. makuwa]1.0e-1542.37Show/hide
Query:  ISWNS--LRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSF
        ++WN+  L S  KRALIK  I  ++P+FVIL E   +  +   IKS W S  I     +A GSSG ILILW+  S  +L+  +  FSLS    L +  S+
Subjt:  ISWNS--LRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSF

Query:  WFIGIYGPSKATHRPQFW
        W  G+YGP K   R  FW
Subjt:  WFIGIYGPSKATHRPQFW

TYK11012.1 uncharacterized protein E5676_scaffold874G00540 [Cucumis melo var. makuwa]1.0e-1542.37Show/hide
Query:  ISWNS--LRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSF
        ++WN+  L S  KRALIK  I  ++P+FVIL E   +  +   IKS W S  I     +A GSSG ILILW+  S  +L+  +  FSLS    L +  S+
Subjt:  ISWNS--LRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSF

Query:  WFIGIYGPSKATHRPQFW
        W  G+YGP K   R  FW
Subjt:  WFIGIYGPSKATHRPQFW

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]3.3e-1945.76Show/hide
Query:  ISWN--SLRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSF
        ++WN   L S  KRA IK  I    P+ VIL E K+ S++N FIKSLWSS  I  A+LDA G+SG I++LW+  S   + +  G FS+S+   LAD F++
Subjt:  ISWN--SLRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSF

Query:  WFIGIYGPSKATHRPQFW
        W  G+Y P K   R  FW
Subjt:  WFIGIYGPSKATHRPQFW

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]1.5e-2249.15Show/hide
Query:  ISWN--SLRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSF
        ++WN   L SW+K ALIK  I + NPN VILQE K   +D L +KSLWS+  I  + LDA G +  ILILWNDP      + +G FSL++   L+DGF F
Subjt:  ISWN--SLRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSF

Query:  WFIGIYGPSKATHRPQFW
        W  GIYGPS       FW
Subjt:  WFIGIYGPSKATHRPQFW

XP_031739979.1 uncharacterized protein LOC116403332 [Cucumis sativus]7.4e-1946.67Show/hide
Query:  RALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSFWFIGIYGPSKATH
        +A++K L+ K NP+ VILQ+ K  +V+   +KS+WSS  +G ATL+A GSSG ILILW + S  +++  +G FS+S+      GFS W  G+YGPS    
Subjt:  RALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSFWFIGIYGPSKATH

Query:  RPQFW
        R QFW
Subjt:  RPQFW

TrEMBL top hitse value%identityAlignment
A0A5A7UV84 Reverse transcriptase domain-containing protein4.9e-1642.37Show/hide
Query:  ISWNS--LRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSF
        ++WN+  L S  KRALIK  I  ++P+FVIL E   +  +   IKS W S  I     +A GSSG ILILW+  S  +L+  +  FSLS    L +  S+
Subjt:  ISWNS--LRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSF

Query:  WFIGIYGPSKATHRPQFW
        W  G+YGP K   R  FW
Subjt:  WFIGIYGPSKATHRPQFW

A0A5D3CI86 Reverse transcriptase domain-containing protein4.9e-1642.37Show/hide
Query:  ISWNS--LRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSF
        ++WN+  L S  KRALIK  I  ++P+FVIL E   +  +   IKS W S  I     +A GSSG ILILW+  S  +L+  +  FSLS    L +  S+
Subjt:  ISWNS--LRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSF

Query:  WFIGIYGPSKATHRPQFW
        W  G+YGP K   R  FW
Subjt:  WFIGIYGPSKATHRPQFW

A0A6J1CVN2 uncharacterized protein LOC1110146571.6e-1945.76Show/hide
Query:  ISWN--SLRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSF
        ++WN   L S  KRA IK  I    P+ VIL E K+ S++N FIKSLWSS  I  A+LDA G+SG I++LW+  S   + +  G FS+S+   LAD F++
Subjt:  ISWN--SLRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSF

Query:  WFIGIYGPSKATHRPQFW
        W  G+Y P K   R  FW
Subjt:  WFIGIYGPSKATHRPQFW

A0A6J1E2G6 uncharacterized protein LOC1110254057.0e-2349.15Show/hide
Query:  ISWN--SLRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSF
        ++WN   L SW+K ALIK  I + NPN VILQE K   +D L +KSLWS+  I  + LDA G +  ILILWNDP      + +G FSL++   L+DGF F
Subjt:  ISWN--SLRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSF

Query:  WFIGIYGPSKATHRPQFW
        W  GIYGPS       FW
Subjt:  WFIGIYGPSKATHRPQFW

A0A803QQM3 Uncharacterized protein4.9e-1642.48Show/hide
Query:  NSLRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSFWFIGI
        N L    KRA IK  I K NP+ VILQE K  +VD  FI S+W SR      L A G SG  L++W+  +  +L+   G FS+S+L++      +WF G+
Subjt:  NSLRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSFWFIGI

Query:  YGPSKATHRPQFW
        YGP     RP+FW
Subjt:  YGPSKATHRPQFW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCACAGATTTCCATATGATCATCATATCTCTTGGAATAGTCTCCGATCATGGCAGAAGCGAGCTCTCATTAAAGGTCTCATTTTTAAACACAACCCCAACTTTGT
TATTCTTCAGGAGCCCAAGGCTAGATCGGTGGACAATCTCTTTATTAAGTCCTTATGGAGTTCTAGACAGATTGGATTGGCCACACTTGATGCCCAGGGTTCTTCTGGCA
GCATCCTCATCCTATGGAACGATCCATCCTTTGTGATATTGAACATTACTAAAGGTGCATTCTCTCTATCGCTACTTGTCTCATTGGCTGATGGGTTTAGTTTTTGGTTC
ATAGGCATTTACGGACCATCTAAAGCCACTCATAGGCCTCAATTTTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCCACAGATTTCCATATGATCATCATATCTCTTGGAATAGTCTCCGATCATGGCAGAAGCGAGCTCTCATTAAAGGTCTCATTTTTAAACACAACCCCAACTTTGT
TATTCTTCAGGAGCCCAAGGCTAGATCGGTGGACAATCTCTTTATTAAGTCCTTATGGAGTTCTAGACAGATTGGATTGGCCACACTTGATGCCCAGGGTTCTTCTGGCA
GCATCCTCATCCTATGGAACGATCCATCCTTTGTGATATTGAACATTACTAAAGGTGCATTCTCTCTATCGCTACTTGTCTCATTGGCTGATGGGTTTAGTTTTTGGTTC
ATAGGCATTTACGGACCATCTAAAGCCACTCATAGGCCTCAATTTTGGTAA
Protein sequenceShow/hide protein sequence
MGHRFPYDHHISWNSLRSWQKRALIKGLIFKHNPNFVILQEPKARSVDNLFIKSLWSSRQIGLATLDAQGSSGSILILWNDPSFVILNITKGAFSLSLLVSLADGFSFWF
IGIYGPSKATHRPQFW