; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023258 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023258
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCSL zinc finger domain-containing protein
Genome locationtig00000892:1597620..1607223
RNA-Seq ExpressionSgr023258
SyntenySgr023258
Gene Ontology termsGO:0002098 - tRNA wobble uridine modification (biological process)
GO:0017183 - peptidyl-diphthamide biosynthetic process from peptidyl-histidine (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005829 - cytosol (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR044248 - Diphthamide biosynthesis protein 3/4-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570461.1 hypothetical protein SDJN03_29376, partial [Cucurbita argyrosperma subsp. sororia]3.4e-6976.6Show/hide
Query:  MATTRVFVTVPMELLLLLTALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSIN
        MAT RVFVTVPMELLLLL AL+LHSEA NTNNVYQPCADT IQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLS+N
Subjt:  MATTRVFVTVPMELLLLLTALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSIN

Query:  TSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIR
        TSDFFPDTFGGYMVAFAGRKYAARSQPAFVAN TFIVTSFTLV        + L ++R GC +C+G+     +   + D  +   + R
Subjt:  TSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIR

KAG7010325.1 hypothetical protein SDJN02_27118 [Cucurbita argyrosperma subsp. argyrosperma]3.4e-6976.6Show/hide
Query:  MATTRVFVTVPMELLLLLTALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSIN
        MAT RVFVTVPMELLLLL AL+LHSEA NTNNVYQPCADT IQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLS+N
Subjt:  MATTRVFVTVPMELLLLLTALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSIN

Query:  TSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIR
        TSDFFPDTFGGYMVAFAGRKYAARSQPAFVAN TFIVTSFTLV        + L ++R GC +C+G+     +   + D  +   + R
Subjt:  TSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIR

XP_022152979.1 uncharacterized protein LOC111020583 [Momordica charantia]4.9e-6875.53Show/hide
Query:  MATTRVFVTVPMELLLLLTALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSIN
        MAT RVFVT PMELLLLL AL+LHSEAANTNNVYQPCADT IQRSDGF+FGIAFSSRDSFFFNQSHQLSPCDRRLSL SLNSQLAVFRPRVDEISLL+IN
Subjt:  MATTRVFVTVPMELLLLLTALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSIN

Query:  TSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIR
        TSDFFPD+FGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV        + L ++R GC +C+G+     +   + D  +   + R
Subjt:  TSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIR

XP_022943561.1 uncharacterized protein LOC111448296 [Cucurbita moschata]3.4e-6976.6Show/hide
Query:  MATTRVFVTVPMELLLLLTALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSIN
        MAT RVFVTVPMELLLLL AL+LHSEA NTNNVYQPCADT IQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLS+N
Subjt:  MATTRVFVTVPMELLLLLTALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSIN

Query:  TSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIR
        TSDFFPDTFGGYMVAFAGRKYAARSQPAFVAN TFIVTSFTLV        + L ++R GC +C+G+     +   + D  +   + R
Subjt:  TSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIR

XP_023511823.1 uncharacterized protein LOC111776727 [Cucurbita pepo subsp. pepo]9.8e-6984.43Show/hide
Query:  MATTRVFVTVPMELLLLLTALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSIN
        MAT RVFVTVPMELLLLL AL+LHSEA NTNNVYQPCADT IQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLS+N
Subjt:  MATTRVFVTVPMELLLLLTALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSIN

Query:  TSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGR
        TSDF+PDTFGGYMVAFAGRKYAARSQPAFVAN TFIVTSFTLV        + L ++R GC +C+G+
Subjt:  TSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGR

TrEMBL top hitse value%identityAlignment
A0A1S3BNB9 uncharacterized protein LOC1034914383.2e-6572.77Show/hide
Query:  MATTRVFVTVPMELLLLL---TALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLL
        MAT RVFVT+PM+LLLLL     L++HSEAANTNNVYQPCADT IQRSDGFTFGIAFSSRDSFF NQSHQLSPCDRRLSLASLNSQLAVFRPRVD+ISLL
Subjt:  MATTRVFVTVPMELLLLL---TALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLL

Query:  SINTSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIR
        SINTSDFFPD FGGYMVAFAGRKYAARSQPAFVAN+TFIVTSFTLV        + L ++R GC +C+G+     +   + D  +   + R
Subjt:  SINTSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIR

A0A5A7V4B1 Uncharacterized protein3.2e-6572.77Show/hide
Query:  MATTRVFVTVPMELLLLL---TALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLL
        MAT RVFVT+PM+LLLLL     L++HSEAANTNNVYQPCADT IQRSDGFTFGIAFSSRDSFF NQSHQLSPCDRRLSLASLNSQLAVFRPRVD+ISLL
Subjt:  MATTRVFVTVPMELLLLL---TALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLL

Query:  SINTSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIR
        SINTSDFFPD FGGYMVAFAGRKYAARSQPAFVAN+TFIVTSFTLV        + L ++R GC +C+G+     +   + D  +   + R
Subjt:  SINTSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIR

A0A6J1DHN7 uncharacterized protein LOC1110205832.4e-6875.53Show/hide
Query:  MATTRVFVTVPMELLLLLTALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSIN
        MAT RVFVT PMELLLLL AL+LHSEAANTNNVYQPCADT IQRSDGF+FGIAFSSRDSFFFNQSHQLSPCDRRLSL SLNSQLAVFRPRVDEISLL+IN
Subjt:  MATTRVFVTVPMELLLLLTALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSIN

Query:  TSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIR
        TSDFFPD+FGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV        + L ++R GC +C+G+     +   + D  +   + R
Subjt:  TSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIR

A0A6J1FS19 uncharacterized protein LOC1114482961.6e-6976.6Show/hide
Query:  MATTRVFVTVPMELLLLLTALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSIN
        MAT RVFVTVPMELLLLL AL+LHSEA NTNNVYQPCADT IQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLS+N
Subjt:  MATTRVFVTVPMELLLLLTALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSIN

Query:  TSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIR
        TSDFFPDTFGGYMVAFAGRKYAARSQPAFVAN TFIVTSFTLV        + L ++R GC +C+G+     +   + D  +   + R
Subjt:  TSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIR

A0A6J1JBU3 uncharacterized protein LOC1114843491.6e-6976.6Show/hide
Query:  MATTRVFVTVPMELLLLLTALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSIN
        MAT RVFVTVPMELLLLL AL+LHSEA NTNNVYQPCADT IQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLS+N
Subjt:  MATTRVFVTVPMELLLLLTALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSIN

Query:  TSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIR
        TSDFFPDTFGGYMVAFAGRKYAARSQPAFVAN TFIVTSFTLV        + L ++R GC +C+G+     +   + D  +   + R
Subjt:  TSDFFPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15910.1 CSL zinc finger domain-containing protein3.2e-3360.33Show/hide
Query:  AANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSINTSDFFPDTFGGYMVAFAGRKYAARSQ
        AA+ N VY PC+DT I + DGFT GIA SS+++FF +Q  QLSPCD RL LA+  +QLA+FRP+VDEISLLSI+TS F P   GG+MV FAG KYAARS 
Subjt:  AANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSINTSDFFPDTFGGYMVAFAGRKYAARSQ

Query:  PAFVANSTFIVTSFTLVRYLS
        P  VA+ +  +T+FTLV  L+
Subjt:  PAFVANSTFIVTSFTLVRYLS

AT3G11800.1 unknown protein8.3e-3451.48Show/hide
Query:  LLLLLTALILHS---EAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFF---NQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSINT---SDF
        LL  L A +L S   EA + N VY PC+D+T+   DGFTFGIAF+++DSFF    ++S Q SPCD R    + NS++AVFRP+VDEI+LL+INT   S F
Subjt:  LLLLLTALILHS---EAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFF---NQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSINT---SDF

Query:  FPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLL
         PD   GYMVAFAG KYAARS P  VA+S  IVTSFTLV          + +++ GC  C+G    V L
Subjt:  FPDTFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLL

AT3G44150.1 unknown protein2.6e-5164.46Show/hide
Query:  LLLLTALIL-------HSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSINTSDFFPD
        LL+ +A+IL          + NTN +Y PC+DT IQRSDGFTFGIAFSSR SFF NQ+  LSPCDRRLSLA++NSQ +VFRP++DEISLLSINTS FFPD
Subjt:  LLLLTALIL-------HSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSINTSDFFPD

Query:  TFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLL
         +GGYMVAFAGRKYAARS PAF+ANSTFIVTSFTLV        + L ++R GC +C G +  V L
Subjt:  TFGGYMVAFAGRKYAARSQPAFVANSTFIVTSFTLV--------RYLSFRRAGCRTCTGREMDVLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACGACGAGAGTGTTCGTAACGGTGCCGATGGAGCTGCTGCTCTTGCTGACGGCGTTGATACTTCACTCCGAAGCTGCAAATACGAACAATGTGTACCAACCTTG
CGCGGACACCACGATTCAGAGATCCGACGGTTTCACCTTCGGAATCGCATTCTCTTCCAGAGACTCATTCTTTTTTAACCAGTCTCATCAGCTGTCGCCGTGCGACCGTC
GTCTTTCTCTGGCGTCGCTCAACTCACAGCTTGCAGTTTTCAGACCTAGAGTAGACGAGATCTCCCTTCTATCCATCAATACTTCCGATTTTTTTCCGGATACTTTTGGT
GGTTACATGGTGGCATTTGCTGGAAGAAAATATGCTGCAAGGTCTCAGCCTGCCTTTGTTGCCAACAGCACGTTCATCGTGACGAGTTTCACCCTGGTGAGGTACTTGAG
TTTCAGAAGGGCAGGCTGCAGAACTTGTACTGGAAGAGAGATGGATGTGCTTCTTGCTCAGGCAAGTCCAGATCTAGTTATGTCTGCCTTAACAATCAGGATTGTGCCAT
CAAAACATCAAGTTGTAGAAACCGAGGGGGCTCTGTGGATTGTAGTTTGGGAATACAACTTACGTTTTCAGGCACAGATAAGCATCTTGCAGCGCTTAACTCCTGGGAAA
CTGCCGTCGTACTTAACTTCGAAACCATCGATCGGGCAAACCGCCGCTCCAATCTCCGCCGGTGCTATTGTGGCACTTAAACCTATATTCCCGGTACTGCCGGTGAACCC
CGTGCCCAGGTTCTATTATGCGCCGCCAGAACTTAGATCTGACGGCCGAACACACATTATCCGGCTTTCAAGATTGTCTCGTACAGCTCCTACTTCTTTTGAGAGAAATA
AGGCACGAAGAGAAAGTGATTTTTTTTTTTTTTTTCCTTTTTTGCCTAAAGGAAGGATCTCAGCTAGGGTTTTGGGTATTGTTTTTGTTGAGGACTACTTTGGAAAGTCC
GGTATTGGGCTTCTGTGGCGAATCGAAATGCCTGGCTGGAATTGTATTTCTTCGGAATTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGACGACGAGAGTGTTCGTAACGGTGCCGATGGAGCTGCTGCTCTTGCTGACGGCGTTGATACTTCACTCCGAAGCTGCAAATACGAACAATGTGTACCAACCTTG
CGCGGACACCACGATTCAGAGATCCGACGGTTTCACCTTCGGAATCGCATTCTCTTCCAGAGACTCATTCTTTTTTAACCAGTCTCATCAGCTGTCGCCGTGCGACCGTC
GTCTTTCTCTGGCGTCGCTCAACTCACAGCTTGCAGTTTTCAGACCTAGAGTAGACGAGATCTCCCTTCTATCCATCAATACTTCCGATTTTTTTCCGGATACTTTTGGT
GGTTACATGGTGGCATTTGCTGGAAGAAAATATGCTGCAAGGTCTCAGCCTGCCTTTGTTGCCAACAGCACGTTCATCGTGACGAGTTTCACCCTGGTGAGGTACTTGAG
TTTCAGAAGGGCAGGCTGCAGAACTTGTACTGGAAGAGAGATGGATGTGCTTCTTGCTCAGGCAAGTCCAGATCTAGTTATGTCTGCCTTAACAATCAGGATTGTGCCAT
CAAAACATCAAGTTGTAGAAACCGAGGGGGCTCTGTGGATTGTAGTTTGGGAATACAACTTACGTTTTCAGGCACAGATAAGCATCTTGCAGCGCTTAACTCCTGGGAAA
CTGCCGTCGTACTTAACTTCGAAACCATCGATCGGGCAAACCGCCGCTCCAATCTCCGCCGGTGCTATTGTGGCACTTAAACCTATATTCCCGGTACTGCCGGTGAACCC
CGTGCCCAGGTTCTATTATGCGCCGCCAGAACTTAGATCTGACGGCCGAACACACATTATCCGGCTTTCAAGATTGTCTCGTACAGCTCCTACTTCTTTTGAGAGAAATA
AGGCACGAAGAGAAAGTGATTTTTTTTTTTTTTTTCCTTTTTTGCCTAAAGGAAGGATCTCAGCTAGGGTTTTGGGTATTGTTTTTGTTGAGGACTACTTTGGAAAGTCC
GGTATTGGGCTTCTGTGGCGAATCGAAATGCCTGGCTGGAATTGTATTTCTTCGGAATTCTAA
Protein sequenceShow/hide protein sequence
MATTRVFVTVPMELLLLLTALILHSEAANTNNVYQPCADTTIQRSDGFTFGIAFSSRDSFFFNQSHQLSPCDRRLSLASLNSQLAVFRPRVDEISLLSINTSDFFPDTFG
GYMVAFAGRKYAARSQPAFVANSTFIVTSFTLVRYLSFRRAGCRTCTGREMDVLLAQASPDLVMSALTIRIVPSKHQVVETEGALWIVVWEYNLRFQAQISILQRLTPGK
LPSYLTSKPSIGQTAAPISAGAIVALKPIFPVLPVNPVPRFYYAPPELRSDGRTHIIRLSRLSRTAPTSFERNKARRESDFFFFFPFLPKGRISARVLGIVFVEDYFGKS
GIGLLWRIEMPGWNCISSEF