; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028567 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028567
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionThioredoxin domain-containing protein
Genome locationtig00153204:2338419..2339744
RNA-Seq ExpressionSgr028567
SyntenySgr028567
Gene Ontology termsGO:0006396 - RNA processing (biological process)
GO:0045454 - cell redox homeostasis (biological process)
GO:0009507 - chloroplast (cellular component)
InterPro domainsIPR013766 - Thioredoxin domain
IPR036249 - Thioredoxin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585642.1 hypothetical protein SDJN03_18375, partial [Cucurbita argyrosperma subsp. sororia]5.1e-9879.45Show/hide
Query:  GKLKNKMGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHR
        G L+N M STPKQP FCFKWPWD++PKNP DCSFEGPWLFKSLQNVG FA NF+N+VSKSSPPW+N F  L FD    GNKIS+ RK L+PEEQGEAE+R
Subjt:  GKLKNKMGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHR

Query:  AFASALARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLL
        AFA+ALA GKEATVIEFYSPKC LC+SLL FVT++E RNS WL+IVMADAENDKWLPELLHYDI YVPCFV+LDK GKALAKTGIPSSRLHVIAGLSHLL
Subjt:  AFASALARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLL

Query:  KLKRPSRLPGSDNK---PC
        KLKRP+ LPGSDN+   PC
Subjt:  KLKRPSRLPGSDNK---PC

XP_008444648.1 PREDICTED: uncharacterized protein LOC103487919 [Cucumis melo]3.3e-9785.71Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL
        MGSTPKQP FCFKWPWDVD KN  DCSFEGPWLFKSLQNVG FAFNFVNKVSKSSPPW+ TF  LQF+  TGGNKISQ RK+LTPEEQGEAE+RA A+AL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL

Query:  ARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS
        A GKEATVIEFYSPKC LCNSLL+ V EIE RNSDWL+IVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHL+KLK P 
Subjt:  ARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS

Query:  RLP
          P
Subjt:  RLP

XP_022144394.1 uncharacterized protein LOC111014082 [Momordica charantia]4.6e-10789.95Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL
        MGSTPKQPFFCFKWPWD DPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWIN F  L F+AS GGNK S PRK LTPEEQGEAEHRAFASAL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL

Query:  ARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS
        A GKEATVIEFYSPKCRLCNSLLD V E+E RNSDWLSIVMADAEN+KWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLL LKRP 
Subjt:  ARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS

Query:  RLPGSDNKP
         L GS NKP
Subjt:  RLPGSDNKP

XP_023002017.1 uncharacterized protein LOC111496027 [Cucurbita maxima]3.6e-9680.28Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL
        M STPKQP FCFKWPWD++PKNP DCSFEGPWLFKSLQNVG FA NF+N+VSKSSPPW+N F  L FD    GNKIS+ RK L+PEEQGEAE+RAFA+AL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL

Query:  ARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS
        A GKEATVIEFYSPKC LC+SLL FVT++E RNS WL+IVMADAENDKWLPE+LHYDI YVPCFV+LDK GKALAKTGIPSSRLHVIAGLSHLLKLKRP+
Subjt:  ARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS

Query:  RLPGSDNK---PC
         LPGSDNK   PC
Subjt:  RLPGSDNK---PC

XP_038885736.1 uncharacterized protein LOC120076025 [Benincasa hispida]1.4e-10087.19Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL
        MGSTPKQPFFC KWPWDVDPKNPFDCSFEGPWLFKSLQNVG FAFNFVNKVSKSSPPWINTF  LQ DASTGGN ISQ RK+LTPEEQGEAE+RAFA+AL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL

Query:  ARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS
        A GKEATVIEFYSPKC LCNSLL+ V E+E RNSDWL+IVMADAEN KWLPE+LHYDIRYVPCFVLLDKHGKALAKTGIPSSRL VIAGLSHL+KLK P 
Subjt:  ARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS

Query:  RLP
          P
Subjt:  RLP

TrEMBL top hitse value%identityAlignment
A0A0A0LRE2 Thioredoxin domain-containing protein6.7e-9682.76Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL
        MGSTPKQPFFCFKWPWDVDPKN  DCSFE PWLFKSLQNVG FAF+FVNK SKSSPPW+ TF  LQF+  TGGNKISQ RK+LTPEEQGEAE+RA A+AL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL

Query:  ARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS
        A GKEAT+IEFYSPKC LCNSLL+ VTE+E RNSDWL+IVMADAENDKWLPELLHYDI YVPCFVLLDKHGKALAKT +PSSRLHVIAGLSHL+K+K P 
Subjt:  ARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS

Query:  RLP
          P
Subjt:  RLP

A0A1S3BAS7 uncharacterized protein LOC1034879191.6e-9785.71Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL
        MGSTPKQP FCFKWPWDVD KN  DCSFEGPWLFKSLQNVG FAFNFVNKVSKSSPPW+ TF  LQF+  TGGNKISQ RK+LTPEEQGEAE+RA A+AL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL

Query:  ARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS
        A GKEATVIEFYSPKC LCNSLL+ V EIE RNSDWL+IVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHL+KLK P 
Subjt:  ARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS

Query:  RLP
          P
Subjt:  RLP

A0A5A7TE38 Thioredoxin-like protein3.9e-9684.73Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL
        MGSTPKQP FCFKWPWDVD +N  DCSFEGPWLFKSLQNVG FAFNFVNKVSKSSP W+ TF  LQF+  TGGNKISQ RK+LTPEEQGEAE+RA A+AL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL

Query:  ARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS
        A GKEATVIEFYSPKC LCNSLL+ V EIE RNSDWL+IVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHL+KLK P 
Subjt:  ARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS

Query:  RLP
          P
Subjt:  RLP

A0A6J1CS67 uncharacterized protein LOC1110140822.2e-10789.95Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL
        MGSTPKQPFFCFKWPWD DPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWIN F  L F+AS GGNK S PRK LTPEEQGEAEHRAFASAL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL

Query:  ARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS
        A GKEATVIEFYSPKCRLCNSLLD V E+E RNSDWLSIVMADAEN+KWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLL LKRP 
Subjt:  ARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS

Query:  RLPGSDNKP
         L GS NKP
Subjt:  RLPGSDNKP

A0A6J1KP92 uncharacterized protein LOC1114960271.8e-9680.28Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL
        M STPKQP FCFKWPWD++PKNP DCSFEGPWLFKSLQNVG FA NF+N+VSKSSPPW+N F  L FD    GNKIS+ RK L+PEEQGEAE+RAFA+AL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL

Query:  ARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS
        A GKEATVIEFYSPKC LC+SLL FVT++E RNS WL+IVMADAENDKWLPE+LHYDI YVPCFV+LDK GKALAKTGIPSSRLHVIAGLSHLLKLKRP+
Subjt:  ARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS

Query:  RLPGSDNK---PC
         LPGSDNK   PC
Subjt:  RLPGSDNK---PC

SwissProt top hitse value%identityAlignment
P35088 Thiol:disulfide interchange protein TxlA1.9e-0736.92Show/hide
Query:  VIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGK
        ++EFY+  C  C ++   +  ++   SD L  VM + +NDKWLPE+L Y++  +P FV L+  G+
Subjt:  VIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGK

P73920 Thiol:disulfide interchange protein TxlA homolog1.3e-0834.57Show/hide
Query:  ASALARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKT
        A+AL  G+  T++EFY+  C  C ++   + E++      ++  M + +N+KWLPE+L Y +  +P FV LD  G A+A++
Subjt:  ASALARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKT

Arabidopsis top hitse value%identityAlignment
AT5G06430.1 Thioredoxin superfamily protein2.1e-6559.13Show/hide
Query:  KLKNKMGSTPKQPFFCFKWPWDVD--PKNPFD-CSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKL-LTPEEQGEA
        K+     ST K PFFC KWPWD +  PK+    C F+GPWLF+S+Q +G+ A + +                     S G N   +P+K  L+  EQGEA
Subjt:  KLKNKMGSTPKQPFFCFKWPWDVD--PKNPFD-CSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKL-LTPEEQGEA

Query:  EHRAFASALARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLS
        E RAFA+ALA  KEATV+EFYS KCRLCNSLL FV E+E RNS+WLSI MADAEN+KW PELLHYD++YVPCFVLLDK+G+ALAKTG+PSSR HVIAG+S
Subjt:  EHRAFASALARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLS

Query:  HLLKLKRP
        HLLK+KRP
Subjt:  HLLKLKRP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTTCATTTGCAGGGTTTGGGTTGCAATTTTTAGAGGAAGGGGAAATAGGGAAGTTAAAGAATAAAATGGGTTCGACGCCCAAACAGCCTTTCTTTTGCTTCAAATGGCCA
TGGGACGTTGACCCTAAAAATCCTTTTGACTGTTCGTTTGAGGGTCCTTGGCTGTTCAAATCATTGCAAAATGTGGGTGCCTTTGCTTTCAATTTTGTAAATAAAGTTTC
AAAGTCCTCACCTCCGTGGATCAATACTTTTAACCAGTTGCAATTCGATGCCTCGACTGGAGGAAATAAGATATCTCAGCCTAGAAAGTTGTTAACTCCTGAAGAGCAAG
GAGAGGCAGAGCATAGAGCATTTGCATCAGCATTGGCCAGGGGGAAAGAAGCTACCGTGATTGAGTTCTACTCACCGAAATGTCGCCTATGCAATTCCTTGCTTGATTTT
GTTACGGAGATAGAGGTGAGGAATTCAGACTGGCTTAGCATTGTGATGGCAGATGCAGAGAATGATAAATGGCTGCCTGAGCTGCTTCATTATGACATTAGATATGTTCC
TTGCTTTGTGTTGCTGGACAAACACGGCAAGGCGCTAGCGAAGACGGGTATTCCCAGCAGTCGGCTACATGTAATTGCAGGACTTTCTCATCTTCTCAAACTGAAGCGCC
CAAGCAGACTCCCTGGATCAGATAATAAGCCATGTTGA
mRNA sequenceShow/hide mRNA sequence
TTTCATTTGCAGGGTTTGGGTTGCAATTTTTAGAGGAAGGGGAAATAGGGAAGTTAAAGAATAAAATGGGTTCGACGCCCAAACAGCCTTTCTTTTGCTTCAAATGGCCA
TGGGACGTTGACCCTAAAAATCCTTTTGACTGTTCGTTTGAGGGTCCTTGGCTGTTCAAATCATTGCAAAATGTGGGTGCCTTTGCTTTCAATTTTGTAAATAAAGTTTC
AAAGTCCTCACCTCCGTGGATCAATACTTTTAACCAGTTGCAATTCGATGCCTCGACTGGAGGAAATAAGATATCTCAGCCTAGAAAGTTGTTAACTCCTGAAGAGCAAG
GAGAGGCAGAGCATAGAGCATTTGCATCAGCATTGGCCAGGGGGAAAGAAGCTACCGTGATTGAGTTCTACTCACCGAAATGTCGCCTATGCAATTCCTTGCTTGATTTT
GTTACGGAGATAGAGGTGAGGAATTCAGACTGGCTTAGCATTGTGATGGCAGATGCAGAGAATGATAAATGGCTGCCTGAGCTGCTTCATTATGACATTAGATATGTTCC
TTGCTTTGTGTTGCTGGACAAACACGGCAAGGCGCTAGCGAAGACGGGTATTCCCAGCAGTCGGCTACATGTAATTGCAGGACTTTCTCATCTTCTCAAACTGAAGCGCC
CAAGCAGACTCCCTGGATCAGATAATAAGCCATGTTGA
Protein sequenceShow/hide protein sequence
SFAGFGLQFLEEGEIGKLKNKMGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQG
EAEHRAFASALARGKEATVIEFYSPKCRLCNSLLDFVTEIEVRNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRP
SRLPGSDNKPC