; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr003096 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr003096
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionThioredoxin domain-containing protein
Genome locationtig00001967:24953..26260
RNA-Seq ExpressionSgr003096
SyntenySgr003096
Gene Ontology termsGO:0006396 - RNA processing (biological process)
GO:0045454 - cell redox homeostasis (biological process)
GO:0009507 - chloroplast (cellular component)
InterPro domainsIPR013766 - Thioredoxin domain
IPR036249 - Thioredoxin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041672.1 thioredoxin-like protein [Cucumis melo var. makuwa]1.2e-9685.22Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL
        MGSTPKQP FCFKWPWDVD +N  DCSFEGPWLFKSLQNVG FAFNFVNKVSKSSP W+ TF  LQF+  TGGNKISQ RK+LTPEEQGEAE+RA A+AL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL

Query:  ARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS
        A GKEAT+IEFYSPKC LCNSLL+ VMEIEARNSDWL+IVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHL+KLK P 
Subjt:  ARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS

Query:  RLP
          P
Subjt:  RLP

KAG6585642.1 hypothetical protein SDJN03_18375, partial [Cucurbita argyrosperma subsp. sororia]1.9e-9779Show/hide
Query:  GKLKNKMGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHR
        G L+N M STPKQP FCFKWPWD++PKNP DCSFEGPWLFKSLQNVG FA NF+N+VSKSSPPW+N F  L FD    GNKIS+ RK L+PEEQGEAE+R
Subjt:  GKLKNKMGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHR

Query:  AFASALARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLL
        AFA+ALA GKEAT+IEFYSPKC LC+SLL FV ++EARNS WL+IVMADAENDKWLPELLHYDI YVPCFV+LDK GKALAKTGIPSSRLHVIAGLSHLL
Subjt:  AFASALARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLL

Query:  KLKRPSRLPGSDNK---PC
        KLKRP+ LPGSDN+   PC
Subjt:  KLKRPSRLPGSDNK---PC

XP_008444648.1 PREDICTED: uncharacterized protein LOC103487919 [Cucumis melo]5.1e-9886.21Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL
        MGSTPKQP FCFKWPWDVD KN  DCSFEGPWLFKSLQNVG FAFNFVNKVSKSSPPW+ TF  LQF+  TGGNKISQ RK+LTPEEQGEAE+RA A+AL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL

Query:  ARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS
        A GKEAT+IEFYSPKC LCNSLL+ VMEIEARNSDWL+IVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHL+KLK P 
Subjt:  ARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS

Query:  RLP
          P
Subjt:  RLP

XP_022144394.1 uncharacterized protein LOC111014082 [Momordica charantia]9.2e-10890.43Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL
        MGSTPKQPFFCFKWPWD DPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWIN F  L F+AS GGNK S PRK LTPEEQGEAEHRAFASAL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL

Query:  ARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS
        A GKEAT+IEFYSPKCRLCNSLLD VME+EARNSDWLSIVMADAEN+KWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLL LKRP 
Subjt:  ARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS

Query:  RLPGSDNKP
         L GS NKP
Subjt:  RLPGSDNKP

XP_038885736.1 uncharacterized protein LOC120076025 [Benincasa hispida]2.2e-10187.68Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL
        MGSTPKQPFFC KWPWDVDPKNPFDCSFEGPWLFKSLQNVG FAFNFVNKVSKSSPPWINTF  LQ DASTGGN ISQ RK+LTPEEQGEAE+RAFA+AL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL

Query:  ARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS
        A GKEAT+IEFYSPKC LCNSLL+ VME+EARNSDWL+IVMADAEN KWLPE+LHYDIRYVPCFVLLDKHGKALAKTGIPSSRL VIAGLSHL+KLK P 
Subjt:  ARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS

Query:  RLP
          P
Subjt:  RLP

TrEMBL top hitse value%identityAlignment
A0A0A0LRE2 Thioredoxin domain-containing protein1.9e-9582.76Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL
        MGSTPKQPFFCFKWPWDVDPKN  DCSFE PWLFKSLQNVG FAF+FVNK SKSSPPW+ TF  LQF+  TGGNKISQ RK+LTPEEQGEAE+RA A+AL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL

Query:  ARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS
        A GKEAT+IEFYSPKC LCNSLL+ V E+EARNSDWL+IVMADAENDKWLPELLHYDI YVPCFVLLDKHGKALAKT +PSSRLHVIAGLSHL+K+K P 
Subjt:  ARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS

Query:  RLP
          P
Subjt:  RLP

A0A1S3BAS7 uncharacterized protein LOC1034879192.5e-9886.21Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL
        MGSTPKQP FCFKWPWDVD KN  DCSFEGPWLFKSLQNVG FAFNFVNKVSKSSPPW+ TF  LQF+  TGGNKISQ RK+LTPEEQGEAE+RA A+AL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL

Query:  ARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS
        A GKEAT+IEFYSPKC LCNSLL+ VMEIEARNSDWL+IVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHL+KLK P 
Subjt:  ARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS

Query:  RLP
          P
Subjt:  RLP

A0A5A7TE38 Thioredoxin-like protein6.0e-9785.22Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL
        MGSTPKQP FCFKWPWDVD +N  DCSFEGPWLFKSLQNVG FAFNFVNKVSKSSP W+ TF  LQF+  TGGNKISQ RK+LTPEEQGEAE+RA A+AL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL

Query:  ARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS
        A GKEAT+IEFYSPKC LCNSLL+ VMEIEARNSDWL+IVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHL+KLK P 
Subjt:  ARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS

Query:  RLP
          P
Subjt:  RLP

A0A6J1CS67 uncharacterized protein LOC1110140824.5e-10890.43Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL
        MGSTPKQPFFCFKWPWD DPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWIN F  L F+AS GGNK S PRK LTPEEQGEAEHRAFASAL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL

Query:  ARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS
        A GKEAT+IEFYSPKCRLCNSLLD VME+EARNSDWLSIVMADAEN+KWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLL LKRP 
Subjt:  ARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS

Query:  RLPGSDNKP
         L GS NKP
Subjt:  RLPGSDNKP

A0A6J1KP92 uncharacterized protein LOC1114960276.7e-9679.81Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL
        M STPKQP FCFKWPWD++PKNP DCSFEGPWLFKSLQNVG FA NF+N+VSKSSPPW+N F  L FD    GNKIS+ RK L+PEEQGEAE+RAFA+AL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQGEAEHRAFASAL

Query:  ARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS
        A GKEAT+IEFYSPKC LC+SLL FV ++EARNS WL+IVMADAENDKWLPE+LHYDI YVPCFV+LDK GKALAKTGIPSSRLHVIAGLSHLLKLKRP+
Subjt:  ARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRPS

Query:  RLPGSDNK---PC
         LPGSDNK   PC
Subjt:  RLPGSDNK---PC

SwissProt top hitse value%identityAlignment
P35088 Thiol:disulfide interchange protein TxlA1.5e-0736.92Show/hide
Query:  MIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGK
        ++EFY+  C  C ++   +  ++   SD L  VM + +NDKWLPE+L Y++  +P FV L+  G+
Subjt:  MIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGK

P73920 Thiol:disulfide interchange protein TxlA homolog1.3e-0834.57Show/hide
Query:  ASALARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKT
        A+AL  G+  T++EFY+  C  C ++   + E++      ++  M + +N+KWLPE+L Y +  +P FV LD  G A+A++
Subjt:  ASALARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKT

Arabidopsis top hitse value%identityAlignment
AT5G06430.1 Thioredoxin superfamily protein1.6e-6558.65Show/hide
Query:  KLKNKMGSTPKQPFFCFKWPWDVD--PKNPFD-CSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKL-LTPEEQGEA
        K+     ST K PFFC KWPWD +  PK+    C F+GPWLF+S+Q +G+ A + +                     S G N   +P+K  L+  EQGEA
Subjt:  KLKNKMGSTPKQPFFCFKWPWDVD--PKNPFD-CSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKL-LTPEEQGEA

Query:  EHRAFASALARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLS
        E RAFA+ALA  KEAT++EFYS KCRLCNSLL FV+E+E RNS+WLSI MADAEN+KW PELLHYD++YVPCFVLLDK+G+ALAKTG+PSSR HVIAG+S
Subjt:  EHRAFASALARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLS

Query:  HLLKLKRP
        HLLK+KRP
Subjt:  HLLKLKRP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTTCATTTGCAGGGTTTGGGTTGCAATTTTTAGAGGAAGGGGAAATAGGGAAGTTAAAGAATAAAATGGGTTCGACACCCAAACAGCCTTTCTTTTGCTTCAAATGGCCA
TGGGATGTTGACCCTAAAAATCCTTTTGACTGTTCGTTTGAGGGTCCTTGGCTGTTCAAATCATTGCAAAATGTGGGTGCCTTTGCTTTCAATTTTGTAAATAAAGTTTC
AAAGTCCTCACCTCCGTGGATCAATACTTTTAACCAGTTGCAATTCGATGCCTCGACTGGAGGAAATAAGATATCTCAGCCTAGAAAGTTGTTAACTCCTGAAGAGCAAG
GAGAGGCAGAGCATAGAGCATTTGCATCAGCATTGGCCAGGGGGAAAGAAGCTACCATGATTGAGTTCTACTCGCCGAAATGTCGCCTATGCAATTCCTTGCTTGATTTT
GTTATGGAGATAGAGGCGAGGAATTCAGACTGGCTTAGCATTGTGATGGCAGATGCAGAGAATGATAAATGGCTGCCTGAGCTGCTTCATTATGACATTAGATATGTTCC
TTGCTTTGTGTTGCTGGACAAACACGGCAAGGCGCTAGCGAAGACGGGTATTCCCAGCAGTCGGCTACATGTAATTGCAGGACTTTCTCATCTTCTCAAACTGAAGCGCC
CAAGCAGACTCCCTGGATCAGATAATAAGCCATGTTGA
mRNA sequenceShow/hide mRNA sequence
TTTCATTTGCAGGGTTTGGGTTGCAATTTTTAGAGGAAGGGGAAATAGGGAAGTTAAAGAATAAAATGGGTTCGACACCCAAACAGCCTTTCTTTTGCTTCAAATGGCCA
TGGGATGTTGACCCTAAAAATCCTTTTGACTGTTCGTTTGAGGGTCCTTGGCTGTTCAAATCATTGCAAAATGTGGGTGCCTTTGCTTTCAATTTTGTAAATAAAGTTTC
AAAGTCCTCACCTCCGTGGATCAATACTTTTAACCAGTTGCAATTCGATGCCTCGACTGGAGGAAATAAGATATCTCAGCCTAGAAAGTTGTTAACTCCTGAAGAGCAAG
GAGAGGCAGAGCATAGAGCATTTGCATCAGCATTGGCCAGGGGGAAAGAAGCTACCATGATTGAGTTCTACTCGCCGAAATGTCGCCTATGCAATTCCTTGCTTGATTTT
GTTATGGAGATAGAGGCGAGGAATTCAGACTGGCTTAGCATTGTGATGGCAGATGCAGAGAATGATAAATGGCTGCCTGAGCTGCTTCATTATGACATTAGATATGTTCC
TTGCTTTGTGTTGCTGGACAAACACGGCAAGGCGCTAGCGAAGACGGGTATTCCCAGCAGTCGGCTACATGTAATTGCAGGACTTTCTCATCTTCTCAAACTGAAGCGCC
CAAGCAGACTCCCTGGATCAGATAATAAGCCATGTTGA
Protein sequenceShow/hide protein sequence
SFAGFGLQFLEEGEIGKLKNKMGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFNQLQFDASTGGNKISQPRKLLTPEEQG
EAEHRAFASALARGKEATMIEFYSPKCRLCNSLLDFVMEIEARNSDWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKRP
SRLPGSDNKPC