; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG08G015590 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG08G015590
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionThioredoxin domain-containing protein
Genome locationCG_Chr08:28096329..28098691
RNA-Seq ExpressionClCG08G015590
SyntenyClCG08G015590
Gene Ontology termsGO:0006396 - RNA processing (biological process)
GO:0045454 - cell redox homeostasis (biological process)
GO:0009507 - chloroplast (cellular component)
InterPro domainsIPR013766 - Thioredoxin domain
IPR036249 - Thioredoxin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041672.1 thioredoxin-like protein [Cucumis melo var. makuwa]4.4e-9987.68Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL
        MGSTPKQP FCFKWPWDVD +N  DCSFEGPWLFKSLQNVG FAFNFVNKVSKSSP W+ TFK L+ +  TGGNKISQSRK+LTPEEQGEAENRA AAAL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL

Query:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        ASGKEATVIEFYSPKC LCNSLLN VME+EARNS+WL+IVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHL+KLK+PK
Subjt:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  NTP
        +TP
Subjt:  NTP

XP_004152726.1 uncharacterized protein LOC101203280 [Cucumis sativus]2.2e-9885.71Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL
        MGSTPKQPFFCFKWPWDVDPKN  DCSFE PWLFKSLQNVG FAF+FVNK SKSSPPW+ TFK+L+ +  TGGNKISQSRK+LTPEEQGEAENRA AAAL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL

Query:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        ASGKEAT+IEFYSPKC LCNSLLN V EMEARNS+WL+IVMADAENDKWLPELLHYDI YVPCFVLLDKHGKALAKT +PSSRLHVIAGLSHL+K+K+PK
Subjt:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  NTP
        +TP
Subjt:  NTP

XP_008444648.1 PREDICTED: uncharacterized protein LOC103487919 [Cucumis melo]1.8e-10088.67Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL
        MGSTPKQP FCFKWPWDVD KN  DCSFEGPWLFKSLQNVG FAFNFVNKVSKSSPPW+ TFK L+ +  TGGNKISQSRK+LTPEEQGEAENRA AAAL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL

Query:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        ASGKEATVIEFYSPKC LCNSLLN VME+EARNS+WL+IVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHL+KLK+PK
Subjt:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  NTP
        +TP
Subjt:  NTP

XP_022144394.1 uncharacterized protein LOC111014082 [Momordica charantia]7.2e-10289.55Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL
        MGSTPKQPFFCFKWPWD DPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWIN FK L  +AS GGNK S  RK LTPEEQGEAE+RAFA+AL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL

Query:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        ASGKEATVIEFYSPKCRLCNSLL+ VMEMEARNS+WLSIVMADAEN+KWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLL LK P+
Subjt:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  N
        +
Subjt:  N

XP_038885736.1 uncharacterized protein LOC120076025 [Benincasa hispida]1.3e-10693.1Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL
        MGSTPKQPFFC KWPWDVDPKNPFDCSFEGPWLFKSLQNVG FAFNFVNKVSKSSPPWINTFK L+LDASTGGN ISQSRK+LTPEEQGEAENRAFAAAL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL

Query:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        ASGKEATVIEFYSPKC LCNSLLN VMEMEARNS+WL+IVMADAEN KWLPE+LHYDIRYVPCFVLLDKHGKALAKTGIPSSRL VIAGLSHL+KLKAPK
Subjt:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  NTP
        NTP
Subjt:  NTP

TrEMBL top hitse value%identityAlignment
A0A0A0LRE2 Thioredoxin domain-containing protein1.1e-9885.71Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL
        MGSTPKQPFFCFKWPWDVDPKN  DCSFE PWLFKSLQNVG FAF+FVNK SKSSPPW+ TFK+L+ +  TGGNKISQSRK+LTPEEQGEAENRA AAAL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL

Query:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        ASGKEAT+IEFYSPKC LCNSLLN V EMEARNS+WL+IVMADAENDKWLPELLHYDI YVPCFVLLDKHGKALAKT +PSSRLHVIAGLSHL+K+K+PK
Subjt:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  NTP
        +TP
Subjt:  NTP

A0A1S3BAS7 uncharacterized protein LOC1034879198.6e-10188.67Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL
        MGSTPKQP FCFKWPWDVD KN  DCSFEGPWLFKSLQNVG FAFNFVNKVSKSSPPW+ TFK L+ +  TGGNKISQSRK+LTPEEQGEAENRA AAAL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL

Query:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        ASGKEATVIEFYSPKC LCNSLLN VME+EARNS+WL+IVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHL+KLK+PK
Subjt:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  NTP
        +TP
Subjt:  NTP

A0A5A7TE38 Thioredoxin-like protein2.1e-9987.68Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL
        MGSTPKQP FCFKWPWDVD +N  DCSFEGPWLFKSLQNVG FAFNFVNKVSKSSP W+ TFK L+ +  TGGNKISQSRK+LTPEEQGEAENRA AAAL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL

Query:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        ASGKEATVIEFYSPKC LCNSLLN VME+EARNS+WL+IVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHL+KLK+PK
Subjt:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  NTP
        +TP
Subjt:  NTP

A0A6J1CS67 uncharacterized protein LOC1110140823.5e-10289.55Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL
        MGSTPKQPFFCFKWPWD DPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWIN FK L  +AS GGNK S  RK LTPEEQGEAE+RAFA+AL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL

Query:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        ASGKEATVIEFYSPKCRLCNSLL+ VMEMEARNS+WLSIVMADAEN+KWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLL LK P+
Subjt:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  N
        +
Subjt:  N

A0A6J1KP92 uncharacterized protein LOC1114960278.6e-9381.77Show/hide
Query:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL
        M STPKQP FCFKWPWD++PKNP DCSFEGPWLFKSLQNVG FA NF+N+VSKSSPPW+N FK L  D    GNKIS+SRK L+PEEQGEAENRAFAAAL
Subjt:  MGSTPKQPFFCFKWPWDVDPKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAAL

Query:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        A GKEATVIEFYSPKC LC+SLL FV +MEARNS WL+IVMADAENDKWLPE+LHYDI YVPCFV+LDK GKALAKTGIPSSRLHVIAGLSHLLKLK P 
Subjt:  ASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  NTP
          P
Subjt:  NTP

SwissProt top hitse value%identityAlignment
P35088 Thiol:disulfide interchange protein TxlA1.2e-0635.38Show/hide
Query:  VIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGK
        ++EFY+  C  C ++   +  ++   S+ L  VM + +NDKWLPE+L Y++  +P FV L+  G+
Subjt:  VIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGK

P73920 Thiol:disulfide interchange protein TxlA homolog4.8e-0834.57Show/hide
Query:  AAALASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKT
        A AL +G+  T++EFY+  C  C ++   + E++      ++  M + +N+KWLPE+L Y +  +P FV LD  G A+A++
Subjt:  AAALASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKT

Arabidopsis top hitse value%identityAlignment
AT5G06430.1 Thioredoxin superfamily protein2.3e-6658.94Show/hide
Query:  KIKTIMGSTPKQPFFCFKWPWDVD--PKNPFD-CSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAE
        K+ T   ST K PFFC KWPWD +  PK+    C F+GPWLF+S+Q +G+ A + +    ++                         +K L+  EQGEAE
Subjt:  KIKTIMGSTPKQPFFCFKWPWDVD--PKNPFD-CSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAE

Query:  NRAFAAALASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSH
         RAFAAALAS KEATV+EFYS KCRLCNSLL FV+E+E RNSNWLSI MADAEN+KW PELLHYD++YVPCFVLLDK+G+ALAKTG+PSSR HVIAG+SH
Subjt:  NRAFAAALASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSH

Query:  LLKLKAP
        LLK+K P
Subjt:  LLKLKAP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTTAAAAAGAAAAAGGAAAAAAAGAAAAAAAGAAGAAGAGGAGAATGGAGGCAGAAAAGCCATCTCCAGAGCTCGCCCTGCTTTGTTAGCTGTTACCATTTTCAC
ATCCTTCTCCTTGTCGCCTACGAATTGTCCAAAGGAGTTCCGAGGGGCATCTCCTTTCTCAGTCCTAGTACCCCGCTTCTGGAAGCTCAATAAGGGGCAGGGTGATAGTT
TTATAGGAAGGGAAAGAAGAAAGATAAAGACTATAATGGGTTCAACACCCAAACAACCTTTCTTTTGCTTCAAATGGCCATGGGACGTAGACCCTAAAAATCCTTTCGAC
TGTTCGTTTGAGGGTCCTTGGCTGTTCAAATCGCTGCAAAATGTGGGTGCCTTTGCTTTCAATTTTGTAAATAAAGTTTCGAAGTCGTCGCCTCCATGGATCAATACTTT
TAAGGCGTTGCGATTGGATGCCTCAACTGGTGGAAATAAGATATCTCAGTCTAGAAAGATATTAACTCCTGAAGAGCAAGGGGAGGCAGAAAATAGAGCATTTGCTGCAG
CATTGGCCAGTGGGAAAGAAGCCACCGTGATTGAGTTCTACTCGCCCAAATGTCGCCTTTGCAATTCTTTGCTCAATTTTGTTATGGAGATGGAAGCAAGGAATTCAAAT
TGGCTTAGTATTGTGATGGCAGATGCAGAGAATGATAAATGGCTGCCCGAGCTTCTTCATTATGACATTAGATATGTTCCATGCTTTGTGTTGCTGGACAAACATGGGAA
GGCGCTAGCGAAGACGGGTATTCCTAGCAGTCGGCTTCATGTGATTGCAGGACTCTCTCATCTTCTCAAACTGAAAGCGCCCAAGAACACTCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACTTAAAAAGAAAAAGGAAAAAAAGAAAAAAAGAAGAAGAGGAGAATGGAGGCAGAAAAGCCATCTCCAGAGCTCGCCCTGCTTTGTTAGCTGTTACCATTTTCAC
ATCCTTCTCCTTGTCGCCTACGAATTGTCCAAAGGAGTTCCGAGGGGCATCTCCTTTCTCAGTCCTAGTACCCCGCTTCTGGAAGCTCAATAAGGGGCAGGGTGATAGTT
TTATAGGAAGGGAAAGAAGAAAGATAAAGACTATAATGGGTTCAACACCCAAACAACCTTTCTTTTGCTTCAAATGGCCATGGGACGTAGACCCTAAAAATCCTTTCGAC
TGTTCGTTTGAGGGTCCTTGGCTGTTCAAATCGCTGCAAAATGTGGGTGCCTTTGCTTTCAATTTTGTAAATAAAGTTTCGAAGTCGTCGCCTCCATGGATCAATACTTT
TAAGGCGTTGCGATTGGATGCCTCAACTGGTGGAAATAAGATATCTCAGTCTAGAAAGATATTAACTCCTGAAGAGCAAGGGGAGGCAGAAAATAGAGCATTTGCTGCAG
CATTGGCCAGTGGGAAAGAAGCCACCGTGATTGAGTTCTACTCGCCCAAATGTCGCCTTTGCAATTCTTTGCTCAATTTTGTTATGGAGATGGAAGCAAGGAATTCAAAT
TGGCTTAGTATTGTGATGGCAGATGCAGAGAATGATAAATGGCTGCCCGAGCTTCTTCATTATGACATTAGATATGTTCCATGCTTTGTGTTGCTGGACAAACATGGGAA
GGCGCTAGCGAAGACGGGTATTCCTAGCAGTCGGCTTCATGTGATTGCAGGACTCTCTCATCTTCTCAAACTGAAAGCGCCCAAGAACACTCCCTGA
Protein sequenceShow/hide protein sequence
MNLKRKRKKRKKEEEENGGRKAISRARPALLAVTIFTSFSLSPTNCPKEFRGASPFSVLVPRFWKLNKGQGDSFIGRERRKIKTIMGSTPKQPFFCFKWPWDVDPKNPFD
CSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKALRLDASTGGNKISQSRKILTPEEQGEAENRAFAAALASGKEATVIEFYSPKCRLCNSLLNFVMEMEARNSN
WLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPKNTP