; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004912 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004912
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionThioredoxin domain-containing protein
Genome locationChr08:21354310..21355499
RNA-Seq ExpressionHG10004912
SyntenyHG10004912
Gene Ontology termsGO:0006396 - RNA processing (biological process)
GO:0045454 - cell redox homeostasis (biological process)
GO:0009507 - chloroplast (cellular component)
InterPro domainsIPR013766 - Thioredoxin domain
IPR036249 - Thioredoxin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041672.1 thioredoxin-like protein [Cucumis melo var. makuwa]9.8e-10189.16Show/hide
Query:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL
        MGSTPKQP FCFKWPWDVD +N  DCSFEGPWLFKSLQNVG FAFNFVNKVSKSSP W+ TFKPLQF   TGGNKISQSRKMLTPEEQGEAENRA AAAL
Subjt:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL

Query:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        AS KEATVIEFYSPKC LCNSLLN VME+EARNS+WL+IVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHL+KLK+PK
Subjt:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  NTP
        +TP
Subjt:  NTP

XP_004152726.1 uncharacterized protein LOC101203280 [Cucumis sativus]7.7e-9886.21Show/hide
Query:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL
        MGSTPKQPFFCFKWPWDVD KN  DCSFE PWLFKSLQNVG FAF+FVNK SKSSPPW+ TFK LQF   TGGNKISQSRKMLTPEEQGEAENRA AAAL
Subjt:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL

Query:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        AS KEAT+IEFYSPKC LCNSLLN V EMEARNS+WL+IVMADAENDKWLPELLHYDI YVPCFVLLDKHGKALAKT +PSSRLHVIAGLSHL+K+K+PK
Subjt:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  NTP
        +TP
Subjt:  NTP

XP_008444648.1 PREDICTED: uncharacterized protein LOC103487919 [Cucumis melo]4.0e-10290.15Show/hide
Query:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL
        MGSTPKQP FCFKWPWDVD KN  DCSFEGPWLFKSLQNVG FAFNFVNKVSKSSPPW+ TFKPLQF   TGGNKISQSRKMLTPEEQGEAENRA AAAL
Subjt:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL

Query:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        AS KEATVIEFYSPKC LCNSLLN VME+EARNS+WL+IVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHL+KLK+PK
Subjt:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  NTP
        +TP
Subjt:  NTP

XP_022144394.1 uncharacterized protein LOC111014082 [Momordica charantia]2.0e-10189.55Show/hide
Query:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL
        MGSTPKQPFFCFKWPWD D KNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWIN FKPL F AS GGNK S  RK LTPEEQGEAE+RAFA+AL
Subjt:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL

Query:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        AS KEATVIEFYSPKCRLCNSLL+ VMEMEARNS+WLSIVMADAEN+KWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLL LK P+
Subjt:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  N
        +
Subjt:  N

XP_038885736.1 uncharacterized protein LOC120076025 [Benincasa hispida]2.2e-10592.61Show/hide
Query:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL
        MGSTPKQPFFC KWPWDVD KNPFDCSFEGPWLFKSLQNVG FAFNFVNKVSKSSPPWINTFKPLQ  ASTGGN ISQSRKMLTPEEQGEAENRAFAAAL
Subjt:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL

Query:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        AS KEATVIEFYSPKC LCNSLLN VMEMEARNS+WL+IVMADAEN KWLPE+LHYDIRYVPCFVLLDKHGKALAKTGIPSSRL VIAGLSHL+KLKAPK
Subjt:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  NTP
        NTP
Subjt:  NTP

TrEMBL top hitse value%identityAlignment
A0A0A0LRE2 Thioredoxin domain-containing protein3.8e-9886.21Show/hide
Query:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL
        MGSTPKQPFFCFKWPWDVD KN  DCSFE PWLFKSLQNVG FAF+FVNK SKSSPPW+ TFK LQF   TGGNKISQSRKMLTPEEQGEAENRA AAAL
Subjt:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL

Query:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        AS KEAT+IEFYSPKC LCNSLLN V EMEARNS+WL+IVMADAENDKWLPELLHYDI YVPCFVLLDKHGKALAKT +PSSRLHVIAGLSHL+K+K+PK
Subjt:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  NTP
        +TP
Subjt:  NTP

A0A1S3BAS7 uncharacterized protein LOC1034879191.9e-10290.15Show/hide
Query:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL
        MGSTPKQP FCFKWPWDVD KN  DCSFEGPWLFKSLQNVG FAFNFVNKVSKSSPPW+ TFKPLQF   TGGNKISQSRKMLTPEEQGEAENRA AAAL
Subjt:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL

Query:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        AS KEATVIEFYSPKC LCNSLLN VME+EARNS+WL+IVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHL+KLK+PK
Subjt:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  NTP
        +TP
Subjt:  NTP

A0A5A7TE38 Thioredoxin-like protein4.7e-10189.16Show/hide
Query:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL
        MGSTPKQP FCFKWPWDVD +N  DCSFEGPWLFKSLQNVG FAFNFVNKVSKSSP W+ TFKPLQF   TGGNKISQSRKMLTPEEQGEAENRA AAAL
Subjt:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL

Query:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        AS KEATVIEFYSPKC LCNSLLN VME+EARNS+WL+IVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHL+KLK+PK
Subjt:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  NTP
        +TP
Subjt:  NTP

A0A6J1CS67 uncharacterized protein LOC1110140829.5e-10289.55Show/hide
Query:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL
        MGSTPKQPFFCFKWPWD D KNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWIN FKPL F AS GGNK S  RK LTPEEQGEAE+RAFA+AL
Subjt:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL

Query:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        AS KEATVIEFYSPKCRLCNSLL+ VMEMEARNS+WLSIVMADAEN+KWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLL LK P+
Subjt:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  N
        +
Subjt:  N

A0A6J1KP92 uncharacterized protein LOC1114960275.2e-9281.28Show/hide
Query:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL
        M STPKQP FCFKWPWD++ KNP DCSFEGPWLFKSLQNVG FA NF+N+VSKSSPPW+N FKPL F     GNKIS+SRK L+PEEQGEAENRAFAAAL
Subjt:  MGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAAL

Query:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK
        A  KEATVIEFYSPKC LC+SLL FV +MEARNS WL+IVMADAENDKWLPE+LHYDI YVPCFV+LDK GKALAKTGIPSSRLHVIAGLSHLLKLK P 
Subjt:  ASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPK

Query:  NTP
          P
Subjt:  NTP

SwissProt top hitse value%identityAlignment
P35088 Thiol:disulfide interchange protein TxlA6.5e-0732.47Show/hide
Query:  FAAALASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGK
        +  A+A+++   ++EFY+  C  C ++   +  ++   S+ L  VM + +NDKWLPE+L Y++  +P FV L+  G+
Subjt:  FAAALASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGK

P73920 Thiol:disulfide interchange protein TxlA homolog2.2e-0728.89Show/hide
Query:  QGEAENRAFAAALASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKT
        + +A+     A        T++EFY+  C  C ++   + E++      ++  M + +N+KWLPE+L Y +  +P FV LD  G A+A++
Subjt:  QGEAENRAFAAALASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKT

Arabidopsis top hitse value%identityAlignment
AT5G06430.1 Thioredoxin superfamily protein4.9e-6660.7Show/hide
Query:  STPKQPFFCFKWPWDVDHKNPFD----CSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAA
        ST K PFFC KWPWD  +K P      C F+GPWLF+S+Q +G+ A + +    ++       F+P               +K L+  EQGEAE RAFAA
Subjt:  STPKQPFFCFKWPWDVDHKNPFD----CSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAA

Query:  ALASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKA
        ALAS+KEATV+EFYS KCRLCNSLL FV+E+E RNSNWLSI MADAEN+KW PELLHYD++YVPCFVLLDK+G+ALAKTG+PSSR HVIAG+SHLLK+K 
Subjt:  ALASEKEATVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKA

Query:  P
        P
Subjt:  P


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAATATAATGGGTTCAACACCCAAACAACCTTTCTTTTGCTTCAAATGGCCATGGGACGTAGACCATAAAAATCCTTTCGACTGTTCGTTTGAGGGTCCTTGGCT
GTTCAAATCGCTGCAAAATGTGGGTGCCTTTGCTTTCAATTTTGTAAATAAAGTTTCAAAATCATCGCCTCCATGGATCAATACTTTTAAGCCACTGCAATTCGGTGCCT
CAACTGGTGGAAATAAGATATCTCAATCTAGAAAGATGTTAACTCCTGAAGAGCAAGGGGAGGCGGAAAATAGAGCATTTGCAGCAGCATTAGCCAGTGAGAAAGAAGCC
ACCGTGATCGAGTTCTACTCGCCCAAATGTCGCCTTTGCAATTCCTTGCTCAATTTTGTCATGGAGATGGAAGCAAGGAATTCAAATTGGCTTAGTATTGTGATGGCAGA
TGCAGAGAATGATAAATGGCTGCCCGAGCTTCTTCATTATGACATTAGATATGTTCCTTGCTTTGTGTTGCTGGACAAACATGGCAAGGCGCTAGCGAAGACGGGTATTC
CTAGCAGTCGGCTTCATGTAATTGCAGGACTTTCTCATCTTCTCAAACTGAAAGCGCCCAAGAATACTCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAATATAATGGGTTCAACACCCAAACAACCTTTCTTTTGCTTCAAATGGCCATGGGACGTAGACCATAAAAATCCTTTCGACTGTTCGTTTGAGGGTCCTTGGCT
GTTCAAATCGCTGCAAAATGTGGGTGCCTTTGCTTTCAATTTTGTAAATAAAGTTTCAAAATCATCGCCTCCATGGATCAATACTTTTAAGCCACTGCAATTCGGTGCCT
CAACTGGTGGAAATAAGATATCTCAATCTAGAAAGATGTTAACTCCTGAAGAGCAAGGGGAGGCGGAAAATAGAGCATTTGCAGCAGCATTAGCCAGTGAGAAAGAAGCC
ACCGTGATCGAGTTCTACTCGCCCAAATGTCGCCTTTGCAATTCCTTGCTCAATTTTGTCATGGAGATGGAAGCAAGGAATTCAAATTGGCTTAGTATTGTGATGGCAGA
TGCAGAGAATGATAAATGGCTGCCCGAGCTTCTTCATTATGACATTAGATATGTTCCTTGCTTTGTGTTGCTGGACAAACATGGCAAGGCGCTAGCGAAGACGGGTATTC
CTAGCAGTCGGCTTCATGTAATTGCAGGACTTTCTCATCTTCTCAAACTGAAAGCGCCCAAGAATACTCCCTGA
Protein sequenceShow/hide protein sequence
MKNIMGSTPKQPFFCFKWPWDVDHKNPFDCSFEGPWLFKSLQNVGAFAFNFVNKVSKSSPPWINTFKPLQFGASTGGNKISQSRKMLTPEEQGEAENRAFAAALASEKEA
TVIEFYSPKCRLCNSLLNFVMEMEARNSNWLSIVMADAENDKWLPELLHYDIRYVPCFVLLDKHGKALAKTGIPSSRLHVIAGLSHLLKLKAPKNTP