; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016870 (gene) of Snake gourd v1 genome

Gene IDTan0016870
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDCC family protein At1g52590, chloroplastic
Genome locationLG04:85155184..85156581
RNA-Seq ExpressionTan0016870
SyntenyTan0016870
Gene Ontology termsGO:0015035 - protein disulfide oxidoreductase activity (molecular function)
InterPro domainsIPR007263 - DCC1-like thiol-disulfide oxidoreductase family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153577.1 DCC family protein At1g52590, chloroplastic [Momordica charantia]9.5e-8187.43Show/hide
Query:  MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR
        MA+ + G +PA  +PP +AQAR RHS+AFASLSPATK ETIDW+EATSDFF KD+RPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR
Subjt:  MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR

Query:  SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI
        SGRAPDDISSVVLVEKDRSYIKSEAV+RIMEHLELPFPQLA F+QFVPLFVRNFVY+N+ADNRYRLFGRSESCEI
Subjt:  SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI

XP_022955810.1 DCC family protein At1g52590, chloroplastic [Cucurbita moschata]2.4e-7686.86Show/hide
Query:  MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR
        MAL +SG VPA  V PS A  R R  +AF SLS AT D TIDWVEATSDFF KDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR
Subjt:  MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR

Query:  SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI
        SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALF++FVPLFVRN  YDNIADNRY LFGRSESCEI
Subjt:  SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI

XP_022980169.1 DCC family protein At1g52590, chloroplastic [Cucurbita maxima]1.4e-7687.43Show/hide
Query:  MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR
        MAL LSG VPA  V PS A  R R  +AF SLS AT D TIDWVEATSDFF KDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR
Subjt:  MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR

Query:  SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI
        SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALF++FVPLFVRN  YDNIADNRY LFGRSESCEI
Subjt:  SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI

XP_023527546.1 DCC family protein At1g52590, chloroplastic, partial [Cucurbita pepo subsp. pepo]1.4e-7687.43Show/hide
Query:  MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR
        MAL LSG VPA  V PS A  R R  +AF SLS AT D TIDWVEATSDFF KDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR
Subjt:  MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR

Query:  SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI
        SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALF++FVPLFVRN  YDNIADNRY LFGRSESCEI
Subjt:  SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI

XP_038900615.1 DCC family protein At1g52590, chloroplastic [Benincasa hispida]1.2e-8089.71Show/hide
Query:  MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR
        MAL LSGG+PA  VPPS A+AR  H+++ ASLSP TKDETIDWVEATSDFF KDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR
Subjt:  MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR

Query:  SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI
        SGRAPDDISSVVLVEKDRSYIKSEAVLRIME+LELPFPQLALF+QFVPLFVRN VYDNIADNRY LFGRSESCEI
Subjt:  SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI

TrEMBL top hitse value%identityAlignment
A0A0A0KVF0 Uncharacterized protein2.0e-7686.29Show/hide
Query:  MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR
        MAL LSG  PA  +PPS  +A   H +A ASLSP  KDETIDWVEATS+FF KDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQS+AGKKLLRR
Subjt:  MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR

Query:  SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI
        SGRAPDDISSVVLVEKDRSYIKSEAVLRIME+LELPFPQLALF+QFVPLFVRN VYDNIADNRY LFGRSESCEI
Subjt:  SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI

A0A5D3E0H0 DCC family protein7.6e-7686.36Show/hide
Query:  MALRLSGGVPASLVPPS-RAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLR
        MAL LSG  PA  +PPS R      H +A ASLSP TKDETIDWVEATSDFF KDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQS+AGKKLLR
Subjt:  MALRLSGGVPASLVPPS-RAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLR

Query:  RSGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI
        RSGRAPDDISSVVLVEKDRSYIKSEAVLRIME+LELPFPQLA F+QFVPLFVRN VYDNIADNRY LFGRSESCEI
Subjt:  RSGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI

A0A6J1DJB0 DCC family protein At1g52590, chloroplastic4.6e-8187.43Show/hide
Query:  MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR
        MA+ + G +PA  +PP +AQAR RHS+AFASLSPATK ETIDW+EATSDFF KD+RPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR
Subjt:  MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR

Query:  SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI
        SGRAPDDISSVVLVEKDRSYIKSEAV+RIMEHLELPFPQLA F+QFVPLFVRNFVY+N+ADNRYRLFGRSESCEI
Subjt:  SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI

A0A6J1GW36 DCC family protein At1g52590, chloroplastic1.2e-7686.86Show/hide
Query:  MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR
        MAL +SG VPA  V PS A  R R  +AF SLS AT D TIDWVEATSDFF KDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR
Subjt:  MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR

Query:  SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI
        SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALF++FVPLFVRN  YDNIADNRY LFGRSESCEI
Subjt:  SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI

A0A6J1IYJ5 DCC family protein At1g52590, chloroplastic6.9e-7787.43Show/hide
Query:  MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR
        MAL LSG VPA  V PS A  R R  +AF SLS AT D TIDWVEATSDFF KDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR
Subjt:  MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRR

Query:  SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI
        SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALF++FVPLFVRN  YDNIADNRY LFGRSESCEI
Subjt:  SGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI

SwissProt top hitse value%identityAlignment
P40761 Uncharacterized protein YuxK3.3e-2043.97Show/hide
Query:  IMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRRSGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYD
        ++LFDGVCNLCNG V+F+   D +  I   +LQSE G+ LL++SG   D   S V +E  + Y KS A +++  HL  P+    LF   VP  VR+ VY 
Subjt:  IMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRRSGRAPDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYD

Query:  NIADNRYRLFGRSESC
         IA NRY+ FG+   C
Subjt:  NIADNRYRLFGRSESC

Q9SSR1 DCC family protein At1g52590, chloroplastic8.1e-5969.01Show/hide
Query:  VPAS---LVPPSRAQARTRHSMAFASLSPAT-KDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRRSGRA
        +PAS   L   SRAQ R R S   AS +  T + +++DWV+ TS FF +D RPIMLFDGVCNLCNGGV+FVR +DRNR IR EALQSEAGKKLL RSGRA
Subjt:  VPAS---LVPPSRAQARTRHSMAFASLSPAT-KDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRRSGRA

Query:  PDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI
        PDDISSVVLVE DRSYIKSEAVL+IM++++LPFPQLA F+QF PLFVR+F+Y+N+A+NRY +FGRS+SCE+
Subjt:  PDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI

Arabidopsis top hitse value%identityAlignment
AT1G24095.1 Putative thiol-disulphide oxidoreductase DCC6.7e-1637.29Show/hide
Query:  IMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRRSGRAPDDISSVVLVEKDRSYI--KSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFV
        ++++DGVC+LC+GGV+++   D+ R+I+   LQS+A +  L  SG   +D+    L  +   +    S A LR++ +L LP+  L  F   VP  +R+ V
Subjt:  IMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRRSGRAPDDISSVVLVEKDRSYI--KSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFV

Query:  YDNIADNRYRLFGRSESC
        YD +A NRY  FG++E C
Subjt:  YDNIADNRYRLFGRSESC

AT1G52590.1 Putative thiol-disulphide oxidoreductase DCC5.8e-6069.01Show/hide
Query:  VPAS---LVPPSRAQARTRHSMAFASLSPAT-KDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRRSGRA
        +PAS   L   SRAQ R R S   AS +  T + +++DWV+ TS FF +D RPIMLFDGVCNLCNGGV+FVR +DRNR IR EALQSEAGKKLL RSGRA
Subjt:  VPAS---LVPPSRAQARTRHSMAFASLSPAT-KDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRRSGRA

Query:  PDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI
        PDDISSVVLVE DRSYIKSEAVL+IM++++LPFPQLA F+QF PLFVR+F+Y+N+A+NRY +FGRS+SCE+
Subjt:  PDDISSVVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACTTCGGCTTTCCGGTGGAGTTCCGGCCAGTCTTGTTCCGCCGTCCAGGGCGCAAGCTCGAACACGGCATTCAATGGCTTTCGCCTCTCTATCTCCGGCAACTAA
AGATGAAACGATAGATTGGGTGGAAGCAACTTCCGACTTCTTCGGAAAAGATACGAGACCGATTATGCTATTTGACGGTGTATGCAATTTGTGTAATGGAGGTGTCAGAT
TTGTTCGTGCTAATGACCGAAACAGGAGAATACGATTAGAGGCTCTCCAAAGTGAAGCTGGCAAGAAGCTGTTGAGAAGGTCAGGAAGAGCACCGGATGATATTTCGAGT
GTTGTACTTGTTGAAAAGGACAGATCTTATATCAAGTCTGAGGCTGTGCTAAGAATAATGGAACATTTAGAGTTACCCTTTCCTCAGCTGGCCCTTTTTATCCAGTTTGT
GCCTCTGTTTGTTCGAAATTTTGTGTATGACAATATTGCAGATAATCGTTATAGGCTATTTGGACGCTCGGAGTCGTGTGAAATATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCACTTCGGCTTTCCGGTGGAGTTCCGGCCAGTCTTGTTCCGCCGTCCAGGGCGCAAGCTCGAACACGGCATTCAATGGCTTTCGCCTCTCTATCTCCGGCAACTAA
AGATGAAACGATAGATTGGGTGGAAGCAACTTCCGACTTCTTCGGAAAAGATACGAGACCGATTATGCTATTTGACGGTGTATGCAATTTGTGTAATGGAGGTGTCAGAT
TTGTTCGTGCTAATGACCGAAACAGGAGAATACGATTAGAGGCTCTCCAAAGTGAAGCTGGCAAGAAGCTGTTGAGAAGGTCAGGAAGAGCACCGGATGATATTTCGAGT
GTTGTACTTGTTGAAAAGGACAGATCTTATATCAAGTCTGAGGCTGTGCTAAGAATAATGGAACATTTAGAGTTACCCTTTCCTCAGCTGGCCCTTTTTATCCAGTTTGT
GCCTCTGTTTGTTCGAAATTTTGTGTATGACAATATTGCAGATAATCGTTATAGGCTATTTGGACGCTCGGAGTCGTGTGAAATATAG
Protein sequenceShow/hide protein sequence
MALRLSGGVPASLVPPSRAQARTRHSMAFASLSPATKDETIDWVEATSDFFGKDTRPIMLFDGVCNLCNGGVRFVRANDRNRRIRLEALQSEAGKKLLRRSGRAPDDISS
VVLVEKDRSYIKSEAVLRIMEHLELPFPQLALFIQFVPLFVRNFVYDNIADNRYRLFGRSESCEI