; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS015099 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS015099
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionThiol-disulfide oxidoreductase DCC
Genome locationscaffold2:1139555..1141067
RNA-Seq ExpressionMS015099
SyntenyMS015099
Gene Ontology termsGO:0015035 - protein disulfide oxidoreductase activity (molecular function)
InterPro domainsIPR007263 - DCC1-like thiol-disulfide oxidoreductase family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601373.1 hypothetical protein SDJN03_06606, partial [Cucurbita argyrosperma subsp. sororia]1.3e-8970.41Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSPASTASVANTMI-PTLLQPRV
        M +KLL NRFRHL F+ST   RA+SFT  SSS PFRC S SK S  S  S+GIDC+TAA  DIADVPEE  E+L DAVSPA+  S A  ++ P+LLQPRV
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSPASTASVANTMI-PTLLQPRV

Query:  VVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKV
        VVYDGVCHLCHR                           GVKWVIKADKYKKIKFCCLQS+AAEPYLRLSGLDRED+ RRF+F+EG+GSYYQAS AALKV
Subjt:  VVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKV

Query:  LSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
        LSYLPLPYSALS FLIIPTPLRDAVYD VA+HRYD FGKA+DCLVLQEE+LLERFIDREELLD+ HR
Subjt:  LSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR

XP_022151245.1 uncharacterized protein LOC111019215 isoform X1 [Momordica charantia]2.2e-12189.77Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVY
        MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVY
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVY

Query:  DGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSY
        DGVCHLCHR                           GVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSY
Subjt:  DGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSY

Query:  LPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
        LPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
Subjt:  LPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR

XP_022956594.1 uncharacterized protein LOC111458284 [Cucurbita moschata]1.3e-8970.41Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSPASTASVANTMI-PTLLQPRV
        M +KLL NRFRHL F+ST   RA+SFT  SSS PFRC S SK S  S  S+GIDC+TAA  DIADVPEE  E+L DAVSPA+  S A  ++ P+LLQPRV
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSPASTASVANTMI-PTLLQPRV

Query:  VVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKV
        VVYDGVCHLCHR                           GVKWVIKADKYKKIKFCCLQS+AAEPYLRLSGLDRED+ RRF+F+EG+GSYYQAS AALKV
Subjt:  VVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKV

Query:  LSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
        LSYLPLPYSALS FLIIPTPLRDAVYD VA+HRYD FGKA+DCLVLQEE+LLERFIDREELLD+ HR
Subjt:  LSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR

XP_022997576.1 uncharacterized protein LOC111492437 [Cucurbita maxima]1.5e-9070.04Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSP-ASTASVANTMIPTLLQPRV
        M +KLL NRFRHL F+ST   RA+SFT  SSS PFRC   SK S +SPSS+GIDC+TAA  DIADVPEE  E+L D VSP A+ +S    + P+LLQPRV
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSP-ASTASVANTMIPTLLQPRV

Query:  VVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKV
        VVYDGVCHLCHR                           GVKWVIKADKYKKIKFCC QS+AAEPYLRLSGLDRED+ RRF+F+EG+GSYYQAS AALKV
Subjt:  VVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKV

Query:  LSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
        LSYLPLPYSALSAFLIIPTPLRDA+YD VA+HRYD FGKAEDCLVLQEE+LLERFIDREELLD+ HR
Subjt:  LSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR

XP_023549628.1 uncharacterized protein LOC111808071 [Cucurbita pepo subsp. pepo]4.0e-9170.41Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSP-ASTASVANTMIPTLLQPRV
        M +KLL NRFRHL F+ST   RA+SFT  SSS PFRC   SK S +SPSS+GIDC+TAA  DIADVPEE  E+L DAVSP A+ +S    + P+LLQPRV
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSP-ASTASVANTMIPTLLQPRV

Query:  VVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKV
        VVYDGVCHLCHR                           GVKWVIKAD+YKKIKFCCLQS+AAEPYLRLSGLDRED+ RRF+F+EG+GSYYQAS AALKV
Subjt:  VVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKV

Query:  LSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
        LSYLPLPYSALSAFLIIPTPLRDA+YD VA+HRYD FGKAEDCLVLQEE+LLERFIDREELLD+ HR
Subjt:  LSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR

TrEMBL top hitse value%identityAlignment
A0A0A0KVQ2 Uncharacterized protein4.8e-8265.67Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATA--ADIADVPEEGFEDLVDAVSPASTAS--VANTMIPTLLQPR
        MASKLL+NRFR L+ +STA IR +SFTT+ S   FR   SSK S    SS GIDC+TA  ADIADVPEE  +D VD VSPA   +   A  + PTLLQPR
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATA--ADIADVPEEGFEDLVDAVSPASTAS--VANTMIPTLLQPR

Query:  VVVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALK
        VV+YDGVCHLCHR                           GVKWVIK DKYKKIKFCCLQS+ AEPYLRLSGLDRED+S RF+F+EG+GSY+QAS AAL+
Subjt:  VVVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALK

Query:  VLSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
        VLSYLPLPYSALSAFLIIPTPLRD++YD VARHRYD F KAE CLVLQ+E+LLERFIDREELL++ H+
Subjt:  VLSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR

A0A6J1DAN6 uncharacterized protein LOC111019215 isoform X11.1e-12189.77Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVY
        MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVY
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVY

Query:  DGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSY
        DGVCHLCHR                           GVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSY
Subjt:  DGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSY

Query:  LPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
        LPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
Subjt:  LPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR

A0A6J1DE19 uncharacterized protein LOC111019215 isoform X23.6e-8571.59Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVY
        MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVY
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVY

Query:  DGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSY
        DGVCHLCHR                           GVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALK    
Subjt:  DGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSY

Query:  LPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
                                                    EEDLLERFIDREELLDRHHR
Subjt:  LPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR

A0A6J1GZH8 uncharacterized protein LOC1114582846.3e-9070.41Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSPASTASVANTMI-PTLLQPRV
        M +KLL NRFRHL F+ST   RA+SFT  SSS PFRC S SK S  S  S+GIDC+TAA  DIADVPEE  E+L DAVSPA+  S A  ++ P+LLQPRV
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSPASTASVANTMI-PTLLQPRV

Query:  VVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKV
        VVYDGVCHLCHR                           GVKWVIKADKYKKIKFCCLQS+AAEPYLRLSGLDRED+ RRF+F+EG+GSYYQAS AALKV
Subjt:  VVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKV

Query:  LSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
        LSYLPLPYSALS FLIIPTPLRDAVYD VA+HRYD FGKA+DCLVLQEE+LLERFIDREELLD+ HR
Subjt:  LSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR

A0A6J1K7U9 uncharacterized protein LOC1114924377.4e-9170.04Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSP-ASTASVANTMIPTLLQPRV
        M +KLL NRFRHL F+ST   RA+SFT  SSS PFRC   SK S +SPSS+GIDC+TAA  DIADVPEE  E+L D VSP A+ +S    + P+LLQPRV
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSP-ASTASVANTMIPTLLQPRV

Query:  VVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKV
        VVYDGVCHLCHR                           GVKWVIKADKYKKIKFCC QS+AAEPYLRLSGLDRED+ RRF+F+EG+GSYYQAS AALKV
Subjt:  VVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKV

Query:  LSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
        LSYLPLPYSALSAFLIIPTPLRDA+YD VA+HRYD FGKAEDCLVLQEE+LLERFIDREELLD+ HR
Subjt:  LSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR

SwissProt top hitse value%identityAlignment
P40761 Uncharacterized protein YuxK2.2e-1533.75Show/hide
Query:  RVVVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGL--DREDISRRFLFVEGYGSYYQASAA
        RV+++DGVC+LC+                             V+++IK D    I F  LQS   +  L+ SGL  DR D    F+F+E  G  Y  S A
Subjt:  RVVVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGL--DREDISRRFLFVEGYGSYYQASAA

Query:  ALKVLSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFI
        A+KV  +L  P+     F  +P P+RD VY  +A++RY WFGK  +C+ L    + +RF+
Subjt:  ALKVLSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFI

Q9SSR1 DCC family protein At1g52590, chloroplastic1.5e-1133.1Show/hide
Query:  VVVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALK
        ++++DGVC+LC+                            GVK+V   D+ + I+F  LQS A +  L  SG   +DIS   + VE   SY + S A LK
Subjt:  VVVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALK

Query:  VLSYLPLPYSALSAFL-IIPTPLRDAVYDHVARHRYDWFGKAEDC
        ++ Y+ LP+  L+ FL   P  +RD +Y++VA +RY  FG+++ C
Subjt:  VLSYLPLPYSALSAFL-IIPTPLRDAVYDHVARHRYDWFGKAEDC

Arabidopsis top hitse value%identityAlignment
AT1G24095.1 Putative thiol-disulphide oxidoreductase DCC1.2e-6154.79Show/hide
Query:  SSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVI
        S+P+   S G    T A+I        +++   + PA + +    ++P  LQPRVVVYDGVCHLCH                            GVKW+I
Subjt:  SSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVI

Query:  KADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLV
        KADKY+KIKFCCLQS+AAEPYL +SG+ RED+ +RFLF+EG G Y+QAS AAL+V+SYLPLPYSAL+AF I+PTPLRD+VYD+VA++RYDWFGKAEDCLV
Subjt:  KADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLV

Query:  LQEEDLLERFIDREELLDR
        L +++LLERFIDR+EL++R
Subjt:  LQEEDLLERFIDREELLDR

AT1G52590.1 Putative thiol-disulphide oxidoreductase DCC1.0e-1233.1Show/hide
Query:  VVVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALK
        ++++DGVC+LC+                            GVK+V   D+ + I+F  LQS A +  L  SG   +DIS   + VE   SY + S A LK
Subjt:  VVVYDGVCHLCHRDFQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALK

Query:  VLSYLPLPYSALSAFL-IIPTPLRDAVYDHVARHRYDWFGKAEDC
        ++ Y+ LP+  L+ FL   P  +RD +Y++VA +RY  FG+++ C
Subjt:  VLSYLPLPYSALSAFL-IIPTPLRDAVYDHVARHRYDWFGKAEDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGCAAACTTCTGACGAATCGGTTTCGGCACCTTTTGTTCTCATCAACAGCGAAAATCAGAGCATCCTCCTTTACTACTGCATCTTCTTCTCCCCCATTCCGGTG
CAAATCTTCTTCTAAATCTTCACCGGCATCACCTTCTTCCGCCGGCATCGATTGCGCTACCGCTGCGGATATCGCCGATGTTCCTGAAGAAGGTTTTGAGGACCTGGTCG
ACGCCGTCTCTCCTGCTTCCACTGCTTCAGTTGCCAACACTATGATTCCTACTCTCCTCCAGCCTCGTGTCGTTGTCTATGACGGTGTCTGCCATCTCTGCCATCGCGAC
TTTCAGCATCAAATTTGTAGAGGCAGAATTACGGCATTCTGTTTTTATCCTCACTTAGTTGTGTTTTTATTTCTTGCAGGTGTGAAGTGGGTGATAAAAGCAGACAAATA
TAAGAAGATCAAATTTTGCTGCCTTCAGTCCAGAGCTGCTGAACCATATTTGAGACTAAGTGGTCTAGACAGAGAGGATATTAGCCGTCGCTTTTTGTTCGTCGAGGGCT
ACGGTTCATACTACCAAGCTTCTGCCGCTGCTCTGAAGGTATTGTCGTATTTGCCTCTCCCGTATTCAGCTTTGAGCGCATTCTTGATAATTCCAACTCCTCTTAGAGAC
GCTGTGTATGACCATGTTGCTAGACATCGTTATGATTGGTTTGGAAAGGCCGAAGATTGCTTGGTTTTGCAGGAGGAAGATCTGCTTGAGCGTTTTATTGATAGGGAGGA
ACTGCTTGATCGACATCATCGG
mRNA sequenceShow/hide mRNA sequence
ATGGCGAGCAAACTTCTGACGAATCGGTTTCGGCACCTTTTGTTCTCATCAACAGCGAAAATCAGAGCATCCTCCTTTACTACTGCATCTTCTTCTCCCCCATTCCGGTG
CAAATCTTCTTCTAAATCTTCACCGGCATCACCTTCTTCCGCCGGCATCGATTGCGCTACCGCTGCGGATATCGCCGATGTTCCTGAAGAAGGTTTTGAGGACCTGGTCG
ACGCCGTCTCTCCTGCTTCCACTGCTTCAGTTGCCAACACTATGATTCCTACTCTCCTCCAGCCTCGTGTCGTTGTCTATGACGGTGTCTGCCATCTCTGCCATCGCGAC
TTTCAGCATCAAATTTGTAGAGGCAGAATTACGGCATTCTGTTTTTATCCTCACTTAGTTGTGTTTTTATTTCTTGCAGGTGTGAAGTGGGTGATAAAAGCAGACAAATA
TAAGAAGATCAAATTTTGCTGCCTTCAGTCCAGAGCTGCTGAACCATATTTGAGACTAAGTGGTCTAGACAGAGAGGATATTAGCCGTCGCTTTTTGTTCGTCGAGGGCT
ACGGTTCATACTACCAAGCTTCTGCCGCTGCTCTGAAGGTATTGTCGTATTTGCCTCTCCCGTATTCAGCTTTGAGCGCATTCTTGATAATTCCAACTCCTCTTAGAGAC
GCTGTGTATGACCATGTTGCTAGACATCGTTATGATTGGTTTGGAAAGGCCGAAGATTGCTTGGTTTTGCAGGAGGAAGATCTGCTTGAGCGTTTTATTGATAGGGAGGA
ACTGCTTGATCGACATCATCGG
Protein sequenceShow/hide protein sequence
MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVYDGVCHLCHRD
FQHQICRGRITAFCFYPHLVVFLFLAGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRD
AVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR