; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g40060 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g40060
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionThiol-disulfide oxidoreductase DCC
Genome locationchr9:30631986..30633504
RNA-Seq ExpressionMoc09g40060
SyntenyMoc09g40060
Gene Ontology termsGO:0015035 - protein disulfide oxidoreductase activity (molecular function)
InterPro domainsIPR007263 - DCC1-like thiol-disulfide oxidoreductase family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601373.1 hypothetical protein SDJN03_06606, partial [Cucurbita argyrosperma subsp. sororia]3.5e-9478.33Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSPASTASVANTMI-PTLLQPRV
        M +KLL NRFRHL F+ST   RA+SFT  SSS PFRC S SK S  S  S+GIDC+TAA  DIADVPEE  E+L DAVSPA+  S A  ++ P+LLQPRV
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSPASTASVANTMI-PTLLQPRV

Query:  VVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYD
        VVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQS+AAEPYLRLSGLDRED+ RRF+F+EG+GSYYQAS AALKVLSYLPLPYSALS FLIIPTPLRDAVYD
Subjt:  VVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYD

Query:  HVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
         VA+HRYD FGKA+DCLVLQEE+LLERFIDREELLD+ HR
Subjt:  HVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR

XP_022151245.1 uncharacterized protein LOC111019215 isoform X1 [Momordica charantia]9.1e-127100Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVY
        MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVY
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVY

Query:  DGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYDHVA
        DGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYDHVA
Subjt:  DGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYDHVA

Query:  RHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHRY
        RHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHRY
Subjt:  RHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHRY

XP_022956594.1 uncharacterized protein LOC111458284 [Cucurbita moschata]3.5e-9478.33Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSPASTASVANTMI-PTLLQPRV
        M +KLL NRFRHL F+ST   RA+SFT  SSS PFRC S SK S  S  S+GIDC+TAA  DIADVPEE  E+L DAVSPA+  S A  ++ P+LLQPRV
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSPASTASVANTMI-PTLLQPRV

Query:  VVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYD
        VVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQS+AAEPYLRLSGLDRED+ RRF+F+EG+GSYYQAS AALKVLSYLPLPYSALS FLIIPTPLRDAVYD
Subjt:  VVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYD

Query:  HVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
         VA+HRYD FGKA+DCLVLQEE+LLERFIDREELLD+ HR
Subjt:  HVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR

XP_022997576.1 uncharacterized protein LOC111492437 [Cucurbita maxima]4.1e-9577.92Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSP-ASTASVANTMIPTLLQPRV
        M +KLL NRFRHL F+ST   RA+SFT  SSS PFRC   SK S +SPSS+GIDC+TAA  DIADVPEE  E+L D VSP A+ +S    + P+LLQPRV
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSP-ASTASVANTMIPTLLQPRV

Query:  VVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYD
        VVYDGVCHLCHRGVKWVIKADKYKKIKFCC QS+AAEPYLRLSGLDRED+ RRF+F+EG+GSYYQAS AALKVLSYLPLPYSALSAFLIIPTPLRDA+YD
Subjt:  VVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYD

Query:  HVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
         VA+HRYD FGKAEDCLVLQEE+LLERFIDREELLD+ HR
Subjt:  HVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR

XP_023549628.1 uncharacterized protein LOC111808071 [Cucurbita pepo subsp. pepo]1.1e-9578.33Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSP-ASTASVANTMIPTLLQPRV
        M +KLL NRFRHL F+ST   RA+SFT  SSS PFRC   SK S +SPSS+GIDC+TAA  DIADVPEE  E+L DAVSP A+ +S    + P+LLQPRV
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSP-ASTASVANTMIPTLLQPRV

Query:  VVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYD
        VVYDGVCHLCHRGVKWVIKAD+YKKIKFCCLQS+AAEPYLRLSGLDRED+ RRF+F+EG+GSYYQAS AALKVLSYLPLPYSALSAFLIIPTPLRDA+YD
Subjt:  VVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYD

Query:  HVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
         VA+HRYD FGKAEDCLVLQEE+LLERFIDREELLD+ HR
Subjt:  HVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR

TrEMBL top hitse value%identityAlignment
A0A0A0KVQ2 Uncharacterized protein1.3e-8673.03Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATA--ADIADVPEEGFEDLVDAVSPASTAS--VANTMIPTLLQPR
        MASKLL+NRFR L+ +STA IR +SFTT+ S   FR   SSK S    SS GIDC+TA  ADIADVPEE  +D VD VSPA   +   A  + PTLLQPR
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATA--ADIADVPEEGFEDLVDAVSPASTAS--VANTMIPTLLQPR

Query:  VVVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVY
        VV+YDGVCHLCHRGVKWVIK DKYKKIKFCCLQS+ AEPYLRLSGLDRED+S RF+F+EG+GSY+QAS AAL+VLSYLPLPYSALSAFLIIPTPLRD++Y
Subjt:  VVVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVY

Query:  DHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
        D VARHRYD F KAE CLVLQ+E+LLERFIDREELL++ H+
Subjt:  DHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR

A0A6J1DAN6 uncharacterized protein LOC111019215 isoform X14.4e-127100Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVY
        MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVY
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVY

Query:  DGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYDHVA
        DGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYDHVA
Subjt:  DGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYDHVA

Query:  RHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHRY
        RHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHRY
Subjt:  RHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHRY

A0A6J1DE19 uncharacterized protein LOC111019215 isoform X21.5e-9079.83Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVY
        MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVY
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVY

Query:  DGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYDHVA
        DGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALK                               
Subjt:  DGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYDHVA

Query:  RHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHRY
                         EEDLLERFIDREELLDRHHRY
Subjt:  RHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHRY

A0A6J1GZH8 uncharacterized protein LOC1114582841.7e-9478.33Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSPASTASVANTMI-PTLLQPRV
        M +KLL NRFRHL F+ST   RA+SFT  SSS PFRC S SK S  S  S+GIDC+TAA  DIADVPEE  E+L DAVSPA+  S A  ++ P+LLQPRV
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSPASTASVANTMI-PTLLQPRV

Query:  VVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYD
        VVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQS+AAEPYLRLSGLDRED+ RRF+F+EG+GSYYQAS AALKVLSYLPLPYSALS FLIIPTPLRDAVYD
Subjt:  VVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYD

Query:  HVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
         VA+HRYD FGKA+DCLVLQEE+LLERFIDREELLD+ HR
Subjt:  HVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR

A0A6J1K7U9 uncharacterized protein LOC1114924372.0e-9577.92Show/hide
Query:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSP-ASTASVANTMIPTLLQPRV
        M +KLL NRFRHL F+ST   RA+SFT  SSS PFRC   SK S +SPSS+GIDC+TAA  DIADVPEE  E+L D VSP A+ +S    + P+LLQPRV
Subjt:  MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAA--DIADVPEEGFEDLVDAVSP-ASTASVANTMIPTLLQPRV

Query:  VVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYD
        VVYDGVCHLCHRGVKWVIKADKYKKIKFCC QS+AAEPYLRLSGLDRED+ RRF+F+EG+GSYYQAS AALKVLSYLPLPYSALSAFLIIPTPLRDA+YD
Subjt:  VVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYD

Query:  HVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR
         VA+HRYD FGKAEDCLVLQEE+LLERFIDREELLD+ HR
Subjt:  HVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDRHHR

SwissProt top hitse value%identityAlignment
P40761 Uncharacterized protein YuxK1.3e-1940.6Show/hide
Query:  RVVVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGL--DREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRD
        RV+++DGVC+LC+  V+++IK D    I F  LQS   +  L+ SGL  DR D    F+F+E  G  Y  S AA+KV  +L  P+     F  +P P+RD
Subjt:  RVVVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGL--DREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRD

Query:  AVYDHVARHRYDWFGKAEDCLVLQEEDLLERFI
         VY  +A++RY WFGK  +C+ L    + +RF+
Subjt:  AVYDHVARHRYDWFGKAEDCLVLQEEDLLERFI

Q9SSR1 DCC family protein At1g52590, chloroplastic8.9e-1640.68Show/hide
Query:  VVVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFL-IIPTPLRDAV
        ++++DGVC+LC+ GVK+V   D+ + I+F  LQS A +  L  SG   +DIS   + VE   SY + S A LK++ Y+ LP+  L+ FL   P  +RD +
Subjt:  VVVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFL-IIPTPLRDAV

Query:  YDHVARHRYDWFGKAEDC
        Y++VA +RY  FG+++ C
Subjt:  YDHVARHRYDWFGKAEDC

Arabidopsis top hitse value%identityAlignment
AT1G24095.1 Putative thiol-disulphide oxidoreductase DCC5.6e-6662.5Show/hide
Query:  SSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGL
        S+P+   S G    T A+I        +++   + PA + +    ++P  LQPRVVVYDGVCHLCH GVKW+IKADKY+KIKFCCLQS+AAEPYL +SG+
Subjt:  SSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGL

Query:  DREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDR
         RED+ +RFLF+EG G Y+QAS AAL+V+SYLPLPYSAL+AF I+PTPLRD+VYD+VA++RYDWFGKAEDCLVL +++LLERFIDR+EL++R
Subjt:  DREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEEDLLERFIDREELLDR

AT1G52590.1 Putative thiol-disulphide oxidoreductase DCC6.3e-1740.68Show/hide
Query:  VVVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFL-IIPTPLRDAV
        ++++DGVC+LC+ GVK+V   D+ + I+F  LQS A +  L  SG   +DIS   + VE   SY + S A LK++ Y+ LP+  L+ FL   P  +RD +
Subjt:  VVVYDGVCHLCHRGVKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFL-IIPTPLRDAV

Query:  YDHVARHRYDWFGKAEDC
        Y++VA +RY  FG+++ C
Subjt:  YDHVARHRYDWFGKAEDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGCAAACTTCTGACGAATCGGTTTCGGCACCTTTTGTTCTCATCAACAGCGAAGATCAGAGCATCCTCCTTTACTACTGCATCTTCTTCTCCCCCATTCCGGTG
CAAATCTTCTTCTAAATCTTCACCGGCATCACCTTCTTCCGCCGGCATCGATTGCGCTACCGCTGCGGATATCGCCGATGTTCCTGAAGAAGGTTTTGAGGACCTGGTCG
ACGCCGTCTCTCCTGCTTCCACTGCTTCAGTTGCCAACACTATGATTCCTACTCTCCTCCAGCCTCGTGTCGTTGTCTATGACGGTGTCTGCCATCTCTGCCATCGCGGT
GTGAAGTGGGTGATAAAAGCAGACAAATATAAGAAGATCAAATTTTGCTGCCTTCAGTCCAGAGCTGCTGAACCATATTTGAGACTAAGTGGTCTAGACAGAGAGGATAT
TAGCCGTCGCTTTTTGTTCGTCGAGGGCTACGGTTCATACTACCAAGCTTCTGCCGCTGCTCTGAAGGTATTGTCGTATTTGCCTCTCCCGTATTCAGCTTTGAGCGCAT
TCTTGATAATTCCAACTCCTCTTAGAGACGCTGTGTATGACCATGTTGCTAGACATCGTTATGATTGGTTTGGAAAGGCTGAAGATTGCTTGGTTTTGCAGGAGGAAGAT
CTGCTTGAGCGTTTTATTGATAGGGAGGAACTGCTTGATCGACATCATCGGTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAGCAAACTTCTGACGAATCGGTTTCGGCACCTTTTGTTCTCATCAACAGCGAAGATCAGAGCATCCTCCTTTACTACTGCATCTTCTTCTCCCCCATTCCGGTG
CAAATCTTCTTCTAAATCTTCACCGGCATCACCTTCTTCCGCCGGCATCGATTGCGCTACCGCTGCGGATATCGCCGATGTTCCTGAAGAAGGTTTTGAGGACCTGGTCG
ACGCCGTCTCTCCTGCTTCCACTGCTTCAGTTGCCAACACTATGATTCCTACTCTCCTCCAGCCTCGTGTCGTTGTCTATGACGGTGTCTGCCATCTCTGCCATCGCGGT
GTGAAGTGGGTGATAAAAGCAGACAAATATAAGAAGATCAAATTTTGCTGCCTTCAGTCCAGAGCTGCTGAACCATATTTGAGACTAAGTGGTCTAGACAGAGAGGATAT
TAGCCGTCGCTTTTTGTTCGTCGAGGGCTACGGTTCATACTACCAAGCTTCTGCCGCTGCTCTGAAGGTATTGTCGTATTTGCCTCTCCCGTATTCAGCTTTGAGCGCAT
TCTTGATAATTCCAACTCCTCTTAGAGACGCTGTGTATGACCATGTTGCTAGACATCGTTATGATTGGTTTGGAAAGGCTGAAGATTGCTTGGTTTTGCAGGAGGAAGAT
CTGCTTGAGCGTTTTATTGATAGGGAGGAACTGCTTGATCGACATCATCGGTATTGA
Protein sequenceShow/hide protein sequence
MASKLLTNRFRHLLFSSTAKIRASSFTTASSSPPFRCKSSSKSSPASPSSAGIDCATAADIADVPEEGFEDLVDAVSPASTASVANTMIPTLLQPRVVVYDGVCHLCHRG
VKWVIKADKYKKIKFCCLQSRAAEPYLRLSGLDREDISRRFLFVEGYGSYYQASAAALKVLSYLPLPYSALSAFLIIPTPLRDAVYDHVARHRYDWFGKAEDCLVLQEED
LLERFIDREELLDRHHRY