; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001204 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001204
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSWIM-type domain-containing protein
Genome locationscaffold36:2260758..2261444
RNA-Seq ExpressionMS001204
SyntenyMS001204
Gene Ontology termsGO:0016310 - phosphorylation (biological process)
GO:0016567 - protein ubiquitination (biological process)
GO:0008270 - zinc ion binding (molecular function)
GO:0016301 - kinase activity (molecular function)
GO:0061630 - ubiquitin protein ligase activity (molecular function)
InterPro domainsIPR001841 - Zinc finger, RING-type
IPR007527 - Zinc finger, SWIM-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR039903 - E3 ubiquitin-protein ligase Zswim2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016899980.1 PREDICTED: uncharacterized protein LOC103488177 [Cucumis melo]1.9e-9777.73Show/hide
Query:  MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRR
        MESVASTSSP  RP   + RFKPSQ VADRIVRALHHHLRLLHRS SNFFVLGATGNVYIVSLS+TPSC+CPDRITPCKHILFIY++ALG+SLDD CLRR
Subjt:  MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRR

Query:  RTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASAS------VVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRSKGRRS
        RTLRPC LNRLL APIML+SLA IG+R+VFHQ+FFQV  RA +S      V D+EDGT CPVCLD+MKK +DRVVAC TCRN+VHEDCF RWKRSKGRR+
Subjt:  RTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASAS------VVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRSKGRRS

Query:  VSCVVCRARWRDTSGQEKYLNLSAYINE---INNDLYN
        VSCVVCRARW+DT  ++KYLNLSAYINE   I+++LYN
Subjt:  VSCVVCRARWRDTSGQEKYLNLSAYINE---INNDLYN

XP_022131540.1 uncharacterized protein LOC111004703 [Momordica charantia]8.8e-12799.13Show/hide
Query:  MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRR
        MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRR
Subjt:  MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRR

Query:  RTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASASVVDVEDGTTCPVCLDEMKK-DSDRVVACMTCRNVVHEDCFLRWKRSKGRRSVSCVV
        RTLRPCHLNRLLTAPIMLES+AAIGVRRVFHQRFFQVKGRASASVVDVEDGTTCPVCLDEMKK DSDRVVACMTCRNVVHEDCFLRWKRSKGRRSVSCVV
Subjt:  RTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASASVVDVEDGTTCPVCLDEMKK-DSDRVVACMTCRNVVHEDCFLRWKRSKGRRSVSCVV

Query:  CRARWRDTSGQEKYLNLSAYINEINNDLYN
        CRARWRDTSGQEKYLNLSAYINEINNDLYN
Subjt:  CRARWRDTSGQEKYLNLSAYINEINNDLYN

XP_023521856.1 uncharacterized protein LOC111785706 [Cucurbita pepo subsp. pepo]9.2e-10080.09Show/hide
Query:  MESVAST-SSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLR
        MESVAST SSPP  PRLHL RFKPSQAVADRIVRALHHHL LL+RS+SNFFVLGATGNVYIVSLS+TPSC+CPDRITPCKHILFIYIRALG+SLDD CLR
Subjt:  MESVAST-SSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLR

Query:  RRTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASASVVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRSKGRRSVSCVV
        RRTLRPC LNRLL API+LESLA IG+RR+FHQ+FFQVK R S++ V VE+GT CPVCLD+MKK +DRVVAC TCRN+VHEDCF RWKRSKGRR VSCVV
Subjt:  RRTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASASVVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRSKGRRSVSCVV

Query:  CRARWRDTSGQEKYLNLSAYINE---INNDL
        CRARW++T+ Q+KYLNLSAY+N+   ++N+L
Subjt:  CRARWRDTSGQEKYLNLSAYINE---INNDL

XP_023546870.1 uncharacterized protein LOC111805851 [Cucurbita pepo subsp. pepo]3.1e-10080.52Show/hide
Query:  MESVAST-SSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLR
        MESVAST SSPP  PRLHL RFKPSQAVADRIVRALHHHL LL+RS+SNFFVLGATGNVYIVSLS+TPSC+CPDRITPCKHILFIYIRALG+SLDD CLR
Subjt:  MESVAST-SSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLR

Query:  RRTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASASVVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRSKGRRSVSCVV
        RRTLRPC LNRLL API+LESLA IG+RR+FHQ+FFQVK R S++ V VE+GT CPVCLD+MKK +DRVVAC TCRN+VHEDCF RWKRSKGRR VSCVV
Subjt:  RRTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASASVVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRSKGRRSVSCVV

Query:  CRARWRDTSGQEKYLNLSAYINE---INNDL
        CRARW++T+ Q+KYLNLSAY+NE   ++N+L
Subjt:  CRARWRDTSGQEKYLNLSAYINE---INNDL

XP_038885291.1 uncharacterized protein LOC120075728 [Benincasa hispida]2.0e-9979.74Show/hide
Query:  MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRR
        MESVASTSS       H+ RFKPS AVADRIVRALHHHLRLLHRS SNFFVLGATGNVYIVSLS+TPSC+CPDRITPCKHILFIY++ALG+SLDD CLRR
Subjt:  MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRR

Query:  RTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASASVVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRSKGRRSVSCVVC
        RTLRPC LNRLL APIML+SLA +G+R VFHQ+FFQVK RAS +VVD+EDGTTCPVCLD+MKK  DRVVAC TCRN+VHEDCF RWKRSKGRRSVSCVVC
Subjt:  RTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASASVVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRSKGRRSVSCVVC

Query:  RARWRDTSGQEKYLNLSAYINE---INNDLYN
        RARW+D   Q+KYLNLSAYINE   I+++LY+
Subjt:  RARWRDTSGQEKYLNLSAYINE---INNDLYN

TrEMBL top hitse value%identityAlignment
A0A0A0LS10 SWIM-type domain-containing protein3.6e-9475.52Show/hide
Query:  MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRR
        MESVASTSSP   P     RFKPSQ VADRIVRALHHHLRLLHRS SNFFVLGATGNVYIVSLS+TPSC+CPDRITPCKHILFIY++ALG+SLDD CLRR
Subjt:  MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRR

Query:  RTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASAS-------VVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRSKGRR
        RTLRPC LNRLL APIML+SLA IG+R+VFHQ+FFQV  RA++S       V D+EDG+ CPVCLD+MKK  DRVVAC TCRN+VHEDCF RWKRSKGRR
Subjt:  RTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASAS-------VVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRSKGRR

Query:  SVSCVVCRARWRDTSG-QEKYLNLSAYI----NEINNDLYN
        +VSCVVCRARW+DT   Q+KYLNLSAYI    + I+  LYN
Subjt:  SVSCVVCRARWRDTSG-QEKYLNLSAYI----NEINNDLYN

A0A1S4DW96 uncharacterized protein LOC1034881779.2e-9877.73Show/hide
Query:  MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRR
        MESVASTSSP  RP   + RFKPSQ VADRIVRALHHHLRLLHRS SNFFVLGATGNVYIVSLS+TPSC+CPDRITPCKHILFIY++ALG+SLDD CLRR
Subjt:  MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRR

Query:  RTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASAS------VVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRSKGRRS
        RTLRPC LNRLL APIML+SLA IG+R+VFHQ+FFQV  RA +S      V D+EDGT CPVCLD+MKK +DRVVAC TCRN+VHEDCF RWKRSKGRR+
Subjt:  RTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASAS------VVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRSKGRRS

Query:  VSCVVCRARWRDTSGQEKYLNLSAYINE---INNDLYN
        VSCVVCRARW+DT  ++KYLNLSAYINE   I+++LYN
Subjt:  VSCVVCRARWRDTSGQEKYLNLSAYINE---INNDLYN

A0A5A7VCN6 SWIM-type domain-containing protein9.2e-9877.73Show/hide
Query:  MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRR
        MESVASTSSP  RP   + RFKPSQ VADRIVRALHHHLRLLHRS SNFFVLGATGNVYIVSLS+TPSC+CPDRITPCKHILFIY++ALG+SLDD CLRR
Subjt:  MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRR

Query:  RTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASAS------VVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRSKGRRS
        RTLRPC LNRLL APIML+SLA IG+R+VFHQ+FFQV  RA +S      V D+EDGT CPVCLD+MKK +DRVVAC TCRN+VHEDCF RWKRSKGRR+
Subjt:  RTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASAS------VVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRSKGRRS

Query:  VSCVVCRARWRDTSGQEKYLNLSAYINE---INNDLYN
        VSCVVCRARW+DT  ++KYLNLSAYINE   I+++LYN
Subjt:  VSCVVCRARWRDTSGQEKYLNLSAYINE---INNDLYN

A0A6J1BPZ4 uncharacterized protein LOC1110047034.3e-12799.13Show/hide
Query:  MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRR
        MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRR
Subjt:  MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRR

Query:  RTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASASVVDVEDGTTCPVCLDEMKK-DSDRVVACMTCRNVVHEDCFLRWKRSKGRRSVSCVV
        RTLRPCHLNRLLTAPIMLES+AAIGVRRVFHQRFFQVKGRASASVVDVEDGTTCPVCLDEMKK DSDRVVACMTCRNVVHEDCFLRWKRSKGRRSVSCVV
Subjt:  RTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASASVVDVEDGTTCPVCLDEMKK-DSDRVVACMTCRNVVHEDCFLRWKRSKGRRSVSCVV

Query:  CRARWRDTSGQEKYLNLSAYINEINNDLYN
        CRARWRDTSGQEKYLNLSAYINEINNDLYN
Subjt:  CRARWRDTSGQEKYLNLSAYINEINNDLYN

A0A6J1GJ16 mitogen-activated protein kinase kinase kinase 11.2e-9775.43Show/hide
Query:  MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRR
        MESVASTSS P RPR  L  F+PSQA+  RIVRALHHHLRLLHRSDS+FFVLGATGNVYIVSLS+TPSC+CPDRITPCKH+LF+YIRALGVSLDDACL R
Subjt:  MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRR

Query:  RTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASASVVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRSKGRRSVSCVVC
        RTLRPC LNRLLTAP+M ESLA I +RRVFHQ+FFQVK R    +VD+EDG  CPVCLD++K   DRVVAC TCRN+VH+DCF RWKRSKGRR++SCVVC
Subjt:  RTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRFFQVKGRASASVVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRSKGRRSVSCVVC

Query:  RARWRDTSGQEKYLNLSAYINE---INNDLYN
        RARW+DT+ ++KYLNLSAY++E   ++N+ YN
Subjt:  RARWRDTSGQEKYLNLSAYINE---INNDLYN

SwissProt top hitse value%identityAlignment
Q13233 Mitogen-activated protein kinase kinase kinase 17.7e-0926.04Show/hide
Query:  RIVRALHHHLRLLHRSDSNFFVLG--ATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRRRTLRPCHLNRLLTAPIMLESLAAIGVR
        R+ + +   L LL +   N F++G  +  N Y V +    +CSC  R T C H+LF+ +R   +   D  L R+TL+   +  L        S       
Subjt:  RIVRALHHHLRLLHRSDSNFFVLG--ATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRRRTLRPCHLNRLLTAPIMLESLAAIGVR

Query:  RVFHQRFF------------QVKGRASASVVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRS--KGRRSVSCVVCRARWR
        R   Q+F                  +S + +  E+   CP+CL  M  +    V    CRN +H  C   W     + R  + C +CR++WR
Subjt:  RVFHQRFF------------QVKGRASASVVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRS--KGRRSVSCVVCRARWR

Q62925 Mitogen-activated protein kinase kinase kinase 15.9e-0925.58Show/hide
Query:  SPPRRPRL--HLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLG--ATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRRRTLR
        +PPRR       + + P +  + R+ + +   L LL +   N F++G  +  N Y V +    +CSC  R T C H+LF+ +R   +   D  L R+TL+
Subjt:  SPPRRPRL--HLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLG--ATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRRRTLR

Query:  PCHLNRLLTAPIMLESLAAIGVRRVFHQRFF------------QVKGRASASVVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRS--K
           +  L        S       R   Q+F                  +S + +  E+   CP+CL  M  +    V    CRN +H  C   W     +
Subjt:  PCHLNRLLTAPIMLESLAAIGVRRVFHQRFF------------QVKGRASASVVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRS--K

Query:  GRRSVSCVVCRARWR
         R  + C +CR++WR
Subjt:  GRRSVSCVVCRARWR

Arabidopsis top hitse value%identityAlignment
AT5G11620.1 SWIM zinc finger family protein / mitogen-activated protein kinase kinase kinase (MAPKKK)-related1.1e-4741.29Show/hide
Query:  MESVASTSSPP---RRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDS-NFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDA
        MESV S    P    R + H      +Q VADRI+RAL H +RLLHR ++  F VLGAT NVY V+L  TP+C+CPDR  PCKHILF+ IR LG+ LDD 
Subjt:  MESVASTSSPP---RRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDS-NFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDA

Query:  CLRRRTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRF--------FQVKGRASASVVD---VEDGTTCPVCLDEM-------------KKDSDRVVACM
        CLR+R LR C L  L +AP   + LA+  +++ F Q F        +     +S S ++    E+  TCP+CLD++             K+    VV C 
Subjt:  CLRRRTLRPCHLNRLLTAPIMLESLAAIGVRRVFHQRF--------FQVKGRASASVVD---VEDGTTCPVCLDEM-------------KKDSDRVVACM

Query:  TCRNVVHEDCFLRWKRSKGRRSVSCVVCRARWRDTSGQEK--------------YLNLSAYINE
         C+N VH++C L W++S+GRR   CVVCRARW      +               YLNL+ Y++E
Subjt:  TCRNVVHEDCFLRWKRSKGRRSVSCVVCRARWRDTSGQEK--------------YLNLSAYINE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCTGTTGCCTCTACTTCATCCCCACCCCGCCGTCCCCGTCTTCACCTTACTCGATTCAAGCCCTCCCAAGCCGTGGCTGACCGGATCGTTCGAGCTCTCCACCA
TCACCTTCGTCTTCTTCACCGCTCCGATTCTAACTTCTTTGTGTTGGGAGCCACAGGCAACGTCTACATTGTGTCTCTGTCCACCACCCCTTCCTGCAGTTGCCCCGATC
GGATTACCCCGTGCAAACACATATTGTTCATCTACATTAGAGCTCTTGGTGTGTCACTCGACGACGCATGTCTTCGGCGGAGAACGCTTCGACCATGCCACCTGAATCGC
TTGCTTACTGCTCCCATTATGTTGGAATCTCTTGCTGCAATTGGCGTGCGTAGAGTATTTCATCAACGATTCTTCCAGGTAAAAGGACGAGCTTCTGCATCCGTCGTAGA
CGTAGAAGATGGGACTACATGTCCGGTTTGTCTGGATGAAATGAAGAAGGATAGCGATAGGGTCGTGGCTTGTATGACATGTCGAAATGTTGTCCATGAAGATTGTTTCT
TGAGGTGGAAACGAAGCAAGGGAAGGCGAAGTGTTAGTTGCGTAGTTTGTAGGGCAAGGTGGAGAGATACGAGTGGCCAAGAAAAGTATTTGAATTTGTCTGCATACATC
AATGAAATTAACAATGATCTCTATAAT
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCTGTTGCCTCTACTTCATCCCCACCCCGCCGTCCCCGTCTTCACCTTACTCGATTCAAGCCCTCCCAAGCCGTGGCTGACCGGATCGTTCGAGCTCTCCACCA
TCACCTTCGTCTTCTTCACCGCTCCGATTCTAACTTCTTTGTGTTGGGAGCCACAGGCAACGTCTACATTGTGTCTCTGTCCACCACCCCTTCCTGCAGTTGCCCCGATC
GGATTACCCCGTGCAAACACATATTGTTCATCTACATTAGAGCTCTTGGTGTGTCACTCGACGACGCATGTCTTCGGCGGAGAACGCTTCGACCATGCCACCTGAATCGC
TTGCTTACTGCTCCCATTATGTTGGAATCTCTTGCTGCAATTGGCGTGCGTAGAGTATTTCATCAACGATTCTTCCAGGTAAAAGGACGAGCTTCTGCATCCGTCGTAGA
CGTAGAAGATGGGACTACATGTCCGGTTTGTCTGGATGAAATGAAGAAGGATAGCGATAGGGTCGTGGCTTGTATGACATGTCGAAATGTTGTCCATGAAGATTGTTTCT
TGAGGTGGAAACGAAGCAAGGGAAGGCGAAGTGTTAGTTGCGTAGTTTGTAGGGCAAGGTGGAGAGATACGAGTGGCCAAGAAAAGTATTTGAATTTGTCTGCATACATC
AATGAAATTAACAATGATCTCTATAAT
Protein sequenceShow/hide protein sequence
MESVASTSSPPRRPRLHLTRFKPSQAVADRIVRALHHHLRLLHRSDSNFFVLGATGNVYIVSLSTTPSCSCPDRITPCKHILFIYIRALGVSLDDACLRRRTLRPCHLNR
LLTAPIMLESLAAIGVRRVFHQRFFQVKGRASASVVDVEDGTTCPVCLDEMKKDSDRVVACMTCRNVVHEDCFLRWKRSKGRRSVSCVVCRARWRDTSGQEKYLNLSAYI
NEINNDLYN