; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g0488 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g0488
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionWRKY domain-containing protein
Genome locationMC08:3987029..3993002
RNA-Seq ExpressionMC08g0488
SyntenyMC08g0488
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006468 - protein phosphorylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0004674 - protein serine/threonine kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR003657 - WRKY domain
IPR036576 - WRKY domain superfamily
IPR044810 - WRKY transcription factor, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144271.1 probable WRKY transcription factor 50 [Cucumis sativus]6.60e-9889.68Show/hide
Query:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK
        MSYNSNQHL TSESDL+EQPGFEF DWMFDGWLNENSSSL +SVMYPVY+ GE VDE VG NTI QGEPSSRD GRERE+RERFAFKTKSE+EILDDGFK
Subjt:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK

Query:  WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS
        WRKYGKKMVKNSPNPRNYYKCS+EGCPVKKRVERDREDP+YVITTYEGVHTHESS
Subjt:  WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS

XP_008445519.1 PREDICTED: probable WRKY transcription factor 50 isoform X1 [Cucumis melo]5.43e-9789.03Show/hide
Query:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK
        MSYNSNQHL TSESDL+EQPGFEF DWMFDGWLNENSS+L +SVMYPVY+ GE VDE VG NTI Q EPSSRD GREREVRERFAFKTKSE+EILDDGFK
Subjt:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK

Query:  WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS
        WRKYGKKMVKNSPNPRNYYKCS+EGCPVKKRVERDREDP+YVITTYEGVHTHESS
Subjt:  WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS

XP_022131756.1 probable WRKY transcription factor 50 [Momordica charantia]2.68e-111100Show/hide
Query:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK
        MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK
Subjt:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK

Query:  WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS
        WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS
Subjt:  WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS

XP_023546261.1 probable WRKY transcription factor 50 [Cucurbita pepo subsp. pepo]4.80e-9183.23Show/hide
Query:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK
        MSYNSNQHL  SE+DL EQPGFEF DWMFD WL+ENS  L +S MYPVY++GE ++E VG +TI QGEPSSRD GREREVRERFAFKTKSE+E+LDDGFK
Subjt:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK

Query:  WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS
        WRKYGKKMVKNSPNPRNYYKCS+EGCPVKKRVERDREDP+YVITTYEGVHTHESS
Subjt:  WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS

XP_038884623.1 probable WRKY transcription factor 50 isoform X2 [Benincasa hispida]6.08e-9487.1Show/hide
Query:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK
        MSYNSNQHL  SESDL E+PGFEF +WMFDGWLNE SSSLA+SVMYPVY+ GE VDE VG NTIHQGEPS+RD GREREV ERFAFKTKSE+EILDDGFK
Subjt:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK

Query:  WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS
        WRKYGKK VKNSPNPRNYYKCS+EGCPVKKRVERDREDP+YVITTYEGVHTHESS
Subjt:  WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS

TrEMBL top hitse value%identityAlignment
A0A0A0KD40 WRKY domain-containing protein3.20e-9889.68Show/hide
Query:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK
        MSYNSNQHL TSESDL+EQPGFEF DWMFDGWLNENSSSL +SVMYPVY+ GE VDE VG NTI QGEPSSRD GRERE+RERFAFKTKSE+EILDDGFK
Subjt:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK

Query:  WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS
        WRKYGKKMVKNSPNPRNYYKCS+EGCPVKKRVERDREDP+YVITTYEGVHTHESS
Subjt:  WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS

A0A1S3BCG1 probable WRKY transcription factor 50 isoform X12.63e-9789.03Show/hide
Query:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK
        MSYNSNQHL TSESDL+EQPGFEF DWMFDGWLNENSS+L +SVMYPVY+ GE VDE VG NTI Q EPSSRD GREREVRERFAFKTKSE+EILDDGFK
Subjt:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK

Query:  WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS
        WRKYGKKMVKNSPNPRNYYKCS+EGCPVKKRVERDREDP+YVITTYEGVHTHESS
Subjt:  WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS

A0A5A7VG90 Putative WRKY transcription factor 50 isoform X12.63e-9789.03Show/hide
Query:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK
        MSYNSNQHL TSESDL+EQPGFEF DWMFDGWLNENSS+L +SVMYPVY+ GE VDE VG NTI Q EPSSRD GREREVRERFAFKTKSE+EILDDGFK
Subjt:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK

Query:  WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS
        WRKYGKKMVKNSPNPRNYYKCS+EGCPVKKRVERDREDP+YVITTYEGVHTHESS
Subjt:  WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS

A0A5D3BLP6 Putative WRKY transcription factor 50 isoform X13.35e-9177.84Show/hide
Query:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSR---------------------DRGRERE
        MSYNSNQHL TSESDL+EQPGFEF DWMFDGWLNENSS+L +SVMYPVY+ GE VDE VG NTI Q EPSS                      D GRERE
Subjt:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSR---------------------DRGRERE

Query:  VRERFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS
        VRERFAFKTKSE+EILDDGFKWRKYGKKMVKNSPNPRNYYKCS+EGCPVKKRVERDREDP+YVITTYEGVHTHESS
Subjt:  VRERFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS

A0A6J1BQD9 probable WRKY transcription factor 501.30e-111100Show/hide
Query:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK
        MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK
Subjt:  MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFK

Query:  WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS
        WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS
Subjt:  WRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS

SwissProt top hitse value%identityAlignment
Q8VWQ5 Probable WRKY transcription factor 503.4e-3270.24Show/hide
Query:  SRDRGREREVRERFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHES
        ++++  +++++ R AFKT+SE+E+LDDGFKWRKYGKKMVKNSP+PRNYYKCS++GCPVKKRVERDR+DP +VITTYEG H H S
Subjt:  SRDRGREREVRERFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHES

Q93WU9 Probable WRKY transcription factor 512.5e-2766.67Show/hide
Query:  SRDRGREREVRERFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHES
        S++  + +E   R AF+T+S+I+++DDGFKWRKYGKK VKN+ N RNYYKCS EGC VKKRVERD +D  YVITTYEGVH HES
Subjt:  SRDRGREREVRERFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHES

Q9FGZ4 Probable WRKY transcription factor 487.5e-2447.01Show/hide
Query:  NSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERD
        N ++  + V     + G++  E  G     + +  ++ + RE     RFAF TKS+I+ LDDG++WRKYG+K VKNSP PR+YY+C+  GC VKKRVER 
Subjt:  NSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERD

Query:  REDPRYVITTYEGVHTH
         +DP  V+TTYEG HTH
Subjt:  REDPRYVITTYEGVHTH

Q9S763 Probable WRKY transcription factor 454.4e-2453.06Show/hide
Query:  VDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTH
        VD      T  + +P S+ + +ERE   R+AF+T+S+++ILDDG++WRKYG+K VKN+P PR+YYKC+ EGC VKK+V+R   D   V+TTY+GVHTH
Subjt:  VDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTH

Q9SVB7 Probable WRKY transcription factor 136.4e-2363.29Show/hide
Query:  REVRE-RFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS
        R+VRE RF FKT SE+++LDDG++WRKYG+K+VKN+ +PR+YY+C+ + C VKKRVER  +DPR VITTYEG H H  S
Subjt:  REVRE-RFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS

Arabidopsis top hitse value%identityAlignment
AT3G01970.1 WRKY DNA-binding protein 453.1e-2553.06Show/hide
Query:  VDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTH
        VD      T  + +P S+ + +ERE   R+AF+T+S+++ILDDG++WRKYG+K VKN+P PR+YYKC+ EGC VKK+V+R   D   V+TTY+GVHTH
Subjt:  VDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTH

AT4G39410.1 WRKY DNA-binding protein 134.5e-2463.29Show/hide
Query:  REVRE-RFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS
        R+VRE RF FKT SE+++LDDG++WRKYG+K+VKN+ +PR+YY+C+ + C VKKRVER  +DPR VITTYEG H H  S
Subjt:  REVRE-RFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS

AT5G26170.1 WRKY DNA-binding protein 502.4e-3370.24Show/hide
Query:  SRDRGREREVRERFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHES
        ++++  +++++ R AFKT+SE+E+LDDGFKWRKYGKKMVKNSP+PRNYYKCS++GCPVKKRVERDR+DP +VITTYEG H H S
Subjt:  SRDRGREREVRERFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHES

AT5G49520.1 WRKY DNA-binding protein 485.3e-2547.01Show/hide
Query:  NSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERD
        N ++  + V     + G++  E  G     + +  ++ + RE     RFAF TKS+I+ LDDG++WRKYG+K VKNSP PR+YY+C+  GC VKKRVER 
Subjt:  NSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERD

Query:  REDPRYVITTYEGVHTH
         +DP  V+TTYEG HTH
Subjt:  REDPRYVITTYEGVHTH

AT5G64810.1 WRKY DNA-binding protein 511.8e-2866.67Show/hide
Query:  SRDRGREREVRERFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHES
        S++  + +E   R AF+T+S+I+++DDGFKWRKYGKK VKN+ N RNYYKCS EGC VKKRVERD +D  YVITTYEGVH HES
Subjt:  SRDRGREREVRERFAFKTKSEIEILDDGFKWRKYGKKMVKNSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTACAATAGTAACCAGCACCTCGCTACGTCTGAGAGCGATCTTGCCGAGCAGCCCGGGTTCGAGTTTCCGGACTGGATGTTCGATGGGTGGCTCAATGAAAACTC
TTCGTCTCTGGCTGAGTCGGTCATGTACCCGGTTTATAAAGCGGGCGAAGAAGTCGATGAGATTGTTGGGATCAATACCATCCACCAAGGAGAGCCTAGCAGCAGAGACC
GTGGGAGAGAGAGGGAAGTTAGAGAGAGATTTGCATTCAAGACAAAATCAGAAATTGAGATTCTAGATGATGGGTTCAAGTGGAGGAAGTATGGGAAGAAGATGGTGAAG
AACAGCCCAAATCCGAGGAACTATTACAAATGCTCAATCGAAGGCTGCCCGGTGAAGAAGAGAGTCGAGAGAGACCGAGAGGATCCGAGATATGTGATTACGACGTACGA
GGGTGTTCATACCCATGAAAGTTCTTGA
mRNA sequenceShow/hide mRNA sequence
GGAAGATAATGTGGAGGCTCAAATGAGTGCATATATGAATTCAAAAGTCTACTCAGAGTCATCCCTTTGGGATACCCCACAGACCTTGTATTCTAATTTCTACACTTTGC
TGGTCTTCTCCTTTCATTATATTAAGCATTAGCGTTGAAGCTTCCTAAGTTTGGAGGAAAGTCAGTGGGAAGCTGCCTGCCTAAGCTTAGCTAGCTCACTTCATATTTTG
GCTATTTCTATATCTATGCCACAATTCTTTTATCTCTAATTCTGATCTCAGTTTCAAAAAGTATCCTACTTTTGTTTGCTTCTTTATTCTCTCGGTTCCCAAATGTCTTA
CAATAGTAACCAGCACCTCGCTACGTCTGAGAGCGATCTTGCCGAGCAGCCCGGGTTCGAGTTTCCGGACTGGATGTTCGATGGGTGGCTCAATGAAAACTCTTCGTCTC
TGGCTGAGTCGGTCATGTACCCGGTTTATAAAGCGGGCGAAGAAGTCGATGAGATTGTTGGGATCAATACCATCCACCAAGGAGAGCCTAGCAGCAGAGACCGTGGGAGA
GAGAGGGAAGTTAGAGAGAGATTTGCATTCAAGACAAAATCAGAAATTGAGATTCTAGATGATGGGTTCAAGTGGAGGAAGTATGGGAAGAAGATGGTGAAGAACAGCCC
AAATCCGAGGAACTATTACAAATGCTCAATCGAAGGCTGCCCGGTGAAGAAGAGAGTCGAGAGAGACCGAGAGGATCCGAGATATGTGATTACGACGTACGAGGGTGTTC
ATACCCATGAAAGTTCTTGATTTGTTTATTTTTACACCAAATCTCTCCCAATGTTAAATAAGCCTACATTTTTGGGTTTCTTGTGGGGAATTATAACTAGTATATGAACA
TCAACGTCAGGTAATAGTTCTAGCGATCTGTTGTTTGTCTGATATCATATTTGCAGTAGAGGCCCCATACCAAACATTACTATTACATATATTAAAGATACGACAATCCA
AACCATACCGTTACAGATCAAAACACCATGTAGGAATCGTCGAATTTGCATTAGTTTGTCCCATCTCAACTTTCCATTGAAAAGCTCTTCCTTTTTCTGTCATTAAAGAA
CATGGTACCACGAAATATGTTAATCACAAAATTCATACATTTACAGAAAAGGCTCTTGATTCATCCAATATATAACGAAGTGATCTTGTGATGTGAAGATAACTGAGAGC
TTTTGTTTTGACCAATTCCTTTACAAGTGAAAGAAAAGAAAAAAAAAATAATCCATAGTATAAGGCCAAGCGATCCAAGAGATCGATCAACTGAAACATATATGTGAAGC
AATACTTTGGTTCGGGATCACGATATTGGGGTTTCTTTTATGTGAACCCGAAGCTGTCGAAGCGGAAATCATTCCCTCGCCATGTATATCAGAGGAAATGCTCCACAAGA
TATCGCCAATCAGAACTTTGTCACCACGCGACGAAAATCTAACGATCGCTTTCGACTTGGATAATGGAAGCTTGAAGTTCCCACCCATTCCACTTGAAAGCAACCAACGG
GATCCGGATTGCTGCTGTGCGGTGGTTCCCCAGTACTGAGTTTTATTAAGCTCAGAATTTACACCCAGAAGATGAATAGCAGAGATTTTAAATGCGTGAACAATGTTCCC
TTTATCTGCTTTCTCCCGGGTCGTTTTATGACTCTCTTCTTGTGCTTCCAATGGCTCATTCTCCTTGTTGACATCGTTTGCTTCAGAGGCCGTGCCATGCATTGTGTCGA
TAAAAAACCTCTCAACCTCCATTATGCACATCATCGGGCCACCAACAGGTTCATAGTTTCGTAATCGATCTCTAAGTTGCACCATGAGAGCCACCACGAGCTTGTTCCCA
AACAATCCCGGCTCCTTGATGGATATCTCGGAGTCGATATCTTTTGATAGTCTTCCAACTATATCTGCATAGTTAGCACCATGGGCCACAAGAATCTTCATAATGTGCTG
ACCATTTTGATCATCGCCATTAATGAGACCAGCATCAAGCCTTAACCAATCTTCCAATGTTATAGAAAGATCCATCAACCCAACAACATCACTCGCTGTATCAGGCCGCT
CCATAAACTGTAGTTCCTTTAGTACTTCCAAACTACAAGAACCGTCCAAACTGGAACGTCTCTGTCCACAGGCTGGCACACAGTGAAAAGGGCGGGCATTGATTTGTGAT
GGCATCTCATCATCTGTCATGCCGGATTGTATTCTCAAACCTTCTATTAAGAGTGTTTCGCTCTTGTCCAGTGCTAGAAAAGCTAGATCATCTGGCACCATGAGGTCTCG
GTTTGTCTCAAAATCCAGAAAAGATGGCAATCCTTCATCGTCTTTCCTTCCACAACATGAAACCGAACTACACGATAAGTTATTCTCAAGCATAGGCTCCCTGTGACAGA
AGTAGATGAGTCAAGAATTTGGCATAATGAAAACCTTCTGAGTTGCAATTTAAAAGTTTTAGAAATCAAATCAATATTGCAATGGCTGACCTCTCTAGCATTGTTTCACT
GGGAGGACATTCTGATATCATTTGTTGGAGAGTCTTTCCGGTAATATCGTCCAAAGGCATTAACTTTTTTGCCAGTGTCGAAAGGTTTTCGGTTCCAGCCAACGCTAAAT
TTTGCGAAATCTCCATGATATTATGACCCATTTCATCAGGTAGAACAACTGGATCAGAACATTGAACGACTAAGCTCTGTCCAGTGCTAGTGTTTGGAGAGAGCGAACGA
CTCATGGACCGCAAAAATCCGCCACTGTTTATCTTAAGAAATGCACCAAAGCCTTCTCCAAGTGAAGGTAACTTAGGTGGCTCTTCTTCTTGGGGCAACTCAATAGGACT
CCCAAATCCACTTGAACTATAATGTGGAGAATGCTCGAAATCTCTCTCGTCTAAGCCCCATTCTTGCATTAAAACTTCGGTCTCTAAGTCTTCGAGAATTTTGACATTCC
TTCTGTTCCTCAAGGACTGATGTCCTTCCTGTGCTTCTTCAGGAATATAAATAATTGGAGAAAAATCAAAATCCTCATCTTGATTCCCAGAACAAGACTCCATAAAATCC
AATAACGAATTACTATAATCCTGCAACTCTTCAGTATCTGTAAAATCCAATAACGAATTACCAAATACAAGAGATTCCTCCTCAAATTCTCTCAATAAGCGCTCTCTAGG
AGACGATATATCAGGATCTGAAAACCTTGAGGAACCGTGCTCCAACCCCAGCTGCTTTAGAAAATCACTAGCCACAGATTCATAGGAGTCATCATCTAAGCTAAGAGATC
TTCTAGCACAATTCTCCTTGGCATCCATATCGATATCTTGTTTGAGGAGTTCACCAACAGCTAACGAAGAATTAGTCTCCACTGACCGTGAGGTTGACTTGAGATCAAAA
TCTGAACTGAGCTCTTCAGGAGTAACTTCCTCCACTTGTAGGTGATTCTCTTCATATTTGAAGCCATCCAATATAGAGTCATCAACATGAATATCGCAGACTGCATCCTT
CAATATGTTGCTCAGCTTAATTTTGATGGCAACTTTCTCATCTTCGATGATCTCATCCAAAGAAACAGTTTCCATTTTGGAGCTCTCAATAGTCCGAACAGTACTTTTAT
CCATACTAAGCTCTTCTGTCCCAGCTAATTCTATCCCACATTCAATAATGGAAAACTCACCACAGTCATAACCACCTCCACCAATTTCTTCATCAGACTTGTGTTCCTCA
GTAGATTTTAATTCAAGTTGCTTGGCAAACTCGGAACCTGAATGCTCTGGCTTATGCTGCTCTGTCTCATCCATCTTACGATACAAAAGGTTTATCGACTTAGATAGCTC
CAACCTTGGATTTAGTTCATCAAAATTTCTTGTATCAACTAACTGAGTTGAGGTAATAGATCCATATTTTTTACTATGAGAAACATTTCCTTCTGGACTTGGAAGCCCAT
CTAAATTAGTTGAAGTAAGGCGGGCACCATAGGTAGAAAAACTTGACCTATTTTGCAATAACTTCAGGAGTTGGACAACGTTTTCGGGACCGCTCAATTTCATTGGATCA
TCCTTAGTTACCAAAAAACTAAAACTGACGTTTAGGGTAGCACCTCTCGCATTGACCCCAAGTCGAAAGCTAGTCGACCAGTTCCCAGAACACTTCTCCCCCTCTAGCTC
CTCTAAGGTAAGAGGTAAAATCCTTGTGAGGTCAACCCAATGCTCCCCAAAATCGAGCCTAGGTGCCCCAAACATGGAAACATAGATCAAGAAAAGCTTCGGGTCATATT
TTGCCGAGCCGTTAGCCAAACTTTTCCCACCATATATCAAACATTTGTGAATCAAAGTCTCGTCAAATTCAGCCATGCCTTGCAAAACCTTAGACGGCTGAGTCTCCAAA
ATCTCATCCTTCCTTTTCCAATGCACACTTAAACTATAACCATTGAAACTCGGAGACAAACCTTCTATGGAATGAACCTTGAGATAAAACACACAGTTGAACTTGCGCTG
CCGAATATGAGTAAGAGCTTTCAAAGATTTCTTCCAGTTCCATGTAGATGACGACCTTCTTTCATTGACCAATGATTCTTCTTCCCTCGAAGTTCCCTGGTTGAATCTTG
ATTTTGATTCAGTCAAGTGGGATTCTAACCGTCCATCAGGTGGGCAAAAAACCGAGTTCGTATGGCCCTTGTGCAAGTACAGAGCTTTACTTATAGCCTCAATTTCCTCC
AGCAACCGACCCCCATCAGACTCCCCTATGCCATCACAATTTCCAGACTTCATTTTGCACTAAACAAACCCACAATCCGCTACGTATTTCAACAATTTCATCATAGAAAA
CGAAAACCCCCCATAAATAGCTGAATCACAAACAGCAAAGAAAGAGAACACAAAGGAACTCAGCATCACCATCATCAACAATAAGCAACCCCATTTCAAAACAAAATTAT
AAAACATCATTCCTCTTACTAGAAACAACAATCAAATCTCAAAATTCAGTAGAAAGACGAACAGAAAACAACTAGCAAAACAACATAAACGAAAAATCCCAATCCCAACA
GACCCCATCAAAACATCGGATCAGTTAAGGATTAAAGTTAAGCAATCGAGTTACCAAGAAAGAGAATGACACAGTACCGAATTCAATAGTTGGGCAGCTTCTCCGCCACA
AAAGATGTACACTGTAACCCGAAAAAATGATAAGAGTAGAAAATTAATGACCCGGAGGAAGATTAATATTCATCAACTGAGGCCGAAATAATCTGGGAATCGAATATCTT
CTTCTATTAGAAATCTTCCTGGATAGTTTTTTCAGAGGAAAAGCGTGAGAGACATTAATATTTTGTCTTCTGTGCCACCGCGCCTGGAGAAGCCGCGGACTTGGAAAGGT
AATAATTTTAAAAAGAAAAAATGGATCGTCGATTCGGGTTTCATTACTGTGTGAGTGAATTCACCCGAATTTTGTACAATCTAATACCTGGTCCGTTGTATTGGGAGAAG
GGGGTGCCGATCATGGGATCGGATGGCTGAGAGAGAAGAATTC
Protein sequenceShow/hide protein sequence
MSYNSNQHLATSESDLAEQPGFEFPDWMFDGWLNENSSSLAESVMYPVYKAGEEVDEIVGINTIHQGEPSSRDRGREREVRERFAFKTKSEIEILDDGFKWRKYGKKMVK
NSPNPRNYYKCSIEGCPVKKRVERDREDPRYVITTYEGVHTHESS