; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g11040 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g11040
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCSL zinc finger domain-containing protein
Genome locationchr2:7878610..7880884
RNA-Seq ExpressionMoc02g11040
SyntenyMoc02g11040
Gene Ontology termsGO:0017183 - peptidyl-diphthamide biosynthetic process from peptidyl-histidine (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR044248 - Diphthamide biosynthesis protein 3/4-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008459230.1 PREDICTED: uncharacterized protein LOC103498418 [Cucumis melo]8.7e-10678.63Show/hide
Query:  MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV
        +AL   +L+LVAEG DTNDV+SPCLD+K+QRSDGFTFGVAFSSKESFFQDQIQFSPCD RL LASKNAQL VFRP+VDQLS LTI+++TFNPAL GGYMV
Subjt:  MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV

Query:  AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW
        AFAGQKYAARSLPV++TDNS+TITSFTLV EF++GTLQNLFWKKFGCDKC+GD+S+C+DNQDCA+ ++KCK++GGS+DCNL IQLAFSGTD+NLEVLNSW
Subjt:  AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW

Query:  FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG
        +E+DNLRR+SLY+LFSDVR       DT+TNPFG
Subjt:  FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG

XP_011649210.1 uncharacterized protein LOC101222204 [Cucumis sativus]5.0e-10982.05Show/hide
Query:  MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV
        +AL   +L+LV EG DTNDV+SPCLD+K+QRSDGFTFGVAFSSKESFFQDQIQFSPCD RL LASK AQLVVFRP+VDQLS LTINS+TFNPAL GGYMV
Subjt:  MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV

Query:  AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW
        AFAGQKYAARSLPV+ITDNSNTITSFTLVLEFQ+G LQNLFWKKFGCDKC+GD+S+C+DNQDCA+PN+KCK+NGGS+DCNL IQLAFSGTD+NLEVLNSW
Subjt:  AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW

Query:  FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG
        +EVDNLRR+SLY+LFSDVR       DT+TNPFG
Subjt:  FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG

XP_022148528.1 uncharacterized protein LOC111017155 [Momordica charantia]7.8e-131100Show/hide
Query:  MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV
        MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV
Subjt:  MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV

Query:  AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW
        AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW
Subjt:  AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW

Query:  FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG
        FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG
Subjt:  FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG

XP_023513242.1 uncharacterized protein LOC111777756 [Cucurbita pepo subsp. pepo]5.7e-10578.63Show/hide
Query:  MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV
        + L   +L+LVAEG DTNDV+SPCLDAK+Q+SDGFTFG+AFSSKE+FFQDQIQFSPCD RL L  KN QL +FRP+VDQLSLLTINSTTFNPA+ GGYMV
Subjt:  MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV

Query:  AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW
        AFAG KYAARSLPV+ITDNS+TITSFTLV EFQKGTLQNLFWKK+GC+KCTGD+SVCLDNQDCAV ++KCK++GGS+DCN+SIQLAFSGTDRNLEVLNSW
Subjt:  AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW

Query:  FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG
        FE+DNL R+SL+KLF+DVR       DT+TNPFG
Subjt:  FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG

XP_038877911.1 uncharacterized protein LOC120070122 [Benincasa hispida]5.1e-10680.85Show/hide
Query:  MALAAMMLILVA-EGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYM
        +AL  M+L+L+A EG +TND++SPCLD+K+QRSDGFTFGVAFSSKESFFQDQIQ SPCD RL LASKNAQL VFRP VDQLS LTINS+TFNPA+ GGYM
Subjt:  MALAAMMLILVA-EGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYM

Query:  VAFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNS
        VAFAG+KYAARSLPV+ITDNS+TITSFTLV EFQKGTLQNL WKKFGCDKC+GD+SVCLDNQDCAV ++KCK+NGGS+DCNLSIQLAFSGTD+NLEVLNS
Subjt:  VAFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNS

Query:  WFEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG
        W+EVDNLRR+SLYKLFSDVR       DT+TNPFG
Subjt:  WFEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG

TrEMBL top hitse value%identityAlignment
A0A1S3CA87 uncharacterized protein LOC1034984184.2e-10678.63Show/hide
Query:  MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV
        +AL   +L+LVAEG DTNDV+SPCLD+K+QRSDGFTFGVAFSSKESFFQDQIQFSPCD RL LASKNAQL VFRP+VDQLS LTI+++TFNPAL GGYMV
Subjt:  MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV

Query:  AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW
        AFAGQKYAARSLPV++TDNS+TITSFTLV EF++GTLQNLFWKKFGCDKC+GD+S+C+DNQDCA+ ++KCK++GGS+DCNL IQLAFSGTD+NLEVLNSW
Subjt:  AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW

Query:  FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG
        +E+DNLRR+SLY+LFSDVR       DT+TNPFG
Subjt:  FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG

A0A5D3CQC8 Uncharacterized protein4.2e-10678.63Show/hide
Query:  MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV
        +AL   +L+LVAEG DTNDV+SPCLD+K+QRSDGFTFGVAFSSKESFFQDQIQFSPCD RL LASKNAQL VFRP+VDQLS LTI+++TFNPAL GGYMV
Subjt:  MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV

Query:  AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW
        AFAGQKYAARSLPV++TDNS+TITSFTLV EF++GTLQNLFWKKFGCDKC+GD+S+C+DNQDCA+ ++KCK++GGS+DCNL IQLAFSGTD+NLEVLNSW
Subjt:  AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW

Query:  FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG
        +E+DNLRR+SLY+LFSDVR       DT+TNPFG
Subjt:  FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG

A0A6J1D4C1 uncharacterized protein LOC1110171553.8e-131100Show/hide
Query:  MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV
        MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV
Subjt:  MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV

Query:  AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW
        AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW
Subjt:  AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW

Query:  FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG
        FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG
Subjt:  FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG

A0A6J1FVK1 uncharacterized protein LOC1114488554.7e-10578.63Show/hide
Query:  MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV
        + L   +L+LVAEG DTNDV+SPCLDAK+Q+SDGFTFG+AFSSKE+FFQDQIQFSPCD RL L  KN QL +FRP+VDQLSLLTINSTTFNPA+ GGYMV
Subjt:  MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV

Query:  AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW
        AFAG KYAARSLPV+ITDNS+TITSFTLV EFQ+GTLQNLFWKK+GC+KCTGD+SVCLDNQDCAV ++KCK++GGS+DCN+SIQLAFSGTDRNLEVLNSW
Subjt:  AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW

Query:  FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG
        FEVDNL R+SL+KLF+DVR       DT+TNPFG
Subjt:  FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG

A0A6J1JGG7 uncharacterized protein LOC1114842896.1e-10578.21Show/hide
Query:  MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV
        + L   +L+LVAEGTDTN+++SPCLDAK+Q+SDGFTFG+AFSSKE+FFQDQIQFSPCD RL L  KN QL +FRP+VDQLSLLTINSTTFNPA+ GGYMV
Subjt:  MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMV

Query:  AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW
        AFAG KYAARSLPV+ITDNS+TITSFTLV EFQ+GTLQNLFWKK+GC+KCTGD+SVCLDNQDC V ++KCK++GGS+DCN+SIQLAFSGTDRNLEVLNSW
Subjt:  AFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSW

Query:  FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG
        FEVDNL R+SL+KLFSDVR       DT+TNPFG
Subjt:  FEVDNLRRYSLYKLFSDVRDTISDVHDTITNPFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15910.1 CSL zinc finger domain-containing protein1.2e-7658.12Show/hide
Query:  MMLILVAE----GTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMVA
        MM++++ +      D N V+SPC D ++ + DGFT G+A SSKE+FF DQ+Q SPCD RLGLA+K AQL +FRP+VD++SLL+I+++ FNP+  GG+MV 
Subjt:  MMLILVAE----GTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMVA

Query:  FAGQKYAARSLPVLITDNSNTITSFT---------LVLEFQKGTLQNLFWKKFGCDKCTG---DYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSG
        FAG KYAARS PV + D SNTIT+FT         LVLEFQKG LQNLFWK FGCD C G     SVCL+  DCAVP +KCK NGG  +CN+ IQ+AFSG
Subjt:  FAGQKYAARSLPVLITDNSNTITSFT---------LVLEFQKGTLQNLFWKKFGCDKCTG---DYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSG

Query:  TDRNLEVLNSWFEVDNLRRYSLYKLFSDVRDTIS
        TDRNLE LN+W+EV+NLR+YSL  L+++  D++S
Subjt:  TDRNLEVLNSWFEVDNLRRYSLYKLFSDVRDTIS

AT3G11800.1 unknown protein3.1e-6952.52Show/hide
Query:  AAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFF----QDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTIN---STTFNPALYG
        AA++   + E  D N V+SPC D+ V   DGFTFG+AF++K+SFF       +Q+SPCD R    + N+++ VFRP+VD+++LLTIN   S++F P    
Subjt:  AAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFF----QDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTIN---STTFNPALYG

Query:  GYMVAFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYS-VCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLE
        GYMVAFAG KYAARSLP+++ D+++ +TSFTLVLEFQKG L+N+FWKK GC KC+GD   VCL+ ++CA+    CK  GG +DC+L IQLAFSGTD++  
Subjt:  GYMVAFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYS-VCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLE

Query:  VLNSWFEVDNLRRYSLYKLFSDVRDTISDVHDTITNPF
         LNSW+EV NL++YSLY L+S+++       D++TNPF
Subjt:  VLNSWFEVDNLRRYSLYKLFSDVRDTISDVHDTITNPF

AT3G44150.1 unknown protein1.2e-7356.64Show/hide
Query:  MLILVAEGTD------TNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQ-IQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYM
        +++ VA G D      TN ++SPC D ++QRSDGFTFG+AFSS+ SFF +Q +  SPCDRRL LA+ N+Q  VFRP++D++SLL+IN++ F P  YGGYM
Subjt:  MLILVAEGTD------TNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQ-IQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYM

Query:  VAFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYS-VCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLN
        VAFAG+KYAARS+P  I +++  +TSFTLV+EFQKG LQNL+WK+ GC  C G+ + VCL+ QDCA+    CK  GG++DC+L IQLAFSGTD++L VLN
Subjt:  VAFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYS-VCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLN

Query:  SWFEVDNLRRYSLYKLFSDVRDTISD
        SW+EV+NL++YSLY L+S+++ ++++
Subjt:  SWFEVDNLRRYSLYKLFSDVRDTISD

AT3G48630.1 unknown protein6.4e-0644.23Show/hide
Query:  YMVAFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDK
        Y V   G +  +   P  I +++  +TSFT V+EFQKG LQNL+WK+  C K
Subjt:  YMVAFAGQKYAARSLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTGGCAGCAATGATGCTGATTTTGGTTGCGGAAGGGACGGATACGAACGATGTATTCAGTCCATGTTTGGATGCGAAAGTTCAGAGATCAGATGGATTCACTTT
TGGTGTAGCGTTTTCGTCGAAGGAATCGTTCTTTCAGGATCAGATTCAGTTTTCGCCATGTGATCGACGTCTCGGTTTGGCATCCAAAAACGCTCAGCTTGTTGTTTTCA
GGCCTAGGGTCGACCAGCTCTCGCTCCTTACCATCAATAGCACCACCTTCAATCCGGCTCTGTATGGTGGGTATATGGTAGCATTTGCTGGGCAGAAGTATGCAGCAAGA
TCTCTCCCAGTATTGATTACTGATAATTCTAACACCATAACTAGTTTCACTTTGGTTCTTGAATTTCAGAAGGGCACTCTTCAAAATCTGTTCTGGAAGAAATTTGGGTG
TGATAAATGCACTGGGGACTACTCAGTTTGCCTGGACAACCAAGACTGTGCAGTTCCAAACACCAAATGTAAATTCAATGGTGGTTCCATTGACTGCAATCTGAGCATAC
AACTAGCATTTTCAGGGACAGACAGGAACCTGGAAGTCCTCAACTCCTGGTTTGAGGTCGACAATCTCAGGCGCTACTCCCTCTATAAACTTTTCTCTGATGTTCGAGAT
ACTATCTCTGATGTTCATGATACCATCACCAATCCATTCGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTGGCAGCAATGATGCTGATTTTGGTTGCGGAAGGGACGGATACGAACGATGTATTCAGTCCATGTTTGGATGCGAAAGTTCAGAGATCAGATGGATTCACTTT
TGGTGTAGCGTTTTCGTCGAAGGAATCGTTCTTTCAGGATCAGATTCAGTTTTCGCCATGTGATCGACGTCTCGGTTTGGCATCCAAAAACGCTCAGCTTGTTGTTTTCA
GGCCTAGGGTCGACCAGCTCTCGCTCCTTACCATCAATAGCACCACCTTCAATCCGGCTCTGTATGGTGGGTATATGGTAGCATTTGCTGGGCAGAAGTATGCAGCAAGA
TCTCTCCCAGTATTGATTACTGATAATTCTAACACCATAACTAGTTTCACTTTGGTTCTTGAATTTCAGAAGGGCACTCTTCAAAATCTGTTCTGGAAGAAATTTGGGTG
TGATAAATGCACTGGGGACTACTCAGTTTGCCTGGACAACCAAGACTGTGCAGTTCCAAACACCAAATGTAAATTCAATGGTGGTTCCATTGACTGCAATCTGAGCATAC
AACTAGCATTTTCAGGGACAGACAGGAACCTGGAAGTCCTCAACTCCTGGTTTGAGGTCGACAATCTCAGGCGCTACTCCCTCTATAAACTTTTCTCTGATGTTCGAGAT
ACTATCTCTGATGTTCATGATACCATCACCAATCCATTCGGGTAA
Protein sequenceShow/hide protein sequence
MALAAMMLILVAEGTDTNDVFSPCLDAKVQRSDGFTFGVAFSSKESFFQDQIQFSPCDRRLGLASKNAQLVVFRPRVDQLSLLTINSTTFNPALYGGYMVAFAGQKYAAR
SLPVLITDNSNTITSFTLVLEFQKGTLQNLFWKKFGCDKCTGDYSVCLDNQDCAVPNTKCKFNGGSIDCNLSIQLAFSGTDRNLEVLNSWFEVDNLRRYSLYKLFSDVRD
TISDVHDTITNPFG