; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g2202 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g2202
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionGATA transcription factor 17-like isoform X1
Genome locationMC06:29399077..29401754
RNA-Seq ExpressionMC06g2202
SyntenyMC06g2202
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR013088 - Zinc finger, NHR/GATA-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022135612.1 GATA transcription factor 17-like isoform X1 [Momordica charantia]2.77e-150100Show/hide
Query:  MAALAAAPPRVFPFYPYPLSSLSHRFRSVSIYLLSFTHDSAFSGFNFNFLNKAEMGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLC
        MAALAAAPPRVFPFYPYPLSSLSHRFRSVSIYLLSFTHDSAFSGFNFNFLNKAEMGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLC
Subjt:  MAALAAAPPRVFPFYPYPLSSLSHRFRSVSIYLLSFTHDSAFSGFNFNFLNKAEMGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLC

Query:  NACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRK
        NACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRK
Subjt:  NACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRK

Query:  LGEEEQAAVSLMALSCGSVFA
        LGEEEQAAVSLMALSCGSVFA
Subjt:  LGEEEQAAVSLMALSCGSVFA

XP_022135613.1 GATA transcription factor 17-like isoform X2 [Momordica charantia]7.34e-14899.55Show/hide
Query:  MAALAAAPPRVFPFYPYPLSSLSHRFRSVSIYLLSFTHDSAFSGFNFNFLNKAEMGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLC
        MAALAAAPPRVFPFYPYPLSSLSHRFRSVSIYLLSFTHDSAFSGFNFNFLNKAEMGMMDVLRRKNK ERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLC
Subjt:  MAALAAAPPRVFPFYPYPLSSLSHRFRSVSIYLLSFTHDSAFSGFNFNFLNKAEMGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLC

Query:  NACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRK
        NACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRK
Subjt:  NACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRK

Query:  LGEEEQAAVSLMALSCGSVFA
        LGEEEQAAVSLMALSCGSVFA
Subjt:  LGEEEQAAVSLMALSCGSVFA

XP_022135614.1 GATA transcription factor 16-like isoform X3 [Momordica charantia]4.08e-110100Show/hide
Query:  MGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS
        MGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS
Subjt:  MGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS

Query:  NNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
        NNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
Subjt:  NNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA

XP_022135615.1 GATA transcription factor 16-like isoform X4 [Momordica charantia]1.08e-10799.4Show/hide
Query:  MGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS
        MGMMDVLRRKNK ERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS
Subjt:  MGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS

Query:  NNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
        NNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
Subjt:  NNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA

XP_038880207.1 GATA transcription factor 17-like [Benincasa hispida]2.72e-5867.42Show/hide
Query:  FLNKAEMGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSAT
        FLN+ EMGMMD LR+K  +     DTK  CVDCKTTKTPLWRGGP GPKSLCNACGIRFRKRR+STIGTNRG DRKRE+ H++G + T  +SATTSS+ T
Subjt:  FLNKAEMGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSAT

Query:  AADAKSNNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQN----ISKQR--PPRKLGEEE-QAAVSLMALSCGSV
             S +G  DG+E   +LGECGSL MRLMMAL EEV+V QN    + KQR    RKLGEEE QAAVSLMALSCGSV
Subjt:  AADAKSNNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQN----ISKQR--PPRKLGEEE-QAAVSLMALSCGSV

TrEMBL top hitse value%identityAlignment
A0A6J1C1I3 GATA transcription factor 16-like isoform X45.23e-10899.4Show/hide
Query:  MGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS
        MGMMDVLRRKNK ERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS
Subjt:  MGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS

Query:  NNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
        NNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
Subjt:  NNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA

A0A6J1C1Y1 GATA transcription factor 16-like isoform X31.97e-110100Show/hide
Query:  MGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS
        MGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS
Subjt:  MGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKS

Query:  NNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
        NNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
Subjt:  NNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA

A0A6J1C373 GATA transcription factor 17-like isoform X11.34e-150100Show/hide
Query:  MAALAAAPPRVFPFYPYPLSSLSHRFRSVSIYLLSFTHDSAFSGFNFNFLNKAEMGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLC
        MAALAAAPPRVFPFYPYPLSSLSHRFRSVSIYLLSFTHDSAFSGFNFNFLNKAEMGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLC
Subjt:  MAALAAAPPRVFPFYPYPLSSLSHRFRSVSIYLLSFTHDSAFSGFNFNFLNKAEMGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLC

Query:  NACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRK
        NACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRK
Subjt:  NACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRK

Query:  LGEEEQAAVSLMALSCGSVFA
        LGEEEQAAVSLMALSCGSVFA
Subjt:  LGEEEQAAVSLMALSCGSVFA

A0A6J1C5A4 GATA transcription factor 17-like isoform X23.56e-14899.55Show/hide
Query:  MAALAAAPPRVFPFYPYPLSSLSHRFRSVSIYLLSFTHDSAFSGFNFNFLNKAEMGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLC
        MAALAAAPPRVFPFYPYPLSSLSHRFRSVSIYLLSFTHDSAFSGFNFNFLNKAEMGMMDVLRRKNK ERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLC
Subjt:  MAALAAAPPRVFPFYPYPLSSLSHRFRSVSIYLLSFTHDSAFSGFNFNFLNKAEMGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLC

Query:  NACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRK
        NACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRK
Subjt:  NACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRK

Query:  LGEEEQAAVSLMALSCGSVFA
        LGEEEQAAVSLMALSCGSVFA
Subjt:  LGEEEQAAVSLMALSCGSVFA

A0A6J1JPG4 GATA transcription factor 16-like2.99e-5364.29Show/hide
Query:  MGMMDVLRRK-NKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAK
        MGMMDV ++  +   +++D TKK CVDC TTKTPLWRGGPAGPKSLCNACGIRFRKRR+ST   NRG  RKRE+ HS   STT++ S    SS T A   
Subjt:  MGMMDVLRRK-NKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAK

Query:  SNNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
        + +GG  G+ EE DLGEC SLRMR+MM   EEVVV QN+S  +   KLGEEEQAAV LMALSCGSVFA
Subjt:  SNNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA

SwissProt top hitse value%identityAlignment
Q8LC59 GATA transcription factor 233.2e-1268.89Show/hide
Query:  EDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTI
        E  T + C +CKTTKTP+WRGGP GPKSLCNACGIR RK+R S +
Subjt:  EDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTI

Q8LG10 GATA transcription factor 153.0e-1845.27Show/hide
Query:  KKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSL
        KK C  C T+KTPLWRGGPAGPKSLCNACGIR RK+R  T+ +NR  D+K+                           KS+N            G+  SL
Subjt:  KKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSL

Query:  RMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALS-CGSVFA
        + RL M LG EV++Q++ ++ +   KLGEEEQAAV LMALS   SV+A
Subjt:  RMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALS-CGSVFA

Q9FJ10 GATA transcription factor 163.3e-1744.67Show/hide
Query:  DDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGEC
        +D KK C DC T+KTPLWRGGP GPKSLCNACGIR RK+R    GT    D K+ K  S GG                           GE  ++ L + 
Subjt:  DDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGEC

Query:  GSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
        G +R R              + KQR  +KLGEEEQAAV LMALS GSV+A
Subjt:  GSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA

Q9LIB5 GATA transcription factor 173.9e-1839.24Show/hide
Query:  DTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAK---------SNNGGADGEE
        DTK+ CVDC T +TPLWRGGPAGPKSLCNACGI+ RK+R + +G      +K  K++ +        +A         D K          NN  +    
Subjt:  DTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAK---------SNNGGADGEE

Query:  EEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
          + + +   L  ++       V+ +  + K+R  RKLGEEE+AAV LMALSC SV+A
Subjt:  EEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA

Q9SZI6 Putative GATA transcription factor 228.0e-1169.05Show/hide
Query:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGT
        + C DC TTKTPLWR GP GPKSLCNACGIR RK R + + T
Subjt:  KYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGT

Arabidopsis top hitse value%identityAlignment
AT3G06740.1 GATA transcription factor 152.1e-1945.27Show/hide
Query:  KKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSL
        KK C  C T+KTPLWRGGPAGPKSLCNACGIR RK+R  T+ +NR  D+K+                           KS+N            G+  SL
Subjt:  KKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSL

Query:  RMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALS-CGSVFA
        + RL M LG EV++Q++ ++ +   KLGEEEQAAV LMALS   SV+A
Subjt:  RMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALS-CGSVFA

AT3G16870.1 GATA transcription factor 172.8e-1939.24Show/hide
Query:  DTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAK---------SNNGGADGEE
        DTK+ CVDC T +TPLWRGGPAGPKSLCNACGI+ RK+R + +G      +K  K++ +        +A         D K          NN  +    
Subjt:  DTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAK---------SNNGGADGEE

Query:  EEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
          + + +   L  ++       V+ +  + K+R  RKLGEEE+AAV LMALSC SV+A
Subjt:  EEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA

AT4G16141.1 GATA type zinc finger transcription factor family protein6.9e-1838.41Show/hide
Query:  DTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNG------GADG-----
        DTKK CVDC T++TPLWRGGPAGPKSLCNACGI+ RK+R + +G  +   + + K++++ G  +  +            AK   G      G  G     
Subjt:  DTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNG------GADG-----

Query:  -EEEEEDLGECGS-----LRMRLMMALGEEVVVQQN--ISKQRPPRKLGEEEQAAVSLMALSCG
         + + E+     +      R+   +  G +V   +   + K+R  RKLGEEE+AAV LMALSCG
Subjt:  -EEEEEDLGECGS-----LRMRLMMALGEEVVVQQN--ISKQRPPRKLGEEEQAAVSLMALSCG

AT5G26930.1 GATA transcription factor 232.3e-1368.89Show/hide
Query:  EDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTI
        E  T + C +CKTTKTP+WRGGP GPKSLCNACGIR RK+R S +
Subjt:  EDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTI

AT5G49300.1 GATA transcription factor 162.4e-1844.67Show/hide
Query:  DDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGEC
        +D KK C DC T+KTPLWRGGP GPKSLCNACGIR RK+R    GT    D K+ K  S GG                           GE  ++ L + 
Subjt:  DDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGEC

Query:  GSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA
        G +R R              + KQR  +KLGEEEQAAV LMALS GSV+A
Subjt:  GSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTCTGGCTGCGGCTCCCCCTAGGGTTTTTCCATTTTATCCCTATCCTCTCTCCTCTCTCTCTCACCGCTTTCGATCGGTTTCCATATATTTACTCTCTTTCAC
TCATGATTCTGCTTTCTCTGGTTTCAATTTCAATTTCCTGAATAAAGCTGAAATGGGCATGATGGATGTGTTGAGACGAAAGAATAAGGTGGAACGTGTAGAGGATGATA
CCAAGAAATACTGTGTTGATTGTAAGACTACCAAGACCCCTTTATGGCGTGGAGGCCCCGCTGGACCTAAGTCACTGTGTAACGCATGTGGGATCAGGTTTAGAAAGAGA
AGAGTCTCCACCATTGGAACCAACAGAGGGTGTGACAGAAAGAGAGAAAAGGCTCATAGCCATGGCGGCTCCACCACTGCCGCCATGTCAGCCACCACTTCCTCTAGTGC
CACCGCTGCCGATGCAAAATCCAACAACGGTGGCGCAGATGGGGAGGAGGAAGAAGAGGATTTAGGGGAATGTGGGTCATTGAGGATGAGGCTGATGATGGCGTTGGGGG
AGGAGGTGGTGGTGCAGCAGAATATTTCGAAACAGCGGCCCCCGAGGAAGCTCGGGGAGGAGGAGCAGGCAGCGGTGTCGTTAATGGCACTGTCCTGTGGCTCTGTGTTT
GCCTGA
mRNA sequenceShow/hide mRNA sequence
CGGGTTCCAAAATATTGTTTTAACATACAATTTAATTAGAATATAATTTACCATCTATAATATCATATTTAAAGATATATGAATTATTTGTAACATAATTAAAAAATATA
TGAAATATGATAGAAAGTTGGTAATAAGTGGTACTAAAAAAATATATAAATATATACAAGTGGAGGTGGGGATTTTTCAAAAAAAAAAAAAAAGAAAGAAAGAAAGTGGA
GGTGGGAAAAGCGAGTAAAATGGGAAATTGAGAAGTGAGCGCGTGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAATGATCGGATCGGAACAGTTAGAG
AGAAACGTGGAGCGGTGTGTGCAGTGAAAAATGGCTGCTATCGTGTCTCTCAGTGGAAGGCCTTCCTCTGGGCTCTGCAAAATCAATGGCATTTCCGTCATTCCTCCGTT
TAGGGTTTAATGGCGGCTCTGGCTGCGGCTCCCCCTAGGGTTTTTCCATTTTATCCCTATCCTCTCTCCTCTCTCTCTCACCGCTTTCGATCGGTTTCCATATATTTACT
CTCTTTCACTCATGATTCTGCTTTCTCTGGTTTCAATTTCAATTTCCTGAATAAAGCTGAAATGGGCATGATGGATGTGTTGAGACGAAAGAATAAGGTGGAACGTGTAG
AGGATGATACCAAGAAATACTGTGTTGATTGTAAGACTACCAAGACCCCTTTATGGCGTGGAGGCCCCGCTGGACCTAAGTCACTGTGTAACGCATGTGGGATCAGGTTT
AGAAAGAGAAGAGTCTCCACCATTGGAACCAACAGAGGGTGTGACAGAAAGAGAGAAAAGGCTCATAGCCATGGCGGCTCCACCACTGCCGCCATGTCAGCCACCACTTC
CTCTAGTGCCACCGCTGCCGATGCAAAATCCAACAACGGTGGCGCAGATGGGGAGGAGGAAGAAGAGGATTTAGGGGAATGTGGGTCATTGAGGATGAGGCTGATGATGG
CGTTGGGGGAGGAGGTGGTGGTGCAGCAGAATATTTCGAAACAGCGGCCCCCGAGGAAGCTCGGGGAGGAGGAGCAGGCAGCGGTGTCGTTAATGGCACTGTCCTGTGGC
TCTGTGTTTGCCTGAACCAGAAGTAGGAAGAAGAAGAAGGAGGTACAGAAGCAGTAGTAGTATTTAGTATTTTACACATCAAAGTTCTAACCAATACCAATTACCAACCA
AAGAATCACCATCACCTCCTTTTCTGCATATTTTGTCACTTTTTTTTTTTTTTTAACCTCTAAATTAATGAAATGGAAGAGACCCAGAATAGTAGGAATTGGGTTTCTTC
CAAAAATGGTAGTGTTAGGGAAATCAAAAAGAAAGGAGATTTAGGGAGATTCAAAATGATGGTTTTGTGCAGTTGTTGTAGGGCTGTAGTTATTTTTCTTTTCTTTAATT
CATGTGTTCATATGCCACCAAAATGACACAAATCTACATAAGCCTGTTCTCTCACTCTCTTCCCCAAAGAAAAGAAAATTACTATGGTTATAACCCAATTTGGGTTTAAA
TTTTAACCATCTTTTATACGCTGGAGACTTAAATTGTTATAAGTATGACATATCCTTCTTCCACCCACTAATTTCAGTCTTGAATCTCAGTCTCGTCCCTAAAATTTTGG
TACCCTCCCTTGGTTTGCCGTGTCCCTAATAGGAATGTTTTAATTTGGTATTTATAGCTTAGTCAAACTTTATAATTCATTTCTGTCGTCTTATATTATCCAAAAAATTT
TGCATGACAAGGGTTGATTTTTCTATTATTTGGTTTAGTTGATTGTATTATGGAG
Protein sequenceShow/hide protein sequence
MAALAAAPPRVFPFYPYPLSSLSHRFRSVSIYLLSFTHDSAFSGFNFNFLNKAEMGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKR
RVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSLRMRLMMALGEEVVVQQNISKQRPPRKLGEEEQAAVSLMALSCGSVF
A