; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0458 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0458
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDNA-directed RNA polymerase II subunit 1-like
Genome locationMC05:3367373..3369236
RNA-Seq ExpressionMC05g0458
SyntenyMC05g0458
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597060.1 hypothetical protein SDJN03_10240, partial [Cucurbita argyrosperma subsp. sororia]3.21e-6953.74Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPL-KASPGRR--GPSASPKYRAAAKRVGSSP-AKSARSPPASPATKYSD
        M+N PRFG + QR +++APP    +P S+P    L P EP  P   S  S P    SP  +   P ASP+Y ++  RV S P AK   SPP SPA KY D
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPL-KASPGRR--GPSASPKYRAAAKRVGSSP-AKSARSPPASPATKYSD

Query:  RGHAQTTPPLSPAKSRRTATPPLSPLALPRTANGALHSS-----VLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQ----AEVINLTGHNVGAVMEI
        R  AQT+P  SP++S RT+ PP  P ALP T   A++ +     + PEVEKKS +YNKTVEKPAKS   SE+GSGKP +     E INL GHNVGAVMEI
Subjt:  RGHAQTTPPLSPAKSRRTATPPLSPLALPRTANGALHSS-----VLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQ----AEVINLTGHNVGAVMEI

Query:  NQSSAKHSAGEIIKKKESETELGGAHHGNDERKTGAKKELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNYK
        ++S+  H  G        ET  GG   GN+E+K   KKE+P TAFMNSNFQSVNNSVLY SSC+HRDPGLHL F+D ADG G A VDG  KNYK
Subjt:  NQSSAKHSAGEIIKKKESETELGGAHHGNDERKTGAKKELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNYK

XP_004133797.1 uncharacterized protein LOC101205942 [Cucumis sativus]1.04e-7353Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPS--ASPKYRAA---AKRVGSSPA-KSARSPPASPATKY
        MANLPR GRT QR + V PP PAA  P+V       P+  ++       ++P  ASP R  P   +SP  +A    A RV SSPA K+ RSPP S   KY
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPS--ASPKYRAA---AKRVGSSPA-KSARSPPASPATKY

Query:  SDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANGAL-HSSVLPEVEKKSVLYNKTV-EKPAKSGRPSEHGSGKPPQAE-----VINLTGHNVGA
         +R + +TTPPL+PAKSR   T PLSPLALPR    T NG      V PEVE K ++YNK   EKP+KS R SEHGSGK    +     V+ L GHNVGA
Subjt:  SDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANGAL-HSSVLPEVEKKSVLYNKTV-EKPAKSGRPSEHGSGKPPQAE-----VINLTGHNVGA

Query:  VMEINQSSAKHS-AGEIIKKKESETELGGAHHGNDERKTGAKKELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNYKP
        VME+N+SSA +   GE +KKKE+E +     +G++++KTG KK+ P +AFMNSNFQSVNNS+L+DSSC HRDPGLHL+F + ADGGG A+VDG +K+YKP
Subjt:  VMEINQSSAKHS-AGEIIKKKESETELGGAHHGNDERKTGAKKELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNYKP

XP_008437851.1 PREDICTED: zyxin-like [Cucumis melo]4.00e-7555.96Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNL--PPSASP---ASSPLKASPGRR-GPSASPKYRAAAKRVGSSPA-KSARSPPASPATK
        MANLPRFGR  QR   V PP PAA  P+V     + PF   +  P  ASP   +  PL + P +   P ASPKY  +  R+  SPA K+  SPP S   K
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNL--PPSASP---ASSPLKASPGRR-GPSASPKYRAAAKRVGSSPA-KSARSPPASPATK

Query:  YSDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANGAL-HSSVLPEVEKKSVLYNKTVEKPAKSGRPS-EHGSGKPPQA----EVINLTGHNVGA
        Y +R + +TTPPLSPAKSRR  TPPLSPLALPR    T NG      V PEVE K ++YNKTVEKP+KS R S E+GS K  Q     EVI L GHNVGA
Subjt:  YSDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANGAL-HSSVLPEVEKKSVLYNKTVEKPAKSGRPS-EHGSGKPPQA----EVINLTGHNVGA

Query:  VMEINQSSAKHS-AGEIIKKKESETELGGAH-HGNDERKTGAKK-ELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNY
        VMEIN+SS  +   GE +KK E+E ++G  H +G++++KT  KK E P TAFMNSNFQSVNNS+L+DSSC+HRDPGLHLAF D  DG G AIVDG  K+Y
Subjt:  VMEINQSSAKHS-AGEIIKKKESETELGGAH-HGNDERKTGAKK-ELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNY

Query:  KP
        KP
Subjt:  KP

XP_022147458.1 vegetative cell wall protein gp1-like [Momordica charantia]2.28e-193100Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPSASPKYRAAAKRVGSSPAKSARSPPASPATKYSDRGHA
        MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPSASPKYRAAAKRVGSSPAKSARSPPASPATKYSDRGHA
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPSASPKYRAAAKRVGSSPAKSARSPPASPATKYSDRGHA

Query:  QTTPPLSPAKSRRTATPPLSPLALPRTANGALHSSVLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQAEVINLTGHNVGAVMEINQSSAKHSAGEII
        QTTPPLSPAKSRRTATPPLSPLALPRTANGALHSSVLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQAEVINLTGHNVGAVMEINQSSAKHSAGEII
Subjt:  QTTPPLSPAKSRRTATPPLSPLALPRTANGALHSSVLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQAEVINLTGHNVGAVMEINQSSAKHSAGEII

Query:  KKKESETELGGAHHGNDERKTGAKKELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNYKP
        KKKESETELGGAHHGNDERKTGAKKELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNYKP
Subjt:  KKKESETELGGAHHGNDERKTGAKKELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNYKP

XP_038879417.1 proline-rich receptor-like protein kinase PERK8 [Benincasa hispida]1.21e-8055.19Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSV----------PTTQTLQPFEPNLPPSASPA--SSPLKASPGRRG--PSASPKYRAAAKRVGSSPA-KSARS
        MANLPR GRT QR + VAPP  AAV P+           PT+   QP EP  P  ASP   S  L  SP ++   P ASPKY  +  RV S PA K+ RS
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSV----------PTTQTLQPFEPNLPPSASPA--SSPLKASPGRRG--PSASPKYRAAAKRVGSSPA-KSARS

Query:  PPASPATKYSDRGHAQTTPPLSPAKSRRTATPPLSPLALPRTANGALHSSVL-----PEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQ-----AEVIN
        PP SP  KY +  + +TTPPLSPAKSRRT TPPLSPL LPRT   +   +       P VE K ++YNK VEKP K+ RPSE+GSGKP Q     AEVIN
Subjt:  PPASPATKYSDRGHAQTTPPLSPAKSRRTATPPLSPLALPRTANGALHSSVL-----PEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQ-----AEVIN

Query:  LTGHNVGAVMEINQSSAKHS-AGEIIKKKESETELGGAHHGNDERKTGAKKELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVD
        L GHNVGAVMEIN+SS  +   GE +K KE  T+ GG HHG+ E+  GAK   P TAFMN+NFQS+NNS+LYDSSC+H DPGLHL+  ++ DG G A V 
Subjt:  LTGHNVGAVMEINQSSAKHS-AGEIIKKKESETELGGAHHGNDERKTGAKKELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVD

Query:  GHNKNYKP
        GH K+YKP
Subjt:  GHNKNYKP

TrEMBL top hitse value%identityAlignment
A0A0A0L320 Uncharacterized protein5.04e-7453Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPS--ASPKYRAA---AKRVGSSPA-KSARSPPASPATKY
        MANLPR GRT QR + V PP PAA  P+V       P+  ++       ++P  ASP R  P   +SP  +A    A RV SSPA K+ RSPP S   KY
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPS--ASPKYRAA---AKRVGSSPA-KSARSPPASPATKY

Query:  SDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANGAL-HSSVLPEVEKKSVLYNKTV-EKPAKSGRPSEHGSGKPPQAE-----VINLTGHNVGA
         +R + +TTPPL+PAKSR   T PLSPLALPR    T NG      V PEVE K ++YNK   EKP+KS R SEHGSGK    +     V+ L GHNVGA
Subjt:  SDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANGAL-HSSVLPEVEKKSVLYNKTV-EKPAKSGRPSEHGSGKPPQAE-----VINLTGHNVGA

Query:  VMEINQSSAKHS-AGEIIKKKESETELGGAHHGNDERKTGAKKELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNYKP
        VME+N+SSA +   GE +KKKE+E +     +G++++KTG KK+ P +AFMNSNFQSVNNS+L+DSSC HRDPGLHL+F + ADGGG A+VDG +K+YKP
Subjt:  VMEINQSSAKHS-AGEIIKKKESETELGGAHHGNDERKTGAKKELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNYKP

A0A1S3AV38 zyxin-like1.94e-7555.96Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNL--PPSASP---ASSPLKASPGRR-GPSASPKYRAAAKRVGSSPA-KSARSPPASPATK
        MANLPRFGR  QR   V PP PAA  P+V     + PF   +  P  ASP   +  PL + P +   P ASPKY  +  R+  SPA K+  SPP S   K
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNL--PPSASP---ASSPLKASPGRR-GPSASPKYRAAAKRVGSSPA-KSARSPPASPATK

Query:  YSDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANGAL-HSSVLPEVEKKSVLYNKTVEKPAKSGRPS-EHGSGKPPQA----EVINLTGHNVGA
        Y +R + +TTPPLSPAKSRR  TPPLSPLALPR    T NG      V PEVE K ++YNKTVEKP+KS R S E+GS K  Q     EVI L GHNVGA
Subjt:  YSDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANGAL-HSSVLPEVEKKSVLYNKTVEKPAKSGRPS-EHGSGKPPQA----EVINLTGHNVGA

Query:  VMEINQSSAKHS-AGEIIKKKESETELGGAH-HGNDERKTGAKK-ELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNY
        VMEIN+SS  +   GE +KK E+E ++G  H +G++++KT  KK E P TAFMNSNFQSVNNS+L+DSSC+HRDPGLHLAF D  DG G AIVDG  K+Y
Subjt:  VMEINQSSAKHS-AGEIIKKKESETELGGAH-HGNDERKTGAKK-ELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNY

Query:  KP
        KP
Subjt:  KP

A0A5A7TZ24 Zyxin-like1.94e-7555.96Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNL--PPSASP---ASSPLKASPGRR-GPSASPKYRAAAKRVGSSPA-KSARSPPASPATK
        MANLPRFGR  QR   V PP PAA  P+V     + PF   +  P  ASP   +  PL + P +   P ASPKY  +  R+  SPA K+  SPP S   K
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNL--PPSASP---ASSPLKASPGRR-GPSASPKYRAAAKRVGSSPA-KSARSPPASPATK

Query:  YSDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANGAL-HSSVLPEVEKKSVLYNKTVEKPAKSGRPS-EHGSGKPPQA----EVINLTGHNVGA
        Y +R + +TTPPLSPAKSRR  TPPLSPLALPR    T NG      V PEVE K ++YNKTVEKP+KS R S E+GS K  Q     EVI L GHNVGA
Subjt:  YSDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANGAL-HSSVLPEVEKKSVLYNKTVEKPAKSGRPS-EHGSGKPPQA----EVINLTGHNVGA

Query:  VMEINQSSAKHS-AGEIIKKKESETELGGAH-HGNDERKTGAKK-ELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNY
        VMEIN+SS  +   GE +KK E+E ++G  H +G++++KT  KK E P TAFMNSNFQSVNNS+L+DSSC+HRDPGLHLAF D  DG G AIVDG  K+Y
Subjt:  VMEINQSSAKHS-AGEIIKKKESETELGGAH-HGNDERKTGAKK-ELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNY

Query:  KP
        KP
Subjt:  KP

A0A6J1D1D2 vegetative cell wall protein gp1-like1.10e-193100Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPSASPKYRAAAKRVGSSPAKSARSPPASPATKYSDRGHA
        MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPSASPKYRAAAKRVGSSPAKSARSPPASPATKYSDRGHA
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPSASPKYRAAAKRVGSSPAKSARSPPASPATKYSDRGHA

Query:  QTTPPLSPAKSRRTATPPLSPLALPRTANGALHSSVLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQAEVINLTGHNVGAVMEINQSSAKHSAGEII
        QTTPPLSPAKSRRTATPPLSPLALPRTANGALHSSVLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQAEVINLTGHNVGAVMEINQSSAKHSAGEII
Subjt:  QTTPPLSPAKSRRTATPPLSPLALPRTANGALHSSVLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQAEVINLTGHNVGAVMEINQSSAKHSAGEII

Query:  KKKESETELGGAHHGNDERKTGAKKELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNYKP
        KKKESETELGGAHHGNDERKTGAKKELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNYKP
Subjt:  KKKESETELGGAHHGNDERKTGAKKELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNYKP

A0A6J1GHE5 sulfated surface glycoprotein 185-like1.37e-6651.49Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPF---EPNLPPSASPASSPLKASPGRRGPSASPKYRAAAKRVGSSPA-KSARSPPASPATKYSD
        M+N PRFG + QR ++ APP      P+       QPF     +L P+  P S           P ASPKY  +  RV S P  K   SPP SPA KY D
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPF---EPNLPPSASPASSPLKASPGRRGPSASPKYRAAAKRVGSSPA-KSARSPPASPATKYSD

Query:  RGHAQTTPPLSPAKSRRTATPPLSP-LALPRTANGALHSS-----VLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQ----AEVINLTGHNVGAVME
        R  +QT+P  SP++S RT+ PP  P LALP T   A++ +     + PEVEKKS++YNKTVEK  KS RPSE+GSGKP +     E INL GHNVGAVME
Subjt:  RGHAQTTPPLSPAKSRRTATPPLSP-LALPRTANGALHSS-----VLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQ----AEVINLTGHNVGAVME

Query:  INQSSAKHS-AGEIIKKKESETELGGAHHGNDERKTGAKKE-------LPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNK
        I++SS  H   GE ++K E+E   GG   GN+E+K   KK+       +P TAFMNSNFQSVNNSVLYDSSCSHRDPGLHL F+D ADG G A VDG  K
Subjt:  INQSSAKHS-AGEIIKKKESETELGGAHHGNDERKTGAKKE-------LPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNK

Query:  NYK
        +YK
Subjt:  NYK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46630.1 unknown protein2.8e-0828.4Show/hide
Query:  QRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPS--ASPASSPLKASPGRRGPSASPKYRAAAKRVGSS-PAKSARSPPASPATKYSDRGHAQTTPPLSP
        Q+Q  + PP   A P S P  Q   P+  + PPS   SP + P  A+P    P +S     + K V  + P +   SPP+   +  S    +  T   S 
Subjt:  QRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPS--ASPASSPLKASPGRRGPSASPKYRAAAKRVGSS-PAKSARSPPASPATKYSDRGHAQTTPPLSP

Query:  AKSRRTATPP--LSPLALPRTANGALHSSVLPEVEKKSVL---------------------------------YNKTV----EKPAKSGR-PSEHGSGKP
        +++ R A  P  LSP +LP +    LHS    E  +K++L                                 YN+        P K  R PS   S   
Subjt:  AKSRRTATPP--LSPLALPRTANGALHSSVLPEVEKKSVL---------------------------------YNKTV----EKPAKSGR-PSEHGSGKP

Query:  PQAEVINLTGHNVGAVMEI--------------NQSSAKHSAGEIIKKKESETELGGAHHGNDERKT------GAKKELPGTAFMNSNFQSVNNSVLYDS
            VI + G N GAVMEI              + S   H  GE  ++ +S +    +  G  ++KT           LP  AFMNSN Q +NNS++Y+S
Subjt:  PQAEVINLTGHNVGAVMEI--------------NQSSAKHSAGEIIKKKESETELGGAHHGNDERKT------GAKKELPGTAFMNSNFQSVNNSVLYDS

Query:  SCSHRDPGLHLAFSDTADGGGGAIVDGHNKN
        + SH DPG+HL  S       G  V  +  N
Subjt:  SCSHRDPGLHLAFSDTADGGGGAIVDGHNKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAATCTTCCTCGATTCGGCCGTACATGGCAACGTCAGGCTGTGGTTGCGCCGCCAGCTCCCGCCGCCGTGCCGCCATCAGTTCCGACCACCCAAACTCTGCAACC
TTTCGAACCCAACCTGCCGCCATCGGCTTCTCCTGCGTCTTCCCCTTTGAAAGCATCTCCCGGCCGGCGAGGTCCTTCCGCGTCGCCGAAATACAGAGCTGCCGCCAAAC
GTGTGGGTAGTTCGCCGGCGAAGTCCGCACGATCGCCGCCGGCGTCTCCTGCAACCAAATATTCCGACCGGGGACATGCCCAAACAACCCCACCTCTCTCGCCGGCCAAG
TCCCGGCGAACAGCAACGCCGCCACTTTCTCCTCTTGCTCTTCCACGTACCGCAAATGGCGCGCTCCATTCTAGCGTGTTGCCGGAGGTGGAGAAGAAAAGTGTTCTGTA
CAACAAGACCGTCGAGAAGCCGGCGAAGTCTGGCCGGCCGTCGGAGCACGGCTCCGGTAAGCCGCCGCAGGCGGAGGTTATAAACCTCACCGGACACAACGTAGGCGCCG
TCATGGAAATAAATCAGTCCTCCGCCAAACATTCCGCCGGTGAAATCATAAAAAAGAAAGAATCTGAAACGGAATTAGGTGGGGCCCACCACGGAAATGATGAGAGGAAG
ACAGGGGCAAAAAAGGAATTGCCGGGGACTGCGTTCATGAACAGCAACTTTCAGAGTGTGAACAATTCCGTGCTGTACGACTCGTCGTGCAGCCACCGTGATCCCGGCCT
GCATCTCGCCTTCTCCGACACGGCGGACGGCGGCGGCGGAGCCATTGTTGACGGTCATAACAAGAATTACAAGCCCTAG
mRNA sequenceShow/hide mRNA sequence
TAATTTTTTAAAATTTTTGATTGTGTGTGGAGAATCACTTGATCTTCTCACAGAATTTTCAATTATTTTCTCCCTCAAGCTTCATCTCATATTGCAACTTCCAGCAATAT
AAATAATGCAAAGACAAAGTTTCTCAAATATCTGCCTTTAATTTTTCTATTCTTGGTGGAATTTCATTTTCCTTGTTGCCAAACACTCCCAATTCGTTCATTATGGCAAA
TCTTCCTCGATTCGGCCGTACATGGCAACGTCAGGCTGTGGTTGCGCCGCCAGCTCCCGCCGCCGTGCCGCCATCAGTTCCGACCACCCAAACTCTGCAACCTTTCGAAC
CCAACCTGCCGCCATCGGCTTCTCCTGCGTCTTCCCCTTTGAAAGCATCTCCCGGCCGGCGAGGTCCTTCCGCGTCGCCGAAATACAGAGCTGCCGCCAAACGTGTGGGT
AGTTCGCCGGCGAAGTCCGCACGATCGCCGCCGGCGTCTCCTGCAACCAAATATTCCGACCGGGGACATGCCCAAACAACCCCACCTCTCTCGCCGGCCAAGTCCCGGCG
AACAGCAACGCCGCCACTTTCTCCTCTTGCTCTTCCACGTACCGCAAATGGCGCGCTCCATTCTAGCGTGTTGCCGGAGGTGGAGAAGAAAAGTGTTCTGTACAACAAGA
CCGTCGAGAAGCCGGCGAAGTCTGGCCGGCCGTCGGAGCACGGCTCCGGTAAGCCGCCGCAGGCGGAGGTTATAAACCTCACCGGACACAACGTAGGCGCCGTCATGGAA
ATAAATCAGTCCTCCGCCAAACATTCCGCCGGTGAAATCATAAAAAAGAAAGAATCTGAAACGGAATTAGGTGGGGCCCACCACGGAAATGATGAGAGGAAGACAGGGGC
AAAAAAGGAATTGCCGGGGACTGCGTTCATGAACAGCAACTTTCAGAGTGTGAACAATTCCGTGCTGTACGACTCGTCGTGCAGCCACCGTGATCCCGGCCTGCATCTCG
CCTTCTCCGACACGGCGGACGGCGGCGGCGGAGCCATTGTTGACGGTCATAACAAGAATTACAAGCCCTAGAGGACGATGGTCGAAGCGAAAGAGTACTGTTTTATAAGA
AATAATAAGCCAATTAAATGGGTTAACGTAATATTTTAAAAAATATACATAATTAATAATTATTATATGTGAAAAAGGGCTGTGATCTTCATAGTCTTCCCCTTGGCTGT
TAATTATTTAGCTTCCTGAAAATTATGCCACAAAAATTGAAAATAAT
Protein sequenceShow/hide protein sequence
MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPSASPKYRAAAKRVGSSPAKSARSPPASPATKYSDRGHAQTTPPLSPAK
SRRTATPPLSPLALPRTANGALHSSVLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQAEVINLTGHNVGAVMEINQSSAKHSAGEIIKKKESETELGGAHHGNDERK
TGAKKELPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGGGAIVDGHNKNYKP