; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005917 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005917
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDNA-directed RNA polymerase II subunit 1-like
Genome locationscaffold254:2409504..2411029
RNA-Seq ExpressionMS005917
SyntenyMS005917
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597060.1 hypothetical protein SDJN03_10240, partial [Cucurbita argyrosperma subsp. sororia]1.7e-5554.08Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSP-LKASPGRR--GPSASPKYRAAATRVGS-SPAKSARSPPASPATKYSD
        M+N PRFG + QR +++APP    +P S+P    L P EP  P   S  S P    SP  +   P ASP+Y ++ TRV S  PAK   SPP SPA KY D
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSP-LKASPGRR--GPSASPKYRAAATRVGS-SPAKSARSPPASPATKYSD

Query:  RGHAQTTPPLSPAKSRRTATPPLSPLALPRTANGALHSS-----VLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQ----AEVINLTGHNVGAVMEI
        R  AQT+P  SP++S RT+ PP  P ALP T   A++ +     + PEVEKKS +YNKTVEKPAKS   SE+GSGKP +     E INL GHNVGAVMEI
Subjt:  RGHAQTTPPLSPAKSRRTATPPLSPLALPRTANGALHSS-----VLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQ----AEVINLTGHNVGAVMEI

Query:  NQSSAKHSAGEIIKKKESETELGGAHHGNDERKTGAKKGLPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNYK
        ++S+  H  G        ET  GG   GN+E+K   KK +P TAFMNSNFQSVNNSVLY SSC+HRDPGLHL F+D AD GDGA VDG  KNYK
Subjt:  NQSSAKHSAGEIIKKKESETELGGAHHGNDERKTGAKKGLPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNYK

XP_004133797.1 uncharacterized protein LOC101205942 [Cucumis sativus]3.7e-5853.82Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSV-PTTQTLQPFEPNLPPSASPASSPLKASPGRRGPS--ASPKYRAA---ATRVGSSP-AKSARSPPASPATK
        MANLPR GRT QR + V PP PAA  P+V P  +T+        P A+  ++P  ASP R  P   +SP  +A    A+RV SSP AK+ RSPP S   K
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSV-PTTQTLQPFEPNLPPSASPASSPLKASPGRRGPS--ASPKYRAA---ATRVGSSP-AKSARSPPASPATK

Query:  YSDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANG-ALHSSVLPEVEKKSVLYNKTV-EKPAKSGRPSEHGSGKPPQAE-----VINLTGHNVG
        Y +R + +TTPPL+PAKSR   T PLSPLALPR    T NG      V PEVE K ++YNK   EKP+KS R SEHGSGK    +     V+ L GHNVG
Subjt:  YSDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANG-ALHSSVLPEVEKKSVLYNKTV-EKPAKSGRPSEHGSGKPPQAE-----VINLTGHNVG

Query:  AVMEINQSSAKHS-AGEIIKKKESETELGGAHHGNDERKTGAKKGLPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNYK
        AVME+N+SSA +   GE +KKKE+E +     +G++++KTG KK  P +AFMNSNFQSVNNS+L+DSSC HRDPGLHL+F + ADGG GA+VDG +K+YK
Subjt:  AVMEINQSSAKHS-AGEIIKKKESETELGGAHHGNDERKTGAKKGLPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNYK

Query:  P
        P
Subjt:  P

XP_008437851.1 PREDICTED: zyxin-like [Cucumis melo]1.3e-6056.29Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNL--PPSASP---ASSPLKASPGR-RGPSASPKYRAAATRVGSSP-AKSARSPPASPATK
        MANLPRFGR  QR   V PP PAA  P+V     + PF   +  P  ASP   +  PL + P +   P ASPKY  + TR+  SP AK+  SPP S   K
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNL--PPSASP---ASSPLKASPGR-RGPSASPKYRAAATRVGSSP-AKSARSPPASPATK

Query:  YSDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANG-ALHSSVLPEVEKKSVLYNKTVEKPAKSGRPS-EHGSGKPPQ----AEVINLTGHNVGA
        Y +R + +TTPPLSPAKSRR  TPPLSPLALPR    T NG      V PEVE K ++YNKTVEKP+KS R S E+GS K  Q     EVI L GHNVGA
Subjt:  YSDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANG-ALHSSVLPEVEKKSVLYNKTVEKPAKSGRPS-EHGSGKPPQ----AEVINLTGHNVGA

Query:  VMEINQSSAKHS-AGEIIKKKESETELGGAH-HGNDERKTGAKKGLPG-TAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNY
        VMEIN+SS  +   GE +KK E+E ++G  H +G++++KT  KK  P  TAFMNSNFQSVNNS+L+DSSC+HRDPGLHLAF D  D GDGAIVDG  K+Y
Subjt:  VMEINQSSAKHS-AGEIIKKKESETELGGAH-HGNDERKTGAKKGLPG-TAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNY

Query:  KP
        KP
Subjt:  KP

XP_022147458.1 vegetative cell wall protein gp1-like [Momordica charantia]2.5e-14798.94Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPSASPKYRAAATRVGSSPAKSARSPPASPATKYSDRGHA
        MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPSASPKYRAAA RVGSSPAKSARSPPASPATKYSDRGHA
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPSASPKYRAAATRVGSSPAKSARSPPASPATKYSDRGHA

Query:  QTTPPLSPAKSRRTATPPLSPLALPRTANGALHSSVLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQAEVINLTGHNVGAVMEINQSSAKHSAGEII
        QTTPPLSPAKSRRTATPPLSPLALPRTANGALHSSVLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQAEVINLTGHNVGAVMEINQSSAKHSAGEII
Subjt:  QTTPPLSPAKSRRTATPPLSPLALPRTANGALHSSVLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQAEVINLTGHNVGAVMEINQSSAKHSAGEII

Query:  KKKESETELGGAHHGNDERKTGAKKGLPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNYKP
        KKKESETELGGAHHGNDERKTGAKK LPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGG GAIVDGHNKNYKP
Subjt:  KKKESETELGGAHHGNDERKTGAKKGLPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNYKP

XP_038879417.1 proline-rich receptor-like protein kinase PERK8 [Benincasa hispida]2.4e-6555.84Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPS----------VPTTQTLQPFEPNLPPSASP--ASSPLKASPGRR--GPSASPKYRAAATRVGSSP-AKSARS
        MANLPR GRT QR + VAPP  AAV P+           PT+   QP EP  P  ASP   S  L  SP ++   P ASPKY  + TRV S P AK+ RS
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPS----------VPTTQTLQPFEPNLPPSASP--ASSPLKASPGRR--GPSASPKYRAAATRVGSSP-AKSARS

Query:  PPASPATKYSDRGHAQTTPPLSPAKSRRTATPPLSPLALPRTANGALHSSVL-----PEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQ-----AEVIN
        PP SP  KY +  + +TTPPLSPAKSRRT TPPLSPL LPRT   +   +       P VE K ++YNK VEKP K+ RPSE+GSGKP Q     AEVIN
Subjt:  PPASPATKYSDRGHAQTTPPLSPAKSRRTATPPLSPLALPRTANGALHSSVL-----PEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQ-----AEVIN

Query:  LTGHNVGAVMEINQSSAKHS-AGEIIKKKESETELGGAHHGNDERKTGAKKGLPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVD
        L GHNVGAVMEIN+SS  +   GE +K K  ET+ GG HHG+ E+  GAK   P TAFMN+NFQS+NNS+LYDSSC+H DPGLHL+  ++ D GDGA V 
Subjt:  LTGHNVGAVMEINQSSAKHS-AGEIIKKKESETELGGAHHGNDERKTGAKKGLPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVD

Query:  GHNKNYKP
        GH K+YKP
Subjt:  GHNKNYKP

TrEMBL top hitse value%identityAlignment
A0A0A0L320 Uncharacterized protein1.8e-5853.82Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSV-PTTQTLQPFEPNLPPSASPASSPLKASPGRRGPS--ASPKYRAA---ATRVGSSP-AKSARSPPASPATK
        MANLPR GRT QR + V PP PAA  P+V P  +T+        P A+  ++P  ASP R  P   +SP  +A    A+RV SSP AK+ RSPP S   K
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSV-PTTQTLQPFEPNLPPSASPASSPLKASPGRRGPS--ASPKYRAA---ATRVGSSP-AKSARSPPASPATK

Query:  YSDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANG-ALHSSVLPEVEKKSVLYNKTV-EKPAKSGRPSEHGSGKPPQAE-----VINLTGHNVG
        Y +R + +TTPPL+PAKSR   T PLSPLALPR    T NG      V PEVE K ++YNK   EKP+KS R SEHGSGK    +     V+ L GHNVG
Subjt:  YSDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANG-ALHSSVLPEVEKKSVLYNKTV-EKPAKSGRPSEHGSGKPPQAE-----VINLTGHNVG

Query:  AVMEINQSSAKHS-AGEIIKKKESETELGGAHHGNDERKTGAKKGLPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNYK
        AVME+N+SSA +   GE +KKKE+E +     +G++++KTG KK  P +AFMNSNFQSVNNS+L+DSSC HRDPGLHL+F + ADGG GA+VDG +K+YK
Subjt:  AVMEINQSSAKHS-AGEIIKKKESETELGGAHHGNDERKTGAKKGLPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNYK

Query:  P
        P
Subjt:  P

A0A1S3AV38 zyxin-like6.5e-6156.29Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNL--PPSASP---ASSPLKASPGR-RGPSASPKYRAAATRVGSSP-AKSARSPPASPATK
        MANLPRFGR  QR   V PP PAA  P+V     + PF   +  P  ASP   +  PL + P +   P ASPKY  + TR+  SP AK+  SPP S   K
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNL--PPSASP---ASSPLKASPGR-RGPSASPKYRAAATRVGSSP-AKSARSPPASPATK

Query:  YSDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANG-ALHSSVLPEVEKKSVLYNKTVEKPAKSGRPS-EHGSGKPPQ----AEVINLTGHNVGA
        Y +R + +TTPPLSPAKSRR  TPPLSPLALPR    T NG      V PEVE K ++YNKTVEKP+KS R S E+GS K  Q     EVI L GHNVGA
Subjt:  YSDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANG-ALHSSVLPEVEKKSVLYNKTVEKPAKSGRPS-EHGSGKPPQ----AEVINLTGHNVGA

Query:  VMEINQSSAKHS-AGEIIKKKESETELGGAH-HGNDERKTGAKKGLPG-TAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNY
        VMEIN+SS  +   GE +KK E+E ++G  H +G++++KT  KK  P  TAFMNSNFQSVNNS+L+DSSC+HRDPGLHLAF D  D GDGAIVDG  K+Y
Subjt:  VMEINQSSAKHS-AGEIIKKKESETELGGAH-HGNDERKTGAKKGLPG-TAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNY

Query:  KP
        KP
Subjt:  KP

A0A5A7TZ24 Zyxin-like6.5e-6156.29Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNL--PPSASP---ASSPLKASPGR-RGPSASPKYRAAATRVGSSP-AKSARSPPASPATK
        MANLPRFGR  QR   V PP PAA  P+V     + PF   +  P  ASP   +  PL + P +   P ASPKY  + TR+  SP AK+  SPP S   K
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNL--PPSASP---ASSPLKASPGR-RGPSASPKYRAAATRVGSSP-AKSARSPPASPATK

Query:  YSDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANG-ALHSSVLPEVEKKSVLYNKTVEKPAKSGRPS-EHGSGKPPQ----AEVINLTGHNVGA
        Y +R + +TTPPLSPAKSRR  TPPLSPLALPR    T NG      V PEVE K ++YNKTVEKP+KS R S E+GS K  Q     EVI L GHNVGA
Subjt:  YSDRGHAQTTPPLSPAKSRRTATPPLSPLALPR----TANG-ALHSSVLPEVEKKSVLYNKTVEKPAKSGRPS-EHGSGKPPQ----AEVINLTGHNVGA

Query:  VMEINQSSAKHS-AGEIIKKKESETELGGAH-HGNDERKTGAKKGLPG-TAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNY
        VMEIN+SS  +   GE +KK E+E ++G  H +G++++KT  KK  P  TAFMNSNFQSVNNS+L+DSSC+HRDPGLHLAF D  D GDGAIVDG  K+Y
Subjt:  VMEINQSSAKHS-AGEIIKKKESETELGGAH-HGNDERKTGAKKGLPG-TAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNY

Query:  KP
        KP
Subjt:  KP

A0A6J1D1D2 vegetative cell wall protein gp1-like1.2e-14798.94Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPSASPKYRAAATRVGSSPAKSARSPPASPATKYSDRGHA
        MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPSASPKYRAAA RVGSSPAKSARSPPASPATKYSDRGHA
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPSASPKYRAAATRVGSSPAKSARSPPASPATKYSDRGHA

Query:  QTTPPLSPAKSRRTATPPLSPLALPRTANGALHSSVLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQAEVINLTGHNVGAVMEINQSSAKHSAGEII
        QTTPPLSPAKSRRTATPPLSPLALPRTANGALHSSVLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQAEVINLTGHNVGAVMEINQSSAKHSAGEII
Subjt:  QTTPPLSPAKSRRTATPPLSPLALPRTANGALHSSVLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQAEVINLTGHNVGAVMEINQSSAKHSAGEII

Query:  KKKESETELGGAHHGNDERKTGAKKGLPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNYKP
        KKKESETELGGAHHGNDERKTGAKK LPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGG GAIVDGHNKNYKP
Subjt:  KKKESETELGGAHHGNDERKTGAKKGLPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNYKP

A0A6J1GHE5 sulfated surface glycoprotein 185-like1.0e-5352.67Show/hide
Query:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPSASPKYRAAATRVGS-SPAKSARSPPASPATKYSDRGH
        M+N PRFG + QR ++ APP      P+       QPF      S  PA  P         P ASPKY  + TRV S  P K   SPP SPA KY DR  
Subjt:  MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPSASPKYRAAATRVGS-SPAKSARSPPASPATKYSDRGH

Query:  AQTTPPLSPAKSRRTA-TPPLSPLALPRTANGALHSS-----VLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQ----AEVINLTGHNVGAVMEINQ
        +QT+P  SP++S RT+  PP  PLALP T   A++ +     + PEVEKKS++YNKTVEK  KS RPSE+GSGKP +     E INL GHNVGAVMEI++
Subjt:  AQTTPPLSPAKSRRTA-TPPLSPLALPRTANGALHSS-----VLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQ----AEVINLTGHNVGAVMEINQ

Query:  SSAKHS-AGEIIKKKESETELGGAHHGNDE-------RKTGAKKGLPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNYK
        SS  H   GE ++K E+E   GG   GN+E       +K   KK +P TAFMNSNFQSVNNSVLYDSSCSHRDPGLHL F+D AD GDGA VDG  K+YK
Subjt:  SSAKHS-AGEIIKKKESETELGGAHHGNDE-------RKTGAKKGLPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNYK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46630.1 unknown protein8.3e-0828.1Show/hide
Query:  QRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPS--ASPASSPLKASPGRRGPSASPKYRAAATRVGSS-PAKSARSPPASPATKYSDRGHAQTTPPLSP
        Q+Q  + PP   A P S P  Q   P+  + PPS   SP + P  A+P    P +S     +   V  + P +   SPP+   +  S    +  T   S 
Subjt:  QRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPS--ASPASSPLKASPGRRGPSASPKYRAAATRVGSS-PAKSARSPPASPATKYSDRGHAQTTPPLSP

Query:  AKSRRTATPP--LSPLALPRTANGALHSSVLPEVEKKSVL---------------------------------YNKTV----EKPAKSGR-PSEHGSGKP
        +++ R A  P  LSP +LP +    LHS    E  +K++L                                 YN+        P K  R PS   S   
Subjt:  AKSRRTATPP--LSPLALPRTANGALHSSVLPEVEKKSVL---------------------------------YNKTV----EKPAKSGR-PSEHGSGKP

Query:  PQAEVINLTGHNVGAVMEI--------------NQSSAKHSAGEIIKKKESETELGGAHHGNDERKT------GAKKGLPGTAFMNSNFQSVNNSVLYDS
            VI + G N GAVMEI              + S   H  GE  ++ +S +    +  G  ++KT           LP  AFMNSN Q +NNS++Y+S
Subjt:  PQAEVINLTGHNVGAVMEI--------------NQSSAKHSAGEIIKKKESETELGGAHHGNDERKT------GAKKGLPGTAFMNSNFQSVNNSVLYDS

Query:  SCSHRDPGLHLAFSDTADGGDGAIVDGHNKN
        + SH DPG+HL  S      +G  V  +  N
Subjt:  SCSHRDPGLHLAFSDTADGGDGAIVDGHNKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAATCTTCCTCGATTCGGCCGTACATGGCAACGTCAGGCTGTGGTTGCGCCGCCAGCTCCCGCCGCCGTGCCGCCATCAGTTCCGACCACCCAAACTCTGCAACC
TTTCGAACCCAACCTGCCGCCATCGGCTTCTCCTGCGTCTTCCCCTTTGAAAGCATCTCCCGGCCGGCGAGGTCCTTCCGCGTCGCCGAAATACAGAGCTGCCGCCACAC
GTGTGGGTAGTTCGCCGGCGAAGTCCGCACGATCGCCGCCGGCGTCTCCTGCAACCAAATATTCCGACCGGGGACATGCCCAAACAACCCCACCTCTCTCGCCGGCCAAG
TCCCGGCGAACAGCAACGCCGCCACTTTCTCCTCTTGCTCTTCCACGTACCGCAAATGGCGCGCTCCATTCTAGCGTGTTGCCGGAGGTGGAGAAGAAAAGTGTTCTGTA
CAACAAGACCGTCGAGAAGCCGGCGAAGTCTGGCCGGCCGTCGGAGCACGGCTCCGGTAAGCCGCCGCAGGCGGAGGTTATAAACCTCACCGGACACAACGTAGGCGCCG
TCATGGAAATAAATCAGTCCTCCGCCAAACATTCCGCCGGAGAAATCATAAAAAAGAAAGAATCTGAAACGGAATTAGGTGGGGCCCACCACGGAAATGATGAGAGGAAG
ACAGGGGCAAAAAAGGGATTGCCGGGGACTGCGTTCATGAACAGCAACTTTCAGAGTGTGAACAATTCCGTTCTGTACGACTCGTCGTGCAGCCACCGTGATCCCGGCCT
GCATCTCGCCTTCTCCGACACGGCGGACGGCGGCGACGGAGCCATTGTTGACGGTCATAACAAGAATTACAAGCCC
mRNA sequenceShow/hide mRNA sequence
ATGGCAAATCTTCCTCGATTCGGCCGTACATGGCAACGTCAGGCTGTGGTTGCGCCGCCAGCTCCCGCCGCCGTGCCGCCATCAGTTCCGACCACCCAAACTCTGCAACC
TTTCGAACCCAACCTGCCGCCATCGGCTTCTCCTGCGTCTTCCCCTTTGAAAGCATCTCCCGGCCGGCGAGGTCCTTCCGCGTCGCCGAAATACAGAGCTGCCGCCACAC
GTGTGGGTAGTTCGCCGGCGAAGTCCGCACGATCGCCGCCGGCGTCTCCTGCAACCAAATATTCCGACCGGGGACATGCCCAAACAACCCCACCTCTCTCGCCGGCCAAG
TCCCGGCGAACAGCAACGCCGCCACTTTCTCCTCTTGCTCTTCCACGTACCGCAAATGGCGCGCTCCATTCTAGCGTGTTGCCGGAGGTGGAGAAGAAAAGTGTTCTGTA
CAACAAGACCGTCGAGAAGCCGGCGAAGTCTGGCCGGCCGTCGGAGCACGGCTCCGGTAAGCCGCCGCAGGCGGAGGTTATAAACCTCACCGGACACAACGTAGGCGCCG
TCATGGAAATAAATCAGTCCTCCGCCAAACATTCCGCCGGAGAAATCATAAAAAAGAAAGAATCTGAAACGGAATTAGGTGGGGCCCACCACGGAAATGATGAGAGGAAG
ACAGGGGCAAAAAAGGGATTGCCGGGGACTGCGTTCATGAACAGCAACTTTCAGAGTGTGAACAATTCCGTTCTGTACGACTCGTCGTGCAGCCACCGTGATCCCGGCCT
GCATCTCGCCTTCTCCGACACGGCGGACGGCGGCGACGGAGCCATTGTTGACGGTCATAACAAGAATTACAAGCCC
Protein sequenceShow/hide protein sequence
MANLPRFGRTWQRQAVVAPPAPAAVPPSVPTTQTLQPFEPNLPPSASPASSPLKASPGRRGPSASPKYRAAATRVGSSPAKSARSPPASPATKYSDRGHAQTTPPLSPAK
SRRTATPPLSPLALPRTANGALHSSVLPEVEKKSVLYNKTVEKPAKSGRPSEHGSGKPPQAEVINLTGHNVGAVMEINQSSAKHSAGEIIKKKESETELGGAHHGNDERK
TGAKKGLPGTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLAFSDTADGGDGAIVDGHNKNYKP