; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS010840 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS010840
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionsp110 nuclear body protein-like
Genome locationscaffold35:2353550..2354407
RNA-Seq ExpressionMS010840
SyntenyMS010840
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026590.1 hypothetical protein SDJN02_10592, partial [Cucurbita argyrosperma subsp. argyrosperma]3.7e-7462.38Show/hide
Query:  MGCCVSSG---NSAHKFDRNSAAADEKIY------ESREPPSSMEEETVKEVLSETPALKP---PPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVI
        MGCCVSSG   +SAHKFD   AAA  KI+       SREPPSSMEEETVKEVLSET ALKP    PP K+ PP+EDEA KPV DKV     EIEKK+  I
Subjt:  MGCCVSSG---NSAHKFDRNSAAADEKIY------ESREPPSSMEEETVKEVLSETPALKP---PPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVI

Query:  PTNCVAEHACEFSEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGP
        P N + + A EF EIS+P +  + A FTD MD GEEVHQ V +A    LP NQS  G+V  KR++ PN+ LNRRSDQSPVRRN  VGSARL Q RD + P
Subjt:  PTNCVAEHACEFSEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGP

Query:  AMVRRSLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASP---QRVEEESNIIDGKFTTHSESLENPLVSLECF
        AM  R LR EP ++DP E  GRRSRSPATA    GGSRSALGRTPSVRKSGKSSP+R  TA   +P   ++V EE+NI +G   T  ESLENPLVSLECF
Subjt:  AMVRRSLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASP---QRVEEESNIIDGKFTTHSESLENPLVSLECF

Query:  IFL
        IFL
Subjt:  IFL

XP_008441084.1 PREDICTED: uncharacterized protein LOC103485312 [Cucumis melo]4.8e-7462.46Show/hide
Query:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLSETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHA
        MGCC+SS    NS +KF  +S  A      +R+PPSSMEEETVKEVLSETPALK PPP KN PP+EDE  KP+ D       E EKK+  IP N + E  
Subjt:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLSETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHA

Query:  CEFSEISSPSECL--SAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSL
         EF EIS  ++C+  SAATFTD+ D G EVHQ   ++SPVKL KNQS S DV  KRE+  +R L RRSDQSPVRRNG VGS R+  NRDM+ PAM RR L
Subjt:  CEFSEISSPSECL--SAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSL

Query:  RAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL
        RAEPPRRDP E S RRS+SP+TA     G RSAL RTPS RKSGKSSP+RA T   A+ Q+V EE+NI+DGKF T  ESLENPLVSLECFIFL
Subjt:  RAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL

XP_022142272.1 uncharacterized protein LOC111012433, partial [Momordica charantia]7.2e-9499.46Show/hide
Query:  MGCCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLSETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHACEF
        MGCCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVL+ETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHACEF
Subjt:  MGCCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLSETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHACEF

Query:  SEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMN
        SEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMN
Subjt:  SEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMN

XP_022926404.1 uncharacterized protein LOC111433567 [Cucurbita moschata]4.4e-7562.71Show/hide
Query:  MGCCVSSG---NSAHKFDRNSAAADEKIY------ESREPPSSMEEETVKEVLSETPALKP---PPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVI
        MGCCVSSG   +SAHKFD   AAA +KI+       SREPPSSMEEETVKEVLSET ALKP    PP KN PP+EDEA KPV DKV     EIEKK+  I
Subjt:  MGCCVSSG---NSAHKFDRNSAAADEKIY------ESREPPSSMEEETVKEVLSETPALKP---PPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVI

Query:  PTNCVAEHACEFSEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGP
        P N + +   EF EIS+P +  + A FTD MD GEEVHQ V +A    LP NQS  G+V  KR++ PN+ LNRRSDQSPVRRN  VGSARL Q RD + P
Subjt:  PTNCVAEHACEFSEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGP

Query:  AMVRRSLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASP---QRVEEESNIIDGKFTTHSESLENPLVSLECF
        AM  R LR EP ++DP E  GRRSRSPATA    GGSRSALGRTPSVRKSGKSSP+R  TA   +P   ++V EE+NI DG   T  ESLENPLVSLECF
Subjt:  AMVRRSLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASP---QRVEEESNIIDGKFTTHSESLENPLVSLECF

Query:  IFL
        IFL
Subjt:  IFL

XP_038882208.1 uncharacterized protein LOC120073430 [Benincasa hispida]1.6e-8065.76Show/hide
Query:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLSETPALKPP--PPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAE
        MGCC+SSG   NS +KF RNS         SR+PPSSMEEETVKEVLSETP+LKPP  PP KN PP+ED+  KPV        NEIEKK+  I  N +AE
Subjt:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLSETPALKPP--PPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAE

Query:  HACEFSEISSPSECLSAAT--FTDKMDDGEEVHQRVFRASPVKLPKNQSFSGD--VKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRR
           EF EIS P+EC+S +T   T++MD G E+HQ V ++SPVKLPK+QS SGD  VKRE+  NR L RRSDQSPVRRNG +GS R+  NRDMN PAM RR
Subjt:  HACEFSEISSPSECLSAAT--FTDKMDDGEEVHQRVFRASPVKLPKNQSFSGD--VKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRR

Query:  SLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL
         LRAEPPRRDP E S RRSRSPATA   G GSRSAL RTPSVRKSGKSSP RAAT   A+ Q+V EE+NIIDGKF +  ESLENPLVSLECFIFL
Subjt:  SLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0KLE9 Uncharacterized protein7.5e-7361.43Show/hide
Query:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLSETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHA
        MGCC+SS    +S +KF  NS      +  SR+PPSSMEEETVKEVLSETPALK PP   N  P++DE  KP+ D       EIEKK+  IP N + E  
Subjt:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLSETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHA

Query:  CEFSEISSPSECL--SAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSL
         EF EIS  ++C+  SAATFTD+ D G EVHQ V ++SPVKL KNQS S DV  KRE+  +R L RRSDQSPVRRNG VGS R+  NRDM+ PAM RR L
Subjt:  CEFSEISSPSECL--SAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSL

Query:  RAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL
        RAEPPRRDP E S RRS SP+TA     G RSAL RTPS RKSGKSSP+   TA  A+ Q+V EE+NI+DGKF T  ESLENPLVSLECFIFL
Subjt:  RAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL

A0A1S3B2L5 uncharacterized protein LOC1034853122.3e-7462.46Show/hide
Query:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLSETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHA
        MGCC+SS    NS +KF  +S  A      +R+PPSSMEEETVKEVLSETPALK PPP KN PP+EDE  KP+ D       E EKK+  IP N + E  
Subjt:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLSETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHA

Query:  CEFSEISSPSECL--SAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSL
         EF EIS  ++C+  SAATFTD+ D G EVHQ   ++SPVKL KNQS S DV  KRE+  +R L RRSDQSPVRRNG VGS R+  NRDM+ PAM RR L
Subjt:  CEFSEISSPSECL--SAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSL

Query:  RAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL
        RAEPPRRDP E S RRS+SP+TA     G RSAL RTPS RKSGKSSP+RA T   A+ Q+V EE+NI+DGKF T  ESLENPLVSLECFIFL
Subjt:  RAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL

A0A5D3CNI1 Putative BEST plant protein match is: (TAIR:plant.1) protein2.3e-7462.46Show/hide
Query:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLSETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHA
        MGCC+SS    NS +KF  +S  A      +R+PPSSMEEETVKEVLSETPALK PPP KN PP+EDE  KP+ D       E EKK+  IP N + E  
Subjt:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLSETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHA

Query:  CEFSEISSPSECL--SAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSL
         EF EIS  ++C+  SAATFTD+ D G EVHQ   ++SPVKL KNQS S DV  KRE+  +R L RRSDQSPVRRNG VGS R+  NRDM+ PAM RR L
Subjt:  CEFSEISSPSECL--SAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSL

Query:  RAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL
        RAEPPRRDP E S RRS+SP+TA     G RSAL RTPS RKSGKSSP+RA T   A+ Q+V EE+NI+DGKF T  ESLENPLVSLECFIFL
Subjt:  RAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL

A0A6J1CMA7 uncharacterized protein LOC1110124333.5e-9499.46Show/hide
Query:  MGCCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLSETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHACEF
        MGCCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVL+ETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHACEF
Subjt:  MGCCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLSETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHACEF

Query:  SEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMN
        SEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMN
Subjt:  SEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMN

A0A6J1EF08 uncharacterized protein LOC1114335672.1e-7562.71Show/hide
Query:  MGCCVSSG---NSAHKFDRNSAAADEKIY------ESREPPSSMEEETVKEVLSETPALKP---PPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVI
        MGCCVSSG   +SAHKFD   AAA +KI+       SREPPSSMEEETVKEVLSET ALKP    PP KN PP+EDEA KPV DKV     EIEKK+  I
Subjt:  MGCCVSSG---NSAHKFDRNSAAADEKIY------ESREPPSSMEEETVKEVLSETPALKP---PPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVI

Query:  PTNCVAEHACEFSEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGP
        P N + +   EF EIS+P +  + A FTD MD GEEVHQ V +A    LP NQS  G+V  KR++ PN+ LNRRSDQSPVRRN  VGSARL Q RD + P
Subjt:  PTNCVAEHACEFSEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGP

Query:  AMVRRSLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASP---QRVEEESNIIDGKFTTHSESLENPLVSLECF
        AM  R LR EP ++DP E  GRRSRSPATA    GGSRSALGRTPSVRKSGKSSP+R  TA   +P   ++V EE+NI DG   T  ESLENPLVSLECF
Subjt:  AMVRRSLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASP---QRVEEESNIIDGKFTTHSESLENPLVSLECF

Query:  IFL
        IFL
Subjt:  IFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11125.1 unknown protein7.1e-0731.27Show/hide
Query:  MGCCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEET-VKEVLSETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENE--------IEKKIRVIPTN
        MGCC+SS  +  K D  S         +  PPS ++EET VKEVLSET  L           D     K    K++EEE +         ++ +   P+ 
Subjt:  MGCCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEET-VKEVLSETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENE--------IEKKIRVIPTN

Query:  CVAEHACEFSEISSPSECLSAATFTDKMDDGEEVHQR----VFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNG--VVGSARLAQNRDMNG
           E   E SE  S SE  S  +  +K D+ EEV Q     V + SP K            R  +     NRR+D SP +RN     GS RL        
Subjt:  CVAEHACEFSEISSPSECLSAATFTDKMDDGEEVHQR----VFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNG--VVGSARLAQNRDMNG

Query:  PAMVRRSLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPS--VRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTH----SESLENPLVS
               + +    RD  E+S RRSRSPA       G   +   T S    +    SP R    P  +    E   +   G    +    ++S ENPLVS
Subjt:  PAMVRRSLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPS--VRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTH----SESLENPLVS

Query:  LECFIFL
        LECFIFL
Subjt:  LECFIFL

AT1G61170.1 unknown protein2.4e-1033.22Show/hide
Query:  CCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEET-VKEVLSETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENE-------IEKKIRVIPTNCVA
        CCVSSG +    DR +    +K        + +EEET VKEVLSET    P    +           PV  K++E+E +           + + P +   
Subjt:  CCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEET-VKEVLSETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENE-------IEKKIRVIPTNCVA

Query:  EHACEFSEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSLR
        E   E SEI S S   S ++ T  M+  +E H        +K  K+Q      + ++  N    RR+DQSP +RN          N   NG        R
Subjt:  EHACEFSEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSLR

Query:  AEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL
             RDPGE+SGRRSRSPAT       ++S+       RK+ + SP R    PA +    ++  N     +TT  E LENPLVSLECFIFL
Subjt:  AEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGCTGTGTTAGTTCCGGAAATTCAGCTCACAAATTCGATCGCAATTCTGCGGCGGCAGACGAGAAGATTTATGAAAGCAGAGAACCGCCGTCTTCCATGGAGGA
AGAGACCGTGAAAGAAGTGCTCTCTGAAACTCCTGCTCTGAAACCACCGCCGCCACCCAAGAATCACCCGCCGGACGAAGATGAAGCCCCAAAACCAGTCGCCGATAAGG
TCAAGGAGGAAGAAAACGAGATCGAGAAGAAAATTCGAGTAATTCCCACTAATTGTGTTGCAGAACACGCTTGTGAATTCTCTGAAATTTCCAGTCCGAGCGAGTGTCTT
TCCGCAGCCACTTTCACCGATAAAATGGACGACGGCGAGGAGGTTCATCAGAGGGTTTTCAGAGCATCGCCGGTGAAATTGCCGAAGAATCAATCATTTTCCGGGGATGT
AAAAAGAGAAATGTTGCCGAACAGAGCACTTAACCGGAGATCCGACCAGTCGCCGGTTCGACGAAACGGCGTCGTGGGGTCGGCGAGATTGGCTCAGAACAGAGACATGA
ATGGCCCGGCAATGGTGCGGCGGAGCTTGAGGGCGGAGCCTCCCCGGAGAGACCCAGGTGAAAAATCTGGCCGGAGATCCAGGTCGCCCGCCACCGCACATCCCCACGGC
GGAGGGTCTAGATCCGCCTTGGGGCGGACCCCGTCGGTGAGGAAGTCCGGGAAATCGTCGCCGGTTCGGGCGGCGACGGCACCGGCGGCCAGTCCTCAAAGAGTAGAAGA
AGAATCCAACATTATCGATGGGAAATTCACCACTCACAGCGAGTCATTGGAGAACCCTCTGGTTTCCTTAGAGTGCTTTATTTTCCTC
mRNA sequenceShow/hide mRNA sequence
ATGGGTTGCTGTGTTAGTTCCGGAAATTCAGCTCACAAATTCGATCGCAATTCTGCGGCGGCAGACGAGAAGATTTATGAAAGCAGAGAACCGCCGTCTTCCATGGAGGA
AGAGACCGTGAAAGAAGTGCTCTCTGAAACTCCTGCTCTGAAACCACCGCCGCCACCCAAGAATCACCCGCCGGACGAAGATGAAGCCCCAAAACCAGTCGCCGATAAGG
TCAAGGAGGAAGAAAACGAGATCGAGAAGAAAATTCGAGTAATTCCCACTAATTGTGTTGCAGAACACGCTTGTGAATTCTCTGAAATTTCCAGTCCGAGCGAGTGTCTT
TCCGCAGCCACTTTCACCGATAAAATGGACGACGGCGAGGAGGTTCATCAGAGGGTTTTCAGAGCATCGCCGGTGAAATTGCCGAAGAATCAATCATTTTCCGGGGATGT
AAAAAGAGAAATGTTGCCGAACAGAGCACTTAACCGGAGATCCGACCAGTCGCCGGTTCGACGAAACGGCGTCGTGGGGTCGGCGAGATTGGCTCAGAACAGAGACATGA
ATGGCCCGGCAATGGTGCGGCGGAGCTTGAGGGCGGAGCCTCCCCGGAGAGACCCAGGTGAAAAATCTGGCCGGAGATCCAGGTCGCCCGCCACCGCACATCCCCACGGC
GGAGGGTCTAGATCCGCCTTGGGGCGGACCCCGTCGGTGAGGAAGTCCGGGAAATCGTCGCCGGTTCGGGCGGCGACGGCACCGGCGGCCAGTCCTCAAAGAGTAGAAGA
AGAATCCAACATTATCGATGGGAAATTCACCACTCACAGCGAGTCATTGGAGAACCCTCTGGTTTCCTTAGAGTGCTTTATTTTCCTC
Protein sequenceShow/hide protein sequence
MGCCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLSETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHACEFSEISSPSECL
SAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSLRAEPPRRDPGEKSGRRSRSPATAHPHG
GGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL