; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g31470 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g31470
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionsp110 nuclear body protein-like
Genome locationchr1:22087824..22088684
RNA-Seq ExpressionMoc01g31470
SyntenyMoc01g31470
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026590.1 hypothetical protein SDJN02_10592, partial [Cucurbita argyrosperma subsp. argyrosperma]8.3e-7462.05Show/hide
Query:  MGCCVSSG---NSAHKFDRNSAAADEKIY------ESREPPSSMEEETVKEVLTETPALKP---PPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVI
        MGCCVSSG   +SAHKFD   AAA  KI+       SREPPSSMEEETVKEVL+ET ALKP    PP K+ PP+EDEA KPV DKV     EIEKK+  I
Subjt:  MGCCVSSG---NSAHKFDRNSAAADEKIY------ESREPPSSMEEETVKEVLTETPALKP---PPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVI

Query:  PTNCVAEHACEFSEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGP
        P N + + A EF EIS+P +  + A FTD MD GEEVHQ V +A    LP NQS  G+V  KR++ PN+ LNRRSDQSPVRRN  VGSARL Q RD + P
Subjt:  PTNCVAEHACEFSEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGP

Query:  AMVRRSLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASP---QRVEEESNIIDGKFTTHSESLENPLVSLECF
        AM  R LR EP ++DP E  GRRSRSPATA    GGSRSALGRTPSVRKSGKSSP+R  TA   +P   ++V EE+NI +G   T  ESLENPLVSLECF
Subjt:  AMVRRSLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASP---QRVEEESNIIDGKFTTHSESLENPLVSLECF

Query:  IFL
        IFL
Subjt:  IFL

XP_008441084.1 PREDICTED: uncharacterized protein LOC103485312 [Cucumis melo]1.1e-7362.12Show/hide
Query:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHA
        MGCC+SS    NS +KF  +S  A      +R+PPSSMEEETVKEVL+ETPALK PPP KN PP+EDE  KP+ D       E EKK+  IP N + E  
Subjt:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHA

Query:  CEFSEISSPSECL--SAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSL
         EF EIS  ++C+  SAATFTD+ D G EVHQ   ++SPVKL KNQS S DV  KRE+  +R L RRSDQSPVRRNG VGS R+  NRDM+ PAM RR L
Subjt:  CEFSEISSPSECL--SAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSL

Query:  RAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL
        RAEPPRRDP E S RRS+SP+TA     G RSAL RTPS RKSGKSSP+RA T   A+ Q+V EE+NI+DGKF T  ESLENPLVSLECFIFL
Subjt:  RAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL

XP_022142272.1 uncharacterized protein LOC111012433, partial [Momordica charantia]1.9e-94100Show/hide
Query:  MGCCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHACEF
        MGCCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHACEF
Subjt:  MGCCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHACEF

Query:  SEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMN
        SEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMN
Subjt:  SEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMN

XP_022926404.1 uncharacterized protein LOC111433567 [Cucurbita moschata]9.8e-7562.38Show/hide
Query:  MGCCVSSG---NSAHKFDRNSAAADEKIY------ESREPPSSMEEETVKEVLTETPALKP---PPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVI
        MGCCVSSG   +SAHKFD   AAA +KI+       SREPPSSMEEETVKEVL+ET ALKP    PP KN PP+EDEA KPV DKV     EIEKK+  I
Subjt:  MGCCVSSG---NSAHKFDRNSAAADEKIY------ESREPPSSMEEETVKEVLTETPALKP---PPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVI

Query:  PTNCVAEHACEFSEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGP
        P N + +   EF EIS+P +  + A FTD MD GEEVHQ V +A    LP NQS  G+V  KR++ PN+ LNRRSDQSPVRRN  VGSARL Q RD + P
Subjt:  PTNCVAEHACEFSEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGP

Query:  AMVRRSLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASP---QRVEEESNIIDGKFTTHSESLENPLVSLECF
        AM  R LR EP ++DP E  GRRSRSPATA    GGSRSALGRTPSVRKSGKSSP+R  TA   +P   ++V EE+NI DG   T  ESLENPLVSLECF
Subjt:  AMVRRSLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASP---QRVEEESNIIDGKFTTHSESLENPLVSLECF

Query:  IFL
        IFL
Subjt:  IFL

XP_038882208.1 uncharacterized protein LOC120073430 [Benincasa hispida]3.5e-8065.42Show/hide
Query:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLTETPALKPP--PPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAE
        MGCC+SSG   NS +KF RNS         SR+PPSSMEEETVKEVL+ETP+LKPP  PP KN PP+ED+  KPV        NEIEKK+  I  N +AE
Subjt:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLTETPALKPP--PPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAE

Query:  HACEFSEISSPSECLSAAT--FTDKMDDGEEVHQRVFRASPVKLPKNQSFSGD--VKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRR
           EF EIS P+EC+S +T   T++MD G E+HQ V ++SPVKLPK+QS SGD  VKRE+  NR L RRSDQSPVRRNG +GS R+  NRDMN PAM RR
Subjt:  HACEFSEISSPSECLSAAT--FTDKMDDGEEVHQRVFRASPVKLPKNQSFSGD--VKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRR

Query:  SLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL
         LRAEPPRRDP E S RRSRSPATA   G GSRSAL RTPSVRKSGKSSP RAAT   A+ Q+V EE+NIIDGKF +  ESLENPLVSLECFIFL
Subjt:  SLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0KLE9 Uncharacterized protein1.7e-7261.09Show/hide
Query:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHA
        MGCC+SS    +S +KF  NS      +  SR+PPSSMEEETVKEVL+ETPALK PP   N  P++DE  KP+ D       EIEKK+  IP N + E  
Subjt:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHA

Query:  CEFSEISSPSECL--SAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSL
         EF EIS  ++C+  SAATFTD+ D G EVHQ V ++SPVKL KNQS S DV  KRE+  +R L RRSDQSPVRRNG VGS R+  NRDM+ PAM RR L
Subjt:  CEFSEISSPSECL--SAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSL

Query:  RAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL
        RAEPPRRDP E S RRS SP+TA     G RSAL RTPS RKSGKSSP+   TA  A+ Q+V EE+NI+DGKF T  ESLENPLVSLECFIFL
Subjt:  RAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL

A0A1S3B2L5 uncharacterized protein LOC1034853125.2e-7462.12Show/hide
Query:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHA
        MGCC+SS    NS +KF  +S  A      +R+PPSSMEEETVKEVL+ETPALK PPP KN PP+EDE  KP+ D       E EKK+  IP N + E  
Subjt:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHA

Query:  CEFSEISSPSECL--SAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSL
         EF EIS  ++C+  SAATFTD+ D G EVHQ   ++SPVKL KNQS S DV  KRE+  +R L RRSDQSPVRRNG VGS R+  NRDM+ PAM RR L
Subjt:  CEFSEISSPSECL--SAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSL

Query:  RAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL
        RAEPPRRDP E S RRS+SP+TA     G RSAL RTPS RKSGKSSP+RA T   A+ Q+V EE+NI+DGKF T  ESLENPLVSLECFIFL
Subjt:  RAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL

A0A5D3CNI1 Putative BEST plant protein match is: (TAIR:plant.1) protein5.2e-7462.12Show/hide
Query:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHA
        MGCC+SS    NS +KF  +S  A      +R+PPSSMEEETVKEVL+ETPALK PPP KN PP+EDE  KP+ D       E EKK+  IP N + E  
Subjt:  MGCCVSSG---NSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHA

Query:  CEFSEISSPSECL--SAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSL
         EF EIS  ++C+  SAATFTD+ D G EVHQ   ++SPVKL KNQS S DV  KRE+  +R L RRSDQSPVRRNG VGS R+  NRDM+ PAM RR L
Subjt:  CEFSEISSPSECL--SAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSL

Query:  RAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL
        RAEPPRRDP E S RRS+SP+TA     G RSAL RTPS RKSGKSSP+RA T   A+ Q+V EE+NI+DGKF T  ESLENPLVSLECFIFL
Subjt:  RAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL

A0A6J1CMA7 uncharacterized protein LOC1110124339.2e-95100Show/hide
Query:  MGCCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHACEF
        MGCCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHACEF
Subjt:  MGCCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHACEF

Query:  SEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMN
        SEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMN
Subjt:  SEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMN

A0A6J1EF08 uncharacterized protein LOC1114335674.7e-7562.38Show/hide
Query:  MGCCVSSG---NSAHKFDRNSAAADEKIY------ESREPPSSMEEETVKEVLTETPALKP---PPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVI
        MGCCVSSG   +SAHKFD   AAA +KI+       SREPPSSMEEETVKEVL+ET ALKP    PP KN PP+EDEA KPV DKV     EIEKK+  I
Subjt:  MGCCVSSG---NSAHKFDRNSAAADEKIY------ESREPPSSMEEETVKEVLTETPALKP---PPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVI

Query:  PTNCVAEHACEFSEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGP
        P N + +   EF EIS+P +  + A FTD MD GEEVHQ V +A    LP NQS  G+V  KR++ PN+ LNRRSDQSPVRRN  VGSARL Q RD + P
Subjt:  PTNCVAEHACEFSEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDV--KREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGP

Query:  AMVRRSLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASP---QRVEEESNIIDGKFTTHSESLENPLVSLECF
        AM  R LR EP ++DP E  GRRSRSPATA    GGSRSALGRTPSVRKSGKSSP+R  TA   +P   ++V EE+NI DG   T  ESLENPLVSLECF
Subjt:  AMVRRSLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASP---QRVEEESNIIDGKFTTHSESLENPLVSLECF

Query:  IFL
        IFL
Subjt:  IFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11125.1 unknown protein1.6e-0630.94Show/hide
Query:  MGCCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEET-VKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENE--------IEKKIRVIPTN
        MGCC+SS  +  K D  S         +  PPS ++EET VKEVL+ET  L           D     K    K++EEE +         ++ +   P+ 
Subjt:  MGCCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEET-VKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENE--------IEKKIRVIPTN

Query:  CVAEHACEFSEISSPSECLSAATFTDKMDDGEEVHQR----VFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNG--VVGSARLAQNRDMNG
           E   E SE  S SE  S  +  +K D+ EEV Q     V + SP K            R  +     NRR+D SP +RN     GS RL        
Subjt:  CVAEHACEFSEISSPSECLSAATFTDKMDDGEEVHQR----VFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNG--VVGSARLAQNRDMNG

Query:  PAMVRRSLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPS--VRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTH----SESLENPLVS
               + +    RD  E+S RRSRSPA       G   +   T S    +    SP R    P  +    E   +   G    +    ++S ENPLVS
Subjt:  PAMVRRSLRAEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPS--VRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTH----SESLENPLVS

Query:  LECFIFL
        LECFIFL
Subjt:  LECFIFL

AT1G61170.1 unknown protein5.3e-1032.88Show/hide
Query:  CCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEET-VKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENE-------IEKKIRVIPTNCVA
        CCVSSG +    DR +    +K        + +EEET VKEVL+ET    P    +           PV  K++E+E +           + + P +   
Subjt:  CCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEET-VKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENE-------IEKKIRVIPTNCVA

Query:  EHACEFSEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSLR
        E   E SEI S S   S ++ T  M+  +E H        +K  K+Q      + ++  N    RR+DQSP +RN          N   NG        R
Subjt:  EHACEFSEISSPSECLSAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSLR

Query:  AEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL
             RDPGE+SGRRSRSPAT       ++S+       RK+ + SP R    PA +    ++  N     +TT  E LENPLVSLECFIFL
Subjt:  AEPPRRDPGEKSGRRSRSPATAHPHGGGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGCTGTGTTAGTTCCGGAAATTCAGCTCACAAATTCGATCGCAATTCTGCGGCGGCAGACGAGAAGATTTATGAAAGCAGAGAACCGCCGTCTTCCATGGAGGA
AGAGACCGTGAAAGAAGTGCTCACTGAAACTCCTGCTCTGAAACCACCGCCGCCACCCAAGAATCACCCGCCGGACGAAGATGAAGCCCCAAAACCAGTCGCCGATAAGG
TCAAGGAGGAAGAAAACGAGATCGAGAAGAAAATTCGAGTAATTCCCACTAATTGTGTTGCAGAACACGCTTGTGAATTCTCTGAAATTTCCAGTCCGAGCGAGTGTCTC
TCCGCAGCCACTTTCACCGATAAAATGGACGACGGCGAGGAGGTTCATCAGAGGGTTTTCAGAGCATCGCCGGTGAAATTGCCGAAGAATCAATCATTTTCCGGGGATGT
AAAAAGAGAAATGTTGCCGAACAGAGCACTTAACCGGAGATCCGACCAGTCGCCGGTTCGACGAAACGGCGTCGTGGGGTCGGCGAGATTGGCTCAGAACAGAGACATGA
ATGGCCCGGCAATGGTGCGGCGGAGCTTGAGGGCGGAGCCTCCCCGGAGAGACCCAGGTGAAAAATCTGGCCGGAGATCCAGGTCGCCCGCCACCGCACATCCCCACGGC
GGAGGGTCTAGATCCGCCTTGGGGCGGACCCCGTCGGTGAGGAAGTCCGGGAAATCGTCGCCGGTTCGGGCGGCGACGGCACCGGCGGCCAGTCCTCAAAGAGTAGAAGA
AGAATCCAACATTATCGATGGGAAATTCACCACTCACAGCGAGTCATTGGAGAACCCTCTGGTTTCCTTAGAGTGCTTTATTTTCCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTGCTGTGTTAGTTCCGGAAATTCAGCTCACAAATTCGATCGCAATTCTGCGGCGGCAGACGAGAAGATTTATGAAAGCAGAGAACCGCCGTCTTCCATGGAGGA
AGAGACCGTGAAAGAAGTGCTCACTGAAACTCCTGCTCTGAAACCACCGCCGCCACCCAAGAATCACCCGCCGGACGAAGATGAAGCCCCAAAACCAGTCGCCGATAAGG
TCAAGGAGGAAGAAAACGAGATCGAGAAGAAAATTCGAGTAATTCCCACTAATTGTGTTGCAGAACACGCTTGTGAATTCTCTGAAATTTCCAGTCCGAGCGAGTGTCTC
TCCGCAGCCACTTTCACCGATAAAATGGACGACGGCGAGGAGGTTCATCAGAGGGTTTTCAGAGCATCGCCGGTGAAATTGCCGAAGAATCAATCATTTTCCGGGGATGT
AAAAAGAGAAATGTTGCCGAACAGAGCACTTAACCGGAGATCCGACCAGTCGCCGGTTCGACGAAACGGCGTCGTGGGGTCGGCGAGATTGGCTCAGAACAGAGACATGA
ATGGCCCGGCAATGGTGCGGCGGAGCTTGAGGGCGGAGCCTCCCCGGAGAGACCCAGGTGAAAAATCTGGCCGGAGATCCAGGTCGCCCGCCACCGCACATCCCCACGGC
GGAGGGTCTAGATCCGCCTTGGGGCGGACCCCGTCGGTGAGGAAGTCCGGGAAATCGTCGCCGGTTCGGGCGGCGACGGCACCGGCGGCCAGTCCTCAAAGAGTAGAAGA
AGAATCCAACATTATCGATGGGAAATTCACCACTCACAGCGAGTCATTGGAGAACCCTCTGGTTTCCTTAGAGTGCTTTATTTTCCTCTGA
Protein sequenceShow/hide protein sequence
MGCCVSSGNSAHKFDRNSAAADEKIYESREPPSSMEEETVKEVLTETPALKPPPPPKNHPPDEDEAPKPVADKVKEEENEIEKKIRVIPTNCVAEHACEFSEISSPSECL
SAATFTDKMDDGEEVHQRVFRASPVKLPKNQSFSGDVKREMLPNRALNRRSDQSPVRRNGVVGSARLAQNRDMNGPAMVRRSLRAEPPRRDPGEKSGRRSRSPATAHPHG
GGSRSALGRTPSVRKSGKSSPVRAATAPAASPQRVEEESNIIDGKFTTHSESLENPLVSLECFIFL