; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019732 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019732
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold729:1837614..1842875
RNA-Seq ExpressionMS019732
SyntenyMS019732
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580546.1 hypothetical protein SDJN03_20548, partial [Cucurbita argyrosperma subsp. sororia]3.9e-10877.14Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIARTC
        MA PITYSAIDDKDFDDAALWAVIDS AAAAA SSSSSS SRKSLA+N  +KSNPSPPP+FPKSPRTP+QAQ+NSR F EGEVVHEPWVFQPPRKIARTC
Subjt:  MASPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIARTC

Query:  VSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA------------------
         SE+SE SPLA+V NN LR PPAPVYLSPEAYLSPQIAS SEGSPA SGSGL EE+E+ RHSLSG+FPSVSLFKEYQNAAMA                  
Subjt:  VSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA------------------

Query:  -----------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ
                         DKTIEFD+NRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRF KPNHDIPSTAETRAKNKACQ
Subjt:  -----------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ

XP_022145573.1 uncharacterized protein LOC111014994 [Momordica charantia]2.5e-12386.52Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDS--AAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIAR
        MASPITYSAIDDKDFDDAALWAVIDS  AAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIAR
Subjt:  MASPITYSAIDDKDFDDAALWAVIDS--AAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIAR

Query:  TCVSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA----------------
        TCVSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIAS SEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA                
Subjt:  TCVSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA----------------

Query:  -------------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ
                           DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ
Subjt:  -------------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ

XP_022934850.1 uncharacterized protein LOC111441889 [Cucurbita moschata]1.5e-10776.79Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIARTC
        MA PI YSAIDDKDFDDAALWAVIDS AAAAA SSSSSS SRKSLA+N  +KSNPSPPP+FPKSPRTP+QAQ+NSR F EGEVVHEPWVFQPPRKIARTC
Subjt:  MASPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIARTC

Query:  VSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA------------------
         SE+SE SPLA+V NN LR PPAPVYLSPEAYLSPQIAS SEGSPA SGSGL EE+E+ RHSLSG+FPSVSLFKEYQNAAMA                  
Subjt:  VSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA------------------

Query:  -----------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ
                         DKTIEFD+NRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRF KPNHDIPSTAETRAKNKACQ
Subjt:  -----------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ

XP_022983918.1 uncharacterized protein LOC111482396 [Cucurbita maxima]8.7e-10876.79Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIARTC
        MA PITYSAIDDKDFDDAALWAVIDS AAAAA SSSSSS SRKSLA+N  +KSNPSPPP+FPKSPRTP+QAQ+NSR F EG+VVHEPWVFQPPRKIARTC
Subjt:  MASPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIARTC

Query:  VSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA------------------
         SE+SE SPLA+V NN LR PPAPVYLSPEAYLSPQIAS SEGSPA SGSGL EE+E+ RHSLSG+FPSVSLFKEYQNAAMA                  
Subjt:  VSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA------------------

Query:  -----------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ
                         DKTIEFD+NRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRF KPNHDIPSTAETRAKNKACQ
Subjt:  -----------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ

XP_023527602.1 uncharacterized protein LOC111790777 [Cucurbita pepo subsp. pepo]6.0e-10976.79Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIARTC
        MA PI YSAIDDKDFDDAALWAVIDSAAAAAA SSSSSS SRKS+A+N  +KSNPSPPP+FPKSPRTP+QAQ+NSR F EGEVVHEPWVFQPPRKIARTC
Subjt:  MASPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIARTC

Query:  VSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA------------------
         SE+SE SPLA+V NN LR PPAPVYLSPEAYLSPQIAS SEGSPA SGSGL EE+E+ RHSLSG+FPSVSLFKEYQNAAMA                  
Subjt:  VSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA------------------

Query:  -----------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ
                         DKTIEFD+NRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRF KPNHDIPSTAETRAKNKACQ
Subjt:  -----------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ

TrEMBL top hitse value%identityAlignment
A0A0A0LD66 Uncharacterized protein1.8e-10676.07Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIARTC
        MASP+TYSAIDDKDFDDAALWAVIDS AAAAAASSSSSSK RKSLA+N  +KSNPSPPP+FPKSP+TPYQAQRNSR F EGEVVHEPWVFQPPRKIA+T 
Subjt:  MASPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIARTC

Query:  VSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA------------------
         SEVS+SSPLA+V NN LRTPPAPVYLSPEAYLSPQI S SEGSP  S SG+N E+E++RH LSG+FPSVSLFKEYQNAAMA                  
Subjt:  VSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA------------------

Query:  -----------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ
                         DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRF KPNHD+PSTAETRAKNKACQ
Subjt:  -----------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ

A0A1S3B6Q7 uncharacterized protein LOC1034863903.0e-10676.07Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIARTC
        MASPITYS IDDKDFDDAALWAVIDS AAAAA+SSSSSSKSRKSLA+N  +KSNPSPPP+FPKSPRTPYQAQRNSR F EGEVV EPWVFQPPRKIA+T 
Subjt:  MASPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIARTC

Query:  VSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA------------------
         +EVS+SSPLA+V NN LRTPPAPVYLSPEAYLSPQI S SEGSP  S SG+NEE+E+++HSLSG+FPSVSLFKEYQNAAMA                  
Subjt:  VSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA------------------

Query:  -----------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ
                         DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRF KPNHD+PSTAETRAKNKACQ
Subjt:  -----------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ

A0A6J1CWB4 uncharacterized protein LOC1110149941.2e-12386.52Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDS--AAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIAR
        MASPITYSAIDDKDFDDAALWAVIDS  AAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIAR
Subjt:  MASPITYSAIDDKDFDDAALWAVIDS--AAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIAR

Query:  TCVSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA----------------
        TCVSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIAS SEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA                
Subjt:  TCVSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA----------------

Query:  -------------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ
                           DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ
Subjt:  -------------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ

A0A6J1F8X3 uncharacterized protein LOC1114418897.2e-10876.79Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIARTC
        MA PI YSAIDDKDFDDAALWAVIDS AAAAA SSSSSS SRKSLA+N  +KSNPSPPP+FPKSPRTP+QAQ+NSR F EGEVVHEPWVFQPPRKIARTC
Subjt:  MASPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIARTC

Query:  VSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA------------------
         SE+SE SPLA+V NN LR PPAPVYLSPEAYLSPQIAS SEGSPA SGSGL EE+E+ RHSLSG+FPSVSLFKEYQNAAMA                  
Subjt:  VSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA------------------

Query:  -----------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ
                         DKTIEFD+NRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRF KPNHDIPSTAETRAKNKACQ
Subjt:  -----------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ

A0A6J1J926 uncharacterized protein LOC1114823964.2e-10876.79Show/hide
Query:  MASPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIARTC
        MA PITYSAIDDKDFDDAALWAVIDS AAAAA SSSSSS SRKSLA+N  +KSNPSPPP+FPKSPRTP+QAQ+NSR F EG+VVHEPWVFQPPRKIARTC
Subjt:  MASPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIARTC

Query:  VSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA------------------
         SE+SE SPLA+V NN LR PPAPVYLSPEAYLSPQIAS SEGSPA SGSGL EE+E+ RHSLSG+FPSVSLFKEYQNAAMA                  
Subjt:  VSEVSESSPLALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA------------------

Query:  -----------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ
                         DKTIEFD+NRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRF KPNHDIPSTAETRAKNKACQ
Subjt:  -----------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQ

SwissProt top hitse value%identityAlignment
Q6DR24 Uncharacterized protein At3g179501.9e-0471.43Show/hide
Query:  SPSSFTDSSSDLDTESTGSFYHDKSITLGSLIGVS
        SP+  + SSSDLDTESTGSF+HD+SITLG+L+G S
Subjt:  SPSSFTDSSSDLDTESTGSFYHDKSITLGSLIGVS

Arabidopsis top hitse value%identityAlignment
AT3G09430.1 unknown protein1.5e-4945.93Show/hide
Query:  SPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIART-CV
        S ++    D+KD DDA LWAVIDSAAAAA    + + KS K LA+ YP+ ++P  P  +P SP++    Q  +R    G  ++E      P K+AR+  +
Subjt:  SPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIART-CV

Query:  SEVSESSPLALV----GNNTLRTPPAPVYLSPEAYLSPQI------ASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA---------
        SEV   +P+ALV     N+T     +  + SPE+YLSP I      A  S  +  V    +NE     RHSLSG FPS +LFKEYQN AMA         
Subjt:  SEVSESSPLALV----GNNTLRTPPAPVYLSPEAYLSPQI------ASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMA---------

Query:  --------------------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQYENILSLSRPF
                                  DKTIEFDENRNVQRAEF+VRA M GGRF DGWGSCERREK+F KPNHDIPSTAETRAKN+ACQ  ++L +    
Subjt:  --------------------------DKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERREKRFSKPNHDIPSTAETRAKNKACQYENILSLSRPF

Query:  LPPAPLP
          PA LP
Subjt:  LPPAPLP

AT3G17950.1 unknown protein1.3e-0571.43Show/hide
Query:  SPSSFTDSSSDLDTESTGSFYHDKSITLGSLIGVS
        SP+  + SSSDLDTESTGSF+HD+SITLG+L+G S
Subjt:  SPSSFTDSSSDLDTESTGSFYHDKSITLGSLIGVS

AT5G02440.1 unknown protein7.5e-1740.7Show/hide
Query:  QCHGWPLGLRLMNARVG--------LAGNRDLSA-SVSFNTLRTHSPSSFTDSSSDLDTESTGSFYHDKSITLGSLIGVSSILELSRRSAKGNKVETLED
        Q  GWPLGLR +NAR+G           +  +SA S+SF++L + SPSS   SSSDLD++S GSF+ D+S TLG+LIG+SS LELSRRS +    +T   
Subjt:  QCHGWPLGLRLMNARVG--------LAGNRDLSA-SVSFNTLRTHSPSSFTDSSSDLDTESTGSFYHDKSITLGSLIGVSSILELSRRSAKGNKVETLED

Query:  KK-------KDKSKPWLFSLSLCIKLRPDAVSL-------------RSSPSLEHSLAAERRATRNHRIQNPT
        +        K   KPW+F  S+C KL  +A  +              +  SL H L  ERRA  +     PT
Subjt:  KK-------KDKSKPWLFSLSLCIKLRPDAVSL-------------RSSPSLEHSLAAERRATRNHRIQNPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCCCAATCACGTATTCCGCCATCGACGACAAAGATTTCGACGACGCTGCTTTATGGGCGGTAATAGACTCTGCTGCGGCTGCGGCTGCGGCTTCTTCCTCCTC
TTCCTCCAAATCTCGGAAATCTCTAGCAGTTAATTACCCCAGTAAATCAAATCCTTCCCCGCCGCCTAGGTTTCCGAAAAGCCCTAGAACTCCGTACCAGGCGCAGAGGA
ATTCTAGGGCTTTTCCAGAGGGTGAGGTGGTGCACGAGCCTTGGGTGTTTCAACCTCCTCGGAAGATTGCAAGGACGTGTGTATCGGAAGTGAGTGAGAGCAGTCCTCTT
GCACTCGTCGGTAACAACACGCTACGGACACCGCCAGCGCCGGTATATTTGTCTCCTGAAGCGTACTTGTCGCCGCAGATTGCTTCTTGTTCTGAGGGCTCACCGGCTGT
TAGTGGAAGTGGACTGAACGAGGAGAAGGAAATTACAAGGCATAGCCTCTCTGGGCGTTTCCCTTCAGTCTCTCTCTTTAAGGAGTATCAGAATGCGGCGATGGCGGACA
AGACAATTGAATTTGACGAAAACCGCAACGTCCAGCGTGCTGAGTTTGTTGTTCGAGCATATATGCAAGGTGGTAGATTTTGTGATGGATGGGGCTCGTGTGAACGGCGT
GAGAAGAGATTTTCCAAACCAAATCATGATATTCCTAGCACAGCAGAAACCAGGGCCAAGAATAAGGCATGCCAATACGAAAATATTCTCTCGCTCTCACGTCCTTTCCT
TCCACCAGCTCCCCTCCCATGTCTGCTCCACAGAAACTGCATTCTTAGCATGAACTTGCAGGGCCAATGCCATGGCTGGCCATTAGGGTTGCGGCTTATGAACGCCAGAG
TTGGATTGGCGGGAAATCGCGATCTCTCTGCATCGGTTTCCTTCAATACTCTGCGCACACATTCTCCCAGCTCATTCACCGACTCTTCCTCCGATCTTGATACCGAGTCA
ACTGGGTCGTTCTACCATGACAAGAGCATCACGCTTGGGAGTCTAATTGGTGTGTCCAGCATTCTGGAGCTATCACGAAGATCAGCAAAAGGAAACAAGGTGGAAACACT
TGAAGACAAGAAGAAGGACAAGTCCAAGCCATGGCTGTTTTCTTTATCTTTGTGCATTAAGCTCCGCCCTGATGCTGTGAGTTTGAGGAGCTCTCCTTCACTTGAGCACT
CGCTTGCTGCAGAGAGGAGAGCTACAAGAAATCACAGGATTCAGAACCCTACCATGTATGGACCCAACGATTTCTCACCGATTCGTCCTGTTTCAGGAACAAACTCTCTA
TTTTCCGGTGACCAAGTTGCTCCCATGTCATTTGCACCGGTGGTTGAAGAGGGAGCGAGAAAATCAAATGGAGAACTTGGTAACAGTCAAGGGGTTCCTCTTCTCTTTTC
ATGTCTATATTGCCAACTGATTAAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCCCCAATCACGTATTCCGCCATCGACGACAAAGATTTCGACGACGCTGCTTTATGGGCGGTAATAGACTCTGCTGCGGCTGCGGCTGCGGCTTCTTCCTCCTC
TTCCTCCAAATCTCGGAAATCTCTAGCAGTTAATTACCCCAGTAAATCAAATCCTTCCCCGCCGCCTAGGTTTCCGAAAAGCCCTAGAACTCCGTACCAGGCGCAGAGGA
ATTCTAGGGCTTTTCCAGAGGGTGAGGTGGTGCACGAGCCTTGGGTGTTTCAACCTCCTCGGAAGATTGCAAGGACGTGTGTATCGGAAGTGAGTGAGAGCAGTCCTCTT
GCACTCGTCGGTAACAACACGCTACGGACACCGCCAGCGCCGGTATATTTGTCTCCTGAAGCGTACTTGTCGCCGCAGATTGCTTCTTGTTCTGAGGGCTCACCGGCTGT
TAGTGGAAGTGGACTGAACGAGGAGAAGGAAATTACAAGGCATAGCCTCTCTGGGCGTTTCCCTTCAGTCTCTCTCTTTAAGGAGTATCAGAATGCGGCGATGGCGGACA
AGACAATTGAATTTGACGAAAACCGCAACGTCCAGCGTGCTGAGTTTGTTGTTCGAGCATATATGCAAGGTGGTAGATTTTGTGATGGATGGGGCTCGTGTGAACGGCGT
GAGAAGAGATTTTCCAAACCAAATCATGATATTCCTAGCACAGCAGAAACCAGGGCCAAGAATAAGGCATGCCAATACGAAAATATTCTCTCGCTCTCACGTCCTTTCCT
TCCACCAGCTCCCCTCCCATGTCTGCTCCACAGAAACTGCATTCTTAGCATGAACTTGCAGGGCCAATGCCATGGCTGGCCATTAGGGTTGCGGCTTATGAACGCCAGAG
TTGGATTGGCGGGAAATCGCGATCTCTCTGCATCGGTTTCCTTCAATACTCTGCGCACACATTCTCCCAGCTCATTCACCGACTCTTCCTCCGATCTTGATACCGAGTCA
ACTGGGTCGTTCTACCATGACAAGAGCATCACGCTTGGGAGTCTAATTGGTGTGTCCAGCATTCTGGAGCTATCACGAAGATCAGCAAAAGGAAACAAGGTGGAAACACT
TGAAGACAAGAAGAAGGACAAGTCCAAGCCATGGCTGTTTTCTTTATCTTTGTGCATTAAGCTCCGCCCTGATGCTGTGAGTTTGAGGAGCTCTCCTTCACTTGAGCACT
CGCTTGCTGCAGAGAGGAGAGCTACAAGAAATCACAGGATTCAGAACCCTACCATGTATGGACCCAACGATTTCTCACCGATTCGTCCTGTTTCAGGAACAAACTCTCTA
TTTTCCGGTGACCAAGTTGCTCCCATGTCATTTGCACCGGTGGTTGAAGAGGGAGCGAGAAAATCAAATGGAGAACTTGGTAACAGTCAAGGGGTTCCTCTTCTCTTTTC
ATGTCTATATTGCCAACTGATTAAA
Protein sequenceShow/hide protein sequence
MASPITYSAIDDKDFDDAALWAVIDSAAAAAAASSSSSSKSRKSLAVNYPSKSNPSPPPRFPKSPRTPYQAQRNSRAFPEGEVVHEPWVFQPPRKIARTCVSEVSESSPL
ALVGNNTLRTPPAPVYLSPEAYLSPQIASCSEGSPAVSGSGLNEEKEITRHSLSGRFPSVSLFKEYQNAAMADKTIEFDENRNVQRAEFVVRAYMQGGRFCDGWGSCERR
EKRFSKPNHDIPSTAETRAKNKACQYENILSLSRPFLPPAPLPCLLHRNCILSMNLQGQCHGWPLGLRLMNARVGLAGNRDLSASVSFNTLRTHSPSSFTDSSSDLDTES
TGSFYHDKSITLGSLIGVSSILELSRRSAKGNKVETLEDKKKDKSKPWLFSLSLCIKLRPDAVSLRSSPSLEHSLAAERRATRNHRIQNPTMYGPNDFSPIRPVSGTNSL
FSGDQVAPMSFAPVVEEGARKSNGELGNSQGVPLLFSCLYCQLIK