; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g1480 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g1480
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Description50S ribosomal protein L34, chloroplastic
Genome locationMC01:19228125..19233676
RNA-Seq ExpressionMC01g1480
SyntenyMC01g1480
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0005762 - mitochondrial large ribosomal subunit (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR000271 - Ribosomal protein L34


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052628.1 50S ribosomal protein L34 [Cucumis melo var. makuwa]3.08e-6880Show/hide
Query:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA
        MAAISVLS+PWV +GA+AVRVPSASLVISTGSR  CSVSLNTANNTSARSGLL C                 SLGLDW+S++ VGQGKGRSLVVRAGKAA
Subjt:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA

Query:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA
        LCQTKRNRSRKSLARTHGFRRRMRTT+GRAVLKRRRAKGRKVLCTKS PSSGK A
Subjt:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA

XP_004134668.1 50S ribosomal protein L34, chloroplastic [Cucumis sativus]1.69e-8088.39Show/hide
Query:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA
        MAAISVLSTPWV +GA+AVRVPSASLV STGSR  CSVSLNT NN SARSGLL CSFLPSSSLSCSSSFSG SLGLDW+S++ VGQGKGR LVVRAGKAA
Subjt:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA

Query:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA
        LCQTKRNRSRKSLARTHGFRRRMRTT+GRAVLKRRRAKGRKVLCTKS PSSGK A
Subjt:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA

XP_008439748.1 PREDICTED: 50S ribosomal protein L34, chloroplastic [Cucumis melo]1.24e-8290.32Show/hide
Query:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA
        MAAISVLS+PWV +GA+AVRVPSASLVISTGSR  CSVSLNTANNTSARSGLL CSFLPSSSLSCSSSFSG SLGLDW+S++ VGQGKGRSLVVRAGKAA
Subjt:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA

Query:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA
        LCQTKRNRSRKSLARTHGFRRRMRTT+GRAVLKRRRAKGRKVLCTKS PSSGK A
Subjt:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA

XP_022142342.1 50S ribosomal protein L34, chloroplastic [Momordica charantia]5.58e-95100Show/hide
Query:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA
        MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA
Subjt:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA

Query:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA
        LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA
Subjt:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA

XP_038883793.1 50S ribosomal protein L34, chloroplastic [Benincasa hispida]1.02e-8190.32Show/hide
Query:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA
        MAAISVLS+PWV +GA AVRVPSASLVISTG R CCSVSLNTANNTSARSGLL CSFL SSSLSCSSSFSG SLGLDW+S+I VGQGKGRSLVVRAGKAA
Subjt:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA

Query:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA
        LCQTKRNRSRKSLARTHGFRRRMRTT+GRAVLKRRRAKGRKVLCTKS PSSGK A
Subjt:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA

TrEMBL top hitse value%identityAlignment
A0A0A0KI92 Uncharacterized protein8.17e-8188.39Show/hide
Query:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA
        MAAISVLSTPWV +GA+AVRVPSASLV STGSR  CSVSLNT NN SARSGLL CSFLPSSSLSCSSSFSG SLGLDW+S++ VGQGKGR LVVRAGKAA
Subjt:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA

Query:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA
        LCQTKRNRSRKSLARTHGFRRRMRTT+GRAVLKRRRAKGRKVLCTKS PSSGK A
Subjt:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA

A0A1S3AZG6 50S ribosomal protein L34, chloroplastic6.01e-8390.32Show/hide
Query:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA
        MAAISVLS+PWV +GA+AVRVPSASLVISTGSR  CSVSLNTANNTSARSGLL CSFLPSSSLSCSSSFSG SLGLDW+S++ VGQGKGRSLVVRAGKAA
Subjt:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA

Query:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA
        LCQTKRNRSRKSLARTHGFRRRMRTT+GRAVLKRRRAKGRKVLCTKS PSSGK A
Subjt:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA

A0A5A7UBQ8 50S ribosomal protein L341.49e-6880Show/hide
Query:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA
        MAAISVLS+PWV +GA+AVRVPSASLVISTGSR  CSVSLNTANNTSARSGLL C                 SLGLDW+S++ VGQGKGRSLVVRAGKAA
Subjt:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA

Query:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA
        LCQTKRNRSRKSLARTHGFRRRMRTT+GRAVLKRRRAKGRKVLCTKS PSSGK A
Subjt:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA

A0A5D3CSJ7 50S ribosomal protein L346.01e-8390.32Show/hide
Query:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA
        MAAISVLS+PWV +GA+AVRVPSASLVISTGSR  CSVSLNTANNTSARSGLL CSFLPSSSLSCSSSFSG SLGLDW+S++ VGQGKGRSLVVRAGKAA
Subjt:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA

Query:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA
        LCQTKRNRSRKSLARTHGFRRRMRTT+GRAVLKRRRAKGRKVLCTKS PSSGK A
Subjt:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA

A0A6J1CMH5 50S ribosomal protein L34, chloroplastic2.70e-95100Show/hide
Query:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA
        MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA
Subjt:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAA

Query:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA
        LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA
Subjt:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA

SwissProt top hitse value%identityAlignment
B5EJJ2 50S ribosomal protein L348.6e-0480Show/hide
Query:  RTHGFRRRMRTTSGRAVLKRRRAKGRKVLC
        RTHGFR RM T SGR VLKRRRAKGR+ LC
Subjt:  RTHGFRRRMRTTSGRAVLKRRRAKGRKVLC

B7J3D6 50S ribosomal protein L348.6e-0480Show/hide
Query:  RTHGFRRRMRTTSGRAVLKRRRAKGRKVLC
        RTHGFR RM T SGR VLKRRRAKGR+ LC
Subjt:  RTHGFRRRMRTTSGRAVLKRRRAKGRKVLC

P82244 50S ribosomal protein L34, chloroplastic1.0e-3364.1Show/hide
Query:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFL-PSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKA
        MA +S+LST   +  A+  R PSASL   TGSR+  +    + N  SARSG L CSFL PSSSL  SS+FSG SLGLD  S   V   + R  VVRAGKA
Subjt:  MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFL-PSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKA

Query:  ALCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA
        ALC TKR+RSRKSLARTHGFR RM TTSGRA+LKRRRAKGRK+LCTK+ PSSGK A
Subjt:  ALCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA

Q9LP37 50S ribosomal protein L34, chloroplastic2.5e-2759.35Show/hide
Query:  AAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQG-KGRSLVVRAGKAA
        +A S L  P   +G +   VPSASL + TG R   S     +  +SA S LL CSFL SS +S +S FSG S+  D +S+     G + R LVVRAGKAA
Subjt:  AAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQG-KGRSLVVRAGKAA

Query:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA
        LCQTKR+RSRKSLARTHGFRRRMRTTSGRA +KRRRAKGR  LC KS PSSGK A
Subjt:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA

Arabidopsis top hitse value%identityAlignment
AT1G29070.1 Ribosomal protein L341.8e-2859.35Show/hide
Query:  AAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQG-KGRSLVVRAGKAA
        +A S L  P   +G +   VPSASL + TG R   S     +  +SA S LL CSFL SS +S +S FSG S+  D +S+     G + R LVVRAGKAA
Subjt:  AAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQG-KGRSLVVRAGKAA

Query:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA
        LCQTKR+RSRKSLARTHGFRRRMRTTSGRA +KRRRAKGR  LC KS PSSGK A
Subjt:  LCQTKRNRSRKSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCGATATCAGTGTTATCGACCCCATGGGTTTTAAGCGGAGCTGTGGCTGTCCGCGTTCCCTCAGCTTCTCTTGTAATCTCCACTGGCTCTAGAAGTTGCTGCTC
CGTATCTCTCAACACTGCAAACAACACTTCCGCTCGTTCTGGATTGCTCCGTTGTTCGTTTCTTCCTTCTTCCTCCCTTTCATGCTCTTCATCTTTTTCAGGTTTTTCTT
TGGGTTTGGACTGGAACTCCAGAATTTGGGTTGGACAAGGAAAGGGTCGGAGCTTAGTGGTTAGGGCTGGCAAGGCTGCACTATGTCAGACTAAAAGAAACAGGTCGCGT
AAATCCTTAGCTCGGACTCATGGTTTTCGCCGACGAATGCGGACCACCAGTGGTAGAGCTGTTCTTAAGCGCAGGCGTGCCAAGGGGAGGAAGGTCCTCTGCACGAAGTC
TTACCCTAGTAGCGGAAAGGGTGCTTAA
mRNA sequenceShow/hide mRNA sequence
TGGGCTTCCAAGAGTCCATATCTTGAATTGGGCTTTTCCGTATGCTGGACCTTATAAGTGGAGGCCTGGATCCGACCCGATTAAGTTCATCCATATGAGGATAAGGCTCC
TTCGTCATCCGCTAGCAAATTAGCTGAAAGGGGGAAACAGTCTTAAGAATGGCAGCGATATCAGTGTTATCGACCCCATGGGTTTTAAGCGGAGCTGTGGCTGTCCGCGT
TCCCTCAGCTTCTCTTGTAATCTCCACTGGCTCTAGAAGTTGCTGCTCCGTATCTCTCAACACTGCAAACAACACTTCCGCTCGTTCTGGATTGCTCCGTTGTTCGTTTC
TTCCTTCTTCCTCCCTTTCATGCTCTTCATCTTTTTCAGGTTTTTCTTTGGGTTTGGACTGGAACTCCAGAATTTGGGTTGGACAAGGAAAGGGTCGGAGCTTAGTGGTT
AGGGCTGGCAAGGCTGCACTATGTCAGACTAAAAGAAACAGGTCGCGTAAATCCTTAGCTCGGACTCATGGTTTTCGCCGACGAATGCGGACCACCAGTGGTAGAGCTGT
TCTTAAGCGCAGGCGTGCCAAGGGGAGGAAGGTCCTCTGCACGAAGTCTTACCCTAGTAGCGGAAAGGGTGCTTAAATTTTTCTCGCTTTAACCATCTCTAGGTAAATTA
GGCAATGTTGTCTAATTCACTCTTTGAATGACTGGTCGAGATAGCTTATATTATATGATGCTTTCTTATTTAGTCATTTAAAATTTTCTTAAATTATCAGTGTTAAATTC
AAATACTGGACAAGACTAGCTTATGTGATTAATCCTTTCGTATTGTTCATTTGAAAATTTTCTCTGAATTGGAATATGTACATTTTCTGAAACCTGAGGATTGCTGTTAC
TCATATAATCTTGGGCCTTCCGGTCATTAATTTTTAGGTCTGAATCATCTCAGTTTCTGTCAAATATTGTTTTTGATTGAGCTACTCAGTAGCTTCTTACATAAAATGTA
ATTGTTATTGACATTTTAACTTTATAAACATTAAGCATTGTGCTGCCTGACCAAGCTGAAGACACTGTAGAGAAAAAGGAAAGGTTTCAGTGGAATCTGTTAGGTGCAAG
GGAGACCCAGTGGTGTGGTTCTGATAGAAAAGACTTAGAAGTTAGAAGTTGCTTTTAAAGTTGGGTTGTACCTTTTAATTATTTTCCTTCTGATAATAGAGAAGAAAACA
AAATTCACTACAATCTTGGTGATTACTTGTAAGCAGGGAAAATTGAAAGAATGGAGAAAGCAACCAAATGACTAGCATAGCATTGCATAGGATAAGTCTTTATAGGATAA
TATTTGGAGGATCATCATGAATGTTACTGCCTGTTCACGACCTCTGCTTTTAGAAGGAAAGCAAAATGCTAAAATTAGCCGACTGATATCATCTCTACCTACCATGAATC
TAAATAACCTCCCAGTCAGGGAATTCATCATTTTGTGTGGTGGATGGAGAAAACTTTACTATTTCGTTGGTAAGTGCATTTGCTTTTTGGCTCTGCTTCCTTGATGAACT
CGGTATATTTGCTTTAGCATTCATTTTTTCAATCTGAAAGAGAAGGTCAAAGAGGCCAAGAAAAGATATGAGTTGGTATCATGGGTTAGTAGAGAGCATAACTTTAAGCA
TTCTGAATGTTTATAAGCAATCTGCTTGCATGCATATCTAGAAAGTAACGATAAAACCTATAGAGCTCCAAGTACTTTTTGGTCAGCTGAGAAATGGTTTGGTTGAATAA
TATGAAGATTCTGTTTATCAGGTGTTTGAATTGTTCTTCGTTCTGTCTGGATATGATGCTCCCCTGCAATAGCAAGATTCAACAAAATGAATGATATAACAAAAGCACAT
ATGTTATACTGCTCTTTTTCTTATTTCTCCAAAGCTGATGGGTGTCCGTTAACTTGTTTTCATATATGTGCCTGTCTCCCCTTTTCTTTACAGGAAATGACACCCAAGAA
TTAAGAATATTATCATAGAAGCTTCTGACAAGCTGACTATTTCAAGAAAATGCTGGTATGTAAAATATTCATCACTTAAGAATATAGTTAAATATAATTGCATCTTATAT
TGAACCCACATGGCTGTCCACAATTGTACGTGACAGGAGAGAGAACTCCAATTAACAAAAGTTGCATTTTATCACTGAAAATGTAACATCAAAAATTGCACAAGAGGTAA
AGGGTGGAACTGTTTGTGGGATAAGAGTTTTTTGAACCATAGAAAAAAGCCCAAATTAGGAAACCATCACCAAATTGGTGGTGCTGAAAACCAAGACTAAGACCCATTTC
CAAAATTTCTGTAATTATTTTCAAGCACATTGTAGTAGTTTTACCTTCTCCAAGACATCACACATCTCCAAGCACCTGCCCTCCATTTCCTTGAAGACGTTCGTAAACCA
GTCCAAGGCATTACAGTTCTGTCCCATGTTCAGATCAGATGGAAACCTGAAACGAAAATAAACGTGATTGAAAGGTTAAATTACGGCAGATAAGACCAAAATAAAAGCAA
AGGAATAATAGCCTTTGAAAACCAAGAGATGATCCACCAAAATTAAAGAAGCAGTCAGTGCAACTGAGAGCCAAAATTCAAATAAAGAAGTCGTGAAACCATTTCGTCTT
CACTGTAAACTGATAAGCTAATTCAGAAGAAAGCAAGTAGAGGAAGGAGAAACCCATGAACCTGGTGGCAAAGAAGTTAAAAGAAAAACTGAAGACAAATAAAATCCTGA
AGTTACTTGAAAACCCGATTCAACTGAACTTTATAAAAACGGACCACTTTCAATATCATATTCTTAAAAGGAAGAAAGAAAATAATGGGAGAGAGAGAGCTTTGGCAGTG
TCCAAAATGGAGGAGGAAAGGAAACGACGGGGACCAAAAGGAGTTCAGAGAAGAGAGGGACAGGGACCCATGAGAAGAGACAACAGAAGTACTGAGTAAAAAATTCTGAG
AAATTTACAGTATATTCTCTTTAATTACATTTACATTTACTTTTCAATATTTCTAAATTATTTCATCATTGTTTGTAATAAATTTAGGTTTATAAAGTTTGACTCTGGAT
GTAAAATGATTTAGTTTTATAAACTTAAATTTGTGTAGTCGGCATTTGGTTTATAGATTTATAATTTGTATTCATTTGCAATATTCCTAGCTTAACATAAGCAAACCCAC
ATGCCAAACATTTGTCTTCTTTTACTCCTCTAACTCGGATCCAATGTCTAACCCAATAATCAAACACCTTGTAAAAGTCTGTTTAGAAATTTGGCTTAAAATTTTTTCTA
TTTTTTAAAGTAAACAAAAATGCCGTGTCAAAATATATATATCTTTTTAAATAGTTTTTAAAATTTTTAAAATTATATAATAAGCACTTAACTAAACGATTAGTTTGCTA
TATATTTTTCAAAACTAATTTTAAGGAATACACCTAAATAAATGTTTAAAGTGCACAAACTTCATGACATCAAGAAGAGCAATATATTCAAACTCGTGTGGAGTTTATGA
AAGCTCAATTCTACCAAAGAATAATAACGGAACAAAAGAATTCTTAGCTCTTCTTCTTGCTTTATTTTTCCACAATTTCAAAGCTGTACAACAAAAATGAATTTCCTTTC
TGCAAATTCACTTCATGCTTCTCCAGAAGATGTCTGCTTTCTTGTAAGTCTGACCATCTTCTTTTATCTCGAGGTCGTTGCTGAAGACTGTTTTTGTACTTCTGATGAAG
AAGGTTCATGGAGTAATAAAATTAACATCAGAAAGGCGACCCAATTACTACTCTTCCTGTCTGTGGCCCAGAATGTGGATCCATTTTTGCTTGCTTCTCATTCTTAAACC
ATTGTAATAACTTTGATGCCCTCTTTTGAGCCAGTGGACTACCCAATAATGCCACTTCCAGAAGCACTGCAACAATCCCAGACTCGGCCATCTTCTCTCTCTGTTCTGAG
CTCTGATGAGCCAACATCATTAAGATATAAGCAGATAACTCGACAGATTTTGGCTTGTCCTCCCATGTCATAATCTCAATCAGGCTAGACGGGACCATTGGGCTCCCCTC
CATGGCTTTCTTTCCCTGTGAAGTCACCACCAAGTTCCCCAATGATGCGAGGGCTCGATCCGAAAGGCCTTTCGACGATGAGGACAATTTCAAGAGGGTGTGCACCACAC
CATTTGAGACCAGAGGCCCTACGTTTTCCAGCACTGTGGAGATGTTGTAAAGAGTTTCAAGGCAGCATTTTTGGGTCTCGGAATTTGAGGTTGAATCAAGAATCTTTACA
ATAAATGGGATGGCATTTTCATTCGTCTGTAAGGCAATTGTGAAGTGGGAATTGATCAGAGATGAGAGTGACAACAGCAATCTTGCAAAATCATGTTTTGCTGATTCATC
CATGGCTTGGATGTTACATGGAAGTTTGTGTAAGATTCCTGCCTCCACCATGAGGGCCTTGTTCCTAAAGAATAGCACAAAAAGACAAAGAACAATAAGTCCAACTTATG
GTTCAGATTCTTTAAAAACACAAGTCAGGCCAAATGGCAAATGCTGACTTTGGACAGAATGGTTCAGACTTCAGAGGCAGACGAAAGACCAACCCACAAATTCAATATGA
ATAATCTAAAAGTAAACATCTTTTTATTAAGAGTTGACCCATATGCCCAATATTTTTCCTCTCTCTCTTAATTAATGTCTGGGAGCTTTTAGTTATTTATTTATACATTT
TTTTCCTCAAATTTGGAATATCGAGGAATCCAAACCTTCAGCACGTCTTTCCGTAAAGTTCTGTTTTCAATTTGAATATGGGTTCAACATGCTTAAACTTGTTCACTTTT
TCTCTTATTTTTATACCATCTGAGGCTAATATGGTTACCAGAAACTCTCGAATAAGATGAAATTGCCATCGTTAACTACGATTTTCCTTGAAACTCAAGTTTCT
Protein sequenceShow/hide protein sequence
MAAISVLSTPWVLSGAVAVRVPSASLVISTGSRSCCSVSLNTANNTSARSGLLRCSFLPSSSLSCSSSFSGFSLGLDWNSRIWVGQGKGRSLVVRAGKAALCQTKRNRSR
KSLARTHGFRRRMRTTSGRAVLKRRRAKGRKVLCTKSYPSSGKGA