; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g06760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g06760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibosomal protein
Genome locationchr6:4930706..4936214
RNA-Seq ExpressionMoc06g06760
SyntenyMoc06g06760
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0005840 - ribosome (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR000473 - Ribosomal protein L36
IPR025124 - Domain of unknown function DUF4050
IPR035977 - Ribosomal protein L36 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587898.1 hypothetical protein SDJN03_16463, partial [Cucurbita argyrosperma subsp. sororia]7.1e-12482.69Show/hide
Query:  MKVRSSVKKMCEFCRTVKRRGRVYVYCSSNPKHKQRQGLSTFASEGPSPPLFSRSEVKQEILPSYSPRTGLASLIPQKHEPTTMVLGFHLFSQNRCPAFR
        MKVR+SVKKMCEFCRTV+RRGRVY+YCS+NPKHKQRQGLSTFASE PSP LFSRSEVKQE LPS+S RTGLASLIPQKHEP TM L    FS NRCP+ R
Subjt:  MKVRSSVKKMCEFCRTVKRRGRVYVYCSSNPKHKQRQGLSTFASEGPSPPLFSRSEVKQEILPSYSPRTGLASLIPQKHEPTTMVLGFHLFSQNRCPAFR

Query:  FWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLNPSNVGG
        FWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGR+VKK SISDGFWSTSTCDLDNSTIQSQ SISSISTSNLTL  SNVGG
Subjt:  FWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLNPSNVGG

Query:  S-----------LLLWNQNRLQWTG--SSSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLY
        S           LLLWNQNRLQW G  SSS TTDQTQQ+RKAKISWRATYDSLL TRQ FPHPIPLSEMV FLVE  + +G++
Subjt:  S-----------LLLWNQNRLQWTG--SSSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLY

KAG6589849.1 hypothetical protein SDJN03_15272, partial [Cucurbita argyrosperma subsp. sororia]3.4e-11880.92Show/hide
Query:  MKVRSSVKKMCEFCRTVKRRGRVYVYCSSNPKHKQRQGLSTFASEGPSPPLFSRSEVKQEILPSYSPRTGLASLIPQKHEPTTMVLGFHLFSQNRCPAFR
        MKVR+SVKKMCEFCRTVKRRGRVYVYCSSNPKHKQRQGLST ASEGPSP LF  SEVKQEILPS+S RT     IP +     + +GF+L   NRCP+ R
Subjt:  MKVRSSVKKMCEFCRTVKRRGRVYVYCSSNPKHKQRQGLSTFASEGPSPPLFSRSEVKQEILPSYSPRTGLASLIPQKHEPTTMVLGFHLFSQNRCPAFR

Query:  FWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLNPSNVGG
        F L INMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGR+VKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLT + SNVGG
Subjt:  FWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLNPSNVGG

Query:  S-----------LLLWNQNRLQWTGSS-SKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
        S           LLLWNQ RLQW GSS + TTD+TQ+R+KAKISWRATYDSLLGTRQPFPH IPLSEMVNFLVEVWEQEGLYD
Subjt:  S-----------LLLWNQNRLQWTGSS-SKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

KAG7023519.1 hypothetical protein SDJN02_14545, partial [Cucurbita argyrosperma subsp. argyrosperma]5.3e-8782.84Show/hide
Query:  ASLIPQKHEPTTMVLGFHL---FSQNRCPAFRFWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTS
        +SL P+ H       GF +   F  NRCP+ RF L INMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGR+VKKPSISDGFWSTS
Subjt:  ASLIPQKHEPTTMVLGFHL---FSQNRCPAFRFWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTS

Query:  TCDLDNSTIQSQRSISSISTSNLTLNPSNVGGSLLLWNQNRLQWTGSS-SKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQE
        TCDLDNSTIQSQRSISSISTSNLT + SNVGGSLLLWNQ RLQW GSS + TTD+TQ+R+KAKISWRATYDSLLGTRQPFPH IPLSEMVNFLVEVWEQE
Subjt:  TCDLDNSTIQSQRSISSISTSNLTLNPSNVGGSLLLWNQNRLQWTGSS-SKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQE

Query:  GLYD
        GLYD
Subjt:  GLYD

XP_022144929.1 uncharacterized protein LOC111014486 [Momordica charantia]5.0e-10186.88Show/hide
Query:  LPSYSPRTGLASLIPQKHEPTTMVLGFHLFSQNRCPAFRFWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSIS
        LP+ SP   L++           VLGFHLFSQNRCPAFRFWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSIS
Subjt:  LPSYSPRTGLASLIPQKHEPTTMVLGFHLFSQNRCPAFRFWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSIS

Query:  DGFWSTSTCDLDNSTIQSQRSISSISTSNLTLNPSNVGGS-----------LLLWNQNRLQWTGSSSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHP
        DGFWSTSTCDLDNSTIQSQRSISSISTSNLTLNPSNVGGS           LLLWNQNRLQWTGSSSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHP
Subjt:  DGFWSTSTCDLDNSTIQSQRSISSISTSNLTLNPSNVGGS-----------LLLWNQNRLQWTGSSSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHP

Query:  IPLSEMVNFLVEVWEQEGLYD
        IPLSEMVNFLVEVWEQEGLYD
Subjt:  IPLSEMVNFLVEVWEQEGLYD

XP_022971935.1 uncharacterized protein LOC111470604 [Cucurbita maxima]1.9e-8485.05Show/hide
Query:  FSQNRCPAFRFWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSN
        FS NRCP+ RFWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGR+VKK SISDGFWSTSTCDLDNSTIQSQ SISSISTSN
Subjt:  FSQNRCPAFRFWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSN

Query:  LTLNPSNVGGS-----------LLLWNQNRLQWTG--SSSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
        LTL  SNVG S           LLLWNQNRLQW G  SSSKTTDQTQ +RKAKISWRATYDSLL TRQ FPHPIPL+EMV FLVEVWEQEGLYD
Subjt:  LTLNPSNVGGS-----------LLLWNQNRLQWTG--SSSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

TrEMBL top hitse value%identityAlignment
A0A1S3B9E5 uncharacterized protein LOC1034874454.9e-7885.88Show/hide
Query:  MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLNPSNVGGS-----
        MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGR+VKKPSISDGFWSTSTCDLDNSTIQSQRSISSIST NLTL+ SNV GS     
Subjt:  MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLNPSNVGGS-----

Query:  ------LLLWNQNRLQWTGS-SSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
              LLLWNQ R+QW GS ++K TD+TQQR+KAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
Subjt:  ------LLLWNQNRLQWTGS-SSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

A0A5A7U2N6 DUF4050 family protein1.1e-7782.26Show/hide
Query:  MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLNPSNVGGS-----
        MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGR+VKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTL+ SNV GS     
Subjt:  MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLNPSNVGGS-----

Query:  ---------------LLLWNQNRLQWTGS-SSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
                       LLLWNQ R+QW GS ++K TD+TQQR+KAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
Subjt:  ---------------LLLWNQNRLQWTGS-SSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

A0A5D3C7W3 DUF4050 family protein4.9e-7885.88Show/hide
Query:  MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLNPSNVGGS-----
        MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGR+VKKPSISDGFWSTSTCDLDNSTIQSQRSISSIST NLTL+ SNV GS     
Subjt:  MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLNPSNVGGS-----

Query:  ------LLLWNQNRLQWTGS-SSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
              LLLWNQ R+QW GS ++K TD+TQQR+KAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
Subjt:  ------LLLWNQNRLQWTGS-SSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

A0A6J1CTQ5 uncharacterized protein LOC1110144862.4e-10186.88Show/hide
Query:  LPSYSPRTGLASLIPQKHEPTTMVLGFHLFSQNRCPAFRFWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSIS
        LP+ SP   L++           VLGFHLFSQNRCPAFRFWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSIS
Subjt:  LPSYSPRTGLASLIPQKHEPTTMVLGFHLFSQNRCPAFRFWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSIS

Query:  DGFWSTSTCDLDNSTIQSQRSISSISTSNLTLNPSNVGGS-----------LLLWNQNRLQWTGSSSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHP
        DGFWSTSTCDLDNSTIQSQRSISSISTSNLTLNPSNVGGS           LLLWNQNRLQWTGSSSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHP
Subjt:  DGFWSTSTCDLDNSTIQSQRSISSISTSNLTLNPSNVGGS-----------LLLWNQNRLQWTGSSSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHP

Query:  IPLSEMVNFLVEVWEQEGLYD
        IPLSEMVNFLVEVWEQEGLYD
Subjt:  IPLSEMVNFLVEVWEQEGLYD

A0A6J1I744 uncharacterized protein LOC1114706049.2e-8585.05Show/hide
Query:  FSQNRCPAFRFWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSN
        FS NRCP+ RFWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGR+VKK SISDGFWSTSTCDLDNSTIQSQ SISSISTSN
Subjt:  FSQNRCPAFRFWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSN

Query:  LTLNPSNVGGS-----------LLLWNQNRLQWTG--SSSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
        LTL  SNVG S           LLLWNQNRLQW G  SSSKTTDQTQ +RKAKISWRATYDSLL TRQ FPHPIPL+EMV FLVEVWEQEGLYD
Subjt:  LTLNPSNVGGS-----------LLLWNQNRLQWTG--SSSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

SwissProt top hitse value%identityAlignment
A5VLI2 50S ribosomal protein L362.1e-0978.95Show/hide
Query:  MKVRSSVKKMCEFCRTVKRRGRVYVYCSSNPKHKQRQG
        MKVR SVKKMCE C+ VKR GRV V CS+NPKHKQRQG
Subjt:  MKVRSSVKKMCEFCRTVKRRGRVYVYCSSNPKHKQRQG

B2G8V5 50S ribosomal protein L362.1e-0978.95Show/hide
Query:  MKVRSSVKKMCEFCRTVKRRGRVYVYCSSNPKHKQRQG
        MKVR SVKKMCE C+ VKR GRV V CS+NPKHKQRQG
Subjt:  MKVRSSVKKMCEFCRTVKRRGRVYVYCSSNPKHKQRQG

B3WAJ5 50S ribosomal protein L362.7e-0973.68Show/hide
Query:  MKVRSSVKKMCEFCRTVKRRGRVYVYCSSNPKHKQRQG
        MKVR SVKKMCE C+ V+R+GRV + CS+NPKHKQRQG
Subjt:  MKVRSSVKKMCEFCRTVKRRGRVYVYCSSNPKHKQRQG

P73300 50S ribosomal protein L361.2e-0976.32Show/hide
Query:  MKVRSSVKKMCEFCRTVKRRGRVYVYCSSNPKHKQRQG
        MKVR+SVKKMC+ CR ++RRGRV V CS+NPKHKQRQG
Subjt:  MKVRSSVKKMCEFCRTVKRRGRVYVYCSSNPKHKQRQG

Q035A6 50S ribosomal protein L362.7e-0973.68Show/hide
Query:  MKVRSSVKKMCEFCRTVKRRGRVYVYCSSNPKHKQRQG
        MKVR SVKKMCE C+ V+R+GRV + CS+NPKHKQRQG
Subjt:  MKVRSSVKKMCEFCRTVKRRGRVYVYCSSNPKHKQRQG

Arabidopsis top hitse value%identityAlignment
AT1G15350.1 unknown protein3.7e-3047.47Show/hide
Query:  MGGCFGCCT--KPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTS--------NLTLNPSNVGGSLLLWNQNRLQWTG
        MGGC GC    + T     D PS  +    R  KKPS+S+ FWSTST D+DN T  SQ S+SS + +        N    P  V   LLLWNQ R +W G
Subjt:  MGGCFGCCT--KPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTS--------NLTLNPSNVGGSLLLWNQNRLQWTG

Query:  SSSKTTDQTQQRRKAKISWR-ATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
           K  +     + AK++W  ATYDSLLG+ + FP PIPL+EMV+FLV++WEQEGLYD
Subjt:  SSSKTTDQTQQRRKAKISWR-ATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

AT1G15350.2 unknown protein3.7e-3047.47Show/hide
Query:  MGGCFGCCT--KPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTS--------NLTLNPSNVGGSLLLWNQNRLQWTG
        MGGC GC    + T     D PS  +    R  KKPS+S+ FWSTST D+DN T  SQ S+SS + +        N    P  V   LLLWNQ R +W G
Subjt:  MGGCFGCCT--KPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTS--------NLTLNPSNVGGSLLLWNQNRLQWTG

Query:  SSSKTTDQTQQRRKAKISWR-ATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
           K  +     + AK++W  ATYDSLLG+ + FP PIPL+EMV+FLV++WEQEGLYD
Subjt:  SSSKTTDQTQQRRKAKISWR-ATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

AT4G32342.1 unknown protein1.1e-3254.42Show/hide
Query:  CFGCCTKPTP-IIAVDEPSKGLRIQGRIVKKPSI-SDGFWSTSTCDLD-NSTIQSQRSISSISTSNLTLNPSN-VGGSLLLWNQNRLQWTGSSSKTTDQT
        CFGCC +    ++ VDEPSKGL+IQG+IVKK S  SD FWSTSTCD+D N TIQSQ S         T N +  V   L+LWN  R QW       T Q 
Subjt:  CFGCCTKPTP-IIAVDEPSKGLRIQGRIVKKPSI-SDGFWSTSTCDLD-NSTIQSQRSISSISTSNLTLNPSN-VGGSLLLWNQNRLQWTGSSSKTTDQT

Query:  QQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLY
            +  ISW +TYDSLL T + FP PIPL EMV+FLV+VWE+EGLY
Subjt:  QQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLY

AT5G25360.1 unknown protein1.7e-5160.95Show/hide
Query:  LNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSI------STSNLTLNPSN-VGGSL
        L     +WI +LF CMGGCFGCC KP  I+AVDEPSKGLRIQGR+VKKPS+S+ FWSTSTC++DNST+QSQRS+SSI      STS  T NP+  V   L
Subjt:  LNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSI------STSNLTLNPSN-VGGSL

Query:  LLWNQNRLQWTGSSSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
         LWNQ R QW  +   T+ +  + R+  ISW ATY+SLLG  + F  PIPL EMV+FLV+VWEQEGLYD
Subjt:  LLWNQNRLQWTGSSSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

AT5G25360.2 unknown protein1.7e-5160.95Show/hide
Query:  LNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSI------STSNLTLNPSN-VGGSL
        L     +WI +LF CMGGCFGCC KP  I+AVDEPSKGLRIQGR+VKKPS+S+ FWSTSTC++DNST+QSQRS+SSI      STS  T NP+  V   L
Subjt:  LNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSI------STSNLTLNPSN-VGGSL

Query:  LLWNQNRLQWTGSSSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
         LWNQ R QW  +   T+ +  + R+  ISW ATY+SLLG  + F  PIPL EMV+FLV+VWEQEGLYD
Subjt:  LLWNQNRLQWTGSSSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGATCGTTGGCTGGGCCGAGCCCCGGGTCGGGACCGAGCCCTCTCGGCCCGGTTAGTTCTCCGCCAGCCGCTTTCCTCGCTCTTTGCCCCGGCATTAGCCATGAA
AGTCCGTTCTTCAGTGAAAAAGATGTGTGAATTCTGTAGGACAGTAAAGCGCCGGGGTCGAGTATATGTATACTGCTCATCCAATCCCAAACACAAGCAACGCCAAGGCC
TATCAACATTTGCAAGTGAAGGCCCTTCTCCTCCCTTGTTCTCAAGAAGTGAGGTGAAGCAAGAGATTCTTCCCAGTTATAGCCCGCGAACAGGGCTGGCTTCTCTCATC
CCCCAAAAGCACGAGCCAACTACAATGGTTTTGGGTTTTCATTTATTCTCCCAGAATCGATGCCCTGCTTTCCGCTTTTGGCTTAACATCAACATGGTGATGCTGAATAG
TTCCTTCGCTGCGTGGATCAGCCGCTTGTTCGCTTGCATGGGGGGTTGTTTTGGATGCTGCACTAAGCCCACACCTATTATTGCTGTGGATGAGCCATCTAAGGGATTAA
GAATTCAAGGACGAATCGTTAAGAAACCTAGCATATCTGACGGTTTCTGGAGCACAAGCACGTGTGATTTGGATAATAGCACCATTCAATCTCAACGAAGCATCTCGTCT
ATCAGTACATCAAATCTCACACTCAATCCGAGCAATGTTGGTGGCAGTCTCCTTCTCTGGAATCAGAATAGGCTGCAGTGGACTGGAAGTAGTAGCAAGACAACGGATCA
AACTCAACAAAGACGGAAGGCAAAAATCAGTTGGCGTGCAACATATGATAGTTTACTGGGTACAAGACAGCCTTTTCCCCATCCAATTCCTTTATCCGAAATGGTGAACT
TTCTTGTGGAAGTATGGGAGCAGGAGGGCCTCTATGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGATCGTTGGCTGGGCCGAGCCCCGGGTCGGGACCGAGCCCTCTCGGCCCGGTTAGTTCTCCGCCAGCCGCTTTCCTCGCTCTTTGCCCCGGCATTAGCCATGAA
AGTCCGTTCTTCAGTGAAAAAGATGTGTGAATTCTGTAGGACAGTAAAGCGCCGGGGTCGAGTATATGTATACTGCTCATCCAATCCCAAACACAAGCAACGCCAAGGCC
TATCAACATTTGCAAGTGAAGGCCCTTCTCCTCCCTTGTTCTCAAGAAGTGAGGTGAAGCAAGAGATTCTTCCCAGTTATAGCCCGCGAACAGGGCTGGCTTCTCTCATC
CCCCAAAAGCACGAGCCAACTACAATGGTTTTGGGTTTTCATTTATTCTCCCAGAATCGATGCCCTGCTTTCCGCTTTTGGCTTAACATCAACATGGTGATGCTGAATAG
TTCCTTCGCTGCGTGGATCAGCCGCTTGTTCGCTTGCATGGGGGGTTGTTTTGGATGCTGCACTAAGCCCACACCTATTATTGCTGTGGATGAGCCATCTAAGGGATTAA
GAATTCAAGGACGAATCGTTAAGAAACCTAGCATATCTGACGGTTTCTGGAGCACAAGCACGTGTGATTTGGATAATAGCACCATTCAATCTCAACGAAGCATCTCGTCT
ATCAGTACATCAAATCTCACACTCAATCCGAGCAATGTTGGTGGCAGTCTCCTTCTCTGGAATCAGAATAGGCTGCAGTGGACTGGAAGTAGTAGCAAGACAACGGATCA
AACTCAACAAAGACGGAAGGCAAAAATCAGTTGGCGTGCAACATATGATAGTTTACTGGGTACAAGACAGCCTTTTCCCCATCCAATTCCTTTATCCGAAATGGTGAACT
TTCTTGTGGAAGTATGGGAGCAGGAGGGCCTCTATGATTGA
Protein sequenceShow/hide protein sequence
MGDRWLGRAPGRDRALSARLVLRQPLSSLFAPALAMKVRSSVKKMCEFCRTVKRRGRVYVYCSSNPKHKQRQGLSTFASEGPSPPLFSRSEVKQEILPSYSPRTGLASLI
PQKHEPTTMVLGFHLFSQNRCPAFRFWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISS
ISTSNLTLNPSNVGGSLLLWNQNRLQWTGSSSKTTDQTQQRRKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD