; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021841 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021841
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUPF0307 protein plu4061 isoform X2
Genome locationscaffold1:724851..728431
RNA-Seq ExpressionMS021841
SyntenyMS021841
Gene Ontology termsNA
InterPro domainsIPR006839 - Ribosome-associated, YjgA
IPR023153 - PSPTO4464-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591265.1 hypothetical protein SDJN03_13611, partial [Cucurbita argyrosperma subsp. sororia]1.3e-9768.94Show/hide
Query:  MGHMVRALRQWPMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAV
        MGHMVRALR WPMLQ H  GC VH FL S P CV  RIDSRRL+LATVHSARREVQYGSKGLRLS A AP +FQEDESVD+D D +KSRNQLKREARRAV
Subjt:  MGHMVRALRQWPMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAV

Query:  QWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGS---------------SVILATKDGDQGMLKTLSGSIAFDDDDDAESEY--
        QWGM+LAAFSTPQIK ILR                  G DV   K  + S               S+I ATKDGD  ML+TLSGS+A DDD+D +SEY  
Subjt:  QWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGS---------------SVILATKDGDQGMLKTLSGSIAFDDDDDAESEY--

Query:  EEEEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYE
        EEEEEGPHVD  TRWLDGL++KD N+TNE+YSLQ+VEFDRQELRRLVRKVH+VEERKAATEENEDEVN+ IT A+K L RFLCRMAKQLP YE
Subjt:  EEEEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYE

KAG7024147.1 hypothetical protein SDJN02_12960, partial [Cucurbita argyrosperma subsp. argyrosperma]9.1e-9668.49Show/hide
Query:  MGHMVRALRQWPMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAV
        M HMVRALR WPMLQ H  GC VH FL SSP  V  RIDSRRL+LATVHSARREVQYGSKGLRLS A AP +FQEDESVD+D D +KSRNQLKREARRAV
Subjt:  MGHMVRALRQWPMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAV

Query:  QWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGS---------------SVILATKDGDQGMLKTLSGSIAFDDDDDAESEY-E
        QWGM+LAAFSTPQIK ILR                  G DV   K  + S               S+I ATKDGD  ML+TLSGS+A D+D+D +SEY E
Subjt:  QWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGS---------------SVILATKDGDQGMLKTLSGSIAFDDDDDAESEY-E

Query:  EEEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYE
        EEEEGPHVD  TRWLDGL++KD N+TNE+YSLQ+VEFDRQELRRLVRKVH+VEERKAATEENEDEVN+ IT A+K L RFLCRMAKQLP YE
Subjt:  EEEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYE

XP_022140476.1 uncharacterized protein LOC111011113 [Momordica charantia]6.4e-12684.93Show/hide
Query:  MGHMVRALRQWPMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAV
        MGHMVRALRQWPMLQKHWCGCAVHQFLTSSP CVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAV
Subjt:  MGHMVRALRQWPMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAV

Query:  QWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGS---------------SVILATKDGDQGMLKTLSGSIAFDDDDDAESEYEE
        QWGMELAAFSTPQIKLILR                  G DV   K  + S               S+I ATKDGDQGMLKTLSGSIAFDDDDDAESEYEE
Subjt:  QWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGS---------------SVILATKDGDQGMLKTLSGSIAFDDDDDAESEYEE

Query:  EEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYEF
        EEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYEF
Subjt:  EEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYEF

XP_022937103.1 uncharacterized protein LOC111443507 [Cucurbita moschata]4.8e-9769.18Show/hide
Query:  MGHMVRALRQWPMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAV
        MGHMVRALR WPMLQ H  GC VH FL SSP  V  RIDSRRL+LATVHSARREVQYGSKGLRLS A AP +FQEDESVD+D D +KSRNQLKREARRAV
Subjt:  MGHMVRALRQWPMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAV

Query:  QWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGS---------------SVILATKDGDQGMLKTLSGSIAFDDDDDAESEY-E
        QWGM+LAAFSTPQIK ILR                  G DV   K  + S               S+I ATKDGD  ML+TLSGS+A DDD+D +SEY E
Subjt:  QWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGS---------------SVILATKDGDQGMLKTLSGSIAFDDDDDAESEY-E

Query:  EEEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYE
        EEEEGPHVD  TRWLDGL++KD N+TNE+YSLQ+VEFDRQELRRLVRKVH+VEERKAATEENEDEVN+ IT A+K L RFLCRMAKQLP YE
Subjt:  EEEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYE

XP_023535041.1 uncharacterized protein LOC111796585 [Cucurbita pepo subsp. pepo]1.3e-9769.86Show/hide
Query:  MGHMVRALRQWPMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAV
        MGHMVRALR WPMLQ H  GC VH FL SSP  V  RIDSRRL+LATVHSARREVQYGSKGLRLS A AP +FQEDESVD+D D +KSRNQLKREARRAV
Subjt:  MGHMVRALRQWPMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAV

Query:  QWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGS---------------SVILATKDGDQGMLKTLSGSIAFDDDDDAESEYEE
        QWGM+LAAFSTPQIK ILR                  G DV   K  + S               S+I ATKDGD  ML+TLSGS+A DDD+D +SEYEE
Subjt:  QWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGS---------------SVILATKDGDQGMLKTLSGSIAFDDDDDAESEYEE

Query:  EEE-GPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYE
        EEE GPHVD ATRWLDGL++KD N+TNE+YSLQ+VEFDRQELRRLVRKVH+VEERKAATEENEDEVN+ IT ARK L RFLCRMAKQLP YE
Subjt:  EEE-GPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYE

TrEMBL top hitse value%identityAlignment
A0A0A0LH61 Uncharacterized protein3.4e-8062.37Show/hide
Query:  MGHMVRALRQWP-MLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESV-DDDSDAKKSRNQLKREARR
        M HMVRALRQWP M+QKH CGCAVH FL SSP  V  RI SRRLSLATVHSARREVQY SKGLRLS APA  K QE ES+ DDD D +KSRNQLKREARR
Subjt:  MGHMVRALRQWP-MLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESV-DDDSDAKKSRNQLKREARR

Query:  AVQWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGSSV-----------------ILATKDGDQGMLKTLSGSIAFDDDDDAES
        AVQWGM+LA FST QIK IL                   G DV   K  + + +                 I +TK GD  +L+ L  S+  +       
Subjt:  AVQWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGSSV-----------------ILATKDGDQGMLKTLSGSIAFDDDDDAES

Query:  EYEEEEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYE
        E EEEEEGPHVD ATRWLDGLI+K+N IT E+YSLQ+VEFDRQELRRLVRKVH+VEERKAA EEN DEVN  +TNARK L RFLCRMAKQLPS E
Subjt:  EYEEEEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYE

A0A1S4E5J2 UPF0307 protein plu4061 isoform X21.7e-7962.71Show/hide
Query:  MGHMVRALRQW-PMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESV-DDDSDAKKSRNQLKREARR
        M HMVRALRQW PMLQKH CGCAVH FL+ SP  V  RI SRRLSLATVHSARREVQY SKGLRLS APA  K QEDES+ DDDSD +KSRNQLKREARR
Subjt:  MGHMVRALRQW-PMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESV-DDDSDAKKSRNQLKREARR

Query:  AVQWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGSSV-----------------ILATKDGDQGMLKTLSGSIAFDDDDDAES
        AVQWGM+LA FST QIK IL                   G DV   +  + + +                 I ATK GD  +L+ L  S+  +       
Subjt:  AVQWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGSSV-----------------ILATKDGDQGMLKTLSGSIAFDDDDDAES

Query:  EYEEEEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYE
        E EEEEEGPHVD ATRW DGLI+KDN IT E+YS Q+VEFDRQELRRLVRKVH+VEERKAA EEN DEVN  ITNARK L RFL RMAKQLPS E
Subjt:  EYEEEEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYE

A0A6J1CG71 uncharacterized protein LOC1110111133.1e-12684.93Show/hide
Query:  MGHMVRALRQWPMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAV
        MGHMVRALRQWPMLQKHWCGCAVHQFLTSSP CVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAV
Subjt:  MGHMVRALRQWPMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAV

Query:  QWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGS---------------SVILATKDGDQGMLKTLSGSIAFDDDDDAESEYEE
        QWGMELAAFSTPQIKLILR                  G DV   K  + S               S+I ATKDGDQGMLKTLSGSIAFDDDDDAESEYEE
Subjt:  QWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGS---------------SVILATKDGDQGMLKTLSGSIAFDDDDDAESEYEE

Query:  EEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYEF
        EEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYEF
Subjt:  EEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYEF

A0A6J1FF35 uncharacterized protein LOC1114435072.3e-9769.18Show/hide
Query:  MGHMVRALRQWPMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAV
        MGHMVRALR WPMLQ H  GC VH FL SSP  V  RIDSRRL+LATVHSARREVQYGSKGLRLS A AP +FQEDESVD+D D +KSRNQLKREARRAV
Subjt:  MGHMVRALRQWPMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAV

Query:  QWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGS---------------SVILATKDGDQGMLKTLSGSIAFDDDDDAESEY-E
        QWGM+LAAFSTPQIK ILR                  G DV   K  + S               S+I ATKDGD  ML+TLSGS+A DDD+D +SEY E
Subjt:  QWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGS---------------SVILATKDGDQGMLKTLSGSIAFDDDDDAESEY-E

Query:  EEEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYE
        EEEEGPHVD  TRWLDGL++KD N+TNE+YSLQ+VEFDRQELRRLVRKVH+VEERKAATEENEDEVN+ IT A+K L RFLCRMAKQLP YE
Subjt:  EEEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYE

A0A6J1IM05 uncharacterized protein LOC1114768129.8e-9668.49Show/hide
Query:  MGHMVRALRQWPMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAV
        MGHMVRALR WPMLQ H  GC VH FL SSP  V  RIDS RL+LATVHSARREVQ+GSKGLRLS A AP +FQEDESVD+D D +KSRNQLKREARRAV
Subjt:  MGHMVRALRQWPMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAV

Query:  QWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGS---------------SVILATKDGDQGMLKTLSGSIAFDDDDDAESEY-E
        QWGM+LAAFSTPQIK ILR                  G DV   K  + S               S+I ATKDGD   L+TLSGS+A DDD+D +SEY E
Subjt:  QWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGS---------------SVILATKDGDQGMLKTLSGSIAFDDDDDAESEY-E

Query:  EEEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYE
        EEEEGPHVD ATRWLDGL++KD N+TNE+YSLQ+VEFDRQELRRLVRKVH+VEERKAATEENEDEVN+ IT A+K L RFLCRMAKQLP YE
Subjt:  EEEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEERKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G24175.1 unknown protein1.5e-3544.21Show/hide
Query:  PTKFQEDESVDDD---SDAKKSRNQLKREARRAVQWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGS---------------S
        PT    +E  D+D   SD+ +SRNQ KR+ARRAV+WGMELA+FS  Q+K IL+                  G DV   K    +               +
Subjt:  PTKFQEDESVDDD---SDAKKSRNQLKREARRAVQWGMELAAFSTPQIKLILR------------------GLDVMSEKESEGS---------------S

Query:  VILATKDGDQGMLKTLSGSI---------AFDDDDDAESEYEEEEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLV-EERKA
        +I ATK GD   L+TL  S          ++DDD + ESE EEE    +   A RW DGLI+++  +T EVYSLQSV+FDRQELR+LVRKV LV E+RK 
Subjt:  VILATKDGDQGMLKTLSGSI---------AFDDDDDAESEYEEEEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLV-EERKA

Query:  ATEENEDEVNMEITNARKSLTRFLCRMAKQLPS
         TEE + EV   +  A KSL +FLC MAKQ+ S
Subjt:  ATEENEDEVNMEITNARKSLTRFLCRMAKQLPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCATATGGTTCGGGCTCTCCGGCAATGGCCGATGTTGCAGAAACATTGGTGCGGTTGCGCCGTGCATCAATTTCTCACCTCATCTCCGACGTGTGTGACCAACAG
AATTGACTCTCGTCGACTCTCTTTAGCTACCGTCCATTCCGCTCGTCGCGAAGTCCAATATGGTTCAAAAGGACTCAGATTATCCAATGCTCCAGCACCGACTAAATTCC
AAGAAGATGAGAGCGTTGATGACGATTCTGATGCTAAAAAGAGCCGCAACCAGCTGAAACGTGAAGCTCGACGCGCCGTCCAATGGGGCATGGAGCTCGCCGCCTTCTCC
ACTCCTCAAATTAAACTCATCCTTAGAGGCTTGGACGTGATGTCAGAGAAGGAAAGCGAAGGCAGTTCAGTTATATTGGCCACAAAAGACGGTGACCAGGGCATGCTAAA
GACACTATCTGGTTCAATAGCTTTTGATGATGATGATGATGCAGAATCCGAATATGAGGAGGAAGAAGAGGGTCCGCATGTGGATACTGCAACAAGATGGCTTGATGGGT
TAATCAATAAGGACAACAACATTACAAATGAAGTTTACTCGCTACAAAGTGTTGAGTTTGACCGTCAGGAGCTGCGTAGACTTGTTCGAAAAGTGCATTTGGTTGAAGAA
CGCAAGGCGGCAACTGAAGAGAATGAGGATGAAGTCAATATGGAAATAACAAACGCTAGAAAGTCCCTTACTCGTTTTCTTTGTAGAATGGCAAAACAATTGCCCTCCTA
TGAATTC
mRNA sequenceShow/hide mRNA sequence
ATGGGTCATATGGTTCGGGCTCTCCGGCAATGGCCGATGTTGCAGAAACATTGGTGCGGTTGCGCCGTGCATCAATTTCTCACCTCATCTCCGACGTGTGTGACCAACAG
AATTGACTCTCGTCGACTCTCTTTAGCTACCGTCCATTCCGCTCGTCGCGAAGTCCAATATGGTTCAAAAGGACTCAGATTATCCAATGCTCCAGCACCGACTAAATTCC
AAGAAGATGAGAGCGTTGATGACGATTCTGATGCTAAAAAGAGCCGCAACCAGCTGAAACGTGAAGCTCGACGCGCCGTCCAATGGGGCATGGAGCTCGCCGCCTTCTCC
ACTCCTCAAATTAAACTCATCCTTAGAGGCTTGGACGTGATGTCAGAGAAGGAAAGCGAAGGCAGTTCAGTTATATTGGCCACAAAAGACGGTGACCAGGGCATGCTAAA
GACACTATCTGGTTCAATAGCTTTTGATGATGATGATGATGCAGAATCCGAATATGAGGAGGAAGAAGAGGGTCCGCATGTGGATACTGCAACAAGATGGCTTGATGGGT
TAATCAATAAGGACAACAACATTACAAATGAAGTTTACTCGCTACAAAGTGTTGAGTTTGACCGTCAGGAGCTGCGTAGACTTGTTCGAAAAGTGCATTTGGTTGAAGAA
CGCAAGGCGGCAACTGAAGAGAATGAGGATGAAGTCAATATGGAAATAACAAACGCTAGAAAGTCCCTTACTCGTTTTCTTTGTAGAATGGCAAAACAATTGCCCTCCTA
TGAATTC
Protein sequenceShow/hide protein sequence
MGHMVRALRQWPMLQKHWCGCAVHQFLTSSPTCVTNRIDSRRLSLATVHSARREVQYGSKGLRLSNAPAPTKFQEDESVDDDSDAKKSRNQLKREARRAVQWGMELAAFS
TPQIKLILRGLDVMSEKESEGSSVILATKDGDQGMLKTLSGSIAFDDDDDAESEYEEEEEGPHVDTATRWLDGLINKDNNITNEVYSLQSVEFDRQELRRLVRKVHLVEE
RKAATEENEDEVNMEITNARKSLTRFLCRMAKQLPSYEF