; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS016792 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS016792
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUPF0114 domain-containing protein
Genome locationscaffold9_1:59428..60603
RNA-Seq ExpressionMS016792
SyntenyMS016792
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005134 - Uncharacterised protein family UPF0114


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144307.1 uncharacterized protein LOC111014021 isoform X1 [Momordica charantia]7.9e-14698.57Show/hide
Query:  MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRQMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKTQDFI
        MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERR+MVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTK QDFI
Subjt:  MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRQMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKTQDFI

Query:  EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL
        EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL
Subjt:  EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL

Query:  FGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGG
        FGLFYMKKLPTWVGMESVSA KSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLN GGGGGG
Subjt:  FGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGG

XP_022144308.1 uncharacterized protein LOC111014021 isoform X2 [Momordica charantia]1.1e-12689.29Show/hide
Query:  MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRQMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKTQDFI
        MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERR+MVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTK QDFI
Subjt:  MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRQMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKTQDFI

Query:  EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL
        EK                          GSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL
Subjt:  EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL

Query:  FGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGG
        FGLFYMKKLPTWVGMESVSA KSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLN GGGGGG
Subjt:  FGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGG

XP_022951147.1 uncharacterized protein LOC111454079 [Cucurbita moschata]9.2e-11076.37Show/hide
Query:  MAATRFFRPIRPSASVVSSSASPSPAT--TVRCMGRTAFN--NGERGLTSGDGERRQMVTVK---AAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGK
        MAATR  R +RPSA+VVSSS+S SP+T  TVRC+G+T  N  NGER +TSGDGERRQ+V +K   AA AAPETV+T+TRELDLGSLLANLLV+LK    K
Subjt:  MAATRFFRPIRPSASVVSSSASPSPAT--TVRCMGRTAFN--NGERGLTSGDGERRQMVTVK---AAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGK

Query:  TK-----TQDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMK
        TK      Q FIEK IIDCRFFTLFAVAGSLLGSILC+LEGSFIVAESYLQYF+G+S++SD++H VELLI+++DMFLVGTAL VFGVGLFAMFVG EKM 
Subjt:  TK-----TQDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMK

Query:  EENRHWNSGSNLFGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGG
        E+N  W SGSNLFGLFYMK +PTWV MESVS AKSKIGHAV+MILQVGVLEKFKSIPL+SA DLACFAAA+LISSASIFFLS+LN+GGGG G
Subjt:  EENRHWNSGSNLFGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGG

XP_023538418.1 uncharacterized protein LOC111799204 [Cucurbita pepo subsp. pepo]1.2e-10976.37Show/hide
Query:  MAATRFFRPIRPSASVVSSSASPSPAT--TVRCMGRTAFN--NGERGLTSGDGERRQMVTVK---AAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGK
        MAATR FR +RPSA+VVSSS+S SP+T  TVRC+G+T  N  NGER +TSGDGER+  V +K   AA AAPETV+TKTRELDLGSLLANLLV+LK  V K
Subjt:  MAATRFFRPIRPSASVVSSSASPSPAT--TVRCMGRTAFN--NGERGLTSGDGERRQMVTVK---AAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGK

Query:  TK-----TQDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMK
        TK      Q FIEK IIDCRFFTLFAVAGSLLGSILC+LEGSFIVAESYLQYF+G+S++SD++H VELLI+++DMFLVGTAL VFGVGLFAMFVG EKM 
Subjt:  TK-----TQDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMK

Query:  EENRHWNSGSNLFGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGG
        E+N+ W SGSNLFGLFYMK +PTWV MESVS AKSKIGHAV+MILQVGVLEKFKSIPL+SAADLACFA A+LISSASIFFLS+LN+GGG  G
Subjt:  EENRHWNSGSNLFGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGG

XP_038885641.1 uncharacterized protein LOC120075956 [Benincasa hispida]2.3e-10876.12Show/hide
Query:  MAATRFFRPIRPSASV--VSSSASPSPATTVRCMGRTA--FNNGERGLTSGDGERRQMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTK-
        M ATRF + +RP+++V   SSS+SPS A TVRC+G+T    NNGER +TSGDGER+Q+V VKAA AAP+TV+T+T EL+LGSLLANLLV+LKT VGKTK 
Subjt:  MAATRFFRPIRPSASV--VSSSASPSPATTVRCMGRTA--FNNGERGLTSGDGERRQMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTK-

Query:  ----TQDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEEN
             Q FIEK IIDCRFFTL AVAGSLLGSILCY+EGSFIVAESYLQYFHGLSQ S+QNHTVELLI+A+DMFLVGTAL VFGVGLFAMF+G  KMKE+N
Subjt:  ----TQDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEEN

Query:  RHWNSGSNLFGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGG
        R   SGSN FGLF MKK+PTWV MES+S AKSKIGHAV+MILQVGVLEKFK+IPL+SA DLACFAAAV++SSASIFFLSKLN+GGGG G
Subjt:  RHWNSGSNLFGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGG

TrEMBL top hitse value%identityAlignment
A0A0A0LLC9 Uncharacterized protein2.4e-10875.09Show/hide
Query:  MAATRFFRPIRPSASV--VSSSASPSPATTVRCMGRTA--FNNGERGLTSGDGERRQMVTVK--AAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKT
        MAATRF + +RP+A+V   SSS+SPS  T VR +G+T    NNGER +TSG  ERRQ+VTVK  AA AAP+TV+TKT ELDLGSL+ANLL++LK  +GKT
Subjt:  MAATRFFRPIRPSASV--VSSSASPSPATTVRCMGRTA--FNNGERGLTSGDGERRQMVTVK--AAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKT

Query:  K-----TQDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKE
        K      Q FIEK IIDCRFFTL AV+GSL+GSILCY+EGSFIV ESYLQYFHGLSQ++DQ HTVELLI+A+DMFLVGTAL VFG+GLFAMFVG EKMK+
Subjt:  K-----TQDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKE

Query:  ENRHWNSGSNLFGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGG
        +N+ W+S SNLFGLFYMKK+PTWV MES+SAAKSKIGHAV+MILQVGVLEKFK+IPL+SA DLACFAAAVLISSASIFFLSKLN+GGGG
Subjt:  ENRHWNSGSNLFGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGG

A0A5A7VG09 UPF0114 domain-containing protein7.1e-10874.74Show/hide
Query:  MAATRFFRPIRPSASVVSSSASPSPA--TTVRCMGRTA--FNNGERGLTSGDGERRQMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTK-
        MAATRF + +RP+A+V SSS+S SP+  T VR +G+T    NNGER +TSG GE RQ+V VKAA  AP+TV+TKT ELDLGSL+++LLV+LKT +GKTK 
Subjt:  MAATRFFRPIRPSASVVSSSASPSPA--TTVRCMGRTA--FNNGERGLTSGDGERRQMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTK-

Query:  ----TQDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEEN
             Q FIEK IIDCRFFTL AV+GSL+GSILCY+EGSFIVAESYLQYFH LSQ+++Q HTVELLI+A+DMFLVGTAL VFG+GLFAMFVG EKMKE+N
Subjt:  ----TQDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEEN

Query:  RHWNSGSNLFGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGG
        R W S SNLFGLFYMKK+PTWV MES+SAAKSKIGHAV+MILQVGVLEKFK+IPL+SA DLACFAAAVLISSASIFFLSKLN+G GG G
Subjt:  RHWNSGSNLFGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGG

A0A6J1CSY6 uncharacterized protein LOC111014021 isoform X25.2e-12789.29Show/hide
Query:  MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRQMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKTQDFI
        MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERR+MVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTK QDFI
Subjt:  MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRQMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKTQDFI

Query:  EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL
        EK                          GSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL
Subjt:  EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL

Query:  FGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGG
        FGLFYMKKLPTWVGMESVSA KSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLN GGGGGG
Subjt:  FGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGG

A0A6J1CTB3 uncharacterized protein LOC111014021 isoform X13.8e-14698.57Show/hide
Query:  MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRQMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKTQDFI
        MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERR+MVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTK QDFI
Subjt:  MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRQMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKTQDFI

Query:  EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL
        EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL
Subjt:  EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL

Query:  FGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGG
        FGLFYMKKLPTWVGMESVSA KSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLN GGGGGG
Subjt:  FGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGG

A0A6J1GGV4 uncharacterized protein LOC1114540794.4e-11076.37Show/hide
Query:  MAATRFFRPIRPSASVVSSSASPSPAT--TVRCMGRTAFN--NGERGLTSGDGERRQMVTVK---AAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGK
        MAATR  R +RPSA+VVSSS+S SP+T  TVRC+G+T  N  NGER +TSGDGERRQ+V +K   AA AAPETV+T+TRELDLGSLLANLLV+LK    K
Subjt:  MAATRFFRPIRPSASVVSSSASPSPAT--TVRCMGRTAFN--NGERGLTSGDGERRQMVTVK---AAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGK

Query:  TK-----TQDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMK
        TK      Q FIEK IIDCRFFTLFAVAGSLLGSILC+LEGSFIVAESYLQYF+G+S++SD++H VELLI+++DMFLVGTAL VFGVGLFAMFVG EKM 
Subjt:  TK-----TQDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMK

Query:  EENRHWNSGSNLFGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGG
        E+N  W SGSNLFGLFYMK +PTWV MESVS AKSKIGHAV+MILQVGVLEKFKSIPL+SA DLACFAAA+LISSASIFFLS+LN+GGGG G
Subjt:  EENRHWNSGSNLFGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19390.1 Uncharacterised protein family (UPF0114)3.6e-3539.66Show/hide
Query:  QDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRH--W
        ++ IEK I  CRF T     GSLLGS+LC+++G   V +S+LQY        ++   + LL++AID++L+GT + VFG+GL+ +F+      E   H   
Subjt:  QDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRH--W

Query:  NSGSNLFGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLN
        ++ S+LFG+F +K+ P W+ ++SVS  K+K+GH +VM+L +G+ +K K + + S  DL C + ++  SSA +F LS+LN
Subjt:  NSGSNLFGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLN

AT5G13720.1 Uncharacterised protein family (UPF0114)6.9e-3136.26Show/hide
Query:  TQDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFV--GPEKMKEENRH
        T+  +E+ I D RF  L AV GSL GS+LC+L G   + E+Y  Y+   S+       V  L++AID++L GT + +F +GL+ +F+   P  +  E+  
Subjt:  TQDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFV--GPEKMKEENRH

Query:  WNSGSNLFGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMG
            S+LFG+F MK+ P W+ + S+   K+K+GH +VMIL V + E+ K + + +  DL  ++  + +SSAS++ L  L+ G
Subjt:  WNSGSNLFGLFYMKKLPTWVGMESVSAAKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCCACTAGATTTTTCCGGCCGATTCGGCCTTCAGCTTCTGTCGTCTCTTCCTCTGCCTCTCCATCGCCGGCGACGACGGTGAGGTGTATGGGCAGAACGGCGTT
CAACAACGGCGAACGGGGGTTAACTTCCGGCGACGGCGAGAGAAGGCAGATGGTCACCGTCAAGGCGGCGGTGGCGGCTCCCGAGACCGTGGACACCAAAACCAGAGAAC
TGGATTTGGGTTCGTTGCTGGCGAATCTTCTCGTTAAATTGAAGACCGCTGTGGGGAAGACGAAGACTCAGGACTTCATCGAAAAGAGCATAATCGACTGCCGATTCTTC
ACGTTATTCGCCGTCGCCGGATCTTTATTGGGTTCGATACTCTGCTACCTGGAGGGGAGCTTTATTGTTGCAGAGTCTTATCTGCAGTATTTCCATGGTCTCTCGCAGAA
GTCGGACCAAAATCATACGGTGGAGCTTCTAATTCAAGCCATAGATATGTTCCTCGTCGGAACTGCTCTGTTTGTTTTTGGGGTGGGATTGTTTGCAATGTTCGTTGGAC
CCGAGAAGATGAAGGAAGAAAACCGCCATTGGAATTCTGGATCCAACTTGTTTGGTCTCTTCTACATGAAGAAACTTCCGACGTGGGTGGGAATGGAATCGGTGTCGGCG
GCGAAGTCGAAGATCGGGCATGCGGTGGTGATGATTCTACAAGTGGGTGTGTTGGAGAAGTTCAAGAGTATACCTTTGAACTCTGCCGCCGATCTCGCTTGTTTCGCCGC
CGCCGTTCTGATCTCTTCCGCCTCCATCTTTTTCCTCTCTAAACTCAATATGGGCGGCGGCGGCGGCGGCGAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCCACTAGATTTTTCCGGCCGATTCGGCCTTCAGCTTCTGTCGTCTCTTCCTCTGCCTCTCCATCGCCGGCGACGACGGTGAGGTGTATGGGCAGAACGGCGTT
CAACAACGGCGAACGGGGGTTAACTTCCGGCGACGGCGAGAGAAGGCAGATGGTCACCGTCAAGGCGGCGGTGGCGGCTCCCGAGACCGTGGACACCAAAACCAGAGAAC
TGGATTTGGGTTCGTTGCTGGCGAATCTTCTCGTTAAATTGAAGACCGCTGTGGGGAAGACGAAGACTCAGGACTTCATCGAAAAGAGCATAATCGACTGCCGATTCTTC
ACGTTATTCGCCGTCGCCGGATCTTTATTGGGTTCGATACTCTGCTACCTGGAGGGGAGCTTTATTGTTGCAGAGTCTTATCTGCAGTATTTCCATGGTCTCTCGCAGAA
GTCGGACCAAAATCATACGGTGGAGCTTCTAATTCAAGCCATAGATATGTTCCTCGTCGGAACTGCTCTGTTTGTTTTTGGGGTGGGATTGTTTGCAATGTTCGTTGGAC
CCGAGAAGATGAAGGAAGAAAACCGCCATTGGAATTCTGGATCCAACTTGTTTGGTCTCTTCTACATGAAGAAACTTCCGACGTGGGTGGGAATGGAATCGGTGTCGGCG
GCGAAGTCGAAGATCGGGCATGCGGTGGTGATGATTCTACAAGTGGGTGTGTTGGAGAAGTTCAAGAGTATACCTTTGAACTCTGCCGCCGATCTCGCTTGTTTCGCCGC
CGCCGTTCTGATCTCTTCCGCCTCCATCTTTTTCCTCTCTAAACTCAATATGGGCGGCGGCGGCGGCGGCGAG
Protein sequenceShow/hide protein sequence
MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRQMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKTQDFIEKSIIDCRFF
TLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNLFGLFYMKKLPTWVGMESVSA
AKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNMGGGGGGE