; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g0645 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g0645
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUPF0114 domain-containing protein
Genome locationMC08:5237132..5239125
RNA-Seq ExpressionMC08g0645
SyntenyMC08g0645
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005134 - Uncharacterised protein family UPF0114


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144307.1 uncharacterized protein LOC111014021 isoform X1 [Momordica charantia]1.54e-191100Show/hide
Query:  MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRKMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKIQDFI
        MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRKMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKIQDFI
Subjt:  MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRKMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKIQDFI

Query:  EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL
        EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL
Subjt:  EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL

Query:  FGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGGE
        FGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGGE
Subjt:  FGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGGE

XP_022144308.1 uncharacterized protein LOC111014021 isoform X2 [Momordica charantia]5.86e-16790.78Show/hide
Query:  MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRKMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKIQDFI
        MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRKMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKIQDFI
Subjt:  MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRKMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKIQDFI

Query:  EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL
        EK                          GSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL
Subjt:  EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL

Query:  FGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGGE
        FGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGGE
Subjt:  FGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGGE

XP_022951147.1 uncharacterized protein LOC111454079 [Cucurbita moschata]2.13e-13976.11Show/hide
Query:  MAATRFFRPIRPSASVVSSSASPSPAT--TVRCMGRTAFN--NGERGLTSGDGERRKMVTVKAAVAA---PETVDTKTRELDLGSLLANLLVKLKTAVGK
        MAATR  R +RPSA+VVSSS+S SP+T  TVRC+G+T  N  NGER +TSGDGERR++V +KAA AA   PETV+T+TRELDLGSLLANLLV+LK    K
Subjt:  MAATRFFRPIRPSASVVSSSASPSPAT--TVRCMGRTAFN--NGERGLTSGDGERRKMVTVKAAVAA---PETVDTKTRELDLGSLLANLLVKLKTAVGK

Query:  TKI-----QDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMK
        TKI     Q FIEK IIDCRFFTLFAVAGSLLGSILC+LEGSFIVAESYLQYF+G+S++SD++H VELLI+++DMFLVGTAL VFGVGLFAMFVG EKM 
Subjt:  TKI-----QDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMK

Query:  EENRHWNSGSNLFGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGG
        E+N  W SGSNLFGLFYMK +PTWV MESVS  KSKIGHAV+MILQVGVLEKFKSIPL+SA DLACFAAA+LISSASIFFLS+LN GGGG GG
Subjt:  EENRHWNSGSNLFGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGG

XP_023538418.1 uncharacterized protein LOC111799204 [Cucurbita pepo subsp. pepo]7.45e-14076.82Show/hide
Query:  MAATRFFRPIRPSASVVSSSASPSPAT--TVRCMGRTAFN--NGERGLTSGDGERRKMVTVKAAVAA---PETVDTKTRELDLGSLLANLLVKLKTAVGK
        MAATR FR +RPSA+VVSSS+S SP+T  TVRC+G+T  N  NGER +TSGDGER+  V +KAA AA   PETV+TKTRELDLGSLLANLLV+LK  V K
Subjt:  MAATRFFRPIRPSASVVSSSASPSPAT--TVRCMGRTAFN--NGERGLTSGDGERRKMVTVKAAVAA---PETVDTKTRELDLGSLLANLLVKLKTAVGK

Query:  TKI-----QDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMK
        TKI     Q FIEK IIDCRFFTLFAVAGSLLGSILC+LEGSFIVAESYLQYF+G+S++SD++H VELLI+++DMFLVGTAL VFGVGLFAMFVG EKM 
Subjt:  TKI-----QDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMK

Query:  EENRHWNSGSNLFGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGG
        E+N+ W SGSNLFGLFYMK +PTWV MESVS  KSKIGHAV+MILQVGVLEKFKSIPL+SAADLACFA A+LISSASIFFLS+LN GGG
Subjt:  EENRHWNSGSNLFGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGG

XP_038885641.1 uncharacterized protein LOC120075956 [Benincasa hispida]4.39e-13775.86Show/hide
Query:  MAATRFFRPIRPSASVVSSSASPSP--ATTVRCMGRTAFN--NGERGLTSGDGERRKMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKI
        M ATRF + +RP+++V SSS+S SP  A TVRC+G+T  N  NGER +TSGDGER+++V VKAA AAP+TV+T+T EL+LGSLLANLLV+LKT VGKTKI
Subjt:  MAATRFFRPIRPSASVVSSSASPSP--ATTVRCMGRTAFN--NGERGLTSGDGERRKMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKI

Query:  Q-----DFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEEN
        Q      FIEK IIDCRFFTL AVAGSLLGSILCY+EGSFIVAESYLQYFHGLSQ S+QNHTVELLI+A+DMFLVGTAL VFGVGLFAMF+G  KMKE+N
Subjt:  Q-----DFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEEN

Query:  RHWNSGSNLFGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGG
        R   SGSN FGLF MKK+PTWV MES+S  KSKIGHAV+MILQVGVLEKFK+IPL+SA DLACFAAAV++SSASIFFLSKLN GGGG GG
Subjt:  RHWNSGSNLFGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGG

TrEMBL top hitse value%identityAlignment
A0A0A0LLC9 Uncharacterized protein2.71e-13774.74Show/hide
Query:  MAATRFFRPIRPSASVV--SSSASPSPATTVRCMGRTAFN--NGERGLTSGDGERRKMVTVKAAVA--APETVDTKTRELDLGSLLANLLVKLKTAVGKT
        MAATRF + +RP+A+V   SSS+SPS  T VR +G+T  N  NGER +TSG  ERR++VTVKAA A  AP+TV+TKT ELDLGSL+ANLL++LK  +GKT
Subjt:  MAATRFFRPIRPSASVV--SSSASPSPATTVRCMGRTAFN--NGERGLTSGDGERRKMVTVKAAVA--APETVDTKTRELDLGSLLANLLVKLKTAVGKT

Query:  KI-----QDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKE
        KI     Q FIEK IIDCRFFTL AV+GSL+GSILCY+EGSFIV ESYLQYFHGLSQ++DQ HTVELLI+A+DMFLVGTAL VFG+GLFAMFVG EKMK+
Subjt:  KI-----QDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKE

Query:  ENRHWNSGSNLFGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGG
        +N+ W+S SNLFGLFYMKK+PTWV MES+SA KSKIGHAV+MILQVGVLEKFK+IPL+SA DLACFAAAVLISSASIFFLSKLN GGGG
Subjt:  ENRHWNSGSNLFGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGG

A0A5A7VG09 UPF0114 domain-containing protein5.09e-13774.48Show/hide
Query:  MAATRFFRPIRPSASVVSSSASPSPA--TTVRCMGRTAFN--NGERGLTSGDGERRKMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKI
        MAATRF + +RP+A+V SSS+S SP+  T VR +G+T  N  NGER +TSG GE R++V VKAA  AP+TV+TKT ELDLGSL+++LLV+LKT +GKTKI
Subjt:  MAATRFFRPIRPSASVVSSSASPSPA--TTVRCMGRTAFN--NGERGLTSGDGERRKMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKI

Query:  -----QDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEEN
             Q FIEK IIDCRFFTL AV+GSL+GSILCY+EGSFIVAESYLQYFH LSQ+++Q HTVELLI+A+DMFLVGTAL VFG+GLFAMFVG EKMKE+N
Subjt:  -----QDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEEN

Query:  RHWNSGSNLFGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGG
        R W S SNLFGLFYMKK+PTWV MES+SA KSKIGHAV+MILQVGVLEKFK+IPL+SA DLACFAAAVLISSASIFFLSKLN G GG GG
Subjt:  RHWNSGSNLFGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGG

A0A6J1CSY6 uncharacterized protein LOC111014021 isoform X22.84e-16790.78Show/hide
Query:  MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRKMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKIQDFI
        MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRKMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKIQDFI
Subjt:  MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRKMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKIQDFI

Query:  EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL
        EK                          GSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL
Subjt:  EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL

Query:  FGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGGE
        FGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGGE
Subjt:  FGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGGE

A0A6J1CTB3 uncharacterized protein LOC111014021 isoform X17.46e-192100Show/hide
Query:  MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRKMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKIQDFI
        MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRKMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKIQDFI
Subjt:  MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRKMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKIQDFI

Query:  EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL
        EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL
Subjt:  EKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNL

Query:  FGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGGE
        FGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGGE
Subjt:  FGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGGE

A0A6J1GGV4 uncharacterized protein LOC1114540791.03e-13976.11Show/hide
Query:  MAATRFFRPIRPSASVVSSSASPSPAT--TVRCMGRTAFN--NGERGLTSGDGERRKMVTVKAAVAA---PETVDTKTRELDLGSLLANLLVKLKTAVGK
        MAATR  R +RPSA+VVSSS+S SP+T  TVRC+G+T  N  NGER +TSGDGERR++V +KAA AA   PETV+T+TRELDLGSLLANLLV+LK    K
Subjt:  MAATRFFRPIRPSASVVSSSASPSPAT--TVRCMGRTAFN--NGERGLTSGDGERRKMVTVKAAVAA---PETVDTKTRELDLGSLLANLLVKLKTAVGK

Query:  TKI-----QDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMK
        TKI     Q FIEK IIDCRFFTLFAVAGSLLGSILC+LEGSFIVAESYLQYF+G+S++SD++H VELLI+++DMFLVGTAL VFGVGLFAMFVG EKM 
Subjt:  TKI-----QDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMK

Query:  EENRHWNSGSNLFGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGG
        E+N  W SGSNLFGLFYMK +PTWV MESVS  KSKIGHAV+MILQVGVLEKFKSIPL+SA DLACFAAA+LISSASIFFLS+LN GGGG GG
Subjt:  EENRHWNSGSNLFGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19390.1 Uncharacterised protein family (UPF0114)2.1e-3539.44Show/hide
Query:  IQDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRH--
        +++ IEK I  CRF T     GSLLGS+LC+++G   V +S+LQY        ++   + LL++AID++L+GT + VFG+GL+ +F+      E   H  
Subjt:  IQDFIEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRH--

Query:  WNSGSNLFGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLN
         ++ S+LFG+F +K+ P W+ ++SVS +K+K+GH +VM+L +G+ +K K + + S  DL C + ++  SSA +F LS+LN
Subjt:  WNSGSNLFGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLN

AT5G13720.1 Uncharacterised protein family (UPF0114)2.0e-3036.52Show/hide
Query:  IEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFV--GPEKMKEENRHWNSG
        +E+ I D RF  L AV GSL GS+LC+L G   + E+Y  Y+   S+       V  L++AID++L GT + +F +GL+ +F+   P  +  E+      
Subjt:  IEKSIIDCRFFTLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFV--GPEKMKEENRHWNSG

Query:  SNLFGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTG
        S+LFG+F MK+ P W+ + S+  +K+K+GH +VMIL V + E+ K + + +  DL  ++  + +SSAS++ L  L+ G
Subjt:  SNLFGLFYMKKLPTWVGMESVSAVKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCCACTAGATTTTTCCGGCCGATTCGGCCTTCAGCTTCTGTCGTCTCTTCCTCTGCCTCTCCATCGCCGGCGACGACGGTGAGGTGTATGGGCAGAACGGCGTT
CAACAACGGCGAACGGGGGTTAACTTCCGGCGACGGCGAGAGAAGGAAGATGGTCACCGTCAAGGCGGCGGTGGCGGCTCCGGAGACCGTGGACACCAAAACCAGAGAAC
TGGATTTGGGTTCGTTGCTGGCGAATCTTCTCGTTAAATTGAAGACCGCTGTGGGGAAGACGAAGATTCAGGACTTCATCGAAAAGAGCATAATCGACTGCCGATTCTTC
ACGTTATTCGCCGTCGCCGGATCTTTATTGGGTTCGATACTCTGCTACCTGGAGGGGAGCTTTATTGTTGCAGAGTCTTATCTGCAGTATTTCCATGGCCTCTCGCAGAA
GTCGGACCAAAATCATACGGTGGAGCTTCTAATTCAAGCCATAGATATGTTCCTCGTCGGAACTGCTCTGTTTGTTTTTGGGGTGGGATTGTTTGCAATGTTCGTTGGAC
CCGAGAAGATGAAGGAAGAAAACCGCCATTGGAATTCTGGATCCAACTTGTTTGGTCTCTTCTACATGAAGAAACTTCCGACGTGGGTGGGAATGGAATCGGTGTCGGCG
GTGAAGTCGAAGATCGGGCATGCGGTGGTGATGATTCTGCAAGTGGGTGTGTTGGAGAAGTTCAAGAGTATACCTTTGAACTCTGCCGCCGATCTCGCTTGTTTCGCCGC
CGCCGTTCTGATCTCTTCCGCCTCCATCTTTTTCCTCTCTAAACTCAATACGGGCGGCGGCGGCGGCGGCGGCGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATAATAATTTTGTCTTTATAAATAATTTTTTAACCTTGATATATTATACGTGATTTCAAATACTGACATAGCTCGGCTTCTCTTGCTGGCAAAATTACAAAGAAAA
GAAAAAAAAAACTAGAATTCCGGCGGGGTTGTATCATTCACCGTCCAGTCCAGCACGGCGGCAGTGCGTATTCGGTTATTCCGGCCGGTGACAAAGTTCCGAGTCTGAAC
CTTCCTGTAGTAGCCCCCACCGCCGACGCATGCCATCAAAAGTCCCCAAAAAGACTATGACCTCCACGTGTGCGGTCCCGGACACAAGCGGGCCCACAAACCGTTGAATT
CCAGACGCCACGTCATCTAACGTGGAAACTAAAAGCCCACGTCTCCAAGCAAAAGGGAAATCGTTCTGTTAATCTTCACTTCCGGACAAATTCCGGCCAAAATATGCTGC
GAATCTTCCTTCTCCATTGGACGGTATTAAAAAAATGCACACCAAATTTCATCGATAAACTAAAAAAGAACAATGGCAGCCACTAGATTTTTCCGGCCGATTCGGCCTTC
AGCTTCTGTCGTCTCTTCCTCTGCCTCTCCATCGCCGGCGACGACGGTGAGGTGTATGGGCAGAACGGCGTTCAACAACGGCGAACGGGGGTTAACTTCCGGCGACGGCG
AGAGAAGGAAGATGGTCACCGTCAAGGCGGCGGTGGCGGCTCCGGAGACCGTGGACACCAAAACCAGAGAACTGGATTTGGGTTCGTTGCTGGCGAATCTTCTCGTTAAA
TTGAAGACCGCTGTGGGGAAGACGAAGATTCAGGACTTCATCGAAAAGAGCATAATCGACTGCCGATTCTTCACGTTATTCGCCGTCGCCGGATCTTTATTGGGTTCGAT
ACTCTGCTACCTGGAGGGGAGCTTTATTGTTGCAGAGTCTTATCTGCAGTATTTCCATGGCCTCTCGCAGAAGTCGGACCAAAATCATACGGTGGAGCTTCTAATTCAAG
CCATAGATATGTTCCTCGTCGGAACTGCTCTGTTTGTTTTTGGGGTGGGATTGTTTGCAATGTTCGTTGGACCCGAGAAGATGAAGGAAGAAAACCGCCATTGGAATTCT
GGATCCAACTTGTTTGGTCTCTTCTACATGAAGAAACTTCCGACGTGGGTGGGAATGGAATCGGTGTCGGCGGTGAAGTCGAAGATCGGGCATGCGGTGGTGATGATTCT
GCAAGTGGGTGTGTTGGAGAAGTTCAAGAGTATACCTTTGAACTCTGCCGCCGATCTCGCTTGTTTCGCCGCCGCCGTTCTGATCTCTTCCGCCTCCATCTTTTTCCTCT
CTAAACTCAATACGGGCGGCGGCGGCGGCGGCGGCGAGTGAACCTGCACCAATGGCGGCGCGTCGTTCTTTGTTGGCTTGTACAAATATATATAATCTAGGGGAAGTCTA
GGCCGAGGGAATTAGAGATTACCAATTCTTTTATTATTATTAATTAAAAAAATGCTTACACATGATAGTATAAGAGTCTCTAGCTCTGGCAATACGCCAATACTATTCTC
AATTTTTTTTTCAGTCTAGTAAATAAATTGTTAAATATTCAATTATTAGCCTTAATATCTACATTATAAATTGGGGAATAAATGACATGCTTTGCTCATACTATAAAAAT
GACAATATGGG
Protein sequenceShow/hide protein sequence
MAATRFFRPIRPSASVVSSSASPSPATTVRCMGRTAFNNGERGLTSGDGERRKMVTVKAAVAAPETVDTKTRELDLGSLLANLLVKLKTAVGKTKIQDFIEKSIIDCRFF
TLFAVAGSLLGSILCYLEGSFIVAESYLQYFHGLSQKSDQNHTVELLIQAIDMFLVGTALFVFGVGLFAMFVGPEKMKEENRHWNSGSNLFGLFYMKKLPTWVGMESVSA
VKSKIGHAVVMILQVGVLEKFKSIPLNSAADLACFAAAVLISSASIFFLSKLNTGGGGGGGE