; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g03130 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g03130
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: my s in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).
Genome locationchr3:2378986..2380631
RNA-Seq ExpressionMoc03g03130
SyntenyMoc03g03130
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]3.3e-7271.09Show/hide
Query:  VSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAMVCGFTGSVKRKSKGRAHALKTVQSTEPVTPAVTQP
        V+I+P+PELTQASFDTLKYYK+ FPRGRK+GTLV DKLLLESGLLDYNP VRPIE+SR NSELAMVCGF  +VKRKSKG+AHAL+  QS++PVTPAV   
Subjt:  VSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAMVCGFTGSVKRKSKGRAHALKTVQSTEPVTPAVTQP

Query:  AVQDKAGPSSEVPTPVIELDSAGEHSREKRSKNEFEALDVSPL-REVREDSPLKRRRKKKKTTTSSEVGPRGPLPTSHVDLVDDPEARMGGTSNVKIRFR
              GP+SE P PVIEL+S+   SREKR +++ EA+DVSPL  EVRE+ PLKRRRKKKKTT+  EVG RG LP S  D VDDPEARMGGT +V  RFR
Subjt:  AVQDKAGPSSEVPTPVIELDSAGEHSREKRSKNEFEALDVSPL-REVREDSPLKRRRKKKKTTTSSEVGPRGPLPTSHVDLVDDPEARMGGTSNVKIRFR

Query:  MEPSSSGVKDQ
        +EPSSSGV+DQ
Subjt:  MEPSSSGVKDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.6e-9067.16Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRIGLTPAQVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYYMCARKGA---------------
        MFEYGLRLPLHPF QEFL R GL PAQVA NGWGVIFAL+ILFWLRARD +EAELL V+QLL CFEAKRIAKKPGR+YMCARKGA               
Subjt:  MFEYGLRLPLHPFAQEFLNRIGLTPAQVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYYMCARKGA---------------

Query:  ---------------GRPFFDVHVRFGNLVSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAMVCGFTG
                       GR FFDV  RFGNLVSI+P+PELTQASFDTLKYYK++FPRGRK+GTLV D+LLLESGLLDYNP VRPIE SR NS LAMVC F  
Subjt:  ---------------GRPFFDVHVRFGNLVSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAMVCGFTG

Query:  SVKRKSKGRAHALKTVQSTEPVTPAVTQPAVQDKAGPSSEVPTPVIELDSAGEHSREKRSKNEFEALD
         VKRKSKGRAHAL+  QS++P TPAV         GP+SE P PVIEL+S+G  SREKR +++ EA+D
Subjt:  SVKRKSKGRAHALKTVQSTEPVTPAVTQPAVQDKAGPSSEVPTPVIELDSAGEHSREKRSKNEFEALD

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]4.0e-7071.65Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRIGLTPAQVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYYMCARKGA---------------
        MFEYGLRLPLHPF QEFL R GL PAQVA NGWGVIFAL+ILFWLRARD +EAELL V+QLL CFEAKRIAKKPGR+YMCARKGA               
Subjt:  MFEYGLRLPLHPFAQEFLNRIGLTPAQVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYYMCARKGA---------------

Query:  ---------------GRPFFDVHVRFGNLVSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAM
                       GR FFDV  RFGNLVSI+P+PELTQASFDTLKYYK+ FPRGRK+GTLV DKLLLESGLLDYNP VRPIE+SR NSEL M
Subjt:  ---------------GRPFFDVHVRFGNLVSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]9.7e-13369.86Show/hide
Query:  SDSGEDLALRLVSELEEIENFRFSDDGEDSDTSTSGLGLEYPSKMPEHYLGPLRRGFSIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPF
        S+   DLA RL S+LEEIEN R SDDGEDSD STSG GLEYPS++PEHYLG LRRGF+IP +ILLR+PEEGERADNPPEGWVTLY KMFEYGLRLPLHPF
Subjt:  SDSGEDLALRLVSELEEIENFRFSDDGEDSDTSTSGLGLEYPSKMPEHYLGPLRRGFSIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPF

Query:  AQEFLNRIGLTPAQVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYYMCARKGA----------------------------
         QEFL R GL PAQVA NGWGVIFAL+ILFWLRARD +EAEL  V+QLL CFEAKRIAKKPGR+YMCARKGA                            
Subjt:  AQEFLNRIGLTPAQVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYYMCARKGA----------------------------

Query:  --GRPFFDVHVRFGNLVSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAMVCGFTGSVKRKSKGRAHAL
          GR FFDV  RFGNLVSI+P+PELTQASFDTLKYYK++FPRGRK+GTLV D+LLLESGLLDYNP VRPIE+SR NSELAMVCGF   VKRKSKGRAHAL
Subjt:  --GRPFFDVHVRFGNLVSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAMVCGFTGSVKRKSKGRAHAL

Query:  KTVQSTEPVTPAVTQPAVQDKAGPSSEVPTPVIELDSAGEHSREKRSKNEFEALD
        +  QS++P TPAV         GP+SE P  VIEL+S+G  SREKR +++ EA+D
Subjt:  KTVQSTEPVTPAVTQPAVQDKAGPSSEVPTPVIELDSAGEHSREKRSKNEFEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.6e-9577.96Show/hide
Query:  ARKGAGRPFFDVHVRFGNLVSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAMVCGFTGSVKRKSKGRA
        A+  +GR FFDV  RFGNLVSIK IPEL QA+FDTLK+YKD FPR RKI TLV DKLLLESGLLDYNPLVR IEASR NSELAMVCGFTGSVKRKSKGRA
Subjt:  ARKGAGRPFFDVHVRFGNLVSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAMVCGFTGSVKRKSKGRA

Query:  HALKTVQSTEPVTPAVTQPAVQDKAGPSSEVPTPVIELDSAGEHSREKRSKNEFEALDVSPLREVREDSPLKRRRKKKKTTTSSEVGPRGPLPTSHVDLV
        HALKTV  TEPVTP V +   Q  +GPSS VPTPVIELD +G  S EKRS+ E EALDVSPL EVR +SPL+RRRKKKKT++SSE G RG LPTSH DLV
Subjt:  HALKTVQSTEPVTPAVTQPAVQDKAGPSSEVPTPVIELDSAGEHSREKRSKNEFEALDVSPLREVREDSPLKRRRKKKKTTTSSEVGPRGPLPTSHVDLV

Query:  DDPEARMGGTSNVKIRFRMEPSSSGVKDQVSRISAACLDRCFRRA
        DDPEARM GTSNV++RF MEPSSSGVKDQVSRISA CLDR  RRA
Subjt:  DDPEARMGGTSNVKIRFRMEPSSSGVKDQVSRISAACLDRCFRRA

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092981.6e-7271.09Show/hide
Query:  VSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAMVCGFTGSVKRKSKGRAHALKTVQSTEPVTPAVTQP
        V+I+P+PELTQASFDTLKYYK+ FPRGRK+GTLV DKLLLESGLLDYNP VRPIE+SR NSELAMVCGF  +VKRKSKG+AHAL+  QS++PVTPAV   
Subjt:  VSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAMVCGFTGSVKRKSKGRAHALKTVQSTEPVTPAVTQP

Query:  AVQDKAGPSSEVPTPVIELDSAGEHSREKRSKNEFEALDVSPL-REVREDSPLKRRRKKKKTTTSSEVGPRGPLPTSHVDLVDDPEARMGGTSNVKIRFR
              GP+SE P PVIEL+S+   SREKR +++ EA+DVSPL  EVRE+ PLKRRRKKKKTT+  EVG RG LP S  D VDDPEARMGGT +V  RFR
Subjt:  AVQDKAGPSSEVPTPVIELDSAGEHSREKRSKNEFEALDVSPL-REVREDSPLKRRRKKKKTTTSSEVGPRGPLPTSHVDLVDDPEARMGGTSNVKIRFR

Query:  MEPSSSGVKDQ
        +EPSSSGV+DQ
Subjt:  MEPSSSGVKDQ

A0A6J1CR42 uncharacterized protein LOC1110138267.6e-9167.16Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRIGLTPAQVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYYMCARKGA---------------
        MFEYGLRLPLHPF QEFL R GL PAQVA NGWGVIFAL+ILFWLRARD +EAELL V+QLL CFEAKRIAKKPGR+YMCARKGA               
Subjt:  MFEYGLRLPLHPFAQEFLNRIGLTPAQVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYYMCARKGA---------------

Query:  ---------------GRPFFDVHVRFGNLVSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAMVCGFTG
                       GR FFDV  RFGNLVSI+P+PELTQASFDTLKYYK++FPRGRK+GTLV D+LLLESGLLDYNP VRPIE SR NS LAMVC F  
Subjt:  ---------------GRPFFDVHVRFGNLVSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAMVCGFTG

Query:  SVKRKSKGRAHALKTVQSTEPVTPAVTQPAVQDKAGPSSEVPTPVIELDSAGEHSREKRSKNEFEALD
         VKRKSKGRAHAL+  QS++P TPAV         GP+SE P PVIEL+S+G  SREKR +++ EA+D
Subjt:  SVKRKSKGRAHALKTVQSTEPVTPAVTQPAVQDKAGPSSEVPTPVIELDSAGEHSREKRSKNEFEALD

A0A6J1DWF1 uncharacterized protein LOC1110251081.9e-7071.65Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRIGLTPAQVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYYMCARKGA---------------
        MFEYGLRLPLHPF QEFL R GL PAQVA NGWGVIFAL+ILFWLRARD +EAELL V+QLL CFEAKRIAKKPGR+YMCARKGA               
Subjt:  MFEYGLRLPLHPFAQEFLNRIGLTPAQVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYYMCARKGA---------------

Query:  ---------------GRPFFDVHVRFGNLVSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAM
                       GR FFDV  RFGNLVSI+P+PELTQASFDTLKYYK+ FPRGRK+GTLV DKLLLESGLLDYNP VRPIE+SR NSEL M
Subjt:  ---------------GRPFFDVHVRFGNLVSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAM

A0A6J1DXS5 uncharacterized protein LOC1110255024.7e-13369.86Show/hide
Query:  SDSGEDLALRLVSELEEIENFRFSDDGEDSDTSTSGLGLEYPSKMPEHYLGPLRRGFSIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPF
        S+   DLA RL S+LEEIEN R SDDGEDSD STSG GLEYPS++PEHYLG LRRGF+IP +ILLR+PEEGERADNPPEGWVTLY KMFEYGLRLPLHPF
Subjt:  SDSGEDLALRLVSELEEIENFRFSDDGEDSDTSTSGLGLEYPSKMPEHYLGPLRRGFSIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPF

Query:  AQEFLNRIGLTPAQVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYYMCARKGA----------------------------
         QEFL R GL PAQVA NGWGVIFAL+ILFWLRARD +EAEL  V+QLL CFEAKRIAKKPGR+YMCARKGA                            
Subjt:  AQEFLNRIGLTPAQVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYYMCARKGA----------------------------

Query:  --GRPFFDVHVRFGNLVSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAMVCGFTGSVKRKSKGRAHAL
          GR FFDV  RFGNLVSI+P+PELTQASFDTLKYYK++FPRGRK+GTLV D+LLLESGLLDYNP VRPIE+SR NSELAMVCGF   VKRKSKGRAHAL
Subjt:  --GRPFFDVHVRFGNLVSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAMVCGFTGSVKRKSKGRAHAL

Query:  KTVQSTEPVTPAVTQPAVQDKAGPSSEVPTPVIELDSAGEHSREKRSKNEFEALD
        +  QS++P TPAV         GP+SE P  VIEL+S+G  SREKR +++ EA+D
Subjt:  KTVQSTEPVTPAVTQPAVQDKAGPSSEVPTPVIELDSAGEHSREKRSKNEFEALD

A0A6J1DZB3 uncharacterized protein LOC1110256651.7e-9577.96Show/hide
Query:  ARKGAGRPFFDVHVRFGNLVSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAMVCGFTGSVKRKSKGRA
        A+  +GR FFDV  RFGNLVSIK IPEL QA+FDTLK+YKD FPR RKI TLV DKLLLESGLLDYNPLVR IEASR NSELAMVCGFTGSVKRKSKGRA
Subjt:  ARKGAGRPFFDVHVRFGNLVSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAMVCGFTGSVKRKSKGRA

Query:  HALKTVQSTEPVTPAVTQPAVQDKAGPSSEVPTPVIELDSAGEHSREKRSKNEFEALDVSPLREVREDSPLKRRRKKKKTTTSSEVGPRGPLPTSHVDLV
        HALKTV  TEPVTP V +   Q  +GPSS VPTPVIELD +G  S EKRS+ E EALDVSPL EVR +SPL+RRRKKKKT++SSE G RG LPTSH DLV
Subjt:  HALKTVQSTEPVTPAVTQPAVQDKAGPSSEVPTPVIELDSAGEHSREKRSKNEFEALDVSPLREVREDSPLKRRRKKKKTTTSSEVGPRGPLPTSHVDLV

Query:  DDPEARMGGTSNVKIRFRMEPSSSGVKDQVSRISAACLDRCFRRA
        DDPEARM GTSNV++RF MEPSSSGVKDQVSRISA CLDR  RRA
Subjt:  DDPEARMGGTSNVKIRFRMEPSSSGVKDQVSRISAACLDRCFRRA

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic5.2e-0425.96Show/hide
Query:  VVFMSSPSSSDSLGSAGRTISSSPPKPSDSGEDLALRLVSELEEI----ENFRFSDDGEDSDTSTSGLGLEYP---SKMPEHYLGPLRRGFSIPNDILLR
        +V  SS    D  G++    +S P  P+   ED      +E E+I    E   F    +          +  P   S   E  L  L+  F +   + LR
Subjt:  VVFMSSPSSSDSLGSAGRTISSSPPKPSDSGEDLALRLVSELEEI----ENFRFSDDGEDSDTSTSGLGLEYP---SKMPEHYLGPLRRGFSIPNDILLR

Query:  IPEEGERADNPPEGWVTLYLKMFEYG--LRLPLHPFAQEFLNRIGLTPAQVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAK-KPGR
        +P   ERAD+PP G+ TLY + F YG  L LP+     E++    +  +Q+ +       +L  L  +  R  +    +++  L    E +R+ K +  R
Subjt:  IPEEGERADNPPEGWVTLYLKMFEYG--LRLPLHPFAQEFLNRIGLTPAQVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAK-KPGR

Query:  YYMCARKG
        YY+   KG
Subjt:  YYMCARKG

Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related1.8e-0424.12Show/hide
Query:  LRLVSELEEIENFRFSDDGEDSDTSTSGLGLEY------PSKMPEHYLGPLRRGFSIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFA
        LR+ ++ +   N    D+ E +D + SG   +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F 
Subjt:  LRLVSELEEIENFRFSDDGEDSDTSTSGLGLEY------PSKMPEHYLGPLRRGFSIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFA

Query:  QEFLNRIGLTPAQVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYYMCARKG
          F     +  +Q+ +     I   + L  L AR       LSVE +       ++  K G++Y+ + +G
Subjt:  QEFLNRIGLTPAQVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYYMCARKG

AT5G38190.1 INVOLVED IN: biological_process unknown5.3e-0425.32Show/hide
Query:  RFSDD-GEDSDTSTSGLGLEY------PSKMPEHYLGPLRRGFSIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRIGLTPA
        R++DD  E +D + SG   +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F   F     +  +
Subjt:  RFSDD-GEDSDTSTSGLGLEY------PSKMPEHYLGPLRRGFSIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRIGLTPA

Query:  QVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYYMCARKG
        Q+ +     I   + L  L AR       LSVE +       ++  K G++Y+ + +G
Subjt:  QVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYYMCARKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGTTTTCATGTCTTCCCCTTCCAGTAGTGATAGTTTAGGTAGTGCAGGTCGGACTATAAGCAGTTCGCCCCCCAAACCTAGTGACTCTGGGGAGGACTTA
GCTCTTAGGTTAGTGTCCGAACTGGAAGAGATAGAAAATTTTAGGTTTTCTGATGATGGGGAGGATAGTGACACTTCCACCTCGGGCCTGGGTTTGGAATACCCT
TCGAAGATGCCTGAACATTACCTCGGACCCCTCCGTAGGGGGTTTAGTATTCCCAATGACATCCTCCTTAGGATCCCAGAGGAAGGGGAAAGAGCTGACAATCCT
CCAGAGGGATGGGTCACTCTTTATTTAAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCATCCTTTTGCCCAGGAGTTTTTAAACCGAATTGGACTGACACCT
GCTCAAGTGGCCCTCAATGGATGGGGTGTCATTTTTGCTTTGTCAATCCTTTTTTGGTTACGAGCTCGAGACGAGGACGAGGCCGAGCTGTTAAGTGTCGAGCAG
CTTCTTGGGTGCTTCGAAGCCAAGAGAATAGCTAAGAAGCCAGGTCGGTATTATATGTGCGCAAGGAAAGGCGCAGGTCGTCCCTTCTTTGACGTGCATGTTAGG
TTTGGGAACCTAGTGTCGATCAAACCGATTCCCGAGCTCACTCAAGCCTCCTTCGACACCCTCAAGTATTACAAGGATCAATTCCCAAGGGGCCGGAAGATCGGA
ACCTTGGTGATTGACAAGCTGCTCCTGGAATCTGGGTTGTTAGACTACAACCCCTTGGTACGTCCGATCGAAGCTTCAAGGCTAAACTCCGAACTTGCAATGGTG
TGCGGGTTCACTGGCAGTGTGAAGCGCAAGTCTAAGGGCCGTGCTCACGCCCTTAAGACTGTTCAAAGCACGGAGCCAGTGACTCCTGCTGTGACTCAGCCTGCG
GTTCAGGACAAGGCTGGGCCATCTTCTGAAGTTCCAACTCCGGTGATCGAGTTGGACTCTGCTGGGGAGCACTCTAGAGAAAAGCGCTCGAAGAACGAGTTCGAG
GCGCTGGACGTGTCGCCTCTGCGCGAAGTGAGGGAAGACTCTCCTTTGAAGAGGAGAAGGAAGAAGAAGAAAACCACCACCTCCTCAGAGGTTGGACCTCGTGGT
CCCCTGCCCACGAGCCACGTTGACTTGGTGGATGACCCCGAAGCTCGGATGGGGGGGACGTCCAACGTGAAGATACGGTTCAGAATGGAACCGTCGAGCTCCGGG
GTGAAGGACCAGGTGTCCCGCATCTCGGCTGCGTGCTTGGACCGCTGCTTTAGAAGAGCGTTCCGAGGTGGATATCTTGAGGGCCGAGGTGGAAGTCAAGGCCGA
GCTGCTGAAGAGGGAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGTTTTCATGTCTTCCCCTTCCAGTAGTGATAGTTTAGGTAGTGCAGGTCGGACTATAAGCAGTTCGCCCCCCAAACCTAGTGACTCTGGGGAGGACTTA
GCTCTTAGGTTAGTGTCCGAACTGGAAGAGATAGAAAATTTTAGGTTTTCTGATGATGGGGAGGATAGTGACACTTCCACCTCGGGCCTGGGTTTGGAATACCCT
TCGAAGATGCCTGAACATTACCTCGGACCCCTCCGTAGGGGGTTTAGTATTCCCAATGACATCCTCCTTAGGATCCCAGAGGAAGGGGAAAGAGCTGACAATCCT
CCAGAGGGATGGGTCACTCTTTATTTAAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCATCCTTTTGCCCAGGAGTTTTTAAACCGAATTGGACTGACACCT
GCTCAAGTGGCCCTCAATGGATGGGGTGTCATTTTTGCTTTGTCAATCCTTTTTTGGTTACGAGCTCGAGACGAGGACGAGGCCGAGCTGTTAAGTGTCGAGCAG
CTTCTTGGGTGCTTCGAAGCCAAGAGAATAGCTAAGAAGCCAGGTCGGTATTATATGTGCGCAAGGAAAGGCGCAGGTCGTCCCTTCTTTGACGTGCATGTTAGG
TTTGGGAACCTAGTGTCGATCAAACCGATTCCCGAGCTCACTCAAGCCTCCTTCGACACCCTCAAGTATTACAAGGATCAATTCCCAAGGGGCCGGAAGATCGGA
ACCTTGGTGATTGACAAGCTGCTCCTGGAATCTGGGTTGTTAGACTACAACCCCTTGGTACGTCCGATCGAAGCTTCAAGGCTAAACTCCGAACTTGCAATGGTG
TGCGGGTTCACTGGCAGTGTGAAGCGCAAGTCTAAGGGCCGTGCTCACGCCCTTAAGACTGTTCAAAGCACGGAGCCAGTGACTCCTGCTGTGACTCAGCCTGCG
GTTCAGGACAAGGCTGGGCCATCTTCTGAAGTTCCAACTCCGGTGATCGAGTTGGACTCTGCTGGGGAGCACTCTAGAGAAAAGCGCTCGAAGAACGAGTTCGAG
GCGCTGGACGTGTCGCCTCTGCGCGAAGTGAGGGAAGACTCTCCTTTGAAGAGGAGAAGGAAGAAGAAGAAAACCACCACCTCCTCAGAGGTTGGACCTCGTGGT
CCCCTGCCCACGAGCCACGTTGACTTGGTGGATGACCCCGAAGCTCGGATGGGGGGGACGTCCAACGTGAAGATACGGTTCAGAATGGAACCGTCGAGCTCCGGG
GTGAAGGACCAGGTGTCCCGCATCTCGGCTGCGTGCTTGGACCGCTGCTTTAGAAGAGCGTTCCGAGGTGGATATCTTGAGGGCCGAGGTGGAAGTCAAGGCCGA
GCTGCTGAAGAGGGAGGATGA
Protein sequenceShow/hide protein sequence
MVVFMSSPSSSDSLGSAGRTISSSPPKPSDSGEDLALRLVSELEEIENFRFSDDGEDSDTSTSGLGLEYPSKMPEHYLGPLRRGFSIPNDILLRIPEEGERADNP
PEGWVTLYLKMFEYGLRLPLHPFAQEFLNRIGLTPAQVALNGWGVIFALSILFWLRARDEDEAELLSVEQLLGCFEAKRIAKKPGRYYMCARKGAGRPFFDVHVR
FGNLVSIKPIPELTQASFDTLKYYKDQFPRGRKIGTLVIDKLLLESGLLDYNPLVRPIEASRLNSELAMVCGFTGSVKRKSKGRAHALKTVQSTEPVTPAVTQPA
VQDKAGPSSEVPTPVIELDSAGEHSREKRSKNEFEALDVSPLREVREDSPLKRRRKKKKTTTSSEVGPRGPLPTSHVDLVDDPEARMGGTSNVKIRFRMEPSSSG
VKDQVSRISAACLDRCFRRAFRGGYLEGRGGSQGRAAEEGG