; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g14940 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g14940
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: my s in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).
Genome locationchr8:11551318..11554623
RNA-Seq ExpressionMoc08g14940
SyntenyMoc08g14940
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.4e-9869.78Show/hide
Query:  MFEYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG
        MFEYGL+L LHPF QEFL RTG APAQVAPNGWGVIFALAILFWLRAR  +EAELL V+QLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG

Query:  KWFFALGEWLANDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNRLVQPIEAS------SNVCGFTG
        KWF+A GEWLA DESGR FFD   RFGNLVSI+P+PEL QA+FDTLK+YKE FP+GRK+ TLVTD+LLLESGLLDYN  V+PIE S      + VC F  
Subjt:  KWFFALGEWLANDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNRLVQPIEAS------SNVCGFTG

Query:  SVKRKFKGRANALKTVVGTEPVTPTAPRTEAQGNSGPSSAVSTPVIELDLSGGRSEEKRPREESEALD
         VKRK KGRA+AL+    ++P TP           GP+S    PVIEL+ SGG S EKRPR+++EA+D
Subjt:  SVKRKFKGRANALKTVVGTEPVTPTAPRTEAQGNSGPSSAVSTPVIELDLSGGRSEEKRPREESEALD

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]3.7e-8381.18Show/hide
Query:  MFEYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG
        MFEYGL+L LHPF QEFL RTG APAQVAPNGWGVIFALAILFWLRAR  +EAELL V+QLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG

Query:  KWFFALGEWLANDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNRLVQPIEAS
        KWF+A GEWLA DESGR FFD   RFGNLVSI+P+PEL QA+FDTLK+YKE FP+GRK+ TLVTD+LLLESGLLDYN  V+PIE+S
Subjt:  KWFFALGEWLANDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNRLVQPIEAS

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]9.7e-8481.72Show/hide
Query:  MFEYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG
        MFEYGL+L LHPF QEFL RTG APAQVAPNGWGVIFALAILFWLRAR  +EAELL V+QLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG

Query:  KWFFALGEWLANDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNRLVQPIEAS
        KWF+A GEWLA DESGR FFD   RFGNLVSI+P+PEL QA+FDTLK+YKE+FP+GRK+ TLVTDKLLLESGLLDYN  V+PIE+S
Subjt:  KWFFALGEWLANDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNRLVQPIEAS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]9.0e-13871.15Show/hide
Query:  LSSPSSSDSLGCLESELEEIENFRFSDDIEDSDTSTSGQGLEYPSRMPEHYLGPLRRRFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLKLSLH
        +SS   SD    LES+LEEIEN R SDD EDSD STSGQGLEYPSR+PEHYLG LRR F IP +ILLR+PEEGERADNPPEGWVTLY KMFEYGL+L LH
Subjt:  LSSPSSSDSLGCLESELEEIENFRFSDDIEDSDTSTSGQGLEYPSRMPEHYLGPLRRRFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLKLSLH

Query:  PFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVGKWFFALGEWLA
        PF QEFL RTG APAQVAPNGWGVIFALAILFWLRAR  +EAEL  V+QLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV KWF+A GEWLA
Subjt:  PFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVGKWFFALGEWLA

Query:  NDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNRLVQPIEAS------SNVCGFTGSVKRKFKGRAN
         DESGR FFD   RFGNLVSI+P+PEL QA+FDTLK+YKE FP+GRK+ TLVTD+LLLESGLLDYN  V+PIE+S      + VCGF   VKRK KGRA+
Subjt:  NDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNRLVQPIEAS------SNVCGFTGSVKRKFKGRAN

Query:  ALKTVVGTEPVTPTAPRTEAQGNSGPSSAVSTPVIELDLSGGRSEEKRPREESEALD
        AL+    ++P TP           GP+S     VIEL+ SGG S EKRPR+++EA+D
Subjt:  ALKTVVGTEPVTPTAPRTEAQGNSGPSSAVSTPVIELDLSGGRSEEKRPREESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]8.0e-7882.38Show/hide
Query:  MCARKGACGIVKGPTSIKGWVGKWFFALGEWLANDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNR
        MCARKG  GIVKGPTSIKGWVGKWFFA GEWLA DESGR FFD   RFGNLVSIK IPEL QATFDTLK YK++FP+ RKI TLVTDKLLLESGLLDYN 
Subjt:  MCARKGACGIVKGPTSIKGWVGKWFFALGEWLANDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNR

Query:  LVQPIEAS------SNVCGFTGSVKRKFKGRANALKTVVGTEPVTPTAPRTEAQGNSGPSSAVSTPVIELDLSGGRSEEKRPREESEALDVSP
        LV+ IEAS      + VCGFTGSVKRK KGRA+ALKTVVGTEPVTPT PRT AQGNSGPSSAV TPVIELDLSGGRS EKR REESEALDVSP
Subjt:  LVQPIEAS------SNVCGFTGSVKRKFKGRANALKTVVGTEPVTPTAPRTEAQGNSGPSSAVSTPVIELDLSGGRSEEKRPREESEALDVSP

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138261.2e-9869.78Show/hide
Query:  MFEYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG
        MFEYGL+L LHPF QEFL RTG APAQVAPNGWGVIFALAILFWLRAR  +EAELL V+QLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG

Query:  KWFFALGEWLANDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNRLVQPIEAS------SNVCGFTG
        KWF+A GEWLA DESGR FFD   RFGNLVSI+P+PEL QA+FDTLK+YKE FP+GRK+ TLVTD+LLLESGLLDYN  V+PIE S      + VC F  
Subjt:  KWFFALGEWLANDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNRLVQPIEAS------SNVCGFTG

Query:  SVKRKFKGRANALKTVVGTEPVTPTAPRTEAQGNSGPSSAVSTPVIELDLSGGRSEEKRPREESEALD
         VKRK KGRA+AL+    ++P TP           GP+S    PVIEL+ SGG S EKRPR+++EA+D
Subjt:  SVKRKFKGRANALKTVVGTEPVTPTAPRTEAQGNSGPSSAVSTPVIELDLSGGRSEEKRPREESEALD

A0A6J1DWD2 uncharacterized protein LOC1110246801.8e-8381.18Show/hide
Query:  MFEYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG
        MFEYGL+L LHPF QEFL RTG APAQVAPNGWGVIFALAILFWLRAR  +EAELL V+QLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG

Query:  KWFFALGEWLANDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNRLVQPIEAS
        KWF+A GEWLA DESGR FFD   RFGNLVSI+P+PEL QA+FDTLK+YKE FP+GRK+ TLVTD+LLLESGLLDYN  V+PIE+S
Subjt:  KWFFALGEWLANDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNRLVQPIEAS

A0A6J1DWF1 uncharacterized protein LOC1110251084.7e-8481.72Show/hide
Query:  MFEYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG
        MFEYGL+L LHPF QEFL RTG APAQVAPNGWGVIFALAILFWLRAR  +EAELL V+QLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG

Query:  KWFFALGEWLANDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNRLVQPIEAS
        KWF+A GEWLA DESGR FFD   RFGNLVSI+P+PEL QA+FDTLK+YKE+FP+GRK+ TLVTDKLLLESGLLDYN  V+PIE+S
Subjt:  KWFFALGEWLANDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNRLVQPIEAS

A0A6J1DXS5 uncharacterized protein LOC1110255024.3e-13871.15Show/hide
Query:  LSSPSSSDSLGCLESELEEIENFRFSDDIEDSDTSTSGQGLEYPSRMPEHYLGPLRRRFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLKLSLH
        +SS   SD    LES+LEEIEN R SDD EDSD STSGQGLEYPSR+PEHYLG LRR F IP +ILLR+PEEGERADNPPEGWVTLY KMFEYGL+L LH
Subjt:  LSSPSSSDSLGCLESELEEIENFRFSDDIEDSDTSTSGQGLEYPSRMPEHYLGPLRRRFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLKLSLH

Query:  PFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVGKWFFALGEWLA
        PF QEFL RTG APAQVAPNGWGVIFALAILFWLRAR  +EAEL  V+QLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV KWF+A GEWLA
Subjt:  PFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVGKWFFALGEWLA

Query:  NDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNRLVQPIEAS------SNVCGFTGSVKRKFKGRAN
         DESGR FFD   RFGNLVSI+P+PEL QA+FDTLK+YKE FP+GRK+ TLVTD+LLLESGLLDYN  V+PIE+S      + VCGF   VKRK KGRA+
Subjt:  NDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNRLVQPIEAS------SNVCGFTGSVKRKFKGRAN

Query:  ALKTVVGTEPVTPTAPRTEAQGNSGPSSAVSTPVIELDLSGGRSEEKRPREESEALD
        AL+    ++P TP           GP+S     VIEL+ SGG S EKRPR+++EA+D
Subjt:  ALKTVVGTEPVTPTAPRTEAQGNSGPSSAVSTPVIELDLSGGRSEEKRPREESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256653.9e-7882.38Show/hide
Query:  MCARKGACGIVKGPTSIKGWVGKWFFALGEWLANDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNR
        MCARKG  GIVKGPTSIKGWVGKWFFA GEWLA DESGR FFD   RFGNLVSIK IPEL QATFDTLK YK++FP+ RKI TLVTDKLLLESGLLDYN 
Subjt:  MCARKGACGIVKGPTSIKGWVGKWFFALGEWLANDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNR

Query:  LVQPIEAS------SNVCGFTGSVKRKFKGRANALKTVVGTEPVTPTAPRTEAQGNSGPSSAVSTPVIELDLSGGRSEEKRPREESEALDVSP
        LV+ IEAS      + VCGFTGSVKRK KGRA+ALKTVVGTEPVTPT PRT AQGNSGPSSAV TPVIELDLSGGRS EKR REESEALDVSP
Subjt:  LVQPIEAS------SNVCGFTGSVKRKFKGRANALKTVVGTEPVTPTAPRTEAQGNSGPSSAVSTPVIELDLSGGRSEEKRPREESEALDVSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related1.3e-0623.53Show/hide
Query:  HRCVSPEDPSRSLITRVRRGGQSFLLALTLLSNMVVFLSSPSSSDSLGCLESELEEIENFRFSDDIEDSDTSTSGQGLEY------PSRMPEHYLGPLRR
        HR  S E PS  +  R     +  + + + +  +V  L   + SD  G          N    D+ E +D + SG+  +       P+      +G    
Subjt:  HRCVSPEDPSRSLITRVRRGGQSFLLALTLLSNMVVFLSSPSSSDSLGCLESELEEIENFRFSDDIEDSDTSTSGQGLEY------PSRMPEHYLGPLRR

Query:  RFNIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEA
           +P  + +RIP + +R  + PEG++ L+   F E GL+  +  F   F      A +Q+       I   A L  L AR       LSV  +      
Subjt:  RFNIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEA

Query:  KRIAKKPGRYYMCARKGACGIVKGPTSIKGWVGKWFFA
         ++  K G++Y+ + +G   +  GP+  + W+G +F+A
Subjt:  KRIAKKPGRYYMCARKGACGIVKGPTSIKGWVGKWFFA

AT2G15420.1 myosin heavy chain-related5.1e-0630.88Show/hide
Query:  NIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKR
        N P +I L  P+  +R   PPEG++ LY   F   GL   L  F  E+  R   A +Q+          LAIL      G +    +  +         R
Subjt:  NIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKR

Query:  IAKKPGRYYMCARKGACGIVKGPTS-IKGWVGKWFF
        + + PG YY  A K    IV G  S I GW  ++FF
Subjt:  IAKKPGRYYMCARKGACGIVKGPTS-IKGWVGKWFF

AT5G38190.1 INVOLVED IN: biological_process unknown8.7e-0623.11Show/hide
Query:  HRCVSPEDPSRSLITRVRRGGQSFLLALTLLSNMVVFLSSPSSSDSLGCLESELEEIENFRFSDDIEDSDTSTSGQGLEY------PSRMPEHYLGPLRR
        HR  S E PS  +  R     +  +   + +  +V  L   + SD  G          N    D+ E +D + SG+  +       P+      +G    
Subjt:  HRCVSPEDPSRSLITRVRRGGQSFLLALTLLSNMVVFLSSPSSSDSLGCLESELEEIENFRFSDDIEDSDTSTSGQGLEY------PSRMPEHYLGPLRR

Query:  RFNIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEA
           +P  + +RIP + +R  + PEG++ L+   F E GL+  +  F   F      A +Q+       I   A L  L AR       LSV  +      
Subjt:  RFNIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEA

Query:  KRIAKKPGRYYMCARKGACGIVKGPTSIKGWVGKWFFA
         ++  K G++Y+ + +G   +   P+  + W+G +F+A
Subjt:  KRIAKKPGRYYMCARKGACGIVKGPTSIKGWVGKWFFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCACTCTTTCTTCTGCACCTGGCCTTAAGCGCCTCGACCTGCTCATCGAACTTGTCAGAGTTGGAGAACTCTCGGGTGTAGGACGGAGAGTCTTCCGGCTCCCGAG
AGGCTTTCTTCTTCTTATCAGTTGGATAGATGGAGCTCCCTTTTCACCGGGTGCGCCTGAGACCCTAGAGGGAGACGCAGTTCAGTTAGCACGCGTTGCCTCGGCGTACA
TTTCTTCCATCGTGCGCAACCGATGTCGCATATCATCCAACTCGCGCTGGGGAGTAGACAAAGCCTCAGGGTCTGCCTCCGGGTTGGCCCTTCGAGAGACCTTTCTGGAG
GTTCCACCTCGACCTCGATTGGCTTTTGAGGGTTTCGGGTGGGCAAGAGATCGGGATGACTTAAGATGTTCTTCCCCACAAACGGCGCCAAATGTTTATGCAGGAATTTG
CACAACGGTTCTTCACGAATCGAGCTCGAACCCGGTCTCCGGTTCCGACCTGAACACTAGAGTGGACCTGCACAAAAGGGTGATGGATCTGACAGTACACACGACCGGCG
CTTATGTGTCTTTTCTCATATCAGACATGTCGGGTTCCGAGCAGGTTGGACCCCAGTCAGGAATTTGCACAACGGTTCTCCACGAATCGAGCTCGAACCCGGTCTCCGGT
TCTGACCTGAACACTAGAGTGGACCTACACAAGAGGGTGATGGATCCGACAGCACACACGACCGGCGGTTACATGTCTTTTCTTATATCGGACCTGTCGGGTTCCGAGCA
GGTCGGACCCCAGTCAGGTCGAACTTTGGTGCCCATACTTCATCTTTTAAGGGGCAAACCCGGTCACCTCGGCGGAGCTGAGGTGGACCTAAGCAATCCTTTTTATCTAA
TTTCTTCAAACACGAATAAGGGTCCTCCACGTGTCCCGGGTTGTCGGAGCAATCAAGCGTTTCACCGTTGCGTATCCCCAGAAGATCCCAGCCGCTCGTTGATTACACGT
GTACGACGCGGAGGTCAGTCATTTCTTCTGGCTCTTACTCTTCTTTCAAACATGGTAGTTTTCTTGTCTTCCCCCTCCAGTAGCGATAGCTTGGGTTGTTTAGAGTCCGA
GCTTGAAGAAATAGAGAACTTTAGGTTCTCAGATGACATAGAGGATAGTGATACCTCCACCTCGGGCCAGGGTCTGGAGTACCCTTCTAGGATGCCCGAGCATTATCTTG
GACCCCTTCGTAGGAGGTTTAACATTCCGAATGACATCCTCCTTAGGATTCCGGAGGAAGGGGAAAGAGCTGACAATCCCCCAGAGGGATGGGTCACTCTTTATCTCAAG
ATGTTTGAGTACGGCCTCAAGCTTTCCCTTCATCCTTTCGCCCAGGAGTTCTTAAACCGAACTGGACCGGCTCCTGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTT
TGCTTTAGCCATTCTTTTTTGGTTGCGAGCTCGAGGCGAGGATGAGGCCGAGCTGCTAAGTGTTAACCAGCTTCTTGGGTGTTTTGAGGCCAAGAGGATAGCCAAAAAAC
CTGGTCGGTACTATATGTGCGCAAGGAAGGGCGCATGTGGCATAGTCAAGGGGCCGACCTCCATCAAGGGATGGGTAGGAAAGTGGTTCTTTGCCTTGGGTGAGTGGCTG
GCAAATGACGAGTCAGGTCGTCCATTCTTTGACGGGTCTGCTAGGTTTGGGAACCTAGTATCGATCAAGCCGATTCCCGAGCTCGATCAAGCCACTTTCGACACCCTCAA
GTTCTACAAGGAGAACTTCCCCAAGGGCAGGAAGATCGAAACCTTGGTCACCGACAAGCTTCTCTTGGAGTCGGGGCTTCTTGACTACAACCGTCTAGTTCAGCCAATCG
AAGCTTCAAGCAATGTGTGCGGATTTACTGGGAGTGTGAAGCGCAAGTTCAAGGGCCGTGCTAACGCCCTGAAGACTGTGGTGGGGACTGAACCGGTGACGCCTACGGCG
CCACGGACTGAGGCTCAGGGTAACTCTGGGCCTTCTTCTGCAGTCTCCACCCCTGTGATCGAACTAGACTTGTCTGGGGGTCGATCTGAAGAGAAGCGTCCGAGGGAAGA
GTCCGAGGCGCTTGACGTATCTCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCACTCTTTCTTCTGCACCTGGCCTTAAGCGCCTCGACCTGCTCATCGAACTTGTCAGAGTTGGAGAACTCTCGGGTGTAGGACGGAGAGTCTTCCGGCTCCCGAG
AGGCTTTCTTCTTCTTATCAGTTGGATAGATGGAGCTCCCTTTTCACCGGGTGCGCCTGAGACCCTAGAGGGAGACGCAGTTCAGTTAGCACGCGTTGCCTCGGCGTACA
TTTCTTCCATCGTGCGCAACCGATGTCGCATATCATCCAACTCGCGCTGGGGAGTAGACAAAGCCTCAGGGTCTGCCTCCGGGTTGGCCCTTCGAGAGACCTTTCTGGAG
GTTCCACCTCGACCTCGATTGGCTTTTGAGGGTTTCGGGTGGGCAAGAGATCGGGATGACTTAAGATGTTCTTCCCCACAAACGGCGCCAAATGTTTATGCAGGAATTTG
CACAACGGTTCTTCACGAATCGAGCTCGAACCCGGTCTCCGGTTCCGACCTGAACACTAGAGTGGACCTGCACAAAAGGGTGATGGATCTGACAGTACACACGACCGGCG
CTTATGTGTCTTTTCTCATATCAGACATGTCGGGTTCCGAGCAGGTTGGACCCCAGTCAGGAATTTGCACAACGGTTCTCCACGAATCGAGCTCGAACCCGGTCTCCGGT
TCTGACCTGAACACTAGAGTGGACCTACACAAGAGGGTGATGGATCCGACAGCACACACGACCGGCGGTTACATGTCTTTTCTTATATCGGACCTGTCGGGTTCCGAGCA
GGTCGGACCCCAGTCAGGTCGAACTTTGGTGCCCATACTTCATCTTTTAAGGGGCAAACCCGGTCACCTCGGCGGAGCTGAGGTGGACCTAAGCAATCCTTTTTATCTAA
TTTCTTCAAACACGAATAAGGGTCCTCCACGTGTCCCGGGTTGTCGGAGCAATCAAGCGTTTCACCGTTGCGTATCCCCAGAAGATCCCAGCCGCTCGTTGATTACACGT
GTACGACGCGGAGGTCAGTCATTTCTTCTGGCTCTTACTCTTCTTTCAAACATGGTAGTTTTCTTGTCTTCCCCCTCCAGTAGCGATAGCTTGGGTTGTTTAGAGTCCGA
GCTTGAAGAAATAGAGAACTTTAGGTTCTCAGATGACATAGAGGATAGTGATACCTCCACCTCGGGCCAGGGTCTGGAGTACCCTTCTAGGATGCCCGAGCATTATCTTG
GACCCCTTCGTAGGAGGTTTAACATTCCGAATGACATCCTCCTTAGGATTCCGGAGGAAGGGGAAAGAGCTGACAATCCCCCAGAGGGATGGGTCACTCTTTATCTCAAG
ATGTTTGAGTACGGCCTCAAGCTTTCCCTTCATCCTTTCGCCCAGGAGTTCTTAAACCGAACTGGACCGGCTCCTGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTT
TGCTTTAGCCATTCTTTTTTGGTTGCGAGCTCGAGGCGAGGATGAGGCCGAGCTGCTAAGTGTTAACCAGCTTCTTGGGTGTTTTGAGGCCAAGAGGATAGCCAAAAAAC
CTGGTCGGTACTATATGTGCGCAAGGAAGGGCGCATGTGGCATAGTCAAGGGGCCGACCTCCATCAAGGGATGGGTAGGAAAGTGGTTCTTTGCCTTGGGTGAGTGGCTG
GCAAATGACGAGTCAGGTCGTCCATTCTTTGACGGGTCTGCTAGGTTTGGGAACCTAGTATCGATCAAGCCGATTCCCGAGCTCGATCAAGCCACTTTCGACACCCTCAA
GTTCTACAAGGAGAACTTCCCCAAGGGCAGGAAGATCGAAACCTTGGTCACCGACAAGCTTCTCTTGGAGTCGGGGCTTCTTGACTACAACCGTCTAGTTCAGCCAATCG
AAGCTTCAAGCAATGTGTGCGGATTTACTGGGAGTGTGAAGCGCAAGTTCAAGGGCCGTGCTAACGCCCTGAAGACTGTGGTGGGGACTGAACCGGTGACGCCTACGGCG
CCACGGACTGAGGCTCAGGGTAACTCTGGGCCTTCTTCTGCAGTCTCCACCCCTGTGATCGAACTAGACTTGTCTGGGGGTCGATCTGAAGAGAAGCGTCCGAGGGAAGA
GTCCGAGGCGCTTGACGTATCTCCCTGA
Protein sequenceShow/hide protein sequence
MSTLSSAPGLKRLDLLIELVRVGELSGVGRRVFRLPRGFLLLISWIDGAPFSPGAPETLEGDAVQLARVASAYISSIVRNRCRISSNSRWGVDKASGSASGLALRETFLE
VPPRPRLAFEGFGWARDRDDLRCSSPQTAPNVYAGICTTVLHESSSNPVSGSDLNTRVDLHKRVMDLTVHTTGAYVSFLISDMSGSEQVGPQSGICTTVLHESSSNPVSG
SDLNTRVDLHKRVMDPTAHTTGGYMSFLISDLSGSEQVGPQSGRTLVPILHLLRGKPGHLGGAEVDLSNPFYLISSNTNKGPPRVPGCRSNQAFHRCVSPEDPSRSLITR
VRRGGQSFLLALTLLSNMVVFLSSPSSSDSLGCLESELEEIENFRFSDDIEDSDTSTSGQGLEYPSRMPEHYLGPLRRRFNIPNDILLRIPEEGERADNPPEGWVTLYLK
MFEYGLKLSLHPFAQEFLNRTGPAPAQVAPNGWGVIFALAILFWLRARGEDEAELLSVNQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVGKWFFALGEWL
ANDESGRPFFDGSARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIETLVTDKLLLESGLLDYNRLVQPIEASSNVCGFTGSVKRKFKGRANALKTVVGTEPVTPTA
PRTEAQGNSGPSSAVSTPVIELDLSGGRSEEKRPREESEALDVSP