; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g14840 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g14840
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: my s in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).
Genome locationchr4:11324021..11329780
RNA-Seq ExpressionMoc04g14840
SyntenyMoc04g14840
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]4.2e-8964.06Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDELLGCFEAKRIAKKP--------------------------
        MFEYGLRLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAELL VD+LL CFEAKRIAKKP                          
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDELLGCFEAKRIAKKP--------------------------

Query:  -------EWLAKDESGRAFFDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVYGFTG
               EWLAKDESGR+FFDVP RFGNLVSI+P+PEL QA+FDTLK+YKE FP+GRK+GTLVTD+LLLESGLLDYNP VRPIE SRPNS LAMV  F  
Subjt:  -------EWLAKDESGRAFFDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVYGFTG

Query:  SVKRKSKGRAHALKTVVSPEPMTPSVPRTEAQGNSDPSSAVPTPVIKLDLSGGRSEDKRPREESVAL-------DVSPLNE
         VKRKSKGRAHAL+   S +P TP+V          P+S  P PVI+L+ SGG S +KRPR+++ A+       DV PL E
Subjt:  SVKRKSKGRAHALKTVVSPEPMTPSVPRTEAQGNSDPSSAVPTPVIKLDLSGGRSEDKRPREESVAL-------DVSPLNE

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]1.8e-7172.4Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDELLGCFEAKRIAKKP--------------------------
        MFEYGLRLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAELL VD+LL CFEAKRIAKKP                          
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDELLGCFEAKRIAKKP--------------------------

Query:  -------EWLAKDESGRAFFDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSEL
               EWLAKDESGR+FFDVP RFGNLVSI+P+PEL QA+FDTLK+YKE FP+GRK+GTLVTD+LLLESGLLDYNP VRPIE+SRPNSEL
Subjt:  -------EWLAKDESGRAFFDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSEL

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]1.2e-7272.68Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDELLGCFEAKRIAKKP--------------------------
        MFEYGLRLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAELL VD+LL CFEAKRIAKKP                          
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDELLGCFEAKRIAKKP--------------------------

Query:  -------EWLAKDESGRAFFDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAM
               EWLAKDESGR+FFDVP RFGNLVSI+P+PEL QA+FDTLK+YKE+FP+GRK+GTLVTDKLLLESGLLDYNP VRPIE+SRPNSEL M
Subjt:  -------EWLAKDESGRAFFDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.2e-13069.91Show/hide
Query:  LARRSESELEEIENFRFSDDEEDSDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNNILLRIPQEGERADNPPEGWVTLYLKMFEYGLRLPLHPFAQEFLN
        LARR ES+LEEIEN R SDD EDSD STSGQGLEYPSR+PEHYLG LRRGF IP NILLR+P+EGERADNPPEGWVTLY KMFEYGLRLPLHPF QEFL 
Subjt:  LARRSESELEEIENFRFSDDEEDSDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNNILLRIPQEGERADNPPEGWVTLYLKMFEYGLRLPLHPFAQEFLN

Query:  RTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDELLGCFEAKRIAKKP---------------------------------EWLAKDESGRAF
        RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAEL  VD+LL CFEAKRIAKKP                                 EWLAKDESGR+F
Subjt:  RTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDELLGCFEAKRIAKKP---------------------------------EWLAKDESGRAF

Query:  FDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVYGFTGSVKRKSKGRAHALKTVVSP
        FDVP RFGNLVSI+P+PEL QA+FDTLK+YKE FP+GRK+GTLVTD+LLLESGLLDYNP VRPIE+SRPNSELAMV GF   VKRKSKGRAHAL+   S 
Subjt:  FDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVYGFTGSVKRKSKGRAHALKTVVSP

Query:  EPMTPSVPRTEAQGNSDPSSAVPTPVIKLDLSGGRSEDKRPREESVALD
        +P TP+V          P+S  P  VI+L+ SGG S +KRPR+++ A+D
Subjt:  EPMTPSVPRTEAQGNSDPSSAVPTPVIKLDLSGGRSEDKRPREESVALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.8e-13384.97Show/hide
Query:  EWLAKDESGRAFFDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVYGFTGSVKRKSK
        EWLAKDESGRAFFDVP RFGNLVSIK IPEL QATFDTLK YK++FP+ RKI TLVTDKLLLESGLLDYNPLVR IEASRPNSELAMV GFTGSVKRKSK
Subjt:  EWLAKDESGRAFFDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVYGFTGSVKRKSK

Query:  GRAHALKTVVSPEPMTPSVPRTEAQGNSDPSSAVPTPVIKLDLSGGRSEDKRPREESVALDVSPLNEVRGESPLRRKRKKKKTSSPSEAGACGTLPASHA
        GRAHALKTVV  EP+TP+VPRT AQGNS PSSAVPTPVI+LDLSGGRS +KR REES ALDVSPLNEVRGESPLRR+RKKKKTSS SEAGA GTLP SHA
Subjt:  GRAHALKTVVSPEPMTPSVPRTEAQGNSDPSSAVPTPVIKLDLSGGRSEDKRPREESVALDVSPLNEVRGESPLRRKRKKKKTSSPSEAGACGTLPASHA

Query:  DLVDDPVARMGGTFDVRTRFRMEPSSSGVKDQVSCISATCLDRCLRRASKFMSDPGSVLQRTIDYAANAFVASIHSAIMVKVEPDGREDPAAKERENSSA
        DLVDDP ARM GT +VR RF MEPSSSGVKDQVS ISATCLDR LRRASKF+SDPGSVLQRTID  A AF+ASIH A+MVK E DGRE  AAKERENS A
Subjt:  DLVDDPVARMGGTFDVRTRFRMEPSSSGVKDQVSCISATCLDRCLRRASKFMSDPGSVLQRTIDYAANAFVASIHSAIMVKVEPDGREDPAAKERENSSA

Query:  ALEAAT
        ALEAAT
Subjt:  ALEAAT

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138262.0e-8964.06Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDELLGCFEAKRIAKKP--------------------------
        MFEYGLRLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAELL VD+LL CFEAKRIAKKP                          
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDELLGCFEAKRIAKKP--------------------------

Query:  -------EWLAKDESGRAFFDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVYGFTG
               EWLAKDESGR+FFDVP RFGNLVSI+P+PEL QA+FDTLK+YKE FP+GRK+GTLVTD+LLLESGLLDYNP VRPIE SRPNS LAMV  F  
Subjt:  -------EWLAKDESGRAFFDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVYGFTG

Query:  SVKRKSKGRAHALKTVVSPEPMTPSVPRTEAQGNSDPSSAVPTPVIKLDLSGGRSEDKRPREESVAL-------DVSPLNE
         VKRKSKGRAHAL+   S +P TP+V          P+S  P PVI+L+ SGG S +KRPR+++ A+       DV PL E
Subjt:  SVKRKSKGRAHALKTVVSPEPMTPSVPRTEAQGNSDPSSAVPTPVIKLDLSGGRSEDKRPREESVAL-------DVSPLNE

A0A6J1DWD2 uncharacterized protein LOC1110246808.6e-7272.4Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDELLGCFEAKRIAKKP--------------------------
        MFEYGLRLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAELL VD+LL CFEAKRIAKKP                          
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDELLGCFEAKRIAKKP--------------------------

Query:  -------EWLAKDESGRAFFDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSEL
               EWLAKDESGR+FFDVP RFGNLVSI+P+PEL QA+FDTLK+YKE FP+GRK+GTLVTD+LLLESGLLDYNP VRPIE+SRPNSEL
Subjt:  -------EWLAKDESGRAFFDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSEL

A0A6J1DWF1 uncharacterized protein LOC1110251086.0e-7372.68Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDELLGCFEAKRIAKKP--------------------------
        MFEYGLRLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAELL VD+LL CFEAKRIAKKP                          
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDELLGCFEAKRIAKKP--------------------------

Query:  -------EWLAKDESGRAFFDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAM
               EWLAKDESGR+FFDVP RFGNLVSI+P+PEL QA+FDTLK+YKE+FP+GRK+GTLVTDKLLLESGLLDYNP VRPIE+SRPNSEL M
Subjt:  -------EWLAKDESGRAFFDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAM

A0A6J1DXS5 uncharacterized protein LOC1110255021.1e-13069.91Show/hide
Query:  LARRSESELEEIENFRFSDDEEDSDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNNILLRIPQEGERADNPPEGWVTLYLKMFEYGLRLPLHPFAQEFLN
        LARR ES+LEEIEN R SDD EDSD STSGQGLEYPSR+PEHYLG LRRGF IP NILLR+P+EGERADNPPEGWVTLY KMFEYGLRLPLHPF QEFL 
Subjt:  LARRSESELEEIENFRFSDDEEDSDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNNILLRIPQEGERADNPPEGWVTLYLKMFEYGLRLPLHPFAQEFLN

Query:  RTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDELLGCFEAKRIAKKP---------------------------------EWLAKDESGRAF
        RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAEL  VD+LL CFEAKRIAKKP                                 EWLAKDESGR+F
Subjt:  RTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDELLGCFEAKRIAKKP---------------------------------EWLAKDESGRAF

Query:  FDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVYGFTGSVKRKSKGRAHALKTVVSP
        FDVP RFGNLVSI+P+PEL QA+FDTLK+YKE FP+GRK+GTLVTD+LLLESGLLDYNP VRPIE+SRPNSELAMV GF   VKRKSKGRAHAL+   S 
Subjt:  FDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVYGFTGSVKRKSKGRAHALKTVVSP

Query:  EPMTPSVPRTEAQGNSDPSSAVPTPVIKLDLSGGRSEDKRPREESVALD
        +P TP+V          P+S  P  VI+L+ SGG S +KRPR+++ A+D
Subjt:  EPMTPSVPRTEAQGNSDPSSAVPTPVIKLDLSGGRSEDKRPREESVALD

A0A6J1DZB3 uncharacterized protein LOC1110256652.3e-13384.97Show/hide
Query:  EWLAKDESGRAFFDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVYGFTGSVKRKSK
        EWLAKDESGRAFFDVP RFGNLVSIK IPEL QATFDTLK YK++FP+ RKI TLVTDKLLLESGLLDYNPLVR IEASRPNSELAMV GFTGSVKRKSK
Subjt:  EWLAKDESGRAFFDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVYGFTGSVKRKSK

Query:  GRAHALKTVVSPEPMTPSVPRTEAQGNSDPSSAVPTPVIKLDLSGGRSEDKRPREESVALDVSPLNEVRGESPLRRKRKKKKTSSPSEAGACGTLPASHA
        GRAHALKTVV  EP+TP+VPRT AQGNS PSSAVPTPVI+LDLSGGRS +KR REES ALDVSPLNEVRGESPLRR+RKKKKTSS SEAGA GTLP SHA
Subjt:  GRAHALKTVVSPEPMTPSVPRTEAQGNSDPSSAVPTPVIKLDLSGGRSEDKRPREESVALDVSPLNEVRGESPLRRKRKKKKTSSPSEAGACGTLPASHA

Query:  DLVDDPVARMGGTFDVRTRFRMEPSSSGVKDQVSCISATCLDRCLRRASKFMSDPGSVLQRTIDYAANAFVASIHSAIMVKVEPDGREDPAAKERENSSA
        DLVDDP ARM GT +VR RF MEPSSSGVKDQVS ISATCLDR LRRASKF+SDPGSVLQRTID  A AF+ASIH A+MVK E DGRE  AAKERENS A
Subjt:  DLVDDPVARMGGTFDVRTRFRMEPSSSGVKDQVSCISATCLDRCLRRASKFMSDPGSVLQRTIDYAANAFVASIHSAIMVKVEPDGREDPAAKERENSSA

Query:  ALEAAT
        ALEAAT
Subjt:  ALEAAT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related1.5e-0428.43Show/hide
Query:  RFSDDE-EDSDTSTSGQGLEY------PSRMPEHYLGPLRRGFNIPNNILLRIPQEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLAPA
        R++DDE E +D + SG+  +       P+      +G       +P  + +RIP++ +R  + PEG++ L+   F E GLR P+  F   F     +A +
Subjt:  RFSDDE-EDSDTSTSGQGLEY------PSRMPEHYLGPLRRGFNIPNNILLRIPQEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLAPA

Query:  QV
        Q+
Subjt:  QV

AT5G38190.1 INVOLVED IN: biological_process unknown2.0e-0428.43Show/hide
Query:  RFSDDE-EDSDTSTSGQGLEY------PSRMPEHYLGPLRRGFNIPNNILLRIPQEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLAPA
        R++DDE E +D + SG+  +       P+      +G       +P  + +RIP++ +R  + PEG++ L+   F E GLR P+  F   F     +A +
Subjt:  RFSDDE-EDSDTSTSGQGLEY------PSRMPEHYLGPLRRGFNIPNNILLRIPQEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLAPA

Query:  QV
        Q+
Subjt:  QV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAGATGAGGTTGTAGGGCTTGAGCGGTACCCGCTGCAGTCGAATTGTCAATTAACGTCTGTAGTGCTTCTACCAGTAAAGCCACCTAAGGATTCACTTGATGAGCT
ACCGGGGGAACTAGAGGGACTACCCCTTCCTGAGGAGCTGGTTGAGGAACTGGAAGGACTACCCCTCCCTGAGGAGCTGCCAGAGGAACGTGGGGGTCTACTACATTCTC
AATTCCAGGATTCGGGTGTTTATGCAGAAATCTGCACAACGGTTCTTCACGAATCGAGCTCGAACCCAGTCTCCGGTTTCGACCTGAACACTAGAGTGGACCTGCACAAG
AGGGCGAACACTCCGACGCTCAAGTCAAATACATTTCGACCTGCCAGGTTGTCGGAGTACTCAAGCGTTTCGTCGTTGTGTATTCCGAGGATATCTCAGCCGCTCGTTGA
TTACACGTGTACGGCGCAGAGGTTTTTCCGATCAGCTATAAATAGTGCCGAAACTTCAGTTTTCTTATCTTCCCTCTCCAGTAGTGATAGCCTGGGTAGTATAGGTCGGA
CAATAAGTAGTTTGCCCCTCAAGCCAAGTGACTCCGGGGAGGTCTTAGCTCGTAGGTCAGAGTCTGAGCTGGAGGAAATAGAGAACTTTAGGTTTTCAGATGATGAAGAG
GATAGCGATACCTCCACCTCGGGCCAGGGTCTGGAGTACCCTTCTAGGATGCCCGAACACTACCTTGGACCCCTTCGTAGGGGGTTTAACATTCCGAATAACATCCTCCT
TAGGATTCCGCAGGAAGGGGAAAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTTTATTTGAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCATCCCTTTGCTC
AAGAGTTCTTAAACCGAACCGGACTGGCTCCTGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTAGCCATTCTTTTTTGGTTGCGAGCTCGGGATGAGGAT
GAGGCCGAGTTGCTAAGTGTCGACGAGCTTCTCGGGTGTTTTGAGGCTAAGAGGATAGCCAAGAAACCAGAGTGGCTGGCAAAGGACGAATCAGGTCGTGCCTTTTTTGA
CGTGCCTGCTAGGTTTGGGAACCTAGTGTCGATCAAGCCGATCCCCGAGCTCGATCAAGCCACTTTCGACACACTCAAGTTCTACAAAGAGAACTTCCCCAAGGGCAGGA
AGATCGGAACCTTGGTCACCGACAAGCTTCTCTTGGAGTCGGGGCTTCTTGACTACAACCCTCTAGTTCGGCCAATTGAAGCTTCAAGGCCAAACTCTGAACTCGCAATG
GTGTACGGATTCACTGGAAGTGTGAAGCGCAAGTCCAAGGGCCGTGCTCACGCCCTTAAGACCGTGGTGAGCCCTGAACCAATGACGCCTTCGGTGCCACGGACTGAGGC
TCAGGGTAACTCCGACCCATCCTCTGCAGTTCCCACCCCCGTGATCAAACTGGACCTGTCCGGGGGTCGATCTGAAGATAAGCGTCCAAGGGAGGAGTCCGTGGCACTTG
ATGTATCTCCCCTGAACGAGGTGAGGGGAGAGTCTCCTTTGAGGAGAAAGAGAAAGAAGAAGAAGACCTCCTCCCCCTCGGAGGCTGGGGCTTGTGGGACCCTGCCCGCG
AGCCATGCTGACCTGGTGGACGACCCCGTAGCTCGGATGGGGGGAACATTCGACGTGCGAACGCGGTTCAGGATGGAACCATCAAGCTCTGGGGTGAAGGACCAGGTATC
CTGCATCTCGGCCACGTGCTTGGACCGCTGTCTGAGGAGAGCATCCAAGTTCATGAGTGATCCTGGGTCCGTACTGCAGAGGACCATCGATTACGCTGCCAATGCGTTTG
TCGCTTCCATTCATTCAGCTATTATGGTCAAGGTTGAGCCGGATGGAAGGGAGGATCCGGCAGCCAAGGAGAGGGAGAACTCTTCTGCTGCCTTAGAGGCTGCCACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAGATGAGGTTGTAGGGCTTGAGCGGTACCCGCTGCAGTCGAATTGTCAATTAACGTCTGTAGTGCTTCTACCAGTAAAGCCACCTAAGGATTCACTTGATGAGCT
ACCGGGGGAACTAGAGGGACTACCCCTTCCTGAGGAGCTGGTTGAGGAACTGGAAGGACTACCCCTCCCTGAGGAGCTGCCAGAGGAACGTGGGGGTCTACTACATTCTC
AATTCCAGGATTCGGGTGTTTATGCAGAAATCTGCACAACGGTTCTTCACGAATCGAGCTCGAACCCAGTCTCCGGTTTCGACCTGAACACTAGAGTGGACCTGCACAAG
AGGGCGAACACTCCGACGCTCAAGTCAAATACATTTCGACCTGCCAGGTTGTCGGAGTACTCAAGCGTTTCGTCGTTGTGTATTCCGAGGATATCTCAGCCGCTCGTTGA
TTACACGTGTACGGCGCAGAGGTTTTTCCGATCAGCTATAAATAGTGCCGAAACTTCAGTTTTCTTATCTTCCCTCTCCAGTAGTGATAGCCTGGGTAGTATAGGTCGGA
CAATAAGTAGTTTGCCCCTCAAGCCAAGTGACTCCGGGGAGGTCTTAGCTCGTAGGTCAGAGTCTGAGCTGGAGGAAATAGAGAACTTTAGGTTTTCAGATGATGAAGAG
GATAGCGATACCTCCACCTCGGGCCAGGGTCTGGAGTACCCTTCTAGGATGCCCGAACACTACCTTGGACCCCTTCGTAGGGGGTTTAACATTCCGAATAACATCCTCCT
TAGGATTCCGCAGGAAGGGGAAAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTTTATTTGAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCATCCCTTTGCTC
AAGAGTTCTTAAACCGAACCGGACTGGCTCCTGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTAGCCATTCTTTTTTGGTTGCGAGCTCGGGATGAGGAT
GAGGCCGAGTTGCTAAGTGTCGACGAGCTTCTCGGGTGTTTTGAGGCTAAGAGGATAGCCAAGAAACCAGAGTGGCTGGCAAAGGACGAATCAGGTCGTGCCTTTTTTGA
CGTGCCTGCTAGGTTTGGGAACCTAGTGTCGATCAAGCCGATCCCCGAGCTCGATCAAGCCACTTTCGACACACTCAAGTTCTACAAAGAGAACTTCCCCAAGGGCAGGA
AGATCGGAACCTTGGTCACCGACAAGCTTCTCTTGGAGTCGGGGCTTCTTGACTACAACCCTCTAGTTCGGCCAATTGAAGCTTCAAGGCCAAACTCTGAACTCGCAATG
GTGTACGGATTCACTGGAAGTGTGAAGCGCAAGTCCAAGGGCCGTGCTCACGCCCTTAAGACCGTGGTGAGCCCTGAACCAATGACGCCTTCGGTGCCACGGACTGAGGC
TCAGGGTAACTCCGACCCATCCTCTGCAGTTCCCACCCCCGTGATCAAACTGGACCTGTCCGGGGGTCGATCTGAAGATAAGCGTCCAAGGGAGGAGTCCGTGGCACTTG
ATGTATCTCCCCTGAACGAGGTGAGGGGAGAGTCTCCTTTGAGGAGAAAGAGAAAGAAGAAGAAGACCTCCTCCCCCTCGGAGGCTGGGGCTTGTGGGACCCTGCCCGCG
AGCCATGCTGACCTGGTGGACGACCCCGTAGCTCGGATGGGGGGAACATTCGACGTGCGAACGCGGTTCAGGATGGAACCATCAAGCTCTGGGGTGAAGGACCAGGTATC
CTGCATCTCGGCCACGTGCTTGGACCGCTGTCTGAGGAGAGCATCCAAGTTCATGAGTGATCCTGGGTCCGTACTGCAGAGGACCATCGATTACGCTGCCAATGCGTTTG
TCGCTTCCATTCATTCAGCTATTATGGTCAAGGTTGAGCCGGATGGAAGGGAGGATCCGGCAGCCAAGGAGAGGGAGAACTCTTCTGCTGCCTTAGAGGCTGCCACCTGA
Protein sequenceShow/hide protein sequence
MLDEVVGLERYPLQSNCQLTSVVLLPVKPPKDSLDELPGELEGLPLPEELVEELEGLPLPEELPEERGGLLHSQFQDSGVYAEICTTVLHESSSNPVSGFDLNTRVDLHK
RANTPTLKSNTFRPARLSEYSSVSSLCIPRISQPLVDYTCTAQRFFRSAINSAETSVFLSSLSSSDSLGSIGRTISSLPLKPSDSGEVLARRSESELEEIENFRFSDDEE
DSDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNNILLRIPQEGERADNPPEGWVTLYLKMFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDED
EAELLSVDELLGCFEAKRIAKKPEWLAKDESGRAFFDVPARFGNLVSIKPIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAM
VYGFTGSVKRKSKGRAHALKTVVSPEPMTPSVPRTEAQGNSDPSSAVPTPVIKLDLSGGRSEDKRPREESVALDVSPLNEVRGESPLRRKRKKKKTSSPSEAGACGTLPA
SHADLVDDPVARMGGTFDVRTRFRMEPSSSGVKDQVSCISATCLDRCLRRASKFMSDPGSVLQRTIDYAANAFVASIHSAIMVKVEPDGREDPAAKERENSSAALEAAT