; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g06400 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g06400
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: my s in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).
Genome locationchr7:5262062..5268531
RNA-Seq ExpressionMoc07g06400
SyntenyMoc07g06400
Gene Ontology termsGO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]3.5e-6361.67Show/hide
Query:  MCARKGAGGIVKGPTSIKGVGR-----------FGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNPLFRPIEASRPNSEL
        MCARKGA GIVKGPTSIKG  R               V I+P+PEL QA+FDT K+YK++FPRGRK+GTLVTDKLLLESGLLD+NP  RPIE+SRPNSEL
Subjt:  MCARKGAGGIVKGPTSIKGVGR-----------FGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNPLFRPIEASRPNSEL

Query:  AM-------------------NAAQDQE-------GPSSAAPTPVIELDSTRERSREKRSRSESEALDVSPL-REVREGFPLKRRKKKKKTTSSLEVGPR
        AM                    AAQ  +       GP+S  P PVIEL+S+R  SREKR R ++EA+DVSPL  EVRE  PLKRR+KKKKTTS LEVG R
Subjt:  AM-------------------NAAQDQE-------GPSSAAPTPVIELDSTRERSREKRSRSESEALDVSPL-REVREGFPLKRRKKKKKTTSSLEVGPR

Query:  GPLPSSHADLVDDPEAQMGGTSDVKMRFRMEPSSSGVKDQ
        G LP+S AD VDDPEA+MGGT DV  RFR+EPSSSGV+DQ
Subjt:  GPLPSSHADLVDDPEAQMGGTSDVKMRFRMEPSSSGVKDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]9.5e-6155.38Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTRLAPAQ---------------------DEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKG---
        MFEYGLRLPLHPF QEFL RT LAPAQ                     D +EAELL VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKG   
Subjt:  MFEYGLRLPLHPFAQEFLNRTRLAPAQ---------------------DEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKG---

Query:  ----------------------VGRFGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNPLFRPIEASRPNSELAM------
                                RFGNLV I+P+PEL QA+FDT K+YK+ FPRGRK+GTLVTD+LLLESGLLD+NP  RPIE SRPNS LAM      
Subjt:  ----------------------VGRFGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNPLFRPIEASRPNSELAM------

Query:  -------------NAAQDQE-------GPSSAAPTPVIELDSTRERSREKRSRSESEALD
                      AAQ  +       GP+S  P PVIEL+S+   SREKR R ++EA+D
Subjt:  -------------NAAQDQE-------GPSSAAPTPVIELDSTRERSREKRSRSESEALD

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]2.7e-5562.89Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTRLAPAQ---------------------DEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKG---
        MFEYGLRLPLHPF QEFL RT LAPAQ                     D +EAELL VDQLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKG   
Subjt:  MFEYGLRLPLHPFAQEFLNRTRLAPAQ---------------------DEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKG---

Query:  ----------------------VGRFGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNPLFRPIEASRPNSELAM
                                RFGNLV I+P+PEL QA+FDT K+YK++FPRGRK+GTLVTDKLLLESGLLD+NP  RPIE+SRPNSEL M
Subjt:  ----------------------VGRFGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNPLFRPIEASRPNSELAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.8e-10262.17Show/hide
Query:  LARRLESELEEIENFRFSDDGEDSDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNRPEGWVTLYLKMFEYGLRLPLHPFAQEFLN
        LARRLES+LEEIEN R SDDGEDSD STSGQGLEYPSR+PEHYLG LRRGF IP +ILLR+PEEGERADN PEGWVTLY KMFEYGLRLPLHPF QEFL 
Subjt:  LARRLESELEEIENFRFSDDGEDSDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNRPEGWVTLYLKMFEYGLRLPLHPFAQEFLN

Query:  RTRLAPAQ---------------------DEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKG----------------------
        RT LAPAQ                     D +EAEL  VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKG                      
Subjt:  RTRLAPAQ---------------------DEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKG----------------------

Query:  ---VGRFGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNPLFRPIEASRPNSELAM-------------------NAAQDQ
             RFGNLV I+P+PEL QA+FDT K+YK+ FPRGRK+GTLVTD+LLLESGLLD+NP  RPIE+SRPNSELAM                    AAQ  
Subjt:  ---VGRFGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNPLFRPIEASRPNSELAM-------------------NAAQDQ

Query:  E-------GPSSAAPTPVIELDSTRERSREKRSRSESEALD
        +       GP+S  P  VIEL+S+   SREKR R ++EA+D
Subjt:  E-------GPSSAAPTPVIELDSTRERSREKRSRSESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.2e-10167.36Show/hide
Query:  MCARKGAGGIVKGPTSIKG-VG------------------------RFGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNP
        MCARKG GGIVKGPTSIKG VG                        RFGNLV IK IPEL QATFDT K YKD+FPR RKI TLVTDKLLLESGLLD+NP
Subjt:  MCARKGAGGIVKGPTSIKG-VG------------------------RFGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNP

Query:  LFRPIEASRPNSELAM----------------------------------NAAQDQEGPSSAAPTPVIELDSTRERSREKRSRSESEALDVSPLREVREG
        L R IEASRPNSELAM                                    AQ   GPSSA PTPVIELD +  RS EKRSR ESEALDVSPL EVR  
Subjt:  LFRPIEASRPNSELAM----------------------------------NAAQDQEGPSSAAPTPVIELDSTRERSREKRSRSESEALDVSPLREVREG

Query:  FPLKRRKKKKKTTSSLEVGPRGPLPSSHADLVDDPEAQMGGTSDVKMRFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPRSVLQRTIDHTVEAFT
         PL+RR+KKKKT+SS E G RG LP+SHADLVDDPEA+M GTS+V+MRF MEPSSSGVKDQVSRISA CLDR LRRASKFVSDP SVLQRTID+  EAF 
Subjt:  FPLKRRKKKKKTTSSLEVGPRGPLPSSHADLVDDPEAQMGGTSDVKMRFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPRSVLQRTIDHTVEAFT

Query:  ASIHSAVMIKAELDGREALAAKERENSSAALEAATTL
        ASIH AVM+KAELDGREALAAKERENS AALEAATTL
Subjt:  ASIHSAVMIKAELDGREALAAKERENSSAALEAATTL

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092981.7e-6361.67Show/hide
Query:  MCARKGAGGIVKGPTSIKGVGR-----------FGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNPLFRPIEASRPNSEL
        MCARKGA GIVKGPTSIKG  R               V I+P+PEL QA+FDT K+YK++FPRGRK+GTLVTDKLLLESGLLD+NP  RPIE+SRPNSEL
Subjt:  MCARKGAGGIVKGPTSIKGVGR-----------FGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNPLFRPIEASRPNSEL

Query:  AM-------------------NAAQDQE-------GPSSAAPTPVIELDSTRERSREKRSRSESEALDVSPL-REVREGFPLKRRKKKKKTTSSLEVGPR
        AM                    AAQ  +       GP+S  P PVIEL+S+R  SREKR R ++EA+DVSPL  EVRE  PLKRR+KKKKTTS LEVG R
Subjt:  AM-------------------NAAQDQE-------GPSSAAPTPVIELDSTRERSREKRSRSESEALDVSPL-REVREGFPLKRRKKKKKTTSSLEVGPR

Query:  GPLPSSHADLVDDPEAQMGGTSDVKMRFRMEPSSSGVKDQ
        G LP+S AD VDDPEA+MGGT DV  RFR+EPSSSGV+DQ
Subjt:  GPLPSSHADLVDDPEAQMGGTSDVKMRFRMEPSSSGVKDQ

A0A6J1CR42 uncharacterized protein LOC1110138264.6e-6155.38Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTRLAPAQ---------------------DEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKG---
        MFEYGLRLPLHPF QEFL RT LAPAQ                     D +EAELL VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKG   
Subjt:  MFEYGLRLPLHPFAQEFLNRTRLAPAQ---------------------DEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKG---

Query:  ----------------------VGRFGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNPLFRPIEASRPNSELAM------
                                RFGNLV I+P+PEL QA+FDT K+YK+ FPRGRK+GTLVTD+LLLESGLLD+NP  RPIE SRPNS LAM      
Subjt:  ----------------------VGRFGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNPLFRPIEASRPNSELAM------

Query:  -------------NAAQDQE-------GPSSAAPTPVIELDSTRERSREKRSRSESEALD
                      AAQ  +       GP+S  P PVIEL+S+   SREKR R ++EA+D
Subjt:  -------------NAAQDQE-------GPSSAAPTPVIELDSTRERSREKRSRSESEALD

A0A6J1DWF1 uncharacterized protein LOC1110251081.3e-5562.89Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTRLAPAQ---------------------DEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKG---
        MFEYGLRLPLHPF QEFL RT LAPAQ                     D +EAELL VDQLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKG   
Subjt:  MFEYGLRLPLHPFAQEFLNRTRLAPAQ---------------------DEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKG---

Query:  ----------------------VGRFGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNPLFRPIEASRPNSELAM
                                RFGNLV I+P+PEL QA+FDT K+YK++FPRGRK+GTLVTDKLLLESGLLD+NP  RPIE+SRPNSEL M
Subjt:  ----------------------VGRFGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNPLFRPIEASRPNSELAM

A0A6J1DXS5 uncharacterized protein LOC1110255021.8e-10262.17Show/hide
Query:  LARRLESELEEIENFRFSDDGEDSDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNRPEGWVTLYLKMFEYGLRLPLHPFAQEFLN
        LARRLES+LEEIEN R SDDGEDSD STSGQGLEYPSR+PEHYLG LRRGF IP +ILLR+PEEGERADN PEGWVTLY KMFEYGLRLPLHPF QEFL 
Subjt:  LARRLESELEEIENFRFSDDGEDSDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNRPEGWVTLYLKMFEYGLRLPLHPFAQEFLN

Query:  RTRLAPAQ---------------------DEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKG----------------------
        RT LAPAQ                     D +EAEL  VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKG                      
Subjt:  RTRLAPAQ---------------------DEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKG----------------------

Query:  ---VGRFGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNPLFRPIEASRPNSELAM-------------------NAAQDQ
             RFGNLV I+P+PEL QA+FDT K+YK+ FPRGRK+GTLVTD+LLLESGLLD+NP  RPIE+SRPNSELAM                    AAQ  
Subjt:  ---VGRFGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNPLFRPIEASRPNSELAM-------------------NAAQDQ

Query:  E-------GPSSAAPTPVIELDSTRERSREKRSRSESEALD
        +       GP+S  P  VIEL+S+   SREKR R ++EA+D
Subjt:  E-------GPSSAAPTPVIELDSTRERSREKRSRSESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256652.0e-10167.36Show/hide
Query:  MCARKGAGGIVKGPTSIKG-VG------------------------RFGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNP
        MCARKG GGIVKGPTSIKG VG                        RFGNLV IK IPEL QATFDT K YKD+FPR RKI TLVTDKLLLESGLLD+NP
Subjt:  MCARKGAGGIVKGPTSIKG-VG------------------------RFGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNP

Query:  LFRPIEASRPNSELAM----------------------------------NAAQDQEGPSSAAPTPVIELDSTRERSREKRSRSESEALDVSPLREVREG
        L R IEASRPNSELAM                                    AQ   GPSSA PTPVIELD +  RS EKRSR ESEALDVSPL EVR  
Subjt:  LFRPIEASRPNSELAM----------------------------------NAAQDQEGPSSAAPTPVIELDSTRERSREKRSRSESEALDVSPLREVREG

Query:  FPLKRRKKKKKTTSSLEVGPRGPLPSSHADLVDDPEAQMGGTSDVKMRFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPRSVLQRTIDHTVEAFT
         PL+RR+KKKKT+SS E G RG LP+SHADLVDDPEA+M GTS+V+MRF MEPSSSGVKDQVSRISA CLDR LRRASKFVSDP SVLQRTID+  EAF 
Subjt:  FPLKRRKKKKKTTSSLEVGPRGPLPSSHADLVDDPEAQMGGTSDVKMRFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPRSVLQRTIDHTVEAFT

Query:  ASIHSAVMIKAELDGREALAAKERENSSAALEAATTL
        ASIH AVM+KAELDGREALAAKERENS AALEAATTL
Subjt:  ASIHSAVMIKAELDGREALAAKERENSSAALEAATTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related6.7e-0425Show/hide
Query:  RLESELEEIENFRFSDDGEDSDTSTSGQGLEY------PSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNRPEGWVTLYLKMF-EYGLRLPLHPFAQ
        R+ ++ +   N    D+ E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F  
Subjt:  RLESELEEIENFRFSDDGEDSDTSTSGQGLEY------PSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNRPEGWVTLYLKMF-EYGLRLPLHPFAQ

Query:  EFLNRTRLAPAQ
         F    ++A +Q
Subjt:  EFLNRTRLAPAQ

AT5G38190.1 INVOLVED IN: biological_process unknown6.7e-0427.72Show/hide
Query:  RFSDD-GEDSDTSTSGQGLEY------PSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNRPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTRLAPA
        R++DD  E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F   F    ++A +
Subjt:  RFSDD-GEDSDTSTSGQGLEY------PSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNRPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTRLAPA

Query:  Q
        Q
Subjt:  Q


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACTTTTGTGCTCATATTAACGCCTCTGAGAAGGTAGCTGAGGTTTTCTTCCCAAAAGAGAATAACAACTTCTCATCTTTTACATCAAACATGAATGCGAGAAGTGA
TACTCCATCGGTCGAACCCTTTGTCTTTTTCGTTTTCCATTGGGATTTTATAGCTTCTCAGCCTGTTCATGCCAATTTTTGTAGCTGTGTAGTTGTTGGACCTAGTCCAC
TGAAGACTTCATCTCGCAGTTTGTCATTTCCTCTCATCATTAACTTGCTTTGGTTTGGTGGGTTATTTGTCGACTCCACGACATCTTCAATTTCTGTGGCCTTGTCTTTT
CACGGATCTCGGACCAGTGTTGGGGATCCGACTGACGGTTATTTGTATCATCCCCTGAAGGTCGGACGTGTCGAGTGCCTCGTGTCGAACCCATTTATCAACCGTACTGC
AAGAAGTAATTATGCAGGAATTTGCACAACGGTTCTTCAGGAATCAAGCTCGAACCCGGTCTCCGGTTCCGACCTGAACACTAGAGTGGACCTGCACAAGAGGGTGACGG
ATCCGACAGCACACACGACCGGCGGTTACATGTCTTTTCTCATGTCGGACCTGTCGGGTTCCGAGCAGGTCGGTCCTCAGTTCGATCTCGACCTGGCAGAGAAGTTTATT
CGAATCTATTTTGGACACGTGGCGACTTTTCATTTGCAGAGGGAATATGACCGTTGCGGAAGACGTTTCGACCTGCCAGGTTGTTGGAGCACTCAAGTGTTTCGCCGTTG
CGTATCTCGAGAAGATCCTAGCCGCTCGTTGATTACACGTGTACGGTGCAGAGGTCAGTCACTTTTCCTTGCTCTTACTCTTCTTTCAAACATGGTAGTTTTCTTGTCTT
CCCCCTCCAGTAGTGATAGCCTGGGTCGTGTAGGTCGGACAATAAGTAGTTCGCCCCCCAAGCCAAGTGACTCTGGGGAGGTCTTAGCTCGTAGGTTAGAGTCCGAGCTG
GAAGAAATAGAGAACTTTAGGTTCTCAGATGACGGAGAGGATAGCGATACCTCCACCTCGGGCCAGGGTCTGGAGTACCCTTCTAGGATGCCCGAGCACTATCTTGGACC
CCTTCGTAGAGGGTTTAACATTCCGAATGACATCCTCCTTAGGATTCCGGAGGAAGGGGAAAGAGCTGACAATCGTCCAGAGGGATGGGTCACTCTTTATTTGAAGATGT
TTGAGTATGGCCTCAGACTTCCCCTTCATCCCTTTGCTCAGGAGTTCTTAAACCGAACTAGACTGGCTCCTGCTCAAGACGAGGATGAGGCCGAGTTGCTAAGTGTTGAC
CAGCTTCTTGGGTGTTTTGAGGCCAAGAGGATAGCCAAAAAACCAGGTCGATACTATATGTGCGCAAGGAAGGGCGCGGGTGGTATAGTCAAGGGGCCGACCTCCATCAA
AGGAGTGGGTAGGTTTGGGAACCTAGTATTGATCAAGCCGATTCCCGAGCTCGGTCAAGCCACTTTTGACACCCACAAATTCTACAAGGACAACTTCCCAAGGGGCCGGA
AGATCGGGACCTTGGTCACCGACAAACTGCTGCTAGAATCAGGGCTATTGGACTTCAATCCTTTATTTCGCCCGATTGAAGCTTCGAGGCCAAACTCCGAGCTTGCCATG
AATGCAGCTCAGGACCAGGAGGGTCCATCTTCTGCAGCTCCAACTCCGGTGATTGAGTTGGATTCTACTAGGGAGCGCTCTAGGGAGAAGCGCTCAAGGAGTGAGTCCGA
AGCCTTGGACGTGTCACCACTTCGTGAGGTGAGAGAGGGCTTTCCTCTGAAGAGGAGGAAGAAAAAGAAGAAGACCACCTCCTCCTTGGAGGTTGGACCTCGTGGTCCCC
TGCCCTCAAGCCACGCCGATCTGGTAGATGACCCGGAAGCTCAGATGGGGGGGACATCCGACGTGAAGATGCGGTTCAGAATGGAACCGTCGAGCTCCGGGGTGAAAGAC
CAGGTGTCACGCATCTCGGCTGCCTGCTTGGATCGCTGTCTCAGGAGAGCATCCAAGTTTGTGAGCGACCCAAGGTCGGTGCTGCAACGGACTATCGACCACACCGTCGA
GGCGTTCACTGCCTCCATACATTCAGCAGTCATGATCAAGGCCGAGCTGGATGGAAGGGAGGCCTTGGCAGCGAAGGAGAGGGAGAACTCCTCTGCTGCCTTGGAGGCTG
CCACTACGCTAGGGCGAGCTGCTGAAGGCTCGGAGCGAGGTGGATATACTAAGGGCCGAGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTACTTTTGTGCTCATATTAACGCCTCTGAGAAGGTAGCTGAGGTTTTCTTCCCAAAAGAGAATAACAACTTCTCATCTTTTACATCAAACATGAATGCGAGAAGTGA
TACTCCATCGGTCGAACCCTTTGTCTTTTTCGTTTTCCATTGGGATTTTATAGCTTCTCAGCCTGTTCATGCCAATTTTTGTAGCTGTGTAGTTGTTGGACCTAGTCCAC
TGAAGACTTCATCTCGCAGTTTGTCATTTCCTCTCATCATTAACTTGCTTTGGTTTGGTGGGTTATTTGTCGACTCCACGACATCTTCAATTTCTGTGGCCTTGTCTTTT
CACGGATCTCGGACCAGTGTTGGGGATCCGACTGACGGTTATTTGTATCATCCCCTGAAGGTCGGACGTGTCGAGTGCCTCGTGTCGAACCCATTTATCAACCGTACTGC
AAGAAGTAATTATGCAGGAATTTGCACAACGGTTCTTCAGGAATCAAGCTCGAACCCGGTCTCCGGTTCCGACCTGAACACTAGAGTGGACCTGCACAAGAGGGTGACGG
ATCCGACAGCACACACGACCGGCGGTTACATGTCTTTTCTCATGTCGGACCTGTCGGGTTCCGAGCAGGTCGGTCCTCAGTTCGATCTCGACCTGGCAGAGAAGTTTATT
CGAATCTATTTTGGACACGTGGCGACTTTTCATTTGCAGAGGGAATATGACCGTTGCGGAAGACGTTTCGACCTGCCAGGTTGTTGGAGCACTCAAGTGTTTCGCCGTTG
CGTATCTCGAGAAGATCCTAGCCGCTCGTTGATTACACGTGTACGGTGCAGAGGTCAGTCACTTTTCCTTGCTCTTACTCTTCTTTCAAACATGGTAGTTTTCTTGTCTT
CCCCCTCCAGTAGTGATAGCCTGGGTCGTGTAGGTCGGACAATAAGTAGTTCGCCCCCCAAGCCAAGTGACTCTGGGGAGGTCTTAGCTCGTAGGTTAGAGTCCGAGCTG
GAAGAAATAGAGAACTTTAGGTTCTCAGATGACGGAGAGGATAGCGATACCTCCACCTCGGGCCAGGGTCTGGAGTACCCTTCTAGGATGCCCGAGCACTATCTTGGACC
CCTTCGTAGAGGGTTTAACATTCCGAATGACATCCTCCTTAGGATTCCGGAGGAAGGGGAAAGAGCTGACAATCGTCCAGAGGGATGGGTCACTCTTTATTTGAAGATGT
TTGAGTATGGCCTCAGACTTCCCCTTCATCCCTTTGCTCAGGAGTTCTTAAACCGAACTAGACTGGCTCCTGCTCAAGACGAGGATGAGGCCGAGTTGCTAAGTGTTGAC
CAGCTTCTTGGGTGTTTTGAGGCCAAGAGGATAGCCAAAAAACCAGGTCGATACTATATGTGCGCAAGGAAGGGCGCGGGTGGTATAGTCAAGGGGCCGACCTCCATCAA
AGGAGTGGGTAGGTTTGGGAACCTAGTATTGATCAAGCCGATTCCCGAGCTCGGTCAAGCCACTTTTGACACCCACAAATTCTACAAGGACAACTTCCCAAGGGGCCGGA
AGATCGGGACCTTGGTCACCGACAAACTGCTGCTAGAATCAGGGCTATTGGACTTCAATCCTTTATTTCGCCCGATTGAAGCTTCGAGGCCAAACTCCGAGCTTGCCATG
AATGCAGCTCAGGACCAGGAGGGTCCATCTTCTGCAGCTCCAACTCCGGTGATTGAGTTGGATTCTACTAGGGAGCGCTCTAGGGAGAAGCGCTCAAGGAGTGAGTCCGA
AGCCTTGGACGTGTCACCACTTCGTGAGGTGAGAGAGGGCTTTCCTCTGAAGAGGAGGAAGAAAAAGAAGAAGACCACCTCCTCCTTGGAGGTTGGACCTCGTGGTCCCC
TGCCCTCAAGCCACGCCGATCTGGTAGATGACCCGGAAGCTCAGATGGGGGGGACATCCGACGTGAAGATGCGGTTCAGAATGGAACCGTCGAGCTCCGGGGTGAAAGAC
CAGGTGTCACGCATCTCGGCTGCCTGCTTGGATCGCTGTCTCAGGAGAGCATCCAAGTTTGTGAGCGACCCAAGGTCGGTGCTGCAACGGACTATCGACCACACCGTCGA
GGCGTTCACTGCCTCCATACATTCAGCAGTCATGATCAAGGCCGAGCTGGATGGAAGGGAGGCCTTGGCAGCGAAGGAGAGGGAGAACTCCTCTGCTGCCTTGGAGGCTG
CCACTACGCTAGGGCGAGCTGCTGAAGGCTCGGAGCGAGGTGGATATACTAAGGGCCGAGGTTGA
Protein sequenceShow/hide protein sequence
MYFCAHINASEKVAEVFFPKENNNFSSFTSNMNARSDTPSVEPFVFFVFHWDFIASQPVHANFCSCVVVGPSPLKTSSRSLSFPLIINLLWFGGLFVDSTTSSISVALSF
HGSRTSVGDPTDGYLYHPLKVGRVECLVSNPFINRTARSNYAGICTTVLQESSSNPVSGSDLNTRVDLHKRVTDPTAHTTGGYMSFLMSDLSGSEQVGPQFDLDLAEKFI
RIYFGHVATFHLQREYDRCGRRFDLPGCWSTQVFRRCVSREDPSRSLITRVRCRGQSLFLALTLLSNMVVFLSSPSSSDSLGRVGRTISSSPPKPSDSGEVLARRLESEL
EEIENFRFSDDGEDSDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNRPEGWVTLYLKMFEYGLRLPLHPFAQEFLNRTRLAPAQDEDEAELLSVD
QLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGVGRFGNLVLIKPIPELGQATFDTHKFYKDNFPRGRKIGTLVTDKLLLESGLLDFNPLFRPIEASRPNSELAM
NAAQDQEGPSSAAPTPVIELDSTRERSREKRSRSESEALDVSPLREVREGFPLKRRKKKKKTTSSLEVGPRGPLPSSHADLVDDPEAQMGGTSDVKMRFRMEPSSSGVKD
QVSRISAACLDRCLRRASKFVSDPRSVLQRTIDHTVEAFTASIHSAVMIKAELDGREALAAKERENSSAALEAATTLGRAAEGSERGGYTKGRG