; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g13210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g13210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: my s in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).
Genome locationchr8:10020403..10022532
RNA-Seq ExpressionMoc08g13210
SyntenyMoc08g13210
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]3.0e-10073.12Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKK GRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVR

Query:  KLFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESS--------------
        K FYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE S              
Subjt:  KLFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESS--------------

Query:  -----SHGLRICKQREAQVQGPSPCSRGHPEFETYHPCCGRASHGRSSPSDRAGVFWGSFEGEAPRDQTEAVDVSPLGE
             S G     +     + P+P   G P  E   P     S G  S   R        + EA   QTEA DV PLGE
Subjt:  -----SHGLRICKQREAQVQGPSPCSRGHPEFETYHPCCGRASHGRSSPSDRAGVFWGSFEGEAPRDQTEAVDVSPLGE

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]2.5e-9997.31Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKK GRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVR

Query:  KLFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESS
        K FYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESS
Subjt:  KLFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESS

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]2.5e-9997.31Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKK GRFYMCARKGA+GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVR

Query:  KLFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESS
        K FYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKE FPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESS
Subjt:  KLFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.9e-14879.67Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVRKLFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKK GRFYMCARKGA GIVKGPTSIKGWVRK FYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVRKLFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESS---SHGLRICKQREAQVQG
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESS   S    +C        G
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESS---SHGLRICKQREAQVQG

Query:  PSPCSRGHPEFETYHPCCGRASHGRSSPS--DRAGVF-----WGSFEGEAPRDQTEAVD
            S+G             A+     P+  D A V       G    + PRDQTEAVD
Subjt:  PSPCSRGHPEFETYHPCCGRASHGRSSPS--DRAGVF-----WGSFEGEAPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.1e-14264.86Show/hide
Query:  MCARKGANGIVKGPTSIKGWVRKLFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG  GIVKGPTSIKGWV K F+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGANGIVKGPTSIKGWVRKLFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESS---SHGLRIC---KQREAQVQGPSPCSRGHPEFETYHPCCGRA-SHGRSSPSDRAGV------FWGSFEGE-APRDQTEAVDVSPLGEEVRE
         VR IE+S   S    +C      + + +G +   +     E   P   R  + G S PS             G   GE   R+++EA+DVSPL  EVR 
Subjt:  AVRPIESS---SHGLRIC---KQREAQVQGPSPCSRGHPEFETYHPCCGRA-SHGRSSPSDRAGV------FWGSFEGE-APRDQTEAVDVSPLGEEVRE

Query:  EVPLKRRRKKKKTTPPLEVGARGVLPASFADRVDDPEARIGGTSDVTARYRVQPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQGTIDYAAEAF
        E PL+RRRKKKKT+   E GARG LP S AD VDDPEAR+ GTS+V  R+ ++PSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQ TID  AEAF
Subjt:  EVPLKRRRKKKKTTPPLEVGARGVLPASFADRVDDPEARIGGTSDVTARYRVQPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQGTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAEREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITKGLKKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LA +E+E   AALEAA +T+K ELLKA  EV+IL+AEV+ K +LLKKE +K KA LRAAHAITKGL+KEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAEREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITKGLKKEKFQLLKEKDDML

Query:  QALEAKEEDLKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAS
        Q LE K+  +   T EL+ +KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+
Subjt:  QALEAKEEDLKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAS

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138261.4e-10073.12Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKK GRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVR

Query:  KLFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESS--------------
        K FYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE S              
Subjt:  KLFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESS--------------

Query:  -----SHGLRICKQREAQVQGPSPCSRGHPEFETYHPCCGRASHGRSSPSDRAGVFWGSFEGEAPRDQTEAVDVSPLGE
             S G     +     + P+P   G P  E   P     S G  S   R        + EA   QTEA DV PLGE
Subjt:  -----SHGLRICKQREAQVQGPSPCSRGHPEFETYHPCCGRASHGRSSPSDRAGVFWGSFEGEAPRDQTEAVDVSPLGE

A0A6J1DWD2 uncharacterized protein LOC1110246801.2e-9997.31Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKK GRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVR

Query:  KLFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESS
        K FYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESS
Subjt:  KLFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESS

A0A6J1DWF1 uncharacterized protein LOC1110251081.2e-9997.31Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKK GRFYMCARKGA+GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVR

Query:  KLFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESS
        K FYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKE FPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESS
Subjt:  KLFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESS

A0A6J1DXS5 uncharacterized protein LOC1110255021.4e-14879.67Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVRKLFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKK GRFYMCARKGA GIVKGPTSIKGWVRK FYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVRKLFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESS---SHGLRICKQREAQVQG
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESS   S    +C        G
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESS---SHGLRICKQREAQVQG

Query:  PSPCSRGHPEFETYHPCCGRASHGRSSPS--DRAGVF-----WGSFEGEAPRDQTEAVD
            S+G             A+     P+  D A V       G    + PRDQTEAVD
Subjt:  PSPCSRGHPEFETYHPCCGRASHGRSSPS--DRAGVF-----WGSFEGEAPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256651.5e-14264.86Show/hide
Query:  MCARKGANGIVKGPTSIKGWVRKLFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG  GIVKGPTSIKGWV K F+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGANGIVKGPTSIKGWVRKLFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESS---SHGLRIC---KQREAQVQGPSPCSRGHPEFETYHPCCGRA-SHGRSSPSDRAGV------FWGSFEGE-APRDQTEAVDVSPLGEEVRE
         VR IE+S   S    +C      + + +G +   +     E   P   R  + G S PS             G   GE   R+++EA+DVSPL  EVR 
Subjt:  AVRPIESS---SHGLRIC---KQREAQVQGPSPCSRGHPEFETYHPCCGRA-SHGRSSPSDRAGV------FWGSFEGE-APRDQTEAVDVSPLGEEVRE

Query:  EVPLKRRRKKKKTTPPLEVGARGVLPASFADRVDDPEARIGGTSDVTARYRVQPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQGTIDYAAEAF
        E PL+RRRKKKKT+   E GARG LP S AD VDDPEAR+ GTS+V  R+ ++PSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQ TID  AEAF
Subjt:  EVPLKRRRKKKKTTPPLEVGARGVLPASFADRVDDPEARIGGTSDVTARYRVQPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQGTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAEREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITKGLKKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LA +E+E   AALEAA +T+K ELLKA  EV+IL+AEV+ K +LLKKE +K KA LRAAHAITKGL+KEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAEREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITKGLKKEKFQLLKEKDDML

Query:  QALEAKEEDLKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAS
        Q LE K+  +   T EL+ +KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+
Subjt:  QALEAKEEDLKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42060.1 myosin heavy chain-related3.9e-0527.49Show/hide
Query:  SRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAE
        SR    + G        PE +   IPE  +R  + PEG++ L+   F E GL  PL  F+  +  R  +A +Q++         L IL       +EE  
Subjt:  SRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAE

Query:  LLDVDQLLACFEAKRIAKKSGRFYMCA--RKGANGIVKGPTS-IKGWVRKLFYASGEWLAKDESGRSFFDV
        ++D+D L     +  I  K+ R  +CA  R+G   I  G TS ++ W +  F+A    ++ D++  S  ++
Subjt:  LLDVDQLLACFEAKRIAKKSGRFYMCA--RKGANGIVKGPTS-IKGWVRKLFYASGEWLAKDESGRSFFDV

AT5G38190.1 INVOLVED IN: biological_process unknown1.6e-0625.14Show/hide
Query:  RFSDD-GEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPA
        R++DD  E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F+  F     +A +
Subjt:  RFSDD-GEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPA

Query:  QVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVRKLFYA
        Q+       I   A L  L AR       L V+ +       ++  K G+ Y+ + +G   +   P+  + W+   FYA
Subjt:  QVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVRKLFYA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTCGTCCAA
GAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGACAGTGAAGA
GGCCGAGTTGTTAGACGTAGACCAACTTCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGTCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAAACGGTATAG
TTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTTGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGG
TTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTACTTCGACACGCTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGGAACCTT
GGTGACCGATAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCCGCAGTTCGTCCCATTGAATCCTCAAGCCATGGTTTGCGGATTTGCAAGCAACGTGAAGCGC
AAGTCCAAGGGCCGAGCCCATGCTCTCGAGGCCACCCAGAGTTCGAAACCTACCACCCCTGCTGTGGTAGGGCCAGCCACGGAAGATCCAGCCCTAGTGATCGAGCGGGA
GTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGAGGTGAGAGAGGAAGTCCCTCTGAAGCGAAGGAG
GAAGAAGAAGAAGACCACCCCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATAGGTGGGACGTCCG
ATGTGACGGCACGGTACAGAGTTCAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTT
GTAAGTGACCCAGGGTCCGTTCTGCAGGGGACCATCGATTACGCCGCTGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGA
AGTTCTGGCAGAGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGG
CCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAAGACAAACGCAAGGCCCAGCTCCGAGCTGCCCACGCTATCACCAAGGGCTTGAAGAAGGAGAAGTTCCAA
CTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAAGATCTGAAGCATGCGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCTCAGCAATGGAGC
CCTATTGGAGGAATCGTTTAGGCAACACCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCGACATGCCTGA
CCTTCAGATCGATCTCGGTGGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTCGTCCAA
GAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGACAGTGAAGA
GGCCGAGTTGTTAGACGTAGACCAACTTCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGTCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAAACGGTATAG
TTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTTGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGG
TTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTACTTCGACACGCTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGGAACCTT
GGTGACCGATAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCCGCAGTTCGTCCCATTGAATCCTCAAGCCATGGTTTGCGGATTTGCAAGCAACGTGAAGCGC
AAGTCCAAGGGCCGAGCCCATGCTCTCGAGGCCACCCAGAGTTCGAAACCTACCACCCCTGCTGTGGTAGGGCCAGCCACGGAAGATCCAGCCCTAGTGATCGAGCGGGA
GTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGAGGTGAGAGAGGAAGTCCCTCTGAAGCGAAGGAG
GAAGAAGAAGAAGACCACCCCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATAGGTGGGACGTCCG
ATGTGACGGCACGGTACAGAGTTCAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTT
GTAAGTGACCCAGGGTCCGTTCTGCAGGGGACCATCGATTACGCCGCTGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGA
AGTTCTGGCAGAGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGG
CCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAAGACAAACGCAAGGCCCAGCTCCGAGCTGCCCACGCTATCACCAAGGGCTTGAAGAAGGAGAAGTTCCAA
CTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAAGATCTGAAGCATGCGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCTCAGCAATGGAGC
CCTATTGGAGGAATCGTTTAGGCAACACCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCGACATGCCTGA
CCTTCAGATCGATCTCGGTGGTCTGA
Protein sequenceShow/hide protein sequence
MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQ
EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKSGRFYMCARKGANGIVKGPTSIKGWVRKLFYASGEWLAKDESGRSFFDVPTR
FGNLVSIRPVPELTQAYFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSSHGLRICKQREAQVQGPSPCSRGHPEFETYHPCCGRASHGRSSPSDRAG
VFWGSFEGEAPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTPPLEVGARGVLPASFADRVDDPEARIGGTSDVTARYRVQPSSSGVRDQVSRISAASLDRCLRRASKF
VSDPGSVLQGTIDYAAEAFVASIQSALAVKAELDGREVLAEREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITKGLKKEKFQ
LLKEKDDMLQALEAKEEDLKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASTCLTFRSISVV