; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g01410 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g01410
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:1070713..1075054
RNA-Seq ExpressionMoc03g01410
SyntenyMoc03g01410
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]7.3e-7664.75Show/hide
Query:  MCARKGACDIVKRQTSIKEWVKKWFFASGEWLAKDESGHPFFDVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNP
        MCARKGAC IVK  TSIK WV+KWF+ASGEWLAKDES              V+IRP+PELTQASF+TLKYYK+ FP+GRK+GTLVTDKLLL+SGLLDYNP
Subjt:  MCARKGACDIVKRQTSIKEWVKKWFFASGEWLAKDESGHPFFDVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNP

Query:  LARPIEASRPNSEFTMVCGFTGSVKCKSKGSAHAFKTVQSTEPATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVLDVSPL-NEMRG
          RPIE+SRPNSE  MVCGF  +VK KSKG AHA +  QS++P T     PA    VGP+SE P PVIEL+S+   S+ KR   + E +DVSPL  E+R 
Subjt:  LARPIEASRPNSEFTMVCGFTGSVKCKSKGSAHAFKTVQSTEPATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVLDVSPL-NEMRG

Query:  ESPLKRRKKKKKTTSFSEVGSRGALPTSHVDLVDDPEARMGGRP
        E PLKRR+KKKKTTS  EVG+RG LP S  D VDDPEARMGG P
Subjt:  ESPLKRRKKKKKTTSFSEVGSRGALPTSHVDLVDDPEARMGGRP

XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]5.9e-9454.21Show/hide
Query:  DVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNPLARPIEASRPNSEFTMVCGFTGSVKCKSKGSAHAFKTVQSTE
        + S    + +SI+PIPEL QA+F+TLK+YKD FP+GRKIGTLVTDKLLL+SGLLDYNPL RPIEASRPNSE  MVCGFT SVK KSKG AHA K VQS++
Subjt:  DVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNPLARPIEASRPNSEFTMVCGFTGSVKCKSKGSAHAFKTVQSTE

Query:  PATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVLDVSPLNEMRGESPLKRRKKKKKTTSFSEVGSRGALPTSHVDLVDDPEARMGGR
        P T A  + AAQD  GPSS  PTPVIELDS GE S+ KRS SE E LDVSPL E+R                                            
Subjt:  PATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVLDVSPLNEMRGESPLKRRKKKKKTTSFSEVGSRGALPTSHVDLVDDPEARMGGR

Query:  PTYPCSSELNRQAPGEALAAKDNENSSAALEAATALKGELLKARSEVDILKAEVEAKAELLKKEDERHKSHLRAAHAITKGLEKEKFQLLKQKDDLAQVL
                                                              EAKAELLK+EDERHK+HLRAAHAITKGLEKEKFQLLK+KDD+ Q L
Subjt:  PTYPCSSELNRQAPGEALAAKDNENSSAALEAATALKGELLKARSEVDILKAEVEAKAELLKKEDERHKSHLRAAHAITKGLEKEKFQLLKQKDDLAQVL

Query:  EEKDASLGRLTAELKEVKE----------AFRQHPDFDGFAKDFSDAGFKFLMKGIIADMPHLQIDLSDLKKKYVEKCAS
        E KDA++GRL AELK  KE          AFRQHPDFDGFAKDFSDAGFKFLMKGI AD+PHL++DL DLKK+Y EK AS
Subjt:  EEKDASLGRLTAELKEVKE----------AFRQHPDFDGFAKDFSDAGFKFLMKGIIADMPHLQIDLSDLKKKYVEKCAS

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]5.7e-8964.41Show/hide
Query:  MFEYGLRLPLHPFVQEFLNRTGL---------------------LRPRDEDEAELLNVDQLLGCFEAKRIAKKPGRYYMCARKGACDIVKRQTSIKEWVK
        MFEYGLRLPLHPFVQEFL RTGL                     LR RD +EAELL+VDQLL CFEAKRIAKKPGR+YMCARKGA  IVK  TSIK WV+
Subjt:  MFEYGLRLPLHPFVQEFLNRTGL---------------------LRPRDEDEAELLNVDQLLGCFEAKRIAKKPGRYYMCARKGACDIVKRQTSIKEWVK

Query:  KWFFASGEWLAKDESGHPFFDVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNPLARPIEASRPNSEFTMVCGFTG
        KWF+ASGEWLAKDESG  FFDV  RF NLVSIRP+PELTQASF+TLKYYK+RFP+GRK+GTLVTD+LLL+SGLLDYNP  RPIE SRPNS   MVC F  
Subjt:  KWFFASGEWLAKDESGHPFFDVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNPLARPIEASRPNSEFTMVCGFTG

Query:  SVKCKSKGSAHAFKTVQSTEPATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVL-------DVSPLNE
         VK KSKG AHA +  QS++P T     PA    VGP+SE P PVIEL+S+G  S+ KR   + E +       DV PL E
Subjt:  SVKCKSKGSAHAFKTVQSTEPATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVL-------DVSPLNE

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]4.3e-12968.82Show/hide
Query:  SNDSGEDLSRRLESELEEVENFRFSDDGEDSDASTSGQGLEDPSKIPEHYLEPLRRGFKIPNDILLRIPEEEKRADNPPEGWVTLYLKMFEYGLRLPLHP
        S++   DL+RRLES+LEE+EN R SDDGEDSDASTSGQGLE PS+IPEHYL  LRRGF IP +ILLR+PEE +RADNPPEGWVTLY KMFEYGLRLPLHP
Subjt:  SNDSGEDLSRRLESELEEVENFRFSDDGEDSDASTSGQGLEDPSKIPEHYLEPLRRGFKIPNDILLRIPEEEKRADNPPEGWVTLYLKMFEYGLRLPLHP

Query:  FVQEFLNRTGL---------------------LRPRDEDEAELLNVDQLLGCFEAKRIAKKPGRYYMCARKGACDIVKRQTSIKEWVKKWFFASGEWLAK
        FVQEFL RTGL                     LR RD +EAEL +VDQLL CFEAKRIAKKPGR+YMCARKGA  IVK  TSIK WV+KWF+ASGEWLAK
Subjt:  FVQEFLNRTGL---------------------LRPRDEDEAELLNVDQLLGCFEAKRIAKKPGRYYMCARKGACDIVKRQTSIKEWVKKWFFASGEWLAK

Query:  DESGHPFFDVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNPLARPIEASRPNSEFTMVCGFTGSVKCKSKGSAHA
        DESG  FFDV  RF NLVSIRP+PELTQASF+TLKYYK+RFP+GRK+GTLVTD+LLL+SGLLDYNP  RPIE+SRPNSE  MVCGF   VK KSKG AHA
Subjt:  DESGHPFFDVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNPLARPIEASRPNSEFTMVCGFTGSVKCKSKGSAHA

Query:  FKTVQSTEPATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVLD
         +  QS++PAT     PA    VGP+SE P  VIEL+S+G  S+ KR   + E +D
Subjt:  FKTVQSTEPATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVLD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]8.7e-15465.7Show/hide
Query:  MCARKGACDIVKRQTSIKEWVKKWFFASGEWLAKDESGHPFFDVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNP
        MCARKG   IVK  TSIK WV KWFFASGEWLAKDESG  FFDV  RF NLVSI+ IPEL QA+F+TLK+YKD FP+ RKI TLVTDKLLL+SGLLDYNP
Subjt:  MCARKGACDIVKRQTSIKEWVKKWFFASGEWLAKDESGHPFFDVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNP

Query:  LARPIEASRPNSEFTMVCGFTGSVKCKSKGSAHAFKTVQSTEPATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVLDVSPLNEMRGE
        L R IEASRPNSE  MVCGFTGSVK KSKG AHA KTV  TEP T    R  AQ N GPSS VPTPVIELD +G  S  KRS  E E LDVSPLNE+RGE
Subjt:  LARPIEASRPNSEFTMVCGFTGSVKCKSKGSAHAFKTVQSTEPATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVLDVSPLNEMRGE

Query:  SPLKRRKKKKKTTSFSEVGSRGALPTSHVDLVDDPEARMGG----------------------RPTYPCSSELNRQA------PG---------------
        SPL+RR+KKKKT+S SE G+RG LPTSH DLVDDPEARM G                      R +  C     R+A      PG               
Subjt:  SPLKRRKKKKKTTSFSEVGSRGALPTSHVDLVDDPEARMGG----------------------RPTYPCSSELNRQA------PG---------------

Query:  ----------------EALAAKDNENSSAALEAATALKGELLKARSEVDILKAEVEAKAELLKKEDERHKSHLRAAHAITKGLEKEKFQLLKQKDDLAQV
                        EALAAK+ ENS AALEAAT LKGELLKA+ EVDIL+AEV+AK +LLKKE E+HK+HLRAAHAITKGLEKEKFQLLK+KDDLAQV
Subjt:  ----------------EALAAKDNENSSAALEAATALKGELLKARSEVDILKAEVEAKAELLKKEDERHKSHLRAAHAITKGLEKEKFQLLKQKDDLAQV

Query:  LEEKDASLGRLTAELKEVK----------EAFRQHPDFDGFAKDFSDAGFKFLMKGIIADMPHLQIDLSDLKKKYVEKCAS
        LEEKDAS+GRLT ELK++K          E+FRQHPDFDGFAKDFSDAGFKFLMKGI ADMPHLQIDL+ LKKKY EK AS
Subjt:  LEEKDASLGRLTAELKEVK----------EAFRQHPDFDGFAKDFSDAGFKFLMKGIIADMPHLQIDLSDLKKKYVEKCAS

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092983.5e-7664.75Show/hide
Query:  MCARKGACDIVKRQTSIKEWVKKWFFASGEWLAKDESGHPFFDVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNP
        MCARKGAC IVK  TSIK WV+KWF+ASGEWLAKDES              V+IRP+PELTQASF+TLKYYK+ FP+GRK+GTLVTDKLLL+SGLLDYNP
Subjt:  MCARKGACDIVKRQTSIKEWVKKWFFASGEWLAKDESGHPFFDVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNP

Query:  LARPIEASRPNSEFTMVCGFTGSVKCKSKGSAHAFKTVQSTEPATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVLDVSPL-NEMRG
          RPIE+SRPNSE  MVCGF  +VK KSKG AHA +  QS++P T     PA    VGP+SE P PVIEL+S+   S+ KR   + E +DVSPL  E+R 
Subjt:  LARPIEASRPNSEFTMVCGFTGSVKCKSKGSAHAFKTVQSTEPATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVLDVSPL-NEMRG

Query:  ESPLKRRKKKKKTTSFSEVGSRGALPTSHVDLVDDPEARMGGRP
        E PLKRR+KKKKTTS  EVG+RG LP S  D VDDPEARMGG P
Subjt:  ESPLKRRKKKKKTTSFSEVGSRGALPTSHVDLVDDPEARMGGRP

A0A6J1CLV1 uncharacterized protein LOC1110124672.9e-9454.21Show/hide
Query:  DVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNPLARPIEASRPNSEFTMVCGFTGSVKCKSKGSAHAFKTVQSTE
        + S    + +SI+PIPEL QA+F+TLK+YKD FP+GRKIGTLVTDKLLL+SGLLDYNPL RPIEASRPNSE  MVCGFT SVK KSKG AHA K VQS++
Subjt:  DVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNPLARPIEASRPNSEFTMVCGFTGSVKCKSKGSAHAFKTVQSTE

Query:  PATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVLDVSPLNEMRGESPLKRRKKKKKTTSFSEVGSRGALPTSHVDLVDDPEARMGGR
        P T A  + AAQD  GPSS  PTPVIELDS GE S+ KRS SE E LDVSPL E+R                                            
Subjt:  PATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVLDVSPLNEMRGESPLKRRKKKKKTTSFSEVGSRGALPTSHVDLVDDPEARMGGR

Query:  PTYPCSSELNRQAPGEALAAKDNENSSAALEAATALKGELLKARSEVDILKAEVEAKAELLKKEDERHKSHLRAAHAITKGLEKEKFQLLKQKDDLAQVL
                                                              EAKAELLK+EDERHK+HLRAAHAITKGLEKEKFQLLK+KDD+ Q L
Subjt:  PTYPCSSELNRQAPGEALAAKDNENSSAALEAATALKGELLKARSEVDILKAEVEAKAELLKKEDERHKSHLRAAHAITKGLEKEKFQLLKQKDDLAQVL

Query:  EEKDASLGRLTAELKEVKE----------AFRQHPDFDGFAKDFSDAGFKFLMKGIIADMPHLQIDLSDLKKKYVEKCAS
        E KDA++GRL AELK  KE          AFRQHPDFDGFAKDFSDAGFKFLMKGI AD+PHL++DL DLKK+Y EK AS
Subjt:  EEKDASLGRLTAELKEVKE----------AFRQHPDFDGFAKDFSDAGFKFLMKGIIADMPHLQIDLSDLKKKYVEKCAS

A0A6J1CR42 uncharacterized protein LOC1110138262.8e-8964.41Show/hide
Query:  MFEYGLRLPLHPFVQEFLNRTGL---------------------LRPRDEDEAELLNVDQLLGCFEAKRIAKKPGRYYMCARKGACDIVKRQTSIKEWVK
        MFEYGLRLPLHPFVQEFL RTGL                     LR RD +EAELL+VDQLL CFEAKRIAKKPGR+YMCARKGA  IVK  TSIK WV+
Subjt:  MFEYGLRLPLHPFVQEFLNRTGL---------------------LRPRDEDEAELLNVDQLLGCFEAKRIAKKPGRYYMCARKGACDIVKRQTSIKEWVK

Query:  KWFFASGEWLAKDESGHPFFDVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNPLARPIEASRPNSEFTMVCGFTG
        KWF+ASGEWLAKDESG  FFDV  RF NLVSIRP+PELTQASF+TLKYYK+RFP+GRK+GTLVTD+LLL+SGLLDYNP  RPIE SRPNS   MVC F  
Subjt:  KWFFASGEWLAKDESGHPFFDVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNPLARPIEASRPNSEFTMVCGFTG

Query:  SVKCKSKGSAHAFKTVQSTEPATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVL-------DVSPLNE
         VK KSKG AHA +  QS++P T     PA    VGP+SE P PVIEL+S+G  S+ KR   + E +       DV PL E
Subjt:  SVKCKSKGSAHAFKTVQSTEPATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVL-------DVSPLNE

A0A6J1DXS5 uncharacterized protein LOC1110255022.1e-12968.82Show/hide
Query:  SNDSGEDLSRRLESELEEVENFRFSDDGEDSDASTSGQGLEDPSKIPEHYLEPLRRGFKIPNDILLRIPEEEKRADNPPEGWVTLYLKMFEYGLRLPLHP
        S++   DL+RRLES+LEE+EN R SDDGEDSDASTSGQGLE PS+IPEHYL  LRRGF IP +ILLR+PEE +RADNPPEGWVTLY KMFEYGLRLPLHP
Subjt:  SNDSGEDLSRRLESELEEVENFRFSDDGEDSDASTSGQGLEDPSKIPEHYLEPLRRGFKIPNDILLRIPEEEKRADNPPEGWVTLYLKMFEYGLRLPLHP

Query:  FVQEFLNRTGL---------------------LRPRDEDEAELLNVDQLLGCFEAKRIAKKPGRYYMCARKGACDIVKRQTSIKEWVKKWFFASGEWLAK
        FVQEFL RTGL                     LR RD +EAEL +VDQLL CFEAKRIAKKPGR+YMCARKGA  IVK  TSIK WV+KWF+ASGEWLAK
Subjt:  FVQEFLNRTGL---------------------LRPRDEDEAELLNVDQLLGCFEAKRIAKKPGRYYMCARKGACDIVKRQTSIKEWVKKWFFASGEWLAK

Query:  DESGHPFFDVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNPLARPIEASRPNSEFTMVCGFTGSVKCKSKGSAHA
        DESG  FFDV  RF NLVSIRP+PELTQASF+TLKYYK+RFP+GRK+GTLVTD+LLL+SGLLDYNP  RPIE+SRPNSE  MVCGF   VK KSKG AHA
Subjt:  DESGHPFFDVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNPLARPIEASRPNSEFTMVCGFTGSVKCKSKGSAHA

Query:  FKTVQSTEPATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVLD
         +  QS++PAT     PA    VGP+SE P  VIEL+S+G  S+ KR   + E +D
Subjt:  FKTVQSTEPATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVLD

A0A6J1DZB3 uncharacterized protein LOC1110256654.2e-15465.7Show/hide
Query:  MCARKGACDIVKRQTSIKEWVKKWFFASGEWLAKDESGHPFFDVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNP
        MCARKG   IVK  TSIK WV KWFFASGEWLAKDESG  FFDV  RF NLVSI+ IPEL QA+F+TLK+YKD FP+ RKI TLVTDKLLL+SGLLDYNP
Subjt:  MCARKGACDIVKRQTSIKEWVKKWFFASGEWLAKDESGHPFFDVSARFENLVSIRPIPELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNP

Query:  LARPIEASRPNSEFTMVCGFTGSVKCKSKGSAHAFKTVQSTEPATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVLDVSPLNEMRGE
        L R IEASRPNSE  MVCGFTGSVK KSKG AHA KTV  TEP T    R  AQ N GPSS VPTPVIELD +G  S  KRS  E E LDVSPLNE+RGE
Subjt:  LARPIEASRPNSEFTMVCGFTGSVKCKSKGSAHAFKTVQSTEPATFAAARPAAQDNVGPSSEVPTPVIELDSAGEHSKGKRSMSEPEVLDVSPLNEMRGE

Query:  SPLKRRKKKKKTTSFSEVGSRGALPTSHVDLVDDPEARMGG----------------------RPTYPCSSELNRQA------PG---------------
        SPL+RR+KKKKT+S SE G+RG LPTSH DLVDDPEARM G                      R +  C     R+A      PG               
Subjt:  SPLKRRKKKKKTTSFSEVGSRGALPTSHVDLVDDPEARMGG----------------------RPTYPCSSELNRQA------PG---------------

Query:  ----------------EALAAKDNENSSAALEAATALKGELLKARSEVDILKAEVEAKAELLKKEDERHKSHLRAAHAITKGLEKEKFQLLKQKDDLAQV
                        EALAAK+ ENS AALEAAT LKGELLKA+ EVDIL+AEV+AK +LLKKE E+HK+HLRAAHAITKGLEKEKFQLLK+KDDLAQV
Subjt:  ----------------EALAAKDNENSSAALEAATALKGELLKARSEVDILKAEVEAKAELLKKEDERHKSHLRAAHAITKGLEKEKFQLLKQKDDLAQV

Query:  LEEKDASLGRLTAELKEVK----------EAFRQHPDFDGFAKDFSDAGFKFLMKGIIADMPHLQIDLSDLKKKYVEKCAS
        LEEKDAS+GRLT ELK++K          E+FRQHPDFDGFAKDFSDAGFKFLMKGI ADMPHLQIDL+ LKKKY EK AS
Subjt:  LEEKDASLGRLTAELKEVK----------EAFRQHPDFDGFAKDFSDAGFKFLMKGIIADMPHLQIDLSDLKKKYVEKCAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGAGAACACTCATTTGGTTCTCCTCGATTCATGTGCAGCCCGATCCGAACCCCCGCCCGAACTCGACCTTCGATCCAACGTTAGGCCTATAACGAAACACATGGG
TCCAGAACGGAAATATGACTGTTGTCGGAAGCGGAATACGTTTCGACCTGACAAGCTGTCGGAGCACTTGATTACCCCGGCGTTCGGAATTCCAAAGTCAAACCTTACGC
TTCTTGAATTTTTGGAGTTCGATCTGGAATCAGCTCGAACCCTCCATAGGTCAGTCATTCTTGTTTTAATCTATATTTCGAATATGGTAGTTTTCGCATCATCCCCTTCC
AATAGTGGTAGCATGGGTAGCGCAGGTCGGACCATAAGTAGTTCCCCTCCTAAATCTAATGACTCTGGGGAGGACTTATCTCGTAGGTTAGAATCCGAGCTGGAAGAGGT
AGAGAATTTTAGATTTTCTGATGATGGGGAAGATAGTGATGCTTCCACCTCGGGCCAGGGTTTGGAAGATCCGTCAAAAATACCCGAACACTATCTCGAACCCCTCCGTA
GGGGGTTTAAAATTCCAAATGACATCCTCCTTAGGATTCCGGAGGAAGAGAAAAGAGCTGACAACCCTCCAGAGGGGTGGGTCACTCTTTACCTAAAAATGTTTGAGTAC
GGCCTTAGACTCCCTCTTCACCCTTTTGTCCAGGAGTTCTTAAATCGAACTGGACTGTTGCGACCTCGGGACGAGGACGAGGCCGAGCTGCTGAATGTTGACCAACTCCT
TGGGTGCTTCGAGGCCAAGAGGATAGCTAAGAAGCCGGGTCGGTACTACATGTGCGCAAGGAAAGGCGCATGCGACATAGTTAAAAGGCAGACCTCCATAAAGGAATGGG
TGAAGAAGTGGTTCTTTGCCTCTGGGGAATGGCTGGCAAAGGACGAGTCAGGTCATCCCTTCTTTGATGTTTCCGCTAGGTTTGAAAACTTAGTGTCAATCAGGCCAATC
CCCGAACTCACTCAAGCATCCTTTAATACACTTAAGTATTACAAGGATCGCTTTCCGAAGGGCAGGAAGATCGGAACTCTAGTGACCGACAAGCTTCTCCTCGACTCTGG
GTTGTTGGATTACAACCCTTTAGCTCGTCCAATCGAAGCCTCAAGGCCAAACTCCGAGTTCACAATGGTGTGCGGTTTCACAGGTAGTGTGAAGTGTAAGTCCAAGGGTA
GTGCTCACGCCTTTAAGACCGTTCAAAGCACGGAGCCAGCAACTTTTGCTGCGGCTCGACCTGCGGCCCAAGACAACGTTGGGCCGTCTTCTGAAGTTCCCACTCCAGTG
ATCGAGCTGGATTCTGCCGGGGAACACTCCAAGGGAAAACGCTCAATGAGTGAGCCTGAGGTGCTAGACGTGTCTCCCCTGAACGAGATGAGGGGAGAATCTCCTCTGAA
AAGGAGAAAAAAGAAGAAGAAGACCACCTCCTTCTCGGAGGTGGGGTCTCGTGGTGCCCTGCCTACCAGCCACGTTGACCTGGTGGACGACCCCGAGGCGAGGATGGGGG
GACGTCCGACGTACCCATGCAGTTCCGAGTTGAACCGTCAAGCTCCAGGGGAGGCTTTGGCTGCGAAGGATAATGAGAACTCCTCTGCTGCCTTAGAGGCTGCCACCGCG
CTGAAGGGCGAGCTACTGAAGGCCCGGAGCGAGGTGGATATCTTGAAGGCCGAAGTGGAGGCCAAAGCTGAGCTGCTGAAGAAGGAGGATGAGAGACATAAGAGTCACCT
CCGAGCTGCCCACGCCATCACTAAAGGGCTGGAAAAGGAGAAGTTCCAGCTCTTGAAGCAGAAGGACGACCTGGCCCAAGTCCTTGAGGAGAAGGACGCTTCGCTAGGGC
GCCTTACCGCCGAGCTCAAGGAGGTGAAGGAAGCTTTCAGGCAACACCCAGACTTTGATGGGTTTGCCAAAGACTTTAGTGATGCGGGCTTCAAGTTCTTGATGAAGGGC
ATCATTGCCGACATGCCTCATCTTCAGATCGATCTCAGCGATCTGAAGAAGAAGTACGTTGAAAAATGCGCCTCTAGGGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGATCGAGAACACTCATTTGGTTCTCCTCGATTCATGTGCAGCCCGATCCGAACCCCCGCCCGAACTCGACCTTCGATCCAACGTTAGGCCTATAACGAAACACATGGG
TCCAGAACGGAAATATGACTGTTGTCGGAAGCGGAATACGTTTCGACCTGACAAGCTGTCGGAGCACTTGATTACCCCGGCGTTCGGAATTCCAAAGTCAAACCTTACGC
TTCTTGAATTTTTGGAGTTCGATCTGGAATCAGCTCGAACCCTCCATAGGTCAGTCATTCTTGTTTTAATCTATATTTCGAATATGGTAGTTTTCGCATCATCCCCTTCC
AATAGTGGTAGCATGGGTAGCGCAGGTCGGACCATAAGTAGTTCCCCTCCTAAATCTAATGACTCTGGGGAGGACTTATCTCGTAGGTTAGAATCCGAGCTGGAAGAGGT
AGAGAATTTTAGATTTTCTGATGATGGGGAAGATAGTGATGCTTCCACCTCGGGCCAGGGTTTGGAAGATCCGTCAAAAATACCCGAACACTATCTCGAACCCCTCCGTA
GGGGGTTTAAAATTCCAAATGACATCCTCCTTAGGATTCCGGAGGAAGAGAAAAGAGCTGACAACCCTCCAGAGGGGTGGGTCACTCTTTACCTAAAAATGTTTGAGTAC
GGCCTTAGACTCCCTCTTCACCCTTTTGTCCAGGAGTTCTTAAATCGAACTGGACTGTTGCGACCTCGGGACGAGGACGAGGCCGAGCTGCTGAATGTTGACCAACTCCT
TGGGTGCTTCGAGGCCAAGAGGATAGCTAAGAAGCCGGGTCGGTACTACATGTGCGCAAGGAAAGGCGCATGCGACATAGTTAAAAGGCAGACCTCCATAAAGGAATGGG
TGAAGAAGTGGTTCTTTGCCTCTGGGGAATGGCTGGCAAAGGACGAGTCAGGTCATCCCTTCTTTGATGTTTCCGCTAGGTTTGAAAACTTAGTGTCAATCAGGCCAATC
CCCGAACTCACTCAAGCATCCTTTAATACACTTAAGTATTACAAGGATCGCTTTCCGAAGGGCAGGAAGATCGGAACTCTAGTGACCGACAAGCTTCTCCTCGACTCTGG
GTTGTTGGATTACAACCCTTTAGCTCGTCCAATCGAAGCCTCAAGGCCAAACTCCGAGTTCACAATGGTGTGCGGTTTCACAGGTAGTGTGAAGTGTAAGTCCAAGGGTA
GTGCTCACGCCTTTAAGACCGTTCAAAGCACGGAGCCAGCAACTTTTGCTGCGGCTCGACCTGCGGCCCAAGACAACGTTGGGCCGTCTTCTGAAGTTCCCACTCCAGTG
ATCGAGCTGGATTCTGCCGGGGAACACTCCAAGGGAAAACGCTCAATGAGTGAGCCTGAGGTGCTAGACGTGTCTCCCCTGAACGAGATGAGGGGAGAATCTCCTCTGAA
AAGGAGAAAAAAGAAGAAGAAGACCACCTCCTTCTCGGAGGTGGGGTCTCGTGGTGCCCTGCCTACCAGCCACGTTGACCTGGTGGACGACCCCGAGGCGAGGATGGGGG
GACGTCCGACGTACCCATGCAGTTCCGAGTTGAACCGTCAAGCTCCAGGGGAGGCTTTGGCTGCGAAGGATAATGAGAACTCCTCTGCTGCCTTAGAGGCTGCCACCGCG
CTGAAGGGCGAGCTACTGAAGGCCCGGAGCGAGGTGGATATCTTGAAGGCCGAAGTGGAGGCCAAAGCTGAGCTGCTGAAGAAGGAGGATGAGAGACATAAGAGTCACCT
CCGAGCTGCCCACGCCATCACTAAAGGGCTGGAAAAGGAGAAGTTCCAGCTCTTGAAGCAGAAGGACGACCTGGCCCAAGTCCTTGAGGAGAAGGACGCTTCGCTAGGGC
GCCTTACCGCCGAGCTCAAGGAGGTGAAGGAAGCTTTCAGGCAACACCCAGACTTTGATGGGTTTGCCAAAGACTTTAGTGATGCGGGCTTCAAGTTCTTGATGAAGGGC
ATCATTGCCGACATGCCTCATCTTCAGATCGATCTCAGCGATCTGAAGAAGAAGTACGTTGAAAAATGCGCCTCTAGGGCCTAA
Protein sequenceShow/hide protein sequence
MIENTHLVLLDSCAARSEPPPELDLRSNVRPITKHMGPERKYDCCRKRNTFRPDKLSEHLITPAFGIPKSNLTLLEFLEFDLESARTLHRSVILVLIYISNMVVFASSPS
NSGSMGSAGRTISSSPPKSNDSGEDLSRRLESELEEVENFRFSDDGEDSDASTSGQGLEDPSKIPEHYLEPLRRGFKIPNDILLRIPEEEKRADNPPEGWVTLYLKMFEY
GLRLPLHPFVQEFLNRTGLLRPRDEDEAELLNVDQLLGCFEAKRIAKKPGRYYMCARKGACDIVKRQTSIKEWVKKWFFASGEWLAKDESGHPFFDVSARFENLVSIRPI
PELTQASFNTLKYYKDRFPKGRKIGTLVTDKLLLDSGLLDYNPLARPIEASRPNSEFTMVCGFTGSVKCKSKGSAHAFKTVQSTEPATFAAARPAAQDNVGPSSEVPTPV
IELDSAGEHSKGKRSMSEPEVLDVSPLNEMRGESPLKRRKKKKKTTSFSEVGSRGALPTSHVDLVDDPEARMGGRPTYPCSSELNRQAPGEALAAKDNENSSAALEAATA
LKGELLKARSEVDILKAEVEAKAELLKKEDERHKSHLRAAHAITKGLEKEKFQLLKQKDDLAQVLEEKDASLGRLTAELKEVKEAFRQHPDFDGFAKDFSDAGFKFLMKG
IIADMPHLQIDLSDLKKKYVEKCASRA