; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g14280 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g14280
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:8894383..8900142
RNA-Seq ExpressionMoc01g14280
SyntenyMoc01g14280
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]9.2e-14974.11Show/hide
Query:  VSIKPIPELTQATFDILKYYKDNFPRGRKIGTLVTDTLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTSSLKRKSKGRAHTLKTVQSSDPATPVVDQH
        +SIKPIPEL QATFD LK+YKDNFPRGRKIGTLVTD LLLESGLLDYNPLVRPIEASRPNSELAMVCGFTSS+KRKSKGRAH LK VQSSDP TP VDQ+
Subjt:  VSIKPIPELTQATFDILKYYKDNFPRGRKIGTLVTDTLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTSSLKRKSKGRAHTLKTVQSSDPATPVVDQH

Query:  AAQDQAGPSSEVPTPVIELDSTRERSKKKRSRSESEALDVSPLREAFTASIHSTIMIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLR
        AAQDQAGPSS  PTPVIELDST ERS++KRSRSESEALDVSPLRE                                                       
Subjt:  AAQDQAGPSSEVPTPVIELDSTRERSKKKRSRSESEALDVSPLREAFTASIHSTIMIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLR

Query:  AEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQAFEGKDATIDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKF
           EAKAELLKREDE+HKAHL+AAHAITKGLEKEKFQLLKEKDDMLQA E KDA I RL AELKAEKERL+NG LLEAAFRQHPDFDGFAKDFSDA FKF
Subjt:  AEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQAFEGKDATIDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKF

Query:  LMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKYVRELDSDYSNVEEEDAPSQEPTEVGTTQEEAPSQQGGSQEVNLLGSQGEL
        LMKGIAAD+PHL++DL +LKKRYAEKWASGPN T GP SLVDKYVR+LDSDYS+++E++ PSQEPTEVGTTQE  PSQQ GSQEVNLLGSQGEL
Subjt:  LMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKYVRELDSDYSNVEEEDAPSQEPTEVGTTQEEAPSQQGGSQEVNLLGSQGEL

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]5.1e-9981.86Show/hide
Query:  EAFTASIHSTIMIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLRAEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDD
        EAF ASIHS +M+KAELDGREALTAKER N S TLEAATTLKGELLKA+GEVD+LRAEV+AK +LLK+E EKHKAHL+AAHAITKGLEKEKFQLLKEKDD
Subjt:  EAFTASIHSTIMIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLRAEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDD

Query:  MLQAFEGKDATIDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKY
        + Q  E KDA+I RLT ELK  KERL++G LLE +FRQHP+FDGFAKDFSDA FKFLMKGIAADMPHLQ+DL +LKKRY+E WASGPN T GPQSLVDKY
Subjt:  MLQAFEGKDATIDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKY

Query:  VRELDSDYSNVEEEDAPSQEPTEVGTTQEEAPSQQGG
        VRELDSDYS++EEEDAPSQEPT+VGTTQEEAPSQ GG
Subjt:  VRELDSDYSNVEEEDAPSQEPTEVGTTQEEAPSQQGG

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]7.4e-10683.2Show/hide
Query:  EAFTASIHSTIMIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLRAEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDD
        EAF ASIHS IM+KAELDGREAL AKER NSSA LEAATTLKGELLKA+GEV +LRAEV+AKAELLK+E EKHKAHL+AAHAITKGLEKEKFQLLKEKDD
Subjt:  EAFTASIHSTIMIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLRAEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDD

Query:  MLQAFEGKDATIDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKY
        + Q  EGKD +I RLTAELK  KERL+NG+LLE +FRQH DFDGFAKDFSDA FKFLMKGIAADMPHLQ+DL NLKK+Y+EKWASGPN T GPQSLV KY
Subjt:  MLQAFEGKDATIDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKY

Query:  VRELDSDYSNVEEEDAPSQEPTEVGTTQEEAPSQQGGSQEVNLLGSQGEL
        VRELDSDYS++EEEDAPSQEP E+GTTQEE PSQQ GSQEVNLLGS+GEL
Subjt:  VRELDSDYSNVEEEDAPSQEPTEVGTTQEEAPSQQGGSQEVNLLGSQGEL

XP_022158409.1 uncharacterized protein LOC111024898 [Momordica charantia]2.2e-9478.66Show/hide
Query:  MIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLRAEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQAFEGKDAT
        MIKAELDGREAL AKE+ NS A LEAATT+K ELLKAR EV +L+A+V+ KAE+LK+E EKHKAHL AAHAITK +EKEKFQLLKEKDD+ QA E  DA 
Subjt:  MIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLRAEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQAFEGKDAT

Query:  IDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKYVRELDSDYSNV
        I RL+ ELK  KERL+NG LLE AF+QHPDFDGFAKDFSDA FKFLMKGIA DM HLQ+DL ++KK+Y+EKWASGPN T GPQSLVDKYVRELDSDYS+V
Subjt:  IDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKYVRELDSDYSNV

Query:  EEEDAPSQEPTEVGTTQEEAPSQQGGSQEVNLLGSQGEL
        EE DAPSQEP EVGTTQEE PSQ GGSQEVNLLGSQGEL
Subjt:  EEEDAPSQEPTEVGTTQEEAPSQQGGSQEVNLLGSQGEL

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]7.2e-17865.92Show/hide
Query:  MCARKGAGDIVKGPTSIKGWVGKWFFASGEWLAKDESGRPFFDVPARFGNLVSIKPIPELTQATFDILKYYKDNFPRGRKIGTLVTDTLLLESGLLDYNP
        MCARKG G IVKGPTSIKGWVGKWFFASGEWLAKDESGR FFDVP RFGNLVSIK IPEL QATFD LK+YKD+FPR RKI TLVTD LLLESGLLDYNP
Subjt:  MCARKGAGDIVKGPTSIKGWVGKWFFASGEWLAKDESGRPFFDVPARFGNLVSIKPIPELTQATFDILKYYKDNFPRGRKIGTLVTDTLLLESGLLDYNP

Query:  LVRPIEASRPNSELAMVCGFTSSLKRKSKGRAHTLKTVQSSDPATPVVDQHAAQDQAGPSSEVPTPVIELDSTRERSKKKRSRSESEALDVSPLR-----
        LVR IEASRPNSELAMVCGFT S+KRKSKGRAH LKTV  ++P TP V +  AQ  +GPSS VPTPVIELD +  RS +KRSR ESEALDVSPL      
Subjt:  LVRPIEASRPNSELAMVCGFTSSLKRKSKGRAHTLKTVQSSDPATPVVDQHAAQDQAGPSSEVPTPVIELDSTRERSKKKRSRSESEALDVSPLR-----

Query:  ------------------------------------------------------------------------------------------------EAFT
                                                                                                        EAF 
Subjt:  ------------------------------------------------------------------------------------------------EAFT

Query:  ASIHSTIMIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLRAEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQA
        ASIH  +M+KAELDGREAL AKER NS A LEAATTLKGELLKA+GEVD+LRAEV+AK +LLK+E EKHKAHL+AAHAITKGLEKEKFQLLKEKDD+ Q 
Subjt:  ASIHSTIMIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLRAEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQA

Query:  FEGKDATIDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKYVREL
         E KDA+I RLT ELK  KERL+NGTLLE +FRQHPDFDGFAKDFSDA FKFLMKGIAADMPHLQ+DL+ LKK+Y+EKWASGPN T  PQSLVDKYVREL
Subjt:  FEGKDATIDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKYVREL

Query:  DSDYSNVEEEDAPSQEPTEVGTTQEEAPSQQGGS
        DSDYS++EEEDAPSQEP EVGTTQEE PSQQGGS
Subjt:  DSDYSNVEEEDAPSQEPTEVGTTQEEAPSQQGGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124674.5e-14974.11Show/hide
Query:  VSIKPIPELTQATFDILKYYKDNFPRGRKIGTLVTDTLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTSSLKRKSKGRAHTLKTVQSSDPATPVVDQH
        +SIKPIPEL QATFD LK+YKDNFPRGRKIGTLVTD LLLESGLLDYNPLVRPIEASRPNSELAMVCGFTSS+KRKSKGRAH LK VQSSDP TP VDQ+
Subjt:  VSIKPIPELTQATFDILKYYKDNFPRGRKIGTLVTDTLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTSSLKRKSKGRAHTLKTVQSSDPATPVVDQH

Query:  AAQDQAGPSSEVPTPVIELDSTRERSKKKRSRSESEALDVSPLREAFTASIHSTIMIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLR
        AAQDQAGPSS  PTPVIELDST ERS++KRSRSESEALDVSPLRE                                                       
Subjt:  AAQDQAGPSSEVPTPVIELDSTRERSKKKRSRSESEALDVSPLREAFTASIHSTIMIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLR

Query:  AEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQAFEGKDATIDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKF
           EAKAELLKREDE+HKAHL+AAHAITKGLEKEKFQLLKEKDDMLQA E KDA I RL AELKAEKERL+NG LLEAAFRQHPDFDGFAKDFSDA FKF
Subjt:  AEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQAFEGKDATIDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKF

Query:  LMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKYVRELDSDYSNVEEEDAPSQEPTEVGTTQEEAPSQQGGSQEVNLLGSQGEL
        LMKGIAAD+PHL++DL +LKKRYAEKWASGPN T GP SLVDKYVR+LDSDYS+++E++ PSQEPTEVGTTQE  PSQQ GSQEVNLLGSQGEL
Subjt:  LMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKYVRELDSDYSNVEEEDAPSQEPTEVGTTQEEAPSQQGGSQEVNLLGSQGEL

A0A6J1D1N9 uncharacterized protein LOC1110161932.5e-9981.86Show/hide
Query:  EAFTASIHSTIMIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLRAEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDD
        EAF ASIHS +M+KAELDGREALTAKER N S TLEAATTLKGELLKA+GEVD+LRAEV+AK +LLK+E EKHKAHL+AAHAITKGLEKEKFQLLKEKDD
Subjt:  EAFTASIHSTIMIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLRAEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDD

Query:  MLQAFEGKDATIDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKY
        + Q  E KDA+I RLT ELK  KERL++G LLE +FRQHP+FDGFAKDFSDA FKFLMKGIAADMPHLQ+DL +LKKRY+E WASGPN T GPQSLVDKY
Subjt:  MLQAFEGKDATIDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKY

Query:  VRELDSDYSNVEEEDAPSQEPTEVGTTQEEAPSQQGG
        VRELDSDYS++EEEDAPSQEPT+VGTTQEEAPSQ GG
Subjt:  VRELDSDYSNVEEEDAPSQEPTEVGTTQEEAPSQQGG

A0A6J1DF31 uncharacterized protein LOC1110199093.6e-10683.2Show/hide
Query:  EAFTASIHSTIMIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLRAEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDD
        EAF ASIHS IM+KAELDGREAL AKER NSSA LEAATTLKGELLKA+GEV +LRAEV+AKAELLK+E EKHKAHL+AAHAITKGLEKEKFQLLKEKDD
Subjt:  EAFTASIHSTIMIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLRAEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDD

Query:  MLQAFEGKDATIDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKY
        + Q  EGKD +I RLTAELK  KERL+NG+LLE +FRQH DFDGFAKDFSDA FKFLMKGIAADMPHLQ+DL NLKK+Y+EKWASGPN T GPQSLV KY
Subjt:  MLQAFEGKDATIDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKY

Query:  VRELDSDYSNVEEEDAPSQEPTEVGTTQEEAPSQQGGSQEVNLLGSQGEL
        VRELDSDYS++EEEDAPSQEP E+GTTQEE PSQQ GSQEVNLLGS+GEL
Subjt:  VRELDSDYSNVEEEDAPSQEPTEVGTTQEEAPSQQGGSQEVNLLGSQGEL

A0A6J1DZB3 uncharacterized protein LOC1110256653.5e-17865.92Show/hide
Query:  MCARKGAGDIVKGPTSIKGWVGKWFFASGEWLAKDESGRPFFDVPARFGNLVSIKPIPELTQATFDILKYYKDNFPRGRKIGTLVTDTLLLESGLLDYNP
        MCARKG G IVKGPTSIKGWVGKWFFASGEWLAKDESGR FFDVP RFGNLVSIK IPEL QATFD LK+YKD+FPR RKI TLVTD LLLESGLLDYNP
Subjt:  MCARKGAGDIVKGPTSIKGWVGKWFFASGEWLAKDESGRPFFDVPARFGNLVSIKPIPELTQATFDILKYYKDNFPRGRKIGTLVTDTLLLESGLLDYNP

Query:  LVRPIEASRPNSELAMVCGFTSSLKRKSKGRAHTLKTVQSSDPATPVVDQHAAQDQAGPSSEVPTPVIELDSTRERSKKKRSRSESEALDVSPLR-----
        LVR IEASRPNSELAMVCGFT S+KRKSKGRAH LKTV  ++P TP V +  AQ  +GPSS VPTPVIELD +  RS +KRSR ESEALDVSPL      
Subjt:  LVRPIEASRPNSELAMVCGFTSSLKRKSKGRAHTLKTVQSSDPATPVVDQHAAQDQAGPSSEVPTPVIELDSTRERSKKKRSRSESEALDVSPLR-----

Query:  ------------------------------------------------------------------------------------------------EAFT
                                                                                                        EAF 
Subjt:  ------------------------------------------------------------------------------------------------EAFT

Query:  ASIHSTIMIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLRAEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQA
        ASIH  +M+KAELDGREAL AKER NS A LEAATTLKGELLKA+GEVD+LRAEV+AK +LLK+E EKHKAHL+AAHAITKGLEKEKFQLLKEKDD+ Q 
Subjt:  ASIHSTIMIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLRAEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQA

Query:  FEGKDATIDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKYVREL
         E KDA+I RLT ELK  KERL+NGTLLE +FRQHPDFDGFAKDFSDA FKFLMKGIAADMPHLQ+DL+ LKK+Y+EKWASGPN T  PQSLVDKYVREL
Subjt:  FEGKDATIDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKYVREL

Query:  DSDYSNVEEEDAPSQEPTEVGTTQEEAPSQQGGS
        DSDYS++EEEDAPSQEP EVGTTQEE PSQQGGS
Subjt:  DSDYSNVEEEDAPSQEPTEVGTTQEEAPSQQGGS

A0A6J1DZB5 uncharacterized protein LOC1110248981.1e-9478.66Show/hide
Query:  MIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLRAEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQAFEGKDAT
        MIKAELDGREAL AKE+ NS A LEAATT+K ELLKAR EV +L+A+V+ KAE+LK+E EKHKAHL AAHAITK +EKEKFQLLKEKDD+ QA E  DA 
Subjt:  MIKAELDGREALTAKERANSSATLEAATTLKGELLKARGEVDVLRAEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQAFEGKDAT

Query:  IDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKYVRELDSDYSNV
        I RL+ ELK  KERL+NG LLE AF+QHPDFDGFAKDFSDA FKFLMKGIA DM HLQ+DL ++KK+Y+EKWASGPN T GPQSLVDKYVRELDSDYS+V
Subjt:  IDRLTAELKAEKERLSNGTLLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKYVRELDSDYSNV

Query:  EEEDAPSQEPTEVGTTQEEAPSQQGGSQEVNLLGSQGEL
        EE DAPSQEP EVGTTQEE PSQ GGSQEVNLLGSQGEL
Subjt:  EEEDAPSQEPTEVGTTQEEAPSQQGGSQEVNLLGSQGEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGCGAGGAAGGGCGCAGGTGATATAGTCAAGGGGCCGACCTCCATCAAAGGATGGGTAGGTAAGTGGTTCTTTGCCTCAGGTGAATGGCTGGCAAAGGACGAGTC
AGGTCGTCCCTTCTTTGACGTGCCTGCTAGGTTTGGGAACCTAGTATCAATTAAGCCGATTCCCGAGCTCACTCAAGCCACCTTCGACATCCTCAAATACTACAAGGACA
ACTTCCCAAGGGGCCGGAAGATCGGGACCTTGGTCACAGACACGCTGCTGCTAGAATCAGGGCTATTGGACTACAATCCTTTAGTTCGTCCGATTGAAGCTTCAAGGCCA
AACTCTGAGCTCGCCATGGTGTGTGGATTCACGAGCAGCCTGAAACGCAAGTCTAAGGGCCGTGCTCACACCCTTAAGACAGTTCAGAGCTCTGATCCCGCTACCCCTGT
TGTGGATCAGCATGCAGCTCAGGACCAGGCGGGTCCATCTTCTGAAGTTCCAACTCCAGTGATCGAGTTGGATTCTACTAGGGAGCGCTCCAAGAAGAAGCGCTCGAGGA
GCGAGTCCGAAGCACTGGACGTGTCACCGCTTCGTGAGGCGTTCACTGCCTCCATCCACTCAACAATCATGATCAAGGCCGAGCTGGATGGAAGGGAGGCCTTGACAGCG
AAAGAGAGGGCGAACTCCTCTGCTACCTTGGAGGCTGCCACCACGCTCAAGGGCGAGCTGCTGAAGGCTCGGGGTGAGGTGGACGTACTGAGGGCCGAGGTAGAAGCCAA
GGCCGAACTGCTGAAAAGGGAGGATGAAAAGCATAAGGCCCACCTCCAAGCTGCCCACGCCATCACCAAAGGGCTGGAGAAGGAAAAGTTCCAACTCCTTAAGGAGAAGG
ACGACATGCTCCAGGCCTTCGAAGGGAAGGACGCTACGATCGACCGTCTCACTGCCGAGCTGAAGGCGGAAAAGGAGCGCCTTTCCAATGGAACTCTTCTGGAGGCAGCC
TTCAGGCAACACCCAGATTTTGATGGGTTTGCCAAGGACTTCAGCGATGCAAGCTTCAAGTTTCTGATGAAGGGCATTGCTGCCGATATGCCGCACCTCCAGCTCGACCT
CGACAATCTGAAGAAGAGGTATGCTGAGAAATGGGCTTCTGGGCCTAACGACACTCTTGGCCCCCAATCCCTGGTGGACAAGTACGTCAGGGAGCTTGACTCTGACTACT
CCAACGTGGAAGAAGAGGATGCCCCAAGCCAAGAGCCTACCGAGGTCGGCACAACTCAAGAGGAGGCTCCATCACAGCAGGGTGGATCCCAGGAGGTCAACCTTCTAGGC
TCCCAGGGCGAGCTTCCTCCCACCTCAGAAGTAGCTGAGGCTCTCTTTCCTTCTATCTTTATATTTCTTTTTGTAAGTCTTTGGGCAGAGCTGCAAGGAGTCACACACTT
GAGCGGAGGCAAGGAGAAGCCACGTCGGTACAACATTCCTTCTCGGAGTATGAACCGAGCTGCTCTTCGAGCCATCTTCTTTTGCTCCTTCGGATCTTGCGGTGGACTTC
CTTTGATGAACTCCACGATTGGGTCCATCCAAGTGGGTGATGGAGTATCAACCTCCATCACATCTGGCTCCAAGATTGAAGGAAGAAAACTGGCTTTTGCTTTTCCTCGG
GAGGTCGTCTTCACTGGTTTGCTCTTCCAGGGCACATACCGACGATCCTTTGAGCGCGGACGCGTAGCACTCCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTGCGAGGAAGGGCGCAGGTGATATAGTCAAGGGGCCGACCTCCATCAAAGGATGGGTAGGTAAGTGGTTCTTTGCCTCAGGTGAATGGCTGGCAAAGGACGAGTC
AGGTCGTCCCTTCTTTGACGTGCCTGCTAGGTTTGGGAACCTAGTATCAATTAAGCCGATTCCCGAGCTCACTCAAGCCACCTTCGACATCCTCAAATACTACAAGGACA
ACTTCCCAAGGGGCCGGAAGATCGGGACCTTGGTCACAGACACGCTGCTGCTAGAATCAGGGCTATTGGACTACAATCCTTTAGTTCGTCCGATTGAAGCTTCAAGGCCA
AACTCTGAGCTCGCCATGGTGTGTGGATTCACGAGCAGCCTGAAACGCAAGTCTAAGGGCCGTGCTCACACCCTTAAGACAGTTCAGAGCTCTGATCCCGCTACCCCTGT
TGTGGATCAGCATGCAGCTCAGGACCAGGCGGGTCCATCTTCTGAAGTTCCAACTCCAGTGATCGAGTTGGATTCTACTAGGGAGCGCTCCAAGAAGAAGCGCTCGAGGA
GCGAGTCCGAAGCACTGGACGTGTCACCGCTTCGTGAGGCGTTCACTGCCTCCATCCACTCAACAATCATGATCAAGGCCGAGCTGGATGGAAGGGAGGCCTTGACAGCG
AAAGAGAGGGCGAACTCCTCTGCTACCTTGGAGGCTGCCACCACGCTCAAGGGCGAGCTGCTGAAGGCTCGGGGTGAGGTGGACGTACTGAGGGCCGAGGTAGAAGCCAA
GGCCGAACTGCTGAAAAGGGAGGATGAAAAGCATAAGGCCCACCTCCAAGCTGCCCACGCCATCACCAAAGGGCTGGAGAAGGAAAAGTTCCAACTCCTTAAGGAGAAGG
ACGACATGCTCCAGGCCTTCGAAGGGAAGGACGCTACGATCGACCGTCTCACTGCCGAGCTGAAGGCGGAAAAGGAGCGCCTTTCCAATGGAACTCTTCTGGAGGCAGCC
TTCAGGCAACACCCAGATTTTGATGGGTTTGCCAAGGACTTCAGCGATGCAAGCTTCAAGTTTCTGATGAAGGGCATTGCTGCCGATATGCCGCACCTCCAGCTCGACCT
CGACAATCTGAAGAAGAGGTATGCTGAGAAATGGGCTTCTGGGCCTAACGACACTCTTGGCCCCCAATCCCTGGTGGACAAGTACGTCAGGGAGCTTGACTCTGACTACT
CCAACGTGGAAGAAGAGGATGCCCCAAGCCAAGAGCCTACCGAGGTCGGCACAACTCAAGAGGAGGCTCCATCACAGCAGGGTGGATCCCAGGAGGTCAACCTTCTAGGC
TCCCAGGGCGAGCTTCCTCCCACCTCAGAAGTAGCTGAGGCTCTCTTTCCTTCTATCTTTATATTTCTTTTTGTAAGTCTTTGGGCAGAGCTGCAAGGAGTCACACACTT
GAGCGGAGGCAAGGAGAAGCCACGTCGGTACAACATTCCTTCTCGGAGTATGAACCGAGCTGCTCTTCGAGCCATCTTCTTTTGCTCCTTCGGATCTTGCGGTGGACTTC
CTTTGATGAACTCCACGATTGGGTCCATCCAAGTGGGTGATGGAGTATCAACCTCCATCACATCTGGCTCCAAGATTGAAGGAAGAAAACTGGCTTTTGCTTTTCCTCGG
GAGGTCGTCTTCACTGGTTTGCTCTTCCAGGGCACATACCGACGATCCTTTGAGCGCGGACGCGTAGCACTCCCTTGA
Protein sequenceShow/hide protein sequence
MCARKGAGDIVKGPTSIKGWVGKWFFASGEWLAKDESGRPFFDVPARFGNLVSIKPIPELTQATFDILKYYKDNFPRGRKIGTLVTDTLLLESGLLDYNPLVRPIEASRP
NSELAMVCGFTSSLKRKSKGRAHTLKTVQSSDPATPVVDQHAAQDQAGPSSEVPTPVIELDSTRERSKKKRSRSESEALDVSPLREAFTASIHSTIMIKAELDGREALTA
KERANSSATLEAATTLKGELLKARGEVDVLRAEVEAKAELLKREDEKHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQAFEGKDATIDRLTAELKAEKERLSNGTLLEAA
FRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQLDLDNLKKRYAEKWASGPNDTLGPQSLVDKYVRELDSDYSNVEEEDAPSQEPTEVGTTQEEAPSQQGGSQEVNLLG
SQGELPPTSEVAEALFPSIFIFLFVSLWAELQGVTHLSGGKEKPRRYNIPSRSMNRAALRAIFFCSFGSCGGLPLMNSTIGSIQVGDGVSTSITSGSKIEGRKLAFAFPR
EVVFTGLLFQGTYRRSFERGRVALP