; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g18440 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g18440
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:13165564..13168179
RNA-Seq ExpressionMoc07g18440
SyntenyMoc07g18440
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.9e-10548.57Show/hide
Query:  PLNDVRGESPLKRRRKKKKTSSLSEVGARETLPASHADLVDDPAARMGGTFDVRTRFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSDLGSVLQRTI
        P + +  E+P KRR+KKK  SS SEVGA   LPA  AD VDDPAARMGGT DV  RFR+EPSSSGV+DQVSRISA  LDRCL+RASKFVS  GSVL R I
Subjt:  PLNDVRGESPLKRRRKKKKTSSLSEVGARETLPASHADLVDDPAARMGGTFDVRTRFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSDLGSVLQRTI

Query:  DNPAEAFVASIHSAIMVKAELDGREALAAKERENSSTALEAA-TTLKGELLKAQGEVGILRAEV------------------------------------
        D  AEAFVASI SA+ VKAELDGRE LAA+E+E  S ALEAA +T+K ELLKA  EV  L+AEV                                    
Subjt:  DNPAEAFVASIHSAIMVKAELDGREALAAKERENSSTALEAA-TTLKGELLKAQGEVGILRAEV------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------DAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDAS
                                                        +AKAELLK+E E+HKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q LE KDA+
Subjt:  ------------------------------------------------DAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDAS

Query:  IGRLTAELKDLKERLTSGPLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDM
        IGRL AELK  KERLT+G LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIAAD+PHL++DL DLKK+Y+EKWASGPNGT GP  LVDKYVR+LDS+YSD+
Subjt:  IGRLTAELKDLKERLTSGPLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDM

Query:  EEEDAPSQEPNEIGTTQEEVPSQ
        +E++ PSQEP E+GTTQE VPSQ
Subjt:  EEEDAPSQEPNEIGTTQEEVPSQ

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]3.0e-13590.81Show/hide
Query:  RFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSDLGSVLQRTIDNPAEAFVASIHSAIMVKAELDGREALAAKERENSSTALEAATTLKGELLKAQGE
        RFRME SSSGVKDQVSRISATCLDRCL+RAS+FVSD GSVLQRTIDN AEAF+ASIHSA+MVKAELDGREAL AKEREN ST LEAATTLKGELLKAQGE
Subjt:  RFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSDLGSVLQRTIDNPAEAFVASIHSAIMVKAELDGREALAAKERENSSTALEAATTLKGELLKAQGE

Query:  VGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTSGPLLEESFRQHPDFDGFAKDFSD
        V ILRAEVDAK +LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLE+KDASIGRLT ELKDLKERLT G LLEESFRQHP+FDGFAKDFSD
Subjt:  VGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTSGPLLEESFRQHPDFDGFAKDFSD

Query:  AGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDMEEEDAPSQEPNEIGTTQEEVPSQ
        AGFKFLMKGIAADMPHLQIDLSDLKK+YSE WASGPNGTPGPQ LVDKYVRELDS+YSDMEEEDAPSQEP ++GTTQEE PSQ
Subjt:  AGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDMEEEDAPSQEPNEIGTTQEEVPSQ

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]1.4e-14595.89Show/hide
Query:  MGGTFDVRTRFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSDLGSVLQRTIDNPAEAFVASIHSAIMVKAELDGREALAAKERENSSTALEAATTLK
        MGGTFDVRTRFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSD GSVLQRTIDN AEAFVASIHSAIMVKAELDGREALAAKERENSS ALEAATTLK
Subjt:  MGGTFDVRTRFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSDLGSVLQRTIDNPAEAFVASIHSAIMVKAELDGREALAAKERENSSTALEAATTLK

Query:  GELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTSGPLLEESFRQHPDF
        GELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLE KD SIGRLTAELKDLKERLT+G LLEESFRQH DF
Subjt:  GELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTSGPLLEESFRQHPDF

Query:  DGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDMEEEDAPSQEPNEIGTTQEEVPSQ
        DGFAKDFSDAGFKFLMKGIAADMPHLQIDLS+LKKKYSEKWASGPNGTPGPQ LV KYVRELDS+YSDMEEEDAPSQEPNEIGTTQEEVPSQ
Subjt:  DGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDMEEEDAPSQEPNEIGTTQEEVPSQ

XP_022158409.1 uncharacterized protein LOC111024898 [Momordica charantia]8.4e-9885.2Show/hide
Query:  MVKAELDGREALAAKERENSSTALEAATTLKGELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDAS
        M+KAELDGREALAAKE+ENS  ALEAATT+K ELLKA+ EVGIL+A+VD KAE+LKKEGEKHKAHL AAHAITK +EKEKFQLLKEKDDLAQ LEE DA 
Subjt:  MVKAELDGREALAAKERENSSTALEAATTLKGELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDAS

Query:  IGRLTAELKDLKERLTSGPLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDM
        IGRL+ ELKD KERLT+G LLEE+F+QHPDFDGFAKDFSDAGFKFLMKGIA DM HLQIDLSD+KKKYSEKWASGPNGTPGPQ LVDKYVRELDS+YSD+
Subjt:  IGRLTAELKDLKERLTSGPLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDM

Query:  EEEDAPSQEPNEIGTTQEEVPSQ
        EE DAPSQEPNE+GTTQEEVPSQ
Subjt:  EEEDAPSQEPNEIGTTQEEVPSQ

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.2e-18289.84Show/hide
Query:  SVPRTEAQGNSGPSSAVPTPVIKLDLSGGRSEEKRPREESEALDVSPLNDVRGESPLKRRRKKKKTSSLSEVGARETLPASHADLVDDPAARMGGTFDVR
        +VPRT AQGNSGPSSAVPTPVI+LDLSGGRS EKR REESEALDVSPLN+VRGESPL+RRRKKKKTSS SE GAR TLP SHADLVDDP ARM GT +VR
Subjt:  SVPRTEAQGNSGPSSAVPTPVIKLDLSGGRSEEKRPREESEALDVSPLNDVRGESPLKRRRKKKKTSSLSEVGARETLPASHADLVDDPAARMGGTFDVR

Query:  TRFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSDLGSVLQRTIDNPAEAFVASIHSAIMVKAELDGREALAAKERENSSTALEAATTLKGELLKAQG
         RF MEPSSSGVKDQVSRISATCLDR L+RASKFVSD GSVLQRTIDN AEAF+ASIH A+MVKAELDGREALAAKERENS  ALEAATTLKGELLKAQG
Subjt:  TRFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSDLGSVLQRTIDNPAEAFVASIHSAIMVKAELDGREALAAKERENSSTALEAATTLKGELLKAQG

Query:  EVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTSGPLLEESFRQHPDFDGFAKDFS
        EV ILRAEVDAK +LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLT ELKDLKERLT+G LLEESFRQHPDFDGFAKDFS
Subjt:  EVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTSGPLLEESFRQHPDFDGFAKDFS

Query:  DAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDMEEEDAPSQEPNEIGTTQEEVPSQ
        DAGFKFLMKGIAADMPHLQIDL+ LKKKYSEKWASGPNGTP PQ LVDKYVRELDS+YSDMEEEDAPSQEP E+GTTQEEVPSQ
Subjt:  DAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDMEEEDAPSQEPNEIGTTQEEVPSQ

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124679.1e-10648.57Show/hide
Query:  PLNDVRGESPLKRRRKKKKTSSLSEVGARETLPASHADLVDDPAARMGGTFDVRTRFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSDLGSVLQRTI
        P + +  E+P KRR+KKK  SS SEVGA   LPA  AD VDDPAARMGGT DV  RFR+EPSSSGV+DQVSRISA  LDRCL+RASKFVS  GSVL R I
Subjt:  PLNDVRGESPLKRRRKKKKTSSLSEVGARETLPASHADLVDDPAARMGGTFDVRTRFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSDLGSVLQRTI

Query:  DNPAEAFVASIHSAIMVKAELDGREALAAKERENSSTALEAA-TTLKGELLKAQGEVGILRAEV------------------------------------
        D  AEAFVASI SA+ VKAELDGRE LAA+E+E  S ALEAA +T+K ELLKA  EV  L+AEV                                    
Subjt:  DNPAEAFVASIHSAIMVKAELDGREALAAKERENSSTALEAA-TTLKGELLKAQGEVGILRAEV------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------DAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDAS
                                                        +AKAELLK+E E+HKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q LE KDA+
Subjt:  ------------------------------------------------DAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDAS

Query:  IGRLTAELKDLKERLTSGPLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDM
        IGRL AELK  KERLT+G LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIAAD+PHL++DL DLKK+Y+EKWASGPNGT GP  LVDKYVR+LDS+YSD+
Subjt:  IGRLTAELKDLKERLTSGPLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDM

Query:  EEEDAPSQEPNEIGTTQEEVPSQ
        +E++ PSQEP E+GTTQE VPSQ
Subjt:  EEEDAPSQEPNEIGTTQEEVPSQ

A0A6J1D1N9 uncharacterized protein LOC1110161931.4e-13590.81Show/hide
Query:  RFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSDLGSVLQRTIDNPAEAFVASIHSAIMVKAELDGREALAAKERENSSTALEAATTLKGELLKAQGE
        RFRME SSSGVKDQVSRISATCLDRCL+RAS+FVSD GSVLQRTIDN AEAF+ASIHSA+MVKAELDGREAL AKEREN ST LEAATTLKGELLKAQGE
Subjt:  RFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSDLGSVLQRTIDNPAEAFVASIHSAIMVKAELDGREALAAKERENSSTALEAATTLKGELLKAQGE

Query:  VGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTSGPLLEESFRQHPDFDGFAKDFSD
        V ILRAEVDAK +LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLE+KDASIGRLT ELKDLKERLT G LLEESFRQHP+FDGFAKDFSD
Subjt:  VGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTSGPLLEESFRQHPDFDGFAKDFSD

Query:  AGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDMEEEDAPSQEPNEIGTTQEEVPSQ
        AGFKFLMKGIAADMPHLQIDLSDLKK+YSE WASGPNGTPGPQ LVDKYVRELDS+YSDMEEEDAPSQEP ++GTTQEE PSQ
Subjt:  AGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDMEEEDAPSQEPNEIGTTQEEVPSQ

A0A6J1DF31 uncharacterized protein LOC1110199096.9e-14695.89Show/hide
Query:  MGGTFDVRTRFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSDLGSVLQRTIDNPAEAFVASIHSAIMVKAELDGREALAAKERENSSTALEAATTLK
        MGGTFDVRTRFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSD GSVLQRTIDN AEAFVASIHSAIMVKAELDGREALAAKERENSS ALEAATTLK
Subjt:  MGGTFDVRTRFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSDLGSVLQRTIDNPAEAFVASIHSAIMVKAELDGREALAAKERENSSTALEAATTLK

Query:  GELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTSGPLLEESFRQHPDF
        GELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLE KD SIGRLTAELKDLKERLT+G LLEESFRQH DF
Subjt:  GELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTSGPLLEESFRQHPDF

Query:  DGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDMEEEDAPSQEPNEIGTTQEEVPSQ
        DGFAKDFSDAGFKFLMKGIAADMPHLQIDLS+LKKKYSEKWASGPNGTPGPQ LV KYVRELDS+YSDMEEEDAPSQEPNEIGTTQEEVPSQ
Subjt:  DGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDMEEEDAPSQEPNEIGTTQEEVPSQ

A0A6J1DZB3 uncharacterized protein LOC1110256651.6e-18289.84Show/hide
Query:  SVPRTEAQGNSGPSSAVPTPVIKLDLSGGRSEEKRPREESEALDVSPLNDVRGESPLKRRRKKKKTSSLSEVGARETLPASHADLVDDPAARMGGTFDVR
        +VPRT AQGNSGPSSAVPTPVI+LDLSGGRS EKR REESEALDVSPLN+VRGESPL+RRRKKKKTSS SE GAR TLP SHADLVDDP ARM GT +VR
Subjt:  SVPRTEAQGNSGPSSAVPTPVIKLDLSGGRSEEKRPREESEALDVSPLNDVRGESPLKRRRKKKKTSSLSEVGARETLPASHADLVDDPAARMGGTFDVR

Query:  TRFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSDLGSVLQRTIDNPAEAFVASIHSAIMVKAELDGREALAAKERENSSTALEAATTLKGELLKAQG
         RF MEPSSSGVKDQVSRISATCLDR L+RASKFVSD GSVLQRTIDN AEAF+ASIH A+MVKAELDGREALAAKERENS  ALEAATTLKGELLKAQG
Subjt:  TRFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSDLGSVLQRTIDNPAEAFVASIHSAIMVKAELDGREALAAKERENSSTALEAATTLKGELLKAQG

Query:  EVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTSGPLLEESFRQHPDFDGFAKDFS
        EV ILRAEVDAK +LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLT ELKDLKERLT+G LLEESFRQHPDFDGFAKDFS
Subjt:  EVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTSGPLLEESFRQHPDFDGFAKDFS

Query:  DAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDMEEEDAPSQEPNEIGTTQEEVPSQ
        DAGFKFLMKGIAADMPHLQIDL+ LKKKYSEKWASGPNGTP PQ LVDKYVRELDS+YSDMEEEDAPSQEP E+GTTQEEVPSQ
Subjt:  DAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDMEEEDAPSQEPNEIGTTQEEVPSQ

A0A6J1DZB5 uncharacterized protein LOC1110248984.1e-9885.2Show/hide
Query:  MVKAELDGREALAAKERENSSTALEAATTLKGELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDAS
        M+KAELDGREALAAKE+ENS  ALEAATT+K ELLKA+ EVGIL+A+VD KAE+LKKEGEKHKAHL AAHAITK +EKEKFQLLKEKDDLAQ LEE DA 
Subjt:  MVKAELDGREALAAKERENSSTALEAATTLKGELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDAS

Query:  IGRLTAELKDLKERLTSGPLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDM
        IGRL+ ELKD KERLT+G LLEE+F+QHPDFDGFAKDFSDAGFKFLMKGIA DM HLQIDLSD+KKKYSEKWASGPNGTPGPQ LVDKYVRELDS+YSD+
Subjt:  IGRLTAELKDLKERLTSGPLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSDM

Query:  EEEDAPSQEPNEIGTTQEEVPSQ
        EE DAPSQEPNE+GTTQEEVPSQ
Subjt:  EEEDAPSQEPNEIGTTQEEVPSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGTTGAGATGTGTTTATGCAGGAATCTGCACAACGGTTCTTCACGAATCGAGCTCGAACCCGGTTTCCGATTCCGACCTGAACACTAGAGTGGACCTGCAC
AAGAGGGTGAACACTCCGACGCTCAAGTCAGTAATGGGCCCGACAGCACGCACGACCGGCGGTTACATGTCTTTTCTCATATCGGACCTATCGGGTTCCGAGCAG
GTCGGACCACAGTTATCAGAGTACTCAAGCGTTTCGTCGTTGCGTATCCCGAGGAGATCTCAGCCGCTCGTTGATTACACGTGTACGGCGCAGAGATTTTTCCGA
TCAGCTATAAATAGTGCCGAAACTTCAGTTTTCTTATCTTCCCCCTCCAGTAGCGATAGCCTGGGTAGTGTAGGTCGGACAATAAGTAGTTCGCCCCCCAAGCCA
AGTGACTCTGGGGAGGTCTTAGCTCGTAGGTTAGAGTCTGAGCTGGAAGAAATAGAGAACTTTAGGTTCTCAGATGATGAAGAGGATAGCGATACCTCCACCTCG
GGCCAGGGTCTGGAGTACCCTTCTAGGATGCCCGAGCACTACCTTGGACCCCTTCGTAGGGGGTTTAACATTCCGAATAACATCCTCAGGATTCCGGAGGAAGGG
GAAAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTTTATTTGAAGATGTTTGAGTACGGCCTCAGACTTCCCCTTCATCCCTTTGCTCAGGAGTTCTTAAAC
CGAACCGGACTGGCTCCTGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTAGCCATTCTTTTTTGGTCGGTGCCACGGACTGAGGCTCAGGGTAAC
TCCGGGCCATCCTCTGCAGTTCCCACCCCCGTGATCAAACTGGACCTGTCTGGGGGTCGATCTGAAGAGAAGCGTCCAAGGGAGGAGTCCGAGGCGCTTGATGTA
TCTCCCCTGAACGACGTGAGGGGAGAGTCTCCTTTGAAGAGAAGAAGAAAGAAGAAGAAGACCTCTTCCCTTTCGGAGGTTGGGGCTCGTGAGACCCTGCCCGCG
AGCCATGCTGACCTGGTGGACGACCCCGCAGCTCGGATGGGGGGAACATTCGACGTGCGAACGCGGTTCAGGATGGAACCGTCAAGCTCTGGGGTGAAGGACCAG
GTGTCCCGCATCTCGGCCACGTGCTTGGACCGCTGTCTGAAGAGAGCATCCAAGTTCGTGAGTGATCTTGGGTCCGTACTGCAGAGGACCATCGATAACCCTGCC
GAGGCGTTTGTCGCTTCCATTCATTCAGCTATTATGGTCAAGGCTGAGTTGGATGGAAGGGAGGCTCTGGCAGCAAAGGAGAGGGAGAACTCTTCTACTGCCTTA
GAGGCTGCCACCACGCTGAAGGGCGAGCTGCTAAAGGCCCAAGGCGAGGTGGGTATCTTAAGGGCCGAGGTGGATGCCAAGGCCGAACTTTTGAAGAAGGAGGGT
GAGAAGCACAAGGCCCACCTCCGAGCAGCCCATGCGATCACCAAGGGGCTGGAGAAGGAGAAATTCCAACTCCTAAAGGAGAAGGACGATCTCGCCCAAGTTCTT
GAGGAGAAGGATGCCTCTATTGGGCGTCTCACTGCCGAGCTCAAAGACCTGAAGGAGCGCCTCACCAGCGGACCTCTGCTGGAGGAGTCGTTCCGGCAACACCCA
GACTTCGATGGGTTCGCCAAGGACTTTAGCGACGCCGGCTTCAAGTTCTTGATGAAGGGTATTGCTGCCGACATGCCTCACCTTCAGATCGATCTCAGCGACCTC
AAGAAGAAATACTCTGAGAAGTGGGCTTCTGGGCCTAACGGGACTCCTGGCCCCCAACCGCTGGTGGACAAGTATGTCAGGGAGCTGGACTCTAACTACTCCGAC
ATGGAAGAAGAGGACGCTCCTAGCCAAGAGCCCAATGAGATCGGCACAACGCAAGAGGAGGTTCCTTCTCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGTTGAGATGTGTTTATGCAGGAATCTGCACAACGGTTCTTCACGAATCGAGCTCGAACCCGGTTTCCGATTCCGACCTGAACACTAGAGTGGACCTGCAC
AAGAGGGTGAACACTCCGACGCTCAAGTCAGTAATGGGCCCGACAGCACGCACGACCGGCGGTTACATGTCTTTTCTCATATCGGACCTATCGGGTTCCGAGCAG
GTCGGACCACAGTTATCAGAGTACTCAAGCGTTTCGTCGTTGCGTATCCCGAGGAGATCTCAGCCGCTCGTTGATTACACGTGTACGGCGCAGAGATTTTTCCGA
TCAGCTATAAATAGTGCCGAAACTTCAGTTTTCTTATCTTCCCCCTCCAGTAGCGATAGCCTGGGTAGTGTAGGTCGGACAATAAGTAGTTCGCCCCCCAAGCCA
AGTGACTCTGGGGAGGTCTTAGCTCGTAGGTTAGAGTCTGAGCTGGAAGAAATAGAGAACTTTAGGTTCTCAGATGATGAAGAGGATAGCGATACCTCCACCTCG
GGCCAGGGTCTGGAGTACCCTTCTAGGATGCCCGAGCACTACCTTGGACCCCTTCGTAGGGGGTTTAACATTCCGAATAACATCCTCAGGATTCCGGAGGAAGGG
GAAAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTTTATTTGAAGATGTTTGAGTACGGCCTCAGACTTCCCCTTCATCCCTTTGCTCAGGAGTTCTTAAAC
CGAACCGGACTGGCTCCTGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTAGCCATTCTTTTTTGGTCGGTGCCACGGACTGAGGCTCAGGGTAAC
TCCGGGCCATCCTCTGCAGTTCCCACCCCCGTGATCAAACTGGACCTGTCTGGGGGTCGATCTGAAGAGAAGCGTCCAAGGGAGGAGTCCGAGGCGCTTGATGTA
TCTCCCCTGAACGACGTGAGGGGAGAGTCTCCTTTGAAGAGAAGAAGAAAGAAGAAGAAGACCTCTTCCCTTTCGGAGGTTGGGGCTCGTGAGACCCTGCCCGCG
AGCCATGCTGACCTGGTGGACGACCCCGCAGCTCGGATGGGGGGAACATTCGACGTGCGAACGCGGTTCAGGATGGAACCGTCAAGCTCTGGGGTGAAGGACCAG
GTGTCCCGCATCTCGGCCACGTGCTTGGACCGCTGTCTGAAGAGAGCATCCAAGTTCGTGAGTGATCTTGGGTCCGTACTGCAGAGGACCATCGATAACCCTGCC
GAGGCGTTTGTCGCTTCCATTCATTCAGCTATTATGGTCAAGGCTGAGTTGGATGGAAGGGAGGCTCTGGCAGCAAAGGAGAGGGAGAACTCTTCTACTGCCTTA
GAGGCTGCCACCACGCTGAAGGGCGAGCTGCTAAAGGCCCAAGGCGAGGTGGGTATCTTAAGGGCCGAGGTGGATGCCAAGGCCGAACTTTTGAAGAAGGAGGGT
GAGAAGCACAAGGCCCACCTCCGAGCAGCCCATGCGATCACCAAGGGGCTGGAGAAGGAGAAATTCCAACTCCTAAAGGAGAAGGACGATCTCGCCCAAGTTCTT
GAGGAGAAGGATGCCTCTATTGGGCGTCTCACTGCCGAGCTCAAAGACCTGAAGGAGCGCCTCACCAGCGGACCTCTGCTGGAGGAGTCGTTCCGGCAACACCCA
GACTTCGATGGGTTCGCCAAGGACTTTAGCGACGCCGGCTTCAAGTTCTTGATGAAGGGTATTGCTGCCGACATGCCTCACCTTCAGATCGATCTCAGCGACCTC
AAGAAGAAATACTCTGAGAAGTGGGCTTCTGGGCCTAACGGGACTCCTGGCCCCCAACCGCTGGTGGACAAGTATGTCAGGGAGCTGGACTCTAACTACTCCGAC
ATGGAAGAAGAGGACGCTCCTAGCCAAGAGCCCAATGAGATCGGCACAACGCAAGAGGAGGTTCCTTCTCAGTAG
Protein sequenceShow/hide protein sequence
MRLRCVYAGICTTVLHESSSNPVSDSDLNTRVDLHKRVNTPTLKSVMGPTARTTGGYMSFLISDLSGSEQVGPQLSEYSSVSSLRIPRRSQPLVDYTCTAQRFFR
SAINSAETSVFLSSPSSSDSLGSVGRTISSSPPKPSDSGEVLARRLESELEEIENFRFSDDEEDSDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNNILRIPEEG
ERADNPPEGWVTLYLKMFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWSVPRTEAQGNSGPSSAVPTPVIKLDLSGGRSEEKRPREESEALDV
SPLNDVRGESPLKRRRKKKKTSSLSEVGARETLPASHADLVDDPAARMGGTFDVRTRFRMEPSSSGVKDQVSRISATCLDRCLKRASKFVSDLGSVLQRTIDNPA
EAFVASIHSAIMVKAELDGREALAAKERENSSTALEAATTLKGELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVL
EEKDASIGRLTAELKDLKERLTSGPLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQPLVDKYVRELDSNYSD
MEEEDAPSQEPNEIGTTQEEVPSQ