; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g30210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g30210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr6:22719536..22725590
RNA-Seq ExpressionMoc06g30210
SyntenyMoc06g30210
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]3.3e-9045.23Show/hide
Query:  PLNEVRGESPLKRRRKKKKTSSSSEAGARGTLPTSHADLVDDPAARIGGTSDVRMRFRMEPSSSGVKDQVSRISATCLDRCLRRASKFVSDPGS------
        P + +  E+P K RRKKKK  SSSE GA   LP   AD VDDPAAR+GGTSDV  RFR+EPSSSGV+DQVSRISA  LDRCLRRASKFVS PGS      
Subjt:  PLNEVRGESPLKRRRKKKKTSSSSEAGARGTLPTSHADLVDDPAARIGGTSDVRMRFRMEPSSSGVKDQVSRISATCLDRCLRRASKFVSDPGS------

Query:  -----AFVASIHSAIMVKAELDGREALAAKERENSSAALEAA-TTLKGELLKAQGEVDILRAEV------------------------------------
             AFVASI SA+ VKAELDGRE LAA+E+E  SAALEAA +T+K ELLKA  EV+ L+AEV                                    
Subjt:  -----AFVASIHSAIMVKAELDGREALAAKERENSSAALEAA-TTLKGELLKAQGEVDILRAEV------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------DAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDAS
                                                        +AKA LLK+E E+HKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q LE KDA+
Subjt:  ------------------------------------------------DAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDAS

Query:  IGRLTAELKDLKERLTNRPLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDA
        IGRL AELK  KERLTN  LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIAAD+PHL++DL DLKK+Y+E                    R+LDS+YSD 
Subjt:  IGRLTAELKDLKERLTNRPLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDA

Query:  EEEDAPSQEPNEIGTMQEDVPSQQDGSQEVNLLGS
        +E++ PSQEP E+GT QE VPSQQDGSQEVNLLGS
Subjt:  EEEDAPSQEPNEIGTMQEDVPSQQDGSQEVNLLGS

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]1.0e-11079.79Show/hide
Query:  MRFRMEPSSSGVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQG
        MRFRME SSSGVKDQVSRISATCLDRCLRRAS+FVSDPGS           AF+ASIHSA+MVKAELDGREAL AKEREN S  LEAATTLKGELLKAQG
Subjt:  MRFRMEPSSSGVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQG

Query:  EVDILRAEVDAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTNRPLLEESFRQHPDFDGFAKDFS
        EVDILRAEVDAK +LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLE+KDASIGRLT ELKDLKERLT+  LLEESFRQHP+FDGFAKDFS
Subjt:  EVDILRAEVDAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTNRPLLEESFRQHPDFDGFAKDFS

Query:  DAGFKFLMKGIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDAEEEDAPSQEPNEIGTMQEDVPSQQDG
        DAGFKFLMKGIAADMPHLQIDLSDLKK+YSE                    RELDS+YSD EEEDAPSQEP ++GT QE+ PSQ  G
Subjt:  DAGFKFLMKGIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDAEEEDAPSQEPNEIGTMQEDVPSQQDG

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]1.0e-12384.54Show/hide
Query:  IGGTSDVRMRFRMEPSSSGVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLK
        +GGT DVR RFRMEPSSSGVKDQVSRISATCLDRCL+RASKFVSDPGS           AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLK
Subjt:  IGGTSDVRMRFRMEPSSSGVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLK

Query:  GELLKAQGEVDILRAEVDAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTNRPLLEESFRQHPDF
        GELLKAQGEV ILRAEVDAKA LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLE KD SIGRLTAELKDLKERLTN  LLEESFRQH DF
Subjt:  GELLKAQGEVDILRAEVDAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTNRPLLEESFRQHPDF

Query:  DGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDAEEEDAPSQEPNEIGTMQEDVPSQQDGSQEVN
        DGFAKDFSDAGFKFLMKGIAADMPHLQIDLS+LKKKYSE                    RELDS+YSD EEEDAPSQEPNEIGT QE+VPSQQDGSQEVN
Subjt:  DGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDAEEEDAPSQEPNEIGTMQEDVPSQQDGSQEVN

Query:  LLGS
        LLGS
Subjt:  LLGS

XP_022158409.1 uncharacterized protein LOC111024898 [Momordica charantia]4.0e-8375.74Show/hide
Query:  MVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGEVDILRAEVDAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDAS
        M+KAELDGREALAAKE+ENS AALEAATT+K ELLKA+ EV IL+A+VD KA +LKKEGEKHKAHL AAHAITK +EKEKFQLLKEKDDLAQ LEE DA 
Subjt:  MVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGEVDILRAEVDAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDAS

Query:  IGRLTAELKDLKERLTNRPLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDA
        IGRL+ ELKD KERLTN  LLEE+F+QHPDFDGFAKDFSDAGFKFLMKGIA DM HLQIDLSD+KKKYSE                    RELDS+YSD 
Subjt:  IGRLTAELKDLKERLTNRPLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDA

Query:  EEEDAPSQEPNEIGTMQEDVPSQQDGSQEVNLLGS
        EE DAPSQEPNE+GT QE+VPSQ  GSQEVNLLGS
Subjt:  EEEDAPSQEPNEIGTMQEDVPSQQDGSQEVNLLGS

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.7e-15583.11Show/hide
Query:  NKPPISRSPTPVIELDLSGGRSEVKRPREESEALDVSPLNEVRGESPLKRRRKKKKTSSSSEAGARGTLPTSHADLVDDPAARIGGTSDVRMRFRMEPSS
        N  P S  PTPVIELDLSGGRS  KR REESEALDVSPLNEVRGESPL+RRRKKKKTSSSSEAGARGTLPTSHADLVDDP AR+ GTS+VRMRF MEPSS
Subjt:  NKPPISRSPTPVIELDLSGGRSEVKRPREESEALDVSPLNEVRGESPLKRRRKKKKTSSSSEAGARGTLPTSHADLVDDPAARIGGTSDVRMRFRMEPSS

Query:  SGVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGEVDILRAEV
        SGVKDQVSRISATCLDR LRRASKFVSDPGS           AF+ASIH A+MVKAELDGREALAAKERENS AALEAATTLKGELLKAQGEVDILRAEV
Subjt:  SGVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGEVDILRAEV

Query:  DAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTNRPLLEESFRQHPDFDGFAKDFSDAGFKFLMK
        DAK +LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLT ELKDLKERLTN  LLEESFRQHPDFDGFAKDFSDAGFKFLMK
Subjt:  DAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTNRPLLEESFRQHPDFDGFAKDFSDAGFKFLMK

Query:  GIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDAEEEDAPSQEPNEIGTMQEDVPSQQDGS
        GIAADMPHLQIDL+ LKKKYSE                    RELDS+YSD EEEDAPSQEP E+GT QE+VPSQQ GS
Subjt:  GIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDAEEEDAPSQEPNEIGTMQEDVPSQQDGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124671.6e-9045.23Show/hide
Query:  PLNEVRGESPLKRRRKKKKTSSSSEAGARGTLPTSHADLVDDPAARIGGTSDVRMRFRMEPSSSGVKDQVSRISATCLDRCLRRASKFVSDPGS------
        P + +  E+P K RRKKKK  SSSE GA   LP   AD VDDPAAR+GGTSDV  RFR+EPSSSGV+DQVSRISA  LDRCLRRASKFVS PGS      
Subjt:  PLNEVRGESPLKRRRKKKKTSSSSEAGARGTLPTSHADLVDDPAARIGGTSDVRMRFRMEPSSSGVKDQVSRISATCLDRCLRRASKFVSDPGS------

Query:  -----AFVASIHSAIMVKAELDGREALAAKERENSSAALEAA-TTLKGELLKAQGEVDILRAEV------------------------------------
             AFVASI SA+ VKAELDGRE LAA+E+E  SAALEAA +T+K ELLKA  EV+ L+AEV                                    
Subjt:  -----AFVASIHSAIMVKAELDGREALAAKERENSSAALEAA-TTLKGELLKAQGEVDILRAEV------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------DAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDAS
                                                        +AKA LLK+E E+HKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q LE KDA+
Subjt:  ------------------------------------------------DAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDAS

Query:  IGRLTAELKDLKERLTNRPLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDA
        IGRL AELK  KERLTN  LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIAAD+PHL++DL DLKK+Y+E                    R+LDS+YSD 
Subjt:  IGRLTAELKDLKERLTNRPLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDA

Query:  EEEDAPSQEPNEIGTMQEDVPSQQDGSQEVNLLGS
        +E++ PSQEP E+GT QE VPSQQDGSQEVNLLGS
Subjt:  EEEDAPSQEPNEIGTMQEDVPSQQDGSQEVNLLGS

A0A6J1D1N9 uncharacterized protein LOC1110161934.8e-11179.79Show/hide
Query:  MRFRMEPSSSGVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQG
        MRFRME SSSGVKDQVSRISATCLDRCLRRAS+FVSDPGS           AF+ASIHSA+MVKAELDGREAL AKEREN S  LEAATTLKGELLKAQG
Subjt:  MRFRMEPSSSGVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQG

Query:  EVDILRAEVDAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTNRPLLEESFRQHPDFDGFAKDFS
        EVDILRAEVDAK +LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLE+KDASIGRLT ELKDLKERLT+  LLEESFRQHP+FDGFAKDFS
Subjt:  EVDILRAEVDAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTNRPLLEESFRQHPDFDGFAKDFS

Query:  DAGFKFLMKGIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDAEEEDAPSQEPNEIGTMQEDVPSQQDG
        DAGFKFLMKGIAADMPHLQIDLSDLKK+YSE                    RELDS+YSD EEEDAPSQEP ++GT QE+ PSQ  G
Subjt:  DAGFKFLMKGIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDAEEEDAPSQEPNEIGTMQEDVPSQQDG

A0A6J1DF31 uncharacterized protein LOC1110199095.0e-12484.54Show/hide
Query:  IGGTSDVRMRFRMEPSSSGVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLK
        +GGT DVR RFRMEPSSSGVKDQVSRISATCLDRCL+RASKFVSDPGS           AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLK
Subjt:  IGGTSDVRMRFRMEPSSSGVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLK

Query:  GELLKAQGEVDILRAEVDAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTNRPLLEESFRQHPDF
        GELLKAQGEV ILRAEVDAKA LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLE KD SIGRLTAELKDLKERLTN  LLEESFRQH DF
Subjt:  GELLKAQGEVDILRAEVDAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTNRPLLEESFRQHPDF

Query:  DGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDAEEEDAPSQEPNEIGTMQEDVPSQQDGSQEVN
        DGFAKDFSDAGFKFLMKGIAADMPHLQIDLS+LKKKYSE                    RELDS+YSD EEEDAPSQEPNEIGT QE+VPSQQDGSQEVN
Subjt:  DGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDAEEEDAPSQEPNEIGTMQEDVPSQQDGSQEVN

Query:  LLGS
        LLGS
Subjt:  LLGS

A0A6J1DZB3 uncharacterized protein LOC1110256658.4e-15683.11Show/hide
Query:  NKPPISRSPTPVIELDLSGGRSEVKRPREESEALDVSPLNEVRGESPLKRRRKKKKTSSSSEAGARGTLPTSHADLVDDPAARIGGTSDVRMRFRMEPSS
        N  P S  PTPVIELDLSGGRS  KR REESEALDVSPLNEVRGESPL+RRRKKKKTSSSSEAGARGTLPTSHADLVDDP AR+ GTS+VRMRF MEPSS
Subjt:  NKPPISRSPTPVIELDLSGGRSEVKRPREESEALDVSPLNEVRGESPLKRRRKKKKTSSSSEAGARGTLPTSHADLVDDPAARIGGTSDVRMRFRMEPSS

Query:  SGVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGEVDILRAEV
        SGVKDQVSRISATCLDR LRRASKFVSDPGS           AF+ASIH A+MVKAELDGREALAAKERENS AALEAATTLKGELLKAQGEVDILRAEV
Subjt:  SGVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGEVDILRAEV

Query:  DAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTNRPLLEESFRQHPDFDGFAKDFSDAGFKFLMK
        DAK +LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLT ELKDLKERLTN  LLEESFRQHPDFDGFAKDFSDAGFKFLMK
Subjt:  DAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGRLTAELKDLKERLTNRPLLEESFRQHPDFDGFAKDFSDAGFKFLMK

Query:  GIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDAEEEDAPSQEPNEIGTMQEDVPSQQDGS
        GIAADMPHLQIDL+ LKKKYSE                    RELDS+YSD EEEDAPSQEP E+GT QE+VPSQQ GS
Subjt:  GIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDAEEEDAPSQEPNEIGTMQEDVPSQQDGS

A0A6J1DZB5 uncharacterized protein LOC1110248981.9e-8375.74Show/hide
Query:  MVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGEVDILRAEVDAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDAS
        M+KAELDGREALAAKE+ENS AALEAATT+K ELLKA+ EV IL+A+VD KA +LKKEGEKHKAHL AAHAITK +EKEKFQLLKEKDDLAQ LEE DA 
Subjt:  MVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGEVDILRAEVDAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDAS

Query:  IGRLTAELKDLKERLTNRPLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDA
        IGRL+ ELKD KERLTN  LLEE+F+QHPDFDGFAKDFSDAGFKFLMKGIA DM HLQIDLSD+KKKYSE                    RELDS+YSD 
Subjt:  IGRLTAELKDLKERLTNRPLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSE--------------------RELDSEYSDA

Query:  EEEDAPSQEPNEIGTMQEDVPSQQDGSQEVNLLGS
        EE DAPSQEPNE+GT QE+VPSQ  GSQEVNLLGS
Subjt:  EEEDAPSQEPNEIGTMQEDVPSQQDGSQEVNLLGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCACAAACCCCTGCCGCAAATGCCAACCGAATATTCGGGAAGCATTTGATCGATGGGTCAAGGCCAATGATAAGGCCCAGGTCTACATTCTTGCCAGCATGACTGA
TGTATTGGCAAAGAAACATGAACCCTTGATGACTGCAAAGGAAATCGTGGATTCATTAAAGGCGATGTTTGGGGAACCTTCATCGACCTTGAGGCACGAGGCACTAAATA
TATCTTCATGGGTGAGAGCAGCTCAACAACGCTGGCTCAATAAGCCTCCCATTTCAAGGTCCCCCACCCCCGTGATCGAACTAGACTTGTCTGGGGGTCGATCTGAAGTG
AAGCGTCCAAGGGAGGAATCCGAGGCGCTTGATGTATCTCCCCTGAATGAGGTGAGGGGAGAGTCTCCTTTGAAGAGAAGAAGAAAGAAGAAGAAGACTTCCTCCTCCTC
GGAGGCTGGGGCTCGTGGGACTCTGCCTACGAGCCATGCTGATTTGGTGGATGACCCCGCAGCTCGGATTGGGGGAACATCCGATGTGCGAATGCGGTTCAGAATGGAAC
CGTCAAGTTCCGGGGTGAAAGACCAGGTGTCCCGCATCTCGGCCACGTGCTTGGATCGCTGCCTGAGGAGAGCATCCAAGTTCGTGAGTGATCCTGGGTCAGCGTTTGTC
GCTTCCATTCATTCAGCTATTATGGTCAAGGCCGAACTGGATGGGAGGGAGGCTTTGGCAGCCAAGGAGAGGGAGAACTCCTCTGCTGCCTTAGAAGCTGCCACCACGCT
GAAGGGCGAGCTGCTAAAGGCCCAAGGCGAGGTGGATATTTTAAGGGCCGAGGTGGATGCCAAGGCCAACCTCTTGAAGAAGGAGGGTGAGAAACACAAGGCCCACCTCC
GAGCAGCCCATGCTATCACCAAGGGGCTGGAGAAAGAGAAATTCCAACTCCTAAAGGAGAAGGACGATCTCGCTCAAGTCCTTGAGGAGAAGGATGCCTCAATTGGGCGC
CTTACGGCCGAGCTTAAAGACCTGAAGGAGCGCCTTACCAATAGACCTCTGCTGGAGGAGTCGTTCAGGCAACACCCAGACTTCGATGGGTTTGCCAAGGACTTCAGCGA
CGCCGGCTTCAAGTTCCTGATGAAGGGCATTGCTGCCGACATGCCTCACCTTCAGATCGATCTCAGCGATCTCAAGAAGAAATACTCTGAGAGGGAGCTGGATTCTGAGT
ACTCCGACGCAGAGGAAGAGGATGCTCCTAGCCAAGAGCCTAACGAGATCGGCACGATGCAAGAAGATGTTCCTTCACAGCAGGACGGATCCCAAGAGGTCAACCTTCTG
GGGTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCACAAACCCCTGCCGCAAATGCCAACCGAATATTCGGGAAGCATTTGATCGATGGGTCAAGGCCAATGATAAGGCCCAGGTCTACATTCTTGCCAGCATGACTGA
TGTATTGGCAAAGAAACATGAACCCTTGATGACTGCAAAGGAAATCGTGGATTCATTAAAGGCGATGTTTGGGGAACCTTCATCGACCTTGAGGCACGAGGCACTAAATA
TATCTTCATGGGTGAGAGCAGCTCAACAACGCTGGCTCAATAAGCCTCCCATTTCAAGGTCCCCCACCCCCGTGATCGAACTAGACTTGTCTGGGGGTCGATCTGAAGTG
AAGCGTCCAAGGGAGGAATCCGAGGCGCTTGATGTATCTCCCCTGAATGAGGTGAGGGGAGAGTCTCCTTTGAAGAGAAGAAGAAAGAAGAAGAAGACTTCCTCCTCCTC
GGAGGCTGGGGCTCGTGGGACTCTGCCTACGAGCCATGCTGATTTGGTGGATGACCCCGCAGCTCGGATTGGGGGAACATCCGATGTGCGAATGCGGTTCAGAATGGAAC
CGTCAAGTTCCGGGGTGAAAGACCAGGTGTCCCGCATCTCGGCCACGTGCTTGGATCGCTGCCTGAGGAGAGCATCCAAGTTCGTGAGTGATCCTGGGTCAGCGTTTGTC
GCTTCCATTCATTCAGCTATTATGGTCAAGGCCGAACTGGATGGGAGGGAGGCTTTGGCAGCCAAGGAGAGGGAGAACTCCTCTGCTGCCTTAGAAGCTGCCACCACGCT
GAAGGGCGAGCTGCTAAAGGCCCAAGGCGAGGTGGATATTTTAAGGGCCGAGGTGGATGCCAAGGCCAACCTCTTGAAGAAGGAGGGTGAGAAACACAAGGCCCACCTCC
GAGCAGCCCATGCTATCACCAAGGGGCTGGAGAAAGAGAAATTCCAACTCCTAAAGGAGAAGGACGATCTCGCTCAAGTCCTTGAGGAGAAGGATGCCTCAATTGGGCGC
CTTACGGCCGAGCTTAAAGACCTGAAGGAGCGCCTTACCAATAGACCTCTGCTGGAGGAGTCGTTCAGGCAACACCCAGACTTCGATGGGTTTGCCAAGGACTTCAGCGA
CGCCGGCTTCAAGTTCCTGATGAAGGGCATTGCTGCCGACATGCCTCACCTTCAGATCGATCTCAGCGATCTCAAGAAGAAATACTCTGAGAGGGAGCTGGATTCTGAGT
ACTCCGACGCAGAGGAAGAGGATGCTCCTAGCCAAGAGCCTAACGAGATCGGCACGATGCAAGAAGATGTTCCTTCACAGCAGGACGGATCCCAAGAGGTCAACCTTCTG
GGGTCCTAG
Protein sequenceShow/hide protein sequence
MSTNPCRKCQPNIREAFDRWVKANDKAQVYILASMTDVLAKKHEPLMTAKEIVDSLKAMFGEPSSTLRHEALNISSWVRAAQQRWLNKPPISRSPTPVIELDLSGGRSEV
KRPREESEALDVSPLNEVRGESPLKRRRKKKKTSSSSEAGARGTLPTSHADLVDDPAARIGGTSDVRMRFRMEPSSSGVKDQVSRISATCLDRCLRRASKFVSDPGSAFV
ASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGEVDILRAEVDAKANLLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLEEKDASIGR
LTAELKDLKERLTNRPLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSERELDSEYSDAEEEDAPSQEPNEIGTMQEDVPSQQDGSQEVNLL
GS