; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g14210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g14210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:8862356..8863869
RNA-Seq ExpressionMoc01g14210
SyntenyMoc01g14210
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144033.1 uncharacterized protein LOC111013825 [Momordica charantia]3.6e-5738.42Show/hide
Query:  MSSYDGSGDPVSYVEMFEGKMDFLAASDVIKCRAFQIALAASVRLWQLLKLPPSHLGTVKQLDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDR
        M+ YDGS DP  YVE+ E  M+F AA DVIKC AFQIAL  S       +   +HL  +KQ    +L EY+ RF ++ +KV  C+DD AM YF TGL D 
Subjt:  MSSYDGSGDPVSYVEMFEGKMDFLAASDVIKCRAFQIALAASVRLWQLLKLPPSHLGTVKQLDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDR

Query:  NLTIESGSRPPASLNEMFARARQYIDGLDLW-------------KANEAKRSSSGRDREHKSPPSKKWRRERAEPEGSTREEKRERSQPSRSKEDRPAVI
         LT++     P + +E+  +A++ +   + +             K    ++  +G   + K       R E        ++E+R+RS+    ++DRPAVI
Subjt:  NLTIESGSRPPASLNEMFARARQYIDGLDLW-------------KANEAKRSSSGRDREHKSPPSKKWRRERAEPEGSTREEKRERSQPSRSKEDRPAVI

Query:  NTIHGGPSGGQSGQKRKPLARETAHEVCTLYPKEPVMPILFDEQDGEGVHMPHNDALVIAP---------------------------------------
        NTI GGPSGGQ G KRK LARE   EVC +  +     I FD+ D EGVH+PHNDALVIAP                                       
Subjt:  NTIHGGPSGGQSGQKRKPLARETAHEVCTLYPKEPVMPILFDEQDGEGVHMPHNDALVIAP---------------------------------------

Query:  --PDRSCEGESVSAEGCISLPVTINEGEHQVTKVTEFVIIDRTSAYNAILGRPLIHDLKAVPSTYHQVLKYPTPTEIATI
          P     GES    GCI L V I + + Q T++ EFV+I   SAY AI GRP+IH  + VPST +QVLKY TP  + T+
Subjt:  --PDRSCEGESVSAEGCISLPVTINEGEHQVTKVTEFVIIDRTSAYNAILGRPLIHDLKAVPSTYHQVLKYPTPTEIATI

XP_022149029.1 uncharacterized protein LOC111017548 [Momordica charantia]8.0e-5748.11Show/hide
Query:  MDFLAASDVIKCRAFQIALAASVRLW--------------------------QLLKLPPSHLGTVKQLDNESLTEYIARFMDEHVKVVSCTDDIAMMYFT
        MDFLAASD IKCRAFQIAL  SVRLW                          QLLKLPPSHL TVKQ DNESLTEYIAR MDEHVKVVSCTDDIAMMYFT
Subjt:  MDFLAASDVIKCRAFQIALAASVRLW--------------------------QLLKLPPSHLGTVKQLDNESLTEYIARFMDEHVKVVSCTDDIAMMYFT

Query:  TGLNDRNLTIESGSRPPASLNEMFARARQYIDGLDLWKANEAKRSSSGRDREHKSPPSKK-----------------------------W----------
        TGLNDRNLTIE GSRPPASLN+M ARARQYIDGL+LWKA  A+RSS G+DR+ +S P KK                             W          
Subjt:  TGLNDRNLTIESGSRPPASLNEMFARARQYIDGLDLWKANEAKRSSSGRDREHKSPPSKK-----------------------------W----------

Query:  -------------------------------------------------------------------------RRERAEPEGSTREEKRERSQPSRSKED
                                                                                  RERA+PEGSTREEKRERSQP   KED
Subjt:  -------------------------------------------------------------------------RRERAEPEGSTREEKRERSQPSRSKED

Query:  RPAVINTIHGGPSGGQSG
        RPAVINTIHGGPSG +SG
Subjt:  RPAVINTIHGGPSGGQSG

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.5e-5534.82Show/hide
Query:  MSSYDGSGDPVSYVEMFEGKMDFLAASDVIKCRAFQIALAASVRLWQLLKLPPSHLGTVKQLDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDR
        + SYDGS DP  YVE+FEG MDF AASD IKCRAFQIAL  S RLW                           F ++ +KV   +DD AM YF TGL D 
Subjt:  MSSYDGSGDPVSYVEMFEGKMDFLAASDVIKCRAFQIALAASVRLWQLLKLPPSHLGTVKQLDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDR

Query:  NLTIESGSRPPASLNEMFARARQYIDGLDLWKANEAK------RSSSGRD--------------------------------------------------
         LT++ G   PA+  E+  +A++ IDG +L +    +      R  SG+D                                                  
Subjt:  NLTIESGSRPPASLNEMFARARQYIDGLDLWKANEAK------RSSSGRD--------------------------------------------------

Query:  -------------------------------REHKSPPSKKWRRER---------------AEPEGSTREEKRER--SQPSRSKEDRPAVINTIHGGPSG
                                       REH    S +W  +R                +P  S+ E+K ER  S+    + DRPAVINTI GGPSG
Subjt:  -------------------------------REHKSPPSKKWRRER---------------AEPEGSTREEKRER--SQPSRSKEDRPAVINTIHGGPSG

Query:  GQSGQKRKPLARETAHEVCTLYPKEPVMPILFDEQDGEGVHMPHNDALVIAP-----------------------------------------PDRSCEG
        GQSG KRK LAR    EVC +  + P  PI FD  D E VH+PHNDALVIAP                                         P      
Subjt:  GQSGQKRKPLARETAHEVCTLYPKEPVMPILFDEQDGEGVHMPHNDALVIAP-----------------------------------------PDRSCEG

Query:  ESVSAEGCISLPVTINEGEHQVTKVTEFVIIDRTSAYNAILGRPLIHDLKAVPSTYHQVLKYPTPTEIATI
        ESV  EGCI LPVT+   + QVT++ EFV+ID  SAYNAI GRP+IH  +A+PST HQVLKY TP  +  +
Subjt:  ESVSAEGCISLPVTINEGEHQVTKVTEFVIIDRTSAYNAILGRPLIHDLKAVPSTYHQVLKYPTPTEIATI

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]1.5e-5535.76Show/hide
Query:  MDFLAASDVIKCRAFQIALAASVRLW-------------QLLK-------------LPPSHLGTVKQLDNESLTEYIARFMDEHVKVVSCTDDIAMMYFT
        MDF AA+D IKCRAFQIAL  S RLW             QL K                +HL T++Q + E+L EY+ RF +E +KV  C+DD AM YF 
Subjt:  MDFLAASDVIKCRAFQIALAASVRLW-------------QLLK-------------LPPSHLGTVKQLDNESLTEYIARFMDEHVKVVSCTDDIAMMYFT

Query:  TGLNDRNLTIESGSRPPASLNEMFARARQYIDG--------------LDLWKANEAKRSSSGRDREHKSPPS---KKWRRERAEPEGS------------
        T L D  LT++ G   P +  E+  +A++ IDG              +D  K ++ KR +  + R+  S  S    ++RR  + P  S            
Subjt:  TGLNDRNLTIESGSRPPASLNEMFARARQYIDG--------------LDLWKANEAKRSSSGRDREHKSPPS---KKWRRERAEPEGS------------

Query:  -----------------------------------------------------------------------------TREEKRERSQPSRSKEDRPAVIN
                                                                                      ++E+R+RS+    +EDRPAVIN
Subjt:  -----------------------------------------------------------------------------TREEKRERSQPSRSKEDRPAVIN

Query:  TIHGGPSGGQSGQKRKPLARETAHEVCTLYPKEPVMPILFDEQDGEGVHMPHNDALVIAP-PDRSCEGESVSAEGCISLPVTINEGEHQVTKVTEFVIID
        TI GGP+GGQSG KRK LARE   EVC +   +P   I F + D EGVH+PHNDALVIA   D       +   GCI LPVTI +   QVT++ EFV+ID
Subjt:  TIHGGPSGGQSGQKRKPLARETAHEVCTLYPKEPVMPILFDEQDGEGVHMPHNDALVIAP-PDRSCEGESVSAEGCISLPVTINEGEHQVTKVTEFVIID

Query:  RTSAYNAILGRPLIHDLKAVPSTYHQVLKYPTPTEIATI
          SAYNAI GRP+IH  +AVPST HQVLKY TP E+  +
Subjt:  RTSAYNAILGRPLIHDLKAVPSTYHQVLKYPTPTEIATI

XP_022158844.1 uncharacterized protein LOC111025310 [Momordica charantia]8.5e-7550.13Show/hide
Query:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIESGSRPPASLNEMFARARQYIDGLDLWKANEAKRSSSGRDREHKSPPSKK--------------------
        MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIE  SRPPASLNEMFARARQYIDGL+LWKAN A+RSS GRDR+HKSPPSKK                    
Subjt:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIESGSRPPASLNEMFARARQYIDGLDLWKANEAKRSSSGRDREHKSPPSKK--------------------

Query:  --------------------------------------------------------------------------------------------WRRERAEP
                                                                                                      RE+AE 
Subjt:  --------------------------------------------------------------------------------------------WRRERAEP

Query:  EGSTREEKRERSQPSRSKEDRPAVINTIHGGPSGGQSGQKRKPLARETAHEVCTLYPKEPVMPILFDEQDGEGVHMPHNDALVIAP--------------
        EGS REEKRERSQP R KEDRPAVINTIHGGPSG +SGQKRK LARE AHEVCT YPK PVMPILFDEQDGE VHMPHNDALVIAP              
Subjt:  EGSTREEKRERSQPSRSKEDRPAVINTIHGGPSGGQSGQKRKPLARETAHEVCTLYPKEPVMPILFDEQDGEGVHMPHNDALVIAP--------------

Query:  ------------------------PDRSCEG---ESVSAEGCISLPVTINEGEHQVTKVTEFVIIDRTSAY
                                   S  G   ESVS EGCISLPVTI+EGEHQVT+V EFV+IDR+SAY
Subjt:  ------------------------PDRSCEG---ESVSAEGCISLPVTINEGEHQVTKVTEFVIIDRTSAY

TrEMBL top hitse value%identityAlignment
A0A6J1CS66 uncharacterized protein LOC1110138251.7e-5738.42Show/hide
Query:  MSSYDGSGDPVSYVEMFEGKMDFLAASDVIKCRAFQIALAASVRLWQLLKLPPSHLGTVKQLDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDR
        M+ YDGS DP  YVE+ E  M+F AA DVIKC AFQIAL  S       +   +HL  +KQ    +L EY+ RF ++ +KV  C+DD AM YF TGL D 
Subjt:  MSSYDGSGDPVSYVEMFEGKMDFLAASDVIKCRAFQIALAASVRLWQLLKLPPSHLGTVKQLDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDR

Query:  NLTIESGSRPPASLNEMFARARQYIDGLDLW-------------KANEAKRSSSGRDREHKSPPSKKWRRERAEPEGSTREEKRERSQPSRSKEDRPAVI
         LT++     P + +E+  +A++ +   + +             K    ++  +G   + K       R E        ++E+R+RS+    ++DRPAVI
Subjt:  NLTIESGSRPPASLNEMFARARQYIDGLDLW-------------KANEAKRSSSGRDREHKSPPSKKWRRERAEPEGSTREEKRERSQPSRSKEDRPAVI

Query:  NTIHGGPSGGQSGQKRKPLARETAHEVCTLYPKEPVMPILFDEQDGEGVHMPHNDALVIAP---------------------------------------
        NTI GGPSGGQ G KRK LARE   EVC +  +     I FD+ D EGVH+PHNDALVIAP                                       
Subjt:  NTIHGGPSGGQSGQKRKPLARETAHEVCTLYPKEPVMPILFDEQDGEGVHMPHNDALVIAP---------------------------------------

Query:  --PDRSCEGESVSAEGCISLPVTINEGEHQVTKVTEFVIIDRTSAYNAILGRPLIHDLKAVPSTYHQVLKYPTPTEIATI
          P     GES    GCI L V I + + Q T++ EFV+I   SAY AI GRP+IH  + VPST +QVLKY TP  + T+
Subjt:  --PDRSCEGESVSAEGCISLPVTINEGEHQVTKVTEFVIIDRTSAYNAILGRPLIHDLKAVPSTYHQVLKYPTPTEIATI

A0A6J1D5T3 uncharacterized protein LOC1110175483.9e-5748.11Show/hide
Query:  MDFLAASDVIKCRAFQIALAASVRLW--------------------------QLLKLPPSHLGTVKQLDNESLTEYIARFMDEHVKVVSCTDDIAMMYFT
        MDFLAASD IKCRAFQIAL  SVRLW                          QLLKLPPSHL TVKQ DNESLTEYIAR MDEHVKVVSCTDDIAMMYFT
Subjt:  MDFLAASDVIKCRAFQIALAASVRLW--------------------------QLLKLPPSHLGTVKQLDNESLTEYIARFMDEHVKVVSCTDDIAMMYFT

Query:  TGLNDRNLTIESGSRPPASLNEMFARARQYIDGLDLWKANEAKRSSSGRDREHKSPPSKK-----------------------------W----------
        TGLNDRNLTIE GSRPPASLN+M ARARQYIDGL+LWKA  A+RSS G+DR+ +S P KK                             W          
Subjt:  TGLNDRNLTIESGSRPPASLNEMFARARQYIDGLDLWKANEAKRSSSGRDREHKSPPSKK-----------------------------W----------

Query:  -------------------------------------------------------------------------RRERAEPEGSTREEKRERSQPSRSKED
                                                                                  RERA+PEGSTREEKRERSQP   KED
Subjt:  -------------------------------------------------------------------------RRERAEPEGSTREEKRERSQPSRSKED

Query:  RPAVINTIHGGPSGGQSG
        RPAVINTIHGGPSG +SG
Subjt:  RPAVINTIHGGPSGGQSG

A0A6J1D9E1 uncharacterized protein LOC1110188237.3e-5634.82Show/hide
Query:  MSSYDGSGDPVSYVEMFEGKMDFLAASDVIKCRAFQIALAASVRLWQLLKLPPSHLGTVKQLDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDR
        + SYDGS DP  YVE+FEG MDF AASD IKCRAFQIAL  S RLW                           F ++ +KV   +DD AM YF TGL D 
Subjt:  MSSYDGSGDPVSYVEMFEGKMDFLAASDVIKCRAFQIALAASVRLWQLLKLPPSHLGTVKQLDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDR

Query:  NLTIESGSRPPASLNEMFARARQYIDGLDLWKANEAK------RSSSGRD--------------------------------------------------
         LT++ G   PA+  E+  +A++ IDG +L +    +      R  SG+D                                                  
Subjt:  NLTIESGSRPPASLNEMFARARQYIDGLDLWKANEAK------RSSSGRD--------------------------------------------------

Query:  -------------------------------REHKSPPSKKWRRER---------------AEPEGSTREEKRER--SQPSRSKEDRPAVINTIHGGPSG
                                       REH    S +W  +R                +P  S+ E+K ER  S+    + DRPAVINTI GGPSG
Subjt:  -------------------------------REHKSPPSKKWRRER---------------AEPEGSTREEKRER--SQPSRSKEDRPAVINTIHGGPSG

Query:  GQSGQKRKPLARETAHEVCTLYPKEPVMPILFDEQDGEGVHMPHNDALVIAP-----------------------------------------PDRSCEG
        GQSG KRK LAR    EVC +  + P  PI FD  D E VH+PHNDALVIAP                                         P      
Subjt:  GQSGQKRKPLARETAHEVCTLYPKEPVMPILFDEQDGEGVHMPHNDALVIAP-----------------------------------------PDRSCEG

Query:  ESVSAEGCISLPVTINEGEHQVTKVTEFVIIDRTSAYNAILGRPLIHDLKAVPSTYHQVLKYPTPTEIATI
        ESV  EGCI LPVT+   + QVT++ EFV+ID  SAYNAI GRP+IH  +A+PST HQVLKY TP  +  +
Subjt:  ESVSAEGCISLPVTINEGEHQVTKVTEFVIIDRTSAYNAILGRPLIHDLKAVPSTYHQVLKYPTPTEIATI

A0A6J1DZB9 uncharacterized protein LOC1110249047.3e-5635.76Show/hide
Query:  MDFLAASDVIKCRAFQIALAASVRLW-------------QLLK-------------LPPSHLGTVKQLDNESLTEYIARFMDEHVKVVSCTDDIAMMYFT
        MDF AA+D IKCRAFQIAL  S RLW             QL K                +HL T++Q + E+L EY+ RF +E +KV  C+DD AM YF 
Subjt:  MDFLAASDVIKCRAFQIALAASVRLW-------------QLLK-------------LPPSHLGTVKQLDNESLTEYIARFMDEHVKVVSCTDDIAMMYFT

Query:  TGLNDRNLTIESGSRPPASLNEMFARARQYIDG--------------LDLWKANEAKRSSSGRDREHKSPPS---KKWRRERAEPEGS------------
        T L D  LT++ G   P +  E+  +A++ IDG              +D  K ++ KR +  + R+  S  S    ++RR  + P  S            
Subjt:  TGLNDRNLTIESGSRPPASLNEMFARARQYIDG--------------LDLWKANEAKRSSSGRDREHKSPPS---KKWRRERAEPEGS------------

Query:  -----------------------------------------------------------------------------TREEKRERSQPSRSKEDRPAVIN
                                                                                      ++E+R+RS+    +EDRPAVIN
Subjt:  -----------------------------------------------------------------------------TREEKRERSQPSRSKEDRPAVIN

Query:  TIHGGPSGGQSGQKRKPLARETAHEVCTLYPKEPVMPILFDEQDGEGVHMPHNDALVIAP-PDRSCEGESVSAEGCISLPVTINEGEHQVTKVTEFVIID
        TI GGP+GGQSG KRK LARE   EVC +   +P   I F + D EGVH+PHNDALVIA   D       +   GCI LPVTI +   QVT++ EFV+ID
Subjt:  TIHGGPSGGQSGQKRKPLARETAHEVCTLYPKEPVMPILFDEQDGEGVHMPHNDALVIAP-PDRSCEGESVSAEGCISLPVTINEGEHQVTKVTEFVIID

Query:  RTSAYNAILGRPLIHDLKAVPSTYHQVLKYPTPTEIATI
          SAYNAI GRP+IH  +AVPST HQVLKY TP E+  +
Subjt:  RTSAYNAILGRPLIHDLKAVPSTYHQVLKYPTPTEIATI

A0A6J1E0L8 uncharacterized protein LOC1110253104.1e-7550.13Show/hide
Query:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIESGSRPPASLNEMFARARQYIDGLDLWKANEAKRSSSGRDREHKSPPSKK--------------------
        MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIE  SRPPASLNEMFARARQYIDGL+LWKAN A+RSS GRDR+HKSPPSKK                    
Subjt:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIESGSRPPASLNEMFARARQYIDGLDLWKANEAKRSSSGRDREHKSPPSKK--------------------

Query:  --------------------------------------------------------------------------------------------WRRERAEP
                                                                                                      RE+AE 
Subjt:  --------------------------------------------------------------------------------------------WRRERAEP

Query:  EGSTREEKRERSQPSRSKEDRPAVINTIHGGPSGGQSGQKRKPLARETAHEVCTLYPKEPVMPILFDEQDGEGVHMPHNDALVIAP--------------
        EGS REEKRERSQP R KEDRPAVINTIHGGPSG +SGQKRK LARE AHEVCT YPK PVMPILFDEQDGE VHMPHNDALVIAP              
Subjt:  EGSTREEKRERSQPSRSKEDRPAVINTIHGGPSGGQSGQKRKPLARETAHEVCTLYPKEPVMPILFDEQDGEGVHMPHNDALVIAP--------------

Query:  ------------------------PDRSCEG---ESVSAEGCISLPVTINEGEHQVTKVTEFVIIDRTSAY
                                   S  G   ESVS EGCISLPVTI+EGEHQVT+V EFV+IDR+SAY
Subjt:  ------------------------PDRSCEG---ESVSAEGCISLPVTINEGEHQVTKVTEFVIIDRTSAY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCGTATGATGGGTCTGGGGATCCAGTCTCGTATGTAGAGATGTTCGAAGGAAAGATGGATTTTTTGGCCGCGAGCGACGTCATAAAATGCCGAGCATTTCAGAT
AGCACTGGCAGCCTCGGTGAGACTATGGCAATTATTGAAGTTACCGCCTTCTCACCTCGGAACAGTGAAGCAACTGGACAATGAGTCCCTTACGGAGTACATTGCTCGGT
TCATGGATGAGCATGTCAAGGTGGTGAGCTGCACCGACGACATCGCCATGATGTACTTCACGACAGGGTTGAATGACAGAAACTTGACAATCGAGTCCGGAAGCCGCCCA
CCGGCCTCTTTAAACGAAATGTTTGCCCGAGCTCGTCAGTATATTGATGGCCTGGATCTGTGGAAGGCCAATGAAGCCAAGCGAAGCAGCAGCGGTAGAGATCGAGAACA
CAAGTCTCCACCCTCCAAGAAGTGGCGCAGAGAACGAGCTGAGCCAGAGGGATCAACTCGAGAAGAGAAGCGAGAAAGGTCGCAGCCGTCCAGAAGCAAAGAAGATCGCC
CTGCAGTTATAAATACCATTCACGGAGGTCCGAGCGGGGGGCAGTCAGGGCAGAAGAGAAAACCTCTAGCCCGGGAAACAGCGCATGAGGTATGTACCTTGTACCCCAAG
GAGCCCGTGATGCCGATCTTATTTGACGAGCAGGACGGTGAAGGGGTGCATATGCCCCATAACGACGCTTTGGTAATCGCCCCCCCTGATAGATCATGTGAAGGGGAGTC
AGTTAGTGCGGAAGGATGCATCTCGCTCCCTGTGACCATTAACGAAGGAGAGCATCAAGTAACCAAAGTGACTGAGTTCGTCATAATAGATCGAACCTCGGCATACAACG
CCATTCTTGGCCGACCTCTTATTCACGACCTTAAGGCGGTCCCCTCTACTTATCATCAGGTTTTGAAGTATCCTACTCCGACTGAAATTGCAACGATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTCGTATGATGGGTCTGGGGATCCAGTCTCGTATGTAGAGATGTTCGAAGGAAAGATGGATTTTTTGGCCGCGAGCGACGTCATAAAATGCCGAGCATTTCAGAT
AGCACTGGCAGCCTCGGTGAGACTATGGCAATTATTGAAGTTACCGCCTTCTCACCTCGGAACAGTGAAGCAACTGGACAATGAGTCCCTTACGGAGTACATTGCTCGGT
TCATGGATGAGCATGTCAAGGTGGTGAGCTGCACCGACGACATCGCCATGATGTACTTCACGACAGGGTTGAATGACAGAAACTTGACAATCGAGTCCGGAAGCCGCCCA
CCGGCCTCTTTAAACGAAATGTTTGCCCGAGCTCGTCAGTATATTGATGGCCTGGATCTGTGGAAGGCCAATGAAGCCAAGCGAAGCAGCAGCGGTAGAGATCGAGAACA
CAAGTCTCCACCCTCCAAGAAGTGGCGCAGAGAACGAGCTGAGCCAGAGGGATCAACTCGAGAAGAGAAGCGAGAAAGGTCGCAGCCGTCCAGAAGCAAAGAAGATCGCC
CTGCAGTTATAAATACCATTCACGGAGGTCCGAGCGGGGGGCAGTCAGGGCAGAAGAGAAAACCTCTAGCCCGGGAAACAGCGCATGAGGTATGTACCTTGTACCCCAAG
GAGCCCGTGATGCCGATCTTATTTGACGAGCAGGACGGTGAAGGGGTGCATATGCCCCATAACGACGCTTTGGTAATCGCCCCCCCTGATAGATCATGTGAAGGGGAGTC
AGTTAGTGCGGAAGGATGCATCTCGCTCCCTGTGACCATTAACGAAGGAGAGCATCAAGTAACCAAAGTGACTGAGTTCGTCATAATAGATCGAACCTCGGCATACAACG
CCATTCTTGGCCGACCTCTTATTCACGACCTTAAGGCGGTCCCCTCTACTTATCATCAGGTTTTGAAGTATCCTACTCCGACTGAAATTGCAACGATCTGA
Protein sequenceShow/hide protein sequence
MSSYDGSGDPVSYVEMFEGKMDFLAASDVIKCRAFQIALAASVRLWQLLKLPPSHLGTVKQLDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDRNLTIESGSRP
PASLNEMFARARQYIDGLDLWKANEAKRSSSGRDREHKSPPSKKWRRERAEPEGSTREEKRERSQPSRSKEDRPAVINTIHGGPSGGQSGQKRKPLARETAHEVCTLYPK
EPVMPILFDEQDGEGVHMPHNDALVIAPPDRSCEGESVSAEGCISLPVTINEGEHQVTKVTEFVIIDRTSAYNAILGRPLIHDLKAVPSTYHQVLKYPTPTEIATI