; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g0949 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g0949
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionStress response NST1-like protein
Genome locationMC08:7623226..7627484
RNA-Seq ExpressionMC08g0949
SyntenyMC08g0949
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152172.1 uncharacterized protein LOC101207869 [Cucumis sativus]4.29e-10974.31Show/hide
Query:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNN-------NGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLA
        MF AR SW  FSKR KP +TRSFCSKSH  TN        NG+NKVE DLSSY EAYKQLDNLD MTASKILFT P KKKKFG+DFHLVQLFFVCMPSLA
Subjt:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNN-------NGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLA

Query:  VYLVAQYARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGT-TKNSEKDREGGKVKHGENNMG
        VYLVAQYARYEMRKMEADLELK+KKEEEEKAKQ+ELEETE+I E NPELQEVK RLDKLE TIKEIAVESRK SG+G  TKNSEK  +  K KHG N   
Subjt:  VYLVAQYARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGT-TKNSEKDREGGKVKHGENNMG

Query:  NASESSKSVEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR
           +S+KS++DHLG QKI  APVLPKGR SEST++++ +H N GGGSSPDA+R
Subjt:  NASESSKSVEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR

XP_008454169.1 PREDICTED: uncharacterized protein LOC103494654 [Cucumis melo]1.48e-10271.94Show/hide
Query:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNN-------NGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLA
        MF AR SW  FSKR KP +TRSFCSK H  TN        NG+NKV+ DLSSY EAYKQLDNLDFMTASKILFT P KKKKFG+DFHLVQLFFVCMPSLA
Subjt:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNN-------NGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLA

Query:  VYLVAQYARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGT-TKNSEKDREGGKVKHGENNMG
        VYLVAQYARYEMRKMEADLELK+KKEEEEKAKQ+ELEE E+I E NPELQEVK RLDKLE+TIKEIAVESRK SG+G  TKNSEK  +  K KHG N   
Subjt:  VYLVAQYARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGT-TKNSEKDREGGKVKHGENNMG

Query:  NASESSKSVEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR
           + +KS++DHLG QKI  APVLPK   SEST++E+ +H N G GSS D KR
Subjt:  NASESSKSVEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR

XP_022153425.1 uncharacterized protein LOC111020934 [Momordica charantia]3.19e-16199.59Show/hide
Query:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNNNGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLAVYLVAQY
        MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNNNGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLAVYLVAQY
Subjt:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNNNGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLAVYLVAQY

Query:  ARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGTTKNSEKDREGGKVKHGENNMGNASESSKS
        ARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGT KNSEKDREGGKVKHGENNMGNASESSKS
Subjt:  ARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGTTKNSEKDREGGKVKHGENNMGNASESSKS

Query:  VEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR
        VEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR
Subjt:  VEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR

XP_022956225.1 uncharacterized protein LOC111457985 [Cucurbita moschata]1.27e-10073.91Show/hide
Query:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNNN-------GNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLA
        MF AR S  RFSKR KPFQT  FCSKS   TN N       G+NKVESDLSSY EAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLA
Subjt:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNNN-------GNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLA

Query:  VYLVAQYARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSG-TTKNSEKDREGGKVKHGENNMG
        VYLVAQYARYEMRKMEADLELK+KKEEE  AKQ++LEE EEI +KN ELQEVK RLDKLEETIKEIAVESRK SGSG  TKNSEK +   K KHG N   
Subjt:  VYLVAQYARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSG-TTKNSEKDREGGKVKHGENNMG

Query:  NASESSKSVEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR
           + SKS++DHLG QKI  APVLPK R S  T+ E+ +H N GG SSPD+KR
Subjt:  NASESSKSVEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR

XP_038901255.1 uncharacterized protein LOC120088201 [Benincasa hispida]6.37e-10875.9Show/hide
Query:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNNN----GNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLAVYL
        M  AR SW RFSKR KPF+T SFCSKSH   N N    G+NKVESDLSSY EAYKQLDNLDFMTASKILFT P  KKKFGIDFHLVQLFF CMPSLAVYL
Subjt:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNNN----GNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLAVYL

Query:  VAQYARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGT-TKNSEKDREGGKVKHGENNMGNAS
        VAQYARYEMRKMEADLELK+KKEEEEKAKQ+ELEETEEI EKN ELQEVKIRLDKLEETIKEIAVE RK SG+G  TKNSEK ++  K KHG N      
Subjt:  VAQYARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGT-TKNSEKDREGGKVKHGENNMGNAS

Query:  ESSKSVEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAK
        + SKS++D LG QKI  APVLPKGR SEST++E+G+H N  GGSSP AK
Subjt:  ESSKSVEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAK

TrEMBL top hitse value%identityAlignment
A0A0A0KTR7 Uncharacterized protein2.08e-10974.31Show/hide
Query:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNN-------NGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLA
        MF AR SW  FSKR KP +TRSFCSKSH  TN        NG+NKVE DLSSY EAYKQLDNLD MTASKILFT P KKKKFG+DFHLVQLFFVCMPSLA
Subjt:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNN-------NGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLA

Query:  VYLVAQYARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGT-TKNSEKDREGGKVKHGENNMG
        VYLVAQYARYEMRKMEADLELK+KKEEEEKAKQ+ELEETE+I E NPELQEVK RLDKLE TIKEIAVESRK SG+G  TKNSEK  +  K KHG N   
Subjt:  VYLVAQYARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGT-TKNSEKDREGGKVKHGENNMG

Query:  NASESSKSVEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR
           +S+KS++DHLG QKI  APVLPKGR SEST++++ +H N GGGSSPDA+R
Subjt:  NASESSKSVEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR

A0A1S3BZ75 uncharacterized protein LOC1034946547.18e-10371.94Show/hide
Query:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNN-------NGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLA
        MF AR SW  FSKR KP +TRSFCSK H  TN        NG+NKV+ DLSSY EAYKQLDNLDFMTASKILFT P KKKKFG+DFHLVQLFFVCMPSLA
Subjt:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNN-------NGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLA

Query:  VYLVAQYARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGT-TKNSEKDREGGKVKHGENNMG
        VYLVAQYARYEMRKMEADLELK+KKEEEEKAKQ+ELEE E+I E NPELQEVK RLDKLE+TIKEIAVESRK SG+G  TKNSEK  +  K KHG N   
Subjt:  VYLVAQYARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGT-TKNSEKDREGGKVKHGENNMG

Query:  NASESSKSVEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR
           + +KS++DHLG QKI  APVLPK   SEST++E+ +H N G GSS D KR
Subjt:  NASESSKSVEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR

A0A6J1DGT5 uncharacterized protein LOC1110209341.54e-16199.59Show/hide
Query:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNNNGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLAVYLVAQY
        MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNNNGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLAVYLVAQY
Subjt:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNNNGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLAVYLVAQY

Query:  ARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGTTKNSEKDREGGKVKHGENNMGNASESSKS
        ARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGT KNSEKDREGGKVKHGENNMGNASESSKS
Subjt:  ARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGTTKNSEKDREGGKVKHGENNMGNASESSKS

Query:  VEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR
        VEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR
Subjt:  VEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR

A0A6J1GVZ5 uncharacterized protein LOC1114579856.13e-10173.91Show/hide
Query:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNNN-------GNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLA
        MF AR S  RFSKR KPFQT  FCSKS   TN N       G+NKVESDLSSY EAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLA
Subjt:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNNN-------GNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLA

Query:  VYLVAQYARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSG-TTKNSEKDREGGKVKHGENNMG
        VYLVAQYARYEMRKMEADLELK+KKEEE  AKQ++LEE EEI +KN ELQEVK RLDKLEETIKEIAVESRK SGSG  TKNSEK +   K KHG N   
Subjt:  VYLVAQYARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSG-TTKNSEKDREGGKVKHGENNMG

Query:  NASESSKSVEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR
           + SKS++DHLG QKI  APVLPK R S  T+ E+ +H N GG SSPD+KR
Subjt:  NASESSKSVEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR

A0A6J1IQM5 uncharacterized protein LOC1114796312.88e-9972.73Show/hide
Query:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNNNGNN-------KVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLA
        MF AR S  RFSKR KPFQT  FCSKS   TN N NN       KVESDLSSY EAYKQLDNLDFMTA KILFT+PPKKKKFGIDFHLVQLFFVCMPSLA
Subjt:  MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNNNGNN-------KVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLA

Query:  VYLVAQYARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSG-TTKNSEKDREGGKVKHGENNMG
        VYLVAQYARYEMRKMEADLELK+KKEEE  AKQ++L+E EEI +KN ELQEVK RLDKLEETIKEIAVESRK SGSG  TKNSEK +   K KHG N   
Subjt:  VYLVAQYARYEMRKMEADLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSG-TTKNSEKDREGGKVKHGENNMG

Query:  NASESSKSVEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR
           + SKS++DHLG QKI  APVLPK R S  T+ E+ +H N GG SSPD+KR
Subjt:  NASESSKSVEDHLGRQKIELAPVLPKGRESESTSQENGQHPNDGGGSSPDAKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G80700.1 unknown protein7.4e-3748.91Show/hide
Query:  MFGARVSWGRFSKRFKPFQTRSFCSKSH-----TPTNNNGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLAVY
        M   R SW   S R K ++TR FC+K       T ++   ++  ES +S Y E YK+LD LDF+TA+KILFT+PPKK KFG D+H+VQ   VC+PS+AVY
Subjt:  MFGARVSWGRFSKRFKPFQTRSFCSKSH-----TPTNNNGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLAVY

Query:  LVAQYARYEMRKMEADLELKRKKEEEEKAK---QMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGTTKNSE
        LVAQYAR +M+ M+A+L  K++KEEE+K K   + +  + E   + + EL E++ RL K+EETIKEI +E++KPSG+  TK  E
Subjt:  LVAQYARYEMRKMEADLELKRKKEEEEKAK---QMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGTTKNSE

AT1G80980.1 unknown protein4.3e-3748.91Show/hide
Query:  MFGARVSWGRFSKRFKPFQTRSFCSKSH-----TPTNNNGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLAVY
        M   R SW   S R K ++TR FC+K       T ++   +++ ES +S Y E YK+LD LDF+TA+KILFT+PPKK KFG D+H+VQ   VC+PS+AVY
Subjt:  MFGARVSWGRFSKRFKPFQTRSFCSKSH-----TPTNNNGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLAVY

Query:  LVAQYARYEMRKMEADLELKRKKEEEEKAK---QMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGTTKNSE
        LVAQYAR +M+ M+A+L  K++KEEE+K K   + +  + E   + + EL E++ RL K+EETIKEI +E++KPSG+  TK  E
Subjt:  LVAQYARYEMRKMEADLELKRKKEEEEKAK---QMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGTTKNSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGGCGCCAGAGTCAGTTGGGGTCGATTTTCAAAGCGATTCAAGCCTTTCCAAACCAGATCATTCTGCTCCAAATCCCACACTCCCACCAATAACAATGGCAACAA
CAAGGTTGAGTCGGATCTGAGCAGCTACGGTGAGGCTTATAAGCAGCTGGATAACCTGGACTTCATGACCGCCTCCAAGATCCTCTTCACTGATCCTCCCAAGAAGAAGA
AATTTGGGATTGATTTCCATCTGGTGCAACTCTTCTTTGTTTGCATGCCTTCTTTGGCTGTTTATTTGGTGGCCCAATATGCTCGTTATGAAATGAGGAAAATGGAAGCG
GACCTGGAGCTGAAAAGGAAGAAAGAAGAAGAAGAGAAAGCTAAACAAATGGAGTTAGAAGAGACCGAAGAAATTCAGGAAAAGAATCCAGAGCTACAGGAGGTAAAAAT
AAGACTGGATAAACTTGAGGAGACCATAAAGGAAATTGCTGTTGAATCCAGAAAACCATCAGGAAGTGGTACGACAAAGAACTCTGAAAAAGATCGAGAAGGTGGTAAAG
TCAAACATGGGGAAAACAACATGGGGAATGCGTCAGAATCAAGTAAATCTGTGGAAGACCATCTTGGCAGACAAAAAATAGAACTTGCTCCAGTTTTGCCCAAAGGGCGC
GAGAGCGAGTCTACATCACAAGAAAATGGTCAGCATCCAAACGACGGTGGAGGATCCTCTCCAGATGCCAAGAGATGA
mRNA sequenceShow/hide mRNA sequence
AAGGAGGGTGGGCCACAGCCCAAACCTTGGCACTTTAATGGGCCTGGCAAATGAAGAGTAGTCAGGCCCATCAAAATCAGCGAGCCACAGCCCAAACCTTGGTAGTTTAA
TGGGCCTGGCAAATGAAGAGTAGTCAGGCCCATCAAAATAAGCGACCTGTATATATTCATTGCACTTCCCCCAAATTCCATCGCGTTCAAGAACAATGTTTGGCGCCAGA
GTCAGTTGGGGTCGATTTTCAAAGCGATTCAAGCCTTTCCAAACCAGATCATTCTGCTCCAAATCCCACACTCCCACCAATAACAATGGCAACAACAAGGTTGAGTCGGA
TCTGAGCAGCTACGGTGAGGCTTATAAGCAGCTGGATAACCTGGACTTCATGACCGCCTCCAAGATCCTCTTCACTGATCCTCCCAAGAAGAAGAAATTTGGGATTGATT
TCCATCTGGTGCAACTCTTCTTTGTTTGCATGCCTTCTTTGGCTGTTTATTTGGTGGCCCAATATGCTCGTTATGAAATGAGGAAAATGGAAGCGGACCTGGAGCTGAAA
AGGAAGAAAGAAGAAGAAGAGAAAGCTAAACAAATGGAGTTAGAAGAGACCGAAGAAATTCAGGAAAAGAATCCAGAGCTACAGGAGGTAAAAATAAGACTGGATAAACT
TGAGGAGACCATAAAGGAAATTGCTGTTGAATCCAGAAAACCATCAGGAAGTGGTACGACAAAGAACTCTGAAAAAGATCGAGAAGGTGGTAAAGTCAAACATGGGGAAA
ACAACATGGGGAATGCGTCAGAATCAAGTAAATCTGTGGAAGACCATCTTGGCAGACAAAAAATAGAACTTGCTCCAGTTTTGCCCAAAGGGCGCGAGAGCGAGTCTACA
TCACAAGAAAATGGTCAGCATCCAAACGACGGTGGAGGATCCTCTCCAGATGCCAAGAGATGAAGAAAATGCCTCTCTTAACATGACCGAAATATCCTGCACATGGTACT
TTTCCTTGATGAAGCTAGAGAAATGGGAGAATGTTTGCCAAGAATCCAACTGGTTTTAGGTTTTGTTTAATAATATATTGCTGGAGGCAGTAGCCTACATCATAAACAAT
GTAAATGGTTAATAGGTAAAAGGTTCATGCCCGTTTTGTTGTGACAATTACCTCATGATGAACAATTGATTTCTGTTTGTGTTATGTAAATGTACCCNCCCCCCCACCAG
AGTTATACTTGTAGTACTGAATTTTGATGAAGTCATTGGCAGTGGAATGGGACTGAAATAGATTTCCAGTCATTGTCATCAAAGTGGGCAGCTAAGTGGTTGTCTTCCTG
CTTATCAATTGCAGACCCAAAGATGAATCACTTATGGGTTTTGTTGAG
Protein sequenceShow/hide protein sequence
MFGARVSWGRFSKRFKPFQTRSFCSKSHTPTNNNGNNKVESDLSSYGEAYKQLDNLDFMTASKILFTDPPKKKKFGIDFHLVQLFFVCMPSLAVYLVAQYARYEMRKMEA
DLELKRKKEEEEKAKQMELEETEEIQEKNPELQEVKIRLDKLEETIKEIAVESRKPSGSGTTKNSEKDREGGKVKHGENNMGNASESSKSVEDHLGRQKIELAPVLPKGR
ESESTSQENGQHPNDGGGSSPDAKR