; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g1667 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g1667
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionBEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein .
Genome locationMC08:24645844..24649733
RNA-Seq ExpressionMC08g1667
SyntenyMC08g1667
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581290.1 hypothetical protein SDJN03_21292, partial [Cucurbita argyrosperma subsp. sororia]2.80e-15887.82Show/hide
Query:  MGVESNSAAPPPP----SSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDD
        MGVESNS  PPPP    SSSSTPSPS KRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGDPL ENLMDSPARSESMLYLRDEMS QYSPMSEDSDD
Subjt:  MGVESNSAAPPPP----SSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDD

Query:  CRFCETSTNLFPLQSDS-VPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPG
        CRFCETSTNLFP QSDS VPTSPVSPYRYQRPFS +TPST TN  SLGC+T PV  LQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP G
Subjt:  CRFCETSTNLFPLQSDS-VPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPG

Query:  PSSMELPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG
        PSSMELPYCSM EPGPNIEAEER C+F KSLV+ERVYQL ECS+M VSEPEYN+QK CKDLNR MKDSESG
Subjt:  PSSMELPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG

XP_022155254.1 uncharacterized protein LOC111022394 isoform X1 [Momordica charantia]7.10e-190100Show/hide
Query:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPLQSDSVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
        ETSTNLFPLQSDSVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
Subjt:  ETSTNLFPLQSDSVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME

Query:  LPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESGES
        LPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESGES
Subjt:  LPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESGES

XP_022934215.1 uncharacterized protein LOC111441454 isoform X1 [Cucurbita moschata]1.90e-15888.15Show/hide
Query:  MGVESNSAAPPPP---SSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC
        MGVESNS  PPPP   SSSSTPSPS KRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGDPL ENLMDSPARSESMLYLRDEMS QYSPMSEDSDDC
Subjt:  MGVESNSAAPPPP---SSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC

Query:  RFCETSTNLFPLQSDS-VPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP
        RFCETSTNLFP QSDS VPTSPVSPYRYQRPFS +TPST TN  SLGC+T PV  LQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GP
Subjt:  RFCETSTNLFPLQSDS-VPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP

Query:  SSMELPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG
        SSMELPYCSM EPGPNIEAEER C+F KSLV+ERVYQL ECS+M VSEPEYN+QK CKDLNR MKDSESG
Subjt:  SSMELPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG

XP_022984066.1 uncharacterized protein LOC111482488 isoform X1 [Cucurbita maxima]5.11e-16089.14Show/hide
Query:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA PPPPSSSSTPSPS KRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGD L ENLMDSPARSESMLYLRDEMS QYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        ETSTNLFP QSD SVPTSPVSPYRYQRPFS +TPST+TN  SLGC+TSPV  LQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSM
Subjt:  ETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG
        ELPYCSM EPGPNIEAEER C+F KSLV+ERVYQL ECS M VSEPEYN+QK CKDLNR MKD ESG
Subjt:  ELPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG

XP_038904570.1 uncharacterized protein LOC120090942 [Benincasa hispida]6.74e-16089.14Show/hide
Query:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA  PPPSSSSTPSPS KRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGDPL ENLMDSPARSESMLY R+EMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPLQSDS-VPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        ETSTNLFP QSDS VPTSPVSPYRYQRPFS VTPST TN  SLGCSTSPV  LQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSM
Subjt:  ETSTNLFPLQSDS-VPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG
        ELPYCSM EPGPNIEAEERPC+  KSLV+ER +QLEECS+M VSEPEYN++K CKDLNR+MKDSESG
Subjt:  ELPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG

TrEMBL top hitse value%identityAlignment
A0A0A0KWN0 Uncharacterized protein1.85e-15888.52Show/hide
Query:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA PPP SSSSTPSPS KRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGDPL ENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPLQSDS-VPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCST-SPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS
        ETSTNLFP QSDS VPTSPVSPYRYQRPFS V PST TN  SLGCST SPV  LQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS
Subjt:  ETSTNLFPLQSDS-VPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCST-SPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS

Query:  MELPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMR--VSEPEYNQQKPCKDLNRNMKDSESG
        MELPYCSM EPGPNIEAE+RPC+  KSLV+ERVYQLEECS+M   VSE EYN+QK CKDLNR+MKDS SG
Subjt:  MELPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMR--VSEPEYNQQKPCKDLNRNMKDSESG

A0A5A7TRC2 Uncharacterized protein2.16e-15787.78Show/hide
Query:  MGVESNSAAPPPP-SSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF
        MGVESNSA PPPP SSSSTPSPS KRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVG+PL ENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSAAPPPP-SSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPLQSDS-VPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS
        CETSTNLFP QSDS VPTSPVSPYRYQRPFS + PS  TN  SLGCSTSPV  LQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSS
Subjt:  CETSTNLFPLQSDS-VPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS

Query:  MELPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMR--VSEPEYNQQKPCKDLNRNMKDSESG
        MELPYCSM EPGPNIEAE+RPC+  KSLV+ERVYQLEECS+M   VSE EYN+QK CKDLNR+MKDS+SG
Subjt:  MELPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMR--VSEPEYNQQKPCKDLNRNMKDSESG

A0A6J1DPQ3 uncharacterized protein LOC111022394 isoform X13.44e-190100Show/hide
Query:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPLQSDSVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
        ETSTNLFPLQSDSVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
Subjt:  ETSTNLFPLQSDSVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME

Query:  LPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESGES
        LPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESGES
Subjt:  LPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESGES

A0A6J1F722 uncharacterized protein LOC111441454 isoform X19.20e-15988.15Show/hide
Query:  MGVESNSAAPPPP---SSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC
        MGVESNS  PPPP   SSSSTPSPS KRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGDPL ENLMDSPARSESMLYLRDEMS QYSPMSEDSDDC
Subjt:  MGVESNSAAPPPP---SSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC

Query:  RFCETSTNLFPLQSDS-VPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP
        RFCETSTNLFP QSDS VPTSPVSPYRYQRPFS +TPST TN  SLGC+T PV  LQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GP
Subjt:  RFCETSTNLFPLQSDS-VPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP

Query:  SSMELPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG
        SSMELPYCSM EPGPNIEAEER C+F KSLV+ERVYQL ECS+M VSEPEYN+QK CKDLNR MKDSESG
Subjt:  SSMELPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG

A0A6J1J464 uncharacterized protein LOC111482488 isoform X12.47e-16089.14Show/hide
Query:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA PPPPSSSSTPSPS KRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGD L ENLMDSPARSESMLYLRDEMS QYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        ETSTNLFP QSD SVPTSPVSPYRYQRPFS +TPST+TN  SLGC+TSPV  LQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSM
Subjt:  ETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG
        ELPYCSM EPGPNIEAEER C+F KSLV+ERVYQL ECS M VSEPEYN+QK CKDLNR MKD ESG
Subjt:  ELPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G25920.1 BEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein (TAIR:AT2G25910.2)7.5e-5953.31Show/hide
Query:  GVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCE
        G      A  PP   S  SP  KR RDPEDEVYLDN  S KRYLSEIMA SLNGLTVGD LP N+++SPARSES LY RD++S QYSPMSEDSD+ RFCE
Subjt:  GVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCE

Query:  TST---NLFPLQSDSVPTSPVSPYRYQRPFSTVT---PSTSTNNNSLGCSTSPVPGL------QPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQM
          T   +    Q +S PTSPVSPYRYQRP ++     PS +  ++S  C  S +         Q  QRGSD+EGRFPSSPSDICHS DLRR ALLRSVQM
Subjt:  TST---NLFPLQSDSVPTSPVSPYRYQRPFSTVT---PSTSTNNNSLGCSTSPVPGL------QPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQM

Query:  RAQPPGPSSMELPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQL-EECSTMRVSEPEYNQQKPCKDLNRNM
        R QP G SS   P         NI+ EER C  +KS+  +R Y   E+     VS    ++ K CK L+  +
Subjt:  RAQPPGPSSMELPYCSMTEPGPNIEAEERPCAFTKSLVNERVYQL-EECSTMRVSEPEYNQQKPCKDLNRNM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGTCGAATCGAACTCTGCGGCACCGCCACCACCATCGTCGTCTTCTACACCTTCTCCGAGCGCGAAGCGAGCCAGAGATCCCGAAGATGAAGTTTATCTCGACAA
TTTCCACTCTCATAAGCGCTACCTCAGTGAGATAATGGCTTCTAGTTTGAATGGATTGACGGTTGGGGACCCCCTCCCTGAGAATCTCATGGATTCTCCTGCAAGGTCGG
AATCCATGCTTTATCTAAGGGATGAAATGTCCTGGCAATATTCTCCTATGTCTGAAGATTCAGATGACTGCCGTTTTTGTGAGACATCCACAAACTTATTTCCTTTGCAG
TCTGACAGTGTACCTACCAGTCCAGTCTCGCCATATCGATATCAAAGACCGTTCAGCACGGTGACTCCTTCAACAAGTACTAATAATAATTCACTTGGATGTTCTACTAG
TCCCGTCCCTGGCTTGCAACCACATCAACGTGGATCAGATTCTGAGGGCCGTTTCCCGTCATCTCCCAGCGACATATGCCACTCGGCAGACTTGAGAAGGGCTGCGCTCT
TGCGTTCTGTTCAAATGAGAGCACAACCTCCTGGTCCATCATCTATGGAGTTGCCATATTGCTCAATGACTGAGCCTGGACCTAATATAGAAGCTGAGGAGCGGCCATGT
GCTTTCACAAAATCGTTAGTCAATGAAAGAGTATATCAACTTGAGGAATGCTCCACAATGAGAGTGTCCGAACCCGAATATAACCAACAGAAACCGTGCAAGGACTTGAA
CAGGAATATGAAGGATAGTGAATCCGGAGAGTCGTAA
mRNA sequenceShow/hide mRNA sequence
AATAAAAACAAAAAGGCCGTTTGTGAGAAAACTCAAACAAACAGAGCCCCCAACATCTCTTCTTCTCCAGCCCAGAAAAGGGAAAAACCAGACTTCTTCTTCTACATCCA
AAACTGGTGGCGGTGAACGATGATTCCGATTTCCAAATAATTTGCCTACATTATTGGCTCTCCAAACCCTAATGATCGTAGAATCCGTAATTTTGTGCTTCAATTTCTTG
CTCAAAAACTGTTGAAGCTGCCTTGATGGGCGTCGAATCGAACTCTGCGGCACCGCCACCACCATCGTCGTCTTCTACACCTTCTCCGAGCGCGAAGCGAGCCAGAGATC
CCGAAGATGAAGTTTATCTCGACAATTTCCACTCTCATAAGCGCTACCTCAGTGAGATAATGGCTTCTAGTTTGAATGGATTGACGGTTGGGGACCCCCTCCCTGAGAAT
CTCATGGATTCTCCTGCAAGGTCGGAATCCATGCTTTATCTAAGGGATGAAATGTCCTGGCAATATTCTCCTATGTCTGAAGATTCAGATGACTGCCGTTTTTGTGAGAC
ATCCACAAACTTATTTCCTTTGCAGTCTGACAGTGTACCTACCAGTCCAGTCTCGCCATATCGATATCAAAGACCGTTCAGCACGGTGACTCCTTCAACAAGTACTAATA
ATAATTCACTTGGATGTTCTACTAGTCCCGTCCCTGGCTTGCAACCACATCAACGTGGATCAGATTCTGAGGGCCGTTTCCCGTCATCTCCCAGCGACATATGCCACTCG
GCAGACTTGAGAAGGGCTGCGCTCTTGCGTTCTGTTCAAATGAGAGCACAACCTCCTGGTCCATCATCTATGGAGTTGCCATATTGCTCAATGACTGAGCCTGGACCTAA
TATAGAAGCTGAGGAGCGGCCATGTGCTTTCACAAAATCGTTAGTCAATGAAAGAGTATATCAACTTGAGGAATGCTCCACAATGAGAGTGTCCGAACCCGAATATAACC
AACAGAAACCGTGCAAGGACTTGAACAGGAATATGAAGGATAGTGAATCCGGAGAGTCGTAAATGCTCGAAAATGTACATAGGAAAATCTCTTCTAACATCACAAATGGG
ACACATTGCTGCTGTGGGATTGAAAGTGTTTTGGTCAGTCCCATGGAAGTTCTCTTTTTTACATTTGCTGCCAAATTTACTCGCTTTTGCTCGTTCAGGGATGGTTTGGT
TTTCTGCTAGGTTCTCTCTCGGGTGTGCGGTGGCTTGCACCATCTTGTTGAGCATGCATAGAAGCTACAGTTTATCTATTTGGTACGATCAAAGTATGAAGCCGAGAGAA
GACGTTTTGCTGAGCTAATTTTGTAGATAAAATGGCCGCAAATTTTCGGGTCAAGAAGTCAGAAGTTGCAAGTGTTCCATTTCGTTTCATATTTGTAATGCTGTTATTAA
CTTATCGTTGAGTTCTTGTTGTGATAGCCACAAAGATTTTACTGGCAAATTTGATGGGAAAACGATGAGATGAATAGAGATGAAATTACATATTGAGTTTCTATCAGGCA
TCAGCAAAGAGTACCATGGTTTTAAATTCTTGGGTTTTTTTCCTCCTACTCATTTAAAGCCCATTAGGAACAGAATGGGGCTTAAATTTTACAGAGCAAATATTT
Protein sequenceShow/hide protein sequence
MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCETSTNLFPLQ
SDSVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSMELPYCSMTEPGPNIEAEERPC
AFTKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESGES