; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019005 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019005
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionBEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein .
Genome locationscaffold20:643356..646402
RNA-Seq ExpressionMS019005
SyntenyMS019005
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581290.1 hypothetical protein SDJN03_21292, partial [Cucurbita argyrosperma subsp. sororia]5.6e-12588.56Show/hide
Query:  MGVESNSAAPPPP----SSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDD
        MGVESNS  PPPP    SSSSTPSPS KRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGDPL ENLMDSPARSESMLYLRDEMS QYSPMSEDSDD
Subjt:  MGVESNSAAPPPP----SSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDD

Query:  CRFCETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPG
        CRFCETSTNLFP QSD SVPTSPVSPYRYQRPFS +TPST T N SLGC+T PV  LQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP G
Subjt:  CRFCETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPG

Query:  PSSMELPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG
        PSSMELPYCSMPEPGPNIEAEER C+FIKSLV+ERVYQL ECS+M VSEPEYN+QK CKDLNR MKDSESG
Subjt:  PSSMELPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG

XP_022155254.1 uncharacterized protein LOC111022394 isoform X1 [Momordica charantia]7.5e-14699.25Show/hide
Query:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPLQSDSVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
        ETSTNLFPLQSDSVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
Subjt:  ETSTNLFPLQSDSVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME

Query:  LPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESGE
        LPYCSM EPGPNIEAEERPCAF KSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESGE
Subjt:  LPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESGE

XP_022934215.1 uncharacterized protein LOC111441454 isoform X1 [Cucurbita moschata]4.3e-12588.89Show/hide
Query:  MGVESNSAAPPPP---SSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC
        MGVESNS  PPPP   SSSSTPSPS KRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGDPL ENLMDSPARSESMLYLRDEMS QYSPMSEDSDDC
Subjt:  MGVESNSAAPPPP---SSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC

Query:  RFCETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP
        RFCETSTNLFP QSD SVPTSPVSPYRYQRPFS +TPST T N SLGC+T PV  LQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GP
Subjt:  RFCETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP

Query:  SSMELPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG
        SSMELPYCSMPEPGPNIEAEER C+FIKSLV+ERVYQL ECS+M VSEPEYN+QK CKDLNR MKDSESG
Subjt:  SSMELPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG

XP_022984066.1 uncharacterized protein LOC111482488 isoform X1 [Cucurbita maxima]3.0e-12689.89Show/hide
Query:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA PPPPSSSSTPSPS KRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGD L ENLMDSPARSESMLYLRDEMS QYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        ETSTNLFP QSD SVPTSPVSPYRYQRPFS +TPST+T N SLGC+TSPV  LQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSM
Subjt:  ETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG
        ELPYCSMPEPGPNIEAEER C+FIKSLV+ERVYQL ECS M VSEPEYN+QK CKDLNR MKD ESG
Subjt:  ELPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG

XP_038904570.1 uncharacterized protein LOC120090942 [Benincasa hispida]5.1e-12689.89Show/hide
Query:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA  PPPSSSSTPSPS KRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGDPL ENLMDSPARSESMLY R+EMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        ETSTNLFP QSD SVPTSPVSPYRYQRPFS VTPST T N SLGCSTSPV  LQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSM
Subjt:  ETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG
        ELPYCSMPEPGPNIEAEERPC+ IKSLV+ER +QLEECS+M VSEPEYN++K CKDLNR+MKDSESG
Subjt:  ELPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG

TrEMBL top hitse value%identityAlignment
A0A0A0KWN0 Uncharacterized protein3.6e-12589.26Show/hide
Query:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA PPP SSSSTPSPS KRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGDPL ENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCS-TSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS
        ETSTNLFP QSD SVPTSPVSPYRYQRPFS V PST T N SLGCS TSPV  LQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS
Subjt:  ETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCS-TSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS

Query:  MELPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTM--RVSEPEYNQQKPCKDLNRNMKDSESG
        MELPYCSMPEPGPNIEAE+RPC+ IKSLV+ERVYQLEECS+M   VSE EYN+QK CKDLNR+MKDS SG
Subjt:  MELPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTM--RVSEPEYNQQKPCKDLNRNMKDSESG

A0A5A7TRC2 Uncharacterized protein2.3e-12488.52Show/hide
Query:  MGVESNSAAPPPP-SSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF
        MGVESNSA PPPP SSSSTPSPS KRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVG+PL ENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSAAPPPP-SSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS
        CETSTNLFP QSD SVPTSPVSPYRYQRPFS + PS  T N SLGCSTSPV  LQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSS
Subjt:  CETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSS

Query:  MELPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTM--RVSEPEYNQQKPCKDLNRNMKDSESG
        MELPYCSMPEPGPNIEAE+RPC+ IKSLV+ERVYQLEECS+M   VSE EYN+QK CKDLNR+MKDS+SG
Subjt:  MELPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTM--RVSEPEYNQQKPCKDLNRNMKDSESG

A0A6J1DPQ3 uncharacterized protein LOC111022394 isoform X13.7e-14699.25Show/hide
Query:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPLQSDSVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
        ETSTNLFPLQSDSVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
Subjt:  ETSTNLFPLQSDSVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME

Query:  LPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESGE
        LPYCSM EPGPNIEAEERPCAF KSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESGE
Subjt:  LPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESGE

A0A6J1F722 uncharacterized protein LOC111441454 isoform X12.1e-12588.89Show/hide
Query:  MGVESNSAAPPPP---SSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC
        MGVESNS  PPPP   SSSSTPSPS KRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGDPL ENLMDSPARSESMLYLRDEMS QYSPMSEDSDDC
Subjt:  MGVESNSAAPPPP---SSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC

Query:  RFCETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP
        RFCETSTNLFP QSD SVPTSPVSPYRYQRPFS +TPST T N SLGC+T PV  LQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GP
Subjt:  RFCETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP

Query:  SSMELPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG
        SSMELPYCSMPEPGPNIEAEER C+FIKSLV+ERVYQL ECS+M VSEPEYN+QK CKDLNR MKDSESG
Subjt:  SSMELPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG

A0A6J1J464 uncharacterized protein LOC111482488 isoform X11.4e-12689.89Show/hide
Query:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA PPPPSSSSTPSPS KRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGD L ENLMDSPARSESMLYLRDEMS QYSPMSEDSDDCRFC
Subjt:  MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        ETSTNLFP QSD SVPTSPVSPYRYQRPFS +TPST+T N SLGC+TSPV  LQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSM
Subjt:  ETSTNLFPLQSD-SVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG
        ELPYCSMPEPGPNIEAEER C+FIKSLV+ERVYQL ECS M VSEPEYN+QK CKDLNR MKD ESG
Subjt:  ELPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G25920.1 BEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein (TAIR:AT2G25910.2)5.7e-5953.31Show/hide
Query:  GVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCE
        G      A  PP   S  SP  KR RDPEDEVYLDN  S KRYLSEIMA SLNGLTVGD LP N+++SPARSES LY RD++S QYSPMSEDSD+ RFCE
Subjt:  GVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCE

Query:  TST---NLFPLQSDSVPTSPVSPYRYQRPFSTVT---PSTSTNNNSLGCSTSPVPGL------QPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQM
          T   +    Q +S PTSPVSPYRYQRP ++     PS +  ++S  C  S +         Q  QRGSD+EGRFPSSPSDICHS DLRR ALLRSVQM
Subjt:  TST---NLFPLQSDSVPTSPVSPYRYQRPFSTVT---PSTSTNNNSLGCSTSPVPGL------QPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQM

Query:  RAQPPGPSSMELPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQL-EECSTMRVSEPEYNQQKPCKDLNRNM
        R QP G SS   P         NI+ EER C+  KS+  +R Y   E+     VS    ++ K CK L+  +
Subjt:  RAQPPGPSSMELPYCSMPEPGPNIEAEERPCAFIKSLVNERVYQL-EECSTMRVSEPEYNQQKPCKDLNRNM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGTCGAATCGAACTCTGCGGCACCGCCACCACCATCGTCGTCTTCTACACCTTCTCCGAGCGCGAAGCGAGCCAGAGATCCCGAAGATGAAGTTTATCTCGACAA
CTTCCACTCTCATAAGCGCTACCTCAGTGAGATAATGGCTTCTAGTTTGAATGGATTGACGGTTGGGGACCCCCTCCCTGAGAATCTCATGGATTCTCCTGCAAGGTCGG
AATCCATGCTTTATCTAAGGGATGAAATGTCCTGGCAATATTCTCCTATGTCTGAAGATTCAGATGACTGCCGTTTTTGTGAGACATCCACAAACTTATTTCCTTTGCAG
TCTGACAGTGTACCTACCAGTCCAGTCTCGCCATATCGATATCAAAGACCGTTCAGCACGGTGACTCCTTCAACAAGTACTAATAATAATTCACTTGGATGTTCTACTAG
TCCCGTCCCTGGCTTGCAACCACATCAACGTGGATCAGATTCTGAGGGCCGTTTCCCGTCATCTCCCAGCGACATATGCCACTCGGCAGACTTGAGAAGGGCTGCGCTCT
TGCGTTCTGTTCAAATGAGAGCACAACCTCCTGGTCCATCATCTATGGAGTTGCCATATTGCTCAATGCCTGAGCCTGGACCTAATATAGAAGCTGAGGAGCGGCCATGT
GCTTTCATAAAATCGTTAGTCAATGAAAGAGTATATCAACTTGAGGAATGCTCCACAATGAGAGTGTCCGAACCCGAATATAACCAACAGAAACCGTGCAAGGACTTGAA
CAGGAATATGAAGGATAGTGAATCTGGAGAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCGTCGAATCGAACTCTGCGGCACCGCCACCACCATCGTCGTCTTCTACACCTTCTCCGAGCGCGAAGCGAGCCAGAGATCCCGAAGATGAAGTTTATCTCGACAA
CTTCCACTCTCATAAGCGCTACCTCAGTGAGATAATGGCTTCTAGTTTGAATGGATTGACGGTTGGGGACCCCCTCCCTGAGAATCTCATGGATTCTCCTGCAAGGTCGG
AATCCATGCTTTATCTAAGGGATGAAATGTCCTGGCAATATTCTCCTATGTCTGAAGATTCAGATGACTGCCGTTTTTGTGAGACATCCACAAACTTATTTCCTTTGCAG
TCTGACAGTGTACCTACCAGTCCAGTCTCGCCATATCGATATCAAAGACCGTTCAGCACGGTGACTCCTTCAACAAGTACTAATAATAATTCACTTGGATGTTCTACTAG
TCCCGTCCCTGGCTTGCAACCACATCAACGTGGATCAGATTCTGAGGGCCGTTTCCCGTCATCTCCCAGCGACATATGCCACTCGGCAGACTTGAGAAGGGCTGCGCTCT
TGCGTTCTGTTCAAATGAGAGCACAACCTCCTGGTCCATCATCTATGGAGTTGCCATATTGCTCAATGCCTGAGCCTGGACCTAATATAGAAGCTGAGGAGCGGCCATGT
GCTTTCATAAAATCGTTAGTCAATGAAAGAGTATATCAACTTGAGGAATGCTCCACAATGAGAGTGTCCGAACCCGAATATAACCAACAGAAACCGTGCAAGGACTTGAA
CAGGAATATGAAGGATAGTGAATCTGGAGAG
Protein sequenceShow/hide protein sequence
MGVESNSAAPPPPSSSSTPSPSAKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLPENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCETSTNLFPLQ
SDSVPTSPVSPYRYQRPFSTVTPSTSTNNNSLGCSTSPVPGLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEERPC
AFIKSLVNERVYQLEECSTMRVSEPEYNQQKPCKDLNRNMKDSESGE