; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002723 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002723
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionvicilin-like seed storage protein At2g18540
Genome locationscaffold66:28460..30899
RNA-Seq ExpressionMS002723
SyntenyMS002723
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600733.1 hypothetical protein SDJN03_05966, partial [Cucurbita argyrosperma subsp. sororia]2.3e-9282.21Show/hide
Query:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSD---DKKRGRSSRRKGKSRRRY-SSEEDSDSEDAESESSEYDSDSENSDAESASS
        E+N APSKR SGR+         S+V++GRKR+K RRKSRDSSDSD   D+KRG+SS RKGKSRRRY SSEEDSDSEDAESESSE DSDSE+SDAESAS+
Subjt:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSD---DKKRGRSSRRKGKSRRRY-SSEEDSDSEDAESESSEYDSDSENSDAESASS

Query:  SSGSEGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKE-KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE
        SSGSEG  ESEEERRRRKR+ERRRRREKEERERKRR+KEKEKK+R+KE+EEEKRRKE KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE
Subjt:  SSGSEGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKE-KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE

Query:  IKKVNLESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM
        IKKVNLESLANWEEKQMFKE              YYNLDAYYQHKMLKDMKKGHKK+ AGERTVFDDEEQRRQELL EREKHKEEQVEALKRSMQTGM
Subjt:  IKKVNLESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM

KAG7031373.1 hypothetical protein SDJN02_05413, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-9282.21Show/hide
Query:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSD---DKKRGRSSRRKGKSRRRY-SSEEDSDSEDAESESSEYDSDSENSDAESASS
        E+N APSKR SGR+         S+V++GRKR+K RRKSRDSSDSD   D+KRG+SS RKGKSRRRY SSEEDSDSEDAESESSE DSDSE+SDAESAS+
Subjt:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSD---DKKRGRSSRRKGKSRRRY-SSEEDSDSEDAESESSEYDSDSENSDAESASS

Query:  SSGSEGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKE-KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE
        SSGSEG  ESEEERRRRKR+ERRRRREKEERERKRR+KEKEKK+R+KE+EEEKRRKE KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE
Subjt:  SSGSEGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKE-KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE

Query:  IKKVNLESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM
        IKKVNLESLANWEEKQMFKE              YYNLDAYYQHKMLKDMKKGHKK+ AGERTVFDDEEQRRQELL EREKHKEEQVEALKRSMQTGM
Subjt:  IKKVNLESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM

XP_022142930.1 troponin I [Momordica charantia]2.1e-11794.54Show/hide
Query:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSDDKKRGRSSRRKGKSRRRYSSEEDSDSEDAESESSEYDSDSENSDAESASSSSGS
        EENGAPSKRTSGRTSEKGHGISGSEVD+GRKRNKLRRKSRDSSDSDDKKRGRSSRRKGKSRRRYSSEEDSDSEDAE ESSEYDSDSENSDAESASSSSGS
Subjt:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSDDKKRGRSSRRKGKSRRRYSSEEDSDSEDAESESSEYDSDSENSDAESASSSSGS

Query:  EGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKEKKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAEIKKVN
        EGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKEKKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAEIKKVN
Subjt:  EGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKEKKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAEIKKVN

Query:  LESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM
        LESLANWEEKQMFKE              YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM
Subjt:  LESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM

XP_023515919.1 vicilin-like seed storage protein At2g18540 [Cucurbita pepo subsp. pepo]2.3e-9282.21Show/hide
Query:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSD---DKKRGRSSRRKGKSRRRY-SSEEDSDSEDAESESSEYDSDSENSDAESASS
        E+N APSKR SGR+         S+V++GRKR+K RRKSRDSSDSD   D+KRG+SS RKGKSRRRY SSEEDSDSEDAESESSE DSDSE+SDAESAS+
Subjt:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSD---DKKRGRSSRRKGKSRRRY-SSEEDSDSEDAESESSEYDSDSENSDAESASS

Query:  SSGSEGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKE-KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE
        SSGSEG  ESEEERRRRKR+ERRRRREKEERERKRR+KEKEKK+R+KE+EEEKRRKE KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE
Subjt:  SSGSEGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKE-KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE

Query:  IKKVNLESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM
        IKKVNLESLANWEEKQMFKE              YYNLDAYYQHKMLKDMKKGHKK+ AGERTVFDDEEQRRQELL EREKHKEEQVEALKRSMQTGM
Subjt:  IKKVNLESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM

XP_038906271.1 vicilin-like seed storage protein At2g18540 [Benincasa hispida]1.1e-9181.21Show/hide
Query:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSD---DKKRGRSSRRKGKSRRRY-SSEEDSDSEDAESESSEYDSDSENSDAESASS
        E+NGAPSK +SGR S+K    SGS+ D+GRKR K R KSRDS DSD   DKKRG+SSRRKGKSRRRY SSEEDSDSED ES+SS YDSDS +SDAESAS+
Subjt:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSD---DKKRGRSSRRKGKSRRRY-SSEEDSDSEDAESESSEYDSDSENSDAESASS

Query:  SSGSEGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKE-KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE
        SSGSEGDS+SEEERRRRKR+ERRRRR+K ERERKRR+KEKEKK+R+KEREEEKRRKE KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE
Subjt:  SSGSEGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKE-KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE

Query:  IKKVNLESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM
        IKKVNLESLANWEEKQMFKE              YYNLDAYYQ K+ KDMKKGHKK+  GERTVFDDEEQRRQELL EREKHKEEQVEALKRSMQTGM
Subjt:  IKKVNLESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM

TrEMBL top hitse value%identityAlignment
A0A0A0L2M5 Uncharacterized protein1.6e-7573.42Show/hide
Query:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSDD---KKRGRSSRRKGKSRRRYS-SEEDSDSEDAESESSEYDSDSENSDAESASS
        E+N APSKR SG+ SEK    SG + D+GRKR K RRKSR++ DSD+   K+RGRSSRRKGKSRRRYS SEEDSDSE+ ES+SS YDSDS +SD+ESASS
Subjt:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSDD---KKRGRSSRRKGKSRRRYS-SEEDSDSEDAESESSEYDSDSENSDAESASS

Query:  SSGSEGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKE-KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE
        SSGSE DSESE ERR+RKR+ERRRRREK ERERKRR+KEKEKK+ KKE+EEEKRRKE KKKEKKERGKKGAVTNSWGK+GIIKETDMW    EF      
Subjt:  SSGSEGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKE-KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE

Query:  IKK---VNLESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTG
        +     VNLESLANWEEKQMFKE              YYNLDAYYQ K+ KDMKKG KK+ AGERTVFDDEEQRRQELL EREKHKEEQVE LKRSMQTG
Subjt:  IKK---VNLESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTG

Query:  M
        M
Subjt:  M

A0A1S3BU92 LOW QUALITY PROTEIN: vicilin-like seed storage protein At2g185404.0e-9079.46Show/hide
Query:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSDD---KKRGRSSRRKGKSRRRY-SSEEDSDSEDAESESSEYDSDSENSDAESASS
        E+N APSKR SG+ SEK    SGS+ D+GRKR K RRKSR++SDSD+   K+RGRSSRRKGKSRRRY SSEEDSDSED ES+SS YDS+S  SD+ESASS
Subjt:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSDD---KKRGRSSRRKGKSRRRY-SSEEDSDSEDAESESSEYDSDSENSDAESASS

Query:  SSGSEGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKEKKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAEI
        SSGSE DSESEEERRRRKR+ERRRRREK ERERKRR+KEKEKK+RKK  EE++R+++KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAEI
Subjt:  SSGSEGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKEKKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAEI

Query:  KKVNLESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM
        KKVNLESLANWEEKQMFKE              YYNLDAYYQ K+ KDMKKGHKK+ AGERTVFDDEEQRRQELL EREKHKEEQVEALKRSMQTGM
Subjt:  KKVNLESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM

A0A6J1CMB2 troponin I1.0e-11794.54Show/hide
Query:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSDDKKRGRSSRRKGKSRRRYSSEEDSDSEDAESESSEYDSDSENSDAESASSSSGS
        EENGAPSKRTSGRTSEKGHGISGSEVD+GRKRNKLRRKSRDSSDSDDKKRGRSSRRKGKSRRRYSSEEDSDSEDAE ESSEYDSDSENSDAESASSSSGS
Subjt:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSDDKKRGRSSRRKGKSRRRYSSEEDSDSEDAESESSEYDSDSENSDAESASSSSGS

Query:  EGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKEKKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAEIKKVN
        EGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKEKKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAEIKKVN
Subjt:  EGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKEKKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAEIKKVN

Query:  LESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM
        LESLANWEEKQMFKE              YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM
Subjt:  LESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM

A0A6J1FXC4 stress response protein NST11.6e-9181.54Show/hide
Query:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSD---DKKRGRSSRRKGKSRRRY-SSEEDSDSEDAESESSEYDSDSENSDAESASS
        E+N APSKR S R+         S+V++GRKR+K RRKSRDSSDSD   D+KRG+SS RKGKSRRRY SSEEDSDSEDAESESSE DSDSE+SDAESAS+
Subjt:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSD---DKKRGRSSRRKGKSRRRY-SSEEDSDSEDAESESSEYDSDSENSDAESASS

Query:  SSGSEGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKE-KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE
        SSGSEG  ESEEERRRRKR+ERRR+REKEERERKRR+KEKEKK+R+KE+EEEKRRKE KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE
Subjt:  SSGSEGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKE-KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE

Query:  IKKVNLESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM
        IKKVNLESLANWEEKQMFKE              YYNLDAYYQHKMLKDMKKGHKK+ AGERTVFDDEEQRRQELL EREKHKEEQVEALKRSMQTGM
Subjt:  IKKVNLESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM

A0A6J1KTR2 troponin I1.2e-9181.54Show/hide
Query:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSD---DKKRGRSSRRKGKSRRRY-SSEEDSDSEDAESESSEYDSDSENSDAESASS
        E+N APSKR SGR+         S+V++GRKR+K RRKSRDSSDSD   D+KRG+S  RKGKSRRRY SSEEDSDSEDAESESSE DSDSE+SDAESAS+
Subjt:  EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSD---DKKRGRSSRRKGKSRRRY-SSEEDSDSEDAESESSEYDSDSENSDAESASS

Query:  SSGSEGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKE-KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE
        SSGSEG  ESEEERRRRKR+E+RRRREKEERERKRR+KEKEKK+R+KE+EEEKRRKE KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE
Subjt:  SSGSEGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKE-KKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAE

Query:  IKKVNLESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM
        IKKVNLESLANWEEKQMFKE              YYNLDAYYQHKMLKDMKKGHKK+ AGERTVFDDEEQRRQELL EREKHKEEQVEALKRSMQTGM
Subjt:  IKKVNLESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53800.1 unknown protein8.8e-4252.04Show/hide
Query:  KRTSGRTSEKGHG-ISGSEVDKGRKRNKLRRKSRDSSDSDDKKRGRSSRRKGKSRRRYSSEE------DSDSEDAESESSEYDSDSENSDAESASSSSGS
        +R S R    G G  SG E D  R+R   +R+++D +DS  +    S     K  RR S ++      D  S  +     +Y S S +S++ES S  S S
Subjt:  KRTSGRTSEKGHG-ISGSEVDKGRKRNKLRRKSRDSSDSDDKKRGRSSRRKGKSRRRYSSEE------DSDSEDAESESSEYDSDSENSDAESASSSSGS

Query:  EGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKR-KKEREEEKRRKEKKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAEIKKV
        E +SESE+ERRRRKR  +R+ RE+EE+ERKRR++EK+KKKR K +++ +K+RKEKKK+K E+ KKGAVT SWGKYGII+ETDMWNKRPEFTAWL E+KKV
Subjt:  EGDSESEEERRRRKRRERRRRREKEERERKRRKKEKEKKKR-KKEREEEKRRKEKKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAEIKKV

Query:  NLESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM
        NLESL  WEEK+MFK+              YY++D YY+ K+ K+MKKG KK    ERTVF+DEEQRR E+   RE+ KEE+V ALKRSM+ GM
Subjt:  NLESLANWEEKQMFKE--------------YYNLDAYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAAGAGAACGGAGCCCCGTCGAAGAGAACTTCTGGTAGGACTTCCGAAAAGGGTCATGGAATTTCGGGCTCCGAAGTGGACAAGGGACGAAAACGCAACAAATTAAGAAG
AAAATCGCGAGACAGTTCGGATTCCGATGACAAGAAACGCGGGAGGAGCTCGAGAAGAAAGGGGAAATCCCGTAGACGATACAGTAGTGAGGAGGATTCTGATTCCGAGG
ATGCTGAATCTGAAAGCTCGGAGTATGACTCGGATTCGGAGAACTCCGACGCGGAATCTGCGTCGAGTAGTTCGGGTTCGGAAGGCGATAGCGAGAGCGAGGAGGAGAGG
AGGAGGAGGAAGAGGAGGGAGAGGAGGAGGAGAAGAGAAAAGGAGGAAAGAGAGCGGAAGAGGAGGAAAAAGGAGAAGGAGAAGAAGAAGAGGAAAAAGGAGAGGGAGGA
AGAGAAACGAAGGAAGGAGAAGAAAAAGGAGAAGAAGGAACGGGGAAAGAAGGGGGCAGTCACGAATTCGTGGGGAAAGTATGGGATTATCAAAGAAACTGATATGTGGA
ACAAGCGACCAGAGTTCACTGCATGGTTGGCTGAAATCAAAAAGGTGAACTTGGAAAGCCTAGCTAATTGGGAAGAGAAGCAAATGTTTAAGGAGTACTACAACCTTGAT
GCTTATTACCAGCATAAAATGCTAAAAGATATGAAGAAGGGCCACAAAAAGCTTTCGGCAGGGGAACGCACTGTGTTTGATGATGAAGAACAGAGGAGGCAAGAACTACT
TCGAGAACGTGAAAAACACAAGGAAGAACAAGTGGAAGCCTTAAAGCGCTCTATGCAGACTGGAATGGTCAGTGATTTTACTCTC
mRNA sequenceShow/hide mRNA sequence
GAAGAGAACGGAGCCCCGTCGAAGAGAACTTCTGGTAGGACTTCCGAAAAGGGTCATGGAATTTCGGGCTCCGAAGTGGACAAGGGACGAAAACGCAACAAATTAAGAAG
AAAATCGCGAGACAGTTCGGATTCCGATGACAAGAAACGCGGGAGGAGCTCGAGAAGAAAGGGGAAATCCCGTAGACGATACAGTAGTGAGGAGGATTCTGATTCCGAGG
ATGCTGAATCTGAAAGCTCGGAGTATGACTCGGATTCGGAGAACTCCGACGCGGAATCTGCGTCGAGTAGTTCGGGTTCGGAAGGCGATAGCGAGAGCGAGGAGGAGAGG
AGGAGGAGGAAGAGGAGGGAGAGGAGGAGGAGAAGAGAAAAGGAGGAAAGAGAGCGGAAGAGGAGGAAAAAGGAGAAGGAGAAGAAGAAGAGGAAAAAGGAGAGGGAGGA
AGAGAAACGAAGGAAGGAGAAGAAAAAGGAGAAGAAGGAACGGGGAAAGAAGGGGGCAGTCACGAATTCGTGGGGAAAGTATGGGATTATCAAAGAAACTGATATGTGGA
ACAAGCGACCAGAGTTCACTGCATGGTTGGCTGAAATCAAAAAGGTGAACTTGGAAAGCCTAGCTAATTGGGAAGAGAAGCAAATGTTTAAGGAGTACTACAACCTTGAT
GCTTATTACCAGCATAAAATGCTAAAAGATATGAAGAAGGGCCACAAAAAGCTTTCGGCAGGGGAACGCACTGTGTTTGATGATGAAGAACAGAGGAGGCAAGAACTACT
TCGAGAACGTGAAAAACACAAGGAAGAACAAGTGGAAGCCTTAAAGCGCTCTATGCAGACTGGAATGGTCAGTGATTTTACTCTC
Protein sequenceShow/hide protein sequence
EENGAPSKRTSGRTSEKGHGISGSEVDKGRKRNKLRRKSRDSSDSDDKKRGRSSRRKGKSRRRYSSEEDSDSEDAESESSEYDSDSENSDAESASSSSGSEGDSESEEER
RRRKRRERRRRREKEERERKRRKKEKEKKKRKKEREEEKRRKEKKKEKKERGKKGAVTNSWGKYGIIKETDMWNKRPEFTAWLAEIKKVNLESLANWEEKQMFKEYYNLD
AYYQHKMLKDMKKGHKKLSAGERTVFDDEEQRRQELLREREKHKEEQVEALKRSMQTGMVSDFTL