; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS024052 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS024052
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionAgglutinin domain-containing protein
Genome locationscaffold93:1394388..1397494
RNA-Seq ExpressionMS024052
SyntenyMS024052
Gene Ontology termsNA
InterPro domainsIPR008998 - Agglutinin domain
IPR036242 - Agglutinin domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155409.1 uncharacterized protein LOC111022557 [Momordica charantia]8.4e-6942.34Show/hide
Query:  EAELRE--LENKYKAITRKTTDTSDEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESSPKHVHIRCC
        E  LRE  LEN+YK I+ K  + S        +PK+F+LQ + P    PKT  YLR VQDHE   DGFL+ SGK + SP SK  SEASES P+ +HIRC 
Subjt:  EAELRE--LENKYKAITRKTTDTSDEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESSPKHVHIRCC

Query:  YNNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLG-----------------------------------------LAV
        YNNKYWVRQ PDS YIV    ++E D+SKWSCTLF   Y     H+ +   HVQLG                                         L +
Subjt:  YNNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLG-----------------------------------------LAV

Query:  L-------------------YRSYDSNDFLN----------------------------------CLSAEDNMDDPNTLFKPVKVDHNIVVLRNMGNNQF
        L                   Y  +   D  N                                   ++ + + DD N+LF+PVK+ +NIV LR++GNN F
Subjt:  L-------------------YRSYDSNDFLN----------------------------------CLSAEDNMDDPNTLFKPVKVDHNIVVLRNMGNNQF

Query:  CISLTIDGKRNCLNATNRNVSADAHMEVLQAVVSSKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKRKNS
        C SL+IDGK NCLNA   N   +  ME  +AV+SS+IENIEY + DAKIYGERVWSM KGDA NKT AADTVQFTF+FEDK K +
Subjt:  CISLTIDGKRNCLNATNRNVSADAHMEVLQAVVSSKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKRKNS

XP_022157622.1 uncharacterized protein LOC111024282 [Momordica charantia]8.7e-5032.26Show/hide
Query:  TLLMRAAEEAELRELENKYKAITRKTTDTSDEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESSPKH
        T LMRA +    R+LE+KY+ IT+K    S++G+ +  +P+YF L+  + ++       YLR + + +    GFL  SG+ ++SPY+K E E S     +
Subjt:  TLLMRAAEEAELRELENKYKAITRKTTDTSDEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESSPKH

Query:  VHIRCCYNNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLGLAVLYRSYDSNDFLNCLSAE------------------
        VH+RCC+NNKYWVR+   S YIV AA E  +D SKWSCTL    Y  H   + +   HVQL  A++Y S+ ++ + +CL A+                  
Subjt:  VHIRCCYNNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLGLAVLYRSYDSNDFLNCLSAE------------------

Query:  ---------------------------DN-------------------------------------------------------------MDDPNTLFKP
                                   DN                                                              ++PN LF P
Subjt:  ---------------------------DN-------------------------------------------------------------MDDPNTLFKP

Query:  VKVDHNIVVLRNMGNNQFCISLTIDGKRNCLNATNRNVSADAHMEVLQAVVSSKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKR
         K+D   + LRN+GNN FC+ L+++GK +CLN+    ++ +A ++  +AV+S  I+N+EY +NDA+IYG++V SMAKGDA NKT   DTV F FT+E+K+
Subjt:  VKVDHNIVVLRNMGNNQFCISLTIDGKRNCLNATNRNVSADAHMEVLQAVVSSKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKR

Query:  KNS
        K +
Subjt:  KNS

XP_022157630.1 uncharacterized protein LOC111024291 [Momordica charantia]2.1e-12871.51Show/hide
Query:  MDPTLLMRAAEEAELRELENKYKAITRKTTDTSDEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESS
        MDPTLLMRAAEEAELRELENKYKAITRKTTDTSDEGKSVQQLPKYF+LQRFNPSSSDPKTGAYLRCVQDHEILE GFLKVSGKSVLSPYSKMESEASESS
Subjt:  MDPTLLMRAAEEAELRELENKYKAITRKTTDTSDEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESS

Query:  PKHVHIRCCYNNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLGLAVLYRSYDSNDFLNCLSAED--------------
        PKHVHIR C NNKYWVRQ PDSFYIVTAAAEKEEDRSKW+CTLFSAFYMHHGSHEVFGLNHVQLGLAVLYRSYDSNDFLNCLSAED              
Subjt:  PKHVHIRCCYNNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLGLAVLYRSYDSNDFLNCLSAED--------------

Query:  -----------------------NMDDPNTLFKPVKVDHNIVVLRNMGNNQFCI-------SLTIDGKRNCLNATNRNVSAD-------AHMEVLQAVVS
                               +++D + + +    +   + +RN+G+ +F I       +L   G ++  N   + V  D       AHMEVLQAVVS
Subjt:  -----------------------NMDDPNTLFKPVKVDHNIVVLRNMGNNQFCI-------SLTIDGKRNCLNATNRNVSAD-------AHMEVLQAVVS

Query:  SKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKRKNS
         KIENIEYCINDAKIYGERVWSMAKGDATNKTNAAD VQFTFTFEDKRKNS
Subjt:  SKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKRKNS

XP_038906851.1 uncharacterized protein LOC120092742 [Benincasa hispida]4.6e-6740.26Show/hide
Query:  EEAELRELENKYKAITRKTTDTS-DEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESSPKHVHIRCC
        EE   +++E+ Y  +  K  D S ++ KS+  +P++F+LQ  NP S  PKT  YLR V + +  E G L  SGK+VLSP+SK ESE SE+ PK  HI+CC
Subjt:  EEAELRELENKYKAITRKTTDTS-DEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESSPKHVHIRCC

Query:  YNNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLGL-------------------------------------------
        YNNKYWVR+  +S YI+  A +KEED+SKW+CTLF   Y     ++ F   HVQ  L                                           
Subjt:  YNNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLGL-------------------------------------------

Query:  --------------------------------AVLYRSYDSND-------------------FLNCLSAEDNMDDPNTLFKPVKVDHNIVVLRNMGNNQF
                                        ++++  +  ND                   ++   + E + +DPNT F+PVK+  NIV LRN+GNN F
Subjt:  --------------------------------AVLYRSYDSND-------------------FLNCLSAEDNMDDPNTLFKPVKVDHNIVVLRNMGNNQF

Query:  CISLTIDGKRNCLNATNRNVSADAHMEVLQAVVSSKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKRKNS
        C SL++D K +CLNA + N + +A MEV +AV+SSKIENIEY + DAKIYGERVWSMAKGDA NKT AADT+QFTF+FEDKRK +
Subjt:  CISLTIDGKRNCLNATNRNVSADAHMEVLQAVVSSKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKRKNS

XP_038906982.1 uncharacterized protein LOC120092830 [Benincasa hispida]3.2e-6840.78Show/hide
Query:  EEAELRELENKYKAITRKTTDTS-DEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESSPKHVHIRCC
        EE   R++E+KY  +  K  D S ++ KS+  +P++F+LQ  NP S  PKT  YLR V + +  E G L  SGK+VLSP+SK ESE SE+ PK  HI+CC
Subjt:  EEAELRELENKYKAITRKTTDTS-DEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESSPKHVHIRCC

Query:  YNNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLGL-------------------------------------------
        YNNKYWVR+  +S YI+  A +KEED+SKW+CTLF   Y     ++ F   HVQ  L                                           
Subjt:  YNNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLGL-------------------------------------------

Query:  --------------------------------AVLYRSYDSND-------------------FLNCLSAEDNMDDPNTLFKPVKVDHNIVVLRNMGNNQF
                                        ++++  +  ND                   ++   + E + +DPNT F+PVK+  NIV LRN+GNN F
Subjt:  --------------------------------AVLYRSYDSND-------------------FLNCLSAEDNMDDPNTLFKPVKVDHNIVVLRNMGNNQF

Query:  CISLTIDGKRNCLNATNRNVSADAHMEVLQAVVSSKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKRKNS
        C SL++D K +CLNA + N + +A MEV +AV+SSKIENIEY + DAKIYGERVWSMAKGDA NKT AADT+QFTF+FEDKRK +
Subjt:  CISLTIDGKRNCLNATNRNVSADAHMEVLQAVVSSKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKRKNS

TrEMBL top hitse value%identityAlignment
A0A6J1DQ71 uncharacterized protein LOC1110225574.1e-6942.34Show/hide
Query:  EAELRE--LENKYKAITRKTTDTSDEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESSPKHVHIRCC
        E  LRE  LEN+YK I+ K  + S        +PK+F+LQ + P    PKT  YLR VQDHE   DGFL+ SGK + SP SK  SEASES P+ +HIRC 
Subjt:  EAELRE--LENKYKAITRKTTDTSDEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESSPKHVHIRCC

Query:  YNNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLG-----------------------------------------LAV
        YNNKYWVRQ PDS YIV    ++E D+SKWSCTLF   Y     H+ +   HVQLG                                         L +
Subjt:  YNNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLG-----------------------------------------LAV

Query:  L-------------------YRSYDSNDFLN----------------------------------CLSAEDNMDDPNTLFKPVKVDHNIVVLRNMGNNQF
        L                   Y  +   D  N                                   ++ + + DD N+LF+PVK+ +NIV LR++GNN F
Subjt:  L-------------------YRSYDSNDFLN----------------------------------CLSAEDNMDDPNTLFKPVKVDHNIVVLRNMGNNQF

Query:  CISLTIDGKRNCLNATNRNVSADAHMEVLQAVVSSKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKRKNS
        C SL+IDGK NCLNA   N   +  ME  +AV+SS+IENIEY + DAKIYGERVWSM KGDA NKT AADTVQFTF+FEDK K +
Subjt:  CISLTIDGKRNCLNATNRNVSADAHMEVLQAVVSSKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKRKNS

A0A6J1DTM1 uncharacterized protein LOC1110242911.0e-12871.51Show/hide
Query:  MDPTLLMRAAEEAELRELENKYKAITRKTTDTSDEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESS
        MDPTLLMRAAEEAELRELENKYKAITRKTTDTSDEGKSVQQLPKYF+LQRFNPSSSDPKTGAYLRCVQDHEILE GFLKVSGKSVLSPYSKMESEASESS
Subjt:  MDPTLLMRAAEEAELRELENKYKAITRKTTDTSDEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESS

Query:  PKHVHIRCCYNNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLGLAVLYRSYDSNDFLNCLSAED--------------
        PKHVHIR C NNKYWVRQ PDSFYIVTAAAEKEEDRSKW+CTLFSAFYMHHGSHEVFGLNHVQLGLAVLYRSYDSNDFLNCLSAED              
Subjt:  PKHVHIRCCYNNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLGLAVLYRSYDSNDFLNCLSAED--------------

Query:  -----------------------NMDDPNTLFKPVKVDHNIVVLRNMGNNQFCI-------SLTIDGKRNCLNATNRNVSAD-------AHMEVLQAVVS
                               +++D + + +    +   + +RN+G+ +F I       +L   G ++  N   + V  D       AHMEVLQAVVS
Subjt:  -----------------------NMDDPNTLFKPVKVDHNIVVLRNMGNNQFCI-------SLTIDGKRNCLNATNRNVSAD-------AHMEVLQAVVS

Query:  SKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKRKNS
         KIENIEYCINDAKIYGERVWSMAKGDATNKTNAAD VQFTFTFEDKRKNS
Subjt:  SKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKRKNS

A0A6J1DUY9 uncharacterized protein LOC1110242824.2e-5032.26Show/hide
Query:  TLLMRAAEEAELRELENKYKAITRKTTDTSDEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESSPKH
        T LMRA +    R+LE+KY+ IT+K    S++G+ +  +P+YF L+  + ++       YLR + + +    GFL  SG+ ++SPY+K E E S     +
Subjt:  TLLMRAAEEAELRELENKYKAITRKTTDTSDEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESSPKH

Query:  VHIRCCYNNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLGLAVLYRSYDSNDFLNCLSAE------------------
        VH+RCC+NNKYWVR+   S YIV AA E  +D SKWSCTL    Y  H   + +   HVQL  A++Y S+ ++ + +CL A+                  
Subjt:  VHIRCCYNNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLGLAVLYRSYDSNDFLNCLSAE------------------

Query:  ---------------------------DN-------------------------------------------------------------MDDPNTLFKP
                                   DN                                                              ++PN LF P
Subjt:  ---------------------------DN-------------------------------------------------------------MDDPNTLFKP

Query:  VKVDHNIVVLRNMGNNQFCISLTIDGKRNCLNATNRNVSADAHMEVLQAVVSSKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKR
         K+D   + LRN+GNN FC+ L+++GK +CLN+    ++ +A ++  +AV+S  I+N+EY +NDA+IYG++V SMAKGDA NKT   DTV F FT+E+K+
Subjt:  VKVDHNIVVLRNMGNNQFCISLTIDGKRNCLNATNRNVSADAHMEVLQAVVSSKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKR

Query:  KNS
        K +
Subjt:  KNS

A0A6J1GPP7 uncharacterized protein LOC1114563411.1e-4531.95Show/hide
Query:  EEAELRELENKYKAITRKTTDTSDEGKSV-QQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESSPKHVHIRCC
        +EAE + + ++ + +   + D+ D+ +++ + +PK FSL+         +   YLR + + E   DG L+ SGK+++ PYSK    AS++ P  VHIRCC
Subjt:  EEAELRELENKYKAITRKTTDTSDEGKSV-QQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESSPKHVHIRCC

Query:  YNNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLGLAVLYRSYDSNDFLNCLSA--------EDNM-------------
        YNNK+WVR   DS YI   A E+EED+SKWSCTLF   ++         + HVQL   +     D + + +CL+A        +DN+             
Subjt:  YNNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLGLAVLYRSYDSNDFLNCLSA--------EDNM-------------

Query:  -------------------------------------------------------------------------DDPNTLFKPVKVDHNIVVLRNMGNNQF
                                                                                 D+PN LF PVKVD NIV LRN GNN F
Subjt:  -------------------------------------------------------------------------DDPNTLFKPVKVDHNIVVLRNMGNNQF

Query:  CISLTIDGKRNCLNATNRNVSADAHMEVLQAVVSSKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKRKNS
        C  LT +GK NCLNA    ++  A +EV++ VV+  IE++EY +NDA++YG+++ +++KG A N T  AD V   F +E K + S
Subjt:  CISLTIDGKRNCLNATNRNVSADAHMEVLQAVVSSKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKRKNS

A0A6J1JVU2 uncharacterized protein LOC1114883387.0e-4531.51Show/hide
Query:  EEAELRELENKYKAITRKTTDTSDEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESSPKHVHIRCCY
        +EAE + + ++ +A+   +    D+    + +P+ FSL+         +   YLR + + E   DG L+ SGK+++ PYSK    AS++ P  VHIRCCY
Subjt:  EEAELRELENKYKAITRKTTDTSDEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESSPKHVHIRCCY

Query:  NNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLGLAVLYRSYDSNDFLNCLSA--------EDNM--------------
        NNK+WVR   DS YI   A E+EED+SKWSCTLF   ++         + HVQL   +     D + + +C++A        +DN+              
Subjt:  NNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLGLAVLYRSYDSNDFLNCLSA--------EDNM--------------

Query:  ------------------------------------------------------------------------DDPNTLFKPVKVDHNIVVLRNMGNNQFC
                                                                                D+PN LF PVKVD NIV LRN GNN FC
Subjt:  ------------------------------------------------------------------------DDPNTLFKPVKVDHNIVVLRNMGNNQFC

Query:  ISLTIDGKRNCLNATNRNVSADAHMEVLQAVVSSKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKRKNS
          LT +GK NCLNA    ++  A +EV++ VV+  IE++EY +NDA++YG+++ +++KG A N T  AD V   F +E K + S
Subjt:  ISLTIDGKRNCLNATNRNVSADAHMEVLQAVVSSKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKRKNS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCTACCTTATTAATGAGAGCTGCTGAGGAGGCTGAGTTACGGGAGCTGGAAAACAAGTACAAAGCAATAACAAGAAAGACAACCGACACATCGGATGAAGGTAA
ATCAGTACAGCAACTCCCAAAATATTTTTCTTTGCAACGATTCAACCCAAGTTCTTCAGACCCCAAAACTGGAGCATATCTGCGGTGTGTACAAGATCATGAGATTTTAG
AAGATGGATTCCTCAAAGTCTCTGGGAAAAGTGTGCTGAGTCCTTATTCGAAGATGGAATCCGAGGCGTCCGAGTCCAGCCCCAAGCACGTGCACATAAGATGCTGTTAC
AACAATAAATACTGGGTTCGACAGTGGCCTGACTCTTTCTACATTGTCACTGCTGCAGCAGAGAAAGAAGAGGACCGATCCAAATGGAGCTGCACCTTGTTCTCTGCCTT
CTACATGCATCACGGCTCCCACGAGGTGTTTGGCTTAAACCATGTGCAGCTCGGCTTGGCCGTGCTGTATCGATCCTATGATTCCAATGACTTTTTGAATTGCCTCTCGG
CCGAGGACAATATGGATGATCCCAACACATTGTTTAAGCCAGTGAAAGTTGATCACAACATCGTGGTTCTTCGTAACATGGGCAACAACCAGTTCTGCATATCACTCACC
ATCGACGGAAAGAGAAATTGTTTGAATGCCACCAATCGAAATGTTAGTGCAGATGCCCACATGGAAGTCTTACAAGCTGTAGTATCCAGCAAAATAGAAAACATTGAGTA
TTGCATAAATGATGCCAAAATCTATGGCGAGAGGGTTTGGTCAATGGCCAAAGGAGATGCCACAAACAAAACCAACGCAGCCGACACTGTCCAATTCACATTCACTTTTG
AGGACAAAAGGAAGAACAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCCTACCTTATTAATGAGAGCTGCTGAGGAGGCTGAGTTACGGGAGCTGGAAAACAAGTACAAAGCAATAACAAGAAAGACAACCGACACATCGGATGAAGGTAA
ATCAGTACAGCAACTCCCAAAATATTTTTCTTTGCAACGATTCAACCCAAGTTCTTCAGACCCCAAAACTGGAGCATATCTGCGGTGTGTACAAGATCATGAGATTTTAG
AAGATGGATTCCTCAAAGTCTCTGGGAAAAGTGTGCTGAGTCCTTATTCGAAGATGGAATCCGAGGCGTCCGAGTCCAGCCCCAAGCACGTGCACATAAGATGCTGTTAC
AACAATAAATACTGGGTTCGACAGTGGCCTGACTCTTTCTACATTGTCACTGCTGCAGCAGAGAAAGAAGAGGACCGATCCAAATGGAGCTGCACCTTGTTCTCTGCCTT
CTACATGCATCACGGCTCCCACGAGGTGTTTGGCTTAAACCATGTGCAGCTCGGCTTGGCCGTGCTGTATCGATCCTATGATTCCAATGACTTTTTGAATTGCCTCTCGG
CCGAGGACAATATGGATGATCCCAACACATTGTTTAAGCCAGTGAAAGTTGATCACAACATCGTGGTTCTTCGTAACATGGGCAACAACCAGTTCTGCATATCACTCACC
ATCGACGGAAAGAGAAATTGTTTGAATGCCACCAATCGAAATGTTAGTGCAGATGCCCACATGGAAGTCTTACAAGCTGTAGTATCCAGCAAAATAGAAAACATTGAGTA
TTGCATAAATGATGCCAAAATCTATGGCGAGAGGGTTTGGTCAATGGCCAAAGGAGATGCCACAAACAAAACCAACGCAGCCGACACTGTCCAATTCACATTCACTTTTG
AGGACAAAAGGAAGAACAGCTAG
Protein sequenceShow/hide protein sequence
MDPTLLMRAAEEAELRELENKYKAITRKTTDTSDEGKSVQQLPKYFSLQRFNPSSSDPKTGAYLRCVQDHEILEDGFLKVSGKSVLSPYSKMESEASESSPKHVHIRCCY
NNKYWVRQWPDSFYIVTAAAEKEEDRSKWSCTLFSAFYMHHGSHEVFGLNHVQLGLAVLYRSYDSNDFLNCLSAEDNMDDPNTLFKPVKVDHNIVVLRNMGNNQFCISLT
IDGKRNCLNATNRNVSADAHMEVLQAVVSSKIENIEYCINDAKIYGERVWSMAKGDATNKTNAADTVQFTFTFEDKRKNS