; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS015848 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS015848
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionConserved peptide upstream open reading frame 46
Genome locationscaffold943_2:421586..422311
RNA-Seq ExpressionMS015848
SyntenyMS015848
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047269.1 Methyltransferase type 11 [Cucumis melo var. makuwa]5.3e-9071.54Show/hide
Query:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL
        MDLK  KS IL DPA A+RVFFRVFLFASA+S+IPI+HILT+YDF++FHLP+S  C+A+   T    D  PRGSYLFQGHFLNPVWDSF+S+HC+E VNL
Subjt:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL

Query:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGK-FKVSVPDLVVGEIERVLNSGG
        TIS IKLLVD KHLFNHSA+ALFVGGSSSSA S ++DLGFS A+GVDKGR +SLKR E GY+LDYAN SFDFVLF GK  KVSVPDLVVGEIER+L+ GG
Subjt:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGK-FKVSVPDLVVGEIERVLNSGG

Query:  IGAVVSGISAP-----AARVGSLLKSSCVVHSGRVNNFYMTVFKKE
        IGAVV+GIS+P       RV  LLKSSCVV+SG V   Y++VFKK+
Subjt:  IGAVVSGISAP-----AARVGSLLKSSCVVHSGRVNNFYMTVFKKE

KAE8647389.1 hypothetical protein Csa_003425 [Cucumis sativus]4.8e-9171.43Show/hide
Query:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL
        MDLK  +S IL+DPA A+RVFFRVFLFASA+S+IPI+HILT+YDF++FHLP+S  C+A+   T    D  PRGSYLFQGHFLNPVWDSFDS+HCQ  VNL
Subjt:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL

Query:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGGI
        TIS IKLLV EKHLFNHSA+ALFVGGSSSSA S + DLGFS AVGVDKGR +SLKR E GY+LDY N SFDFVLF+GK KVSVPDLVVGE+ER+L+ GGI
Subjt:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGGI

Query:  GAVVSGISAP-----AARVGSLLKSSCVVHSGRVNNFYMTVFKKE
        GAVV+GIS+P       RV  LLKSSCVV+SG V   Y++VFKK+
Subjt:  GAVVSGISAP-----AARVGSLLKSSCVVHSGRVNNFYMTVFKKE

XP_011657647.1 uncharacterized protein LOC105435880 [Cucumis sativus]4.8e-9171.43Show/hide
Query:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL
        MDLK  +S IL+DPA A+RVFFRVFLFASA+S+IPI+HILT+YDF++FHLP+S  C+A+   T    D  PRGSYLFQGHFLNPVWDSFDS+HCQ  VNL
Subjt:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL

Query:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGGI
        TIS IKLLV EKHLFNHSA+ALFVGGSSSSA S + DLGFS AVGVDKGR +SLKR E GY+LDY N SFDFVLF+GK KVSVPDLVVGE+ER+L+ GGI
Subjt:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGGI

Query:  GAVVSGISAP-----AARVGSLLKSSCVVHSGRVNNFYMTVFKKE
        GAVV+GIS+P       RV  LLKSSCVV+SG V   Y++VFKK+
Subjt:  GAVVSGISAP-----AARVGSLLKSSCVVHSGRVNNFYMTVFKKE

XP_022143860.1 uncharacterized protein LOC111013673 [Momordica charantia]3.6e-13199.59Show/hide
Query:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL
        MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL
Subjt:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL

Query:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGGI
        TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFV+FRGKFKVSVPDLVVGEIERVLNSGGI
Subjt:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGGI

Query:  GAVVSGISAPAARVGSLLKSSCVVHSGRVNNFYMTVFKKEFQ
        GAVVSGISAPAARVGSLLKSSCVVHSGRVNNFYMTVFKKEFQ
Subjt:  GAVVSGISAPAARVGSLLKSSCVVHSGRVNNFYMTVFKKEFQ

XP_038883488.1 uncharacterized protein LOC120074441 [Benincasa hispida]2.0e-9271.43Show/hide
Query:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL
        MDLK  KS IL+D A ARR+FFR+FLF S +S+IPI+HILT+YDF++FHLP+S  C+A         DQ PRGSYLFQGHFLNPVWDSFDSVHCQE VNL
Subjt:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL

Query:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGGI
        T+S IKLLV+EKHLFNHSA+ALFVGGSSSSA S ++DLGFSGAVGVDKGR +SL+++  GY+LDY+N SFDFVLF+GK KVSVPDLVVGEIER+L  GGI
Subjt:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGGI

Query:  GAVVSGISAP-----AARVGSLLKSSCVVHSGRVNNFYMTVFKKE
        GAVV+GIS+P     A RVG LLKSSCVV+SG VN  Y++VFKK+
Subjt:  GAVVSGISAP-----AARVGSLLKSSCVVHSGRVNNFYMTVFKKE

TrEMBL top hitse value%identityAlignment
A0A0A0KEM7 Uncharacterized protein2.3e-9171.43Show/hide
Query:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL
        MDLK  +S IL+DPA A+RVFFRVFLFASA+S+IPI+HILT+YDF++FHLP+S  C+A+   T    D  PRGSYLFQGHFLNPVWDSFDS+HCQ  VNL
Subjt:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL

Query:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGGI
        TIS IKLLV EKHLFNHSA+ALFVGGSSSSA S + DLGFS AVGVDKGR +SLKR E GY+LDY N SFDFVLF+GK KVSVPDLVVGE+ER+L+ GGI
Subjt:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGGI

Query:  GAVVSGISAP-----AARVGSLLKSSCVVHSGRVNNFYMTVFKKE
        GAVV+GIS+P       RV  LLKSSCVV+SG V   Y++VFKK+
Subjt:  GAVVSGISAP-----AARVGSLLKSSCVVHSGRVNNFYMTVFKKE

A0A1S3BMI8 uncharacterized protein LOC1034916502.6e-9071.54Show/hide
Query:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL
        MDLK  KS IL DPA A+RVFFRVFLFASA+S+IPI+HILT+YDF++FHLP+S  C+A+   T    D  PRGSYLFQGHFLNPVWDSF+S+HC+E VNL
Subjt:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL

Query:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGK-FKVSVPDLVVGEIERVLNSGG
        TIS IKLLVD KHLFNHSA+ALFVGGSSSSA S ++DLGFS A+GVDKGR +SLKR E GY+LDYAN SFDFVLF GK  KVSVPDLVVGEIER+L+ GG
Subjt:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGK-FKVSVPDLVVGEIERVLNSGG

Query:  IGAVVSGISAP-----AARVGSLLKSSCVVHSGRVNNFYMTVFKKE
        IGAVV+GIS+P       RV  LLKSSCVV+SG V   Y++VFKK+
Subjt:  IGAVVSGISAP-----AARVGSLLKSSCVVHSGRVNNFYMTVFKKE

A0A5A7U1B0 Methyltransferase type 112.6e-9071.54Show/hide
Query:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL
        MDLK  KS IL DPA A+RVFFRVFLFASA+S+IPI+HILT+YDF++FHLP+S  C+A+   T    D  PRGSYLFQGHFLNPVWDSF+S+HC+E VNL
Subjt:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL

Query:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGK-FKVSVPDLVVGEIERVLNSGG
        TIS IKLLVD KHLFNHSA+ALFVGGSSSSA S ++DLGFS A+GVDKGR +SLKR E GY+LDYAN SFDFVLF GK  KVSVPDLVVGEIER+L+ GG
Subjt:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGK-FKVSVPDLVVGEIERVLNSGG

Query:  IGAVVSGISAP-----AARVGSLLKSSCVVHSGRVNNFYMTVFKKE
        IGAVV+GIS+P       RV  LLKSSCVV+SG V   Y++VFKK+
Subjt:  IGAVVSGISAP-----AARVGSLLKSSCVVHSGRVNNFYMTVFKKE

A0A6J1CPZ9 uncharacterized protein LOC1110136731.8e-13199.59Show/hide
Query:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL
        MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL
Subjt:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL

Query:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGGI
        TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFV+FRGKFKVSVPDLVVGEIERVLNSGGI
Subjt:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGGI

Query:  GAVVSGISAPAARVGSLLKSSCVVHSGRVNNFYMTVFKKEFQ
        GAVVSGISAPAARVGSLLKSSCVVHSGRVNNFYMTVFKKEFQ
Subjt:  GAVVSGISAPAARVGSLLKSSCVVHSGRVNNFYMTVFKKEFQ

A0A6J1EDK4 uncharacterized protein LOC1114332234.5e-8769.8Show/hide
Query:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL
        MDLKL KS IL+D A ARR+ FR+FLFA AVSIIP VHI T+YDF++FHLP+S  C+AAG      +DQ PRGSYLFQGHFLNP+WDS +S HCQE VNL
Subjt:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNL

Query:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGGI
        TIS I+ LVDEKHLFNHSA+ALFVG SSS+A S ++DLGF GAVG+DKGR +S+K++E GY+LDY N SFDFVLFRGKFK+SVPDLVVGEIERVL  GG 
Subjt:  TISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGGI

Query:  GAVVSGISAP-----AARVGSLLKSSCVVHSGRVNNFYMTVFKKE
        GAVV GI++P     A R+ SLLKSSCVV S  VNN  +TVFKK+
Subjt:  GAVVSGISAP-----AARVGSLLKSSCVVHSGRVNNFYMTVFKKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G53400.1 BEST Arabidopsis thaliana protein match is: conserved peptide upstream open reading frame 47 (TAIR:AT5G03190.1)8.1e-2030.8Show/hide
Query:  RRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPR-----------GSYLFQGH-------FLNPVWDSFDSVHCQENVN
        RRV  R  +   A S++ ++  L              G Y   G+T+ N+ QP +           G +LF G+       FL PVW+  +S  C++N+ 
Subjt:  RRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPR-----------GSYLFQGH-------FLNPVWDSFDSVHCQENVN

Query:  LTISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGG
        LT   ++ L    +L ++ +KAL +G  S SAV AM   G S         V + K ++F   L Y + SF FV       V+VP  +V EIER+L  GG
Subjt:  LTISAIKLLVDEKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGG

Query:  IGAVVSGISA---------PAARVGSLLKSSCVVHSGRVNNFYMTVFKKE
         GA++ G ++           + V SLLK+S VVH   +    + VFK++
Subjt:  IGAVVSGISA---------PAARVGSLLKSSCVVHSGRVNNFYMTVFKKE

AT5G03190.1 conserved peptide upstream open reading frame 475.1e-1430.49Show/hide
Query:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQ-PPRGSYLFQGHFLNPVWDSFDSVHCQENVN
        M +K+ K  I    +  R   FR  + ASA+S++P++ +         H          G   L +  +    G  LF    + P W   ++    + V 
Subjt:  MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQ-PPRGSYLFQGHFLNPVWDSFDSVHCQENVN

Query:  LTISAIKLLVDE---KHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYA-NGSFDFVLFRGKFKVSVPDLVVGEIERVL
             I  LVDE     L ++ AK L +G  S SAVS  +++GFS   GV K  + S   ++    L+ + + SFDFVL      V+ P L+V E+ERVL
Subjt:  LTISAIKLLVDE---KHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYA-NGSFDFVLFRGKFKVSVPDLVVGEIERVL

Query:  NSGGIGAVVSGISAP--AARVGSLLKSSCVVHSGRVNNFYMTVFKK
          GG GAV+   +A      V S LK S +V    ++ F + VFK+
Subjt:  NSGGIGAVVSGISAP--AARVGSLLKSSCVVHSGRVNNFYMTVFKK

AT5G03190.2 conserved peptide upstream open reading frame 471.5e-1330.87Show/hide
Query:  ARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQ-PPRGSYLFQGHFLNPVWDSFDSVHCQENVNLTISAIKLLVDE---K
        +R   FR  + ASA+S++P++ +         H          G   L +  +    G  LF    + P W   ++    + V      I  LVDE    
Subjt:  ARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQ-PPRGSYLFQGHFLNPVWDSFDSVHCQENVNLTISAIKLLVDE---K

Query:  HLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYA-NGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGGIGAVVSGISAP-
         L ++ AK L +G  S SAVS  +++GFS   GV K  + S   ++    L+ + + SFDFVL      V+ P L+V E+ERVL  GG GAV+   +A  
Subjt:  HLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYA-NGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGGIGAVVSGISAP-

Query:  -AARVGSLLKSSCVVHSGRVNNFYMTVFKK
            V S LK S +V    ++ F + VFK+
Subjt:  -AARVGSLLKSSCVVHSGRVNNFYMTVFKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTAAAGCTCTCGAAATCGCAGATCTTGAACGACCCTGCAGTTGCGCGGCGCGTGTTCTTCCGCGTCTTTCTATTCGCCTCCGCCGTCTCCATCATTCCGATCGT
CCACATTCTCACCACTTACGATTTCAGAAACTTCCATTTGCCCCAATCCCAAGGCTGCTACGCCGCCGGCGGCCAAACCCTAAAAAACTCCGATCAGCCCCCCCGGGGGT
CCTATCTGTTCCAGGGCCATTTCCTGAACCCCGTCTGGGATTCTTTCGATTCCGTCCATTGCCAAGAAAATGTGAATCTCACGATCTCGGCGATCAAGTTGCTGGTGGAT
GAGAAGCATTTGTTCAACCACAGCGCGAAGGCTCTGTTCGTTGGGGGGAGTTCGTCCTCCGCCGTGTCGGCGATGAGGGATCTGGGGTTTTCCGGTGCCGTCGGCGTCGA
TAAGGGGCGGGTTCTCTCGCTGAAGCGGAAGGAGTTTGGGTATAGACTCGATTACGCCAATGGGTCCTTCGATTTCGTTCTGTTCAGGGGAAAATTTAAGGTCTCTGTTC
CTGATTTGGTGGTGGGCGAGATTGAGCGCGTTCTTAACTCCGGCGGAATTGGGGCGGTTGTTTCCGGCATCAGTGCTCCGGCCGCCCGAGTGGGGAGCTTACTGAAATCT
TCCTGTGTTGTACATTCGGGCCGTGTAAATAACTTCTATATGACTGTGTTCAAGAAGGAATTTCAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTAAAGCTCTCGAAATCGCAGATCTTGAACGACCCTGCAGTTGCGCGGCGCGTGTTCTTCCGCGTCTTTCTATTCGCCTCCGCCGTCTCCATCATTCCGATCGT
CCACATTCTCACCACTTACGATTTCAGAAACTTCCATTTGCCCCAATCCCAAGGCTGCTACGCCGCCGGCGGCCAAACCCTAAAAAACTCCGATCAGCCCCCCCGGGGGT
CCTATCTGTTCCAGGGCCATTTCCTGAACCCCGTCTGGGATTCTTTCGATTCCGTCCATTGCCAAGAAAATGTGAATCTCACGATCTCGGCGATCAAGTTGCTGGTGGAT
GAGAAGCATTTGTTCAACCACAGCGCGAAGGCTCTGTTCGTTGGGGGGAGTTCGTCCTCCGCCGTGTCGGCGATGAGGGATCTGGGGTTTTCCGGTGCCGTCGGCGTCGA
TAAGGGGCGGGTTCTCTCGCTGAAGCGGAAGGAGTTTGGGTATAGACTCGATTACGCCAATGGGTCCTTCGATTTCGTTCTGTTCAGGGGAAAATTTAAGGTCTCTGTTC
CTGATTTGGTGGTGGGCGAGATTGAGCGCGTTCTTAACTCCGGCGGAATTGGGGCGGTTGTTTCCGGCATCAGTGCTCCGGCCGCCCGAGTGGGGAGCTTACTGAAATCT
TCCTGTGTTGTACATTCGGGCCGTGTAAATAACTTCTATATGACTGTGTTCAAGAAGGAATTTCAA
Protein sequenceShow/hide protein sequence
MDLKLSKSQILNDPAVARRVFFRVFLFASAVSIIPIVHILTTYDFRNFHLPQSQGCYAAGGQTLKNSDQPPRGSYLFQGHFLNPVWDSFDSVHCQENVNLTISAIKLLVD
EKHLFNHSAKALFVGGSSSSAVSAMRDLGFSGAVGVDKGRVLSLKRKEFGYRLDYANGSFDFVLFRGKFKVSVPDLVVGEIERVLNSGGIGAVVSGISAPAARVGSLLKS
SCVVHSGRVNNFYMTVFKKEFQ