; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS024285 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS024285
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF962)
Genome locationscaffold30:1595372..1598490
RNA-Seq ExpressionMS024285
SyntenyMS024285
Gene Ontology termsGO:0046521 - sphingoid catabolic process (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009305 - 2-hydroxy-palmitic acid dioxygenase Mpo1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138174.1 uncharacterized endoplasmic reticulum membrane protein YGL010W [Cucumis sativus]1.4e-9285.07Show/hide
Query:  MGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWG
        MGKTGLFDLE QFAFYGAYHSNP+NIFIHVLFVWPIFFT+LMYLYFTPS Y++PK+ CGF+HGLVLNFGFLFTL+YA  YV FDKRAGSMAALLC VCW 
Subjt:  MGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWG

Query:  GASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKL
        GASF+AN LGYS TWKVVLAAQLFCWTNQFIGHGVFE   KRAPALLDNLAQAFLMAPFFV LEVLQ+LF+YEPYPGF+ASVQAKIK +I+EWKE  EKL
Subjt:  GASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKL

Query:  S
        S
Subjt:  S

XP_008453216.1 PREDICTED: uncharacterized endoplasmic reticulum membrane protein YGL010W [Cucumis melo]1.8e-9284.58Show/hide
Query:  MGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWG
        MGKTGLFDLE QFAFYGAYHSNP+NIFIHVLFVWPIFFT+LMYLYFTPS Y++PK+ CGF+HGLVLNFGF FTL+YA  YV FDKRAGSMAALLC VCW 
Subjt:  MGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWG

Query:  GASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKL
        GAS++AN LGYS TWKVVLAAQLFCWTNQFIGHGVFE   KRAPALLDNLAQAFLMAPFFV LEVLQ+LF+YEPYPGF+ASVQAKIK +I+EWKEK EKL
Subjt:  GASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKL

Query:  S
        S
Subjt:  S

XP_022135210.1 uncharacterized endoplasmic reticulum membrane protein C16E8.02 isoform X1 [Momordica charantia]3.0e-15697.6Show/hide
Query:  MSFSATAKIPQFQPLPLNSRSNGNGNPLFFRSSSRFLGSTLGIRFPSLSSSTRRRRSSTVVAVADDVKDMDNNLKPSSNPNPGLSIFPEILMGKTGLFDL
        MSFSATAKIPQFQPLPLNSRSNGNGNPLFFRSSSRFLGSTLGIRFPSLSSSTRRRRSSTVVAVADDVKDMDNNLKPSSNPNPGLSIFPEILMGKTGLFDL
Subjt:  MSFSATAKIPQFQPLPLNSRSNGNGNPLFFRSSSRFLGSTLGIRFPSLSSSTRRRRSSTVVAVADDVKDMDNNLKPSSNPNPGLSIFPEILMGKTGLFDL

Query:  ETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWGGASFLANSL
        ETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYS+PKTLCGFEHGLVLNFGFLFTLI AVSYVAFDKRAGSMAALLCLVCWGGA FLANSL
Subjt:  ETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWGGASFLANSL

Query:  GYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKLS
        GYSLTWKVVLAAQLFCWTNQFIGHGVFE   KRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEK EKLS
Subjt:  GYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKLS

XP_022135211.1 uncharacterized endoplasmic reticulum membrane protein C16E8.02 isoform X2 [Momordica charantia]3.5e-10496.52Show/hide
Query:  MGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWG
        MGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYS+PKTLCGFEHGLVLNFGFLFTLI AVSYVAFDKRAGSMAALLCLVCWG
Subjt:  MGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWG

Query:  GASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKL
        GA FLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFE   KRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEK EKL
Subjt:  GASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKL

Query:  S
        S
Subjt:  S

XP_022933403.1 uncharacterized endoplasmic reticulum membrane protein C16E8.02 [Cucurbita moschata]1.1e-9285.07Show/hide
Query:  MGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWG
        MGKT  FDLE  FAFYGAYHSNP+NIFIHVLFVWPIFFT LMYLYFTPS Y++PK+ CGF+HGLVLNFGFLFTLIYA SYV FDKRAGSMAALLC VCW 
Subjt:  MGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWG

Query:  GASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKL
        GASF++N LGYS TWKVVLAAQLFCWTNQFIGHGVFE   KRAPALLDNLAQAFLMAPFFVFLEVLQ LF+YEPYPGF+ASVQAKI+ +IKEWKEK EKL
Subjt:  GASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKL

Query:  S
        S
Subjt:  S

TrEMBL top hitse value%identityAlignment
A0A0A0LP54 Uncharacterized protein6.7e-9385.07Show/hide
Query:  MGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWG
        MGKTGLFDLE QFAFYGAYHSNP+NIFIHVLFVWPIFFT+LMYLYFTPS Y++PK+ CGF+HGLVLNFGFLFTL+YA  YV FDKRAGSMAALLC VCW 
Subjt:  MGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWG

Query:  GASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKL
        GASF+AN LGYS TWKVVLAAQLFCWTNQFIGHGVFE   KRAPALLDNLAQAFLMAPFFV LEVLQ+LF+YEPYPGF+ASVQAKIK +I+EWKE  EKL
Subjt:  GASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKL

Query:  S
        S
Subjt:  S

A0A5A7UQ13 Putative endoplasmic reticulum membrane protein8.7e-9384.58Show/hide
Query:  MGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWG
        MGKTGLFDLE QFAFYGAYHSNP+NIFIHVLFVWPIFFT+LMYLYFTPS Y++PK+ CGF+HGLVLNFGF FTL+YA  YV FDKRAGSMAALLC VCW 
Subjt:  MGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWG

Query:  GASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKL
        GAS++AN LGYS TWKVVLAAQLFCWTNQFIGHGVFE   KRAPALLDNLAQAFLMAPFFV LEVLQ+LF+YEPYPGF+ASVQAKIK +I+EWKEK EKL
Subjt:  GASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKL

Query:  S
        S
Subjt:  S

A0A6J1C006 uncharacterized endoplasmic reticulum membrane protein C16E8.02 isoform X21.7e-10496.52Show/hide
Query:  MGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWG
        MGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYS+PKTLCGFEHGLVLNFGFLFTLI AVSYVAFDKRAGSMAALLCLVCWG
Subjt:  MGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWG

Query:  GASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKL
        GA FLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFE   KRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEK EKL
Subjt:  GASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKL

Query:  S
        S
Subjt:  S

A0A6J1C0I5 uncharacterized endoplasmic reticulum membrane protein C16E8.02 isoform X11.5e-15697.6Show/hide
Query:  MSFSATAKIPQFQPLPLNSRSNGNGNPLFFRSSSRFLGSTLGIRFPSLSSSTRRRRSSTVVAVADDVKDMDNNLKPSSNPNPGLSIFPEILMGKTGLFDL
        MSFSATAKIPQFQPLPLNSRSNGNGNPLFFRSSSRFLGSTLGIRFPSLSSSTRRRRSSTVVAVADDVKDMDNNLKPSSNPNPGLSIFPEILMGKTGLFDL
Subjt:  MSFSATAKIPQFQPLPLNSRSNGNGNPLFFRSSSRFLGSTLGIRFPSLSSSTRRRRSSTVVAVADDVKDMDNNLKPSSNPNPGLSIFPEILMGKTGLFDL

Query:  ETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWGGASFLANSL
        ETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYS+PKTLCGFEHGLVLNFGFLFTLI AVSYVAFDKRAGSMAALLCLVCWGGA FLANSL
Subjt:  ETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWGGASFLANSL

Query:  GYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKLS
        GYSLTWKVVLAAQLFCWTNQFIGHGVFE   KRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEK EKLS
Subjt:  GYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKLS

A0A6J1F4N0 uncharacterized endoplasmic reticulum membrane protein C16E8.025.1e-9385.07Show/hide
Query:  MGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWG
        MGKT  FDLE  FAFYGAYHSNP+NIFIHVLFVWPIFFT LMYLYFTPS Y++PK+ CGF+HGLVLNFGFLFTLIYA SYV FDKRAGSMAALLC VCW 
Subjt:  MGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWG

Query:  GASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKL
        GASF++N LGYS TWKVVLAAQLFCWTNQFIGHGVFE   KRAPALLDNLAQAFLMAPFFVFLEVLQ LF+YEPYPGF+ASVQAKI+ +IKEWKEK EKL
Subjt:  GASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKL

Query:  S
        S
Subjt:  S

SwissProt top hitse value%identityAlignment
O13737 2-hydroxy-palmitic acid dioxygenase mpo11.2e-1737.3Show/hide
Query:  LETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLY-FTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSM-AALLCLVCWGGASFLA
        L   ++FY AYHSNP+NI IH + +  +  TAL+ L+ F  +L             L +N   L  L Y + YV  D   G + + +L L  +   S L 
Subjt:  LETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLY-FTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSM-AALLCLVCWGGASFLA

Query:  NSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIK
             SL  +      + CW  QFIGHGVFE   KR PALLDNL Q+  +AP F FLE         P+ G+  SV +KI+  IK
Subjt:  NSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIK

P25338 2-hydroxy-palmitic acid dioxygenase MPO11.2e-1432.65Show/hide
Query:  GLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWGGASF
        GL DL +Q  FY  YH NP N+ IH +FV  I F+    L+      S+          L      LF++ Y + Y+     AG +  LL L        
Subjt:  GLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWGGASF

Query:  LANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKL
                LT+K  L      W  QF+GHGVFE   KR PAL+DNL Q+ ++AP+F+  E L  L       GF   ++A ++ ++ E K++N ++
Subjt:  LANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKL

Arabidopsis top hitse value%identityAlignment
AT1G18720.1 Protein of unknown function (DUF962)3.5e-6261.19Show/hide
Query:  GLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVL------NFGFLFTLIYAVSYVAFDKRAGSMAALLCLVC
        GLFDLE  FAFYGAYHSNPINI IH++FVWPIFF+ L+ L+ +  ++  P  L GF   L L      N GF+F LIYA+ Y+  DK++G +AAL+C  C
Subjt:  GLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVL------NFGFLFTLIYAVSYVAFDKRAGSMAALLCLVC

Query:  WGGASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNE
        W G+SFLA  LG SL  KV LA+QL CWT QF+GHGVFE   KRAPALLDNL QAFLMAPFFV LEVLQ++F YEPYPGF A V AK++ +IKE++ K +
Subjt:  WGGASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNE

Query:  K
        K
Subjt:  K

AT1G74440.1 Protein of unknown function (DUF962)1.5e-6059.51Show/hide
Query:  LMGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTP-----SLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALL
        +  + GL DLE  FAFYGAYHSNPINI IH LFVWP  F  L++LY TP     S     K+L  F+  L L+ GF  T+ YAV Y+  DK++G +AALL
Subjt:  LMGKTGLFDLETQFAFYGAYHSNPINIFIHVLFVWPIFFTALMYLYFTP-----SLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALL

Query:  CLVCWGGASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWK
        C  CW G+SFLA  LG+SLT KV +A+QL CWT QF+GHG+FE   KRAPALLDNL QAFLM PFFV LEVLQ++F YEPYPGF A V +KI+  IKEW+
Subjt:  CLVCWGGASFLANSLGYSLTWKVVLAAQLFCWTNQFIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWK

Query:  EKNEK
        EK ++
Subjt:  EKNEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTCTCCGCCACAGCCAAAATCCCTCAATTTCAGCCCCTTCCTCTCAACTCCAGATCCAATGGCAATGGGAACCCCCTCTTCTTCAGATCATCCTCCAGATTTCT
CGGATCTACGCTGGGGATTCGCTTCCCCTCCCTCTCCTCCTCCACTCGCCGCCGTCGATCCTCCACTGTTGTCGCCGTCGCCGACGATGTCAAGGACATGGACAACAACC
TCAAACCCTCTTCCAATCCCAATCCGGGATTGTCAATTTTCCCGGAAATTTTGATGGGAAAAACTGGATTGTTTGATTTGGAGACGCAATTCGCCTTCTATGGCGCATAT
CACAGCAACCCAATCAACATTTTCATCCACGTTCTGTTTGTGTGGCCGATTTTCTTCACCGCTCTCATGTATTTGTATTTCACCCCTTCTCTCTACAGCGTCCCCAAAAC
CCTTTGTGGGTTTGAGCATGGCCTGGTTTTGAACTTCGGATTTCTATTCACTTTAATCTACGCTGTATCTTATGTGGCTTTCGATAAGAGAGCCGGGTCCATGGCCGCCT
TGCTTTGTCTCGTCTGCTGGGGTGGAGCAAGCTTCCTCGCCAACAGCCTTGGCTATTCTCTCACTTGGAAGGTAGTACTGGCTGCTCAGTTGTTTTGTTGGACCAATCAG
TTTATAGGCCATGGAGTATTTGAGGTTAGACAGAAACGAGCACCGGCTTTGTTAGACAATCTTGCTCAAGCCTTTCTTATGGCTCCTTTCTTTGTATTTCTGGAGGTTCT
TCAAAATCTATTCAGATATGAACCATATCCAGGGTTTAATGCAAGCGTGCAAGCAAAGATCAAAGAAGAGATAAAAGAGTGGAAAGAAAAGAACGAAAAGCTATCATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCATTCTCCGCCACAGCCAAAATCCCTCAATTTCAGCCCCTTCCTCTCAACTCCAGATCCAATGGCAATGGGAACCCCCTCTTCTTCAGATCATCCTCCAGATTTCT
CGGATCTACGCTGGGGATTCGCTTCCCCTCCCTCTCCTCCTCCACTCGCCGCCGTCGATCCTCCACTGTTGTCGCCGTCGCCGACGATGTCAAGGACATGGACAACAACC
TCAAACCCTCTTCCAATCCCAATCCGGGATTGTCAATTTTCCCGGAAATTTTGATGGGAAAAACTGGATTGTTTGATTTGGAGACGCAATTCGCCTTCTATGGCGCATAT
CACAGCAACCCAATCAACATTTTCATCCACGTTCTGTTTGTGTGGCCGATTTTCTTCACCGCTCTCATGTATTTGTATTTCACCCCTTCTCTCTACAGCGTCCCCAAAAC
CCTTTGTGGGTTTGAGCATGGCCTGGTTTTGAACTTCGGATTTCTATTCACTTTAATCTACGCTGTATCTTATGTGGCTTTCGATAAGAGAGCCGGGTCCATGGCCGCCT
TGCTTTGTCTCGTCTGCTGGGGTGGAGCAAGCTTCCTCGCCAACAGCCTTGGCTATTCTCTCACTTGGAAGGTAGTACTGGCTGCTCAGTTGTTTTGTTGGACCAATCAG
TTTATAGGCCATGGAGTATTTGAGGTTAGACAGAAACGAGCACCGGCTTTGTTAGACAATCTTGCTCAAGCCTTTCTTATGGCTCCTTTCTTTGTATTTCTGGAGGTTCT
TCAAAATCTATTCAGATATGAACCATATCCAGGGTTTAATGCAAGCGTGCAAGCAAAGATCAAAGAAGAGATAAAAGAGTGGAAAGAAAAGAACGAAAAGCTATCATAG
Protein sequenceShow/hide protein sequence
MSFSATAKIPQFQPLPLNSRSNGNGNPLFFRSSSRFLGSTLGIRFPSLSSSTRRRRSSTVVAVADDVKDMDNNLKPSSNPNPGLSIFPEILMGKTGLFDLETQFAFYGAY
HSNPINIFIHVLFVWPIFFTALMYLYFTPSLYSVPKTLCGFEHGLVLNFGFLFTLIYAVSYVAFDKRAGSMAALLCLVCWGGASFLANSLGYSLTWKVVLAAQLFCWTNQ
FIGHGVFEVRQKRAPALLDNLAQAFLMAPFFVFLEVLQNLFRYEPYPGFNASVQAKIKEEIKEWKEKNEKLS