; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G004150 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G004150
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptiontransmembrane protein 33 homolog
Genome locationCmo_Chr01:2021705..2023628
RNA-Seq ExpressionCmoCh01G004150
SyntenyCmoCh01G004150
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005344 - TMEM33/Pom33 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607043.1 hypothetical protein SDJN03_00385, partial [Cucurbita argyrosperma subsp. sororia]2.0e-15298.23Show/hide
Query:  DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT
        DPDLVVETMSSGSSSFQPGRQSAPSPPT DQTQSQS GSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAV+PMVPRNISHRAYQISLMGT
Subjt:  DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT

Query:  ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC
        ACSSLFTLFITCG PRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC
Subjt:  ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC

Query:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR
        LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFV+RFLPFLKPLLSAAQRWWLR
Subjt:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR

KAG7036743.1 hypothetical protein SDJN02_00363 [Cucurbita argyrosperma subsp. argyrosperma]8.6e-14889.1Show/hide
Query:  DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT
        DPDLVVETMSSGSSSFQPGRQSAPSPPT DQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAV+PMVPR ISHRAYQISLMGT
Subjt:  DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT

Query:  ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC
        ACSSLFTLFITCG PRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC
Subjt:  ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC

Query:  LLSSNVEIALGFLLIISLFT-----------------------------WQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLK
        LLSSNVEIALGFLLIISLFT                             WQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFV+RFLPFLK
Subjt:  LLSSNVEIALGFLLIISLFT-----------------------------WQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLK

Query:  PLLSAAQRWWLR
        PLLSAAQRWWLR
Subjt:  PLLSAAQRWWLR

XP_022949013.1 uncharacterized protein LOC111452483 [Cucurbita moschata]1.9e-155100Show/hide
Query:  DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT
        DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT
Subjt:  DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT

Query:  ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC
        ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC
Subjt:  ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC

Query:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR
        LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR
Subjt:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR

XP_022998724.1 transmembrane protein 33 homolog [Cucurbita maxima]1.2e-14693.64Show/hide
Query:  DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT
        DP+LVVETMSS SSSFQPGRQSAPSPPTNDQTQSQSSGSTTRN GTSATT SNW++LRWDRHRVLFLVNAWVFLL+S A++PMVPRNISHRAYQISLMGT
Subjt:  DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT

Query:  ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC
        ACSSLFTLFITCG PRSFDMQALEVYF+PVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQI+K+VRHNFPRSAFYRKCLERPCAW ESNT TLC
Subjt:  ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC

Query:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR
        LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFP+TAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWW+R
Subjt:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR

XP_023525188.1 uncharacterized protein LOC111788863 [Cucurbita pepo subsp. pepo]1.7e-15197.53Show/hide
Query:  DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT
        DPDLVVETMSSGSSSFQPGRQSA SPPTNDQTQSQSSGSTTRN GTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAV+PMVPRNISHRAYQISLMGT
Subjt:  DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT

Query:  ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC
        ACSSLFTLFITCG PRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQI+KYV+HNFPRSAFYRKCLERPCAWVESNTITLC
Subjt:  ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC

Query:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR
        LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFP+TAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR
Subjt:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR

TrEMBL top hitse value%identityAlignment
A0A2N9J2X3 Uncharacterized protein1.0e-9362.37Show/hide
Query:  DPDLVVETMSSGSSSFQPGRQSAP----SPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQIS
        DPDLVVE MS+ SSS QP R SAP    S  +NDQ + ++SGSTTR  GTSATTG++   +RWDR  + F VNAWVF+++ LA+ P+VPR++SHRAY++S
Subjt:  DPDLVVETMSSGSSSFQPGRQSAP----SPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQIS

Query:  LMGTACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNT
         MGTACSSLF+L+   G PR +++QAL+VYF+ ++ATK  +Y IY +TFV SNL LK ALIPI+C A+E ++K++R NF RS+ YRK LE PC WV+SNT
Subjt:  LMGTACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNT

Query:  ITLCLLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR
         TL +LSS+ EI LGFLLI+SLF+WQRN +QTFMYWQLLKLMY  P+TAGYH S WAKIGR VNP V R+ PFL+  +SAAQRWW R
Subjt:  ITLCLLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR

A0A6J1CQJ0 uncharacterized protein LOC111013250 isoform X25.3e-11172.79Show/hide
Query:  DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT
        DPDLVV+T+S  S   Q  RQSAPSPPTNDQTQ+QSSGSTT N GTS TT SN        H +LF +NAWV +++  A++PMVP+N+SHRA+++S MGT
Subjt:  DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT

Query:  ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC
          SSLF+L I+CG P++ DMQALEVYF+ VVATKA VY IY +TFVASNL LK ALIPIIC  +EQI+K++R +FPRS FYRKCLERPCAWVESNT TL 
Subjt:  ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC

Query:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR
        LLSSNVEIALGFLLIISLF+WQRNFV TFMYWQLL+LMYHFP+TAGYHQS WAK+GRKVNPFVSRFLPFLKP LSAA+RWW+R
Subjt:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR

A0A6J1CQK8 uncharacterized protein LOC111013250 isoform X15.3e-11172.79Show/hide
Query:  DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT
        DPDLVV+T+S  S   Q  RQSAPSPPTNDQTQ+QSSGSTT N GTS TT SN        H +LF +NAWV +++  A++PMVP+N+SHRA+++S MGT
Subjt:  DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT

Query:  ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC
          SSLF+L I+CG P++ DMQALEVYF+ VVATKA VY IY +TFVASNL LK ALIPIIC  +EQI+K++R +FPRS FYRKCLERPCAWVESNT TL 
Subjt:  ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC

Query:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR
        LLSSNVEIALGFLLIISLF+WQRNFV TFMYWQLL+LMYHFP+TAGYHQS WAK+GRKVNPFVSRFLPFLKP LSAA+RWW+R
Subjt:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR

A0A6J1GAW8 uncharacterized protein LOC1114524839.3e-156100Show/hide
Query:  DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT
        DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT
Subjt:  DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT

Query:  ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC
        ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC
Subjt:  ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC

Query:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR
        LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR
Subjt:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR

A0A6J1KF32 transmembrane protein 33 homolog6.0e-14793.64Show/hide
Query:  DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT
        DP+LVVETMSS SSSFQPGRQSAPSPPTNDQTQSQSSGSTTRN GTSATT SNW++LRWDRHRVLFLVNAWVFLL+S A++PMVPRNISHRAYQISLMGT
Subjt:  DPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGT

Query:  ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC
        ACSSLFTLFITCG PRSFDMQALEVYF+PVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQI+K+VRHNFPRSAFYRKCLERPCAW ESNT TLC
Subjt:  ACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLC

Query:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR
        LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFP+TAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWW+R
Subjt:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G02420.1 unknown protein4.6e-9157.79Show/hide
Query:  DPDLVVETMSSGSSSFQPGRQSAPSPPT------NDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQ
        DPDLVVE MS+ SSS Q  R +A S  +      N+Q +S++SGS  R  G SATTG+    +RWD   + F VNAWVF+++ LAV+P++P+N+S+RAY+
Subjt:  DPDLVVETMSSGSSSFQPGRQSAPSPPT------NDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQ

Query:  ISLMGTACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVES
        +S MGTACSSL++L+   G PR+++MQ L+VYF+ +VA K  +Y IY +TFV S+L LK ALIPI+C A+EQ++K++R NF RS  YRK LE PC WVES
Subjt:  ISLMGTACSSLFTLFITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVES

Query:  NTITLCLLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR
        NT TL +LSS  EIA+GFLLIISL +WQRN +QTFMYWQLLKLMY  P+TAGYHQS W++IGR V P + R+ PFL   +SA QRWW R
Subjt:  NTITLCLLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGATCCTGATCTTGTAGTTGAGACAATGTCTTCCGGCAGTTCATCATTTCAGCCAGGGAGGCAGTCAGCACCTTCACCTCCTACAAATGATCAAACTCAGTCACA
GAGCTCCGGGTCAACTACAAGAAATCCAGGAACATCGGCAACTACAGGTTCAAATTGGATGTATTTACGCTGGGATCGACATAGGGTTCTTTTTTTGGTCAATGCTTGGG
TGTTTCTCTTGTCATCGTTGGCAGTGATGCCAATGGTTCCCAGAAACATTTCACATAGGGCGTATCAGATTTCTCTTATGGGCACTGCATGCTCCTCTCTATTCACCTTG
TTCATTACTTGTGGGATACCCAGGTCGTTCGATATGCAGGCTTTGGAAGTTTACTTCCGACCTGTCGTTGCAACAAAAGCTCTCGTCTACCTTATTTACTCCATTACCTT
TGTAGCTTCAAATCTTTTCCTTAAACTTGCTTTAATTCCGATTATATGCGTAGCTGTTGAGCAGATTTCCAAGTACGTTCGGCATAATTTCCCCCGATCTGCCTTCTACA
GGAAATGCTTGGAGCGGCCTTGTGCTTGGGTGGAATCAAATACAATCACACTTTGTCTTCTGTCTTCAAATGTTGAGATTGCACTGGGTTTCCTTCTGATCATCTCCTTA
TTCACATGGCAGCGAAACTTCGTGCAAACATTCATGTACTGGCAGCTACTGAAGCTGATGTACCACTTCCCCTTGACTGCTGGGTATCATCAGAGTGCGTGGGCTAAGAT
TGGGAGAAAAGTTAATCCATTTGTGAGCAGATTTCTCCCATTTTTGAAGCCTTTACTCTCTGCAGCTCAAAGATGGTGGCTCAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGGATCCTGATCTTGTAGTTGAGACAATGTCTTCCGGCAGTTCATCATTTCAGCCAGGGAGGCAGTCAGCACCTTCACCTCCTACAAATGATCAAACTCAGTCACA
GAGCTCCGGGTCAACTACAAGAAATCCAGGAACATCGGCAACTACAGGTTCAAATTGGATGTATTTACGCTGGGATCGACATAGGGTTCTTTTTTTGGTCAATGCTTGGG
TGTTTCTCTTGTCATCGTTGGCAGTGATGCCAATGGTTCCCAGAAACATTTCACATAGGGCGTATCAGATTTCTCTTATGGGCACTGCATGCTCCTCTCTATTCACCTTG
TTCATTACTTGTGGGATACCCAGGTCGTTCGATATGCAGGCTTTGGAAGTTTACTTCCGACCTGTCGTTGCAACAAAAGCTCTCGTCTACCTTATTTACTCCATTACCTT
TGTAGCTTCAAATCTTTTCCTTAAACTTGCTTTAATTCCGATTATATGCGTAGCTGTTGAGCAGATTTCCAAGTACGTTCGGCATAATTTCCCCCGATCTGCCTTCTACA
GGAAATGCTTGGAGCGGCCTTGTGCTTGGGTGGAATCAAATACAATCACACTTTGTCTTCTGTCTTCAAATGTTGAGATTGCACTGGGTTTCCTTCTGATCATCTCCTTA
TTCACATGGCAGCGAAACTTCGTGCAAACATTCATGTACTGGCAGCTACTGAAGCTGATGTACCACTTCCCCTTGACTGCTGGGTATCATCAGAGTGCGTGGGCTAAGAT
TGGGAGAAAAGTTAATCCATTTGTGAGCAGATTTCTCCCATTTTTGAAGCCTTTACTCTCTGCAGCTCAAAGATGGTGGCTCAGGTAGAGAAACCAAGAATAAGAAAGTG
GTTTCTGTTAGTTTGCATTTAGTTTACTGAATATATTCATTGTTTCTTAGAAGATAGTCACATATTCATGACAGAAGCTGCCCTCATGAACGAACCCTGTTGATTTGGGG
TTTTTGTATGCATATTATGAAGCAGGGCAGGACTACTTGTTATGAATTATGATATTACTTTAGCAACTGAGATTTGTGATGATAATATGATATATATATACAACCTTTCA
AGAATTA
Protein sequenceShow/hide protein sequence
MQDPDLVVETMSSGSSSFQPGRQSAPSPPTNDQTQSQSSGSTTRNPGTSATTGSNWMYLRWDRHRVLFLVNAWVFLLSSLAVMPMVPRNISHRAYQISLMGTACSSLFTL
FITCGIPRSFDMQALEVYFRPVVATKALVYLIYSITFVASNLFLKLALIPIICVAVEQISKYVRHNFPRSAFYRKCLERPCAWVESNTITLCLLSSNVEIALGFLLIISL
FTWQRNFVQTFMYWQLLKLMYHFPLTAGYHQSAWAKIGRKVNPFVSRFLPFLKPLLSAAQRWWLR