; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029772 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029772
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr8:41938692..41943520
RNA-Seq ExpressionLag0029772
SyntenyLag0029772
Gene Ontology termsGO:0005765 - lysosomal membrane (cellular component)
InterPro domainsIPR019320 - BLOC-1-related complex subunit 8


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578805.1 hypothetical protein SDJN03_23253, partial [Cucurbita argyrosperma subsp. sororia]7.1e-12486.36Show/hide
Query:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA
        M+GFSTVDGFVEIAESSAEMIKYIANEPSTGLFY+QQHTKNAVPNVINLKNSVVDKSHE+TLH +DSEDSITML+SMK+CGFPIADEMIRDI KSLA+M+
Subjt:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA

Query:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD
        TKQPRRGLIRNTSG QQPGR+STWRSATWGRS IVAPR DD SGGYISTVFKSAREKASNF+WPQLDIKEDLAQVEVD++ PQPN+P VASASS+SSQPD
Subjt:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD

Query:  MDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDL
        +DTEELPL+ QVNDELQR+DQ DV +++DLLSVSD FDDFRADKEAKLEEWLGGSGGLND++++
Subjt:  MDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDL

KAG6602294.1 hypothetical protein SDJN03_07527, partial [Cucurbita argyrosperma subsp. sororia]9.3e-12482.8Show/hide
Query:  DELPFTFEKMYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRD
        DELPFTFEKMYGFSTVDGFVEI ESSAEMIKYIANEPSTGLFYIQQHTKNAVPNV+N+KNSV + S ESTLHT+DSEDSITMLRSMKECGFPIADEMIRD
Subjt:  DELPFTFEKMYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRD

Query:  INKSLAIMATKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVAS
        I KSLA+M+ KQPRRGLIR+T GMQ PGR+STWRSATWGRS  +APR DD  GGYISTVFKSARE ASNF+WPQLDI EDLA+VEV K QP+PNQP V S
Subjt:  INKSLAIMATKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVAS

Query:  ASSSSSQPDMDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKGH
        ASSSSSQPDMD++ELPLS QVND LQ +DQ DV LD+D++SVS+ FDDFRADKEAKL++WL GSG LNDIRDLSAGKGH
Subjt:  ASSSSSQPDMDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKGH

XP_022134183.1 uncharacterized protein LOC111006508 isoform X1 [Momordica charantia]1.4e-12483.8Show/hide
Query:  KIRSDELPFTFEKMYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADE
        +IRSDELPFTFEKMYGFSTVDGFVEI ES AEMIKYIANEPSTGLFYIQQHT+NAVPNV+ L+N VVDKSHE+TLHT+DSEDSITMLRSMKE GFPIADE
Subjt:  KIRSDELPFTFEKMYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADE

Query:  MIRDINKSLAIMATKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQP
        MIRDI KSLAIM+TKQPRRGLI NTSG+Q+ GRISTWRSATWGRS IVAP  ++ SGGYISTVFKSAREKASNF+WPQLDIK+DLAQVEVDK+ PQ NQP
Subjt:  MIRDINKSLAIMATKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQP

Query:  LVASASSSSSQPDM-DTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKGH
         VASASSSSSQPD+  T ELPLSSQVNDELQR+DQ D +LD DLLS SD FDDFRADKEAKLEEWLGG+GGL+ + DLSAGK H
Subjt:  LVASASSSSSQPDM-DTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKGH

XP_022957415.1 uncharacterized protein LOC111458823 isoform X1 [Cucurbita moschata]1.4e-12782.29Show/hide
Query:  MDVKEKIRSDELPFTFEKMYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGF
        MD KE+IRSDELPFTFEKMYGFSTVDGFVEI ESSAEMIKYIANEPSTGLFYIQQHTKNAVPN++N+KNSV + S ESTLHT+DSEDSITMLRSMKECGF
Subjt:  MDVKEKIRSDELPFTFEKMYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGF

Query:  PIADEMIRDINKSLAIMATKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQP
        PIADEMIRDI KSLA+M+ KQPRRGLIR+T GMQ PGR+STWRSATWGRS  +APR DD  GGYISTVFKSARE ASNF+WPQLDI EDLA+VEV K QP
Subjt:  PIADEMIRDINKSLAIMATKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQP

Query:  QPNQPLVASASSSSSQPDMDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKGH
        +PNQP V SASSSSSQPDMD++ELPLS QVND LQ +D+ DV LD+D++SVSD FDDFRADKEAKL++WL GSG LNDIRDLSAGKGH
Subjt:  QPNQPLVASASSSSSQPDMDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKGH

XP_038884987.1 uncharacterized protein LOC120075565 isoform X1 [Benincasa hispida]7.3e-12986.97Show/hide
Query:  SDELPFTFEKMYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIR
        SDELPFTFE+MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHE+TLHT+DSEDSITMLRSMKECGFPIADEMIR
Subjt:  SDELPFTFEKMYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIR

Query:  DINKSLAIMATKQPRRGLIRNTSGM----QQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQ
        DI KSLAIM+TKQPRRGLIRNTSGM    QQPGR+STWRSATWGR  IVAP +DD SGGYISTVFKSAREKASNF+WPQL+I+EDLAQVEVDK+QPQP Q
Subjt:  DINKSLAIMATKQPRRGLIRNTSGM----QQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQ

Query:  PLVASASSSSSQPDMDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKGH
        P VA A+SSSSQPDMDTEELPLSSQVNDE QR DQ + NL++DLL VSD FDDFRADKEAKLEEWLG SGGLN+ RDL  GKGH
Subjt:  PLVASASSSSSQPDMDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKGH

TrEMBL top hitse value%identityAlignment
A0A5A7T575 Uncharacterized protein1.1e-11985.24Show/hide
Query:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA
        M+GFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHE+TLHT+DSEDSI MLRSMKECGFPIADEMIRDI KSLAIM+
Subjt:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA

Query:  TKQPRRGLIRNTSGM--QQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQ
        TKQPRRGLI NTSGM  QQPGR+STWRSATWGR  I AP  +D SGGYISTVFKSAREKASNF+WPQLDIKEDLA VEVDK+QPQ  QP VAS +SSSSQ
Subjt:  TKQPRRGLIRNTSGM--QQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQ

Query:  PDMDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKG
        PDMD EELPLSSQVNDE Q++D+ D  L++DLLSVSD FDDFRADKEAKLEEWLGGS GLND+RDL AG G
Subjt:  PDMDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKG

A0A6J1C195 uncharacterized protein LOC111006508 isoform X16.9e-12583.8Show/hide
Query:  KIRSDELPFTFEKMYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADE
        +IRSDELPFTFEKMYGFSTVDGFVEI ES AEMIKYIANEPSTGLFYIQQHT+NAVPNV+ L+N VVDKSHE+TLHT+DSEDSITMLRSMKE GFPIADE
Subjt:  KIRSDELPFTFEKMYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADE

Query:  MIRDINKSLAIMATKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQP
        MIRDI KSLAIM+TKQPRRGLI NTSG+Q+ GRISTWRSATWGRS IVAP  ++ SGGYISTVFKSAREKASNF+WPQLDIK+DLAQVEVDK+ PQ NQP
Subjt:  MIRDINKSLAIMATKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQP

Query:  LVASASSSSSQPDM-DTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKGH
         VASASSSSSQPD+  T ELPLSSQVNDELQR+DQ D +LD DLLS SD FDDFRADKEAKLEEWLGG+GGL+ + DLSAGK H
Subjt:  LVASASSSSSQPDM-DTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKGH

A0A6J1FML8 uncharacterized protein LOC111445573 isoform X12.2e-12385.98Show/hide
Query:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA
        M+GFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHE+TLH +DSEDSITML+SMK+CGFPIADEMIRDI KSLA+M+
Subjt:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA

Query:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD
        TKQPRRGLIRNTSG QQPGR+STWRSATWGRS IVAPR DD SGGYISTVFKSAREKASNF+WPQLDIKEDLA+VEVD++ PQPN+P VASASS+SSQPD
Subjt:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD

Query:  MDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDL
        +DTEELPL+ QVNDELQR++Q DV +++DLLSVSD FDDFRADKEAKLEEWLGGSGGLND++++
Subjt:  MDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDL

A0A6J1H0H0 uncharacterized protein LOC111458823 isoform X16.7e-12882.29Show/hide
Query:  MDVKEKIRSDELPFTFEKMYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGF
        MD KE+IRSDELPFTFEKMYGFSTVDGFVEI ESSAEMIKYIANEPSTGLFYIQQHTKNAVPN++N+KNSV + S ESTLHT+DSEDSITMLRSMKECGF
Subjt:  MDVKEKIRSDELPFTFEKMYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGF

Query:  PIADEMIRDINKSLAIMATKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQP
        PIADEMIRDI KSLA+M+ KQPRRGLIR+T GMQ PGR+STWRSATWGRS  +APR DD  GGYISTVFKSARE ASNF+WPQLDI EDLA+VEV K QP
Subjt:  PIADEMIRDINKSLAIMATKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQP

Query:  QPNQPLVASASSSSSQPDMDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKGH
        +PNQP V SASSSSSQPDMD++ELPLS QVND LQ +D+ DV LD+D++SVSD FDDFRADKEAKL++WL GSG LNDIRDLSAGKGH
Subjt:  QPNQPLVASASSSSSQPDMDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKGH

A0A6J1JWC1 uncharacterized protein LOC111489457 isoform X11.1e-12285.98Show/hide
Query:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA
        M+GFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQ HTKNAVPNVINLKNSVVDKSHE+TLH +DSEDSITML+SMK+CGFPIADEMIRDI KSLA+M+
Subjt:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA

Query:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD
        TKQPRRGLIRNTSG QQPGR+STWRSATWGRS IVAPR DD SGGYISTVFKSAREKASNF+WPQLDIKEDLA VEVD++ PQPN+P VASASS SSQPD
Subjt:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD

Query:  MDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDL
        ++TEELPLS QVNDELQR+DQ DV++++DLLSVSD FDDFRADKEAKLEEWLGGSGGLND++++
Subjt:  MDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDIRDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39170.1 unknown protein1.3e-5143.87Show/hide
Query:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA
        M+ FSTVDGF EI ES AEMIKYIANEPS GL+YIQQH +NA PNVINL N+V++KS E+ LHT+D EDSI M++SMK+CG PIADEMI DI  SLAIM+
Subjt:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA

Query:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRV--------DDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASA
        +KQPRRG+I N+             ++ W RS+ +  R         +  S  Y ++VF +A+EKASN +WPQLD KE                      
Subjt:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRV--------DDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASA

Query:  SSSSSQPDMDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDI
          S + P++           ++EL+  ++ D      ++  +  F++F+A KEA L+ WLG   G  D+
Subjt:  SSSSSQPDMDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEWLGGSGGLNDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGAGACGAACGGTTCATCGGAGAAAAAAATTAGTGTTGGCGTTGGATCTGAAACATCTAGGGCAAAATCGGCAGATCAACTCAATTTCCTTTTTCATCATCATTT
CTCACATTCTGAGCAAAAGCAGAGCACTAATTCAGCTTTCTCTCTCTTTCTCATCACCATTCCGTTAGAGCTCTGGACTCAGTCCATCTCCGTCGTCGGTCGCCGAATCA
ACCCATCAAACCGTGGCGGCTTTCAATCTGTAATCTCTCGCCTTCTTTCTTCATTGGGCTGTGCAGTGTCGATTGTTACCCCTCTTGGGTTCATTAAGGCCGTCGGTTTC
CAATCAAGCGACGTTCCGGTCTTGCTCTCCCTCCCTTCCTGGTTGTCCTCATCTACTGTAGTCTGCTGTAAAGGCTTGAAATGGGAATTTTTGATGGCTTGCCAGTCCCT
CCCGAGAAAGCTACTTAAGAAAGTCCTTCCCTCCACGGTCTCGGAGGCCCCGGGGGCGTTTGTCAATGGAAGGCAAATTCTTGATCAAGCCTTGATTGCAAACGAGGCCT
CCGAGACCGTGGAGGGAAGGACTTTCTTAGTAAGTTTAGCTTCAGCTTTAGATAAAGTCACCGAAAGTGAAGAGCTGATGAACAACGGTTCACCTCTTCACAGTTTCATC
TACTTCTTGCGAGGCATATTATCTCTTAGGCTTTTCAAGTCAATTCTTGTTGGATCCTTTCATCCAAATGTATTAACTTTCAAAGCAAGCTTCCATGCTTTTGGAAAATA
TTACAAGCACGGAGCACCCCCAAGTTTTGATGGTAATGGCTCATACTTTTGGACATCGTATTTGGAAGTCAAAGTTTCCGCAACAATTGGTATACTGTGGCATTTTAGAG
ATGCATATGCATTCCCTTCCCTAAGGATATCTTGTAGGTTCTTAACATTAGCAACACAATTTAGCCGAGTTATGTGGCAAATAACCCAAAGCCAATCCCAAGGAACAGTG
AAGCTTCATAGGTTCCTTTTGTCTAGTGCAAGTAGTTTGCATCTTCACAAGCATGTTGTCTATTTCACAGAGCCTTTTTTAGAGATTGTGCCCATATCGTTATGCCTTTC
GTTTGAGTTATGTTGGCTAATCACGTCTGTAATGTACCCCTGCCAATGCCAAAAGTTAGCAAGACAGGCAAATATGGACGTCAAGGAGAAGATAAGAAGTGATGAACTGC
CATTCACCTTTGAAAAGATGTATGGATTCTCCACAGTTGATGGCTTTGTGGAGATAGCTGAAAGCTCGGCCGAGATGATCAAGTACATTGCAAATGAACCTTCAACTGGG
CTTTTCTACATTCAGCAGCATACCAAAAATGCTGTTCCCAATGTTATCAATCTGAAGAATAGTGTGGTGGACAAATCTCATGAATCAACTTTGCACACTCAAGATTCTGA
GGATTCAATCACCATGTTGAGGTCGATGAAAGAATGTGGGTTTCCTATTGCTGATGAGATGATTAGAGACATAAATAAGTCCCTTGCTATAATGGCAACCAAACAACCAA
GAAGGGGCTTGATCCGTAATACTTCTGGTATGCAGCAGCCAGGGAGAATAAGCACCTGGAGATCGGCCACTTGGGGGCGAAGCACAATTGTTGCCCCACGCGTCGACGAC
AGTAGTGGTGGTTATATTTCAACGGTATTCAAGTCAGCTAGAGAAAAGGCGAGCAACTTTAGGTGGCCACAGCTTGACATCAAGGAAGATCTTGCACAAGTTGAAGTCGA
CAAGGTACAGCCACAACCTAACCAACCATTAGTTGCATCTGCTAGTTCTAGTTCATCACAGCCAGATATGGACACGGAGGAGTTGCCTCTGTCTAGTCAAGTTAATGATG
AGTTGCAACGAAATGACCAGGCTGATGTCAATTTGGATAGCGATTTACTTTCGGTGTCCGATACATTTGACGATTTCAGGGCCGATAAAGAAGCAAAGTTGGAGGAGTGG
TTGGGAGGGTCTGGCGGCTTGAATGATATAAGAGATTTGAGCGCAGGGAAAGGTCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGAGACGAACGGTTCATCGGAGAAAAAAATTAGTGTTGGCGTTGGATCTGAAACATCTAGGGCAAAATCGGCAGATCAACTCAATTTCCTTTTTCATCATCATTT
CTCACATTCTGAGCAAAAGCAGAGCACTAATTCAGCTTTCTCTCTCTTTCTCATCACCATTCCGTTAGAGCTCTGGACTCAGTCCATCTCCGTCGTCGGTCGCCGAATCA
ACCCATCAAACCGTGGCGGCTTTCAATCTGTAATCTCTCGCCTTCTTTCTTCATTGGGCTGTGCAGTGTCGATTGTTACCCCTCTTGGGTTCATTAAGGCCGTCGGTTTC
CAATCAAGCGACGTTCCGGTCTTGCTCTCCCTCCCTTCCTGGTTGTCCTCATCTACTGTAGTCTGCTGTAAAGGCTTGAAATGGGAATTTTTGATGGCTTGCCAGTCCCT
CCCGAGAAAGCTACTTAAGAAAGTCCTTCCCTCCACGGTCTCGGAGGCCCCGGGGGCGTTTGTCAATGGAAGGCAAATTCTTGATCAAGCCTTGATTGCAAACGAGGCCT
CCGAGACCGTGGAGGGAAGGACTTTCTTAGTAAGTTTAGCTTCAGCTTTAGATAAAGTCACCGAAAGTGAAGAGCTGATGAACAACGGTTCACCTCTTCACAGTTTCATC
TACTTCTTGCGAGGCATATTATCTCTTAGGCTTTTCAAGTCAATTCTTGTTGGATCCTTTCATCCAAATGTATTAACTTTCAAAGCAAGCTTCCATGCTTTTGGAAAATA
TTACAAGCACGGAGCACCCCCAAGTTTTGATGGTAATGGCTCATACTTTTGGACATCGTATTTGGAAGTCAAAGTTTCCGCAACAATTGGTATACTGTGGCATTTTAGAG
ATGCATATGCATTCCCTTCCCTAAGGATATCTTGTAGGTTCTTAACATTAGCAACACAATTTAGCCGAGTTATGTGGCAAATAACCCAAAGCCAATCCCAAGGAACAGTG
AAGCTTCATAGGTTCCTTTTGTCTAGTGCAAGTAGTTTGCATCTTCACAAGCATGTTGTCTATTTCACAGAGCCTTTTTTAGAGATTGTGCCCATATCGTTATGCCTTTC
GTTTGAGTTATGTTGGCTAATCACGTCTGTAATGTACCCCTGCCAATGCCAAAAGTTAGCAAGACAGGCAAATATGGACGTCAAGGAGAAGATAAGAAGTGATGAACTGC
CATTCACCTTTGAAAAGATGTATGGATTCTCCACAGTTGATGGCTTTGTGGAGATAGCTGAAAGCTCGGCCGAGATGATCAAGTACATTGCAAATGAACCTTCAACTGGG
CTTTTCTACATTCAGCAGCATACCAAAAATGCTGTTCCCAATGTTATCAATCTGAAGAATAGTGTGGTGGACAAATCTCATGAATCAACTTTGCACACTCAAGATTCTGA
GGATTCAATCACCATGTTGAGGTCGATGAAAGAATGTGGGTTTCCTATTGCTGATGAGATGATTAGAGACATAAATAAGTCCCTTGCTATAATGGCAACCAAACAACCAA
GAAGGGGCTTGATCCGTAATACTTCTGGTATGCAGCAGCCAGGGAGAATAAGCACCTGGAGATCGGCCACTTGGGGGCGAAGCACAATTGTTGCCCCACGCGTCGACGAC
AGTAGTGGTGGTTATATTTCAACGGTATTCAAGTCAGCTAGAGAAAAGGCGAGCAACTTTAGGTGGCCACAGCTTGACATCAAGGAAGATCTTGCACAAGTTGAAGTCGA
CAAGGTACAGCCACAACCTAACCAACCATTAGTTGCATCTGCTAGTTCTAGTTCATCACAGCCAGATATGGACACGGAGGAGTTGCCTCTGTCTAGTCAAGTTAATGATG
AGTTGCAACGAAATGACCAGGCTGATGTCAATTTGGATAGCGATTTACTTTCGGTGTCCGATACATTTGACGATTTCAGGGCCGATAAAGAAGCAAAGTTGGAGGAGTGG
TTGGGAGGGTCTGGCGGCTTGAATGATATAAGAGATTTGAGCGCAGGGAAAGGTCATTAA
Protein sequenceShow/hide protein sequence
MFETNGSSEKKISVGVGSETSRAKSADQLNFLFHHHFSHSEQKQSTNSAFSLFLITIPLELWTQSISVVGRRINPSNRGGFQSVISRLLSSLGCAVSIVTPLGFIKAVGF
QSSDVPVLLSLPSWLSSSTVVCCKGLKWEFLMACQSLPRKLLKKVLPSTVSEAPGAFVNGRQILDQALIANEASETVEGRTFLVSLASALDKVTESEELMNNGSPLHSFI
YFLRGILSLRLFKSILVGSFHPNVLTFKASFHAFGKYYKHGAPPSFDGNGSYFWTSYLEVKVSATIGILWHFRDAYAFPSLRISCRFLTLATQFSRVMWQITQSQSQGTV
KLHRFLLSSASSLHLHKHVVYFTEPFLEIVPISLCLSFELCWLITSVMYPCQCQKLARQANMDVKEKIRSDELPFTFEKMYGFSTVDGFVEIAESSAEMIKYIANEPSTG
LFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMATKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDD
SSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPDMDTEELPLSSQVNDELQRNDQADVNLDSDLLSVSDTFDDFRADKEAKLEEW
LGGSGGLNDIRDLSAGKGH