; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg039456 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg039456
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold10:41705283..41707914
RNA-Seq ExpressionSpg039456
SyntenySpg039456
Gene Ontology termsGO:0005765 - lysosomal membrane (cellular component)
InterPro domainsIPR019320 - BLOC-1-related complex subunit 8


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578805.1 hypothetical protein SDJN03_23253, partial [Cucurbita argyrosperma subsp. sororia]4.8e-12587.12Show/hide
Query:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA
        M+GFSTVDGFVEIAESSAEMIKYIANEPSTGLFY+QQHTKNAVPNVINLKNSVVDKSHE+TLH +DSEDSITML+SMK+CGFPIADEMIRDI KSLA+M+
Subjt:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA

Query:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD
        TKQPRRGLIRNTSG QQPGR+STWRSATWGRS IVAPR DD SGGYISTVFKSAREKASNF+WPQLDIKEDLAQVEVD++ PQPN+P VASASS+SSQPD
Subjt:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD

Query:  MDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDL
        +DTEELPL+ QVNDELQRDDQ DV +++DLLSVSDNFDDFRADKEAKLEEWLGGSGGLND++++
Subjt:  MDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDL

XP_022939806.1 uncharacterized protein LOC111445573 isoform X1 [Cucurbita moschata]3.1e-12486.74Show/hide
Query:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA
        M+GFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHE+TLH +DSEDSITML+SMK+CGFPIADEMIRDI KSLA+M+
Subjt:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA

Query:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD
        TKQPRRGLIRNTSG QQPGR+STWRSATWGRS IVAPR DD SGGYISTVFKSAREKASNF+WPQLDIKEDLA+VEVD++ PQPN+P VASASS+SSQPD
Subjt:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD

Query:  MDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDL
        +DTEELPL+ QVNDELQRD+Q DV +++DLLSVSDNFDDFRADKEAKLEEWLGGSGGLND++++
Subjt:  MDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDL

XP_022993444.1 uncharacterized protein LOC111489457 isoform X1 [Cucurbita maxima]1.6e-12386.74Show/hide
Query:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA
        M+GFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQ HTKNAVPNVINLKNSVVDKSHE+TLH +DSEDSITML+SMK+CGFPIADEMIRDI KSLA+M+
Subjt:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA

Query:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD
        TKQPRRGLIRNTSG QQPGR+STWRSATWGRS IVAPR DD SGGYISTVFKSAREKASNF+WPQLDIKEDLA VEVD++ PQPN+P VASASS SSQPD
Subjt:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD

Query:  MDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDL
        ++TEELPLS QVNDELQRDDQ DV++++DLLSVSDNFDDFRADKEAKLEEWLGGSGGLND++++
Subjt:  MDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDL

XP_023551457.1 uncharacterized protein LOC111809261 isoform X1 [Cucurbita pepo subsp. pepo]5.3e-12486.36Show/hide
Query:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA
        M+GFSTVDGFVEIAESSAEMIKYIANEPSTGLFY+QQHTKNAVPNVINLKNSVVDKSHE+TLH +DSEDSITML+SMK+CGFPIADEMIRDI KSLA+M+
Subjt:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA

Query:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD
        TK PRRGLIRNTSG QQPGR+STWRSATWGRS IVAPR DD SGGYISTVFKSAREKASNF+WPQLDIKEDLA+VEVD++ PQPN+P VASASS+SSQPD
Subjt:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD

Query:  MDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDL
        +DTEELPL+ QVNDELQRDDQ DV +++DLLSVSDNFDDFRADKEAKLEEWLGGSGGLND++++
Subjt:  MDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDL

XP_038884987.1 uncharacterized protein LOC120075565 isoform X1 [Benincasa hispida]4.5e-12387.18Show/hide
Query:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA
        MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHE+TLHT+DSEDSITMLRSMKECGFPIADEMIRDI KSLAIM+
Subjt:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA

Query:  TKQPRRGLIRNTSGM----QQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSS
        TKQPRRGLIRNTSGM    QQPGR+STWRSATWGR  IVAP +DD SGGYISTVFKSAREKASNF+WPQL+I+EDLAQVEVDK+QPQP QP VA A+SSS
Subjt:  TKQPRRGLIRNTSGM----QQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSS

Query:  SQPDMDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKG
        SQPDMDTEELPLSSQVNDE QR+DQ + NL++DLL VSDNFDDFRADKEAKLEEWLG SGGLN+ RDL  GKG
Subjt:  SQPDMDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKG

TrEMBL top hitse value%identityAlignment
A0A1S3C3G0 uncharacterized protein LOC103496452 isoform X11.7e-12086.25Show/hide
Query:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA
        M+GFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHE+TLHT+DSEDSI MLRSMKECGFPIADEMIRDI KSLAIM+
Subjt:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA

Query:  TKQPRRGLIRNTSGM--QQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQ
        TKQPRRGLI NTSGM  QQPGR+STWRSATWGR  I AP  +D SGGYISTVFKSAREKASNF+WPQLDIKEDLA VEVDK+QPQ  QP VAS +SSSSQ
Subjt:  TKQPRRGLIRNTSGM--QQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQ

Query:  PDMDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDLSAG
        PDMD EELPLSSQVNDE Q+DD+ D  L++DLLSVSDNFDDFRADKEAKLEEWLGGS GLND+RDL AG
Subjt:  PDMDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDLSAG

A0A5A7T575 Uncharacterized protein6.0e-12185.98Show/hide
Query:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA
        M+GFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHE+TLHT+DSEDSI MLRSMKECGFPIADEMIRDI KSLAIM+
Subjt:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA

Query:  TKQPRRGLIRNTSGM--QQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQ
        TKQPRRGLI NTSGM  QQPGR+STWRSATWGR  I AP  +D SGGYISTVFKSAREKASNF+WPQLDIKEDLA VEVDK+QPQ  QP VAS +SSSSQ
Subjt:  TKQPRRGLIRNTSGM--QQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQ

Query:  PDMDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKG
        PDMD EELPLSSQVNDE Q+DD+ D  L++DLLSVSDNFDDFRADKEAKLEEWLGGS GLND+RDL AG G
Subjt:  PDMDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKG

A0A6J1FML8 uncharacterized protein LOC111445573 isoform X11.5e-12486.74Show/hide
Query:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA
        M+GFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHE+TLH +DSEDSITML+SMK+CGFPIADEMIRDI KSLA+M+
Subjt:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA

Query:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD
        TKQPRRGLIRNTSG QQPGR+STWRSATWGRS IVAPR DD SGGYISTVFKSAREKASNF+WPQLDIKEDLA+VEVD++ PQPN+P VASASS+SSQPD
Subjt:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD

Query:  MDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDL
        +DTEELPL+ QVNDELQRD+Q DV +++DLLSVSDNFDDFRADKEAKLEEWLGGSGGLND++++
Subjt:  MDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDL

A0A6J1H0H0 uncharacterized protein LOC111458823 isoform X12.8e-11882.16Show/hide
Query:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA
        MYGFSTVDGFVEI ESSAEMIKYIANEPSTGLFYIQQHTKNAVPN++N+KNSV + S ESTLHT+DSEDSITMLRSMKECGFPIADEMIRDI KSLA+M+
Subjt:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA

Query:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD
         KQPRRGLIR+T GMQ PGR+STWRSATWGRS  +APR DD  GGYISTVFKSARE ASNF+WPQLDI EDLA+VEV K QP+PNQP V SASSSSSQPD
Subjt:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD

Query:  MDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKG
        MD++ELPLS QVND LQ DD+ DV LD+D++SVSD FDDFRADKEAKL++WL GSG LNDIRDLSAGKG
Subjt:  MDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKG

A0A6J1JWC1 uncharacterized protein LOC111489457 isoform X17.5e-12486.74Show/hide
Query:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA
        M+GFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQ HTKNAVPNVINLKNSVVDKSHE+TLH +DSEDSITML+SMK+CGFPIADEMIRDI KSLA+M+
Subjt:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA

Query:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD
        TKQPRRGLIRNTSG QQPGR+STWRSATWGRS IVAPR DD SGGYISTVFKSAREKASNF+WPQLDIKEDLA VEVD++ PQPN+P VASASS SSQPD
Subjt:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPD

Query:  MDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDL
        ++TEELPLS QVNDELQRDDQ DV++++DLLSVSDNFDDFRADKEAKLEEWLGGSGGLND++++
Subjt:  MDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39170.1 unknown protein7.6e-5243.87Show/hide
Query:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA
        M+ FSTVDGF EI ES AEMIKYIANEPS GL+YIQQH +NA PNVINL N+V++KS E+ LHT+D EDSI M++SMK+CG PIADEMI DI  SLAIM+
Subjt:  MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMA

Query:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRV--------DDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASA
        +KQPRRG+I N+             ++ W RS+ +  R         +  S  Y ++VF +A+EKASN +WPQLD KE                      
Subjt:  TKQPRRGLIRNTSGMQQPGRISTWRSATWGRSTIVAPRV--------DDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASA

Query:  SSSSSQPDMDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDI
          S + P++           ++EL+ +++ D      ++  +  F++F+A KEA L+ WLG   G  D+
Subjt:  SSSSSQPDMDTEELPLSSQVNDELQRDDQADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGGATTCTCCACAGTTGATGGCTTTGTGGAGATAGCTGAAAGCTCGGCCGAGATGATCAAGTACATTGCAAATGAACCTTCAACTGGGCTTTTCTACATTCAGCA
GCATACCAAAAATGCTGTTCCCAATGTTATCAATCTGAAGAATAGTGTGGTGGACAAATCTCATGAATCAACTTTGCACACTCAAGATTCCGAGGATTCAATCACCATGT
TGAGGTCGATGAAAGAATGTGGGTTTCCTATTGCTGATGAGATGATTAGAGACATAAACAAGTCCCTTGCTATAATGGCAACCAAACAACCAAGAAGGGGCTTGATCCGT
AATACTTCTGGTATGCAGCAGCCAGGGAGAATAAGCACCTGGAGATCGGCCACTTGGGGGCGAAGCACAATTGTTGCCCCACGCGTCGACGACAGTAGTGGTGGTTATAT
TTCAACAGTGTTCAAGTCAGCTAGAGAAAAGGCGAGCAACTTTAGGTGGCCACAGCTTGACATCAAGGAAGATCTTGCACAAGTTGAAGTCGACAAGGTACAGCCACAAC
CTAACCAACCATTAGTTGCATCTGCTAGTTCTAGTTCATCACAGCCAGATATGGACACGGAGGAGTTGCCTCTGTCTAGTCAAGTTAATGATGAGTTGCAACGAGACGAC
CAGGCTGATGTCAATTTGGATAGCGATTTACTTTCAGTATCCGATAACTTTGACGATTTCAGGGCCGATAAAGAAGCAAAGTTGGAGGAGTGGTTGGGAGGGTCTGGCGG
CTTGAATGATATAAGAGATTTGAGCGCAGGGAAAGGCGCTGAAGCTTTTTCTCAGCAAAACAAGCTTATTAGCATCTACAGACTACAAGTCATGGCTAGTGGGCAATACT
ATTTCCAGGTAGGCTACGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGTATGGATTCTCCACAGTTGATGGCTTTGTGGAGATAGCTGAAAGCTCGGCCGAGATGATCAAGTACATTGCAAATGAACCTTCAACTGGGCTTTTCTACATTCAGCA
GCATACCAAAAATGCTGTTCCCAATGTTATCAATCTGAAGAATAGTGTGGTGGACAAATCTCATGAATCAACTTTGCACACTCAAGATTCCGAGGATTCAATCACCATGT
TGAGGTCGATGAAAGAATGTGGGTTTCCTATTGCTGATGAGATGATTAGAGACATAAACAAGTCCCTTGCTATAATGGCAACCAAACAACCAAGAAGGGGCTTGATCCGT
AATACTTCTGGTATGCAGCAGCCAGGGAGAATAAGCACCTGGAGATCGGCCACTTGGGGGCGAAGCACAATTGTTGCCCCACGCGTCGACGACAGTAGTGGTGGTTATAT
TTCAACAGTGTTCAAGTCAGCTAGAGAAAAGGCGAGCAACTTTAGGTGGCCACAGCTTGACATCAAGGAAGATCTTGCACAAGTTGAAGTCGACAAGGTACAGCCACAAC
CTAACCAACCATTAGTTGCATCTGCTAGTTCTAGTTCATCACAGCCAGATATGGACACGGAGGAGTTGCCTCTGTCTAGTCAAGTTAATGATGAGTTGCAACGAGACGAC
CAGGCTGATGTCAATTTGGATAGCGATTTACTTTCAGTATCCGATAACTTTGACGATTTCAGGGCCGATAAAGAAGCAAAGTTGGAGGAGTGGTTGGGAGGGTCTGGCGG
CTTGAATGATATAAGAGATTTGAGCGCAGGGAAAGGCGCTGAAGCTTTTTCTCAGCAAAACAAGCTTATTAGCATCTACAGACTACAAGTCATGGCTAGTGGGCAATACT
ATTTCCAGGTAGGCTACGGATAG
Protein sequenceShow/hide protein sequence
MYGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHESTLHTQDSEDSITMLRSMKECGFPIADEMIRDINKSLAIMATKQPRRGLIR
NTSGMQQPGRISTWRSATWGRSTIVAPRVDDSSGGYISTVFKSAREKASNFRWPQLDIKEDLAQVEVDKVQPQPNQPLVASASSSSSQPDMDTEELPLSSQVNDELQRDD
QADVNLDSDLLSVSDNFDDFRADKEAKLEEWLGGSGGLNDIRDLSAGKGAEAFSQQNKLISIYRLQVMASGQYYFQVGYG