; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy6G020930 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy6G020930
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionUnknown protein
Genome locationGy14Chr6:21589803..21593896
RNA-Seq ExpressionCsGy6G020930
SyntenyCsGy6G020930
Gene Ontology termsGO:0005765 - lysosomal membrane (cellular component)
InterPro domainsIPR019320 - BLOC-1-related complex subunit 8


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036499.1 uncharacterized protein E6C27_scaffold147G00680 [Cucumis melo var. makuwa]2.88e-18096.27Show/hide
Query:  MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMS
        MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVV KSHETTLHTEDSEDSI MLRSMKECGFPIADEMIRDIRKSLAIMS
Subjt:  MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMS

Query:  TKQPRRGLIHNTSGMQQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQP
        TKQPRRGLIHNTSGMQQQQPGRMSTWRS +WGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLA+VEVDKLQPQ+IQPPVASTTSSSSQP
Subjt:  TKQPRRGLIHNTSGMQQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQP

Query:  DVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG
        D+DNEELPLSSQVND+SQQDD VDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLN+LRDLRAG
Subjt:  DVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG

XP_008456524.1 PREDICTED: uncharacterized protein LOC103496452 isoform X1 [Cucumis melo]1.34e-18096.27Show/hide
Query:  MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMS
        MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVV KSHETTLHTEDSEDSI MLRSMKECGFPIADEMIRDIRKSLAIMS
Subjt:  MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMS

Query:  TKQPRRGLIHNTSGMQQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQP
        TKQPRRGLIHNTSGMQQQQPGRMSTWRS +WGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLA+VEVDKLQPQ+IQPPVASTTSSSSQP
Subjt:  TKQPRRGLIHNTSGMQQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQP

Query:  DVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG
        D+DNEELPLSSQVND+SQQDD VDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLN+LRDLRAG
Subjt:  DVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG

XP_011657506.1 uncharacterized protein LOC101218921 isoform X1 [Cucumis sativus]1.85e-187100Show/hide
Query:  MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMS
        MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMS
Subjt:  MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMS

Query:  TKQPRRGLIHNTSGMQQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQP
        TKQPRRGLIHNTSGMQQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQP
Subjt:  TKQPRRGLIHNTSGMQQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQP

Query:  DVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG
        DVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG
Subjt:  DVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG

XP_031743602.1 uncharacterized protein LOC101218921 isoform X2 [Cucumis sativus]1.96e-173100Show/hide
Query:  MIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMSTKQPRRGLIHNTSGMQQQQ
        MIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMSTKQPRRGLIHNTSGMQQQQ
Subjt:  MIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMSTKQPRRGLIHNTSGMQQQQ

Query:  PGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQPDVDNEELPLSSQVNDQSQQ
        PGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQPDVDNEELPLSSQVNDQSQQ
Subjt:  PGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQPDVDNEELPLSSQVNDQSQQ

Query:  DDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG
        DDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG
Subjt:  DDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG

XP_038884987.1 uncharacterized protein LOC120075565 isoform X1 [Benincasa hispida]1.30e-17188.77Show/hide
Query:  MFNKPSDELPFTFERMHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIA
        MF+KPSDELPFTFERM+GFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVV KSHETTLHTEDSEDSITMLRSMKECGFPIA
Subjt:  MFNKPSDELPFTFERMHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIA

Query:  DEMIRDIRKSLAIMSTKQPRRGLIHNTSGM--QQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQP
        DEMIRDIRKSLAIMSTKQPRRGLI NTSGM  QQQQPGRMSTWRS +WGRRAI AP +DDSGGYISTVFKSAREKASNFKWPQL+I+EDLA+VEVDKLQP
Subjt:  DEMIRDIRKSLAIMSTKQPRRGLIHNTSGM--QQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQP

Query:  QAIQPPVASTTSSSSQPDVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG
        Q IQPPVA  TSSSSQPD+D EELPLSSQVND+SQ++D V++ LNTDLL VSDNFDDFRADKEAKLEEWLG S GLN  RDLR G
Subjt:  QAIQPPVASTTSSSSQPDVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG

TrEMBL top hitse value%identityAlignment
A0A0A0KE70 Uncharacterized protein8.95e-188100Show/hide
Query:  MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMS
        MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMS
Subjt:  MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMS

Query:  TKQPRRGLIHNTSGMQQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQP
        TKQPRRGLIHNTSGMQQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQP
Subjt:  TKQPRRGLIHNTSGMQQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQP

Query:  DVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG
        DVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG
Subjt:  DVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG

A0A1S3C3G0 uncharacterized protein LOC103496452 isoform X16.46e-18196.27Show/hide
Query:  MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMS
        MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVV KSHETTLHTEDSEDSI MLRSMKECGFPIADEMIRDIRKSLAIMS
Subjt:  MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMS

Query:  TKQPRRGLIHNTSGMQQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQP
        TKQPRRGLIHNTSGMQQQQPGRMSTWRS +WGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLA+VEVDKLQPQ+IQPPVASTTSSSSQP
Subjt:  TKQPRRGLIHNTSGMQQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQP

Query:  DVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG
        D+DNEELPLSSQVND+SQQDD VDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLN+LRDLRAG
Subjt:  DVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG

A0A1S3C453 uncharacterized protein LOC103496452 isoform X26.83e-16795.98Show/hide
Query:  MIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMSTKQPRRGLIHNTSGMQQQQ
        MIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVV KSHETTLHTEDSEDSI MLRSMKECGFPIADEMIRDIRKSLAIMSTKQPRRGLIHNTSGMQQQQ
Subjt:  MIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMSTKQPRRGLIHNTSGMQQQQ

Query:  PGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQPDVDNEELPLSSQVNDQSQQ
        PGRMSTWRS +WGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLA+VEVDKLQPQ+IQPPVASTTSSSSQPD+DNEELPLSSQVND+SQQ
Subjt:  PGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQPDVDNEELPLSSQVNDQSQQ

Query:  DDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG
        DD VDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLN+LRDLRAG
Subjt:  DDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG

A0A5A7T575 Uncharacterized protein1.40e-18096.27Show/hide
Query:  MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMS
        MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVV KSHETTLHTEDSEDSI MLRSMKECGFPIADEMIRDIRKSLAIMS
Subjt:  MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMS

Query:  TKQPRRGLIHNTSGMQQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQP
        TKQPRRGLIHNTSGMQQQQPGRMSTWRS +WGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLA+VEVDKLQPQ+IQPPVASTTSSSSQP
Subjt:  TKQPRRGLIHNTSGMQQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQP

Query:  DVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG
        D+DNEELPLSSQVND+SQQDD VDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLN+LRDLRAG
Subjt:  DVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG

A0A6J1FML8 uncharacterized protein LOC111445573 isoform X13.36e-15283.77Show/hide
Query:  MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMS
        MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVV KSHETTLH EDSEDSITML+SMK+CGFPIADEMIRDI+KSLA+MS
Subjt:  MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMS

Query:  TKQPRRGLIHNTSGMQQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQP
        TKQPRRGLI NTSG QQ  PGRMSTWRS +WGR AI AP +DDSGGYISTVFKSAREKASNFKWPQLDIKEDLA VEVD+L PQ  +PPVAS +S+SSQP
Subjt:  TKQPRRGLIHNTSGMQQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSSSSQP

Query:  DVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDL
        D+D EELPL+ QVND+ Q+D+ VD  +NTDLLSVSDNFDDFRADKEAKLEEWLGGS GLN+++++
Subjt:  DVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39170.1 unknown protein1.2e-5448.85Show/hide
Query:  MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMS
        MH FSTVDGF EI ES AEMIKYIANEPS GL+YIQQH +NA PNVINL N+V+ KS ET LHTED EDSI M++SMK+CG PIADEMI DI+ SLAIMS
Subjt:  MHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIRKSLAIMS

Query:  TKQPRRGLIHNTSGMQQQQPGRMSTW-RSTSWGRRAIG---APFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSS
        +KQPRRG+I N++          S W RS+S   R  G   +  N +S  Y ++VF +A+EKASN KWPQLD KE  +E +                   
Subjt:  TKQPRRGLIHNTSGMQQQQPGRMSTW-RSTSWGRRAIG---APFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTSS

Query:  SSQPDVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSG
           P+V + EL    +  D  + + +V+          +  F++F+A KEA L+ WLG   G
Subjt:  SSQPDVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTAATAAACCCAGTGATGAACTGCCATTCACCTTTGAAAGGATGCATGGATTCTCTACAGTTGACGGCTTTGTGGAGATAGCTGAAAGCTCGGCAGAGATG
ATCAAGTATATAGCAAATGAACCTTCAACTGGGCTTTTCTACATTCAACAGCATACAAAAAATGCTGTTCCCAATGTTATCAATCTGAAGAATAGTGTGGTACAC
AAATCTCATGAAACAACTCTCCACACTGAAGATTCGGAGGATTCGATCACGATGTTAAGATCGATGAAAGAATGTGGTTTTCCTATTGCTGATGAGATGATCAGA
GACATAAGGAAGTCTCTTGCTATAATGTCAACCAAACAGCCAAGAAGGGGCTTGATCCATAATACTTCTGGCATGCAGCAACAACAGCCAGGGAGAATGAGCACC
TGGAGATCGACCAGTTGGGGGCGAAGAGCAATTGGAGCCCCATTTAATGATGACAGTGGAGGTTATATTTCAACAGTATTCAAGTCAGCAAGAGAAAAGGCGAGC
AACTTTAAGTGGCCACAGCTTGACATCAAGGAAGATCTTGCAGAAGTTGAAGTTGATAAACTACAGCCACAAGCTATCCAACCGCCAGTTGCTTCTACTACTTCT
AGCTCGTCACAGCCAGATGTGGACAATGAGGAGTTGCCTCTGTCCAGTCAAGTTAATGATCAGTCGCAACAGGATGACATGGTTGATGAATGCTTGAATACTGAT
TTACTTTCGGTGTCCGATAACTTCGATGATTTCAGGGCTGATAAAGAAGCGAAACTGGAGGAGTGGTTGGGAGGATCTAGCGGCTTGAATAATTTAAGAGATTTG
AGGGCAGGGTAA
mRNA sequenceShow/hide mRNA sequence
CGATAATATGCCTGAATTTATATATATTATAAACAATGCAAGAGGGAGAAAAATGGGGTGGTAGTCCTGCAAGGCAAAGCCCCACCAGATTCAATGGCAGACCTC
TGCTGACGAATATAAATTCGAATTTGTCTCTACCTGCCATCATCGCCTAACTTTTTCCATAAAAGCTTCAAGAACAAGAAGAAGATAAAACCCATAGCTGTTATC
TCTCTCAAGAAGCACTGACCGCCTTGGTTTTCTAGGGTTTCCTGTTAGTTTGGACACAAGCATCACTTCAATTTCATTTCAAACCAAGGATCAGATGAGAATTCT
GCGATGTTAAGGGTTTTTGTAATGTTAATAAATAATAAATCTTCTTCATAGGTTTATCTACTTGTGCTATTACACCTTAGGCTTTTCAAGTCAATTCTTGTTGGA
TCCTTTCATCCAAAGGTATAATTCTTTTCTTTTTTTTGTCCGACAAGAAACATTCTGTTGATAGAATGAAATAAGAATCTAAATTGTTTATAAAAGAGATGTCCA
ATTGGAAACTAAGTAAGAAAGACTGAAGTGTTTAAAAGGGTGCTTATCTTTACACCAATTGAAAGCTCTAGACAGAACCAAGTCCATAAAACTATAAAACCAAAA
AAGAGTTCCTAAAATGACGTTTTGTGCATTCTCTCTATAAACTTCAAAAGTAAGCACGAATATTCTCCAAAAAAATTGTATTCTGAGATCCTTTGAATAGGATGC
CCCACCAAATGTATAATTCTTAGTCATACTGGCTTAGAGGTCCCAAGTTTGAACCTTCGAGGTGGGGTGCAGGTGCCCTCAAGTATAGACCAGCAAAGCTCCAAG
TTCTGGTTATCGAAGAGAAAAAAAGTATAAACTCTCCAATCAAGCTGTAACAAATCACTACTTTGTCATCAATATCATAGTTTGGGGCCGAGTTAAAGACACAAG
ACATAACAACTCTTGTTGGTGTTGGGAATTTAGGAGGTGCAAGCTGGTGTGGGTAGGCTGAGTTTGCTTTTTTCCCCTTTTGTTCAAAGTCTGATGCTGAGCTGA
TATTTCAGTTGAGCAAAAATATAAAACAAACTCAAAGAAAGAATGTTGTTTTCATTTTGTTAAAAAATCATATCAACTGCCTCCCAAAAAATAAAAATGATTGAT
TCTCAGTACTGTTGGTTCCGTGCATGGTTGACCGGAGGCAGTCTTCTGAAGCAGTGTCAATGCTACTCTTAAAATACTTCCTGGTCATACGCTAAAATGAGAAGA
ATGAGAAGAATGGTTTCTGAAGGGTTATTGCCACTACATAATATATGAAGCTTCCAATTAGGAATGGACACCTCTACTGAAAACCAATAGCAATAACTGATTCAA
GGGCCATGGATCCAAATTATAAAATGCCGCAAAAGAGCCTAGAAATTGTTCAAATTTGAGTTGGAAGAAGGGATAAGGTCCTTCTCTGTAAGAACAAAGGGCTTC
TAAATACTATGGTCAAATTGGTAATCCCCAGGCTTTTAATCACATCTGTAATTTGTGTACTGGCCACAGTTAAGCAGATAACTCATGCTGTTTTGGAACCTTGTG
TATGTATGCACGGGTACACAGTGCTCTTGGACTTAATATTGAAAGTTGAAAGACAAAATTACTAGAGTCCATTTGATAACAGTCACAACCATGTTTTTCATATTT
CGTAACTAAACCCTTGGATTTTAAGTTAAATTTCAAAAACAAATACGGGTTTCTAAAAGTATGGCTTTTGAAACACTCTTCAAGAGTAGATTGCAAATTAAAGAA
TTTAATAGGGGAAAGTATTGTTTATAGCTTAATTTTTAAATACAAAAAGCCAAATGGTTATCGTACAGGCCCTTGGGTTTTTTATTTTTGTGAAAATTAAGCTTA
TAAACACTACCCAAGAGTAAATTCAAGGAAAGTTTTGAAAACTAAGAATTTAAGATTTTAAAAACATGATTTTGTTCTTGAAATTTGGCTAAGAATTCAAATGTT
TACTAGAAAAAATGAAAATCATGGTAAAGTAGTCGTGGAAAAACAAGTATGATATTTTAAAAAAAAACAATGGGACCTAGGTGTTTGATATGACTCATGGGTTGG
AGTTGGACAACATCCTCAATGAGATTCATCTTCATCTTTATATGCCTTGGCATATGAAATTTGAATGTTGGTACTGCGTATAGAAATCGAGGTGACATGGATTCA
TTTTCTTTGCTTCTAGTAATCAGAAGAAAGTGAGGTGTAAAACTATGGGCATGACACTAGATTAGAGAAATGCTCTTTTGGATTAATATTTACAAAAGACTTCCC
CAGTTAGATGTTTTACATAGTCTATTAGATTTTTTTTATTTTTCTTTTATGTGAACATGGTTATTGGTTGTAGACCCCTGCCAATGCCAAAACCAAAAGTAAGCA
AGAAGGGCAAATATGGGCGACAGAGAAGATAAGAAGGCTGCCTTTTTCTATATAAATTTTGGACTTCACAAAATTTCCTGTTACATTGCGTTTCTTATTCTACTT
TTAATTTTTATTTCCTAGTGTTGTTATGGCATTGCATTGCTGTTAATTTGTTTTGTTAACTGCATGAACAAATAACACTGTCAGTTATTCGGCAATTTGTTGTCT
GTTTTCCATTTTTTTACCCTTTCATTTCCATTAGCAATTACAGTTGTTATGTTTAATAAACCCAGTGATGAACTGCCATTCACCTTTGAAAGGATGCATGGATTC
TCTACAGTTGACGGCTTTGTGGAGATAGCTGAAAGCTCGGCAGAGATGATCAAGTATATAGCAAATGAACCTTCAACTGGGCTTTTCTACATTCAACAGCATACA
AAAAATGCTGTTCCCAATGTTATCAATCTGAAGAATAGTGTGGTACACAAATCTCATGAAACAACTCTCCACACTGAAGATTCGGAGGATTCGATCACGATGTTA
AGATCGATGAAAGAATGTGGTTTTCCTATTGCTGATGAGATGATCAGAGACATAAGGAAGTCTCTTGCTATAATGTCAACCAAACAGCCAAGAAGGGGCTTGATC
CATAATACTTCTGGCATGCAGCAACAACAGCCAGGGAGAATGAGCACCTGGAGATCGACCAGTTGGGGGCGAAGAGCAATTGGAGCCCCATTTAATGATGACAGT
GGAGGTTATATTTCAACAGTATTCAAGTCAGCAAGAGAAAAGGCGAGCAACTTTAAGTGGCCACAGCTTGACATCAAGGAAGATCTTGCAGAAGTTGAAGTTGAT
AAACTACAGCCACAAGCTATCCAACCGCCAGTTGCTTCTACTACTTCTAGCTCGTCACAGCCAGATGTGGACAATGAGGAGTTGCCTCTGTCCAGTCAAGTTAAT
GATCAGTCGCAACAGGATGACATGGTTGATGAATGCTTGAATACTGATTTACTTTCGGTGTCCGATAACTTCGATGATTTCAGGGCTGATAAAGAAGCGAAACTG
GAGGAGTGGTTGGGAGGATCTAGCGGCTTGAATAATTTAAGAGATTTGAGGGCAGGGTAAGGCCATTGACATTGATTTTGAAGTAGTGTTGATGCCTTTTACAGT
GCCTTTTTTTACTGTAAAAATGATAATAACAAAAATCACTCTAGACACACGTTCTTTTCTTCATTTTGTCCGACCCCGAATTGCACTAGTGAAGAGTTGTATTTC
CATAGCTTTGTAACATGATGTTATTCTTACTGGAATAAGTATTTACTGAGTGCCATATGTTCAATTTTACCTTTCACATTTGAAAACTGATAAAGGATTTCAATA
CAATTGTACCTAGAAGGTACTGGACTGTCTTGGAAATTAAACTGCTTGGTTTAAGGATTGGC
Protein sequenceShow/hide protein sequence
MFNKPSDELPFTFERMHGFSTVDGFVEIAESSAEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVHKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIR
DIRKSLAIMSTKQPRRGLIHNTSGMQQQQPGRMSTWRSTSWGRRAIGAPFNDDSGGYISTVFKSAREKASNFKWPQLDIKEDLAEVEVDKLQPQAIQPPVASTTS
SSSQPDVDNEELPLSSQVNDQSQQDDMVDECLNTDLLSVSDNFDDFRADKEAKLEEWLGGSSGLNNLRDLRAG