; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0000478 (gene) of Chayote v1 genome

Gene IDSed0000478
OrganismSechium edule (Chayote v1)
DescriptionUnknown protein
Genome locationLG06:17061939..17076502
RNA-Seq ExpressionSed0000478
SyntenySed0000478
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024718.1 hypothetical protein SDJN02_13536, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-10465.62Show/hide
Query:  MALLSEFQNTTIFVVTRPFLFFMRACSFILKTD--------RTPKGSLSRQYPDL--------------------------LRQNLQFLEIEFENLLWER
        MALL EF N TI  +T PF FF   CS ILKT            K S+S     L                          L++NLQFL IEF+N+LWER
Subjt:  MALLSEFQNTTIFVVTRPFLFFMRACSFILKTD--------RTPKGSLSRQYPDL--------------------------LRQNLQFLEIEFENLLWER

Query:  KELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSSITFGI---SSSFSGSSIVQ
        KEL+K FQ AMKEQK+MELMLD+LEMIHEKATNKIALLESE+Q LRNENLRL+EIKGK YWSLKGLD KSEAQ  GRV S IT+GI   SSS+S SS+VQ
Subjt:  KELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSSITFGI---SSSFSGSSIVQ

Query:  DLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEPHLCLVMALLFVVCISLKSV
        DL +SDA KDG++SKE LI ILESG +SG+L+ + TS+I S+DED TEIL +QREVA+ +SLFST+LSLLVGVIIW AEEPHLCLV+AL+FVV ISLKSV
Subjt:  DLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEPHLCLVMALLFVVCISLKSV

Query:  VEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFV
        VEFFT IKNKPALD VALLSFNWFV G+LAYPTL ++AR+LAPLASR V
Subjt:  VEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFV

XP_031740628.1 uncharacterized protein LOC101204571 isoform X1 [Cucumis sativus]3.2e-10665.83Show/hide
Query:  MALLSEFQNTTIFVVTRPFLFFMRACSFILKTD--------RTPKGSLS--------------------------RQYPDLLRQNLQFLEIEFENLLWER
        MA LSEF NTTI +VTRPF FFMR CSFILKT            K S+S                           +    L+Q LQFLEI+F+N+LWER
Subjt:  MALLSEFQNTTIFVVTRPFLFFMRACSFILKTD--------RTPKGSLS--------------------------RQYPDLLRQNLQFLEIEFENLLWER

Query:  KELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSSITFGI---SSSFSGSSIVQ
        KEL+K FQ AMKE K+MELMLD+LEMIHEKATNKIALLESE+Q LRN+NLRL+EIKGK YWSLKGLD KSEAQ  GRVD  IT+GI   SS  S SSIVQ
Subjt:  KELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSSITFGI---SSSFSGSSIVQ

Query:  DLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEPHLCLVMALLFVVCISLKSV
        DL Q DA KD SISKE LIKILESGLKSG+L+ S T EI SKDE  T++L +QREVA+S+SLFST+LSLLVGVIIW AEEPHLCLV+AL+FVV ISLKSV
Subjt:  DLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEPHLCLVMALLFVVCISLKSV

Query:  VEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVGQTVEGFGFSI
        VEFFT IKNKPALD VALLSFNWFV G+LAYPTL +I+R LA   +    + VE FGFSI
Subjt:  VEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVGQTVEGFGFSI

XP_038898361.1 uncharacterized protein LOC120086031 isoform X1 [Benincasa hispida]9.5e-11168.61Show/hide
Query:  MALLSEFQNTTIFVVTRPFLFFMRACSFILKTD--------RTPKGSLS--------------------------RQYPDLLRQNLQFLEIEFENLLWER
        MA LSEF NTTIF+VTRPF FFMR CSFILKT            K S+S                           Q    LRQ LQFLEIEF N+LWER
Subjt:  MALLSEFQNTTIFVVTRPFLFFMRACSFILKTD--------RTPKGSLS--------------------------RQYPDLLRQNLQFLEIEFENLLWER

Query:  KELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSSITFGI---SSSFSGSSIVQ
        KEL+K FQ AM+E K+MELMLD+LEMIHEKATNKI+LLESE+Q LRNENLRL+EIKGK YWSLKGLD KSEAQ  GRVDS IT GI   SSS+  SSI+Q
Subjt:  KELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSSITFGI---SSSFSGSSIVQ

Query:  DLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEPHLCLVMALLFVVCISLKSV
        DLFQSDA KDGSISKE LIKIL+SGLKSG+ + S T EI SKDED TEIL +QREVAIS+SLFST+LSLLVGVIIW AEEPHLCLV+AL+FVV ISLKSV
Subjt:  DLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEPHLCLVMALLFVVCISLKSV

Query:  VEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVGQTVEGFGFSI
        VEFFT IKNKPALD V+LLSFNWFV G+LAYPTL  IARLLAP   R     VE F FSI
Subjt:  VEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVGQTVEGFGFSI

XP_038898363.1 uncharacterized protein LOC120086031 isoform X2 [Benincasa hispida]1.8e-10177.42Show/hide
Query:  LRQNLQFLEIEFENLLWERKELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSS
        LRQ LQFLEIEF N+LWERKEL+K FQ AM+E K+MELMLD+LEMIHEKATNKI+LLESE+Q LRNENLRL+EIKGK YWSLKGLD KSEAQ  GRVDS 
Subjt:  LRQNLQFLEIEFENLLWERKELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSS

Query:  ITFGI---SSSFSGSSIVQDLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEP
        IT GI   SSS+  SSI+QDLFQSDA KDGSISKE LIKIL+SGLKSG+ + S T EI SKDED TEIL +QREVAIS+SLFST+LSLLVGVIIW AEEP
Subjt:  ITFGI---SSSFSGSSIVQDLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEP

Query:  HLCLVMALLFVVCISLKSVVEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVGQTVEGFGFSI
        HLCLV+AL+FVV ISLKSVVEFFT IKNKPALD V+LLSFNWFV G+LAYPTL  IARLLAP   R     VE F FSI
Subjt:  HLCLVMALLFVVCISLKSVVEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVGQTVEGFGFSI

XP_038898364.1 uncharacterized protein LOC120086031 isoform X3 [Benincasa hispida]1.8e-10177.42Show/hide
Query:  LRQNLQFLEIEFENLLWERKELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSS
        LRQ LQFLEIEF N+LWERKEL+K FQ AM+E K+MELMLD+LEMIHEKATNKI+LLESE+Q LRNENLRL+EIKGK YWSLKGLD KSEAQ  GRVDS 
Subjt:  LRQNLQFLEIEFENLLWERKELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSS

Query:  ITFGI---SSSFSGSSIVQDLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEP
        IT GI   SSS+  SSI+QDLFQSDA KDGSISKE LIKIL+SGLKSG+ + S T EI SKDED TEIL +QREVAIS+SLFST+LSLLVGVIIW AEEP
Subjt:  ITFGI---SSSFSGSSIVQDLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEP

Query:  HLCLVMALLFVVCISLKSVVEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVGQTVEGFGFSI
        HLCLV+AL+FVV ISLKSVVEFFT IKNKPALD V+LLSFNWFV G+LAYPTL  IARLLAP   R     VE F FSI
Subjt:  HLCLVMALLFVVCISLKSVVEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVGQTVEGFGFSI

TrEMBL top hitse value%identityAlignment
A0A0A0KWK5 Uncharacterized protein2.6e-9874.55Show/hide
Query:  LRQNLQFLEIEFENLLWERKELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSS
        L+Q LQFLEI+F+N+LWERKEL+K FQ AMKE K+MELMLD+LEMIHEKATNKIALLESE+Q LRN+NLRL+EIKGK YWSLKGLD KSEAQ  GRVD  
Subjt:  LRQNLQFLEIEFENLLWERKELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSS

Query:  ITFGI---SSSFSGSSIVQDLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEP
        IT+GI   SS  S SSIVQDL Q DA KD SISKE LIKILESGLKSG+L+ S T EI SKDE  T++L +QREVA+S+SLFST+LSLLVGVIIW AEEP
Subjt:  ITFGI---SSSFSGSSIVQDLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEP

Query:  HLCLVMALLFVVCISLKSVVEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVGQTVEGFGFSI
        HLCLV+AL+FVV ISLKSVVEFFT IKNKPALD VALLSFNWFV G+LAYPTL +I+R LA   +    + VE FGFSI
Subjt:  HLCLVMALLFVVCISLKSVVEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVGQTVEGFGFSI

A0A1S3B9G5 uncharacterized protein LOC1034876339.0e-9976.26Show/hide
Query:  LRQNLQFLEIEFENLLWERKELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSS
        L+Q LQFLEIEF+N+L ERKEL+K FQ A+KE K+MELMLD+LEMIHEKATNKIALLESE+Q LRNENLRL+EIKGK YWSLKGLD KSE Q  GRVD  
Subjt:  LRQNLQFLEIEFENLLWERKELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSS

Query:  ITFGI---SSSFSGSSIVQDLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEP
        IT+GI   SSS+S SS+VQDL Q DA KDGSISKE L+KILESGLKSG+L+ S T EI SKDE  TE+L +QREVAIS+SLFS +LSLLVGVIIW AEEP
Subjt:  ITFGI---SSSFSGSSIVQDLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEP

Query:  HLCLVMALLFVVCISLKSVVEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVGQTVEGFGFS
        HLCLV+AL+FVV ISLKSVVEFFT IKNKPALD VALLSFNWFV G+LAYPTL +IAR LAPLASR     VE  GFS
Subjt:  HLCLVMALLFVVCISLKSVVEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVGQTVEGFGFS

A0A6J1C7X7 uncharacterized protein LOC111009200 isoform X14.8e-10064.67Show/hide
Query:  MALLSEFQNTTIFVVTRPFLFFMRACSFILKTD--------RTPKGSL--------------------------SRQYPDLLRQNLQFLEIEFENLLWER
        MALLSEF N TI V TRPF +FM ACS ILK             K S+                          + Q    LRQ+LQFLEIE +N+LWE 
Subjt:  MALLSEFQNTTIFVVTRPFLFFMRACSFILKTD--------RTPKGSL--------------------------SRQYPDLLRQNLQFLEIEFENLLWER

Query:  KELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSS-ITFGI---SSSFSGSSIV
        KEL+K+FQ AMKEQK+MELMLD+LEMIHEKATNKIALLESE+QNLRNE LR +EIKGK YWSLKG      AQ  GRVD++ I+ GI   SSS+SGSS++
Subjt:  KELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSS-ITFGI---SSSFSGSSIV

Query:  QDLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEPHLCLVMALLFVVCISLKS
        QDL QSDAWKDG+IS   LIKILESGLKS +++   TSEI SKDED  EIL KQREVA+S+SLFST+LSLLVGV+IW AEE HLCL++ALL VV ISLKS
Subjt:  QDLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEPHLCLVMALLFVVCISLKS

Query:  VVEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVG
        VVEFFT IKNKPALD VALLS N FV G+LAYPTL +IA LLAPLASRFVG
Subjt:  VVEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVG

A0A6J1F6D4 uncharacterized protein LOC111442593 isoform X22.6e-9060.46Show/hide
Query:  MALLSEFQNTTIFVVTRPFLFFMRACSFILKTD--------RTPKGSLSRQYPDL--------------------------LRQNLQFLEIEFENLLWER
        MALL EF N TI  +T PF FF+  CS ILKT            K S+S     L                          L++NLQFL IEF+N+LWER
Subjt:  MALLSEFQNTTIFVVTRPFLFFMRACSFILKTD--------RTPKGSLSRQYPDL--------------------------LRQNLQFLEIEFENLLWER

Query:  KELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSSITFGI---SSSFSGSSIVQ
        KEL+K FQ AMKEQK+MELMLD+LEMIHEKATNKIALLESE+Q LRNENLRL+EIKGK YWSLKGLD KSEAQ  GRV S IT+GI   SSS+S SS+VQ
Subjt:  KELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSSITFGI---SSSFSGSSIVQ

Query:  DLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEPHLCLVMALLFVVCISLKSV
        DL +SDA KD                               +DED TEIL +QREVA+ +SLFST+LSLLVGVIIW AEEPHLCLV+AL+FVV ISLKSV
Subjt:  DLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEPHLCLVMALLFVVCISLKSV

Query:  VEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFV
        VEFFT IKNKPALD VALLSFNWFV G+LAYPTL ++AR+LAPLASR V
Subjt:  VEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFV

A0A6J1FLE1 uncharacterized protein LOC1114468148.2e-10075.37Show/hide
Query:  LRQNLQFLEIEFENLLWERKELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSS
        L+QNLQFLEIEFEN+LWERKE +K+FQ AMKEQKV+ELMLD+LEMIHEKAT+KI+ LESEL  LRNENLRL+EIKGKTYWSLKGLD+K EAQN GR+DS 
Subjt:  LRQNLQFLEIEFENLLWERKELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSS

Query:  ITFGISSS---FSGSSIVQDLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEP
        ITFGISS    +SGSSIVQ LFQSD WKD +I K  LI++LESGLKSGL   S        D+D  E L +QRE+AIS+SLFST+LSLLVG+IIW AEE 
Subjt:  ITFGISSS---FSGSSIVQDLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEP

Query:  HLCLVMALLFVVCISLKSVVEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVGQTV
        HLCLVMALLFVV ISLKSVVEFFT IKNKPALD VALLSFNWF+ G+LAYP L +IARLL PL SRFVGQTV
Subjt:  HLCLVMALLFVVCISLKSVVEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVGQTV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G45310.1 unknown protein5.0e-4135.33Show/hide
Query:  LLSEFQNTTIFVVTRPFLFFMRACSFILKTD-----------------------RTPKGSL---------------SRQYPDLLRQNLQFLEIEFENLLW
        L+S   +++++++TRPF F + AC F L+T                        R  +GS+               S     LL Q++  L  E E+L W
Subjt:  LLSEFQNTTIFVVTRPFLFFMRACSFILKTD-----------------------RTPKGSL---------------SRQYPDLLRQNLQFLEIEFENLLW

Query:  ERKELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSSITFGISSSFSGSSIVQD
         RKE+ K  + A+KE ++ME  LD+LE  H++A +KI  LE+ELQ L+ ENL+L E+ GK Y S KG    SE  +  R                     
Subjt:  ERKELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQNLRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSSITFGISSSFSGSSIVQD

Query:  LFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEPHLC--LVMALLFVVCISLKS
               K  +I   +  K   + +KS L   ++ S IP  +E    +L  ++ +A+S+S+FS +L+L+VG++++ A+E  LC  L+ AL  VV ISLKS
Subjt:  LFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVAISKSLFSTVLSLLVGVIIWTAEEPHLC--LVMALLFVVCISLKS

Query:  VVEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVG
        VV+FF+ +KNKPALD VAL+S NWF+ G L YPTL  +AR++ P     VG
Subjt:  VVEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTTGCTTTCTGAGTTTCAGAATACCACAATATTCGTGGTCACGAGGCCTTTCTTGTTTTTCATGCGTGCGTGTTCATTTATCTTGAAAACTGATAGAACCCCGAA
AGGAAGTCTATCGCGTCAGTATCCTGATCTTTTACGACAAAATTTGCAATTTCTGGAAATTGAATTCGAAAATCTTTTGTGGGAAAGAAAGGAGCTTCGAAAATATTTCC
AGGTTGCTATGAAAGAGCAGAAGGTGATGGAATTGATGTTGGACAAACTTGAAATGATACATGAAAAGGCGACCAACAAGATTGCACTCTTAGAAAGTGAGCTGCAGAAT
TTGAGAAATGAAAATCTTCGACTGGAAGAAATCAAGGGTAAGACATATTGGAGCTTAAAAGGTCTTGATCACAAAAGTGAAGCACAAAATGTTGGCAGAGTTGACAGCAG
CATTACCTTTGGTATCTCATCCAGCTTTAGTGGCAGCAGCATTGTTCAAGACCTCTTTCAAAGCGATGCTTGGAAAGACGGTAGTATATCTAAAGAAAATTTGATCAAAA
TTTTAGAATCCGGGTTAAAATCGGGTCTGCTCGTATGCTCTCGTACTTCTGAAATCCCATCAAAAGATGAAGATGCCACTGAAATTCTTGCTAAACAAAGAGAGGTTGCA
ATTTCAAAAAGTCTATTTAGTACTGTATTGTCACTTTTGGTTGGAGTGATTATATGGACAGCTGAAGAGCCACATTTGTGCCTCGTAATGGCTCTCTTGTTTGTGGTTTG
CATCTCATTGAAGAGCGTCGTTGAGTTTTTCACGATTATTAAGAACAAACCTGCTTTGGATGTTGTGGCTCTCTTGAGCTTCAACTGGTTTGTATTCGGAGTTCTTGCTT
ACCCAACGCTGTCAAGCATTGCTCGTTTGCTTGCTCCTCTGGCCTCGAGGTTTGTCGGACAAACAGTAGAAGGGTTTGGTTTTTCCATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTTGCTTTCTGAGTTTCAGAATACCACAATATTCGTGGTCACGAGGCCTTTCTTGTTTTTCATGCGTGCGTGTTCATTTATCTTGAAAACTGATAGAACCCCGAA
AGGAAGTCTATCGCGTCAGTATCCTGATCTTTTACGACAAAATTTGCAATTTCTGGAAATTGAATTCGAAAATCTTTTGTGGGAAAGAAAGGAGCTTCGAAAATATTTCC
AGGTTGCTATGAAAGAGCAGAAGGTGATGGAATTGATGTTGGACAAACTTGAAATGATACATGAAAAGGCGACCAACAAGATTGCACTCTTAGAAAGTGAGCTGCAGAAT
TTGAGAAATGAAAATCTTCGACTGGAAGAAATCAAGGGTAAGACATATTGGAGCTTAAAAGGTCTTGATCACAAAAGTGAAGCACAAAATGTTGGCAGAGTTGACAGCAG
CATTACCTTTGGTATCTCATCCAGCTTTAGTGGCAGCAGCATTGTTCAAGACCTCTTTCAAAGCGATGCTTGGAAAGACGGTAGTATATCTAAAGAAAATTTGATCAAAA
TTTTAGAATCCGGGTTAAAATCGGGTCTGCTCGTATGCTCTCGTACTTCTGAAATCCCATCAAAAGATGAAGATGCCACTGAAATTCTTGCTAAACAAAGAGAGGTTGCA
ATTTCAAAAAGTCTATTTAGTACTGTATTGTCACTTTTGGTTGGAGTGATTATATGGACAGCTGAAGAGCCACATTTGTGCCTCGTAATGGCTCTCTTGTTTGTGGTTTG
CATCTCATTGAAGAGCGTCGTTGAGTTTTTCACGATTATTAAGAACAAACCTGCTTTGGATGTTGTGGCTCTCTTGAGCTTCAACTGGTTTGTATTCGGAGTTCTTGCTT
ACCCAACGCTGTCAAGCATTGCTCGTTTGCTTGCTCCTCTGGCCTCGAGGTTTGTCGGACAAACAGTAGAAGGGTTTGGTTTTTCCATCTGA
Protein sequenceShow/hide protein sequence
MALLSEFQNTTIFVVTRPFLFFMRACSFILKTDRTPKGSLSRQYPDLLRQNLQFLEIEFENLLWERKELRKYFQVAMKEQKVMELMLDKLEMIHEKATNKIALLESELQN
LRNENLRLEEIKGKTYWSLKGLDHKSEAQNVGRVDSSITFGISSSFSGSSIVQDLFQSDAWKDGSISKENLIKILESGLKSGLLVCSRTSEIPSKDEDATEILAKQREVA
ISKSLFSTVLSLLVGVIIWTAEEPHLCLVMALLFVVCISLKSVVEFFTIIKNKPALDVVALLSFNWFVFGVLAYPTLSSIARLLAPLASRFVGQTVEGFGFSI