; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0008145 (gene) of Chayote v1 genome

Gene IDSed0008145
OrganismSechium edule (Chayote v1)
DescriptionUnknown protein
Genome locationLG07:5528726..5532191
RNA-Seq ExpressionSed0008145
SyntenySed0008145
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593372.1 hypothetical protein SDJN03_12848, partial [Cucurbita argyrosperma subsp. sororia]6.2e-7669.72Show/hide
Query:  ADDHWVHVALSDDSLVVDLLLCLNRPP---------PLHWSLRQPRSRPTLPRH------HDSDAAARSSPTTPLSWSTAAAA---------AASSKNAG
        +DD WV VALSDDSLVVDLLL LNR P          L WS+RQPRS+P LPR        +SD AAR+SPTTPL+WS+   A         A  SKNAG
Subjt:  ADDHWVHVALSDDSLVVDLLLCLNRPP---------PLHWSLRQPRSRPTLPRH------HDSDAAARSSPTTPLSWSTAAAA---------AASSKNAG

Query:  KSEAVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVASAVLKEASSDQPQPQALLTSICNTTPV
        KSE VATMKRPRKKKTLGELKEEEVLLLKERRSLKDALA L+L VEKQRSINGSLKKMK+DF SQQA EM V SA+ KEASSD+PQ Q L  SICNT PV
Subjt:  KSEAVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVASAVLKEASSDQPQPQALLTSICNTTPV

Query:  GVDA-SYQLPLPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS
        GVDA SYQLPLPNVSCKL+EMGT+ +V   +PDLN+PFQEDSGA++LY+MS
Subjt:  GVDA-SYQLPLPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS

XP_008461167.1 PREDICTED: uncharacterized protein LOC103499830 [Cucumis melo]6.9e-7570.95Show/hide
Query:  ADDHWVHVALSDDSLVVDLLLCLNRPP----PLHWSLRQPRSRPTLPRHHDSDAAARSSPTTPLSWSTA----------AAAAASSKNAGKSEAVATMKR
        +D  WV VALSDDSLVVDLLL LNRPP    PL WS+RQPRS+P LPR H SD+AAR+SPTTPL+WS++          A+ AA SK AGKSE VATMKR
Subjt:  ADDHWVHVALSDDSLVVDLLLCLNRPP----PLHWSLRQPRSRPTLPRHHDSDAAARSSPTTPLSWSTA----------AAAAASSKNAGKSEAVATMKR

Query:  PRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVASAVLKEASSDQPQP-QALLTSICNTTPVGVDASYQLP
        PRKKKTLGELKEEEVLLLKERRSLKDALATL+L+VEKQR++NGSLKK+KLD +SQQA EM V SAV  EA+S+QPQ  Q    SIC+TTP+G DASYQL 
Subjt:  PRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVASAVLKEASSDQPQP-QALLTSICNTTPVGVDASYQLP

Query:  LPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS
        +PNVSCKL+E+GT+G+V  LLPDLNLPFQEDS  E LY+MS
Subjt:  LPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS

XP_022959761.1 uncharacterized protein LOC111460736 isoform X1 [Cucurbita moschata]8.4e-8166.67Show/hide
Query:  MNVSIHF-LNWPYVLSSLHSLYLYLLLPTFSLLAAMSD-----ADDHWVHVALSDDSLVVDLLLCLNRPP---------PLHWSLRQPRSRPTLPRHHDS
        MNV+IH  + WP++     S+   L  P FSL  ++S      +DD WV VALSDDSLVVDLLL LNR P          L WS+RQPRS+P LPR   S
Subjt:  MNVSIHF-LNWPYVLSSLHSLYLYLLLPTFSLLAAMSD-----ADDHWVHVALSDDSLVVDLLLCLNRPP---------PLHWSLRQPRSRPTLPRHHDS

Query:  DAAARSSPTTPLSWSTAAAA---------AASSKNAGKSEAVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQ
        D AAR+SPTTPL+WS+   A         A  SKNAGKSE VATMKRPRKKKTLGELKEEEVLLLKERRSLKDALA L+L VEKQRSINGSLKKMK+DF 
Subjt:  DAAARSSPTTPLSWSTAAAA---------AASSKNAGKSEAVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQ

Query:  SQQATEMAVASAVLKEASSDQPQPQALLTSICNTTPVGVDA-SYQLPLPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS
        SQQA EMAV SA+ KEASSD+PQPQ L  SICNT PVGVDA SYQLPLPNVSCKL+EMGT+G+V   +PDLN+PFQEDSGA++LY+MS
Subjt:  SQQATEMAVASAVLKEASSDQPQPQALLTSICNTTPVGVDA-SYQLPLPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS

XP_023004780.1 uncharacterized protein LOC111497981 isoform X1 [Cucurbita maxima]3.6e-7666.06Show/hide
Query:  SLHSLYLY---LLLPTFSLLAAMSD-ADDHWVHVALSDDSLVVDLLLCLNRPP---------PLHWSLRQPRSRPTLPRH------HDSDAAARSSPTTP
        S+++ +++    LL + SL A++S  +DD WV VALSDDSLVVDLLL LNR P          L WS+RQPRS+P LPR        +SD AAR+SPTTP
Subjt:  SLHSLYLY---LLLPTFSLLAAMSD-ADDHWVHVALSDDSLVVDLLLCLNRPP---------PLHWSLRQPRSRPTLPRH------HDSDAAARSSPTTP

Query:  LSWSTAAAA---------AASSKNAGKSEAVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVAS
        L+WS+   A         A  SKNAGKSEAVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALA L+L VE+QRSINGSLKKMK+DF SQQA EM V S
Subjt:  LSWSTAAAA---------AASSKNAGKSEAVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVAS

Query:  AVLKEASSDQPQPQALLTSICNTTPVGVDA-SYQLPLPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS
        A+ KE SSD+PQ Q LL SICNT PVGVDA SYQLPLPNVSCKL+EMGT+ +V   +PDLN+PFQEDS A+VLYQMS
Subjt:  AVLKEASSDQPQPQALLTSICNTTPVGVDA-SYQLPLPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS

XP_038896159.1 uncharacterized protein LOC120084450 isoform X1 [Benincasa hispida]1.5e-7763.76Show/hide
Query:  MNVSIHF-LNWPYVLSSLHSLYLYLLLP-----------TFSLLAAMSD-ADDHWVHVALSDDSLVVDLLLCLNRPP----PLHWSLRQPRSRPTLPRH-
        MNV+IHF  + P  ++ L S +    LP           + SL+A  S  +D+ WV VALSDDSLVVDLLL LNRPP    PL WS+RQPRS+P LPRH 
Subjt:  MNVSIHF-LNWPYVLSSLHSLYLYLLLP-----------TFSLLAAMSD-ADDHWVHVALSDDSLVVDLLLCLNRPP----PLHWSLRQPRSRPTLPRH-

Query:  -----HDSDAAARSSPTTPLSWSTA----AAAAASSKNAGKSEAVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKL
              +SD+AAR+SPTTPL+WS+     A+ AA SK AGKSE VAT+KRPRKKKTLGELKEEEVLLLKERRSLKDALATL+L+VEKQR++NGSLKKMKL
Subjt:  -----HDSDAAARSSPTTPLSWSTA----AAAAASSKNAGKSEAVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKL

Query:  DFQSQQATEMAVASAVLKEASSDQPQPQALLTSICNTTPVG--------VDASYQLPLPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS
        D +SQQA E  V SAVL+EA+SDQPQ Q    SICNTT +G        VDASYQL LPNVSCKL+E+GT+G+V  LLPDLNLPFQEDSG E LY+MS
Subjt:  DFQSQQATEMAVASAVLKEASSDQPQPQALLTSICNTTPVG--------VDASYQLPLPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS

TrEMBL top hitse value%identityAlignment
A0A0A0K8M7 Uncharacterized protein1.8e-7369.83Show/hide
Query:  ADDHWVHVALSDDSLVVDLLLCLNRPP----PLHWSLRQPRSRPTLPRHHDSDAAARSSPTTPLSWSTA-----------AAAAASSKNAGKSEAVATMK
        +D  WV VALSDDSLVVDLLL LNRPP    PL WS+RQPRS+  LPR H SD+AAR+SPTTPL+WS++           A+ AA SK AGKSE VATMK
Subjt:  ADDHWVHVALSDDSLVVDLLLCLNRPP----PLHWSLRQPRSRPTLPRHHDSDAAARSSPTTPLSWSTA-----------AAAAASSKNAGKSEAVATMK

Query:  RPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVASAVLKEASSDQPQP-QALLTSICNTTPVGVDASYQL
        RPRKKKTLGELKEEEVLLLKERRSLKDALATL+L+VEKQR++NGSLKKMKLD +SQQAT+M V SAVL EA+S+Q Q  Q    S+C+TTP+G DASYQL
Subjt:  RPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVASAVLKEASSDQPQP-QALLTSICNTTPVGVDASYQL

Query:  PLPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS
         +PNVSCKL+E+ T+G+V  LLPDLNLPFQEDS  E LY+MS
Subjt:  PLPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS

A0A1S3CDM0 uncharacterized protein LOC1034998303.3e-7570.95Show/hide
Query:  ADDHWVHVALSDDSLVVDLLLCLNRPP----PLHWSLRQPRSRPTLPRHHDSDAAARSSPTTPLSWSTA----------AAAAASSKNAGKSEAVATMKR
        +D  WV VALSDDSLVVDLLL LNRPP    PL WS+RQPRS+P LPR H SD+AAR+SPTTPL+WS++          A+ AA SK AGKSE VATMKR
Subjt:  ADDHWVHVALSDDSLVVDLLLCLNRPP----PLHWSLRQPRSRPTLPRHHDSDAAARSSPTTPLSWSTA----------AAAAASSKNAGKSEAVATMKR

Query:  PRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVASAVLKEASSDQPQP-QALLTSICNTTPVGVDASYQLP
        PRKKKTLGELKEEEVLLLKERRSLKDALATL+L+VEKQR++NGSLKK+KLD +SQQA EM V SAV  EA+S+QPQ  Q    SIC+TTP+G DASYQL 
Subjt:  PRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVASAVLKEASSDQPQP-QALLTSICNTTPVGVDASYQLP

Query:  LPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS
        +PNVSCKL+E+GT+G+V  LLPDLNLPFQEDS  E LY+MS
Subjt:  LPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS

A0A5D3CK05 Uncharacterized protein4.8e-7471.01Show/hide
Query:  ADDHWVHVALSDDSLVVDLLLCLNRPP----PLHWSLRQPRSRPTLPRHHDSDAAARSSPTTPLSWSTA----------AAAAASSKNAGKSEAVATMKR
        +D  WV VALSDDSLVVDLLL LNRPP    PL WS+RQPRS+P LPR H SD+AAR+SPTTPL+WS++          A+ AA SK AGKSE VATMKR
Subjt:  ADDHWVHVALSDDSLVVDLLLCLNRPP----PLHWSLRQPRSRPTLPRHHDSDAAARSSPTTPLSWSTA----------AAAAASSKNAGKSEAVATMKR

Query:  PRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVASAVLKEASSDQPQP-QALLTSICNTTPVGVDASYQLP
        PRKKKTLGELKEEEVLLLKERRSLKDALATL+L+VEKQR++NGSLKK+KLD +SQQA EM V SAV  EA+S+QPQ  Q    SIC+TTP+G DASYQL 
Subjt:  PRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVASAVLKEASSDQPQP-QALLTSICNTTPVGVDASYQLP

Query:  LPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLY
        +PNVSCKL+E+GT+G+V  LLPDLNLPFQEDS  E LY
Subjt:  LPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLY

A0A6J1H5R9 uncharacterized protein LOC111460736 isoform X14.1e-8166.67Show/hide
Query:  MNVSIHF-LNWPYVLSSLHSLYLYLLLPTFSLLAAMSD-----ADDHWVHVALSDDSLVVDLLLCLNRPP---------PLHWSLRQPRSRPTLPRHHDS
        MNV+IH  + WP++     S+   L  P FSL  ++S      +DD WV VALSDDSLVVDLLL LNR P          L WS+RQPRS+P LPR   S
Subjt:  MNVSIHF-LNWPYVLSSLHSLYLYLLLPTFSLLAAMSD-----ADDHWVHVALSDDSLVVDLLLCLNRPP---------PLHWSLRQPRSRPTLPRHHDS

Query:  DAAARSSPTTPLSWSTAAAA---------AASSKNAGKSEAVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQ
        D AAR+SPTTPL+WS+   A         A  SKNAGKSE VATMKRPRKKKTLGELKEEEVLLLKERRSLKDALA L+L VEKQRSINGSLKKMK+DF 
Subjt:  DAAARSSPTTPLSWSTAAAA---------AASSKNAGKSEAVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQ

Query:  SQQATEMAVASAVLKEASSDQPQPQALLTSICNTTPVGVDA-SYQLPLPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS
        SQQA EMAV SA+ KEASSD+PQPQ L  SICNT PVGVDA SYQLPLPNVSCKL+EMGT+G+V   +PDLN+PFQEDSGA++LY+MS
Subjt:  SQQATEMAVASAVLKEASSDQPQPQALLTSICNTTPVGVDA-SYQLPLPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS

A0A6J1KVJ7 uncharacterized protein LOC111497981 isoform X11.8e-7666.06Show/hide
Query:  SLHSLYLY---LLLPTFSLLAAMSD-ADDHWVHVALSDDSLVVDLLLCLNRPP---------PLHWSLRQPRSRPTLPRH------HDSDAAARSSPTTP
        S+++ +++    LL + SL A++S  +DD WV VALSDDSLVVDLLL LNR P          L WS+RQPRS+P LPR        +SD AAR+SPTTP
Subjt:  SLHSLYLY---LLLPTFSLLAAMSD-ADDHWVHVALSDDSLVVDLLLCLNRPP---------PLHWSLRQPRSRPTLPRH------HDSDAAARSSPTTP

Query:  LSWSTAAAA---------AASSKNAGKSEAVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVAS
        L+WS+   A         A  SKNAGKSEAVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALA L+L VE+QRSINGSLKKMK+DF SQQA EM V S
Subjt:  LSWSTAAAA---------AASSKNAGKSEAVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVAS

Query:  AVLKEASSDQPQPQALLTSICNTTPVGVDA-SYQLPLPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS
        A+ KE SSD+PQ Q LL SICNT PVGVDA SYQLPLPNVSCKL+EMGT+ +V   +PDLN+PFQEDS A+VLYQMS
Subjt:  AVLKEASSDQPQPQALLTSICNTTPVGVDA-SYQLPLPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15800.1 unknown protein4.1e-1738.61Show/hide
Query:  WVHVALSDDSLVVDLLLCLNRPPP--------------LHWSLRQPRSR-PTLPRHHDSDAAARSSPTTPLSWSTA---------AAAAAS---------
        W+H A+ DDSLV + L+CL    P              L WS+RQPR++  TL +  D D   R+SPTTPLSWS A         AAAA           
Subjt:  WVHVALSDDSLVVDLLLCLNRPPP--------------LHWSLRQPRSR-PTLPRHHDSDAAARSSPTTPLSWSTA---------AAAAAS---------

Query:  -------SKNAGKSEAVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVASAVLKEASSDQPQPQ
               SK    S   +  KR RKKKTL +LKEEE +LLKER  L++ LAT+Q  +++QR+ N SLK  KL  +SQ+  + +     L  A  +   P+
Subjt:  -------SKNAGKSEAVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVASAVLKEASSDQPQPQ

Query:  AL
        +L
Subjt:  AL

AT1G80610.1 unknown protein5.9e-1636.07Show/hide
Query:  DHWVHVALSDDSLVVDLLLCLNRPPP----------LHWSLRQPRSRPTLPRHHDSDAAARSSPTTPLSWSTAAAAAASSKNAGKSEAVATM--------
        ++W+ VA+SDDS+V + LL L    P          L WS+RQ RS+             R+SPTTPLSWS A + +    + G      TM        
Subjt:  DHWVHVALSDDSLVVDLLLCLNRPPP----------LHWSLRQPRSRPTLPRHHDSDAAARSSPTTPLSWSTAAAAAASSKNAGKSEAVATM--------

Query:  ---------------------------KRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQS
                                   KR RKKKTL ELKEEE++LLKE   LK+ LA ++  +E+QR+ N +LKKMK + QS
Subjt:  ---------------------------KRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQS

AT4G32030.1 unknown protein3.0e-1230.91Show/hide
Query:  ADDHWVHVALSDDSLVVDLLLCLNRP------------PPLHWSLRQPRSRPT---------LPRHHDSDAAARSSPTTPLSWSTAAAA-----------
        A D WV VA++DD LVV+LLL L               PPL W +RQ RSR +         +    D D + R+SP TPLSWS  + +           
Subjt:  ADDHWVHVALSDDSLVVDLLLCLNRP------------PPLHWSLRQPRSRPT---------LPRHHDSDAAARSSPTTPLSWSTAAAA-----------

Query:  -------AASSKNAGKSEAV--------ATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVASAVL
               A+ S + G    V           KR +K+K+  ELK EE L LKER  L+  +A+L+   ++Q   N  LK++KLD  S + T         
Subjt:  -------AASSKNAGKSEAV--------ATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVASAVL

Query:  KEASSDQPQPQALLTSICNTTPVGVDASYQLP--LPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS
                          N  PV +    QL     + SCK  +    GS   +LPDLN+   E+   E+LY  S
Subjt:  KEASSDQPQPQALLTSICNTTPVGVDASYQLP--LPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS

AT4G32030.2 unknown protein4.2e-0634.18Show/hide
Query:  ADDHWVHVALSDDSLVVDLLLCLNRP------------PPLHWSLRQPRSRPT---------LPRHHDSDAAARSSPTTPLSWSTAAAA-----------
        A D WV VA++DD LVV+LLL L               PPL W +RQ RSR +         +    D D + R+SP TPLSWS  + +           
Subjt:  ADDHWVHVALSDDSLVVDLLLCLNRP------------PPLHWSLRQPRSRPT---------LPRHHDSDAAARSSPTTPLSWSTAAAA-----------

Query:  -------AASSKNAGKSEAV--------ATMKRPRKKKTLGELKEEEVLLLKERRSLK
               A+ S + G    V           KR +K+K+  ELK EE L LKER  L+
Subjt:  -------AASSKNAGKSEAV--------ATMKRPRKKKTLGELKEEEVLLLKERRSLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGTGAGCATCCACTTCCTAAATTGGCCCTACGTTCTCTCTTCTCTTCATTCTCTCTATTTATACCTTCTTCTTCCCACTTTCTCTCTCCTCGCCGCCATGTCCGA
CGCCGACGACCACTGGGTCCACGTGGCCCTCTCCGACGACTCCCTCGTCGTCGACTTACTCCTCTGCCTCAACCGCCCTCCCCCCCTCCACTGGTCCCTCCGCCAGCCCC
GCTCCCGCCCCACCCTTCCCCGCCACCACGACTCCGACGCCGCCGCCAGATCCAGTCCCACCACTCCTCTCTCCTGGTCTACCGCCGCCGCCGCCGCCGCATCATCTAAG
AATGCTGGTAAGAGTGAAGCAGTTGCAACAATGAAGAGACCAAGAAAGAAAAAGACACTAGGAGAACTTAAAGAAGAGGAGGTTTTGCTACTGAAGGAAAGAAGAAGCTT
GAAAGATGCCTTGGCTACCTTGCAGCTCAATGTAGAAAAACAGAGGTCTATAAATGGAAGCTTGAAGAAAATGAAGCTTGATTTCCAATCACAACAAGCGACCGAGATGG
CTGTAGCATCTGCTGTTCTGAAAGAGGCAAGCTCCGACCAACCTCAACCGCAGGCACTGCTGACATCTATATGCAACACAACACCCGTCGGGGTCGATGCTTCATACCAA
TTACCTCTGCCAAACGTTTCTTGCAAATTAGAGGAGATGGGAACTATAGGGAGTGTTTGTTGTTTGTTGCCTGATCTTAATTTGCCTTTCCAAGAGGACTCCGGTGCTGA
GGTCCTGTACCAAATGAGCTAA
mRNA sequenceShow/hide mRNA sequence
CCCATTTTGCTACTTATAATTTTTTAATACATATATATATAAGAATTAAAAGAGAGAGAGAGAGAAAAAGTAGGGAGGATTAAGAGAGTGGATGAGGAAGTAATGAATGT
GAGCATCCACTTCCTAAATTGGCCCTACGTTCTCTCTTCTCTTCATTCTCTCTATTTATACCTTCTTCTTCCCACTTTCTCTCTCCTCGCCGCCATGTCCGACGCCGACG
ACCACTGGGTCCACGTGGCCCTCTCCGACGACTCCCTCGTCGTCGACTTACTCCTCTGCCTCAACCGCCCTCCCCCCCTCCACTGGTCCCTCCGCCAGCCCCGCTCCCGC
CCCACCCTTCCCCGCCACCACGACTCCGACGCCGCCGCCAGATCCAGTCCCACCACTCCTCTCTCCTGGTCTACCGCCGCCGCCGCCGCCGCATCATCTAAGAATGCTGG
TAAGAGTGAAGCAGTTGCAACAATGAAGAGACCAAGAAAGAAAAAGACACTAGGAGAACTTAAAGAAGAGGAGGTTTTGCTACTGAAGGAAAGAAGAAGCTTGAAAGATG
CCTTGGCTACCTTGCAGCTCAATGTAGAAAAACAGAGGTCTATAAATGGAAGCTTGAAGAAAATGAAGCTTGATTTCCAATCACAACAAGCGACCGAGATGGCTGTAGCA
TCTGCTGTTCTGAAAGAGGCAAGCTCCGACCAACCTCAACCGCAGGCACTGCTGACATCTATATGCAACACAACACCCGTCGGGGTCGATGCTTCATACCAATTACCTCT
GCCAAACGTTTCTTGCAAATTAGAGGAGATGGGAACTATAGGGAGTGTTTGTTGTTTGTTGCCTGATCTTAATTTGCCTTTCCAAGAGGACTCCGGTGCTGAGGTCCTGT
ACCAAATGAGCTAA
Protein sequenceShow/hide protein sequence
MNVSIHFLNWPYVLSSLHSLYLYLLLPTFSLLAAMSDADDHWVHVALSDDSLVVDLLLCLNRPPPLHWSLRQPRSRPTLPRHHDSDAAARSSPTTPLSWSTAAAAAASSK
NAGKSEAVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLQLNVEKQRSINGSLKKMKLDFQSQQATEMAVASAVLKEASSDQPQPQALLTSICNTTPVGVDASYQ
LPLPNVSCKLEEMGTIGSVCCLLPDLNLPFQEDSGAEVLYQMS