; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0025259 (gene) of Chayote v1 genome

Gene IDSed0025259
OrganismSechium edule (Chayote v1)
Descriptionorgan-specific protein S2-like isoform X2
Genome locationLG05:35745752..35751057
RNA-Seq ExpressionSed0025259
SyntenySed0025259
Gene Ontology termsNA
InterPro domainsIPR024489 - Organ specific protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8646137.1 hypothetical protein Csa_016549 [Cucumis sativus]8.0e-4343.66Show/hide
Query:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVMEEDCL-------------EDMKFEN-------------------------VKDIEPRPSFMSYP-
        MKI PT GITL+LL L  N IESR+EPGG W+ V+E+D L             + +K EN                          KDIEPRPS   YP 
Subjt:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVMEEDCL-------------EDMKFEN-------------------------VKDIEPRPSFMSYP-

Query:  DDIKDQHFAKDIEPRPSLTFYPNNNVKAKLFIKDIEPRPTGSFYPNDVNV-KFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFN----------
        D+ KD+ F KDIEPRPSLTFYPN++ K KLF KDIEPRP+ +FYPND +  KFFI+DIEPRPS T YPNDD K++L TKDIEPR SATF           
Subjt:  DDIKDQHFAKDIEPRPSLTFYPNNNVKAKLFIKDIEPRPTGSFYPNDVNV-KFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFN----------

Query:  TRDMDRKVSSTDDRHGDADIQMITEESRLEPRRIETKLEDSSTEETYSKSDQNPSNGEIIKPQQNILN
        T+D++ + S+T   + D   ++ T++  +EPR   T   +    + +   D  P       P  +  N
Subjt:  TRDMDRKVSSTDDRHGDADIQMITEESRLEPRRIETKLEDSSTEETYSKSDQNPSNGEIIKPQQNILN

XP_023530023.1 uncharacterized protein LOC111792698 [Cucurbita pepo subsp. pepo]3.4e-4134.38Show/hide
Query:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVM-EEDCLEDMKFEN----VKDIEPRPSFMSYPDDIKDQHFAKDIEPRPSLTFYPNNNVKAKLFIKD
        MK+    GITL+LL + VNNIESR+EPG     V  +ED +E  + EN    VKDIEPRPS   YP + + + F KDIEPRPS TFYPN NV   LF KD
Subjt:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVM-EEDCLEDMKFEN----VKDIEPRPSFMSYPDDIKDQHFAKDIEPRPSLTFYPNNNVKAKLFIKD

Query:  IEPRPTGSFYPND-VNVKFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFNTRDMDRKVSSTDDRHGDADIQMITEESRLEPRRIETKLEDSSTE
        IEPRP+ +FYPND +    F +DI PRPS T YPND++K  L  KD+EPR SATF   D               +++ +  +  +EPR   T   + + +
Subjt:  IEPRPTGSFYPND-VNVKFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFNTRDMDRKVSSTDDRHGDADIQMITEESRLEPRRIETKLEDSSTE

Query:  ETYSKSDQNPSNGEIIKPQQNILNLDSNAKDEKPHKPSI-------IKIYPRDKDAKPRPSILDLYN-------YVKDVE-------HQQDILDLYNYAK
              D  P       P  N+  L  + KD +P +PS        +K    DKD +PRPS     N       + KD+E       +  D +    + K
Subjt:  ETYSKSDQNPSNGEIIKPQQNILNLDSNAKDEKPHKPSI-------IKIYPRDKDAKPRPSILDLYN-------YVKDVE-------HQQDILDLYNYAK

Query:  DVLKSQPSIIDLYHYAKNFKPQPNILDLYNAKDVKPQSNIVDFYH--------YVMDVKLQPSI-------IDHYHYAKDFKSHPSIVDHYNYVEDVKPQ
        D+ + +PS          F P  N+  L   KD++P+ +   FY         +  D++L+PS        +    Y KD +  P I  +          
Subjt:  DVLKSQPSIIDLYHYAKNFKPQPNILDLYNAKDVKPQSNIVDFYH--------YVMDVKLQPSI-------IDHYHYAKDFKSHPSIVDHYNYVEDVKPQ

Query:  PNTVDLYHYVKDVKPQPSII---DLYNYANDAKPQPSIVDHNNYA
        PN   +  +VKD++P+PSI        Y +D  P+ S  D ++ A
Subjt:  PNTVDLYHYVKDVKPQPSII---DLYNYANDAKPQPSIVDHNNYA

XP_031745283.1 uncharacterized protein LOC105436132 isoform X1 [Cucumis sativus]9.5e-4435.55Show/hide
Query:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVMEEDCL-------------EDMKFEN-------------------------VKDIEPRPSFMSYP-
        MKI PT GITL+LL L  N IESR+EPGG W+ V+E+D L             + +K EN                          KDIEPRPS   YP 
Subjt:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVMEEDCL-------------EDMKFEN-------------------------VKDIEPRPSFMSYP-

Query:  DDIKDQHFAKDIEPRPSLTFYPNNNVKAKLFIKDIEPRPTGSFYPNDVNV-KFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFNTRDMDRKVSS
        D+ KD+ F KDIEPRPSLTFYPN++ K KLF KDIEPRP+ +FYPND +  KFFI+DIEPRPS T YPND  KD+L TKDIEPR S TF   D       
Subjt:  DDIKDQHFAKDIEPRPSLTFYPNNNVKAKLFIKDIEPRPTGSFYPNDVNV-KFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFNTRDMDRKVSS

Query:  TDDRHGDADIQMITEESRLEPRRIETKLEDSSTEETYSKSDQNPSNGEIIKPQQNILNLDSNAKDEKPHKPSIIKIYPRD--------KDAKPRPSILDL
              D   ++ T++  +EPR   T   +  +++ +   D  P       P  +  N     KD +P +PS    YP D        KD +PRPS    
Subjt:  TDDRHGDADIQMITEESRLEPRRIETKLEDSSTEETYSKSDQNPSNGEIIKPQQNILNLDSNAKDEKPHKPSIIKIYPRD--------KDAKPRPSILDL

Query:  YNYVKDVEHQQDILDLYNYAKDVLKSQPSIIDLYHYAKNFKPQPNILDLYNAKDVKPQSNIVDFYHYVMDVKLQPSIIDHYHYAKDFKSHPSIVDHYNYV
         N   D +++        + KD+ + +PS          F P     D +  KD++P+ +   + +     KL         + KD +  PS   + N  
Subjt:  YNYVKDVEHQQDILDLYNYAKDVLKSQPSIIDLYHYAKNFKPQPNILDLYNAKDVKPQSNIVDFYHYVMDVKLQPSIIDHYHYAKDFKSHPSIVDHYNYV

Query:  EDVKPQPNTVDLYHYVKDVKPQPSIIDLYNYANDAK
               +  +   + KD++P+PS+    N  ND+K
Subjt:  EDVKPQPNTVDLYHYVKDVKPQPSIIDLYNYANDAK

XP_031745285.1 proteoglycan 4 isoform X2 [Cucumis sativus]3.3e-4441.54Show/hide
Query:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVMEEDCL-------------EDMKFEN-------------------------VKDIEPRPSFMSYP-
        MKI PT GITL+LL L  N IESR+EPGG W+ V+E+D L             + +K EN                          KDIEPRPS   YP 
Subjt:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVMEEDCL-------------EDMKFEN-------------------------VKDIEPRPSFMSYP-

Query:  DDIKDQHFAKDIEPRPSLTFYPNNNVKAKLFIKDIEPRPTGSFYPNDVNV-KFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFN----------
        D+ KD+ F KDIEPRPSLTFYPN++ K KLF KDIEPRP+ +FYPND +  KFFI+DIEPRPS T YPNDD K++L TKDIEPR SATF           
Subjt:  DDIKDQHFAKDIEPRPSLTFYPNNNVKAKLFIKDIEPRPTGSFYPNDVNV-KFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFN----------

Query:  TRDMDRKVSSTDDRHGDADIQMITEESRLEPRRIETKLEDSSTEETYSKSDQNPSNGEIIKPQQNILNLDSNAKDEKPHKPSIIKIYPRD--------KD
        T+D++ + S T   + D   ++ T++  +EPR   T   +  +++ +   D  P       P  +  N     KD +P +PS    YP D        KD
Subjt:  TRDMDRKVSSTDDRHGDADIQMITEESRLEPRRIETKLEDSSTEETYSKSDQNPSNGEIIKPQQNILNLDSNAKDEKPHKPSIIKIYPRD--------KD

Query:  AKPRPSILDLYN-------YVKDVE
         +PRPS     N       + KD+E
Subjt:  AKPRPSILDLYN-------YVKDVE

XP_038887162.1 uncharacterized protein LOC120077350 [Benincasa hispida]1.2e-4639.18Show/hide
Query:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVME-----------EDCLEDMKFEN----VKDIEPRPSFMSYPDDIKDQHFAKDIEPRPSLTFYPNN
        MKI PT GITL+L  LLV++IESRYEPGG WR V+E           EDCLED K  N    +KDIEPRPS   YPDD K + F KDIEPRPS TFYP  
Subjt:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVME-----------EDCLEDMKFEN----VKDIEPRPSFMSYPDDIKDQHFAKDIEPRPSLTFYPNN

Query:  NVKAKLFIKDIEPRPTGSFYPNDVNVKFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFN---------TRDMDRKVSSTDDRHGDADIQMITEE
        +V AKLF KDIEPRP+ +FYPNDV + FF ++IEPRPS T YPND+ K    TKDIEPR S TF          T+D++ + S+T      AD+++    
Subjt:  NVKAKLFIKDIEPRPTGSFYPNDVNVKFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFN---------TRDMDRKVSSTDDRHGDADIQMITEE

Query:  SRLEPRRIETKLEDSSTEETYSKSDQNPSNGEIIKPQQNILNLDSNAKDEKPHK---PSIIKIYPRDKDAKPRPSILDLYNYVKDVEHQQDILDLYNYAK
          +EPR   T   D    + ++K+ + P  G    P  N +   +   + +P     P+ +K  P  KD +PRPS          V    D + +  +A 
Subjt:  SRLEPRRIETKLEDSSTEETYSKSDQNPSNGEIIKPQQNILNLDSNAKDEKPHK---PSIIKIYPRDKDAKPRPSILDLYNYVKDVEHQQDILDLYNYAK

Query:  DVLKSQPSIIDLYHYAKNFKPQPNILDLYNAKDVKPQSNIVDFY------HYVMDVKLQPSIIDH
        D+ + +PS+         F P  N +  +  KD +P+ +   +       H+  D+++Q +   +
Subjt:  DVLKSQPSIIDLYHYAKNFKPQPNILDLYNAKDVKPQSNIVDFY------HYVMDVKLQPSIIDH

TrEMBL top hitse value%identityAlignment
A0A0A0K3R4 Uncharacterized protein1.2e-4435.94Show/hide
Query:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVMEEDCL-------------EDMKFENV--KDIEPRPSFMSYP-DDIKDQHFAKDIEPRPSLTFYPN
        MKI PT GITL+LL L  N IESRYEPGG W+ V+E+D L             + +K EN    D +PRPS   YP D+ KD+ F KDIEPRPS TFYPN
Subjt:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVMEEDCL-------------EDMKFENV--KDIEPRPSFMSYP-DDIKDQHFAKDIEPRPSLTFYPN

Query:  NNVKAKLFIKDIEPRPTGSFYPN-DVNVKFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFNTRDMDRKVSSTDDRHGDADIQMITEESRLEPRR
        +  K + F KDIEPRP+ +FYPN D   K F +DIEPRPS T YPNDD K++L TKDIEPR SATF   D             D   ++ T++  +EPR 
Subjt:  NNVKAKLFIKDIEPRPTGSFYPN-DVNVKFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFNTRDMDRKVSSTDDRHGDADIQMITEESRLEPRR

Query:  IETKLEDSSTEETYSKSDQNPSNGEIIKPQQNILNLDSNAKDEKPHKPSIIKIYPRD--------KDAKPRPSILDLYNYVKDVEHQQDILDLYNYAKDV
          T   +  T+      D  P       P  +  N     KD +P +PS    YP D        KD +PRPS     N   D +++        + KD+
Subjt:  IETKLEDSSTEETYSKSDQNPSNGEIIKPQQNILNLDSNAKDEKPHKPSIIKIYPRD--------KDAKPRPSILDLYNYVKDVEHQQDILDLYNYAKDV

Query:  LKSQPSIIDLYHYAKNFKPQPNILDLYNAKDVKPQSNIVDFYHYVMDVKLQPSIIDHYHYAKDFKSHPSIVDHYNYVEDVKPQPNTVDLYHYVKDVKPQP
         + +PS          F P  +  +    KD++P+ +   + +     KL         + KD +  PS   + N  +D K +        + KD++P+P
Subjt:  LKSQPSIIDLYHYAKNFKPQPNILDLYNAKDVKPQSNIVDFYHYVMDVKLQPSIIDHYHYAKDFKSHPSIVDHYNYVEDVKPQPNTVDLYHYVKDVKPQP

Query:  SIIDLYNYANDAKPQPSIVDHNNYAKDVEPQPSI
        S      Y ND        +   + KD+EP+PS+
Subjt:  SIIDLYNYANDAKPQPSIVDHNNYAKDVEPQPSI

A0A0A0K958 Uncharacterized protein7.3e-4244.74Show/hide
Query:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVMEEDCL-------------EDMKFENV--KDIEPRPSFMSYPDD-IKDQHFAKDIEPRPSLTFYPN
        MKI PT GITL LL L  N IESRYEPGG W+ V+E+D L             + +K EN    DI+PRPS   YP+D  KD+ F KDIEPRPSLTFYPN
Subjt:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVMEEDCL-------------EDMKFENV--KDIEPRPSFMSYPDD-IKDQHFAKDIEPRPSLTFYPN

Query:  NNVKAKLFIKDIEPRPTGSFYPN-DVNVKFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFNTRDMDRKVSSTDDRHGDADIQMITEESRLEPRR
        ++ K KLF KDIEPRP+ +FYPN D   K F +DIEPRPS T YPNDD K++L TKDIEPR S TF   D             D   ++ T++  +EPR 
Subjt:  NNVKAKLFIKDIEPRPTGSFYPN-DVNVKFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFNTRDMDRKVSSTDDRHGDADIQMITEESRLEPRR

Query:  IETKLEDSSTEETYSKSDQNPSNGEIIKPQQNILNLDSNAKDEKPHKPSIIKIYPRDKDAKPRPSI
          T   +  +++ +   D  P       P            DE   K  I       KD +PRPS+
Subjt:  IETKLEDSSTEETYSKSDQNPSNGEIIKPQQNILNLDSNAKDEKPHKPSIIKIYPRDKDAKPRPSI

A0A6J1EI09 uncharacterized protein LOC111433582 isoform X11.9e-3741.92Show/hide
Query:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVM-EEDCLEDMKFEN----VKDIEPRPSFMSYPDDIKDQHFAKDIEPRPSLTFYPNNNVKAKLFIKD
        MK+    GITL+LL + VNNIESR+EPG     V  +ED +E  + EN    VKDIEPRPS   YP + + + F KDIEPRPS TFYPN NV   LF KD
Subjt:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVM-EEDCLEDMKFEN----VKDIEPRPSFMSYPDDIKDQHFAKDIEPRPSLTFYPNNNVKAKLFIKD

Query:  IEPRPTGSFYPND-VNVKFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFNTRDMDRKVSSTDDRHGDADIQMITEESRLEPRRIETKLEDSSTE
        IEPRP+ +FYPN+ V    F +DIEPRPS T YPND++K  +  KDIEPR SATF   D               +++ +  +  +EPR   T   + + +
Subjt:  IEPRPTGSFYPND-VNVKFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFNTRDMDRKVSSTDDRHGDADIQMITEESRLEPRRIETKLEDSSTE

Query:  ETYSKSDQNPSNGEIIKPQQNILNL----DSNAKDEKPHKPS-IIKIYPRDKDAKPRPSI
              D  P       P+ N+  +    D   +      P+  +K    DKD +PRPSI
Subjt:  ETYSKSDQNPSNGEIIKPQQNILNL----DSNAKDEKPHKPS-IIKIYPRDKDAKPRPSI

A0A6J1EL39 organ-specific protein S2-like isoform X26.0e-3642.19Show/hide
Query:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVM-EEDCLEDMKFEN----VKDIEPRPSFMSYPDDIKDQHFAKDIEPRPSLTFYPNNNVKAKLFIKD
        MK+    GITL+LL + VNNIESR+EPG     V  +ED +E  + EN    VKDIEPRPS   YP + + + F KDIEPRPS TFYPN NV   LF KD
Subjt:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVM-EEDCLEDMKFEN----VKDIEPRPSFMSYPDDIKDQHFAKDIEPRPSLTFYPNNNVKAKLFIKD

Query:  IEPRPTGSFYPND-VNVKFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFNTRDMDRKVSSTDDRHGDADIQMITEE--SRLEPRRIETKLEDSS
        IEPRP+ +FYPN+ V    F +DIEPRPS T YPN+++K  L  KDIEPR   TF   +  + +   +D      +     +  ++L  + IE +L  S 
Subjt:  IEPRPTGSFYPND-VNVKFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFNTRDMDRKVSSTDDRHGDADIQMITEE--SRLEPRRIETKLEDSS

Query:  TEETYSKSDQNPSNGEIIKPQQNILNLDSNAKDEKPHKPSIIKIYPRDKDAKPRPS
            YS +++     + I+P+ N+ +  S  KDE         +Y    D KP+PS
Subjt:  TEETYSKSDQNPSNGEIIKPQQNILNLDSNAKDEKPHKPSIIKIYPRDKDAKPRPS

A0A6J1I207 organ-specific protein P4-like7.6e-3947.14Show/hide
Query:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVME-----------EDCLEDMKFEN----VKDIEPRPSFMSYPDDIKDQHFAKDIEPRPSLTFYPNN
        MK+ P    TLVLL LLV+NIESR+EPG  W  V+E           ED +ED +  N    VKDIEPRPS   YP+D++ + F+KDIEPRPS TFYPN 
Subjt:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVME-----------EDCLEDMKFEN----VKDIEPRPSFMSYPDDIKDQHFAKDIEPRPSLTFYPNN

Query:  NVKAKLFIKDIEPRPTGSFYPN-----------------------DVNVKFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFNTRDMDRKVSSTD
        NVK  LF KDIEPRP+ +FYPN                       + N + F +DIEPRP+ +SYP+++ KDEL T D++P+ S T    D++ K SSTD
Subjt:  NVKAKLFIKDIEPRPTGSFYPN-----------------------DVNVKFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFNTRDMDRKVSSTD

Query:  DRHGDADIQM
          H + DIQ+
Subjt:  DRHGDADIQM

SwissProt top hitse value%identityAlignment
P17771 Organ-specific protein P42.5e-0730.53Show/hide
Query:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVMEEDCLEDMKFENVKDIEPRPSFMSYPDDIKDQHFA-KDIEPRPSLTFYPNNNVKA---KLFIKDI
        M +     +  + L L+V N+ESR + G +W+ VM++   +DM  E ++ +    +  +     K+   A  + EP P+ + Y +N + A   K  I + 
Subjt:  MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVMEEDCLEDMKFENVKDIEPRPSFMSYPDDIKDQHFA-KDIEPRPSLTFYPNNNVKA---KLFIKDI

Query:  EPRPTGSFYPNDVNVKFFIEDIEPRPSFTSY
        E RP GS Y ++     F +D EPRPS T Y
Subjt:  EPRPTGSFYPNDVNVKFFIEDIEPRPSFTSY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATCATTCCAACCCTTGGAATCACTCTTGTTTTGCTTCATTTGCTTGTCAACAACATAGAATCAAGGTATGAACCTGGAGGACATTGGAGAAAGGTTATGGAAGA
AGATTGCCTTGAAGACATGAAGTTCGAAAATGTCAAGGATATCGAACCACGGCCAAGCTTCATGTCGTACCCGGATGATATCAAAGACCAACATTTTGCAAAAGATATAG
AACCACGACCGAGTCTCACATTTTATCCCAATAATAATGTCAAAGCCAAACTTTTTATTAAAGATATTGAACCACGACCAACTGGATCGTTCTATCCAAATGATGTAAAC
GTCAAGTTTTTTATTGAAGATATTGAACCACGGCCAAGCTTCACGTCTTACCCAAATGATGATATCAAAGACGAACTACTTACGAAAGATATAGAGCCACGAACAAGTGC
CACATTTAATACTCGCGATATGGACCGAAAAGTTTCCTCTACTGATGATCGCCATGGCGACGCTGACATACAGATGATCACAGAAGAGTCAAGACTTGAGCCAAGGAGAA
TTGAGACTAAATTAGAAGATTCATCCACGGAAGAAACTTATTCTAAAAGTGATCAAAACCCTTCCAATGGTGAAATTATCAAACCTCAACAAAATATTCTCAATCTTGAC
TCCAATGCTAAGGATGAAAAACCTCATAAGCCTAGCATCATCAAAATTTACCCGCGGGATAAAGATGCAAAGCCTCGACCTAGCATTCTCGATCTTTACAACTATGTAAA
GGATGTTGAGCACCAACAAGACATTCTCGATCTTTACAACTATGCAAAGGATGTTCTCAAGTCGCAACCAAGCATCATTGACCTTTATCACTATGCAAAGAATTTCAAGC
CACAACCAAACATTTTAGACCTTTACAACGCCAAAGATGTCAAGCCGCAATCAAACATCGTCGACTTTTACCACTATGTCATGGATGTCAAGCTCCAACCAAGCATCATC
GACCATTACCACTATGCAAAGGATTTCAAGTCCCATCCAAGCATCGTCGACCATTACAACTATGTTGAGGATGTCAAGCCACAACCAAACACTGTCGACCTTTACCACTA
TGTAAAGGATGTCAAGCCCCAACCAAGTATCATCGACCTTTACAACTATGCAAACGATGCCAAGCCACAACCAAGCATCGTCGACCACAACAACTATGCCAAGGATGTCG
AGCCACAACCAAGCATCGTCGATCTTTACCACTATGCAAAGGATGTCAATCCCCAACCTAGCATCATCGGCCTTTACAACTATGCAAAGGATGCCAAGCCACAACCAAGT
ATCGTCGATCTTTACCACTATGCAAAGGATGTCAAGCCTCAACCAAGCACCATCGACCTTTATAACTATGCAAATGATGCTAAGCCGCAACCAAACACCGTCAACCTTTA
TCCTTATGCCAAGGATGTTGAGCTGCAATCAAGCATAATCGAGCTTTTCAACTATGTCAAGGATGCAAACAAGGAAAAAGGTGAAAACCAAGAAGAGCTAGTTGAGGCTG
TTGAAAAGGGTAACATATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGATCATTCCAACCCTTGGAATCACTCTTGTTTTGCTTCATTTGCTTGTCAACAACATAGAATCAAGGTATGAACCTGGAGGACATTGGAGAAAGGTTATGGAAGA
AGATTGCCTTGAAGACATGAAGTTCGAAAATGTCAAGGATATCGAACCACGGCCAAGCTTCATGTCGTACCCGGATGATATCAAAGACCAACATTTTGCAAAAGATATAG
AACCACGACCGAGTCTCACATTTTATCCCAATAATAATGTCAAAGCCAAACTTTTTATTAAAGATATTGAACCACGACCAACTGGATCGTTCTATCCAAATGATGTAAAC
GTCAAGTTTTTTATTGAAGATATTGAACCACGGCCAAGCTTCACGTCTTACCCAAATGATGATATCAAAGACGAACTACTTACGAAAGATATAGAGCCACGAACAAGTGC
CACATTTAATACTCGCGATATGGACCGAAAAGTTTCCTCTACTGATGATCGCCATGGCGACGCTGACATACAGATGATCACAGAAGAGTCAAGACTTGAGCCAAGGAGAA
TTGAGACTAAATTAGAAGATTCATCCACGGAAGAAACTTATTCTAAAAGTGATCAAAACCCTTCCAATGGTGAAATTATCAAACCTCAACAAAATATTCTCAATCTTGAC
TCCAATGCTAAGGATGAAAAACCTCATAAGCCTAGCATCATCAAAATTTACCCGCGGGATAAAGATGCAAAGCCTCGACCTAGCATTCTCGATCTTTACAACTATGTAAA
GGATGTTGAGCACCAACAAGACATTCTCGATCTTTACAACTATGCAAAGGATGTTCTCAAGTCGCAACCAAGCATCATTGACCTTTATCACTATGCAAAGAATTTCAAGC
CACAACCAAACATTTTAGACCTTTACAACGCCAAAGATGTCAAGCCGCAATCAAACATCGTCGACTTTTACCACTATGTCATGGATGTCAAGCTCCAACCAAGCATCATC
GACCATTACCACTATGCAAAGGATTTCAAGTCCCATCCAAGCATCGTCGACCATTACAACTATGTTGAGGATGTCAAGCCACAACCAAACACTGTCGACCTTTACCACTA
TGTAAAGGATGTCAAGCCCCAACCAAGTATCATCGACCTTTACAACTATGCAAACGATGCCAAGCCACAACCAAGCATCGTCGACCACAACAACTATGCCAAGGATGTCG
AGCCACAACCAAGCATCGTCGATCTTTACCACTATGCAAAGGATGTCAATCCCCAACCTAGCATCATCGGCCTTTACAACTATGCAAAGGATGCCAAGCCACAACCAAGT
ATCGTCGATCTTTACCACTATGCAAAGGATGTCAAGCCTCAACCAAGCACCATCGACCTTTATAACTATGCAAATGATGCTAAGCCGCAACCAAACACCGTCAACCTTTA
TCCTTATGCCAAGGATGTTGAGCTGCAATCAAGCATAATCGAGCTTTTCAACTATGTCAAGGATGCAAACAAGGAAAAAGGTGAAAACCAAGAAGAGCTAGTTGAGGCTG
TTGAAAAGGGTAACATATAG
Protein sequenceShow/hide protein sequence
MKIIPTLGITLVLLHLLVNNIESRYEPGGHWRKVMEEDCLEDMKFENVKDIEPRPSFMSYPDDIKDQHFAKDIEPRPSLTFYPNNNVKAKLFIKDIEPRPTGSFYPNDVN
VKFFIEDIEPRPSFTSYPNDDIKDELLTKDIEPRTSATFNTRDMDRKVSSTDDRHGDADIQMITEESRLEPRRIETKLEDSSTEETYSKSDQNPSNGEIIKPQQNILNLD
SNAKDEKPHKPSIIKIYPRDKDAKPRPSILDLYNYVKDVEHQQDILDLYNYAKDVLKSQPSIIDLYHYAKNFKPQPNILDLYNAKDVKPQSNIVDFYHYVMDVKLQPSII
DHYHYAKDFKSHPSIVDHYNYVEDVKPQPNTVDLYHYVKDVKPQPSIIDLYNYANDAKPQPSIVDHNNYAKDVEPQPSIVDLYHYAKDVNPQPSIIGLYNYAKDAKPQPS
IVDLYHYAKDVKPQPSTIDLYNYANDAKPQPNTVNLYPYAKDVELQSSIIELFNYVKDANKEKGENQEELVEAVEKGNI