; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0015983 (gene) of Chayote v1 genome

Gene IDSed0015983
OrganismSechium edule (Chayote v1)
DescriptionProtein Ycf2-like
Genome locationLG09:9411673..9418981
RNA-Seq ExpressionSed0015983
SyntenySed0015983
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047596.1 protein Ycf2-like [Cucumis melo var. makuwa]9.3e-5738.85Show/hide
Query:  MKASRRSRAIKINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQR
        M + RR+  +KIN+  +S  +  I++NLGD+ I +FR+  FGH L+ S    SSQLLLH+IQR C PK+  +L F IGG++L FGLREFA+ITGL C + 
Subjt:  MKASRRSRAIKINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQR

Query:  VKINTTSIKDGGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHN
          IN   I  GGRL+  YFE  K + R  LN+ F ++    +DD +KM+ LY+ ESFL+ KQ+   ++ DH++M+DD  +F+ YPWG++A++LLV  ++ 
Subjt:  VKINTTSIKDGGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHN

Query:  AGVEQGNCIVGVAGFVYALLVWAYEL-------------------------------EWRELLTNVFEAPS--INDIVATDEEIAMPYFAPFLNEE
            +G   + + GF++ +L WAYE+                               +W++L   VF++P+  ++ ++AT +E+ MP+FAPF+  E
Subjt:  AGVEQGNCIVGVAGFVYALLVWAYEL-------------------------------EWRELLTNVFEAPS--INDIVATDEEIAMPYFAPFLNEE

KGN48800.2 hypothetical protein Csa_003918 [Cucumis sativus]2.0e-4334.14Show/hide
Query:  KINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTSIKD
        +IN+  +   ++ I+  L ++ + KF+K+CFG+ LD    + SSQL  H+I+RQC  K   EL F + G++  FG+++FA+ITGLNCG+   I+ + I+ 
Subjt:  KINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTSIKD

Query:  GGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQGNCIV
         G+  K YF  +K I+R  L+  F    +    D+VKM+ LY  E F+L KQ +  I  ++ L+IDD   F++YPWG+I+Y++ V  V  +        +
Subjt:  GGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQGNCIV

Query:  GVAGFVYALLVWAYEL-------------------------------EWRELLTNVF--EAPSINDIVATDEEIAMPYFAPF-----LNEELCS--GKLH
        GV GF YALLVWAYE                                EW++L   VF  EA  +  ++AT  E+ MPY  PF      NE+  S   + H
Subjt:  GVAGFVYALLVWAYEL-------------------------------EWRELLTNVF--EAPSINDIVATDEEIAMPYFAPF-----LNEELCS--GKLH

Query:  NNAPHASVDQKYETPKEGNKVDK-GTTNVPF
        N+    S ++ +   K    V K G  N  F
Subjt:  NNAPHASVDQKYETPKEGNKVDK-GTTNVPF

XP_031743197.1 uncharacterized protein LOC101221625 isoform X9 [Cucumis sativus]2.0e-4334.14Show/hide
Query:  KINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTSIKD
        +IN+  +   ++ I+  L ++ + KF+K+CFG+ LD    + SSQL  H+I+RQC  K   EL F + G++  FG+++FA+ITGLNCG+   I+ + I+ 
Subjt:  KINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTSIKD

Query:  GGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQGNCIV
         G+  K YF  +K I+R  L+  F    +    D+VKM+ LY  E F+L KQ +  I  ++ L+IDD   F++YPWG+I+Y++ V  V  +        +
Subjt:  GGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQGNCIV

Query:  GVAGFVYALLVWAYEL-------------------------------EWRELLTNVF--EAPSINDIVATDEEIAMPYFAPF-----LNEELCS--GKLH
        GV GF YALLVWAYE                                EW++L   VF  EA  +  ++AT  E+ MPY  PF      NE+  S   + H
Subjt:  GVAGFVYALLVWAYEL-------------------------------EWRELLTNVF--EAPSINDIVATDEEIAMPYFAPF-----LNEELCS--GKLH

Query:  NNAPHASVDQKYETPKEGNKVDK-GTTNVPF
        N+    S ++ +   K    V K G  N  F
Subjt:  NNAPHASVDQKYETPKEGNKVDK-GTTNVPF

XP_031743205.1 uncharacterized protein LOC101221625 isoform X17 [Cucumis sativus]2.0e-4334.14Show/hide
Query:  KINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTSIKD
        +IN+  +   ++ I+  L ++ + KF+K+CFG+ LD    + SSQL  H+I+RQC  K   EL F + G++  FG+++FA+ITGLNCG+   I+ + I+ 
Subjt:  KINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTSIKD

Query:  GGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQGNCIV
         G+  K YF  +K I+R  L+  F    +    D+VKM+ LY  E F+L KQ +  I  ++ L+IDD   F++YPWG+I+Y++ V  V  +        +
Subjt:  GGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQGNCIV

Query:  GVAGFVYALLVWAYEL-------------------------------EWRELLTNVF--EAPSINDIVATDEEIAMPYFAPF-----LNEELCS--GKLH
        GV GF YALLVWAYE                                EW++L   VF  EA  +  ++AT  E+ MPY  PF      NE+  S   + H
Subjt:  GVAGFVYALLVWAYEL-------------------------------EWRELLTNVF--EAPSINDIVATDEEIAMPYFAPF-----LNEELCS--GKLH

Query:  NNAPHASVDQKYETPKEGNKVDK-GTTNVPF
        N+    S ++ +   K    V K G  N  F
Subjt:  NNAPHASVDQKYETPKEGNKVDK-GTTNVPF

XP_031743208.1 uncharacterized protein LOC101221625 isoform X20 [Cucumis sativus]2.0e-4334.14Show/hide
Query:  KINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTSIKD
        +IN+  +   ++ I+  L ++ + KF+K+CFG+ LD    + SSQL  H+I+RQC  K   EL F + G++  FG+++FA+ITGLNCG+   I+ + I+ 
Subjt:  KINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTSIKD

Query:  GGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQGNCIV
         G+  K YF  +K I+R  L+  F    +    D+VKM+ LY  E F+L KQ +  I  ++ L+IDD   F++YPWG+I+Y++ V  V  +        +
Subjt:  GGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQGNCIV

Query:  GVAGFVYALLVWAYEL-------------------------------EWRELLTNVF--EAPSINDIVATDEEIAMPYFAPF-----LNEELCS--GKLH
        GV GF YALLVWAYE                                EW++L   VF  EA  +  ++AT  E+ MPY  PF      NE+  S   + H
Subjt:  GVAGFVYALLVWAYEL-------------------------------EWRELLTNVF--EAPSINDIVATDEEIAMPYFAPF-----LNEELCS--GKLH

Query:  NNAPHASVDQKYETPKEGNKVDK-GTTNVPF
        N+    S ++ +   K    V K G  N  F
Subjt:  NNAPHASVDQKYETPKEGNKVDK-GTTNVPF

TrEMBL top hitse value%identityAlignment
A0A0A0KI50 TF-B3 domain-containing protein9.7e-4434.14Show/hide
Query:  KINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTSIKD
        +IN+  +   ++ I+  L ++ + KF+K+CFG+ LD    + SSQL  H+I+RQC  K   EL F + G++  FG+++FA+ITGLNCG+   I+ + I+ 
Subjt:  KINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTSIKD

Query:  GGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQGNCIV
         G+  K YF  +K I+R  L+  F    +    D+VKM+ LY  E F+L KQ +  I  ++ L+IDD   F++YPWG+I+Y++ V  V  +        +
Subjt:  GGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQGNCIV

Query:  GVAGFVYALLVWAYEL-------------------------------EWRELLTNVF--EAPSINDIVATDEEIAMPYFAPF-----LNEELCS--GKLH
        GV GF YALLVWAYE                                EW++L   VF  EA  +  ++AT  E+ MPY  PF      NE+  S   + H
Subjt:  GVAGFVYALLVWAYEL-------------------------------EWRELLTNVF--EAPSINDIVATDEEIAMPYFAPF-----LNEELCS--GKLH

Query:  NNAPHASVDQKYETPKEGNKVDK-GTTNVPF
        N+    S ++ +   K    V K G  N  F
Subjt:  NNAPHASVDQKYETPKEGNKVDK-GTTNVPF

A0A1S3B065 uncharacterized protein LOC103484737 isoform X41.1e-4233.55Show/hide
Query:  KINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTSIKD
        +IN+  +   ++ I+  L ++ + KF+K+CFG+ LD    + SSQL  H+I+RQC  K   EL F + G++  FG+++FA+ITGLNCG+   I+ + I+ 
Subjt:  KINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTSIKD

Query:  GGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQGNCIV
         G+  K YF  +K I+R  L+  F    +    D+VKM+ LY  E F+L KQ +  I  ++ L+IDD   F++YPWG+I+Y++ +  V  A        +
Subjt:  GGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQGNCIV

Query:  GVAGFVYALLVWAYEL-------------------------------EWRELLTNVF--EAPSINDIVATDEEIAMPYFAPFLNEELCSGKLHNNAPHAS
        GV GF +AL VWAYE                                EW++L   VF  EA  +  ++AT+ E+ M Y  PF       GK  N      
Subjt:  GVAGFVYALLVWAYEL-------------------------------EWRELLTNVF--EAPSINDIVATDEEIAMPYFAPFLNEELCSGKLHNNAPHAS

Query:  VDQKYET
        +DQ++ +
Subjt:  VDQKYET

A0A1S3B0L9 uncharacterized protein LOC103484737 isoform X51.1e-4233.55Show/hide
Query:  KINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTSIKD
        +IN+  +   ++ I+  L ++ + KF+K+CFG+ LD    + SSQL  H+I+RQC  K   EL F + G++  FG+++FA+ITGLNCG+   I+ + I+ 
Subjt:  KINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTSIKD

Query:  GGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQGNCIV
         G+  K YF  +K I+R  L+  F    +    D+VKM+ LY  E F+L KQ +  I  ++ L+IDD   F++YPWG+I+Y++ +  V  A        +
Subjt:  GGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQGNCIV

Query:  GVAGFVYALLVWAYEL-------------------------------EWRELLTNVF--EAPSINDIVATDEEIAMPYFAPFLNEELCSGKLHNNAPHAS
        GV GF +AL VWAYE                                EW++L   VF  EA  +  ++AT+ E+ M Y  PF       GK  N      
Subjt:  GVAGFVYALLVWAYEL-------------------------------EWRELLTNVF--EAPSINDIVATDEEIAMPYFAPFLNEELCSGKLHNNAPHAS

Query:  VDQKYET
        +DQ++ +
Subjt:  VDQKYET

A0A1S3B181 uncharacterized protein LOC103484737 isoform X71.1e-4233.55Show/hide
Query:  KINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTSIKD
        +IN+  +   ++ I+  L ++ + KF+K+CFG+ LD    + SSQL  H+I+RQC  K   EL F + G++  FG+++FA+ITGLNCG+   I+ + I+ 
Subjt:  KINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTSIKD

Query:  GGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQGNCIV
         G+  K YF  +K I+R  L+  F    +    D+VKM+ LY  E F+L KQ +  I  ++ L+IDD   F++YPWG+I+Y++ +  V  A        +
Subjt:  GGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQGNCIV

Query:  GVAGFVYALLVWAYEL-------------------------------EWRELLTNVF--EAPSINDIVATDEEIAMPYFAPFLNEELCSGKLHNNAPHAS
        GV GF +AL VWAYE                                EW++L   VF  EA  +  ++AT+ E+ M Y  PF       GK  N      
Subjt:  GVAGFVYALLVWAYEL-------------------------------EWRELLTNVF--EAPSINDIVATDEEIAMPYFAPFLNEELCSGKLHNNAPHAS

Query:  VDQKYET
        +DQ++ +
Subjt:  VDQKYET

A0A5A7U047 Protein Ycf2-like4.5e-5738.85Show/hide
Query:  MKASRRSRAIKINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQR
        M + RR+  +KIN+  +S  +  I++NLGD+ I +FR+  FGH L+ S    SSQLLLH+IQR C PK+  +L F IGG++L FGLREFA+ITGL C + 
Subjt:  MKASRRSRAIKINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQR

Query:  VKINTTSIKDGGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHN
          IN   I  GGRL+  YFE  K + R  LN+ F ++    +DD +KM+ LY+ ESFL+ KQ+   ++ DH++M+DD  +F+ YPWG++A++LLV  ++ 
Subjt:  VKINTTSIKDGGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHN

Query:  AGVEQGNCIVGVAGFVYALLVWAYEL-------------------------------EWRELLTNVFEAPS--INDIVATDEEIAMPYFAPFLNEE
            +G   + + GF++ +L WAYE+                               +W++L   VF++P+  ++ ++AT +E+ MP+FAPF+  E
Subjt:  AGVEQGNCIVGVAGFVYALLVWAYEL-------------------------------EWRELLTNVFEAPS--INDIVATDEEIAMPYFAPFLNEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31150.1 Domain of unknown function (DUF1985)6.9e-1026.9Show/hide
Query:  KINMCCQSGAMATIEKNL-GDQHIAKFRKTCFGHLLDFSAKRVS-SQLLLH-IIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTS
        ++N+  +   + TI   L G +   + + + FG L +F   R S S  L+H ++ RQ   K   EL F  GG  + F +REF I+TGL CG+        
Subjt:  KINMCCQSGAMATIEKNL-GDQHIAKFRKTCFGHLLDFSAKRVS-SQLLLH-IIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTS

Query:  IKDGGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKM----SLLYWPESFL---------LAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAY
        +     ++K   ++ K +   V N  F   R     D+++M     L  W +  L         +   D+  +  D + M++D++ F  YPWG+ A+
Subjt:  IKDGGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKM----SLLYWPESFL---------LAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAY

AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases2.5e-0730.93Show/hide
Query:  ESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQ-GNCIVGVAGFVYALLVWAYELEWRELLTNVFEAPSINDIVATDEE
        + FLL       I  DH  M +DL  F +YPWG+++++++++++    VEQ     V V G +YAL     +L   E +  + E P I+++V +D +
Subjt:  ESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQ-GNCIVGVAGFVYALLVWAYELEWRELLTNVFEAPSINDIVATDEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCCTCCCGTAGGTCTAGAGCCATCAAGATTAACATGTGCTGCCAGAGTGGTGCAATGGCAACTATAGAAAAGAACTTGGGTGATCAACACATAGCGAAGTTTAG
AAAGACCTGCTTTGGGCACCTACTAGATTTCTCAGCTAAACGAGTATCATCCCAACTGTTGTTGCACATAATCCAACGACAATGTAACCCCAAGACGTACCCAGAGCTAA
CCTTTAAAATTGGTGGAAAATTGTTGACTTTCGGACTGCGAGAGTTTGCTATTATTACAGGCCTGAATTGTGGGCAACGGGTGAAAATAAACACAACGTCAATCAAGGAT
GGGGGGCGCCTCCGAAAAGACTACTTCGAGGAGGATAAAGTGATTAAGAGGTTTGTGTTGAACTTGGCCTTCAGGGTGAATAGGAGAGCTCCTGAAGATGACATGGTGAA
AATGTCATTGTTGTATTGGCCTGAGAGTTTTTTATTAGCAAAACAGGACAAGGTTGTAATTGAGGATGATCACTTGCTAATGATTGACGACCTCAACCTGTTCAATAATT
ATCCATGGGGGCAGATCGCATATCAGTTGCTAGTATCAAATGTACACAATGCTGGAGTCGAACAAGGGAACTGTATTGTCGGAGTGGCTGGATTTGTCTATGCACTATTG
GTTTGGGCGTATGAGCTAGAATGGCGAGAGCTACTAACAAATGTCTTTGAGGCCCCTTCTATAAATGATATAGTGGCAACAGATGAGGAGATCGCGATGCCATATTTTGC
CCCATTCTTGAATGAGGAGTTATGCAGTGGTAAGCTGCACAACAATGCCCCCCACGCATCAGTTGATCAGAAGTACGAGACCCCGAAAGAGGGCAACAAGGTGGATAAAG
GGACAACGAATGTCCCTTTCATCCCACATCCCGTCAAAGATAATGGTGATTGGTGTAAGTTCCATGAGAGAAGAATTGTTTGCAATTGGAACCGAATGCACAGTAATCTG
GATGGTACTCCACCCTTTTGCTTCTACGTGTTTGGCTCGTTAATCGGAGATATGAAGCTCGAAAAATCTTGGCACGGTAGAGGTGATGGAGATTTTTTTATCGACTACAA
TTCGAATGGAATCTGGAATGCGGATAGAAGATAG
mRNA sequenceShow/hide mRNA sequence
GAAAGGTCAAACCTGTTGAAGCATTGGAGGGCCCCGATCTGGACGATGACTATTTTATGAAGGCCTCCCGTAGGTCTAGAGCCATCAAGATTAACATGTGCTGCCAGAGT
GGTGCAATGGCAACTATAGAAAAGAACTTGGGTGATCAACACATAGCGAAGTTTAGAAAGACCTGCTTTGGGCACCTACTAGATTTCTCAGCTAAACGAGTATCATCCCA
ACTGTTGTTGCACATAATCCAACGACAATGTAACCCCAAGACGTACCCAGAGCTAACCTTTAAAATTGGTGGAAAATTGTTGACTTTCGGACTGCGAGAGTTTGCTATTA
TTACAGGCCTGAATTGTGGGCAACGGGTGAAAATAAACACAACGTCAATCAAGGATGGGGGGCGCCTCCGAAAAGACTACTTCGAGGAGGATAAAGTGATTAAGAGGTTT
GTGTTGAACTTGGCCTTCAGGGTGAATAGGAGAGCTCCTGAAGATGACATGGTGAAAATGTCATTGTTGTATTGGCCTGAGAGTTTTTTATTAGCAAAACAGGACAAGGT
TGTAATTGAGGATGATCACTTGCTAATGATTGACGACCTCAACCTGTTCAATAATTATCCATGGGGGCAGATCGCATATCAGTTGCTAGTATCAAATGTACACAATGCTG
GAGTCGAACAAGGGAACTGTATTGTCGGAGTGGCTGGATTTGTCTATGCACTATTGGTTTGGGCGTATGAGCTAGAATGGCGAGAGCTACTAACAAATGTCTTTGAGGCC
CCTTCTATAAATGATATAGTGGCAACAGATGAGGAGATCGCGATGCCATATTTTGCCCCATTCTTGAATGAGGAGTTATGCAGTGGTAAGCTGCACAACAATGCCCCCCA
CGCATCAGTTGATCAGAAGTACGAGACCCCGAAAGAGGGCAACAAGGTGGATAAAGGGACAACGAATGTCCCTTTCATCCCACATCCCGTCAAAGATAATGGTGATTGGT
GTAAGTTCCATGAGAGAAGAATTGTTTGCAATTGGAACCGAATGCACAGTAATCTGGATGGTACTCCACCCTTTTGCTTCTACGTGTTTGGCTCGTTAATCGGAGATATG
AAGCTCGAAAAATCTTGGCACGGTAGAGGTGATGGAGATTTTTTTATCGACTACAATTCGAATGGAATCTGGAATGCGGATAGAAGATAG
Protein sequenceShow/hide protein sequence
MKASRRSRAIKINMCCQSGAMATIEKNLGDQHIAKFRKTCFGHLLDFSAKRVSSQLLLHIIQRQCNPKTYPELTFKIGGKLLTFGLREFAIITGLNCGQRVKINTTSIKD
GGRLRKDYFEEDKVIKRFVLNLAFRVNRRAPEDDMVKMSLLYWPESFLLAKQDKVVIEDDHLLMIDDLNLFNNYPWGQIAYQLLVSNVHNAGVEQGNCIVGVAGFVYALL
VWAYELEWRELLTNVFEAPSINDIVATDEEIAMPYFAPFLNEELCSGKLHNNAPHASVDQKYETPKEGNKVDKGTTNVPFIPHPVKDNGDWCKFHERRIVCNWNRMHSNL
DGTPPFCFYVFGSLIGDMKLEKSWHGRGDGDFFIDYNSNGIWNADRR