; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G37680 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G37680
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionYcf3-interacting protein 1
Genome locationChr6:30200052..30200780
RNA-Seq ExpressionCSPI06G37680
SyntenyCSPI06G37680
Gene Ontology termsGO:0048564 - photosystem I assembly (biological process)
GO:0080183 - response to photooxidative stress (biological process)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsIPR040340 - Chloroplast enhancing stress tolerance protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052411.1 ycf3-interacting protein 1 [Cucumis melo var. makuwa]3.7e-11589.5Show/hide
Query:  ASAIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSAYSSPRLAGKKKDRDGLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRCCGAL
        +  IVPLSV  SSRRYEFVEDVV+ VSRQLSAPNS YSSPRL GKKK RDGLNRS+SCG+GRGKAAPHGLIENK+M WE GDKHKTEEGKGRRF+ CGAL
Subjt:  ASAIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSAYSSPRLAGKKKDRDGLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRCCGAL

Query:  CLLLPVLGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGRGVWN
        CLLLPVLGFKVGKGRMKGKEEK+EEAEEGECISISISRRVSLEKFECGSWASSGMVVHE+GESGSLYFDLPMELIRNSVSAQ+QSPVGAAFVF+G+GV N
Subjt:  CLLLPVLGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGRGVWN

Query:  KPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHVL
        KPKLAEESGAASPCIITPRLRKARQEFNALLEAHT +L
Subjt:  KPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHVL

XP_008439418.1 PREDICTED: uncharacterized protein LOC103484232 [Cucumis melo]1.1e-11488.7Show/hide
Query:  ASAIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSAYSSPRLAGKKKDRDGLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRCCGAL
        +  IVPLSV  SSRRYEFVEDVV+ VSRQLSAPNS YSSPRL GKKK RDGLNRS+SCG+GRGKAAPHGLIENK+M WE GDKHKTEEGKGRRF+ CGAL
Subjt:  ASAIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSAYSSPRLAGKKKDRDGLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRCCGAL

Query:  CLLLPVLGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR-GVW
        CLLLPVLGFKVGKGRMKGKEEK+EEAEEGECISISISRRVSL+KFECGSWASSGMVVHE+GESGSLYFDLPMELIRNSVSAQ+QSPVGAAFVF+G+ GVW
Subjt:  CLLLPVLGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR-GVW

Query:  NKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHVL
        NKPKLA+ESGAASPCIITPRLRKARQEFNALLEAHT +L
Subjt:  NKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHVL

XP_011658372.1 uncharacterized protein LOC105435976 [Cucumis sativus]1.0e-133100Show/hide
Query:  MPAMASAIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSAYSSPRLAGKKKDRDGLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRC
        MPAMASAIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSAYSSPRLAGKKKDRDGLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRC
Subjt:  MPAMASAIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSAYSSPRLAGKKKDRDGLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRC

Query:  CGALCLLLPVLGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR
        CGALCLLLPVLGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR
Subjt:  CGALCLLLPVLGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR

Query:  GVWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHVL
        GVWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHVL
Subjt:  GVWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHVL

XP_022925963.1 uncharacterized protein LOC111433224 [Cucurbita moschata]1.4e-9072.83Show/hide
Query:  MPAMASAIVPLSVGTSSRRYEFVEDVVIEVSRQL-----SAPNSAYSSPRL----AGKKKDRD---GLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKH
        MPA+A  + PLSVGT+ RRYE VEDVVIEVS Q      SAPNSAYSSP L    A KKK  D   GLNRS+SCGEGRGKA PHGLIEN+VM+WEKG KH
Subjt:  MPAMASAIVPLSVGTSSRRYEFVEDVVIEVSRQL-----SAPNSAYSSPRL----AGKKKDRD---GLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKH

Query:  KTEEGKGRRFRCCGALCLLLPV---LGFKVGKGRMKGKEEKREEAEEGECISISISR-RVSLEKFECGSWASSGMVVHEDGES----GSLYFDLPMELIR
        KTEEGK RRFR CGALCLLLPV   LGFKVGKG+ + KEE  E  E G CISISIS  RVSLEKFECGSWASSGMV HEDGES    GSLYFDLPMELIR
Subjt:  KTEEGKGRRFRCCGALCLLLPV---LGFKVGKGRMKGKEEKREEAEEGECISISISR-RVSLEKFECGSWASSGMVVHEDGES----GSLYFDLPMELIR

Query:  NSVSAQTQSPVGAAFVFNGRG-------VWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAH
        NSV A+TQSP   AFVFN  G       VW K KLAEESGAASPC+ITPRLR+AR+EFNALLEAH
Subjt:  NSVSAQTQSPVGAAFVFNGRG-------VWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAH

XP_038877520.1 uncharacterized protein LOC120069777 [Benincasa hispida]1.2e-10281.35Show/hide
Query:  MPAMASAIVPLSVGTSSRRYEFVEDVVIEVSRQL-----SAPNSAYSSPRLAGKKKDRD---GLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEE
        MPAMA  IVPLSVGT+SR YEFV+DVVIEVS QL     S PNSAYSSPRLA KKK  D   GLNRS+SCGEGRGKA PH LIENKVMVWEKG KHKT E
Subjt:  MPAMASAIVPLSVGTSSRRYEFVEDVVIEVSRQL-----SAPNSAYSSPRLAGKKKDRD---GLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEE

Query:  GKGRRFRCCGALCLLLPV---LGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQS
        GK +RFR CGALCLLLPV   LGFKVGKG+ KGKEE++EEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGE GS YFDLPMELIRNSV  QTQS
Subjt:  GKGRRFRCCGALCLLLPV---LGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQS

Query:  PVGAAFVFNGR--GVWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHT
        PVGAAFVF+     +W KP LAEESGAASPCIITPRLRKAR+EFNALLEAHT
Subjt:  PVGAAFVFNGR--GVWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHT

TrEMBL top hitse value%identityAlignment
A0A0A0KME4 Uncharacterized protein5.0e-134100Show/hide
Query:  MPAMASAIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSAYSSPRLAGKKKDRDGLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRC
        MPAMASAIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSAYSSPRLAGKKKDRDGLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRC
Subjt:  MPAMASAIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSAYSSPRLAGKKKDRDGLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRC

Query:  CGALCLLLPVLGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR
        CGALCLLLPVLGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR
Subjt:  CGALCLLLPVLGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR

Query:  GVWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHVL
        GVWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHVL
Subjt:  GVWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHVL

A0A1S3AZD3 uncharacterized protein LOC1034842325.2e-11588.7Show/hide
Query:  ASAIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSAYSSPRLAGKKKDRDGLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRCCGAL
        +  IVPLSV  SSRRYEFVEDVV+ VSRQLSAPNS YSSPRL GKKK RDGLNRS+SCG+GRGKAAPHGLIENK+M WE GDKHKTEEGKGRRF+ CGAL
Subjt:  ASAIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSAYSSPRLAGKKKDRDGLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRCCGAL

Query:  CLLLPVLGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR-GVW
        CLLLPVLGFKVGKGRMKGKEEK+EEAEEGECISISISRRVSL+KFECGSWASSGMVVHE+GESGSLYFDLPMELIRNSVSAQ+QSPVGAAFVF+G+ GVW
Subjt:  CLLLPVLGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR-GVW

Query:  NKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHVL
        NKPKLA+ESGAASPCIITPRLRKARQEFNALLEAHT +L
Subjt:  NKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHVL

A0A5A7UFW0 Ycf3-interacting protein 11.8e-11589.5Show/hide
Query:  ASAIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSAYSSPRLAGKKKDRDGLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRCCGAL
        +  IVPLSV  SSRRYEFVEDVV+ VSRQLSAPNS YSSPRL GKKK RDGLNRS+SCG+GRGKAAPHGLIENK+M WE GDKHKTEEGKGRRF+ CGAL
Subjt:  ASAIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSAYSSPRLAGKKKDRDGLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRCCGAL

Query:  CLLLPVLGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGRGVWN
        CLLLPVLGFKVGKGRMKGKEEK+EEAEEGECISISISRRVSLEKFECGSWASSGMVVHE+GESGSLYFDLPMELIRNSVSAQ+QSPVGAAFVF+G+GV N
Subjt:  CLLLPVLGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGRGVWN

Query:  KPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHVL
        KPKLAEESGAASPCIITPRLRKARQEFNALLEAHT +L
Subjt:  KPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHVL

A0A6J1EGQ7 uncharacterized protein LOC1114332246.8e-9172.83Show/hide
Query:  MPAMASAIVPLSVGTSSRRYEFVEDVVIEVSRQL-----SAPNSAYSSPRL----AGKKKDRD---GLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKH
        MPA+A  + PLSVGT+ RRYE VEDVVIEVS Q      SAPNSAYSSP L    A KKK  D   GLNRS+SCGEGRGKA PHGLIEN+VM+WEKG KH
Subjt:  MPAMASAIVPLSVGTSSRRYEFVEDVVIEVSRQL-----SAPNSAYSSPRL----AGKKKDRD---GLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKH

Query:  KTEEGKGRRFRCCGALCLLLPV---LGFKVGKGRMKGKEEKREEAEEGECISISISR-RVSLEKFECGSWASSGMVVHEDGES----GSLYFDLPMELIR
        KTEEGK RRFR CGALCLLLPV   LGFKVGKG+ + KEE  E  E G CISISIS  RVSLEKFECGSWASSGMV HEDGES    GSLYFDLPMELIR
Subjt:  KTEEGKGRRFRCCGALCLLLPV---LGFKVGKGRMKGKEEKREEAEEGECISISISR-RVSLEKFECGSWASSGMVVHEDGES----GSLYFDLPMELIR

Query:  NSVSAQTQSPVGAAFVFNGRG-------VWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAH
        NSV A+TQSP   AFVFN  G       VW K KLAEESGAASPC+ITPRLR+AR+EFNALLEAH
Subjt:  NSVSAQTQSPVGAAFVFNGRG-------VWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAH

A0A6J1ILL2 uncharacterized protein LOC1114785486.4e-8971.32Show/hide
Query:  MPAMASAIVPLSVGTSSRRYEFVEDVVIEVSRQL-----SAPNSAYSSPRL----AGKKKDRD---GLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKH
        MPA+A  + PLSVGT+ R YE VEDVVI+VS Q      SAPNSAYSSP L    A KKK  D   GLNRS+SCGEGRGKA PHGLI+N+VM+WEKG KH
Subjt:  MPAMASAIVPLSVGTSSRRYEFVEDVVIEVSRQL-----SAPNSAYSSPRL----AGKKKDRD---GLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKH

Query:  KTEEGKGRRFRCCGALCLLLPV---LGFKVGKGRMKGKEEKREEAEEGECISISISR-RVSLEKFECGSWASSGMVVHEDGES----GSLYFDLPMELIR
        KTEEGK RRFR CGALCLLLP+   LGFKVGKG+ + KEE  E  E G CISISIS  RVSLEKFECGSWASSGMV HEDGES    GSLYFDLPMELIR
Subjt:  KTEEGKGRRFRCCGALCLLLPV---LGFKVGKGRMKGKEEKREEAEEGECISISISR-RVSLEKFECGSWASSGMVVHEDGES----GSLYFDLPMELIR

Query:  NSVSAQTQSPVGAAFVFNGRG-------VWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAH
        NSV A+TQSP  AAFVF+  G       VW K KLAEESGAASPC+ITPRLR+AR+EFNALLEAH
Subjt:  NSVSAQTQSPVGAAFVFNGRG-------VWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30850.1 root hair specific 42.5e-1332.4Show/hide
Query:  CGALCLLLPVLGFKVGKGRMKGKEEKREEAEEGECISIS------ISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIR-----NSVSAQTQS
        C A CL LP  GF  GK ++     KR+ + E + I  S      +S R SLEKFECGSWAS+  ++ ++G    L+FD P+E+ +      +     Q 
Subjt:  CGALCLLLPVLGFKVGKGRMKGKEEKREEAEEGECISIS------ISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIR-----NSVSAQTQS

Query:  PVGAAFVF--------------------NGRGVWNKPK-----LAEESGAASPC------IITPRLRKARQEFNALLEA
        PV + F+F                    + R   + P+         S A+  C       ITPRLRKAR +FN  L A
Subjt:  PVGAAFVF--------------------NGRGVWNKPK-----LAEESGAASPC------IITPRLRKARQEFNALLEA

AT2G34910.1 BEST Arabidopsis thaliana protein match is: root hair specific 4 (TAIR:AT1G30850.1)7.4e-1333.93Show/hide
Query:  CGALCLLLPVLGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNG-
        C A CL LP  G +  +        K++  +     + ++S   SLEKFECGSWAS+  +  E+G    LY DLP+E+I+       Q PV + F F+  
Subjt:  CGALCLLLPVLGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNG-

Query:  ------RGVWNKPK---------LAE--------------ESGAASP-CIITPRLRKARQEFNALLEA
              R V  K           LAE              +S  ASP   ITPRL KAR +FN  L A
Subjt:  ------RGVWNKPK---------LAE--------------ESGAASP-CIITPRLRKARQEFNALLEA

AT4G20190.1 unknown protein1.5e-1334.46Show/hide
Query:  CGALCLLLPVLGFKVGK---GRMKGKEEKREEAEEGECISIS----------ISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQT
        C ALCL LP  GF  GK      KG              S++          +S R SLE+FECGSW SS M+  ++ + G  +FDLP ELI+       
Subjt:  CGALCLLLPVLGFKVGK---GRMKGKEEKREEAEEGECISIS----------ISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQT

Query:  Q-SPVGAAFVFNG--------RGV----WNKPKLAEES------GAASPC--------IITPRLRKARQEFNALLEA
        Q  PV AAFVF+         +GV     +K + + ES        +SP          ITPRL +A ++F++ LEA
Subjt:  Q-SPVGAAFVFNG--------RGV----WNKPKLAEES------GAASPC--------IITPRLRKARQEFNALLEA

AT5G44660.1 unknown protein3.7e-1230.83Show/hide
Query:  QLSAPNSAYSSPR--------LAGKKKDRDGLN--RSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGR---RFRCCGALCLLLPVLGFKVGKG-
        Q S PNS   SP+        L  K++D    +  RS+SCG    K   H     +   + K D +K+         RF+ C ALCL LP  GF  GK  
Subjt:  QLSAPNSAYSSPR--------LAGKKKDRDGLN--RSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGR---RFRCCGALCLLLPVLGFKVGKG-

Query:  RMKGKEE---------------------KREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRN-SVSAQTQSPVGAAFV
        R   K++                     +     E    +  IS R S+EKF+CGS+ S         E G+ +FDLP ELI++ S       PV AAFV
Subjt:  RMKGKEE---------------------KREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRN-SVSAQTQSPVGAAFV

Query:  FNGRGV-----------WNKPKLAEES--------GAASPC------IITPRLRKARQEFNALLEA
        F+   V            +K + A ES          +SP        I+PRL +A + FNA LEA
Subjt:  FNGRGV-----------WNKPKLAEES--------GAASPC------IITPRLRKARQEFNALLEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAGCGATGGCCAGTGCAATAGTGCCTTTGTCTGTTGGCACCTCTAGCAGAAGGTACGAATTTGTTGAGGATGTGGTTATTGAGGTGTCAAGGCAATTGAGTGCCCC
GAACTCGGCCTATTCATCCCCTCGGTTGGCAGGAAAAAAGAAAGACCGTGATGGGCTGAATCGGAGCCAGTCCTGTGGTGAAGGAAGAGGGAAGGCAGCGCCGCATGGGC
TTATTGAGAATAAAGTAATGGTATGGGAGAAAGGGGATAAGCACAAAACAGAGGAGGGGAAAGGGAGGCGCTTCAGATGTTGTGGGGCGCTATGCTTGTTGTTACCAGTG
TTGGGGTTTAAGGTTGGGAAAGGGAGAATGAAAGGGAAGGAAGAGAAAAGGGAAGAGGCAGAGGAAGGCGAGTGTATATCCATATCCATATCGAGGAGAGTTTCTTTGGA
AAAATTCGAATGTGGATCGTGGGCTTCATCGGGGATGGTGGTTCATGAAGACGGGGAGTCGGGTAGCCTTTATTTTGATCTGCCAATGGAATTGATAAGGAACAGCGTGA
GTGCACAGACTCAATCACCAGTAGGAGCAGCTTTTGTATTTAATGGGAGGGGAGTTTGGAACAAACCAAAATTGGCAGAGGAATCAGGAGCTGCATCCCCATGCATCATT
ACCCCACGCTTGCGCAAAGCTAGACAAGAGTTCAATGCTCTTTTGGAAGCTCACACTCACGTTCTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAGCGATGGCCAGTGCAATAGTGCCTTTGTCTGTTGGCACCTCTAGCAGAAGGTACGAATTTGTTGAGGATGTGGTTATTGAGGTGTCAAGGCAATTGAGTGCCCC
GAACTCGGCCTATTCATCCCCTCGGTTGGCAGGAAAAAAGAAAGACCGTGATGGGCTGAATCGGAGCCAGTCCTGTGGTGAAGGAAGAGGGAAGGCAGCGCCGCATGGGC
TTATTGAGAATAAAGTAATGGTATGGGAGAAAGGGGATAAGCACAAAACAGAGGAGGGGAAAGGGAGGCGCTTCAGATGTTGTGGGGCGCTATGCTTGTTGTTACCAGTG
TTGGGGTTTAAGGTTGGGAAAGGGAGAATGAAAGGGAAGGAAGAGAAAAGGGAAGAGGCAGAGGAAGGCGAGTGTATATCCATATCCATATCGAGGAGAGTTTCTTTGGA
AAAATTCGAATGTGGATCGTGGGCTTCATCGGGGATGGTGGTTCATGAAGACGGGGAGTCGGGTAGCCTTTATTTTGATCTGCCAATGGAATTGATAAGGAACAGCGTGA
GTGCACAGACTCAATCACCAGTAGGAGCAGCTTTTGTATTTAATGGGAGGGGAGTTTGGAACAAACCAAAATTGGCAGAGGAATCAGGAGCTGCATCCCCATGCATCATT
ACCCCACGCTTGCGCAAAGCTAGACAAGAGTTCAATGCTCTTTTGGAAGCTCACACTCACGTTCTGTGA
Protein sequenceShow/hide protein sequence
MPAMASAIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSAYSSPRLAGKKKDRDGLNRSQSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRCCGALCLLLPV
LGFKVGKGRMKGKEEKREEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGRGVWNKPKLAEESGAASPCII
TPRLRKARQEFNALLEAHTHVL