; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G022710 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G022710
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionYcf3-interacting protein 1
Genome locationchrH02:54465..55193
RNA-Seq ExpressionChy2G022710
SyntenyChy2G022710
Gene Ontology termsGO:0048564 - photosystem I assembly (biological process)
GO:0080183 - response to photooxidative stress (biological process)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsIPR040340 - Chloroplast enhancing stress tolerance protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052411.1 ycf3-interacting protein 1 [Cucumis melo var. makuwa]1.69e-14591.91Show/hide
Query:  IVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSVYSSPRLAGKKEVRDGLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRCCGALCLL
        IVPLSV  SSRRYEFVEDVV+ VSRQLSAPNS YSSPRL GKK+VRDGLNRSKSCG+GRGKAAPHGLIENK+M WE GDKHKTEEGKGRRF+C GALCLL
Subjt:  IVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSVYSSPRLAGKKEVRDGLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRCCGALCLL

Query:  LPVLGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGRGVWNKPK
        LPVLGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHE+GESGSLYFDLPMELIRNSVSAQ+QSPVGAAFVF+G+GV NKPK
Subjt:  LPVLGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGRGVWNKPK

Query:  LAEESGAASPCIITPRLRKARQEFNALLEAHTHIL
        LAEESGAASPCIITPRLRKARQEFNALLEAHT IL
Subjt:  LAEESGAASPCIITPRLRKARQEFNALLEAHTHIL

XP_008439418.1 PREDICTED: uncharacterized protein LOC103484232 [Cucumis melo]1.80e-14891.1Show/hide
Query:  IVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSVYSSPRLAGKKEVRDGLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRCCGALCLL
        IVPLSV  SSRRYEFVEDVV+ VSRQLSAPNS YSSPRL GKK+VRDGLNRSKSCG+GRGKAAPHGLIENK+M WE GDKHKTEEGKGRRF+C GALCLL
Subjt:  IVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSVYSSPRLAGKKEVRDGLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRCCGALCLL

Query:  LPVLGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR-GVWNKP
        LPVLGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSL+KFECGSWASSGMVVHE+GESGSLYFDLPMELIRNSVSAQ+QSPVGAAFVF+G+ GVWNKP
Subjt:  LPVLGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR-GVWNKP

Query:  KLAEESGAASPCIITPRLRKARQEFNALLEAHTHIL
        KLA+ESGAASPCIITPRLRKARQEFNALLEAHT IL
Subjt:  KLAEESGAASPCIITPRLRKARQEFNALLEAHTHIL

XP_011658372.1 uncharacterized protein LOC105435976 [Cucumis sativus]1.04e-16796.69Show/hide
Query:  MPAMANTIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSVYSSPRLAGKKEVRDGLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRC
        MPAMA+ IVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNS YSSPRLAGKK+ RDGLNRS+SCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRC
Subjt:  MPAMANTIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSVYSSPRLAGKKEVRDGLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRC

Query:  CGALCLLLPVLGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR
        CGALCLLLPVLGFKVGKGRMKGKEEK+EEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR
Subjt:  CGALCLLLPVLGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR

Query:  GVWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHIL
        GVWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTH+L
Subjt:  GVWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHIL

XP_022925963.1 uncharacterized protein LOC111433224 [Cucurbita moschata]1.71e-11673.23Show/hide
Query:  MPAMANTIVPLSVGTSSRRYEFVEDVVIEVSRQL-----SAPNSVYSSPRL----AGKKEVRDG---LNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKH
        MPA+A+ + PLSVGT+ RRYE VEDVVIEVS Q      SAPNS YSSP L    A KK+V DG   LNRSKSCGEGRGKA PHGLIEN+VM+WEKG KH
Subjt:  MPAMANTIVPLSVGTSSRRYEFVEDVVIEVSRQL-----SAPNSVYSSPRL----AGKKEVRDG---LNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKH

Query:  KTEEGKGRRFRCCGALCLLLPVLG---FKVGKGRMKGKEEKKEEAEEGE----CISISISR-RVSLEKFECGSWASSGMVVHEDGES----GSLYFDLPM
        KTEEGK RRFRC GALCLLLPVLG   FKVGKG    KEE+KEE EEGE    CISISIS  RVSLEKFECGSWASSGMV HEDGES    GSLYFDLPM
Subjt:  KTEEGKGRRFRCCGALCLLLPVLG---FKVGKGRMKGKEEKKEEAEEGE----CISISISR-RVSLEKFECGSWASSGMVVHEDGES----GSLYFDLPM

Query:  ELIRNSVSAQTQSPVGAAFVFNGRGV-------WNKPKLAEESGAASPCIITPRLRKARQEFNALLEAH
        ELIRNSV A+TQSP   AFVFN  GV       W K KLAEESGAASPC+ITPRLR+AR+EFNALLEAH
Subjt:  ELIRNSVSAQTQSPVGAAFVFNGRGV-------WNKPKLAEESGAASPCIITPRLRKARQEFNALLEAH

XP_038877520.1 uncharacterized protein LOC120069777 [Benincasa hispida]1.96e-13382.14Show/hide
Query:  MPAMANTIVPLSVGTSSRRYEFVEDVVIEVSRQL-----SAPNSVYSSPRLAGKKEVRD---GLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEE
        MPAMA+TIVPLSVGT+SR YEFV+DVVIEVS QL     S PNS YSSPRLA KK+V D   GLNRSKSCGEGRGKA PH LIENKVMVWEKG KHKTE 
Subjt:  MPAMANTIVPLSVGTSSRRYEFVEDVVIEVSRQL-----SAPNSVYSSPRLAGKKEVRD---GLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEE

Query:  GKGRRFRCCGALCLLLPVLG---FKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQS
        GK +RFRC GALCLLLPVLG   FKVGKG+ KGKEE+KEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGE GS YFDLPMELIRNSV  QTQS
Subjt:  GKGRRFRCCGALCLLLPVLG---FKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQS

Query:  PVGAAFVFNGRG--VWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHT
        PVGAAFVF+     +W KP LAEESGAASPCIITPRLRKAR+EFNALLEAHT
Subjt:  PVGAAFVFNGRG--VWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHT

TrEMBL top hitse value%identityAlignment
A0A0A0KME4 Uncharacterized protein1.1e-13096.69Show/hide
Query:  MPAMANTIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSVYSSPRLAGKKEVRDGLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRC
        MPAMA+ IVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNS YSSPRLAGKK+ RDGLNRS+SCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRC
Subjt:  MPAMANTIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSVYSSPRLAGKKEVRDGLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRC

Query:  CGALCLLLPVLGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR
        CGALCLLLPVLGFKVGKGRMKGKEEK+EEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR
Subjt:  CGALCLLLPVLGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR

Query:  GVWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHIL
        GVWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTH+L
Subjt:  GVWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHIL

A0A1S3AZD3 uncharacterized protein LOC1034842324.7e-11691.1Show/hide
Query:  IVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSVYSSPRLAGKKEVRDGLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRCCGALCLL
        IVPLSV  SSRRYEFVEDVV+ VSRQLSAPNS YSSPRL GKK+VRDGLNRSKSCG+GRGKAAPHGLIENK+M WE GDKHKTEEGKGRRF+ CGALCLL
Subjt:  IVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSVYSSPRLAGKKEVRDGLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRCCGALCLL

Query:  LPVLGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR-GVWNKP
        LPVLGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSL+KFECGSWASSGMVVHE+GESGSLYFDLPMELIRNSVSAQ+QSPVGAAFVF+G+ GVWNKP
Subjt:  LPVLGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGR-GVWNKP

Query:  KLAEESGAASPCIITPRLRKARQEFNALLEAHTHIL
        KLA+ESGAASPCIITPRLRKARQEFNALLEAHT IL
Subjt:  KLAEESGAASPCIITPRLRKARQEFNALLEAHTHIL

A0A5A7UFW0 Ycf3-interacting protein 11.6e-11691.91Show/hide
Query:  IVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSVYSSPRLAGKKEVRDGLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRCCGALCLL
        IVPLSV  SSRRYEFVEDVV+ VSRQLSAPNS YSSPRL GKK+VRDGLNRSKSCG+GRGKAAPHGLIENK+M WE GDKHKTEEGKGRRF+ CGALCLL
Subjt:  IVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSVYSSPRLAGKKEVRDGLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRCCGALCLL

Query:  LPVLGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGRGVWNKPK
        LPVLGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHE+GESGSLYFDLPMELIRNSVSAQ+QSPVGAAFVF+G+GV NKPK
Subjt:  LPVLGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGRGVWNKPK

Query:  LAEESGAASPCIITPRLRKARQEFNALLEAHTHIL
        LAEESGAASPCIITPRLRKARQEFNALLEAHT IL
Subjt:  LAEESGAASPCIITPRLRKARQEFNALLEAHTHIL

A0A6J1EGQ7 uncharacterized protein LOC1114332246.2e-9273.23Show/hide
Query:  MPAMANTIVPLSVGTSSRRYEFVEDVVIEVSRQL-----SAPNSVYSSPRL----AGKKEVRD---GLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKH
        MPA+A+ + PLSVGT+ RRYE VEDVVIEVS Q      SAPNS YSSP L    A KK+V D   GLNRSKSCGEGRGKA PHGLIEN+VM+WEKG KH
Subjt:  MPAMANTIVPLSVGTSSRRYEFVEDVVIEVSRQL-----SAPNSVYSSPRL----AGKKEVRD---GLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKH

Query:  KTEEGKGRRFRCCGALCLLLPV---LGFKVGKGRMKGKEEKKEEAEEGE----CISISISR-RVSLEKFECGSWASSGMVVHEDGES----GSLYFDLPM
        KTEEGK RRFR CGALCLLLPV   LGFKVG    KGKEE+KEE EEGE    CISISIS  RVSLEKFECGSWASSGMV HEDGES    GSLYFDLPM
Subjt:  KTEEGKGRRFRCCGALCLLLPV---LGFKVGKGRMKGKEEKKEEAEEGE----CISISISR-RVSLEKFECGSWASSGMVVHEDGES----GSLYFDLPM

Query:  ELIRNSVSAQTQSPVGAAFVFNGRG-------VWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAH
        ELIRNSV A+TQSP   AFVFN  G       VW K KLAEESGAASPC+ITPRLR+AR+EFNALLEAH
Subjt:  ELIRNSVSAQTQSPVGAAFVFNGRG-------VWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAH

A0A6J1ILL2 uncharacterized protein LOC1114785485.8e-9071.75Show/hide
Query:  MPAMANTIVPLSVGTSSRRYEFVEDVVIEVSRQL-----SAPNSVYSSPRL----AGKKEVRD---GLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKH
        MPA+A+ + PLSVGT+ R YE VEDVVI+VS Q      SAPNS YSSP L    A KK+V D   GLNRSKSCGEGRGKA PHGLI+N+VM+WEKG KH
Subjt:  MPAMANTIVPLSVGTSSRRYEFVEDVVIEVSRQL-----SAPNSVYSSPRL----AGKKEVRD---GLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKH

Query:  KTEEGKGRRFRCCGALCLLLPV---LGFKVGKGRMKGKEEKKEEAEEGE----CISISISR-RVSLEKFECGSWASSGMVVHEDGES----GSLYFDLPM
        KTEEGK RRFR CGALCLLLP+   LGFKVG    KGKEE+KEE EEGE    CISISIS  RVSLEKFECGSWASSGMV HEDGES    GSLYFDLPM
Subjt:  KTEEGKGRRFRCCGALCLLLPV---LGFKVGKGRMKGKEEKKEEAEEGE----CISISISR-RVSLEKFECGSWASSGMVVHEDGES----GSLYFDLPM

Query:  ELIRNSVSAQTQSPVGAAFVFNGRG-------VWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAH
        ELIRNSV A+TQSP  AAFVF+  G       VW K KLAEESGAASPC+ITPRLR+AR+EFNALLEAH
Subjt:  ELIRNSVSAQTQSPVGAAFVFNGRG-------VWNKPKLAEESGAASPCIITPRLRKARQEFNALLEAH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30850.1 root hair specific 47.4e-1331.84Show/hide
Query:  CGALCLLLPVLGFKVGKGRMKGKEEKKEEAEEGECISIS------ISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIR-----NSVSAQTQS
        C A CL LP  GF  GK ++     K++ + E + I  S      +S R SLEKFECGSWAS+  ++ ++G    L+FD P+E+ +      +     Q 
Subjt:  CGALCLLLPVLGFKVGKGRMKGKEEKKEEAEEGECISIS------ISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIR-----NSVSAQTQS

Query:  PVGAAFVF--------------------NGRGVWNKPK-----LAEESGAASPC------IITPRLRKARQEFNALLEA
        PV + F+F                    + R   + P+         S A+  C       ITPRLRKAR +FN  L A
Subjt:  PVGAAFVF--------------------NGRGVWNKPK-----LAEESGAASPC------IITPRLRKARQEFNALLEA

AT2G34910.1 BEST Arabidopsis thaliana protein match is: root hair specific 4 (TAIR:AT1G30850.1)3.3e-1334.52Show/hide
Query:  CGALCLLLPVLGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNG-
        C A CL LP  G +  +        KK+  +     + ++S   SLEKFECGSWAS+  +  E+G    LY DLP+E+I+       Q PV + F F+  
Subjt:  CGALCLLLPVLGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNG-

Query:  ------RGVWNKPK---------LAE--------------ESGAASP-CIITPRLRKARQEFNALLEA
              R V  K           LAE              +S  ASP   ITPRL KAR +FN  L A
Subjt:  ------RGVWNKPK---------LAE--------------ESGAASP-CIITPRLRKARQEFNALLEA

AT4G20190.1 unknown protein1.9e-1334.46Show/hide
Query:  CGALCLLLPVLGFKVGK---GRMKGKEEKKEEAEEGECISIS----------ISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQT
        C ALCL LP  GF  GK      KG              S++          +S R SLE+FECGSW SS M+  ++ + G  +FDLP ELI+       
Subjt:  CGALCLLLPVLGFKVGK---GRMKGKEEKKEEAEEGECISIS----------ISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQT

Query:  Q-SPVGAAFVFNG--------RGV----WNKPKLAEES------GAASPC--------IITPRLRKARQEFNALLEA
        Q  PV AAFVF+         +GV     +K + + ES        +SP          ITPRL +A ++F++ LEA
Subjt:  Q-SPVGAAFVFNG--------RGV----WNKPKLAEES------GAASPC--------IITPRLRKARQEFNALLEA

AT5G44660.1 unknown protein6.2e-1230.83Show/hide
Query:  QLSAPNSVYSSPR--------LAGKKEVRDGLN--RSKSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGR---RFRCCGALCLLLPVLGFKVGKG-
        Q S PNS   SP+        L  K++     +  RSKSCG    K   H     +   + K D +K+         RF+ C ALCL LP  GF  GK  
Subjt:  QLSAPNSVYSSPR--------LAGKKEVRDGLN--RSKSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGR---RFRCCGALCLLLPVLGFKVGKG-

Query:  RMKGKEEKKE---------------------EAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRN-SVSAQTQSPVGAAFV
        R   K++                           E    +  IS R S+EKF+CGS+ S         E G+ +FDLP ELI++ S       PV AAFV
Subjt:  RMKGKEEKKE---------------------EAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRN-SVSAQTQSPVGAAFV

Query:  FNGRGV-----------WNKPKLAEES--------GAASPC------IITPRLRKARQEFNALLEA
        F+   V            +K + A ES          +SP        I+PRL +A + FNA LEA
Subjt:  FNGRGV-----------WNKPKLAEES--------GAASPC------IITPRLRKARQEFNALLEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAGCGATGGCCAATACAATAGTGCCTTTGTCTGTTGGCACCTCTAGCAGAAGGTACGAATTTGTCGAGGATGTGGTAATTGAGGTGTCAAGGCAATTGAGTGCCCC
GAACTCGGTCTATTCATCCCCTCGGTTGGCAGGAAAAAAGGAAGTCCGTGATGGGCTGAATCGGAGCAAATCCTGTGGTGAAGGAAGAGGGAAGGCAGCGCCGCATGGGC
TTATTGAGAATAAAGTAATGGTATGGGAGAAAGGGGATAAGCACAAAACAGAGGAGGGTAAAGGGAGGCGCTTCAGATGTTGTGGGGCGCTATGCCTGTTGTTACCAGTG
TTGGGGTTTAAGGTTGGGAAAGGGAGAATGAAAGGGAAGGAAGAGAAAAAGGAAGAGGCAGAGGAAGGCGAGTGTATATCCATATCCATATCGAGGAGAGTTTCTTTGGA
AAAATTCGAATGTGGATCGTGGGCTTCGTCGGGGATGGTGGTTCATGAAGACGGGGAGTCGGGTAGCCTTTATTTTGATCTGCCAATGGAATTGATAAGGAACAGCGTGA
GTGCACAGACTCAATCACCAGTAGGAGCAGCTTTTGTATTTAATGGGAGGGGAGTTTGGAACAAACCAAAATTGGCAGAGGAATCAGGAGCTGCATCCCCATGCATCATT
ACCCCACGCTTGCGCAAAGCTAGACAAGAGTTCAATGCTCTTTTGGAAGCTCACACTCACATTCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAGCGATGGCCAATACAATAGTGCCTTTGTCTGTTGGCACCTCTAGCAGAAGGTACGAATTTGTCGAGGATGTGGTAATTGAGGTGTCAAGGCAATTGAGTGCCCC
GAACTCGGTCTATTCATCCCCTCGGTTGGCAGGAAAAAAGGAAGTCCGTGATGGGCTGAATCGGAGCAAATCCTGTGGTGAAGGAAGAGGGAAGGCAGCGCCGCATGGGC
TTATTGAGAATAAAGTAATGGTATGGGAGAAAGGGGATAAGCACAAAACAGAGGAGGGTAAAGGGAGGCGCTTCAGATGTTGTGGGGCGCTATGCCTGTTGTTACCAGTG
TTGGGGTTTAAGGTTGGGAAAGGGAGAATGAAAGGGAAGGAAGAGAAAAAGGAAGAGGCAGAGGAAGGCGAGTGTATATCCATATCCATATCGAGGAGAGTTTCTTTGGA
AAAATTCGAATGTGGATCGTGGGCTTCGTCGGGGATGGTGGTTCATGAAGACGGGGAGTCGGGTAGCCTTTATTTTGATCTGCCAATGGAATTGATAAGGAACAGCGTGA
GTGCACAGACTCAATCACCAGTAGGAGCAGCTTTTGTATTTAATGGGAGGGGAGTTTGGAACAAACCAAAATTGGCAGAGGAATCAGGAGCTGCATCCCCATGCATCATT
ACCCCACGCTTGCGCAAAGCTAGACAAGAGTTCAATGCTCTTTTGGAAGCTCACACTCACATTCTATGA
Protein sequenceShow/hide protein sequence
MPAMANTIVPLSVGTSSRRYEFVEDVVIEVSRQLSAPNSVYSSPRLAGKKEVRDGLNRSKSCGEGRGKAAPHGLIENKVMVWEKGDKHKTEEGKGRRFRCCGALCLLLPV
LGFKVGKGRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEDGESGSLYFDLPMELIRNSVSAQTQSPVGAAFVFNGRGVWNKPKLAEESGAASPCII
TPRLRKARQEFNALLEAHTHIL