; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0015685 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0015685
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionYcf3-interacting protein 1
Genome locationchr08:52423..53153
RNA-Seq ExpressionIVF0015685
SyntenyIVF0015685
Gene Ontology termsGO:0048564 - photosystem I assembly (biological process)
GO:0080183 - response to photooxidative stress (biological process)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsIPR040340 - Chloroplast enhancing stress tolerance protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052411.1 ycf3-interacting protein 1 [Cucumis melo var. makuwa]2.02e-14892.53Show/hide
Query:  LQYASDGIVPLSVVPSSRRYEFVEDVVVGVSRQLSAPNSGYSSPRLIGKKKVRDGLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGRRFKC
        + YASDGIVPLSVVPSSRRYEFVEDVVVGVSRQLSAPNSGYSSPRLIGKKKVRDGLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGRRFKC
Subjt:  LQYASDGIVPLSVVPSSRRYEFVEDVVVGVSRQLSAPNSGYSSPRLIGKKKVRDGLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGRRFKC

Query:  G-----------RYACWRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVFDGKG
        G           +    RMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVFDGKG
Subjt:  G-----------RYACWRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVFDGKG

Query:  VRNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTIIL
        VRNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTIIL
Subjt:  VRNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTIIL

XP_008439418.1 PREDICTED: uncharacterized protein LOC103484232 [Cucumis melo]1.21e-15091.8Show/hide
Query:  MKLQYASDGIVPLSVVPSSRRYEFVEDVVVGVSRQLSAPNSGYSSPRLIGKKKVRDGLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGRRF
        MKLQYASDGIVPLSVVPSSRRYEFVEDVVVGVSRQLSAPNSGYSSPRLIGKKKVRDGLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGRRF
Subjt:  MKLQYASDGIVPLSVVPSSRRYEFVEDVVVGVSRQLSAPNSGYSSPRLIGKKKVRDGLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGRRF

Query:  KCG-----------RYACWRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVFDG
        KCG           +    RMKGKEEKKEEAEEGECISISISRRVSL+KFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVFDG
Subjt:  KCG-----------RYACWRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVFDG

Query:  K-GVRNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTIIL
        K GV NKPKLA+ESGAASPCIITPRLRKARQEFNALLEAHTIIL
Subjt:  K-GVRNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTIIL

XP_011658372.1 uncharacterized protein LOC105435976 [Cucumis sativus]4.36e-13282.77Show/hide
Query:  SDGIVPLSVVPSSRRYEFVEDVVVGVSRQLSAPNSGYSSPRLIGKKKVRDGLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGRRFKCGRYA
        +  IVPLSV  SSRRYEFVEDVV+ VSRQLSAPNS YSSPRL GKKK RDGLNRS+SCG+GRGKAAPHGLIENK+M WE GDKHKTEEGKGRRF+C    
Subjt:  SDGIVPLSVVPSSRRYEFVEDVVVGVSRQLSAPNSGYSSPRLIGKKKVRDGLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGRRFKCGRYA

Query:  CW------------RMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVFDGKGVRN
        C             RMKGKEEK+EEAEEGECISISISRRVSLEKFECGSWASSGMVVHE+GESGSLYFDLPMELIRNSVSAQ+QSPVGAAFVF+G+GV N
Subjt:  CW------------RMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVFDGKGVRN

Query:  KPKLAEESGAASPCIITPRLRKARQEFNALLEAHTIIL
        KPKLAEESGAASPCIITPRLRKARQEFNALLEAHT +L
Subjt:  KPKLAEESGAASPCIITPRLRKARQEFNALLEAHTIIL

XP_022925963.1 uncharacterized protein LOC111433224 [Cucurbita moschata]6.13e-9565.52Show/hide
Query:  SDGIVPLSVVPSSRRYEFVEDVVVGVSRQL-----SAPNSGYSSPRL----IGKKKVRDG---LNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEE
        +D + PLSV  + RRYE VEDVV+ VS Q      SAPNS YSSP L      KKKV DG   LNRSKSCG+GRGKA PHGLIEN++M WE G KHKTEE
Subjt:  SDGIVPLSVVPSSRRYEFVEDVVVGVSRQL-----SAPNSGYSSPRL----IGKKKVRDG---LNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEE

Query:  GKGRRFKCGRYACWRM-----------KGKEEKKEEAEEGE----CISISISR-RVSLEKFECGSWASSGMVVHEEGES----GSLYFDLPMELIRNSVS
        GK RRF+CG   C  +           KGKEE+KEE EEGE    CISISIS  RVSLEKFECGSWASSGMV HE+GES    GSLYFDLPMELIRNSV 
Subjt:  GKGRRFKCGRYACWRM-----------KGKEEKKEEAEEGE----CISISISR-RVSLEKFECGSWASSGMVVHEEGES----GSLYFDLPMELIRNSVS

Query:  AQSQSPVGAAFVF--DGKGVRNKP-----KLAEESGAASPCIITPRLRKARQEFNALLEAH
        A++QSP   AFVF  DG GV + P     KLAEESGAASPC+ITPRLR+AR+EFNALLEAH
Subjt:  AQSQSPVGAAFVF--DGKGVRNKP-----KLAEESGAASPCIITPRLRKARQEFNALLEAH

XP_038877520.1 uncharacterized protein LOC120069777 [Benincasa hispida]5.50e-10770.56Show/hide
Query:  SDGIVPLSVVPSSRRYEFVEDVVVGVSRQL-----SAPNSGYSSPRLIGKKKVRD---GLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGR
        +D IVPLSV  +SR YEFV+DVV+ VS QL     S PNS YSSPRL  KKKV D   GLNRSKSCG+GRGKA PH LIENK+M WE G KHKTE GK +
Subjt:  SDGIVPLSVVPSSRRYEFVEDVVVGVSRQL-----SAPNSGYSSPRLIGKKKVRD---GLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGR

Query:  RFKCG--------------RYACWRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAA
        RF+CG              +    + KGKEE+KEEAEEGECISISISRRVSLEKFECGSWASSGMVVHE+GE GS YFDLPMELIRNSV  Q+QSPVGAA
Subjt:  RFKCG--------------RYACWRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAA

Query:  FVFDGKG--VRNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTI
        FVFD     +  KP LAEESGAASPCIITPRLRKAR+EFNALLEAHT+
Subjt:  FVFDGKG--VRNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTI

TrEMBL top hitse value%identityAlignment
A0A0A0KME4 Uncharacterized protein1.3e-10282.77Show/hide
Query:  SDGIVPLSVVPSSRRYEFVEDVVVGVSRQLSAPNSGYSSPRLIGKKKVRDGLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGRRFKCGRYA
        +  IVPLSV  SSRRYEFVEDVV+ VSRQLSAPNS YSSPRL GKKK RDGLNRS+SCG+GRGKAAPHGLIENK+M WE GDKHKTEEGKGRRF+C    
Subjt:  SDGIVPLSVVPSSRRYEFVEDVVVGVSRQLSAPNSGYSSPRLIGKKKVRDGLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGRRFKCGRYA

Query:  C------------WRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVFDGKGVRN
        C             RMKGKEEK+EEAEEGECISISISRRVSLEKFECGSWASSGMVVHE+GESGSLYFDLPMELIRNSVSAQ+QSPVGAAFVF+G+GV N
Subjt:  C------------WRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVFDGKGVRN

Query:  KPKLAEESGAASPCIITPRLRKARQEFNALLEAHTIIL
        KPKLAEESGAASPCIITPRLRKARQEFNALLEAHT +L
Subjt:  KPKLAEESGAASPCIITPRLRKARQEFNALLEAHTIIL

A0A1S3AZD3 uncharacterized protein LOC1034842326.9e-11791.8Show/hide
Query:  MKLQYASDGIVPLSVVPSSRRYEFVEDVVVGVSRQLSAPNSGYSSPRLIGKKKVRDGLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGRRF
        MKLQYASDGIVPLSVVPSSRRYEFVEDVVVGVSRQLSAPNSGYSSPRLIGKKKVRDGLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGRRF
Subjt:  MKLQYASDGIVPLSVVPSSRRYEFVEDVVVGVSRQLSAPNSGYSSPRLIGKKKVRDGLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGRRF

Query:  KCG-----------RYACWRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVFDG
        KCG           +    RMKGKEEKKEEAEEGECISISISRRVSL+KFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVFDG
Subjt:  KCG-----------RYACWRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVFDG

Query:  K-GVRNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTIIL
        K GV NKPKLA+ESGAASPCIITPRLRKARQEFNALLEAHTIIL
Subjt:  K-GVRNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTIIL

A0A5A7UFW0 Ycf3-interacting protein 16.2e-11892.53Show/hide
Query:  LQYASDGIVPLSVVPSSRRYEFVEDVVVGVSRQLSAPNSGYSSPRLIGKKKVRDGLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGRRFKC
        + YASDGIVPLSVVPSSRRYEFVEDVVVGVSRQLSAPNSGYSSPRLIGKKKVRDGLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGRRFKC
Subjt:  LQYASDGIVPLSVVPSSRRYEFVEDVVVGVSRQLSAPNSGYSSPRLIGKKKVRDGLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGRRFKC

Query:  G-----------RYACWRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVFDGKG
        G           +    RMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVFDGKG
Subjt:  G-----------RYACWRMKGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVFDGKG

Query:  VRNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTIIL
        VRNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTIIL
Subjt:  VRNKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTIIL

A0A6J1EGQ7 uncharacterized protein LOC1114332241.1e-7465.52Show/hide
Query:  SDGIVPLSVVPSSRRYEFVEDVVVGVSRQL-----SAPNSGYSSPRL----IGKKKVRD---GLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEE
        +D + PLSV  + RRYE VEDVV+ VS Q      SAPNS YSSP L      KKKV D   GLNRSKSCG+GRGKA PHGLIEN++M WE G KHKTEE
Subjt:  SDGIVPLSVVPSSRRYEFVEDVVVGVSRQL-----SAPNSGYSSPRL----IGKKKVRD---GLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEE

Query:  GKGRRFKCGRYACWRM-----------KGKEEKKEEAEEGE----CISISISR-RVSLEKFECGSWASSGMVVHEEGES----GSLYFDLPMELIRNSVS
        GK RRF+CG   C  +           KGKEE+KEE EEGE    CISISIS  RVSLEKFECGSWASSGMV HE+GES    GSLYFDLPMELIRNSV 
Subjt:  GKGRRFKCGRYACWRM-----------KGKEEKKEEAEEGE----CISISISR-RVSLEKFECGSWASSGMVVHEEGES----GSLYFDLPMELIRNSVS

Query:  AQSQSPVGAAFVF--DGKGVRNKP-----KLAEESGAASPCIITPRLRKARQEFNALLEAH
        A++QSP   AFVF  DG GV + P     KLAEESGAASPC+ITPRLR+AR+EFNALLEAH
Subjt:  AQSQSPVGAAFVF--DGKGVRNKP-----KLAEESGAASPCIITPRLRKARQEFNALLEAH

A0A6J1ILL2 uncharacterized protein LOC1114785484.2e-7464.75Show/hide
Query:  SDGIVPLSVVPSSRRYEFVEDVVVGVSRQL-----SAPNSGYSSPRL----IGKKKVRD---GLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEE
        +D + PLSV  + R YE VEDVV+ VS Q      SAPNS YSSP L      KKKV D   GLNRSKSCG+GRGKA PHGLI+N++M WE G KHKTEE
Subjt:  SDGIVPLSVVPSSRRYEFVEDVVVGVSRQL-----SAPNSGYSSPRL----IGKKKVRD---GLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEE

Query:  GKGRRFKCGRYACWRM-----------KGKEEKKEEAEEGE----CISISISR-RVSLEKFECGSWASSGMVVHEEGES----GSLYFDLPMELIRNSVS
        GK RRF+CG   C  +           KGKEE+KEE EEGE    CISISIS  RVSLEKFECGSWASSGMV HE+GES    GSLYFDLPMELIRNSV 
Subjt:  GKGRRFKCGRYACWRM-----------KGKEEKKEEAEEGE----CISISISR-RVSLEKFECGSWASSGMVVHEEGES----GSLYFDLPMELIRNSVS

Query:  AQSQSPVGAAFVFDGKGVR-------NKPKLAEESGAASPCIITPRLRKARQEFNALLEAH
        A++QSP  AAFVFD  GV         K KLAEESGAASPC+ITPRLR+AR+EFNALLEAH
Subjt:  AQSQSPVGAAFVFDGKGVR-------NKPKLAEESGAASPCIITPRLRKARQEFNALLEAH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30850.1 root hair specific 41.6e-0931.32Show/hide
Query:  KGRRFKCGRYACWRMKG--------KEEKKEEAEEGECISIS------ISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIR-NSVSAQS---
        K   FKC  + C  + G           K++ + E + I  S      +S R SLEKFECGSWAS+  ++    ++G L+FD P+E+ + NS        
Subjt:  KGRRFKCGRYACWRMKG--------KEEKKEEAEEGECISIS------ISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIR-NSVSAQS---

Query:  -QSPVGAAFVFD-------------GKGVRNKPKLAE------------ESGAASPC------IITPRLRKARQEFNALLEA
         Q PV + F+FD              +  R+  + AE             S A+  C       ITPRLRKAR +FN  L A
Subjt:  -QSPVGAAFVFD-------------GKGVRNKPKLAE------------ESGAASPC------IITPRLRKARQEFNALLEA

AT2G34910.1 BEST Arabidopsis thaliana protein match is: root hair specific 4 (TAIR:AT1G30850.1)9.5e-1032.16Show/hide
Query:  FKCGRYACWRMKG------KEEKKEEAEEGECISIS------ISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVF
        FKC  + C  + G      +  K E++ + + I  S      +S   SLEKFECGSWAS+  +     E+G LY DLP+E+I+       Q PV + F F
Subjt:  FKCGRYACWRMKG------KEEKKEEAEEGECISIS------ISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVF

Query:  D-------------------GKGVRNKPKLA-----------EESGAASP-CIITPRLRKARQEFNALLEA
        D                   G+ +R+  + +            +S  ASP   ITPRL KAR +FN  L A
Subjt:  D-------------------GKGVRNKPKLA-----------EESGAASP-CIITPRLRKARQEFNALLEA

AT4G20190.1 unknown protein4.1e-1338.28Show/hide
Query:  ISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQ-SPVGAAFVFDG--------KGV----RNKPKLAEES------GAASPC--
        +S R SLE+FECGSW SS M+  +  + G  +FDLP ELI+       Q  PV AAFVFD         KGV     +K + + ES        +SP   
Subjt:  ISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQ-SPVGAAFVFDG--------KGV----RNKPKLAEES------GAASPC--

Query:  ------IITPRLRKARQEFNALLEAHTI
               ITPRL +A ++F++ LEA  +
Subjt:  ------IITPRLRKARQEFNALLEAHTI

AT5G44660.1 unknown protein1.6e-0938.58Show/hide
Query:  ISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRN-SVSAQSQSPVGAAFVFDGKGVR-----------NKPKLAEES--------GAASPC-
        IS R S+EKF+CGS+ S      EEG  G+ +FDLP ELI++ S       PV AAFVFD + V            +K + A ES          +SP  
Subjt:  ISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRN-SVSAQSQSPVGAAFVFDGKGVR-----------NKPKLAEES--------GAASPC-

Query:  -----IITPRLRKARQEFNALLEAHTI
              I+PRL +A + FNA LEA  +
Subjt:  -----IITPRLRKARQEFNALLEAHTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACTGCAGTATGCCAGCGATGGAATAGTGCCTTTATCTGTTGTGCCCTCTAGCAGAAGGTACGAATTTGTTGAGGATGTGGTTGTTGGGGTGTCAAGGCAATTGAG
TGCCCCGAACTCGGGCTATTCATCCCCTCGGCTGATAGGAAAAAAGAAAGTCCGTGATGGGCTGAATCGGAGCAAATCCTGTGGTGACGGAAGAGGGAAGGCAGCGCCGC
ATGGGCTTATTGAGAACAAAATAATGGCATGGGAGGGAGGGGATAAGCACAAAACAGAGGAGGGGAAAGGGAGGCGCTTCAAATGTGGGCGCTATGCTTGTTGGAGAATG
AAAGGGAAGGAAGAGAAAAAGGAAGAGGCAGAGGAAGGTGAGTGTATATCCATATCCATATCGAGGAGAGTTTCTTTGGAAAAATTCGAATGTGGGTCGTGGGCTTCGTC
AGGGATGGTGGTTCATGAAGAGGGGGAGTCGGGTAGCCTTTATTTTGATCTGCCAATGGAATTGATAAGGAACAGCGTGAGTGCACAGAGTCAATCACCAGTAGGGGCAG
CTTTTGTATTTGATGGAAAGGGAGTTCGGAACAAACCGAAATTGGCGGAGGAATCAGGAGCTGCATCCCCATGCATCATTACCCCACGCTTGCGCAAAGCTAGACAAGAG
TTCAATGCTCTTTTGGAAGCTCACACTATCATTCTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAACTGCAGTATGCCAGCGATGGAATAGTGCCTTTATCTGTTGTGCCCTCTAGCAGAAGGTACGAATTTGTTGAGGATGTGGTTGTTGGGGTGTCAAGGCAATTGAG
TGCCCCGAACTCGGGCTATTCATCCCCTCGGCTGATAGGAAAAAAGAAAGTCCGTGATGGGCTGAATCGGAGCAAATCCTGTGGTGACGGAAGAGGGAAGGCAGCGCCGC
ATGGGCTTATTGAGAACAAAATAATGGCATGGGAGGGAGGGGATAAGCACAAAACAGAGGAGGGGAAAGGGAGGCGCTTCAAATGTGGGCGCTATGCTTGTTGGAGAATG
AAAGGGAAGGAAGAGAAAAAGGAAGAGGCAGAGGAAGGTGAGTGTATATCCATATCCATATCGAGGAGAGTTTCTTTGGAAAAATTCGAATGTGGGTCGTGGGCTTCGTC
AGGGATGGTGGTTCATGAAGAGGGGGAGTCGGGTAGCCTTTATTTTGATCTGCCAATGGAATTGATAAGGAACAGCGTGAGTGCACAGAGTCAATCACCAGTAGGGGCAG
CTTTTGTATTTGATGGAAAGGGAGTTCGGAACAAACCGAAATTGGCGGAGGAATCAGGAGCTGCATCCCCATGCATCATTACCCCACGCTTGCGCAAAGCTAGACAAGAG
TTCAATGCTCTTTTGGAAGCTCACACTATCATTCTGTGA
Protein sequenceShow/hide protein sequence
MKLQYASDGIVPLSVVPSSRRYEFVEDVVVGVSRQLSAPNSGYSSPRLIGKKKVRDGLNRSKSCGDGRGKAAPHGLIENKIMAWEGGDKHKTEEGKGRRFKCGRYACWRM
KGKEEKKEEAEEGECISISISRRVSLEKFECGSWASSGMVVHEEGESGSLYFDLPMELIRNSVSAQSQSPVGAAFVFDGKGVRNKPKLAEESGAASPCIITPRLRKARQE
FNALLEAHTIIL