; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10015768 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10015768
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionYcf3-interacting protein 1
Genome locationChr03:110877..111641
RNA-Seq ExpressionHG10015768
SyntenyHG10015768
Gene Ontology termsGO:0048564 - photosystem I assembly (biological process)
GO:0080183 - response to photooxidative stress (biological process)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsIPR040340 - Chloroplast enhancing stress tolerance protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011658372.1 uncharacterized protein LOC105435976 [Cucumis sativus]9.5e-9879.69Show/hide
Query:  MADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRLAAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTEEGKA
        MA  +VPLSVGT+SR +EFVEDVVIEVSRQL     S PNSAYSSPRLA KKK RD   GLNRS+SCGEGRGKA PHGLIENKVMVWEKG+KHKTEEGK 
Subjt:  MADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRLAAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTEEGKA

Query:  RRFRCGAVLC--LGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSRRVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRNSVGAQTQS
        RRFRC   LC  L  LGF+VGKG+ KGK + +EEA  EEGECISIS+SRRVSLEKFECGSWASSGMVVHEDGES    GSLYFDLPMELIRNSV AQT  
Subjt:  RRFRCGAVLC--LGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSRRVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRNSVGAQTQS

Query:  QSPVGAAFVFDHHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHTL
        QSPVGAAFVF+    VW KPKLAEESGAASPCIITPRLRKARQEFNALLEAHTH L
Subjt:  QSPVGAAFVFDHHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHTL

XP_022925963.1 uncharacterized protein LOC111433224 [Cucurbita moschata]2.8e-9776.69Show/hide
Query:  MADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRL----AAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTE
        +AD L PLSVGT  R +E VEDVVIEVS Q KLESYS PNSAYSSP L    AAKKKV DGGRGLNRSKSCGEGRGKA PHGLIEN+VM+WEKG KHKTE
Subjt:  MADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRL----AAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTE

Query:  EGKARRFRCGAVLC-----LGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSR-RVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRN
        EGKARRFRCGA LC     LGGLGF+VGKGK + K +  EE E E G CISIS+S  RVSLEKFECGSWASSGMV HEDGES SG GSLYFDLPMELIRN
Subjt:  EGKARRFRCGAVLC-----LGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSR-RVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRN

Query:  SVGAQTQSQSPVGAAFVFD------HHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAH
        SVGA+T  QSP   AFVF+      HHLPVWTK KLAEESGAASPC+ITPRLR+AR+EFNALLEAH
Subjt:  SVGAQTQSQSPVGAAFVFD------HHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAH

XP_022978627.1 uncharacterized protein LOC111478548 [Cucurbita maxima]9.5e-9876.69Show/hide
Query:  MADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRL----AAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTE
        +AD L PLSVGT  RS+E VEDVVI+VS Q KLESYS PNSAYSSP L    AAKKKV DGGRGLNRSKSCGEGRGKA PHGLI+N+VM+WEKG KHKTE
Subjt:  MADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRL----AAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTE

Query:  EGKARRFRCGAVLC-----LGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSR-RVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRN
        EGKARRFRCGA LC     LGGLGF+VGKGK + K +  EE E E G CISIS+S  RVSLEKFECGSWASSGMV HEDGES +G GSLYFDLPMELIRN
Subjt:  EGKARRFRCGAVLC-----LGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSR-RVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRN

Query:  SVGAQTQSQSPVGAAFVFD------HHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAH
        SVGA+T  QSP  AAFVFD      HHLPVWTK KLAEESGAASPC+ITPRLR+AR+EFNALLEAH
Subjt:  SVGAQTQSQSPVGAAFVFD------HHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAH

XP_023543164.1 uncharacterized protein LOC111803119 [Cucurbita pepo subsp. pepo]1.6e-9776.69Show/hide
Query:  MADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRL----AAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTE
        +AD L PLSVGT  R +E VEDVVIEVS Q KLESYS PNSAYSSP L    AAKKKV DGGRGLNRSKSCGEGRG+A PHGLIEN+VM+WEKG KHKTE
Subjt:  MADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRL----AAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTE

Query:  EGKARRFRCGAVLC-----LGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSR-RVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRN
        EGKARRFRCGA LC     LGGLGF+VGKGK + K +  EE E E G CISIS+S  RVSLEKFECGSWASSGMV HEDGES SG GSLYFDLPMELIRN
Subjt:  EGKARRFRCGAVLC-----LGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSR-RVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRN

Query:  SVGAQTQSQSPVGAAFVFD------HHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAH
        SVGA+T  QSP   AFVFD      HHLPVWTK KLAEESGAASPC+ITPRLR+AR+EFNALLEAH
Subjt:  SVGAQTQSQSPVGAAFVFD------HHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAH

XP_038877520.1 uncharacterized protein LOC120069777 [Benincasa hispida]1.7e-10784.44Show/hide
Query:  MADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRLAAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTEEGKA
        MADT+VPLSVGT SRS+EFV+DVVIEVS QLKL SYSVPNSAYSSPRLAAKKKV D GRGLNRSKSCGEGRGKA PH LIENKVMVWEKG KHKT EGKA
Subjt:  MADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRLAAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTEEGKA

Query:  RRFRCGAVLC-----LGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSRRVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRNSVGAQ
        +RFRCGA LC     LGGLGF+VGKGK KGK + KEEA  EEGECISIS+SRRVSLEKFECGSWASSGMVVHEDGE     GS YFDLPMELIRNSVG Q
Subjt:  RRFRCGAVLC-----LGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSRRVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRNSVGAQ

Query:  TQSQSPVGAAFVFD-HHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAHT
        T  QSPVGAAFVFD HHLP+WTKP LAEESGAASPCIITPRLRKAR+EFNALLEAHT
Subjt:  TQSQSPVGAAFVFD-HHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAHT

TrEMBL top hitse value%identityAlignment
A0A0A0KME4 Uncharacterized protein4.6e-9879.69Show/hide
Query:  MADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRLAAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTEEGKA
        MA  +VPLSVGT+SR +EFVEDVVIEVSRQL     S PNSAYSSPRLA KKK RD   GLNRS+SCGEGRGKA PHGLIENKVMVWEKG+KHKTEEGK 
Subjt:  MADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRLAAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTEEGKA

Query:  RRFRCGAVLC--LGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSRRVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRNSVGAQTQS
        RRFRC   LC  L  LGF+VGKG+ KGK + +EEA  EEGECISIS+SRRVSLEKFECGSWASSGMVVHEDGES    GSLYFDLPMELIRNSV AQT  
Subjt:  RRFRCGAVLC--LGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSRRVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRNSVGAQTQS

Query:  QSPVGAAFVFDHHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHTL
        QSPVGAAFVF+    VW KPKLAEESGAASPCIITPRLRKARQEFNALLEAHTH L
Subjt:  QSPVGAAFVFDHHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAHTHTL

A0A1S3AZD3 uncharacterized protein LOC1034842327.6e-9376.49Show/hide
Query:  ADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRLAAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTEEGKAR
        +D +VPLSV  +SR +EFVEDVV+ VSRQL     S PNS YSSPRL  KKKVRD   GLNRSKSCG+GRGKA PHGLIENK+M WE G+KHKTEEGK R
Subjt:  ADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRLAAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTEEGKAR

Query:  RFRCGAV-LCLGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSRRVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRNSVGAQTQSQS
        RF+CGA+ L L  LGF+VGKG+ KGK + KEEA  EEGECISIS+SRRVSL+KFECGSWASSGMVVHE+GES    GSLYFDLPMELIRNSV A  QSQS
Subjt:  RFRCGAV-LCLGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSRRVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRNSVGAQTQSQS

Query:  PVGAAFVFDHHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAHT
        PVGAAFVFD    VW KPKLA+ESGAASPCIITPRLRKARQEFNALLEAHT
Subjt:  PVGAAFVFDHHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAHT

A0A5A7UFW0 Ycf3-interacting protein 17.1e-9176.89Show/hide
Query:  ADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRLAAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTEEGKAR
        +D +VPLSV  +SR +EFVEDVV+ VSRQL     S PNS YSSPRL  KKKVRD   GLNRSKSCG+GRGKA PHGLIENK+M WE G+KHKTEEGK R
Subjt:  ADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRLAAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTEEGKAR

Query:  RFRCGAV-LCLGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSRRVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRNSVGAQTQSQS
        RF+CGA+ L L  LGF+VGKG+ KGK + KEEA  EEGECISIS+SRRVSLEKFECGSWASSGMVVHE+GES    GSLYFDLPMELIRNSV A  QSQS
Subjt:  RFRCGAV-LCLGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSRRVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRNSVGAQTQSQS

Query:  PVGAAFVFDHHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAHT
        PVGAAFVFD    V  KPKLAEESGAASPCIITPRLRKARQEFNALLEAHT
Subjt:  PVGAAFVFDHHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAHT

A0A6J1EGQ7 uncharacterized protein LOC1114332241.3e-9776.69Show/hide
Query:  MADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRL----AAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTE
        +AD L PLSVGT  R +E VEDVVIEVS Q KLESYS PNSAYSSP L    AAKKKV DGGRGLNRSKSCGEGRGKA PHGLIEN+VM+WEKG KHKTE
Subjt:  MADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRL----AAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTE

Query:  EGKARRFRCGAVLC-----LGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSR-RVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRN
        EGKARRFRCGA LC     LGGLGF+VGKGK + K +  EE E E G CISIS+S  RVSLEKFECGSWASSGMV HEDGES SG GSLYFDLPMELIRN
Subjt:  EGKARRFRCGAVLC-----LGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSR-RVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRN

Query:  SVGAQTQSQSPVGAAFVFD------HHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAH
        SVGA+T  QSP   AFVF+      HHLPVWTK KLAEESGAASPC+ITPRLR+AR+EFNALLEAH
Subjt:  SVGAQTQSQSPVGAAFVFD------HHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAH

A0A6J1ILL2 uncharacterized protein LOC1114785484.6e-9876.69Show/hide
Query:  MADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRL----AAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTE
        +AD L PLSVGT  RS+E VEDVVI+VS Q KLESYS PNSAYSSP L    AAKKKV DGGRGLNRSKSCGEGRGKA PHGLI+N+VM+WEKG KHKTE
Subjt:  MADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRL----AAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTE

Query:  EGKARRFRCGAVLC-----LGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSR-RVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRN
        EGKARRFRCGA LC     LGGLGF+VGKGK + K +  EE E E G CISIS+S  RVSLEKFECGSWASSGMV HEDGES +G GSLYFDLPMELIRN
Subjt:  EGKARRFRCGAVLC-----LGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSR-RVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRN

Query:  SVGAQTQSQSPVGAAFVFD------HHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAH
        SVGA+T  QSP  AAFVFD      HHLPVWTK KLAEESGAASPC+ITPRLR+AR+EFNALLEAH
Subjt:  SVGAQTQSQSPVGAAFVFD------HHLPVWTKPKLAEESGAASPCIITPRLRKARQEFNALLEAH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30850.1 root hair specific 42.1e-1032.11Show/hide
Query:  KARRFRCGAVLCLGGLGFRVGKGKGKGKGKGKEEAETEEGECISIS------MSRRVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIR-NS
        K   F+C A  CL   GF    GK K      +   + E + I  S      +S R SLEKFECGSWAS+  ++ ++G        L+FD P+E+ + NS
Subjt:  KARRFRCGAVLCLGGLGFRVGKGKGKGKGKGKEEAETEEGECISIS------MSRRVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIR-NS

Query:  VGAQ--TQSQSPVGAAFVF-------------------DHHLPVWTKPK-----LAEESGAASPC------IITPRLRKARQEFNALLEA
         G       Q PV + F+F                   DH     + P+         S A+  C       ITPRLRKAR +FN  L A
Subjt:  VGAQ--TQSQSPVGAAFVF-------------------DHHLPVWTKPK-----LAEESGAASPC------IITPRLRKARQEFNALLEA

AT2G34910.1 BEST Arabidopsis thaliana protein match is: root hair specific 4 (TAIR:AT1G30850.1)6.7e-0932.02Show/hide
Query:  FRCGA-VLCLGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSRRVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRNSVGAQTQSQSP
        F+C A  L L G G R  +         K+  +       ++S+S   SLEKFECGSWAS+  +  E+G        LY DLP+E+I+   G     Q P
Subjt:  FRCGA-VLCLGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSRRVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRNSVGAQTQSQSP

Query:  VGAAFVFDHHLP----------------------VWTKPK-------LAEESGAASP-CIITPRLRKARQEFNALLEA
        V + F FD                            T P+          +S  ASP   ITPRL KAR +FN  L A
Subjt:  VGAAFVFDHHLP----------------------VWTKPK-------LAEESGAASP-CIITPRLRKARQEFNALLEA

AT4G20190.1 unknown protein1.1e-1428.17Show/hide
Query:  VEDVVIEVSRQLKLESYSVPNSAYSSPRLAAKKK--------------VRDGGRGLNRSKSCGEGRGKAP--PHGLIENKVMVWEKGEKH----------
        ++D+ ++   + K  S S+PNSA +SPR ++                 V+D      RSKSCGEGR   P     ++ +K       + H          
Subjt:  VEDVVIEVSRQLKLESYSVPNSAYSSPRLAAKKK--------------VRDGGRGLNRSKSCGEGRGKAP--PHGLIENKVMVWEKGEKH----------

Query:  --------------KTEEGKARR-----------------FRCGAVLCLGGLGFRVGKG-KGKGKGKGKEEAETEEGECISIS----------MSRRVSL
                      KTE  K+ R                 F+C A LCL   GF  GK  +   KG       T      S++          +S R SL
Subjt:  --------------KTEEGKARR-----------------FRCGAVLCLGGLGFRVGKG-KGKGKGKGKEEAETEEGECISIS----------MSRRVSL

Query:  EKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRNSVGAQTQSQSPVGAAFVFDHHLPV-----------WTKPKLAEES------GAASPC----
        E+FECGSW SS M+  ++ +     G  +FDLP ELI+   G   Q   PV AAFVFD    +            +K + + ES        +SP     
Subjt:  EKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRNSVGAQTQSQSPVGAAFVFDHHLPV-----------WTKPKLAEES------GAASPC----

Query:  ----IITPRLRKARQEFNALLEA
             ITPRL +A ++F++ LEA
Subjt:  ----IITPRLRKARQEFNALLEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGATACATTAGTACCTTTATCCGTTGGCACCGCTAGCAGGAGTTTCGAATTTGTTGAGGATGTGGTTATTGAGGTATCGAGGCAATTGAAGTTGGAAAGCTACAG
CGTCCCGAACTCGGCCTATTCATCCCCTCGGTTGGCAGCAAAAAAGAAAGTCCGTGATGGCGGGCGGGGGCTGAATCGGAGCAAATCCTGTGGTGAAGGAAGAGGGAAGG
CACCGCCGCATGGCCTTATTGAGAATAAAGTAATGGTATGGGAGAAAGGGGAGAAGCACAAAACAGAGGAGGGGAAAGCGAGGCGTTTCAGATGTGGAGCAGTACTATGC
TTAGGAGGGTTAGGGTTTAGGGTTGGGAAAGGGAAAGGGAAAGGGAAAGGGAAAGGGAAGGAAGAGGCAGAGACAGAGGAAGGTGAGTGTATATCCATATCCATGTCGAG
GAGAGTTTCTTTGGAAAAATTCGAATGTGGGTCGTGGGCTTCGTCGGGCATGGTGGTTCATGAGGACGGGGAGTCGACGTCAGGGACGGGGAGCCTTTATTTTGATCTGC
CAATGGAATTGATAAGGAACAGCGTGGGTGCACAAACACAATCACAATCACCAGTAGGGGCCGCTTTTGTATTCGATCATCATCTTCCTGTTTGGACCAAACCAAAATTG
GCGGAGGAATCAGGAGCTGCATCTCCATGCATCATTACCCCACGCTTGCGCAAAGCCAGACAGGAGTTCAATGCTCTTTTGGAAGCTCACACTCACACTCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGATACATTAGTACCTTTATCCGTTGGCACCGCTAGCAGGAGTTTCGAATTTGTTGAGGATGTGGTTATTGAGGTATCGAGGCAATTGAAGTTGGAAAGCTACAG
CGTCCCGAACTCGGCCTATTCATCCCCTCGGTTGGCAGCAAAAAAGAAAGTCCGTGATGGCGGGCGGGGGCTGAATCGGAGCAAATCCTGTGGTGAAGGAAGAGGGAAGG
CACCGCCGCATGGCCTTATTGAGAATAAAGTAATGGTATGGGAGAAAGGGGAGAAGCACAAAACAGAGGAGGGGAAAGCGAGGCGTTTCAGATGTGGAGCAGTACTATGC
TTAGGAGGGTTAGGGTTTAGGGTTGGGAAAGGGAAAGGGAAAGGGAAAGGGAAAGGGAAGGAAGAGGCAGAGACAGAGGAAGGTGAGTGTATATCCATATCCATGTCGAG
GAGAGTTTCTTTGGAAAAATTCGAATGTGGGTCGTGGGCTTCGTCGGGCATGGTGGTTCATGAGGACGGGGAGTCGACGTCAGGGACGGGGAGCCTTTATTTTGATCTGC
CAATGGAATTGATAAGGAACAGCGTGGGTGCACAAACACAATCACAATCACCAGTAGGGGCCGCTTTTGTATTCGATCATCATCTTCCTGTTTGGACCAAACCAAAATTG
GCGGAGGAATCAGGAGCTGCATCTCCATGCATCATTACCCCACGCTTGCGCAAAGCCAGACAGGAGTTCAATGCTCTTTTGGAAGCTCACACTCACACTCTCTAA
Protein sequenceShow/hide protein sequence
MADTLVPLSVGTASRSFEFVEDVVIEVSRQLKLESYSVPNSAYSSPRLAAKKKVRDGGRGLNRSKSCGEGRGKAPPHGLIENKVMVWEKGEKHKTEEGKARRFRCGAVLC
LGGLGFRVGKGKGKGKGKGKEEAETEEGECISISMSRRVSLEKFECGSWASSGMVVHEDGESTSGTGSLYFDLPMELIRNSVGAQTQSQSPVGAAFVFDHHLPVWTKPKL
AEESGAASPCIITPRLRKARQEFNALLEAHTHTL