; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0065 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0065
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic
Genome locationMC02:580100..584753
RNA-Seq ExpressionMC02g0065
SyntenyMC02g0065
Gene Ontology termsGO:0048564 - photosystem I assembly (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0047134 - protein-disulfide reductase activity (molecular function)
InterPro domainsIPR036410 - Heat shock protein DnaJ, cysteine-rich domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011653566.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic isoform X2 [Cucumis sativus]1.39e-10985.33Show/hide
Query:  MASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGT-TGASANTMDGKPVC
        MASSSHLSAIPLR  S+S+  +LSH+  KP+VL+ TSN D+ESCS+GDS TPSKPLKGTQ LISRRWCLTCLCSS+TL+K +GGT T A ANTMDGKP C
Subjt:  MASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGT-TGASANTMDGKPVC

Query:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDAR+LLDKMYNGRLLPNS
Subjt:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

XP_016900745.1 PREDICTED: protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic isoform X1 [Cucumis melo]3.05e-11386.89Show/hide
Query:  MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTGASANTMDGKPVC
        MMASSSHLSAIPLR  S+S+  +LSH+ SKP+VL+ TSN D+ESCS+GDS TPSKPLKGTQ LISRRWCLTCLCSS+TL+K++GGTT A ANTMDGKPVC
Subjt:  MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTGASANTMDGKPVC

Query:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPN
        RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDAR+LLDKMYNGRLLPN
Subjt:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPN

XP_022153852.1 protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic [Momordica charantia]1.87e-132100Show/hide
Query:  MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTGASANTMDGKPVC
        MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTGASANTMDGKPVC
Subjt:  MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTGASANTMDGKPVC

Query:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
Subjt:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

XP_022958131.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Cucurbita moschata]1.19e-11086.49Show/hide
Query:  MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTG-ASANTMDGKPV
        MMASSSHLSAIPLR SS+S+R +LSH+ SKP+VL  TSNLD+ES ++GDS+TPSKPLKGT+ LISRRWCLTCLCSS TLIKD+GGT   A ANTMDGKP 
Subjt:  MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTG-ASANTMDGKPV

Query:  CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKM+NGRLLPNS
Subjt:  CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

XP_022996228.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Cucurbita maxima]8.38e-11187.03Show/hide
Query:  MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTG-ASANTMDGKPV
        MMASSSHLSAIPLR SS+S+R  LSH+ SKP+VL  TSNLD+ESCS+GD +TPSKPLKGT  LISRRWCLTCLCSS TLIKD+GGT   A ANTMDGKP 
Subjt:  MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTG-ASANTMDGKPV

Query:  CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKM+NGRLLPNS
Subjt:  CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

TrEMBL top hitse value%identityAlignment
A0A0A0KX46 Uncharacterized protein1.64e-10885.41Show/hide
Query:  MASSSHLSAIPLRLSSTSTRTNLSH-AKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGT-TGASANTMDGKPV
        MASSSHLSAIPLR  S+S+  +LSH A  KP+VL+ TSN D+ESCS+GDS TPSKPLKGTQ LISRRWCLTCLCSS+TL+K +GGT T A ANTMDGKP 
Subjt:  MASSSHLSAIPLRLSSTSTRTNLSH-AKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGT-TGASANTMDGKPV

Query:  CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDAR+LLDKMYNGRLLPNS
Subjt:  CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

A0A1S4DYE8 protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic isoform X11.48e-11386.89Show/hide
Query:  MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTGASANTMDGKPVC
        MMASSSHLSAIPLR  S+S+  +LSH+ SKP+VL+ TSN D+ESCS+GDS TPSKPLKGTQ LISRRWCLTCLCSS+TL+K++GGTT A ANTMDGKPVC
Subjt:  MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTGASANTMDGKPVC

Query:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPN
        RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDAR+LLDKMYNGRLLPN
Subjt:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPN

A0A6J1DK36 protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic9.06e-133100Show/hide
Query:  MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTGASANTMDGKPVC
        MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTGASANTMDGKPVC
Subjt:  MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTGASANTMDGKPVC

Query:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
Subjt:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

A0A6J1H182 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic5.76e-11186.49Show/hide
Query:  MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTG-ASANTMDGKPV
        MMASSSHLSAIPLR SS+S+R +LSH+ SKP+VL  TSNLD+ES ++GDS+TPSKPLKGT+ LISRRWCLTCLCSS TLIKD+GGT   A ANTMDGKP 
Subjt:  MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTG-ASANTMDGKPV

Query:  CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKM+NGRLLPNS
Subjt:  CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

A0A6J1KA92 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic4.06e-11187.03Show/hide
Query:  MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTG-ASANTMDGKPV
        MMASSSHLSAIPLR SS+S+R  LSH+ SKP+VL  TSNLD+ESCS+GD +TPSKPLKGT  LISRRWCLTCLCSS TLIKD+GGT   A ANTMDGKP 
Subjt:  MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTG-ASANTMDGKPV

Query:  CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKM+NGRLLPNS
Subjt:  CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

SwissProt top hitse value%identityAlignment
A0A1D6KL43 Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic4.7e-4964.05Show/hide
Query:  GTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTGASANTMD----GKPVCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDV
        G +N+     +     TPS          SRR CL CL  ++TLI   G   G +A+ M+     K VCRNC GSGAV+CDMCGGTGKWKALNRKRAKDV
Subjt:  GTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTGASANTMD----GKPVCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDV

Query:  YEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        YEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRP+A++LLDKMYNG++LP S
Subjt:  YEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

B5YAR4 Chaperone protein DnaJ1.9e-0530.95Show/hide
Query:  FGGTTGASANTMDGKPVCRNCG---GSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLV---CPVCLGTGLPNNK
        FG         ++  P C+  G   G+  V CDMC GTG+ + + +       + T CP C+G G+++   C  C GTG    K
Subjt:  FGGTTGASANTMDGKPVCRNCG---GSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLV---CPVCLGTGLPNNK

O64750 Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic7.8e-5260Show/hide
Query:  MMASSSHLSAIPLRLSSTSTRTNLSH----AKSKPIVLNGTSNLDNESCSSGDSNTPSK---PLKGTQFLISRR-WCLTCLCSSMTLIKD-FGGTTGASA
        M ASSSHL A+P   S   +  N +     AKS P         +N+S  S DS++ S+     +G Q  +SRR W   C+C+S  LI + +   +  SA
Subjt:  MMASSSHLSAIPLRLSSTSTRTNLSH----AKSKPIVLNGTSNLDNESCSSGDSNTPSK---PLKGTQFLISRR-WCLTCLCSSMTLIKD-FGGTTGASA

Query:  NTMDGKP--VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
          +D KP   CRNC GSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRP AR+LL+KMYNGRLLP+S
Subjt:  NTMDGKP--VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

Q6YUA8 Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic8.6e-5168.92Show/hide
Query:  DNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGT----TGASANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTE
        D E+CS+    T  K  + T    SRR CL CLC ++TLI   G T     G +++ M    VCRNC GSGAVLCDMCGGTGKWKALNRKRAKDVY FTE
Subjt:  DNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGT----TGASANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTE

Query:  CPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        CPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDA+KLLDKMYNG++LP+S
Subjt:  CPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

Q7UA76 Chaperone protein DnaJ1.2e-0435.14Show/hide
Query:  CRNCGGSGA------VLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLV---CPVCLGTGLPNNKGLLR
        C  CGGSGA        C  CGG G+ +   R       +  ECPNC G G+++   C  C G G+   +  LR
Subjt:  CRNCGGSGA------VLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLV---CPVCLGTGLPNNKGLLR

Arabidopsis top hitse value%identityAlignment
AT2G34860.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein5.5e-5360Show/hide
Query:  MMASSSHLSAIPLRLSSTSTRTNLSH----AKSKPIVLNGTSNLDNESCSSGDSNTPSK---PLKGTQFLISRR-WCLTCLCSSMTLIKD-FGGTTGASA
        M ASSSHL A+P   S   +  N +     AKS P         +N+S  S DS++ S+     +G Q  +SRR W   C+C+S  LI + +   +  SA
Subjt:  MMASSSHLSAIPLRLSSTSTRTNLSH----AKSKPIVLNGTSNLDNESCSSGDSNTPSK---PLKGTQFLISRR-WCLTCLCSSMTLIKD-FGGTTGASA

Query:  NTMDGKP--VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
          +D KP   CRNC GSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRP AR+LL+KMYNGRLLP+S
Subjt:  NTMDGKP--VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

AT2G34860.2 DnaJ/Hsp40 cysteine-rich domain superfamily protein5.5e-5360Show/hide
Query:  MMASSSHLSAIPLRLSSTSTRTNLSH----AKSKPIVLNGTSNLDNESCSSGDSNTPSK---PLKGTQFLISRR-WCLTCLCSSMTLIKD-FGGTTGASA
        M ASSSHL A+P   S   +  N +     AKS P         +N+S  S DS++ S+     +G Q  +SRR W   C+C+S  LI + +   +  SA
Subjt:  MMASSSHLSAIPLRLSSTSTRTNLSH----AKSKPIVLNGTSNLDNESCSSGDSNTPSK---PLKGTQFLISRR-WCLTCLCSSMTLIKD-FGGTTGASA

Query:  NTMDGKP--VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
          +D KP   CRNC GSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRP AR+LL+KMYNGRLLP+S
Subjt:  NTMDGKP--VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

AT5G06130.1 chaperone protein dnaJ-related1.2e-0435.82Show/hide
Query:  KPVCRNCGGSGAVLCDMCGGTGKWKALN---RKRAKD----VYEFTECPNCYGRGKLVCPVCLGTGL
        K  C+ C G+G + C  C  +G   +++   R RA +    V     C NC G GK++CP CL TG+
Subjt:  KPVCRNCGGSGAVLCDMCGGTGKWKALN---RKRAKD----VYEFTECPNCYGRGKLVCPVCLGTGL

AT5G06130.2 chaperone protein dnaJ-related1.2e-0435.82Show/hide
Query:  KPVCRNCGGSGAVLCDMCGGTGKWKALN---RKRAKD----VYEFTECPNCYGRGKLVCPVCLGTGL
        K  C+ C G+G + C  C  +G   +++   R RA +    V     C NC G GK++CP CL TG+
Subjt:  KPVCRNCGGSGAVLCDMCGGTGKWKALN---RKRAKD----VYEFTECPNCYGRGKLVCPVCLGTGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGCGTCTTCTTCTCATCTATCAGCCATTCCCCTACGCCTGTCTTCCACCTCCACAAGAACGAACTTATCCCACGCTAAATCAAAGCCAATTGTACTTAATGGGAC
CTCAAATTTGGACAATGAAAGCTGTAGTAGTGGAGATTCTAATACACCATCAAAACCACTTAAGGGAACTCAATTCTTGATCAGTCGCCGATGGTGCCTCACATGTTTGT
GTTCATCCATGACACTGATAAAGGATTTTGGGGGCACGACTGGAGCTAGTGCAAATACCATGGATGGAAAACCTGTTTGCCGAAATTGTGGAGGAAGTGGTGCTGTACTT
TGTGACATGTGCGGTGGTACAGGCAAATGGAAAGCTCTGAACAGAAAACGGGCTAAGGATGTCTACGAGTTTACAGAATGTCCAAATTGTTATGGTAGAGGAAAACTTGT
ATGTCCTGTTTGTCTAGGAACTGGTTTACCAAATAACAAAGGTCTCCTAAGAAGGCCTGACGCACGAAAATTGCTCGATAAGATGTACAATGGTCGCTTGTTACCAAATT
CTTGA
mRNA sequenceShow/hide mRNA sequence
CTTTAAGGAAAAAGTTTTGCTATTAGAAAATGATGAAGTGAAATTGAATTTTAAGATTGTGGAAGAACGGGGGAGGGCAGGCGTAGGAACGAGAGTTGAGAGTGTTGGGT
AATTAATTAGAGTAGCAGAAGGAGTGGTGCAGAGAGAAGAGAAGATGATGGCGTCTTCTTCTCATCTATCAGCCATTCCCCTACGCCTGTCTTCCACCTCCACAAGAACG
AACTTATCCCACGCTAAATCAAAGCCAATTGTACTTAATGGGACCTCAAATTTGGACAATGAAAGCTGTAGTAGTGGAGATTCTAATACACCATCAAAACCACTTAAGGG
AACTCAATTCTTGATCAGTCGCCGATGGTGCCTCACATGTTTGTGTTCATCCATGACACTGATAAAGGATTTTGGGGGCACGACTGGAGCTAGTGCAAATACCATGGATG
GAAAACCTGTTTGCCGAAATTGTGGAGGAAGTGGTGCTGTACTTTGTGACATGTGCGGTGGTACAGGCAAATGGAAAGCTCTGAACAGAAAACGGGCTAAGGATGTCTAC
GAGTTTACAGAATGTCCAAATTGTTATGGTAGAGGAAAACTTGTATGTCCTGTTTGTCTAGGAACTGGTTTACCAAATAACAAAGGTCTCCTAAGAAGGCCTGACGCACG
AAAATTGCTCGATAAGATGTACAATGGTCGCTTGTTACCAAATTCTTGAACTTCATCTCGTTTCTCACAAATTAGCCTGTCTAGGTGATACATGTATAACTTGTTTCAGC
TGGTATAAGTTACATGATTGTAGTTTTTGTCATGGTTGGCTTCATGTCATTCATTGAGTGGAATAACGTAGGTATAACTTAGATGGCCATCCTTTTTGATGCTCAAAACA
CCCTCTAATCGTCAAATTCATTTTGTTCATACCAAAAAGAGTAGAGCATATACGTCAAATCACACTGTTCCATCCTCGTCGCAAAGCATTGTCAAACATGATCTAGTTTT
CATAGATTACAAAGGGGCAACGTGAATTCAAATTTATAAGAAATTCAAAATCACGTTAATGCAAATGAAAGAGAATAAAGAATAGAAACAAAGGAAAAAAGATTAGGAAA
GAATCAGAAAGAGACAGCAACAATCATGCCAACTTTTCGGTGCAGCAAAGTGAAACGGATGTAGAAGCTCAAGCCAGTATCCTTATGAATGAGGCAGTGGATTAAATATC
CTTTGATGAAGTTACTTGTGATTGTGTTGAGCGAGGCAAGTGATCATCTAAAGAGAATTATAGTTTATACATCCTCATAGCTCTCGTTCAAGCCTTGGCCTCACTAAGAT
TCGAACTATTGATTTGTGTCGTTGAAGTTCGAAAGCCTTCAAAATCTTTGTCTCCACACAGTAGGCTTGAAGATGCATGCCTTGTATTCTTCATCATGAATTCCATATTG
ATGAGGATAACTTTCTGTTCATGCTACAAATTTTAAGCAAAATCATTTCATCCAACACAAACTTTAGGCTCACCCTTTTTCCTCTTCTTTTCACTTTGATCAAGTGGTTT
GTTTGCAGCGTGATCAGTCTTGGAGACAATTTTTGATACTGTAAATCCAGCATCTTCAAAGAAGTTCGATTGGTAAGGGATGACAGAACTTCGAATAACAAACTCGACTC
CCTCGTAAGTCCACCATAGTCCTAAGATTGCTTCTCCTTGAAGTCAAATAGCAAACCACCCGAGGCCAGCAACAGCAATGTCTACCCAACTTGAATCCCAACTACTACCA
CAAACACGAAATTCTCTTCTCACCCATTTTCCAAGCTCTGCAACTCGATCTTTACCTATAGGTGGCTGAAGCATGAAACATTGAGGATTCCTTAACAATTACATTTACTA
TAACAGAAACCCTAAACAACTACATAAATGTAAGAGAGTAAGGAAATTCCGAAGCCAACTTTCCACAAATATGAGATCAAGGCTTTCCAAAAAAGAATATGAAAACACTA
TGGCACCCCAAATGTTGCAAAATGAAAAACGAGAAGGGCAGTTAAACCACCGATTAACTATTCATAACATGATCATAAGTATCGAATAATAAAAAACAGGATCATAAGCA
TCAAACAATATCGATATGAGATAAATAATAGAAGCCATATGCACATTCTTAAGACTATGTCATTTAGGAAGCCATTGACAATGAAACTCGGAAGACGCTTACATAAACCA
ATTCTGGGACAGCACTCAAATTTCAGCAACATGATGCAAATGGTTATAGTGAAAATAATACTGTAACTGATTGCCAAAATCATCCTCTTGTATCTTGCTTGCATTTTCTA
TTTTCCCCATGTGTAATAGTAGATAAGGAGAGCCCACACTGTAACATAAATTGAATCTACAAATGTTTCTTCCACATCTAGCCTCATGACACCCGCAACATGAATTGTAT
GACCAGCCTAAAAAACGGACAAAAAGAATGAAAT
Protein sequenceShow/hide protein sequence
MMASSSHLSAIPLRLSSTSTRTNLSHAKSKPIVLNGTSNLDNESCSSGDSNTPSKPLKGTQFLISRRWCLTCLCSSMTLIKDFGGTTGASANTMDGKPVCRNCGGSGAVL
CDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS