; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010384 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010384
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic
Genome locationchr9:46843313..46847779
RNA-Seq ExpressionLag0010384
SyntenyLag0010384
Gene Ontology termsGO:0048564 - photosystem I assembly (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0047134 - protein-disulfide reductase activity (molecular function)
InterPro domainsIPR036410 - Heat shock protein DnaJ, cysteine-rich domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011653566.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic isoform X2 [Cucumis sativus]3.0e-8788.89Show/hide
Query:  MASSSHLSAIPL-RPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPVC
        MASSSHLSAIPL RPSSSS  SLSH N KP+VL VTSN DDE CSTGDSKTPSKPLKGTQ LISRRWCLTCLCSSVTL+K +GGT+TEAIANTMDGKP C
Subjt:  MASSSHLSAIPL-RPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPVC

Query:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV
        RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDAR+LLDK++N R+
Subjt:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV

XP_016900745.1 PREDICTED: protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic isoform X1 [Cucumis melo]6.7e-8788.95Show/hide
Query:  MMASSSHLSAIPL-RPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPV
        MMASSSHLSAIPL RPSSSS  SLSH NSKP+VL +TSN DDE CSTGDS+TPSKPLKGTQ LISRRWCLTCLCSSVTL+K++GGT TEAIANTMDGKPV
Subjt:  MMASSSHLSAIPL-RPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPV

Query:  CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV
        CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDAR+LLDK++N R+
Subjt:  CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV

XP_022958131.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Cucurbita moschata]5.4e-8990Show/hide
Query:  MMASSSHLSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPVC
        MMASSSHLSAIPLR SSSS  SLSH NSKP+VLQVTSN DDE  +TGDS TPSKPLKGT+ILISRRWCLTCLCSS TLIKD+GGTM EAIANTMDGKP C
Subjt:  MMASSSHLSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPVC

Query:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV
        RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDK+FN R+
Subjt:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV

XP_022996228.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Cucurbita maxima]7.1e-8990Show/hide
Query:  MMASSSHLSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPVC
        MMASSSHLSAIPLR SSSS   LSH NSKP+VLQVTSN DDE CSTGD  TPSKPLKGT ILISRRWCLTCLCSS TLIKD+GGTM EAIANTMDGKP C
Subjt:  MMASSSHLSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPVC

Query:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV
        RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDK+FN R+
Subjt:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV

XP_038901741.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Benincasa hispida]1.1e-8687.78Show/hide
Query:  MMASSSHLSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPVC
        MMASSSHLSAIPLRPSS+S  SL HPNSKP+VL VTS  D E CSTG S  PSKPLKGTQ LISRRWCLTCLCSSVTL+K++GGT TEAIANTMDGKPVC
Subjt:  MMASSSHLSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPVC

Query:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV
        RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDK++N R+
Subjt:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV

TrEMBL top hitse value%identityAlignment
A0A0A0KX46 Uncharacterized protein3.6e-8688.4Show/hide
Query:  MASSSHLSAIPL-RPSSSSAASLSH-PNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPV
        MASSSHLSAIPL RPSSSS  SLSH  N KP+VL VTSN DDE CSTGDSKTPSKPLKGTQ LISRRWCLTCLCSSVTL+K +GGT+TEAIANTMDGKP 
Subjt:  MASSSHLSAIPL-RPSSSSAASLSH-PNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPV

Query:  CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV
        CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDAR+LLDK++N R+
Subjt:  CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV

A0A1S4DYE8 protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic isoform X13.2e-8788.95Show/hide
Query:  MMASSSHLSAIPL-RPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPV
        MMASSSHLSAIPL RPSSSS  SLSH NSKP+VL +TSN DDE CSTGDS+TPSKPLKGTQ LISRRWCLTCLCSSVTL+K++GGT TEAIANTMDGKPV
Subjt:  MMASSSHLSAIPL-RPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPV

Query:  CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV
        CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDAR+LLDK++N R+
Subjt:  CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV

A0A6J1DK36 protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic6.3e-8386.19Show/hide
Query:  MMASSSHLSAIPLRPSSSSA-ASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPV
        MMASSSHLSAIPLR SS+S   +LSH  SKPIVL  TSN D+E CS+GDS TPSKPLKGTQ LISRRWCLTCLCSS+TLIKD GGT T A ANTMDGKPV
Subjt:  MMASSSHLSAIPLRPSSSSA-ASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPV

Query:  CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV
        CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDK++N R+
Subjt:  CRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV

A0A6J1H182 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic2.6e-8990Show/hide
Query:  MMASSSHLSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPVC
        MMASSSHLSAIPLR SSSS  SLSH NSKP+VLQVTSN DDE  +TGDS TPSKPLKGT+ILISRRWCLTCLCSS TLIKD+GGTM EAIANTMDGKP C
Subjt:  MMASSSHLSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPVC

Query:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV
        RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDK+FN R+
Subjt:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV

A0A6J1KA92 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic3.4e-8990Show/hide
Query:  MMASSSHLSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPVC
        MMASSSHLSAIPLR SSSS   LSH NSKP+VLQVTSN DDE CSTGD  TPSKPLKGT ILISRRWCLTCLCSS TLIKD+GGTM EAIANTMDGKP C
Subjt:  MMASSSHLSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPVC

Query:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV
        RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDK+FN R+
Subjt:  RNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV

SwissProt top hitse value%identityAlignment
A0A1D6KL43 Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic1.0e-4572.5Show/hide
Query:  SRRWCLTCLCSSVTLIKDHG---GTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNK
        SRR CL CL  +VTLI   G   G   +A+      K VCRNC GSGAV+CDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNK
Subjt:  SRRWCLTCLCSSVTLIKDHG---GTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNK

Query:  GLLRRPDARKLLDKIFNSRV
        GLLRRP+A++LLDK++N ++
Subjt:  GLLRRPDARKLLDKIFNSRV

B5YAR4 Chaperone protein DnaJ1.2e-0630.43Show/hide
Query:  SVTLIKDHGGTMTEAIANTMDGKPVCRNCG---GSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLV---CPVCLGTGLPNNK
        ++TL +   G+  E     ++  P C+  G   G+  V CDMC GTG+ + + +       + T CP C+G G+++   C  C GTG    K
Subjt:  SVTLIKDHGGTMTEAIANTMDGKPVCRNCG---GSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLV---CPVCLGTGLPNNK

O64750 Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic1.2e-4956.99Show/hide
Query:  MMASSSHLSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSK---PLKGTQILISRR-WCLTCLCSSVTLIKDHGGTMTEAIANTMDG
        M ASSSHL A+P    S ++  LS PN   + +   S  +++   + DS + S+     +G Q  +SRR W   C+C+S  LI +    ++   A  +D 
Subjt:  MMASSSHLSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSK---PLKGTQILISRR-WCLTCLCSSVTLIKDHGGTMTEAIANTMDG

Query:  KP--VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV
        KP   CRNC GSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRP AR+LL+K++N R+
Subjt:  KP--VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV

Q6YUA8 Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic4.4e-4961.58Show/hide
Query:  LSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDD-ERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTM--TEAIANTMDGKP-VCRNC
        LS  P   S    A+   P      +   S  DD E CST    T  K  + T    SRR CL CLC +VTLI   G TM     +A+ M  KP VCRNC
Subjt:  LSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDD-ERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTM--TEAIANTMDGKP-VCRNC

Query:  GGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV
         GSGAVLCDMCGGTGKWKALNRKRAKDVY FTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDA+KLLDK++N ++
Subjt:  GGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV

Q7UA76 Chaperone protein DnaJ1.1e-0435.14Show/hide
Query:  CRNCGGSGA------VLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLV---CPVCLGTGLPNNKGLLR
        C  CGGSGA        C  CGG G+ +   R       +  ECPNC G G+++   C  C G G+   +  LR
Subjt:  CRNCGGSGA------VLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLV---CPVCLGTGLPNNKGLLR

Arabidopsis top hitse value%identityAlignment
AT2G34860.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein8.2e-5156.99Show/hide
Query:  MMASSSHLSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSK---PLKGTQILISRR-WCLTCLCSSVTLIKDHGGTMTEAIANTMDG
        M ASSSHL A+P    S ++  LS PN   + +   S  +++   + DS + S+     +G Q  +SRR W   C+C+S  LI +    ++   A  +D 
Subjt:  MMASSSHLSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSK---PLKGTQILISRR-WCLTCLCSSVTLIKDHGGTMTEAIANTMDG

Query:  KP--VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV
        KP   CRNC GSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRP AR+LL+K++N R+
Subjt:  KP--VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV

AT2G34860.2 DnaJ/Hsp40 cysteine-rich domain superfamily protein8.2e-5156.99Show/hide
Query:  MMASSSHLSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSK---PLKGTQILISRR-WCLTCLCSSVTLIKDHGGTMTEAIANTMDG
        M ASSSHL A+P    S ++  LS PN   + +   S  +++   + DS + S+     +G Q  +SRR W   C+C+S  LI +    ++   A  +D 
Subjt:  MMASSSHLSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSK---PLKGTQILISRR-WCLTCLCSSVTLIKDHGGTMTEAIANTMDG

Query:  KP--VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV
        KP   CRNC GSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRP AR+LL+K++N R+
Subjt:  KP--VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRV

AT5G03870.1 Glutaredoxin family protein5.8e-0429.41Show/hide
Query:  GTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVC
        G + E I     G   CR CGG   ++C +C G+ K +   +K         +C  C   G ++CP+C
Subjt:  GTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVC

AT5G06130.1 chaperone protein dnaJ-related8.9e-0535.82Show/hide
Query:  KPVCRNCGGSGAVLCDMCGGTGKWKALN---RKRAKD----VYEFTECPNCYGRGKLVCPVCLGTGL
        K  C+ C G+G + C  C  +G   +++   R RA +    V     C NC G GK++CP CL TG+
Subjt:  KPVCRNCGGSGAVLCDMCGGTGKWKALN---RKRAKD----VYEFTECPNCYGRGKLVCPVCLGTGL

AT5G06130.2 chaperone protein dnaJ-related8.9e-0535.82Show/hide
Query:  KPVCRNCGGSGAVLCDMCGGTGKWKALN---RKRAKD----VYEFTECPNCYGRGKLVCPVCLGTGL
        K  C+ C G+G + C  C  +G   +++   R RA +    V     C NC G GK++CP CL TG+
Subjt:  KPVCRNCGGSGAVLCDMCGGTGKWKALN---RKRAKD----VYEFTECPNCYGRGKLVCPVCLGTGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGCGTCTTCTTCTCATCTCTCCGCCATTCCCCTGCGCCCGTCTTCCTCCTCAGCAGCTTCCTTATCCCACCCTAACTCAAAGCCCATTGTACTTCAAGTGACATC
AAATTTTGACGATGAAAGATGTAGTACTGGAGATTCGAAAACACCATCAAAACCACTTAAGGGAACTCAAATCTTAATCTCACGTCGATGGTGCCTCACATGTTTGTGTT
CATCTGTGACATTGATAAAGGATCATGGGGGCACAATGACTGAAGCTATTGCAAATACCATGGATGGAAAACCCGTGTGCCGGAATTGTGGAGGAAGTGGCGCTGTACTT
TGTGACATGTGCGGTGGTACAGGCAAATGGAAAGCTTTGAACAGAAAACGGGCTAAAGATGTCTACGAGTTTACAGAATGTCCAAATTGTTATGGTAGAGGAAAACTTGT
ATGTCCTGTTTGTCTAGGAACTGGTTTACCAAACAACAAAGGTCTTCTAAGAAGGCCCGATGCACGAAAATTGCTTGATAAGATATTCAATAGTAGAGTACATAGGGGCA
ACGTGAGCGGTGAGCTCAACTTTTATATGAAGTCAGGAGAAAGTCGCACAAATCCCAGAAGATTATCACCGAATGAAAGTCTTCAAAATCTTCGTCTCCAGGGCAGAAGG
CTTGAAGATGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGGCGTCTTCTTCTCATCTCTCCGCCATTCCCCTGCGCCCGTCTTCCTCCTCAGCAGCTTCCTTATCCCACCCTAACTCAAAGCCCATTGTACTTCAAGTGACATC
AAATTTTGACGATGAAAGATGTAGTACTGGAGATTCGAAAACACCATCAAAACCACTTAAGGGAACTCAAATCTTAATCTCACGTCGATGGTGCCTCACATGTTTGTGTT
CATCTGTGACATTGATAAAGGATCATGGGGGCACAATGACTGAAGCTATTGCAAATACCATGGATGGAAAACCCGTGTGCCGGAATTGTGGAGGAAGTGGCGCTGTACTT
TGTGACATGTGCGGTGGTACAGGCAAATGGAAAGCTTTGAACAGAAAACGGGCTAAAGATGTCTACGAGTTTACAGAATGTCCAAATTGTTATGGTAGAGGAAAACTTGT
ATGTCCTGTTTGTCTAGGAACTGGTTTACCAAACAACAAAGGTCTTCTAAGAAGGCCCGATGCACGAAAATTGCTTGATAAGATATTCAATAGTAGAGTACATAGGGGCA
ACGTGAGCGGTGAGCTCAACTTTTATATGAAGTCAGGAGAAAGTCGCACAAATCCCAGAAGATTATCACCGAATGAAAGTCTTCAAAATCTTCGTCTCCAGGGCAGAAGG
CTTGAAGATGCTTGA
Protein sequenceShow/hide protein sequence
MMASSSHLSAIPLRPSSSSAASLSHPNSKPIVLQVTSNFDDERCSTGDSKTPSKPLKGTQILISRRWCLTCLCSSVTLIKDHGGTMTEAIANTMDGKPVCRNCGGSGAVL
CDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKIFNSRVHRGNVSGELNFYMKSGESRTNPRRLSPNESLQNLRLQGRR
LEDA