; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cp4.1LG15g06830 (gene) of Cucurbita pepo (MU-CU-16) v4.1 genome

Gene IDCp4.1LG15g06830
OrganismCucurbita pepo var. pepo MU-CU-16 (Cucurbita pepo (MU-CU-16) v4.1)
DescriptionRibulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferase
Genome locationCp4.1LG15:7389374..7398803
RNA-Seq ExpressionCp4.1LG15g06830
SyntenyCp4.1LG15g06830
Gene Ontology termsGO:0018026 - peptidyl-lysine monomethylation (biological process)
GO:0000162 - tryptophan biosynthetic process (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0030785 - [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase activity (molecular function)
GO:0016279 - protein-lysine N-methyltransferase activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0004425 - indole-3-glycerol-phosphate synthase activity (molecular function)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR044431 - RBCMT, SET domain
IPR036464 - Rubisco LSMT, substrate-binding domain superfamily
IPR015353 - Rubisco LSMT, substrate-binding domain
IPR013798 - Indole-3-glycerol phosphate synthase
IPR001214 - SET domain
IPR001468 - Indole-3-glycerol phosphate synthase, conserved site
IPR011060 - Ribulose-phosphate binding barrel
IPR013785 - Aldolase-type TIM barrel


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4349155.1 hypothetical protein G4B88_020394 [Cannabis sativa]0.076.01Show/hide
Query:  MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK
        MAEASR+  + LV NF   Q+TH ++    SFRR+ ++CSVSTS++ +   T  IPWGCE+DS ENASALQKWLSQSGLP QKMAI++V+VGERGLVALK
Subjt:  MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK

Query:  NVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN
        N+RKGEKLLFVPP+LVISA+SEWS  EAG++LK+NSVPDWP +ATYLISEAS+M+SSRW+NYISALPRQPYSLLYWTREELDRYLEAS+IRERAIERITN
Subjt:  NVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN

Query:  VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKK
        VVGTYNDLR RIFSKYP+LFPEEVFN++TFKWSFGILFSRLVRLPSM  K ALVPWADM+NH+C+VETFLDYDKSSQGVVFTTDR YQPGEQVFISYGKK
Subjt:  VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKK

Query:  SNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN
        SNGELLLSYGFVPKEG+NP DSVEL +SL KSD+CYK+KLEAL+KHGL+ASQC+PI++TGWPLEL A+AYLAVSPPSMS QFEEMAAAASNK    K+L 
Subjt:  SNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN

Query:  YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYLRPNFLKKNVIAFIDILSTDVSDSEKKIERL
        YPDIEEEALQFILDSCE+SISKYNKFLQ SGS+DLDVTSPKQLNRR+FLKQLAVDLCTSERRILFR+QY  P   +  +  F  + ST  S         
Subjt:  YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYLRPNFLKKNVIAFIDILSTDVSDSEKKIERL

Query:  EKWSKFLSRRLNRRPSMDALSRNSISVASSSSPISPAIRAQQTESEAGSAAASPVNE--PKENDLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYV
                  L+ R   ++ S  S S++SSSS  S  IRAQ+ +S+ GSA  +PV+E  P+ + +K+ EWEVGMF+DEVA SQGIRIRRRPPTGPPLHYV
Subjt:  EKWSKFLSRRLNRRPSMDALSRNSISVASSSSPISPAIRAQQTESEAGSAAASPVNE--PKENDLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYV

Query:  GPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKG
        GPF+FRLQNEGNTPRNILEEIIW KD EV+Q+KER+PL SLKK LD+  P RDF+GALK AY RT LPGLIAEVKKASPSRGV++E+FDPVEIAQAYEKG
Subjt:  GPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKG

Query:  GAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIEG
        GAACLSVL D K+FQGS+EN++ IRNAGV+CPLLCKEF+++AWQIYYARSKGADAILLIAAVLPDLDIKYM KICKM+GL  LVEVHDE+EMDR+L IEG
Subjt:  GAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIEG

Query:  IELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFGKDIS
        IELIGINNRNLETFEVD S TK+LLEG+RG+ IRE++I +VGESGLFTP+DIAYVQ AGVKAVLVGESIVKQ DP KGIA LFG+DIS
Subjt:  IELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFGKDIS

KAG6572214.1 Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferase, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.098.51Show/hide
Query:  MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK
        MAEASRIFYSPLVPNFRPLQKTHS KPYTASFRRHSINCSVSTSDSARV ATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK
Subjt:  MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK

Query:  NVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN
        NVRKGEKLLFVPPSLVISADSEWSCPEAGE+LKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN
Subjt:  NVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN

Query:  VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKK
        VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDK+SQGVVFTTDRAYQPGEQVFISYGKK
Subjt:  VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKK

Query:  SNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN
        SNGELLLSYGFVPKEG+NPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKST TKDLN
Subjt:  SNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN

Query:  YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYL
        YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQY+
Subjt:  YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYL

KAG7011853.1 Indole-3-glycerol phosphate synthase, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]0.089.4Show/hide
Query:  MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK
        MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK
Subjt:  MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK

Query:  NVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN
        NVRKGEKLLFVPPSLVISADSEWSCPEAGE+LKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN
Subjt:  NVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN

Query:  VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKK
        VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDK+SQGVVFTTDRAYQPGEQVFISYGKK
Subjt:  VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKK

Query:  SNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN
        SNGELLLSYGFVPKEG+NPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKST TKDLN
Subjt:  SNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN

Query:  YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYLRPNFLKKNVIAFIDILSTDVSDSEKKIERL
        YPDIEEEALQFILDSCETSISKYNKFLQ                    L++   DL + E R L                                  +L
Subjt:  YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYLRPNFLKKNVIAFIDILSTDVSDSEKKIERL

Query:  EKWSKFLSRRLNRRPSMDALSRNSISVASSSSPISPAIRAQQTESEAGSAAASPVNEPKENDLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYVGP
         + SKFLSRRLNRRPSMDALSRNSISVASSSSPISPAIRAQQTESEAGSAAASPVNEPKENDLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYVGP
Subjt:  EKWSKFLSRRLNRRPSMDALSRNSISVASSSSPISPAIRAQQTESEAGSAAASPVNEPKENDLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYVGP

Query:  FQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGA
        FQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDL+SAPPARDFLGALKAAY+RTNLPGLIAEVKKASPSRGVLREDFDPVEIA+AYEKGGA
Subjt:  FQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGA

Query:  ACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIEGIE
        ACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMD         
Subjt:  ACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIEGIE

Query:  LIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFGKDISA
                 +TFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFGKDISA
Subjt:  LIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFGKDISA

RXH67867.1 hypothetical protein DVH24_028014 [Malus domestica]0.074.89Show/hide
Query:  RIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALKNVRKG
        ++  + L+P F P  K H+      +FRR+ + CSVST+D A+      IPWGC+IDSLENASALQKWL+ S LP QKMAI++V+VGERGLVALKN+RKG
Subjt:  RIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALKNVRKG

Query:  EKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITNVVGTY
        EKLLFVPPSL I+ADSEW+C EAGE+LK+N VPDWP +ATYLISEAS ++SSRW+NYISALPRQPYSLLYWTREELDRYLEAS+IR+RAIER+TNVVGTY
Subjt:  EKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITNVVGTY

Query:  NDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKKSNGEL
        NDLR RIFSKYP LFPEEVFN ETFKWSFGILFSRLVRLPS+ GKVALVPWADMLNH+C+V+ FLDYDKSSQGVVF+TD AY+PGEQVFISYGKKSNGEL
Subjt:  NDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKKSNGEL

Query:  LLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLNYPDIE
        LLSYGFVP+EG+NP DSVEL LSL KSDKCYKEKLE L+K+GL ASQC+P+QITGWPLEL A+AYLAVSPPSMS QF+EMAAAASNKSTA KDL Y +IE
Subjt:  LLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLNYPDIE

Query:  EEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYLRPNFLKKNVIAFIDILSTDVSDSEKKIERLEKWSK
        EEALQFILD+CE SISKYNKFLQASGSMDLDVTSPKQLNRR+FLKQLAVDL TSERRILFR+QY+    L+       DI S +   + + +      + 
Subjt:  EEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYLRPNFLKKNVIAFIDILSTDVSDSEKKIERLEKWSK

Query:  FLSRRLNRRP----SMDALSRNSISVASSSSPISPAIRAQQTESEAGSAAASPVN---EPKENDLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYV
          SR   R+P    + DA   NS+S+ SSS  I   +RAQQ ES+    +A  V    EP++ +L++KEWEVGM+ +EVAASQGIRIRRRPPTGPP HYV
Subjt:  FLSRRLNRRP----SMDALSRNSISVASSSSPISPAIRAQQTESEAGSAAASPVN---EPKENDLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYV

Query:  GPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKG
        GPF+FRLQNEGNTPRNILEEIIW+KD +V+Q+K+R+PL+SLKK LD+APP RDF+GAL+AA+ RT LPGLIAEVKKASPSRGVLRE+FDPVEIAQAYEKG
Subjt:  GPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKG

Query:  GAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIEG
        GAACLSVL DEK+FQGS+EN+  IRNAGV+CPLLCKEF+++AWQIYYAR+KGADAILLIAAVLPDLDIKYMTKICKM+G+  LVEVHDE+EMDR+L IEG
Subjt:  GAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIEG

Query:  IELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFGKDIS
        +E IGINNRNLETF+VD SNTKKLLEGERG+ IR+++I +VGESGLFTPDDIAYVQEAGVKAVLVGESIVKQ DP KGIA LFGKDIS
Subjt:  IELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFGKDIS

XP_023554432.1 ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferase, chloroplastic [Cucurbita pepo subsp. pepo]0.099.79Show/hide
Query:  MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK
        MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK
Subjt:  MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK

Query:  NVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN
        NVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN
Subjt:  NVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN

Query:  VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKK
        VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKK
Subjt:  VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKK

Query:  SNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN
        SNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN
Subjt:  SNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN

Query:  YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYL
        YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQY+
Subjt:  YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYL

TrEMBL top hitse value%identityAlignment
A0A498HEW2 Indole-3-glycerol-phosphate synthase0.074.89Show/hide
Query:  RIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALKNVRKG
        ++  + L+P F P  K H+      +FRR+ + CSVST+D A+      IPWGC+IDSLENASALQKWL+ S LP QKMAI++V+VGERGLVALKN+RKG
Subjt:  RIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALKNVRKG

Query:  EKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITNVVGTY
        EKLLFVPPSL I+ADSEW+C EAGE+LK+N VPDWP +ATYLISEAS ++SSRW+NYISALPRQPYSLLYWTREELDRYLEAS+IR+RAIER+TNVVGTY
Subjt:  EKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITNVVGTY

Query:  NDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKKSNGEL
        NDLR RIFSKYP LFPEEVFN ETFKWSFGILFSRLVRLPS+ GKVALVPWADMLNH+C+V+ FLDYDKSSQGVVF+TD AY+PGEQVFISYGKKSNGEL
Subjt:  NDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKKSNGEL

Query:  LLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLNYPDIE
        LLSYGFVP+EG+NP DSVEL LSL KSDKCYKEKLE L+K+GL ASQC+P+QITGWPLEL A+AYLAVSPPSMS QF+EMAAAASNKSTA KDL Y +IE
Subjt:  LLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLNYPDIE

Query:  EEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYLRPNFLKKNVIAFIDILSTDVSDSEKKIERLEKWSK
        EEALQFILD+CE SISKYNKFLQASGSMDLDVTSPKQLNRR+FLKQLAVDL TSERRILFR+QY+    L+       DI S +   + + +      + 
Subjt:  EEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYLRPNFLKKNVIAFIDILSTDVSDSEKKIERLEKWSK

Query:  FLSRRLNRRP----SMDALSRNSISVASSSSPISPAIRAQQTESEAGSAAASPVN---EPKENDLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYV
          SR   R+P    + DA   NS+S+ SSS  I   +RAQQ ES+    +A  V    EP++ +L++KEWEVGM+ +EVAASQGIRIRRRPPTGPP HYV
Subjt:  FLSRRLNRRP----SMDALSRNSISVASSSSPISPAIRAQQTESEAGSAAASPVN---EPKENDLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYV

Query:  GPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKG
        GPF+FRLQNEGNTPRNILEEIIW+KD +V+Q+K+R+PL+SLKK LD+APP RDF+GAL+AA+ RT LPGLIAEVKKASPSRGVLRE+FDPVEIAQAYEKG
Subjt:  GPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKG

Query:  GAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIEG
        GAACLSVL DEK+FQGS+EN+  IRNAGV+CPLLCKEF+++AWQIYYAR+KGADAILLIAAVLPDLDIKYMTKICKM+G+  LVEVHDE+EMDR+L IEG
Subjt:  GAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIEG

Query:  IELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFGKDIS
        +E IGINNRNLETF+VD SNTKKLLEGERG+ IR+++I +VGESGLFTPDDIAYVQEAGVKAVLVGESIVKQ DP KGIA LFGKDIS
Subjt:  IELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFGKDIS

A0A6J1GKA5 ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferase, chloroplastic0.098.51Show/hide
Query:  MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK
        MAEASRIFYSPLVPNFRPLQKTHS KPYTASFRRHSINCSVSTSDSARV ATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK
Subjt:  MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK

Query:  NVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN
        NVRKGEKLLFVPPSLVISADSEWSCPEAGE+LKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN
Subjt:  NVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN

Query:  VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKK
        VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDK+SQGVVFTTDRAYQPGEQVFISYGKK
Subjt:  VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKK

Query:  SNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN
        SNGELLLSYGFVPKEG+NPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKST TKDLN
Subjt:  SNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN

Query:  YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYL
        YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQY+
Subjt:  YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYL

A0A6J1HV83 ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferase, chloroplastic isoform X20.098.09Show/hide
Query:  MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK
        MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLEN+SALQKWLSQSGLPDQKMAIQRVDVGERGLVALK
Subjt:  MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK

Query:  NVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN
        NVRKGEKLLFVPPSLVISADSEWSCPEAG++LKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN
Subjt:  NVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN

Query:  VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKK
        VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLP+MDGKVALVPWADMLNHNCEVETFLDYDK+SQG+VFTTDRAYQPGEQVFISYGKK
Subjt:  VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKK

Query:  SNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN
        SNGELLLSYGFVPKEG+NPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN
Subjt:  SNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN

Query:  YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYL
        YPDIEEEALQFILDSCE SISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQY+
Subjt:  YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYL

A0A6J1HWK3 ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferase, chloroplastic isoform X10.098.09Show/hide
Query:  MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK
        MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLEN+SALQKWLSQSGLPDQKMAIQRVDVGERGLVALK
Subjt:  MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK

Query:  NVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN
        NVRKGEKLLFVPPSLVISADSEWSCPEAG++LKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN
Subjt:  NVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN

Query:  VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKK
        VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLP+MDGKVALVPWADMLNHNCEVETFLDYDK+SQG+VFTTDRAYQPGEQVFISYGKK
Subjt:  VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKK

Query:  SNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN
        SNGELLLSYGFVPKEG+NPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN
Subjt:  SNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN

Query:  YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYL
        YPDIEEEALQFILDSCE SISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQY+
Subjt:  YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYL

A0A7J6DST9 Indole-3-glycerol-phosphate synthase0.076.01Show/hide
Query:  MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK
        MAEASR+  + LV NF   Q+TH ++    SFRR+ ++CSVSTS++ +   T  IPWGCE+DS ENASALQKWLSQSGLP QKMAI++V+VGERGLVALK
Subjt:  MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALK

Query:  NVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN
        N+RKGEKLLFVPP+LVISA+SEWS  EAG++LK+NSVPDWP +ATYLISEAS+M+SSRW+NYISALPRQPYSLLYWTREELDRYLEAS+IRERAIERITN
Subjt:  NVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITN

Query:  VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKK
        VVGTYNDLR RIFSKYP+LFPEEVFN++TFKWSFGILFSRLVRLPSM  K ALVPWADM+NH+C+VETFLDYDKSSQGVVFTTDR YQPGEQVFISYGKK
Subjt:  VVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKK

Query:  SNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN
        SNGELLLSYGFVPKEG+NP DSVEL +SL KSD+CYK+KLEAL+KHGL+ASQC+PI++TGWPLEL A+AYLAVSPPSMS QFEEMAAAASNK    K+L 
Subjt:  SNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLN

Query:  YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYLRPNFLKKNVIAFIDILSTDVSDSEKKIERL
        YPDIEEEALQFILDSCE+SISKYNKFLQ SGS+DLDVTSPKQLNRR+FLKQLAVDLCTSERRILFR+QY  P   +  +  F  + ST  S         
Subjt:  YPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYLRPNFLKKNVIAFIDILSTDVSDSEKKIERL

Query:  EKWSKFLSRRLNRRPSMDALSRNSISVASSSSPISPAIRAQQTESEAGSAAASPVNE--PKENDLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYV
                  L+ R   ++ S  S S++SSSS  S  IRAQ+ +S+ GSA  +PV+E  P+ + +K+ EWEVGMF+DEVA SQGIRIRRRPPTGPPLHYV
Subjt:  EKWSKFLSRRLNRRPSMDALSRNSISVASSSSPISPAIRAQQTESEAGSAAASPVNE--PKENDLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYV

Query:  GPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKG
        GPF+FRLQNEGNTPRNILEEIIW KD EV+Q+KER+PL SLKK LD+  P RDF+GALK AY RT LPGLIAEVKKASPSRGV++E+FDPVEIAQAYEKG
Subjt:  GPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKG

Query:  GAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIEG
        GAACLSVL D K+FQGS+EN++ IRNAGV+CPLLCKEF+++AWQIYYARSKGADAILLIAAVLPDLDIKYM KICKM+GL  LVEVHDE+EMDR+L IEG
Subjt:  GAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIEG

Query:  IELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFGKDIS
        IELIGINNRNLETFEVD S TK+LLEG+RG+ IRE++I +VGESGLFTP+DIAYVQ AGVKAVLVGESIVKQ DP KGIA LFG+DIS
Subjt:  IELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFGKDIS

SwissProt top hitse value%identityAlignment
B0JTM2 Indole-3-glycerol phosphate synthase6.5e-8858.19Show/hide
Query:  IRIRRRPPTGPPLHYVGPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVL
        ++IRR+ P   P   V    + ++     P+NILEEI+W+K+ EV +++ER PL  L++ + +  P  DFL ALK    +   P LIAEVKKASPS+GV+
Subjt:  IRIRRRPPTGPPLHYVGPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVL

Query:  REDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLV
         EDFDPV IA+ YE+GGA CLSVLTD KFFQGS+ENL  +R A V  PLLCKEF++  +QIYYARSKGADA+LLIAA+L D D+ Y  KI K +G+T LV
Subjt:  REDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLV

Query:  EVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLF
        EVH   E DR+LAIEGIELIGINNRNLETF VD+ NT++LLE  RG+++REK I IV ESGL T  D+A V++AG  AVL+GES+VK  DP  GI  LF
Subjt:  EVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLF

B1WQE4 Indole-3-glycerol phosphate synthase4.5e-8957.19Show/hide
Query:  IRIRRRPPTGPPLHYVGPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVL
        ++IRRRPP   P   V   ++++ +      +ILEEI+W+K+KEV +M++R  L  L+K   SAPPA+DFLGA+     +   P LIAEVKKASPS+GV+
Subjt:  IRIRRRPPTGPPLHYVGPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVL

Query:  REDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLV
        REDF+PV IAQAY +GGA+CLSVLTD KFFQGSF+NL  +R A V+ PLLCKEF++  +QIY AR KGADAILLIAA+L D D++Y+ KI   +G+TPLV
Subjt:  REDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLV

Query:  EVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLF
        EVH   E+DR+LAIEG+ L+GINNRNLETFEV +  T  L+   R  +I+E+ I IV ESG+ TP  +  V EAG  AVL+GES+VKQ DPT+ IA LF
Subjt:  EVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLF

B7K0H0 Indole-3-glycerol phosphate synthase2.8e-9158Show/hide
Query:  IRIRRRPPTGPPLHYVGPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVL
        ++IRRR P  PP+  V   +++++     PR+ILEEI+W+K+KEV +++E  PL  L+K +   PP +DFLGA+     +   P LIAEVKKASPS+GV+
Subjt:  IRIRRRPPTGPPLHYVGPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVL

Query:  REDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLV
        REDFDPV IAQAY KGGA+CLSVLTD KFFQGSFENL  +R + V+ PLLCKEF++  +QIY AR+KGADA+LLIAA+L D D+ Y  KI + +G+T L+
Subjt:  REDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLV

Query:  EVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFG
        EVH   E+DR+LAIEG+ LIGINNRNLETFEVD+  T +LL   R  KI+   I I+ ESGL TPDD+ +VQ+AG   VL+GES+VKQ DPT+ IA LFG
Subjt:  EVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFG

P49572 Indole-3-glycerol phosphate synthase, chloroplastic3.5e-14271.07Show/hide
Query:  SPISPAIRAQQTESEAGSAAASPVNEPKENDLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYVGPFQFRL-QNEGNTPRNILEEIIWYKDKEVSQM
        +P   +IRAQQ++ +   A +S   E K N L++KEWEV M+Q+E+A SQGIRIRR+PP+  PL Y GPF+ RL  N+ ++PRNILEEI WYKD EVS+M
Subjt:  SPISPAIRAQQTESEAGSAAASPVNEPKENDLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYVGPFQFRL-QNEGNTPRNILEEIIWYKDKEVSQM

Query:  KERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKIRNAGVECP
        KE  PL  LKK ++ APP RDF+GAL+ A+ RT  PGLIAEVKKASPSRG+L+E+FDPVEIAQAYEKGGAACLSVLTD+K+FQG FENL+ IR+AGV+CP
Subjt:  KERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKIRNAGVECP

Query:  LLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKLLEGERGQK
        LLCKEFVVD WQIYYAR+KGADA+LLIAAVL DL+I ++ KICK + L  LVEVHDE+EM R+L IEGIEL+GINNR+LETFEVDISNTKKLLEGE G++
Subjt:  LLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKLLEGERGQK

Query:  IREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFGKDIS
        IRE+++ +VGESGLFTPDDIAYVQ AGVKAVLVGESIVKQ+DP KGIAGLFG++IS
Subjt:  IREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFGKDIS

Q55508 Indole-3-glycerol phosphate synthase6.1e-8654.49Show/hide
Query:  IRIRRRPPTGPPLHYVGPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVL
        + IRRRPP  PP+  V   Q+++++    PR+ILEEI+W+K+KEV+Q +E  PL  L+  +    P  DF+GAL+ +      P LIAEVKKASPS+G++
Subjt:  IRIRRRPPTGPPLHYVGPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVL

Query:  REDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLV
        R DFDPV IA+AYE GGA CLSVLTDEKFFQGSFENLQ +R+A V+ PLLCKEF++  +QIY ARS+GADA+LLIAA+L D D++Y  KI + +G+  LV
Subjt:  REDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLV

Query:  EVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFG
        EVH  +EMDR+LA++G++LIG+NNRNL+TF VD+  T+ L   +R +++ + +IT+V ESG++   D+  +Q+AG +AVLVGES+VKQ DP + IA L+G
Subjt:  EVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFG

Query:  K
        +
Subjt:  K

Arabidopsis top hitse value%identityAlignment
AT2G04400.1 Aldolase-type TIM barrel family protein2.5e-14371.07Show/hide
Query:  SPISPAIRAQQTESEAGSAAASPVNEPKENDLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYVGPFQFRL-QNEGNTPRNILEEIIWYKDKEVSQM
        +P   +IRAQQ++ +   A +S   E K N L++KEWEV M+Q+E+A SQGIRIRR+PP+  PL Y GPF+ RL  N+ ++PRNILEEI WYKD EVS+M
Subjt:  SPISPAIRAQQTESEAGSAAASPVNEPKENDLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYVGPFQFRL-QNEGNTPRNILEEIIWYKDKEVSQM

Query:  KERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKIRNAGVECP
        KE  PL  LKK ++ APP RDF+GAL+ A+ RT  PGLIAEVKKASPSRG+L+E+FDPVEIAQAYEKGGAACLSVLTD+K+FQG FENL+ IR+AGV+CP
Subjt:  KERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKIRNAGVECP

Query:  LLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKLLEGERGQK
        LLCKEFVVD WQIYYAR+KGADA+LLIAAVL DL+I ++ KICK + L  LVEVHDE+EM R+L IEGIEL+GINNR+LETFEVDISNTKKLLEGE G++
Subjt:  LLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKLLEGERGQK

Query:  IREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFGKDIS
        IRE+++ +VGESGLFTPDDIAYVQ AGVKAVLVGESIVKQ+DP KGIAGLFG++IS
Subjt:  IREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFGKDIS

AT3G07670.1 Rubisco methyltransferase family protein7.1e-20775.31Show/hide
Query:  MAEASRIFYSPLVPNFRPLQK------THSTKPYTASFRRHSINCSVSTSDSA--RVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVG
        MA+A  +  S L+P + PL K      T S  P   S  R  I+CSVS  ++    +     I WGCEIDSLENA++LQ WLS SGLP QKMAI RVD+G
Subjt:  MAEASRIFYSPLVPNFRPLQK------THSTKPYTASFRRHSINCSVSTSDSA--RVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVG

Query:  ERGLVALKNVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRE
        ERGLVA +N+RKGEKLLFVPPSLVISADSEW+  EAGE++KR  VPDWPL+ATYLISEASL KSSRW NYISALPRQPYSLLYWTR ELD YLEAS+IRE
Subjt:  ERGLVALKNVRKGEKLLFVPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRE

Query:  RAIERITNVVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQ
        RAIERITNVVGTY DLR RIFSK+P+LFP+EVFN ETFKWSFGILFSRLVRLPSMDG+ ALVPWADMLNHNCEVETFLDYDKSS+GVVFTTDR YQPGEQ
Subjt:  RAIERITNVVGTYNDLRDRIFSKYPELFPEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQ

Query:  VFISYGKKSNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNK
        VFISYG KSNGELLLSYGFVP+EG+NPSDSVEL LSL+K+DKCY+EKL+ALKKHGL   QCFP++ITGWP+EL A+AYL VSPP M   FEEMA AASNK
Subjt:  VFISYGKKSNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNK

Query:  STATKDLNYPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYL
        ++   DL YP+IEE+ALQFILDSCETSISKY++FL+ SGSMDLD+TSPKQLNR+ FLKQLAVDL TSERRIL+R+QY+
Subjt:  STATKDLNYPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRILFRSQYL

AT5G48220.1 Aldolase-type TIM barrel family protein4.5e-15373.35Show/hide
Query:  NSISVASSSSPISPAIRAQQTESEAGSAAASPVNEPKENDLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYVGPFQFRLQNEGNTPRNILEEIIWY
        N+++  SS +P    +RAQ++    GS +A         + KV E EVGM+Q+EV  SQGIRIRRRPPTGPPLHYVGPF+FRLQNEGNTPRNILEEI+W+
Subjt:  NSISVASSSSPISPAIRAQQTESEAGSAAASPVNEPKENDLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYVGPFQFRLQNEGNTPRNILEEIIWY

Query:  KDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKI
        KDKEV+QMKER+PL+SLKK LD+ PPA+DF+GAL++A+ RT LPGLIAEVKKASPSRG+LREDF+PVEIAQAYEKGGAACLSVLTD+K+F+GS+ENLQ I
Subjt:  KDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKI

Query:  RNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKL
          AGV+CPLL KEF+V+AWQIYY RSKGADA+LLIA+VLPDLDIKYM KICK++G+  LVEVHDE+EMDR+LAIEG+ELIGINNRNLETFEVD+  TKKL
Subjt:  RNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKL

Query:  LEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFGKDIS
        LEGERG+ IR+K+I +VGESGLFTP+DIA+VQEAGVKAVLVGES++KQSDP K I+ LFG+D+S
Subjt:  LEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGLFGKDIS

AT5G48220.2 Aldolase-type TIM barrel family protein3.6e-15079.68Show/hide
Query:  MFQDEVAASQGIRIRRRPPTGPPLHYVGPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAE
        M+Q+EV  SQGIRIRRRPPTGPPLHYVGPF+FRLQNEGNTPRNILEEI+W+KDKEV+QMKER+PL+SLKK LD+ PPA+DF+GAL++A+ RT LPGLIAE
Subjt:  MFQDEVAASQGIRIRRRPPTGPPLHYVGPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYVRTNLPGLIAE

Query:  VKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTK
        VKKASPSRG+LREDF+PVEIAQAYEKGGAACLSVLTD+K+F+GS+ENLQ I  AGV+CPLL KEF+V+AWQIYY RSKGADA+LLIA+VLPDLDIKYM K
Subjt:  VKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTK

Query:  ICKMVGLTPLVEVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQS
        ICK++G+  LVEVHDE+EMDR+LAIEG+ELIGINNRNLETFEVD+  TKKLLEGERG+ IR+K+I +VGESGLFTP+DIA+VQEAGVKAVLVGES++KQS
Subjt:  ICKMVGLTPLVEVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQS

Query:  DPTKGIAGLFGKDIS
        DP K I+ LFG+D+S
Subjt:  DPTKGIAGLFGKDIS

AT5G48220.3 Aldolase-type TIM barrel family protein2.7e-15379.08Show/hide
Query:  DLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYVGPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYV
        + KV E EVGM+Q+EV  SQGIRIRRRPPTGPPLHYVGPF+FRLQNEGNTPRNILEEI+W+KDKEV+QMKER+PL+SLKK LD+ PPA+DF+GAL++A+ 
Subjt:  DLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYVGPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAYV

Query:  RTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVL
        RT LPGLIAEVKKASPSRG+LREDF+PVEIAQAYEKGGAACLSVLTD+K+F+GS+ENLQ I  AGV+CPLL KEF+V+AWQIYY RSKGADA+LLIA+VL
Subjt:  RTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVL

Query:  PDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAV
        PDLDIKYM KICK++G+  LVEVHDE+EMDR+LAIEG+ELIGINNRNLETFEVD+  TKKLLEGERG+ IR+K+I +VGESGLFTP+DIA+VQEAGVKAV
Subjt:  PDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAV

Query:  LVGESIVKQSDPTKGIAGLFGKDIS
        LVGES++KQSDP K I+ LFG+D+S
Subjt:  LVGESIVKQSDPTKGIAGLFGKDIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAAGCTTCAAGGATTTTCTATTCTCCGCTTGTTCCCAACTTCCGTCCACTCCAGAAGACGCATTCAACCAAGCCCTACACAGCATCTTTTCGCAGACATTCGAT
CAATTGCTCTGTCTCCACAAGTGACAGTGCTAGAGTGGCAGCGACCGGGCCGATTCCATGGGGTTGCGAGATCGATTCGCTGGAGAATGCTTCTGCTCTACAGAAATGGC
TGTCGCAGTCTGGTCTCCCTGACCAGAAGATGGCGATACAGAGAGTGGATGTTGGAGAGAGAGGACTTGTGGCTTTGAAGAATGTTAGGAAAGGGGAGAAATTACTGTTT
GTGCCGCCCTCTCTCGTTATCTCTGCAGACTCGGAGTGGAGCTGCCCTGAGGCTGGGGAGATGTTAAAACGTAATTCAGTACCAGATTGGCCACTTATTGCAACCTACTT
GATTAGCGAAGCAAGTTTAATGAAATCATCAAGATGGAACAACTATATATCAGCCTTACCTCGGCAACCTTATTCACTTTTGTACTGGACACGAGAAGAGCTAGATCGAT
ATCTGGAAGCATCAGAGATCAGAGAAAGGGCAATAGAAAGGATCACAAATGTTGTTGGAACATACAATGATTTAAGGGACAGAATATTTTCCAAATATCCTGAATTATTC
CCCGAGGAGGTATTCAATATTGAAACTTTCAAGTGGTCATTTGGGATTCTTTTCTCCCGCTTGGTTCGGTTACCTTCAATGGATGGAAAGGTTGCATTGGTTCCCTGGGC
TGATATGCTGAATCATAACTGTGAGGTGGAGACTTTCTTGGATTATGATAAATCATCCCAAGGCGTTGTCTTCACAACAGATCGAGCATACCAACCAGGTGAGCAGGTGT
TCATTTCTTATGGCAAGAAATCTAATGGGGAGTTGTTGTTGTCATATGGTTTTGTTCCAAAGGAGGGAAGCAATCCTAGCGATTCTGTTGAGTTGTTACTCTCACTAAAA
AAATCTGATAAATGTTACAAGGAGAAGCTGGAAGCACTGAAAAAGCATGGATTAAGAGCATCCCAGTGTTTTCCTATTCAAATCACTGGCTGGCCTTTAGAATTAAAGGC
ATTTGCTTATTTAGCTGTCAGTCCACCAAGTATGAGCAAGCAGTTTGAAGAGATGGCTGCTGCAGCATCAAATAAATCAACAGCAACGAAGGACTTAAATTACCCTGATA
TAGAGGAGGAAGCCCTGCAGTTCATATTAGATAGCTGCGAAACTAGCATATCTAAATATAACAAATTCTTGCAGGCAAGTGGATCCATGGATTTGGACGTGACGTCACCA
AAACAACTGAACCGTAGAGTATTTTTAAAACAGCTAGCAGTAGACTTGTGCACGAGCGAACGAAGGATACTTTTTCGAAGCCAATATTTGAGACCTAACTTTCTTAAAAA
AAATGTGATTGCGTTTATTGATATCTTGAGTACGGATGTTTCAGATTCTGAGAAGAAGATTGAGAGACTTGAGAAGTGGTCCAAGTTTCTCTCCAGAAGGCTGAATCGTC
GGCCTTCAATGGATGCTTTGTCGAGGAATTCAATCTCTGTTGCCTCTTCTTCTTCCCCCATTTCCCCTGCCATCCGAGCTCAACAGACAGAATCCGAGGCTGGATCAGCT
GCGGCTTCCCCAGTTAATGAACCAAAAGAAAATGATCTGAAAGTGAAGGAGTGGGAAGTGGGGATGTTCCAAGATGAAGTAGCAGCAAGTCAAGGCATAAGGATTCGCCG
CCGGCCGCCCACCGGGCCGCCATTGCATTACGTTGGACCTTTTCAGTTCAGATTACAGAATGAAGGGAACACTCCTAGGAATATTTTGGAGGAGATCATATGGTACAAGG
ACAAGGAAGTTTCACAGATGAAAGAAAGAAGGCCTCTCTTTTCGTTGAAAAAGGATCTTGACAGCGCCCCTCCTGCTAGAGATTTCCTTGGAGCTCTCAAGGCTGCCTAT
GTTAGAACTAACTTACCTGGTTTGATTGCTGAAGTGAAAAAGGCTTCACCTAGCAGAGGAGTTCTAAGGGAGGATTTTGATCCAGTTGAGATTGCCCAAGCGTACGAGAA
AGGCGGAGCAGCATGCCTTAGTGTTCTCACAGACGAGAAGTTTTTTCAGGGAAGCTTTGAGAATCTGCAGAAGATAAGGAATGCCGGAGTGGAGTGCCCTCTCTTGTGCA
AGGAGTTCGTGGTCGATGCATGGCAGATCTACTATGCTCGATCCAAAGGTGCCGATGCCATTCTTTTGATTGCTGCTGTTTTGCCTGATCTCGACATTAAATATATGACT
AAAATCTGCAAGATGGTCGGTTTGACCCCCCTCGTTGAGGTTCACGACGAGAAGGAAATGGACCGTATGCTTGCGATCGAAGGCATTGAGCTTATTGGCATCAACAATCG
GAATCTCGAAACATTCGAGGTCGATATCAGCAACACAAAAAAGCTTCTCGAAGGAGAGCGCGGACAAAAGATCCGTGAAAAGAACATAACCATAGTGGGAGAATCTGGGC
TGTTCACTCCTGATGATATTGCTTATGTGCAAGAAGCTGGTGTTAAAGCTGTTCTCGTTGGCGAGTCGATTGTGAAACAGAGCGACCCGACGAAGGGAATAGCCGGGCTT
TTTGGCAAAGATATTTCTGCTTAA
mRNA sequenceShow/hide mRNA sequence
TTTATTTTATTTTCTCACTTATAGAATAATATTTATGTTTAATTAATAATAATAATAATAAGTTTTGCTTATTTCAATTGGCGCGCGTCCACAACCGTCAAAAACGACGA
CGACCTTTTTATTGGCTTCTCTGGCGATAAGGTTCCTTTACTCTCTCGTCCTCGAGCTCAGCCATGGCTGAAGCTTCAAGGATTTTCTATTCTCCGCTTGTTCCCAACTT
CCGTCCACTCCAGAAGACGCATTCAACCAAGCCCTACACAGCATCTTTTCGCAGACATTCGATCAATTGCTCTGTCTCCACAAGTGACAGTGCTAGAGTGGCAGCGACCG
GGCCGATTCCATGGGGTTGCGAGATCGATTCGCTGGAGAATGCTTCTGCTCTACAGAAATGGCTGTCGCAGTCTGGTCTCCCTGACCAGAAGATGGCGATACAGAGAGTG
GATGTTGGAGAGAGAGGACTTGTGGCTTTGAAGAATGTTAGGAAAGGGGAGAAATTACTGTTTGTGCCGCCCTCTCTCGTTATCTCTGCAGACTCGGAGTGGAGCTGCCC
TGAGGCTGGGGAGATGTTAAAACGTAATTCAGTACCAGATTGGCCACTTATTGCAACCTACTTGATTAGCGAAGCAAGTTTAATGAAATCATCAAGATGGAACAACTATA
TATCAGCCTTACCTCGGCAACCTTATTCACTTTTGTACTGGACACGAGAAGAGCTAGATCGATATCTGGAAGCATCAGAGATCAGAGAAAGGGCAATAGAAAGGATCACA
AATGTTGTTGGAACATACAATGATTTAAGGGACAGAATATTTTCCAAATATCCTGAATTATTCCCCGAGGAGGTATTCAATATTGAAACTTTCAAGTGGTCATTTGGGAT
TCTTTTCTCCCGCTTGGTTCGGTTACCTTCAATGGATGGAAAGGTTGCATTGGTTCCCTGGGCTGATATGCTGAATCATAACTGTGAGGTGGAGACTTTCTTGGATTATG
ATAAATCATCCCAAGGCGTTGTCTTCACAACAGATCGAGCATACCAACCAGGTGAGCAGGTGTTCATTTCTTATGGCAAGAAATCTAATGGGGAGTTGTTGTTGTCATAT
GGTTTTGTTCCAAAGGAGGGAAGCAATCCTAGCGATTCTGTTGAGTTGTTACTCTCACTAAAAAAATCTGATAAATGTTACAAGGAGAAGCTGGAAGCACTGAAAAAGCA
TGGATTAAGAGCATCCCAGTGTTTTCCTATTCAAATCACTGGCTGGCCTTTAGAATTAAAGGCATTTGCTTATTTAGCTGTCAGTCCACCAAGTATGAGCAAGCAGTTTG
AAGAGATGGCTGCTGCAGCATCAAATAAATCAACAGCAACGAAGGACTTAAATTACCCTGATATAGAGGAGGAAGCCCTGCAGTTCATATTAGATAGCTGCGAAACTAGC
ATATCTAAATATAACAAATTCTTGCAGGCAAGTGGATCCATGGATTTGGACGTGACGTCACCAAAACAACTGAACCGTAGAGTATTTTTAAAACAGCTAGCAGTAGACTT
GTGCACGAGCGAACGAAGGATACTTTTTCGAAGCCAATATTTGAGACCTAACTTTCTTAAAAAAAATGTGATTGCGTTTATTGATATCTTGAGTACGGATGTTTCAGATT
CTGAGAAGAAGATTGAGAGACTTGAGAAGTGGTCCAAGTTTCTCTCCAGAAGGCTGAATCGTCGGCCTTCAATGGATGCTTTGTCGAGGAATTCAATCTCTGTTGCCTCT
TCTTCTTCCCCCATTTCCCCTGCCATCCGAGCTCAACAGACAGAATCCGAGGCTGGATCAGCTGCGGCTTCCCCAGTTAATGAACCAAAAGAAAATGATCTGAAAGTGAA
GGAGTGGGAAGTGGGGATGTTCCAAGATGAAGTAGCAGCAAGTCAAGGCATAAGGATTCGCCGCCGGCCGCCCACCGGGCCGCCATTGCATTACGTTGGACCTTTTCAGT
TCAGATTACAGAATGAAGGGAACACTCCTAGGAATATTTTGGAGGAGATCATATGGTACAAGGACAAGGAAGTTTCACAGATGAAAGAAAGAAGGCCTCTCTTTTCGTTG
AAAAAGGATCTTGACAGCGCCCCTCCTGCTAGAGATTTCCTTGGAGCTCTCAAGGCTGCCTATGTTAGAACTAACTTACCTGGTTTGATTGCTGAAGTGAAAAAGGCTTC
ACCTAGCAGAGGAGTTCTAAGGGAGGATTTTGATCCAGTTGAGATTGCCCAAGCGTACGAGAAAGGCGGAGCAGCATGCCTTAGTGTTCTCACAGACGAGAAGTTTTTTC
AGGGAAGCTTTGAGAATCTGCAGAAGATAAGGAATGCCGGAGTGGAGTGCCCTCTCTTGTGCAAGGAGTTCGTGGTCGATGCATGGCAGATCTACTATGCTCGATCCAAA
GGTGCCGATGCCATTCTTTTGATTGCTGCTGTTTTGCCTGATCTCGACATTAAATATATGACTAAAATCTGCAAGATGGTCGGTTTGACCCCCCTCGTTGAGGTTCACGA
CGAGAAGGAAATGGACCGTATGCTTGCGATCGAAGGCATTGAGCTTATTGGCATCAACAATCGGAATCTCGAAACATTCGAGGTCGATATCAGCAACACAAAAAAGCTTC
TCGAAGGAGAGCGCGGACAAAAGATCCGTGAAAAGAACATAACCATAGTGGGAGAATCTGGGCTGTTCACTCCTGATGATATTGCTTATGTGCAAGAAGCTGGTGTTAAA
GCTGTTCTCGTTGGCGAGTCGATTGTGAAACAGAGCGACCCGACGAAGGGAATAGCCGGGCTTTTTGGCAAAGATATTTCTGCTTAAAGCTTTTAGGAAACTTGAAACTG
CTGTATGTTGAACAGGTTTTGATACCCATTTGGCTCTGATCTTATGAACCTATGGTATTGAGTTGGAACTGATTAGATCCCAAAATGGGAAAATAAAGCTCAAGCTATAT
ATTTATACCATAACTGTTGCGCAC
Protein sequenceShow/hide protein sequence
MAEASRIFYSPLVPNFRPLQKTHSTKPYTASFRRHSINCSVSTSDSARVAATGPIPWGCEIDSLENASALQKWLSQSGLPDQKMAIQRVDVGERGLVALKNVRKGEKLLF
VPPSLVISADSEWSCPEAGEMLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEIRERAIERITNVVGTYNDLRDRIFSKYPELF
PEEVFNIETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKSSQGVVFTTDRAYQPGEQVFISYGKKSNGELLLSYGFVPKEGSNPSDSVELLLSLK
KSDKCYKEKLEALKKHGLRASQCFPIQITGWPLELKAFAYLAVSPPSMSKQFEEMAAAASNKSTATKDLNYPDIEEEALQFILDSCETSISKYNKFLQASGSMDLDVTSP
KQLNRRVFLKQLAVDLCTSERRILFRSQYLRPNFLKKNVIAFIDILSTDVSDSEKKIERLEKWSKFLSRRLNRRPSMDALSRNSISVASSSSPISPAIRAQQTESEAGSA
AASPVNEPKENDLKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYVGPFQFRLQNEGNTPRNILEEIIWYKDKEVSQMKERRPLFSLKKDLDSAPPARDFLGALKAAY
VRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDEKFFQGSFENLQKIRNAGVECPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMT
KICKMVGLTPLVEVHDEKEMDRMLAIEGIELIGINNRNLETFEVDISNTKKLLEGERGQKIREKNITIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGIAGL
FGKDISA