; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G03045 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G03045
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionIndole-3-glycerol-phosphate synthase
Genome locationClcChr02:2662232..2673385
RNA-Seq ExpressionClc02G03045
SyntenyClc02G03045
Gene Ontology termsGO:0000162 - tryptophan biosynthetic process (biological process)
GO:0003682 - chromatin binding (molecular function)
GO:0004425 - indole-3-glycerol-phosphate synthase activity (molecular function)
GO:0004640 - phosphoribosylanthranilate isomerase activity (molecular function)
InterPro domainsIPR001468 - Indole-3-glycerol phosphate synthase, conserved site
IPR011060 - Ribulose-phosphate binding barrel
IPR013785 - Aldolase-type TIM barrel
IPR013798 - Indole-3-glycerol phosphate synthase
IPR032001 - SAWADEE domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OVA19762.1 Indole-3-glycerol phosphate synthase [Macleaya cordata]1.0e-19850.06Show/hide
Query:  MEGLASLRTIPKVSFPPISS--STRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTAIRAQQIESEAGSAAASPVTELEENALKVKEWEVGMFQD
        ME  +   T P+VS P IS+   T+RS  + R +N    + T  R SI+L            IRAQQ E + GS   SP ++  +NALK+KEWEVG FQ+
Subjt:  MEGLASLRTIPKVSFPPISS--STRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTAIRAQQIESEAGSAAASPVTELEENALKVKEWEVGMFQD

Query:  EVAASQGIRIRRRPPTGPPLHY--------------------------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKA
        E+AA+QGIRIRRRPPTGPPLHY                                +KERRPL SLK+ L+ APP RDF+GAL+ ++LRT LP LIAEVKKA
Subjt:  EVAASQGIRIRRRPPTGPPLHY--------------------------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKA

Query:  SPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKM
        SPSRGVLREDFDPV+IAQAYEKGGAACLSVLTD+K+FQGSFENLEKIR AG++CPLLCKEF++DAWQIYYAR+KGADAILLIAAVLPDLDI+YMTKICK+
Subjt:  SPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKM

Query:  VGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTK
        +GL  LVEVHDE+EMDR+L IDGIELIGINNRNLETFEVDISNTK+LLEGERG+ IR+K++ +VGESGLFTPDDIAYVQEAGVKAVLVGESIVKQ+DP K
Subjt:  VGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTK

Query:  GITGLFEKLPLELASISPAMFSAGAAASGE-----LEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDS
        GI+GLFE       S+S    S G   + +     L+FR  DDAWY + + LE + L + Y  F +E++  ++A  F++L ++  FE +FRP S QLQD+
Subjt:  GITGLFEKLPLELASISPAMFSAGAAASGE-----LEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDS

Query:  ECPNVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEIN-DTVLATFFAKVREKIQTRM
        EC  +V GM V AS+     D++FY+A+++ + Y  H +  GEEEC C+F++ W HGP  G+     I ++C++       D  LA+F    REK++   
Subjt:  ECPNVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEIN-DTVLATFFAKVREKIQTRM

Query:  NRGGTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQ----------FMILLEN
                  L +HN G    +D+CSL                              E    +  +K+G   QD D+GG                IL++N
Subjt:  NRGGTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQ----------FMILLEN

Query:  LDKELSPVKLAKFLHAQTLILPRVYIFPSLTFEAYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTL
        L+K+L+P  +  F+H    I  + Y+FPSL  E Y +G +V++ ++ L++L DFL NP H+I+SS GRP V+T       TFG L
Subjt:  LDKELSPVKLAKFLHAQTLILPRVYIFPSLTFEAYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTL

TYJ09753.1 hypothetical protein E1A91_A11G161700v1 [Gossypium mustelinum]3.1e-20349.76Show/hide
Query:  MEGLASLRTIPKVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTAIRAQQIESEAGSAAASPVTELEENALKVKEWEVGMFQDEV
        MEGL SL++ P+ S     S  +     +RR     SMD  LR         SS P   AIRAQQ          SP  E EE+ALKVKEWEVGMFQ+EV
Subjt:  MEGLASLRTIPKVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTAIRAQQIESEAGSAAASPVTELEENALKVKEWEVGMFQDEV

Query:  AASQGIRIRRRPPTGPPLHY--------------------------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASP
        AASQGIRIRRRPPT PPLHY                                MKER+PL +LK+ +E AP  RDF+GALKAA+ RT LPGLIAEVKKASP
Subjt:  AASQGIRIRRRPPTGPPLHY--------------------------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASP

Query:  SRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVG
        SRG+LREDFDPVEIA+AYEKGGAACLSVLTD+KFF+GSFENLE IRNAGV+CPLLCKEFV+DAWQIYYAR KGADAILLIAAVLPDLDI+YM KICKM+G
Subjt:  SRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVG

Query:  LTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGI
        L  LVEVHDE+EMDR+L IDGIELIGINNRNLETFEVDISNTK+LLEGE GQ IR+K++ +VGESGLFTPD + YVQEAGVKA    E  V+ +      
Subjt:  LTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGI

Query:  TGLFEKLPLELASISPAMFSAGAAASGELEFR-WDDDAWYNVTVKLEG---DDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDSECP
        +G    L               A    + EFR + DDAWY+V + LEG   + LR+ Y EF    DNVF A++F+S  EL DF  RFR +S QLQD  C 
Subjt:  TGLFEKLPLELASISPAMFSAGAAASGELEFR-WDDDAWYNVTVKLEG---DDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDSECP

Query:  NVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMN-RG
         VV GM VCAS S  A +V FYDA+V+ V   +HS  NG+EEC C F+L W HGPN GNLT   +A++C +Q  E++  ++      ++  ++   +   
Subjt:  NVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMN-RG

Query:  GTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQFMILLENLDKELSPVKLAKF
        GT S+D +             C++  + R +    +     +          +W  Q      +     QDTD+GG K  ++IL++NL+K+LS   ++KF
Subjt:  GTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQFMILLENLDKELSPVKLAKF

Query:  LHAQTLILPRVYIFPSLTFEAYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLDSENKFGNEKDGRVRCELKVV
        +H QT I  +VYIFPSL +E Y  G + ++C+K++E+L  FL +P+   +SS GRPLV T +++  + +       ++L S NK  N ++G    ELKVV
Subjt:  LHAQTLILPRVYIFPSLTFEAYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLDSENKFGNEKDGRVRCELKVV

Query:  KVGTNEYLTAKHMKELFMEFLNHQRRLHQRLAMEEGKI
          GT EY  AK +++LF++F++HQ+ L+++L  EE  I
Subjt:  KVGTNEYLTAKHMKELFMEFLNHQRRLHQRLAMEEGKI

TYJ09754.1 hypothetical protein E1A91_A11G161700v1 [Gossypium mustelinum]6.5e-20149.23Show/hide
Query:  MEGLASLRTIPKVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTAIRAQQIESEAGSAAASPVTELEENALKVKEWEVGMFQDEV
        MEGL SL++ P+ S     S  +     +RR     SMD  LR         SS P   AIRAQQ          SP  E EE+ALKVKEWEVGMFQ+EV
Subjt:  MEGLASLRTIPKVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTAIRAQQIESEAGSAAASPVTELEENALKVKEWEVGMFQDEV

Query:  AASQGIRIRRRPPTGPPLHY--------------------------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASP
        AASQGIRIRRRPPT PPLHY                                MKER+PL +LK+ +E AP  RDF+GALKAA+ RT LPGLIAEVKKASP
Subjt:  AASQGIRIRRRPPTGPPLHY--------------------------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASP

Query:  SRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVG
        SRG+LREDFDPVEIA+AYEKGGAACLSVLTD+KFF+GSFENLE IRNAGV+CPLLCKEFV+DAWQIYYAR KGADAILLIAAVLPDLDI+YM KICKM+G
Subjt:  SRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVG

Query:  LTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGI
        L  LVEVHDE+EMDR+L IDGIELIGINNRNLETFEVDISNTK+LLEGE GQ IR+K++ +VGESGLFTPD + YVQEAGVKA    E  V+ +      
Subjt:  LTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGI

Query:  TGLFEKLPLELASISPAMFSAGAAASGELEFR-WDDDAWYNVTVKLEG---DDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDSECP
        +G    L               A    + EFR + DDAWY+V + LEG   + LR+ Y EF    DNVF A++F+S  EL DF  RFR +S QLQD  C 
Subjt:  TGLFEKLPLELASISPAMFSAGAAASGELEFR-WDDDAWYNVTVKLEG---DDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDSECP

Query:  NVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMN-RG
         VV GM VCAS S  A +V FYDA+V+ V   +HS  NG+EEC C F+L W HGPN GNLT   +A++C +Q  E++  ++      ++  ++   +   
Subjt:  NVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMN-RG

Query:  GTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQFMILLENLDKELSPVKLAKF
        GT S+D +             C++  + R +    +     +          +W  Q      +     QDTD+GG K  ++IL++NL+K+LS   ++KF
Subjt:  GTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQFMILLENLDKELSPVKLAKF

Query:  LHAQTLILPRVYIFPSLTFEAYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQG---------RPLVVTGRIAGRETFGTLAAGAMVLDSENKFGNEKDG
        +H QT I  +VYIFPSL +E Y  G + ++C+K++E+L  FL +P+   +SS G         RPLV T +++  + +       ++L S NK  N ++G
Subjt:  LHAQTLILPRVYIFPSLTFEAYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQG---------RPLVVTGRIAGRETFGTLAAGAMVLDSENKFGNEKDG

Query:  RVRCELKVVKVGTNEYLTAKHMKELFMEFLNHQRRLHQRLAMEEGKI
            ELKVV  GT EY  AK +++LF++F++HQ+ L+++L  EE  I
Subjt:  RVRCELKVVKVGTNEYLTAKHMKELFMEFLNHQRRLHQRLAMEEGKI

XP_008466441.1 PREDICTED: uncharacterized protein LOC103503848 isoform X1 [Cucumis melo]4.2e-20083.41Show/hide
Query:  MFSAGAAASGELEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRF
        MFSAG A+SG+LEF  DDDAWYN  VKL+G  LR+SYCEF +EHDNVF A+HF+SLSELS FEARFRP+SRQLQDSECPNV PGMPVCAS+SSRADDVRF
Subjt:  MFSAGAAASGELEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRF

Query:  YDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMNRGGTCSEDRLLTHNDGGAHQKDEC
        YDALVEGVDYLEHSYANGEEECLCNFILLWQ GPNSGNLT ASIAN+CQIQFD+INDTVLATFF KVREKI+TR NRG  CSED   THN GGA QKD+C
Subjt:  YDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMNRGGTCSEDRLLTHNDGGAHQKDEC

Query:  SLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQFMILLENLDKELSPVKLAKFLHAQTLILPRVYIFPSLTFEAY
        SLKLK RLSFFERMDQ +TRRAKRSSG V  WEDQ +LSSRK  VIEQDTDIGG+KYQ+MILLENLDK L+P+KLAKFL+ +TLILPRVYIFPSLTFE Y
Subjt:  SLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQFMILLENLDKELSPVKLAKFLHAQTLILPRVYIFPSLTFEAY

Query:  AKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLDSENKFGNEKDGRVRCELKVVKVGTNEYLTAKHMKELFMEFLN
        A+GAVV+NCRKNL+RL DFLD+PDHVILSSQGRPLVVTG+IA  ETFGTLAAGAMVLDSENKFGNEKDGR  CELKVVKVGT+EYLTAKHMKELF+EFL 
Subjt:  AKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLDSENKFGNEKDGRVRCELKVVKVGTNEYLTAKHMKELFMEFLN

Query:  HQRRLHQRLAMEEGKIYCNGAL
        HQR L QRLAMEE KIYCNGAL
Subjt:  HQRRLHQRLAMEEGKIYCNGAL

XP_038888477.1 uncharacterized protein LOC120078315 isoform X1 [Benincasa hispida]4.5e-21890.05Show/hide
Query:  MFSAGAAASGELEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRF
        MFSAG AASG++EF+ DDDAWYNVTVKLEGDDLRISYCEF +EHDNVF+ANHFRSLSELS+FEARFRPLSRQLQDSECPNV PGMPVCAS+SS+ADDVRF
Subjt:  MFSAGAAASGELEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRF

Query:  YDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMNRGGTCSEDRLLTHNDGGAHQKDEC
        YDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLT ASIAN+CQIQFDEINDTVLATFFAKVREKI TRMNRG TCSEDRLLTHN GG HQKDEC
Subjt:  YDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMNRGGTCSEDRLLTHNDGGAHQKDEC

Query:  SLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQFMILLENLDKELSPVKLAKFLHAQTLILPRVYIFPSLTFEAY
        SLKLKRRLSFFERMD  DTRRAKRSS  +  WEDQQ LSSRKS VIE DTDIGG+KYQ+MILLENLDK LSPVK+AKFLHAQTLILPRVYIFPSLTFE+Y
Subjt:  SLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQFMILLENLDKELSPVKLAKFLHAQTLILPRVYIFPSLTFEAY

Query:  AKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLDSENKFGNEKDGRVRCELKVVKVGTNEYLTAKHMKELFMEFLN
        A+GAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIA  ETFGTL AGAMVLDSENKFGNEKDG V CELKVVKVGT+EYLTAKHMKELFMEFL+
Subjt:  AKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLDSENKFGNEKDGRVRCELKVVKVGTNEYLTAKHMKELFMEFLN

Query:  HQRRLHQRLAMEEGKIYCNGAL
        HQRRLHQRLAMEE KIYCNGAL
Subjt:  HQRRLHQRLAMEEGKIYCNGAL

TrEMBL top hitse value%identityAlignment
A0A1S3CR98 uncharacterized protein LOC103503848 isoform X12.0e-20083.41Show/hide
Query:  MFSAGAAASGELEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRF
        MFSAG A+SG+LEF  DDDAWYN  VKL+G  LR+SYCEF +EHDNVF A+HF+SLSELS FEARFRP+SRQLQDSECPNV PGMPVCAS+SSRADDVRF
Subjt:  MFSAGAAASGELEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRF

Query:  YDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMNRGGTCSEDRLLTHNDGGAHQKDEC
        YDALVEGVDYLEHSYANGEEECLCNFILLWQ GPNSGNLT ASIAN+CQIQFD+INDTVLATFF KVREKI+TR NRG  CSED   THN GGA QKD+C
Subjt:  YDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMNRGGTCSEDRLLTHNDGGAHQKDEC

Query:  SLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQFMILLENLDKELSPVKLAKFLHAQTLILPRVYIFPSLTFEAY
        SLKLK RLSFFERMDQ +TRRAKRSSG V  WEDQ +LSSRK  VIEQDTDIGG+KYQ+MILLENLDK L+P+KLAKFL+ +TLILPRVYIFPSLTFE Y
Subjt:  SLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQFMILLENLDKELSPVKLAKFLHAQTLILPRVYIFPSLTFEAY

Query:  AKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLDSENKFGNEKDGRVRCELKVVKVGTNEYLTAKHMKELFMEFLN
        A+GAVV+NCRKNL+RL DFLD+PDHVILSSQGRPLVVTG+IA  ETFGTLAAGAMVLDSENKFGNEKDGR  CELKVVKVGT+EYLTAKHMKELF+EFL 
Subjt:  AKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLDSENKFGNEKDGRVRCELKVVKVGTNEYLTAKHMKELFMEFLN

Query:  HQRRLHQRLAMEEGKIYCNGAL
        HQR L QRLAMEE KIYCNGAL
Subjt:  HQRRLHQRLAMEEGKIYCNGAL

A0A200RAM3 Indole-3-glycerol-phosphate synthase5.0e-19950.06Show/hide
Query:  MEGLASLRTIPKVSFPPISS--STRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTAIRAQQIESEAGSAAASPVTELEENALKVKEWEVGMFQD
        ME  +   T P+VS P IS+   T+RS  + R +N    + T  R SI+L            IRAQQ E + GS   SP ++  +NALK+KEWEVG FQ+
Subjt:  MEGLASLRTIPKVSFPPISS--STRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTAIRAQQIESEAGSAAASPVTELEENALKVKEWEVGMFQD

Query:  EVAASQGIRIRRRPPTGPPLHY--------------------------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKA
        E+AA+QGIRIRRRPPTGPPLHY                                +KERRPL SLK+ L+ APP RDF+GAL+ ++LRT LP LIAEVKKA
Subjt:  EVAASQGIRIRRRPPTGPPLHY--------------------------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKA

Query:  SPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKM
        SPSRGVLREDFDPV+IAQAYEKGGAACLSVLTD+K+FQGSFENLEKIR AG++CPLLCKEF++DAWQIYYAR+KGADAILLIAAVLPDLDI+YMTKICK+
Subjt:  SPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKM

Query:  VGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTK
        +GL  LVEVHDE+EMDR+L IDGIELIGINNRNLETFEVDISNTK+LLEGERG+ IR+K++ +VGESGLFTPDDIAYVQEAGVKAVLVGESIVKQ+DP K
Subjt:  VGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTK

Query:  GITGLFEKLPLELASISPAMFSAGAAASGE-----LEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDS
        GI+GLFE       S+S    S G   + +     L+FR  DDAWY + + LE + L + Y  F +E++  ++A  F++L ++  FE +FRP S QLQD+
Subjt:  GITGLFEKLPLELASISPAMFSAGAAASGE-----LEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDS

Query:  ECPNVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEIN-DTVLATFFAKVREKIQTRM
        EC  +V GM V AS+     D++FY+A+++ + Y  H +  GEEEC C+F++ W HGP  G+     I ++C++       D  LA+F    REK++   
Subjt:  ECPNVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEIN-DTVLATFFAKVREKIQTRM

Query:  NRGGTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQ----------FMILLEN
                  L +HN G    +D+CSL                              E    +  +K+G   QD D+GG                IL++N
Subjt:  NRGGTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQ----------FMILLEN

Query:  LDKELSPVKLAKFLHAQTLILPRVYIFPSLTFEAYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTL
        L+K+L+P  +  F+H    I  + Y+FPSL  E Y +G +V++ ++ L++L DFL NP H+I+SS GRP V+T       TFG L
Subjt:  LDKELSPVKLAKFLHAQTLILPRVYIFPSLTFEAYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTL

A0A5D2X7G9 Indole-3-glycerol-phosphate synthase1.5e-20349.76Show/hide
Query:  MEGLASLRTIPKVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTAIRAQQIESEAGSAAASPVTELEENALKVKEWEVGMFQDEV
        MEGL SL++ P+ S     S  +     +RR     SMD  LR         SS P   AIRAQQ          SP  E EE+ALKVKEWEVGMFQ+EV
Subjt:  MEGLASLRTIPKVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTAIRAQQIESEAGSAAASPVTELEENALKVKEWEVGMFQDEV

Query:  AASQGIRIRRRPPTGPPLHY--------------------------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASP
        AASQGIRIRRRPPT PPLHY                                MKER+PL +LK+ +E AP  RDF+GALKAA+ RT LPGLIAEVKKASP
Subjt:  AASQGIRIRRRPPTGPPLHY--------------------------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASP

Query:  SRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVG
        SRG+LREDFDPVEIA+AYEKGGAACLSVLTD+KFF+GSFENLE IRNAGV+CPLLCKEFV+DAWQIYYAR KGADAILLIAAVLPDLDI+YM KICKM+G
Subjt:  SRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVG

Query:  LTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGI
        L  LVEVHDE+EMDR+L IDGIELIGINNRNLETFEVDISNTK+LLEGE GQ IR+K++ +VGESGLFTPD + YVQEAGVKA    E  V+ +      
Subjt:  LTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGI

Query:  TGLFEKLPLELASISPAMFSAGAAASGELEFR-WDDDAWYNVTVKLEG---DDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDSECP
        +G    L               A    + EFR + DDAWY+V + LEG   + LR+ Y EF    DNVF A++F+S  EL DF  RFR +S QLQD  C 
Subjt:  TGLFEKLPLELASISPAMFSAGAAASGELEFR-WDDDAWYNVTVKLEG---DDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDSECP

Query:  NVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMN-RG
         VV GM VCAS S  A +V FYDA+V+ V   +HS  NG+EEC C F+L W HGPN GNLT   +A++C +Q  E++  ++      ++  ++   +   
Subjt:  NVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMN-RG

Query:  GTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQFMILLENLDKELSPVKLAKF
        GT S+D +             C++  + R +    +     +          +W  Q      +     QDTD+GG K  ++IL++NL+K+LS   ++KF
Subjt:  GTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQFMILLENLDKELSPVKLAKF

Query:  LHAQTLILPRVYIFPSLTFEAYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLDSENKFGNEKDGRVRCELKVV
        +H QT I  +VYIFPSL +E Y  G + ++C+K++E+L  FL +P+   +SS GRPLV T +++  + +       ++L S NK  N ++G    ELKVV
Subjt:  LHAQTLILPRVYIFPSLTFEAYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLDSENKFGNEKDGRVRCELKVV

Query:  KVGTNEYLTAKHMKELFMEFLNHQRRLHQRLAMEEGKI
          GT EY  AK +++LF++F++HQ+ L+++L  EE  I
Subjt:  KVGTNEYLTAKHMKELFMEFLNHQRRLHQRLAMEEGKI

A0A5D2X7H3 Indole-3-glycerol-phosphate synthase3.2e-20149.23Show/hide
Query:  MEGLASLRTIPKVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTAIRAQQIESEAGSAAASPVTELEENALKVKEWEVGMFQDEV
        MEGL SL++ P+ S     S  +     +RR     SMD  LR         SS P   AIRAQQ          SP  E EE+ALKVKEWEVGMFQ+EV
Subjt:  MEGLASLRTIPKVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTAIRAQQIESEAGSAAASPVTELEENALKVKEWEVGMFQDEV

Query:  AASQGIRIRRRPPTGPPLHY--------------------------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASP
        AASQGIRIRRRPPT PPLHY                                MKER+PL +LK+ +E AP  RDF+GALKAA+ RT LPGLIAEVKKASP
Subjt:  AASQGIRIRRRPPTGPPLHY--------------------------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASP

Query:  SRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVG
        SRG+LREDFDPVEIA+AYEKGGAACLSVLTD+KFF+GSFENLE IRNAGV+CPLLCKEFV+DAWQIYYAR KGADAILLIAAVLPDLDI+YM KICKM+G
Subjt:  SRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVG

Query:  LTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGI
        L  LVEVHDE+EMDR+L IDGIELIGINNRNLETFEVDISNTK+LLEGE GQ IR+K++ +VGESGLFTPD + YVQEAGVKA    E  V+ +      
Subjt:  LTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGI

Query:  TGLFEKLPLELASISPAMFSAGAAASGELEFR-WDDDAWYNVTVKLEG---DDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDSECP
        +G    L               A    + EFR + DDAWY+V + LEG   + LR+ Y EF    DNVF A++F+S  EL DF  RFR +S QLQD  C 
Subjt:  TGLFEKLPLELASISPAMFSAGAAASGELEFR-WDDDAWYNVTVKLEG---DDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDSECP

Query:  NVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMN-RG
         VV GM VCAS S  A +V FYDA+V+ V   +HS  NG+EEC C F+L W HGPN GNLT   +A++C +Q  E++  ++      ++  ++   +   
Subjt:  NVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMN-RG

Query:  GTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQFMILLENLDKELSPVKLAKF
        GT S+D +             C++  + R +    +     +          +W  Q      +     QDTD+GG K  ++IL++NL+K+LS   ++KF
Subjt:  GTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQFMILLENLDKELSPVKLAKF

Query:  LHAQTLILPRVYIFPSLTFEAYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQG---------RPLVVTGRIAGRETFGTLAAGAMVLDSENKFGNEKDG
        +H QT I  +VYIFPSL +E Y  G + ++C+K++E+L  FL +P+   +SS G         RPLV T +++  + +       ++L S NK  N ++G
Subjt:  LHAQTLILPRVYIFPSLTFEAYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQG---------RPLVVTGRIAGRETFGTLAAGAMVLDSENKFGNEKDG

Query:  RVRCELKVVKVGTNEYLTAKHMKELFMEFLNHQRRLHQRLAMEEGKI
            ELKVV  GT EY  AK +++LF++F++HQ+ L+++L  EE  I
Subjt:  RVRCELKVVKVGTNEYLTAKHMKELFMEFLNHQRRLHQRLAMEEGKI

A0A5D3E5V0 SAWADEE domain-containing protein2.0e-20083.41Show/hide
Query:  MFSAGAAASGELEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRF
        MFSAG A+SG+LEF  DDDAWYN  VKL+G  LR+SYCEF +EHDNVF A+HF+SLSELS FEARFRP+SRQLQDSECPNV PGMPVCAS+SSRADDVRF
Subjt:  MFSAGAAASGELEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRF

Query:  YDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMNRGGTCSEDRLLTHNDGGAHQKDEC
        YDALVEGVDYLEHSYANGEEECLCNFILLWQ GPNSGNLT ASIAN+CQIQFD+INDTVLATFF KVREKI+TR NRG  CSED   THN GGA QKD+C
Subjt:  YDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMNRGGTCSEDRLLTHNDGGAHQKDEC

Query:  SLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQFMILLENLDKELSPVKLAKFLHAQTLILPRVYIFPSLTFEAY
        SLKLK RLSFFERMDQ +TRRAKRSSG V  WEDQ +LSSRK  VIEQDTDIGG+KYQ+MILLENLDK L+P+KLAKFL+ +TLILPRVYIFPSLTFE Y
Subjt:  SLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQFMILLENLDKELSPVKLAKFLHAQTLILPRVYIFPSLTFEAY

Query:  AKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLDSENKFGNEKDGRVRCELKVVKVGTNEYLTAKHMKELFMEFLN
        A+GAVV+NCRKNL+RL DFLD+PDHVILSSQGRPLVVTG+IA  ETFGTLAAGAMVLDSENKFGNEKDGR  CELKVVKVGT+EYLTAKHMKELF+EFL 
Subjt:  AKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLDSENKFGNEKDGRVRCELKVVKVGTNEYLTAKHMKELFMEFLN

Query:  HQRRLHQRLAMEEGKIYCNGAL
        HQR L QRLAMEE KIYCNGAL
Subjt:  HQRRLHQRLAMEEGKIYCNGAL

SwissProt top hitse value%identityAlignment
B0JTM2 Indole-3-glycerol phosphate synthase1.0e-7961.18Show/hide
Query:  MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKC
        ++ER PL+ L++ +    P  DFL ALK    +   P LIAEVKKASPS+GV+ EDFDPV IA+ YE+GGA CLSVLTD KFFQGS+ENL  +R A V  
Subjt:  MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKC

Query:  PLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQ
        PLLCKEF++  +QIYYARSKGADA+LLIAA+L D D+ Y  KI K +G+T LVEVH   E DR+LAI+GIELIGINNRNLETF VD+ NT++LLE  RG+
Subjt:  PLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQ

Query:  NIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLFEKL
         +R+K + IV ESGL T  D+A V++AG  AVL+GES+VK  DP  GI  LFE L
Subjt:  NIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLFEKL

B1WQE4 Indole-3-glycerol phosphate synthase1.8e-7658.33Show/hide
Query:  MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKC
        M++R  L+ L++  + APPA+DFLGA+     +   P LIAEVKKASPS+GV+REDF+PV IAQAY +GGA+CLSVLTD KFFQGSF+NL  +R A V  
Subjt:  MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKC

Query:  PLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQ
        PLLCKEF++  +QIY AR KGADAILLIAA+L D D++Y+ KI   +G+TPLVEVH   E+DR+LAI+G+ L+GINNRNLETFEV +  T  L+   R  
Subjt:  PLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQ

Query:  NIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLF
         I+++ + IV ESG+ TP  +  V EAG  AVL+GES+VKQ DPT+ I  LF
Subjt:  NIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLF

B7K0H0 Indole-3-glycerol phosphate synthase3.3e-7854.45Show/hide
Query:  IRIRRRPPTGPPLHY--------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGA
        +R + + P G P H               ++E  PL+ L++ ++  PP +DFLGA+     +   P LIAEVKKASPS+GV+REDFDPV IAQAY KGGA
Subjt:  IRIRRRPPTGPPLHY--------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGA

Query:  ACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIDGIE
        +CLSVLTD KFFQGSFENL  +R + V  PLLCKEF++  +QIY AR+KGADA+LLIAA+L D D+ Y  KI + +G+T L+EVH   E+DR+LAI+G+ 
Subjt:  ACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIDGIE

Query:  LIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLF
        LIGINNRNLETFEVD+  T +LL   R   I+   + I+ ESGL TPDD+ +VQ+AG   VL+GES+VKQ DPT+ I  LF
Subjt:  LIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLF

P49572 Indole-3-glycerol phosphate synthase, chloroplastic3.1e-12159.11Show/hide
Query:  MEGLASLRTIP-KVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTAIRAQQIESEAGSAAASPVTELEENALKVKEWEVGMFQDE
        MEGL  ++ +P KV+ P +            R N S S+  S+         +  +P   +IRAQQ + +   A +S   E + N L++KEWEV M+Q+E
Subjt:  MEGLASLRTIP-KVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTAIRAQQIESEAGSAAASPVTELEENALKVKEWEVGMFQDE

Query:  VAASQGIRIRRRPPTGPPLHY---------------------------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKA
        +A SQGIRIRR+PP+  PL Y                                 MKE  PL  LK+ +E APP RDF+GAL+ A+ RT  PGLIAEVKKA
Subjt:  VAASQGIRIRRRPPTGPPLHY---------------------------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKA

Query:  SPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKM
        SPSRG+L+E+FDPVEIAQAYEKGGAACLSVLTDQK+FQG FENLE IR+AGVKCPLLCKEFVVD WQIYYAR+KGADA+LLIAAVL DL+I ++ KICK 
Subjt:  SPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKM

Query:  VGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTK
        + L  LVEVHDE+EM R+L I+GIEL+GINNR+LETFEVDISNTK+LLEGE G+ IR++++ +VGESGLFTPDDIAYVQ AGVKAVLVGESIVKQ+DP K
Subjt:  VGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTK

Query:  GITGLF
        GI GLF
Subjt:  GITGLF

Q55508 Indole-3-glycerol phosphate synthase4.6e-7249.83Show/hide
Query:  IRIRRRPPTGP----------------PLHYMKE-----------RR---PLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLRE
        + IRRRPP  P                P H ++E           RR   PL+ L+  ++   P  DF+GAL+ +      P LIAEVKKASPS+G++R 
Subjt:  IRIRRRPPTGP----------------PLHYMKE-----------RR---PLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLRE

Query:  DFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEV
        DFDPV IA+AYE GGA CLSVLTD+KFFQGSFENL+ +R+A V+ PLLCKEF++  +QIY ARS+GADA+LLIAA+L D D++Y  KI + +G+  LVEV
Subjt:  DFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEV

Query:  HDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLF
        H  +EMDR+LA+DG++LIG+NNRNL+TF VD+  T+ L   +R + + + ++T+V ESG++   D+  +Q+AG +AVLVGES+VKQ DP + I  L+
Subjt:  HDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLF

Arabidopsis top hitse value%identityAlignment
AT2G04400.1 Aldolase-type TIM barrel family protein2.2e-12259.11Show/hide
Query:  MEGLASLRTIP-KVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTAIRAQQIESEAGSAAASPVTELEENALKVKEWEVGMFQDE
        MEGL  ++ +P KV+ P +            R N S S+  S+         +  +P   +IRAQQ + +   A +S   E + N L++KEWEV M+Q+E
Subjt:  MEGLASLRTIP-KVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTAIRAQQIESEAGSAAASPVTELEENALKVKEWEVGMFQDE

Query:  VAASQGIRIRRRPPTGPPLHY---------------------------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKA
        +A SQGIRIRR+PP+  PL Y                                 MKE  PL  LK+ +E APP RDF+GAL+ A+ RT  PGLIAEVKKA
Subjt:  VAASQGIRIRRRPPTGPPLHY---------------------------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKA

Query:  SPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKM
        SPSRG+L+E+FDPVEIAQAYEKGGAACLSVLTDQK+FQG FENLE IR+AGVKCPLLCKEFVVD WQIYYAR+KGADA+LLIAAVL DL+I ++ KICK 
Subjt:  SPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKM

Query:  VGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTK
        + L  LVEVHDE+EM R+L I+GIEL+GINNR+LETFEVDISNTK+LLEGE G+ IR++++ +VGESGLFTPDDIAYVQ AGVKAVLVGESIVKQ+DP K
Subjt:  VGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTK

Query:  GITGLF
        GI GLF
Subjt:  GITGLF

AT4G25330.1 unknown protein2.3e-2637.71Show/hide
Query:  ELEFR-WDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRFYDALVEGVD
        ELEFR  +D+AWY V      D L IS+  F  EHD  + A+ F++  E+ +FE RFR  S Q+QD ECP V  G  VCA+  S  ++V+FYDA+V  V+
Subjt:  ELEFR-WDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFHANHFRSLSELSDFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRFYDALVEGVD

Query:  YLEHSY-ANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMNRGGTCSE
          +H     G E C C+F L W+ GP    +T A + ++C    D   +  + +F  + R K+      G TC++
Subjt:  YLEHSY-ANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMNRGGTCSE

AT5G48220.1 Aldolase-type TIM barrel family protein2.1e-12567.96Show/hide
Query:  AAASPVTELEENAL--KVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHY--------------------------------MKERRPLISLKRDLERAP
        A  S +TE  ++AL  KV E EVGM+Q+EV  SQGIRIRRRPPTGPPLHY                                MKER+PL SLK+ L+  P
Subjt:  AAASPVTELEENAL--KVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHY--------------------------------MKERRPLISLKRDLERAP

Query:  PARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYAR
        PA+DF+GAL++A+ RT LPGLIAEVKKASPSRG+LREDF+PVEIAQAYEKGGAACLSVLTD K+F+GS+ENL+ I  AGVKCPLL KEF+V+AWQIYY R
Subjt:  PARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYAR

Query:  SKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTP
        SKGADA+LLIA+VLPDLDIKYM KICK++G+  LVEVHDE+EMDR+LAI+G+ELIGINNRNLETFEVD+  TK+LLEGERG+ IR+K++ +VGESGLFTP
Subjt:  SKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTP

Query:  DDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLF
        +DIA+VQEAGVKAVLVGES++KQSDP K I+ LF
Subjt:  DDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLF

AT5G48220.2 Aldolase-type TIM barrel family protein1.9e-12169.35Show/hide
Query:  MFQDEVAASQGIRIRRRPPTGPPLHY--------------------------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAE
        M+Q+EV  SQGIRIRRRPPTGPPLHY                                MKER+PL SLK+ L+  PPA+DF+GAL++A+ RT LPGLIAE
Subjt:  MFQDEVAASQGIRIRRRPPTGPPLHY--------------------------------MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAE

Query:  VKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTK
        VKKASPSRG+LREDF+PVEIAQAYEKGGAACLSVLTD K+F+GS+ENL+ I  AGVKCPLL KEF+V+AWQIYY RSKGADA+LLIA+VLPDLDIKYM K
Subjt:  VKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTK

Query:  ICKMVGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQS
        ICK++G+  LVEVHDE+EMDR+LAI+G+ELIGINNRNLETFEVD+  TK+LLEGERG+ IR+K++ +VGESGLFTP+DIA+VQEAGVKAVLVGES++KQS
Subjt:  ICKMVGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQS

Query:  DPTKGITGLF
        DP K I+ LF
Subjt:  DPTKGITGLF

AT5G48220.3 Aldolase-type TIM barrel family protein1.1e-12465.16Show/hide
Query:  ISTAIRAQQIESEAGSAA---ASPVTELEENAL--KVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHY-------------------------------
        +S A + Q+   +    A   +S +TE  ++AL  KV E EVGM+Q+EV  SQGIRIRRRPPTGPPLHY                               
Subjt:  ISTAIRAQQIESEAGSAA---ASPVTELEENAL--KVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHY-------------------------------

Query:  -MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVK
         MKER+PL SLK+ L+  PPA+DF+GAL++A+ RT LPGLIAEVKKASPSRG+LREDF+PVEIAQAYEKGGAACLSVLTD K+F+GS+ENL+ I  AGVK
Subjt:  -MKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVK

Query:  CPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERG
        CPLL KEF+V+AWQIYY RSKGADA+LLIA+VLPDLDIKYM KICK++G+  LVEVHDE+EMDR+LAI+G+ELIGINNRNLETFEVD+  TK+LLEGERG
Subjt:  CPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERG

Query:  QNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLF
        + IR+K++ +VGESGLFTP+DIA+VQEAGVKAVLVGES++KQSDP K I+ LF
Subjt:  QNIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGCCTCGCTTCCCTCAGAACAATTCCAAAGGTTTCCTTTCCACCCATTTCCTCTTCTACTCGCAGGTCCAAGTTTCTTTCCAGAAGGTTGAATTTCAGCCCTTC
AATGGACACTTCTTTGAGGAATTCAATCTCTCTTGCCTCTTCTTCTTCCTCCTCTCCCATTTCAACTGCCATCCGAGCTCAACAGATAGAATCCGAGGCTGGCTCAGCTG
CGGCTTCACCAGTTACTGAATTAGAAGAAAATGCTCTGAAAGTGAAGGAATGGGAAGTGGGGATGTTCCAAGATGAAGTAGCTGCAAGTCAAGGCATAAGGATTCGTCGC
AGGCCGCCCACGGGACCGCCATTGCATTATATGAAAGAAAGAAGGCCTCTCATCTCGTTGAAAAGGGACCTTGAGAGGGCCCCTCCTGCTAGAGATTTCCTTGGAGCTCT
CAAGGCTGCCTACCTTAGAACTAATTTACCTGGTTTGATTGCTGAAGTAAAAAAGGCTTCTCCCAGCAGAGGAGTTTTAAGGGAGGATTTTGATCCAGTTGAGATTGCCC
AAGCGTATGAGAAAGGTGGAGCAGCATGCCTTAGTGTTCTCACAGATCAGAAGTTTTTTCAGGGAAGCTTTGAGAATCTGGAGAAGATAAGGAATGCTGGAGTGAAGTGC
CCTCTCTTGTGCAAGGAGTTTGTGGTTGATGCATGGCAGATCTACTATGCCCGATCGAAAGGTGCCGATGCCATTCTTTTGATTGCTGCTGTTTTGCCTGATCTCGACAT
TAAATATATGACTAAAATCTGCAAGATGGTCGGTTTGACCCCCCTTGTTGAGGTTCATGACGAGAAGGAAATGGATCGTATGCTCGCAATTGATGGCATTGAGCTTATTG
GCATCAACAATCGGAATCTTGAAACATTTGAGGTCGATATCAGCAACACAAAAAGGCTTCTTGAAGGAGAGCGTGGACAAAACATCCGCAAGAAGAACGTAACAATAGTG
GGAGAATCTGGGCTGTTCACTCCTGATGATATTGCTTATGTGCAAGAAGCTGGTGTTAAAGCTGTTCTGGTTGGCGAGTCGATTGTGAAACAGAGCGACCCCACGAAGGG
AATAACTGGACTTTTTGAGAAACTTCCACTCGAACTCGCCTCTATTTCTCCGGCGATGTTCTCTGCCGGAGCCGCGGCCTCCGGCGAACTGGAGTTCCGGTGGGACGATG
ACGCGTGGTACAATGTGACCGTGAAACTCGAAGGCGATGATCTTAGGATCAGTTATTGCGAGTTCGATAAGGAACATGACAATGTCTTCCACGCCAACCATTTCCGGAGC
TTATCGGAGTTGAGCGACTTCGAAGCTAGGTTTCGGCCTTTGTCCAGACAGTTGCAGGATTCCGAATGCCCTAACGTCGTCCCTGGAATGCCCGTTTGCGCTTCCCACTC
CTCTCGAGCCGATGATGTTCGCTTCTACGATGCTCTTGTGGAAGGGGTGGATTATCTTGAACACTCTTATGCAAATGGAGAAGAAGAGTGCTTATGCAACTTTATCCTTT
TATGGCAGCATGGTCCAAACTCTGGGAATTTGACATTTGCCAGTATTGCTAACTTGTGTCAAATTCAATTTGATGAAATTAATGACACAGTGTTAGCAACCTTCTTCGCG
AAAGTTAGGGAGAAAATCCAAACCAGAATGAATAGAGGTGGTACCTGTTCTGAAGATCGCCTTCTCACCCATAATGACGGTGGTGCTCATCAGAAGGATGAATGCAGCCT
AAAATTAAAGCGTCGCCTATCCTTTTTTGAACGCATGGACCAGGTGGACACACGGCGTGCCAAGCGTTCTTCTGGGGCAGTAGGACTATGGGAAGACCAACAGACTCTCA
GTTCCAGAAAAAGTGGGGTGATCGAGCAAGATACTGATATTGGTGGAGTGAAGTATCAGTTCATGATTTTACTTGAGAATCTAGATAAAGAATTGTCTCCAGTAAAACTT
GCCAAATTCTTACATGCACAAACATTGATATTACCTCGAGTATACATTTTTCCAAGTTTAACATTTGAGGCGTATGCAAAGGGAGCTGTTGTATTGAATTGCAGAAAGAA
CTTAGAGAGGTTGTGCGATTTTTTGGATAATCCAGACCATGTCATTTTATCCTCCCAAGGAAGGCCCTTGGTAGTAACCGGAAGAATAGCAGGACGCGAAACTTTTGGGA
CATTGGCGGCAGGGGCCATGGTGCTAGACTCGGAAAATAAATTTGGTAATGAAAAAGATGGGAGGGTGCGTTGCGAACTGAAGGTTGTGAAAGTAGGAACAAATGAATAT
TTGACTGCAAAGCACATGAAGGAATTGTTCATGGAGTTTCTTAACCATCAAAGGAGGTTGCACCAAAGATTGGCCATGGAGGAGGGAAAGATCTATTGCAATGGTGCTTT
GTAA
mRNA sequenceShow/hide mRNA sequence
TTAAAAATAGGGAAAACAGAGTTTCAAGTTGCAACCTTCATCGGCTCCAAGTCCCAACAAGAACAACAAAAACGCCCACAAATTTGACCTATTTTTTGTTTTTTGGTTTT
GTTTTGTTTCATATTTTGGTGTAACTCCTTAATTCGCCAATATCTATAAAACAATCTGCTTCACAGACGCAGGGTTTTCATTTTCAAACGAGCTGAAGTTTTGTTCGACA
ATGGAGGGCCTCGCTTCCCTCAGAACAATTCCAAAGGTTTCCTTTCCACCCATTTCCTCTTCTACTCGCAGGTCCAAGTTTCTTTCCAGAAGGTTGAATTTCAGCCCTTC
AATGGACACTTCTTTGAGGAATTCAATCTCTCTTGCCTCTTCTTCTTCCTCCTCTCCCATTTCAACTGCCATCCGAGCTCAACAGATAGAATCCGAGGCTGGCTCAGCTG
CGGCTTCACCAGTTACTGAATTAGAAGAAAATGCTCTGAAAGTGAAGGAATGGGAAGTGGGGATGTTCCAAGATGAAGTAGCTGCAAGTCAAGGCATAAGGATTCGTCGC
AGGCCGCCCACGGGACCGCCATTGCATTATATGAAAGAAAGAAGGCCTCTCATCTCGTTGAAAAGGGACCTTGAGAGGGCCCCTCCTGCTAGAGATTTCCTTGGAGCTCT
CAAGGCTGCCTACCTTAGAACTAATTTACCTGGTTTGATTGCTGAAGTAAAAAAGGCTTCTCCCAGCAGAGGAGTTTTAAGGGAGGATTTTGATCCAGTTGAGATTGCCC
AAGCGTATGAGAAAGGTGGAGCAGCATGCCTTAGTGTTCTCACAGATCAGAAGTTTTTTCAGGGAAGCTTTGAGAATCTGGAGAAGATAAGGAATGCTGGAGTGAAGTGC
CCTCTCTTGTGCAAGGAGTTTGTGGTTGATGCATGGCAGATCTACTATGCCCGATCGAAAGGTGCCGATGCCATTCTTTTGATTGCTGCTGTTTTGCCTGATCTCGACAT
TAAATATATGACTAAAATCTGCAAGATGGTCGGTTTGACCCCCCTTGTTGAGGTTCATGACGAGAAGGAAATGGATCGTATGCTCGCAATTGATGGCATTGAGCTTATTG
GCATCAACAATCGGAATCTTGAAACATTTGAGGTCGATATCAGCAACACAAAAAGGCTTCTTGAAGGAGAGCGTGGACAAAACATCCGCAAGAAGAACGTAACAATAGTG
GGAGAATCTGGGCTGTTCACTCCTGATGATATTGCTTATGTGCAAGAAGCTGGTGTTAAAGCTGTTCTGGTTGGCGAGTCGATTGTGAAACAGAGCGACCCCACGAAGGG
AATAACTGGACTTTTTGAGAAACTTCCACTCGAACTCGCCTCTATTTCTCCGGCGATGTTCTCTGCCGGAGCCGCGGCCTCCGGCGAACTGGAGTTCCGGTGGGACGATG
ACGCGTGGTACAATGTGACCGTGAAACTCGAAGGCGATGATCTTAGGATCAGTTATTGCGAGTTCGATAAGGAACATGACAATGTCTTCCACGCCAACCATTTCCGGAGC
TTATCGGAGTTGAGCGACTTCGAAGCTAGGTTTCGGCCTTTGTCCAGACAGTTGCAGGATTCCGAATGCCCTAACGTCGTCCCTGGAATGCCCGTTTGCGCTTCCCACTC
CTCTCGAGCCGATGATGTTCGCTTCTACGATGCTCTTGTGGAAGGGGTGGATTATCTTGAACACTCTTATGCAAATGGAGAAGAAGAGTGCTTATGCAACTTTATCCTTT
TATGGCAGCATGGTCCAAACTCTGGGAATTTGACATTTGCCAGTATTGCTAACTTGTGTCAAATTCAATTTGATGAAATTAATGACACAGTGTTAGCAACCTTCTTCGCG
AAAGTTAGGGAGAAAATCCAAACCAGAATGAATAGAGGTGGTACCTGTTCTGAAGATCGCCTTCTCACCCATAATGACGGTGGTGCTCATCAGAAGGATGAATGCAGCCT
AAAATTAAAGCGTCGCCTATCCTTTTTTGAACGCATGGACCAGGTGGACACACGGCGTGCCAAGCGTTCTTCTGGGGCAGTAGGACTATGGGAAGACCAACAGACTCTCA
GTTCCAGAAAAAGTGGGGTGATCGAGCAAGATACTGATATTGGTGGAGTGAAGTATCAGTTCATGATTTTACTTGAGAATCTAGATAAAGAATTGTCTCCAGTAAAACTT
GCCAAATTCTTACATGCACAAACATTGATATTACCTCGAGTATACATTTTTCCAAGTTTAACATTTGAGGCGTATGCAAAGGGAGCTGTTGTATTGAATTGCAGAAAGAA
CTTAGAGAGGTTGTGCGATTTTTTGGATAATCCAGACCATGTCATTTTATCCTCCCAAGGAAGGCCCTTGGTAGTAACCGGAAGAATAGCAGGACGCGAAACTTTTGGGA
CATTGGCGGCAGGGGCCATGGTGCTAGACTCGGAAAATAAATTTGGTAATGAAAAAGATGGGAGGGTGCGTTGCGAACTGAAGGTTGTGAAAGTAGGAACAAATGAATAT
TTGACTGCAAAGCACATGAAGGAATTGTTCATGGAGTTTCTTAACCATCAAAGGAGGTTGCACCAAAGATTGGCCATGGAGGAGGGAAAGATCTATTGCAATGGTGCTTT
GTAATTACAAGTTGTAAATCAATCATAGGTTAGTTGGATAAGGTTCTTGTTTTTGTGGAAAAAGAAATTACTTTGAGCTGTACATTTTAGATTTTACACTGGATAATGGT
GTAGGATGTAGAAACCACCAACTTCCATTTTTGTATAGGCCTTCTCCTAGTTGAGTTTATGTGGGATTTTGGCTGTATTAAAGAAATGAACATTTTGATGTAATACTCAT
GCTCTTTTTGTTCCCTCTAACTCCAAGGACTATTAGGCAC
Protein sequenceShow/hide protein sequence
MEGLASLRTIPKVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTAIRAQQIESEAGSAAASPVTELEENALKVKEWEVGMFQDEVAASQGIRIRR
RPPTGPPLHYMKERRPLISLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKC
PLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQNIRKKNVTIV
GESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLFEKLPLELASISPAMFSAGAAASGELEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFHANHFRS
LSELSDFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFA
KVREKIQTRMNRGGTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQFMILLENLDKELSPVKL
AKFLHAQTLILPRVYIFPSLTFEAYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLDSENKFGNEKDGRVRCELKVVKVGTNEY
LTAKHMKELFMEFLNHQRRLHQRLAMEEGKIYCNGAL