; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009209 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009209
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Description5-formyltetrahydrofolate cyclo-ligase-like protein COG0212
Genome locationscaffold220:781420..784205
RNA-Seq ExpressionMS009209
SyntenyMS009209
Gene Ontology termsGO:0016874 - ligase activity (molecular function)
InterPro domainsIPR002698 - 5-formyltetrahydrofolate cyclo-ligase
IPR024185 - 5-formyltetrahydrofolate cyclo-ligase-like domain superfamily
IPR037171 - NagB/RpiA transferase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578879.1 5-formyltetrahydrofolate cyclo-ligase-like protein, partial [Cucurbita argyrosperma subsp. sororia]2.0e-17482.59Show/hide
Query:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARG-----DVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAW
        MDSS LQLS PLS +STFLN IRK+SS  SI +GRN QFQR+FKLESS+  G     D AFDEAAFEA+RSRLDA A KSMAEAS+R TE A  DDPKAW
Subjt:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARG-----DVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAW

Query:  KWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATI
        KWVIRKRIWD MESQN+AANPRPVHHRIPNFVGAMEAANRLCDLEVFRD+QCVKVNPDSPQKGVRLLTL GGKKLLTPQPRLRTGFFSIVESGMLT ATI
Subjt:  KWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATI

Query:  NEACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLL
         EACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDST IVTS             +HDCQLVDD PVEKLL
Subjt:  NEACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLL

Query:  VHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASS
        VHDVPVDIVCTPTQVILTNTKIPKPQ                 GIYWEMLSPEKLSQVRILRELKRRIERETG+PLPCGPSEKLPPTAQRSSKP RRASS
Subjt:  VHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASS

Query:  KN
        KN
Subjt:  KN

KAG7016410.1 5-formyltetrahydrofolate cyclo-ligase-like protein [Cucurbita argyrosperma subsp. argyrosperma]7.5e-17482.34Show/hide
Query:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARG-----DVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAW
        MDSS LQLS PLS +STFLN IRK+SS  SI +GRN QFQR+FKLESS+  G     D AFDEAAFEA+RSRLDA A KSMAEAS+R TE A  DDPKAW
Subjt:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARG-----DVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAW

Query:  KWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATI
        KWVIRKRIWD MESQN+AANPRPVHHRIPNFVGAMEAANRLCDLEVFRD+QCVKVNPDSPQKGVRLLTL GGKKLLTPQPRLRTGFFSIVESGMLT ATI
Subjt:  KWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATI

Query:  NEACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLL
         EACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLR+MGAIDDST IVTS             +HDCQLVDD PVEKLL
Subjt:  NEACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLL

Query:  VHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASS
        VHDVPVDIVCTPTQVILTNTKIPKPQ                 GIYWEMLSPEKLSQVRILRELKRRIERETG+PLPCGPSEKLPPTAQRSSKP RRASS
Subjt:  VHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASS

Query:  KN
        KN
Subjt:  KN

XP_022134547.1 5-formyltetrahydrofolate cyclo-ligase-like protein COG0212 [Momordica charantia]6.9e-19691.94Show/hide
Query:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARGDVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAWKWVIR
        MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARGDVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAWKWVIR
Subjt:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARGDVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAWKWVIR

Query:  KRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATINEACT
        KRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATINEACT
Subjt:  KRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATINEACT

Query:  SVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLLVHDVP
        SVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTS             +HDCQLVDDFPVEKLLVHDVP
Subjt:  SVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLLVHDVP

Query:  VDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASSKN
        VDIVCTP QVILTNTKIPKPQ                 GIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASSKN
Subjt:  VDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASSKN

XP_022939928.1 5-formyltetrahydrofolate cyclo-ligase-like protein COG0212 isoform X1 [Cucurbita moschata]1.5e-17482.59Show/hide
Query:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARG-----DVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAW
        MDSS LQLS PLS +STFLN IRK+SS  SI +GRN QFQR+FKLESS+  G     D AFDEAAFEA+RSRLDA A KSMAEAS+R TEGA  DDPKAW
Subjt:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARG-----DVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAW

Query:  KWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATI
        KWVIRKRIWD MESQN+AANPRPVHHRIPNFVGAMEAANRLCDLEVFRD+QCVKVNPDSPQKGVRLLTL GGKKLLTPQPRLRTGFFSIVESGMLT ATI
Subjt:  KWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATI

Query:  NEACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLL
         EACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDST IVTS             +HDCQLVDD PVEKLL
Subjt:  NEACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLL

Query:  VHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASS
        VHDVPVDIVCTPTQVILTNTKIPKPQ                 GIYW+MLSPEKLSQVRILRELKRRIERETG+PLPCGPSEKLPPTAQRSSKP RRASS
Subjt:  VHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASS

Query:  KN
        KN
Subjt:  KN

XP_023551592.1 5-formyltetrahydrofolate cyclo-ligase-like protein COG0212 isoform X1 [Cucurbita pepo subsp. pepo]3.3e-17482.59Show/hide
Query:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARG-----DVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAW
        MDSS LQLS PLS +STFLN IRK+SS  SI IGRN QF+R+FKLESS+  G     + AFDEAAFEA+RSRLDA A KSMAEAS+R TEGA  DDPKAW
Subjt:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARG-----DVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAW

Query:  KWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATI
        KWVIRKRIWD MESQN+AANPRPVHHRIPNFVGAMEAANRLCDLEVFRD+QCVKVNPDSPQKGVRLLTL GGKKLLTPQPRLRTGFFSIVESGMLT ATI
Subjt:  KWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATI

Query:  NEACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLL
         EACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDST IVTS             +HDCQLVDD PVEKLL
Subjt:  NEACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLL

Query:  VHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASS
        VHDVPVDIVCTPTQVILTNTKIPKPQ                 GIYWEMLSPEKLSQVRILRELKRRIERETG+PLPCGPSEKLPPTAQRSSKP RRASS
Subjt:  VHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASS

Query:  KN
        KN
Subjt:  KN

TrEMBL top hitse value%identityAlignment
A0A6J1BZW8 5-formyltetrahydrofolate cyclo-ligase-like protein COG02123.4e-19691.94Show/hide
Query:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARGDVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAWKWVIR
        MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARGDVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAWKWVIR
Subjt:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARGDVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAWKWVIR

Query:  KRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATINEACT
        KRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATINEACT
Subjt:  KRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATINEACT

Query:  SVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLLVHDVP
        SVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTS             +HDCQLVDDFPVEKLLVHDVP
Subjt:  SVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLLVHDVP

Query:  VDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASSKN
        VDIVCTP QVILTNTKIPKPQ                 GIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASSKN
Subjt:  VDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASSKN

A0A6J1FIK2 5-formyltetrahydrofolate cyclo-ligase-like protein COG0212 isoform X21.8e-17382.09Show/hide
Query:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARG-----DVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAW
        MDSS LQLS PLS +STFLN IRK+SS  SI +GRN QFQR+FKLESS+  G     D AFDEAAFEA+RSRLDA A KSMAEAS+R TEGA  DDPKAW
Subjt:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARG-----DVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAW

Query:  KWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATI
        KWVIRKRIWD MESQN+AANPRPVHHRIPNFVGAMEAANRLCDLEVFRD+QCVKVNPDSPQKGVRLLTL GGKKLLTPQPRLRTGFFSIVESGMLT ATI
Subjt:  KWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATI

Query:  NEACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLL
         EACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGK +GFAELEYGMLRYMGAIDDST IVTS             +HDCQLVDD PVEKLL
Subjt:  NEACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLL

Query:  VHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASS
        VHDVPVDIVCTPTQVILTNTKIPKPQ                 GIYW+MLSPEKLSQVRILRELKRRIERETG+PLPCGPSEKLPPTAQRSSKP RRASS
Subjt:  VHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASS

Query:  KN
        KN
Subjt:  KN

A0A6J1FMW8 5-formyltetrahydrofolate cyclo-ligase-like protein COG0212 isoform X17.3e-17582.59Show/hide
Query:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARG-----DVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAW
        MDSS LQLS PLS +STFLN IRK+SS  SI +GRN QFQR+FKLESS+  G     D AFDEAAFEA+RSRLDA A KSMAEAS+R TEGA  DDPKAW
Subjt:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARG-----DVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAW

Query:  KWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATI
        KWVIRKRIWD MESQN+AANPRPVHHRIPNFVGAMEAANRLCDLEVFRD+QCVKVNPDSPQKGVRLLTL GGKKLLTPQPRLRTGFFSIVESGMLT ATI
Subjt:  KWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATI

Query:  NEACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLL
         EACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDST IVTS             +HDCQLVDD PVEKLL
Subjt:  NEACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLL

Query:  VHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASS
        VHDVPVDIVCTPTQVILTNTKIPKPQ                 GIYW+MLSPEKLSQVRILRELKRRIERETG+PLPCGPSEKLPPTAQRSSKP RRASS
Subjt:  VHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASS

Query:  KN
        KN
Subjt:  KN

A0A6J1K0E7 5-formyltetrahydrofolate cyclo-ligase-like protein COG0212 isoform X23.4e-17281.59Show/hide
Query:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARG-----DVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAW
        MDSS LQLS PLS +STFLN IRK+SS  SI IGRN QFQR+FKLESS+  G     + AFDEAAFEA+RSRLDA A KSMAEAS+R TEGA  DDPKAW
Subjt:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARG-----DVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAW

Query:  KWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATI
        KWVIRKRIWD MES N+AANPRPVHHRIPNFVGAMEAANRLCDLEVFRD+QCVKVNPDSPQKGVRLLTL GGKKLLTPQPRLRTGFFSIVESGMLT ATI
Subjt:  KWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATI

Query:  NEACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLL
         EACTSVGVAKYG+PIGLDEKIKVDLIVIGSVAVDPRTGARLGK +GFAELEYGMLRYMGAIDDST IVTS             +HDCQLVDD PVEKLL
Subjt:  NEACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLL

Query:  VHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASS
        VHDVPVDIVCTPTQVILTNTKIPKPQ                 GIYWEMLSPEKLSQVR+LRELKRRIERETG+PLPCGPSEKLPPTAQRSSKP RRASS
Subjt:  VHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASS

Query:  KN
        KN
Subjt:  KN

A0A6J1K2I1 5-formyltetrahydrofolate cyclo-ligase-like protein COG0212 isoform X11.4e-17382.09Show/hide
Query:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARG-----DVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAW
        MDSS LQLS PLS +STFLN IRK+SS  SI IGRN QFQR+FKLESS+  G     + AFDEAAFEA+RSRLDA A KSMAEAS+R TEGA  DDPKAW
Subjt:  MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARG-----DVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAW

Query:  KWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATI
        KWVIRKRIWD MES N+AANPRPVHHRIPNFVGAMEAANRLCDLEVFRD+QCVKVNPDSPQKGVRLLTL GGKKLLTPQPRLRTGFFSIVESGMLT ATI
Subjt:  KWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATI

Query:  NEACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLL
         EACTSVGVAKYG+PIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDST IVTS             +HDCQLVDD PVEKLL
Subjt:  NEACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLL

Query:  VHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASS
        VHDVPVDIVCTPTQVILTNTKIPKPQ                 GIYWEMLSPEKLSQVR+LRELKRRIERETG+PLPCGPSEKLPPTAQRSSKP RRASS
Subjt:  VHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASS

Query:  KN
        KN
Subjt:  KN

SwissProt top hitse value%identityAlignment
Q0P464 Methenyltetrahydrofolate synthase domain-containing protein3.8e-5642.55Show/hide
Query:  KWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFS-IVESGMLTPAT
        KW +R ++W+ +E +N+A  PRPVH+RIPNF GA+EA N++  LE+F ++  VKV+PD P +GVRL  L   K LL P PRLR G F+ I      T  T
Subjt:  KWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFS-IVESGMLTPAT

Query:  INEACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKL
        +    TS G+ ++  P+GLD+K++VDL+V+GSVAV  + G R+GKGEGFA++EY M+  MG++ +ST ++T              +HDCQ++ D P E +
Subjt:  INEACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKL

Query:  LVHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQ
          HD+ VD + T T+VI T  K PKPQG++                 W ML  E+L ++ IL++L R +E+E G+
Subjt:  LVHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQ

Q2KI24 Methenyltetrahydrofolate synthase domain-containing protein5.6e-5543.64Show/hide
Query:  IRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFS-IVESGMLTPATINE
        IR++IWD MESQN+A  PRPVHHRIPNF GA  AA     L+ F+ A+ +KVNPD+PQK  R   L   K LL P PRLRTG F+ I      T   + +
Subjt:  IRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFS-IVESGMLTPATINE

Query:  ACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLLVH
          TS GV  Y  P+GLD K+ VDL+V+GSVAV  + G R+GKGEG+A+LEY M+  MGA+   T +VT              +HDCQ+V D P   L  H
Subjt:  ACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLLVH

Query:  DVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSK
        D+ VD + TPT+VI T  + PK                 P+GI W  +S E L ++ ILR L+ + E + G+ +      + PP A RS +
Subjt:  DVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSK

Q2M296 Methenyltetrahydrofolate synthase domain-containing protein4.0e-5343.69Show/hide
Query:  IRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFS-IVESGMLTPATINE
        IR++IW  MESQN+A  PRPVHHRIPNF G+  A   + DL+VF   Q VKV+PD P +GVRLL L   K LL P PRLRTG F+ I      T   + +
Subjt:  IRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFS-IVESGMLTPATINE

Query:  ACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLLVH
          TS GV  Y  PIGLD ++ VDL+V+GSVAV  + G R+GKGEG+A+LEY M+  MGA+   T +VT              +HDCQ+V D P E +  H
Subjt:  ACTSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLLVH

Query:  DVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPC-GPSEKLPPTAQRSSKP
        D+ VD + TPT+VI T  K PKP                  GI W  +S E + ++ ILR L+ R E++ G+ +   G  + LP    + + P
Subjt:  DVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPC-GPSEKLPPTAQRSSKP

Q52L34 Methenyltetrahydrofolate synthase domain-containing protein2.5e-5543.93Show/hide
Query:  DPKAWKWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGML
        DP   KW IR+++WD +E+ N+A  PRPVHHRIPNF  + +A   + DLEVFR    VKV+PD P +GVRL  L   K LL P PRLRTG F+ +     
Subjt:  DPKAWKWVIRKRIWDLMESQNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGML

Query:  TPATINEAC-TSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDF
            +   C TS GV  Y  P+GLD K++VDL+V+GSVAV  + G R+GKGEGFA++EY M+  MGA+ + T++VT              +HDCQ+V D 
Subjt:  TPATINEAC-TSVGVAKYGKPIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDF

Query:  PVEKLLVHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQ
        P E L  HD+ VD + TPT++I T+ K  KPQG++                 W M++ E + ++ ILR L+ R ER  G+
Subjt:  PVEKLLVHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQ

Q9SRE0 5-formyltetrahydrofolate cyclo-ligase-like protein COG02126.2e-12361.74Show/hide
Query:  HPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARGDVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAWKWVIRKRIWDLMES
        +P     +F+N    S     +S+   SQ +   ++ S    G VAFD  A+EADR  LDA A + MAE + +  E  P  DPKAWKWVIRK++WDLME+
Subjt:  HPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARGDVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAWKWVIRKRIWDLMES

Query:  QNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATINEACTSVGVAKYGK
        +N A +PRPVHHRIPNFVGA  AA +L +L+ FR A  VKVNPDSPQK +R LTL+G KKLLTPQPRLRTGFFS++ES +L P TI EACTSVGVAKYG+
Subjt:  QNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATINEACTSVGVAKYGK

Query:  PIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLLVHDVPVDIVCTPTQ
         IGLDEKIKVDLIVIGSVAV+P+TGARLGKGEGFAELEYGMLRYMGAIDDST +VT+             +HDCQLVDD P+EKL +HDVPVDI+CTPT+
Subjt:  PIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLLVHDVPVDIVCTPTQ

Query:  VILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSK
        VI TNT IPKPQ                 GIYW+ LSPEKL Q+RILRELK R+E++TG+ LP GPSEKLPPTA+R  +
Subjt:  VILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSK

Arabidopsis top hitse value%identityAlignment
AT1G76730.1 NagB/RpiA/CoA transferase-like superfamily protein4.4e-12461.74Show/hide
Query:  HPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARGDVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAWKWVIRKRIWDLMES
        +P     +F+N    S     +S+   SQ +   ++ S    G VAFD  A+EADR  LDA A + MAE + +  E  P  DPKAWKWVIRK++WDLME+
Subjt:  HPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARGDVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAWKWVIRKRIWDLMES

Query:  QNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATINEACTSVGVAKYGK
        +N A +PRPVHHRIPNFVGA  AA +L +L+ FR A  VKVNPDSPQK +R LTL+G KKLLTPQPRLRTGFFS++ES +L P TI EACTSVGVAKYG+
Subjt:  QNVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATINEACTSVGVAKYGK

Query:  PIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLLVHDVPVDIVCTPTQ
         IGLDEKIKVDLIVIGSVAV+P+TGARLGKGEGFAELEYGMLRYMGAIDDST +VT+             +HDCQLVDD P+EKL +HDVPVDI+CTPT+
Subjt:  PIGLDEKIKVDLIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLLVHDVPVDIVCTPTQ

Query:  VILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSK
        VI TNT IPKPQ                 GIYW+ LSPEKL Q+RILRELK R+E++TG+ LP GPSEKLPPTA+R  +
Subjt:  VILTNTKIPKPQGLLLHAKFPLENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCAGTCTCCTGCAATTATCCCACCCGCTATCAACTAGATCCACATTTCTCAACGTAATTCGCAAATCTTCTTCTGCATCCTCAATCTCAATCGGAAGAAATTC
GCAGTTCCAAAGATACTTCAAGCTGGAGAGTAGCAGAGCCAGAGGCGACGTCGCTTTCGACGAAGCTGCTTTCGAGGCCGACAGGTCTCGCCTCGACGCCGAAGCTGAGA
AATCCATGGCCGAAGCTTCAATCAGGGCCACGGAAGGCGCTCCGGCCGACGACCCGAAAGCATGGAAGTGGGTGATCCGGAAACGGATTTGGGATTTGATGGAATCGCAA
AACGTTGCGGCGAATCCCCGGCCTGTTCACCACCGGATCCCTAACTTCGTCGGCGCCATGGAAGCTGCCAATAGATTGTGCGATTTGGAAGTATTTAGGGATGCACAGTG
CGTCAAAGTTAATCCGGATTCGCCTCAGAAGGGCGTGAGGCTTCTTACACTTACGGGTGGCAAAAAACTATTAACACCTCAGCCGCGGTTGAGAACAGGGTTTTTCTCCA
TAGTCGAATCCGGAATGTTGACTCCTGCTACCATCAACGAAGCCTGCACTTCTGTTGGGGTAGCGAAGTACGGAAAGCCAATTGGACTGGATGAAAAGATCAAAGTTGAT
CTGATTGTCATCGGCTCCGTTGCTGTTGACCCCCGAACAGGTGCTCGGCTTGGCAAGGGAGAGGGATTTGCAGAACTTGAATACGGAATGCTTCGGTACATGGGAGCCAT
TGACGATTCAACTCTGATTGTTACTTCTGGTTGTGTAACTTCTTTGCGTCCATCTTATTTTCTAGCTCTGCACGATTGTCAGTTGGTTGATGATTTCCCAGTCGAGAAGC
TATTAGTCCACGACGTGCCAGTAGACATTGTATGCACTCCGACGCAGGTCATTCTAACCAACACAAAAATCCCCAAACCCCAAGGTTTGCTTCTCCATGCCAAATTCCCC
CTAGAAAACAAGTTTGATCCTTCTGGTATTTACTGGGAAATGCTGTCTCCTGAGAAGCTGAGTCAAGTTCGAATACTCAGAGAGCTCAAACGGCGGATTGAACGGGAGAC
CGGCCAACCGCTGCCTTGTGGTCCGTCGGAGAAACTACCACCTACAGCTCAACGGAGTTCAAAACCCACAAGACGTGCATCCTCCAAGAAC
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCAGTCTCCTGCAATTATCCCACCCGCTATCAACTAGATCCACATTTCTCAACGTAATTCGCAAATCTTCTTCTGCATCCTCAATCTCAATCGGAAGAAATTC
GCAGTTCCAAAGATACTTCAAGCTGGAGAGTAGCAGAGCCAGAGGCGACGTCGCTTTCGACGAAGCTGCTTTCGAGGCCGACAGGTCTCGCCTCGACGCCGAAGCTGAGA
AATCCATGGCCGAAGCTTCAATCAGGGCCACGGAAGGCGCTCCGGCCGACGACCCGAAAGCATGGAAGTGGGTGATCCGGAAACGGATTTGGGATTTGATGGAATCGCAA
AACGTTGCGGCGAATCCCCGGCCTGTTCACCACCGGATCCCTAACTTCGTCGGCGCCATGGAAGCTGCCAATAGATTGTGCGATTTGGAAGTATTTAGGGATGCACAGTG
CGTCAAAGTTAATCCGGATTCGCCTCAGAAGGGCGTGAGGCTTCTTACACTTACGGGTGGCAAAAAACTATTAACACCTCAGCCGCGGTTGAGAACAGGGTTTTTCTCCA
TAGTCGAATCCGGAATGTTGACTCCTGCTACCATCAACGAAGCCTGCACTTCTGTTGGGGTAGCGAAGTACGGAAAGCCAATTGGACTGGATGAAAAGATCAAAGTTGAT
CTGATTGTCATCGGCTCCGTTGCTGTTGACCCCCGAACAGGTGCTCGGCTTGGCAAGGGAGAGGGATTTGCAGAACTTGAATACGGAATGCTTCGGTACATGGGAGCCAT
TGACGATTCAACTCTGATTGTTACTTCTGGTTGTGTAACTTCTTTGCGTCCATCTTATTTTCTAGCTCTGCACGATTGTCAGTTGGTTGATGATTTCCCAGTCGAGAAGC
TATTAGTCCACGACGTGCCAGTAGACATTGTATGCACTCCGACGCAGGTCATTCTAACCAACACAAAAATCCCCAAACCCCAAGGTTTGCTTCTCCATGCCAAATTCCCC
CTAGAAAACAAGTTTGATCCTTCTGGTATTTACTGGGAAATGCTGTCTCCTGAGAAGCTGAGTCAAGTTCGAATACTCAGAGAGCTCAAACGGCGGATTGAACGGGAGAC
CGGCCAACCGCTGCCTTGTGGTCCGTCGGAGAAACTACCACCTACAGCTCAACGGAGTTCAAAACCCACAAGACGTGCATCCTCCAAGAAC
Protein sequenceShow/hide protein sequence
MDSSLLQLSHPLSTRSTFLNVIRKSSSASSISIGRNSQFQRYFKLESSRARGDVAFDEAAFEADRSRLDAEAEKSMAEASIRATEGAPADDPKAWKWVIRKRIWDLMESQ
NVAANPRPVHHRIPNFVGAMEAANRLCDLEVFRDAQCVKVNPDSPQKGVRLLTLTGGKKLLTPQPRLRTGFFSIVESGMLTPATINEACTSVGVAKYGKPIGLDEKIKVD
LIVIGSVAVDPRTGARLGKGEGFAELEYGMLRYMGAIDDSTLIVTSGCVTSLRPSYFLALHDCQLVDDFPVEKLLVHDVPVDIVCTPTQVILTNTKIPKPQGLLLHAKFP
LENKFDPSGIYWEMLSPEKLSQVRILRELKRRIERETGQPLPCGPSEKLPPTAQRSSKPTRRASSKN