; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g20540 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g20540
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTIP41-like protein
Genome locationchr1:14369362..14370999
RNA-Seq ExpressionMoc01g20540
SyntenyMoc01g20540
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7027367.1 hypothetical protein SDJN02_11379 [Cucurbita argyrosperma subsp. argyrosperma]3.4e-16976.6Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRD--GVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFG
        MAESLDDGEFWLPPKFLNDDDLFLE KC GND K  RD  GV  +PFE+ LGFGPFGV+SDLGSPVESLIGSSETESDE+EYIAGL  Q+ARSTLEDGFG
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRD--GVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFG

Query:  LDNSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAG
        LD SHGW SSGSPQSTLC VG+GCGCKQ  SRGSPN H     S PQLTLDLLYAAAGEVSKMR+NEEAY  +NN G   PPRKPSPV+VP+KNR+ DAG
Subjt:  LDNSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAG

Query:  VYQQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGS
        VYQQLQASQFLHL+RQQL+EQ+ S      AARVG   G SVR+ Q        PQ+PQNRGRNS+FF GRNCR A GL S P WA PRKH+VNP PNGS
Subjt:  VYQQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGS

Query:  GMRAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKG
        GMRAVFLGVPGGKRECAGTGVFLPRQ+GAVSE+RKKPACSTVLVPARVMQALNLNLDDMYVQR QPQ  LQSRSP VFN GKND+  R RSE L +QQK 
Subjt:  GMRAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKG

Query:  -NLRAAVPVVNHDI-RLPQEWTY
         NLRAAV  VN +I RLPQEW+Y
Subjt:  -NLRAAVPVVNHDI-RLPQEWTY

XP_004147909.1 uncharacterized protein LOC101214270 [Cucumis sativus]2.1e-17979.67Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLF+E+KCGGND+K+GR+GV  YP      FG FG TSDLGSPVESL+GSSETESDEEEYIAGLTH+M RSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD

Query:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
        NSH WGSSGSPQSTLCA+GSGCGCKQ  SRGSPNGHY   AS PQLTLDLLYAAAGEVSKMR+NEEAYG IN+ GPL PPRKPSPVSVP+KNREPD  VY
Subjt:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY

Query:  QQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAAP--RKHAVNPPPNG
        QQLQASQFLHLRRQQL+EQ+NS      AARVGQ+KG +VR        Q  PQMPQNRGRN++FF+GRNCR A +GLPS PTW AP  ++H VNPP NG
Subjt:  QQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAAP--RKHAVNPPPNG

Query:  SGMRAVFLGVPGGKRECAGTGVFLPRQVG-AVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQ
        SGMRAVFLG PGGKRECAGTGVFLPRQ G A+SE+RKKPACSTVLVPARVMQALNLNLDDMYVQR+ P P LQSRSPPVFNAGKNDV VR RSE L  QQ
Subjt:  SGMRAVFLGVPGGKRECAGTGVFLPRQVG-AVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQ

Query:  KGNLRAAVPVVNHDIRLPQEWTY
        KGNLRAAVP VNH+I LPQEWTY
Subjt:  KGNLRAAVPVVNHDIRLPQEWTY

XP_008448729.1 PREDICTED: uncharacterized protein LOC103490808 [Cucumis melo]8.1e-17979.86Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLF+E+KC GND+KNGR+GV  YP      FG FG TSDLGSPVESL+GSSETESDEEEYIAGLTH++ RSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD

Query:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
        NSH WGSSGSPQSTLCA+GSGCGCKQG SRGSPNGHY   AS PQLTLDLLYAAAGEVSKMR+NEE YG IN+ GPL PPRKPSPVSVP+KNREPDA VY
Subjt:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY

Query:  QQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAA-PRKHAVNPPPNGS
        QQLQASQFLHLRRQQL+EQ+NS      A RVGQ+KG SVR+ Q        PQM QNRGRN++FF+GRNCR A +GLPS PTWAA PRKH VNPPPNGS
Subjt:  QQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAA-PRKHAVNPPPNGS

Query:  GMRAVFLGVPGGKRECAGTGVFLPRQV-GAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQK
        GMRAVFLG PGGKRECAGTGVFLPRQ  G V+E+RKKPACSTVLVPARVMQALNLNLDDMYVQRIQP P LQSRSPPV+ AGKNDV VR +SE L  QQK
Subjt:  GMRAVFLGVPGGKRECAGTGVFLPRQV-GAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQK

Query:  GNLRAAVPVVNHDIRLPQEWTY
        GNLR AVP VNH+I LPQEWTY
Subjt:  GNLRAAVPVVNHDIRLPQEWTY

XP_022151566.1 uncharacterized protein LOC111019479 [Momordica charantia]2.7e-243100Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD

Query:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
        NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
Subjt:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY

Query:  QQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGSGM
        QQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGSGM
Subjt:  QQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGSGM

Query:  RAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKGNL
        RAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKGNL
Subjt:  RAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKGNL

Query:  RAAVPVVNHDIRLPQEWTY
        RAAVPVVNHDIRLPQEWTY
Subjt:  RAAVPVVNHDIRLPQEWTY

XP_038883347.1 uncharacterized protein LOC120074329 [Benincasa hispida]6.6e-18181.28Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLFLE+KCGGNDVKNGR GV  YP      FGPFG  SDLGSPVESL+GSSETESDEEEYIAGLTHQM RSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD

Query:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
        NSH WGSSGSPQSTLCA+GSGCGCKQG SRGSPNGHY   AS PQLTLDLL+AAAGEVSKMR+NEEAYG IN+RGPL PPRKPSPVSVP+KNREP+A VY
Subjt:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY

Query:  QQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAA-PRKHAVNPPPNGS
        QQLQASQFLHLRRQQL+EQ+NS       ARV Q+KG SVR+      HQ  PQM QNRGRNS+FF+GRNCR A +GL S PTWAA PRKH VNPPPNGS
Subjt:  QQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAA-PRKHAVNPPPNGS

Query:  GMRAVFLGVPGGKRECAGTGVFLPRQV-GAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQK
        GMRAVFLG PGGKRECAGTGVFLPRQ  G VSE RKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQ  LQ+RSP  FNAGKNDV VR+RSE L SQ K
Subjt:  GMRAVFLGVPGGKRECAGTGVFLPRQV-GAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQK

Query:  GNLRAAVPVVNHDIRLPQEWTY
         NLR AVP VNHDI LPQEWTY
Subjt:  GNLRAAVPVVNHDIRLPQEWTY

TrEMBL top hitse value%identityAlignment
A0A0A0L2G7 Uncharacterized protein1.0e-17979.67Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLF+E+KCGGND+K+GR+GV  YP      FG FG TSDLGSPVESL+GSSETESDEEEYIAGLTH+M RSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD

Query:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
        NSH WGSSGSPQSTLCA+GSGCGCKQ  SRGSPNGHY   AS PQLTLDLLYAAAGEVSKMR+NEEAYG IN+ GPL PPRKPSPVSVP+KNREPD  VY
Subjt:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY

Query:  QQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAAP--RKHAVNPPPNG
        QQLQASQFLHLRRQQL+EQ+NS      AARVGQ+KG +VR        Q  PQMPQNRGRN++FF+GRNCR A +GLPS PTW AP  ++H VNPP NG
Subjt:  QQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAAP--RKHAVNPPPNG

Query:  SGMRAVFLGVPGGKRECAGTGVFLPRQVG-AVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQ
        SGMRAVFLG PGGKRECAGTGVFLPRQ G A+SE+RKKPACSTVLVPARVMQALNLNLDDMYVQR+ P P LQSRSPPVFNAGKNDV VR RSE L  QQ
Subjt:  SGMRAVFLGVPGGKRECAGTGVFLPRQVG-AVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQ

Query:  KGNLRAAVPVVNHDIRLPQEWTY
        KGNLRAAVP VNH+I LPQEWTY
Subjt:  KGNLRAAVPVVNHDIRLPQEWTY

A0A1S3BKD4 uncharacterized protein LOC1034908083.9e-17979.86Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLF+E+KC GND+KNGR+GV  YP      FG FG TSDLGSPVESL+GSSETESDEEEYIAGLTH++ RSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD

Query:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
        NSH WGSSGSPQSTLCA+GSGCGCKQG SRGSPNGHY   AS PQLTLDLLYAAAGEVSKMR+NEE YG IN+ GPL PPRKPSPVSVP+KNREPDA VY
Subjt:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY

Query:  QQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAA-PRKHAVNPPPNGS
        QQLQASQFLHLRRQQL+EQ+NS      A RVGQ+KG SVR+ Q        PQM QNRGRN++FF+GRNCR A +GLPS PTWAA PRKH VNPPPNGS
Subjt:  QQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAA-PRKHAVNPPPNGS

Query:  GMRAVFLGVPGGKRECAGTGVFLPRQV-GAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQK
        GMRAVFLG PGGKRECAGTGVFLPRQ  G V+E+RKKPACSTVLVPARVMQALNLNLDDMYVQRIQP P LQSRSPPV+ AGKNDV VR +SE L  QQK
Subjt:  GMRAVFLGVPGGKRECAGTGVFLPRQV-GAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQK

Query:  GNLRAAVPVVNHDIRLPQEWTY
        GNLR AVP VNH+I LPQEWTY
Subjt:  GNLRAAVPVVNHDIRLPQEWTY

A0A5A7TPQ0 Uncharacterized protein3.9e-17979.86Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLF+E+KC GND+KNGR+GV  YP      FG FG TSDLGSPVESL+GSSETESDEEEYIAGLTH++ RSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD

Query:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
        NSH WGSSGSPQSTLCA+GSGCGCKQG SRGSPNGHY   AS PQLTLDLLYAAAGEVSKMR+NEE YG IN+ GPL PPRKPSPVSVP+KNREPDA VY
Subjt:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY

Query:  QQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAA-PRKHAVNPPPNGS
        QQLQASQFLHLRRQQL+EQ+NS      A RVGQ+KG SVR+ Q        PQM QNRGRN++FF+GRNCR A +GLPS PTWAA PRKH VNPPPNGS
Subjt:  QQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAA-PRKHAVNPPPNGS

Query:  GMRAVFLGVPGGKRECAGTGVFLPRQV-GAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQK
        GMRAVFLG PGGKRECAGTGVFLPRQ  G V+E+RKKPACSTVLVPARVMQALNLNLDDMYVQRIQP P LQSRSPPV+ AGKNDV VR +SE L  QQK
Subjt:  GMRAVFLGVPGGKRECAGTGVFLPRQV-GAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQK

Query:  GNLRAAVPVVNHDIRLPQEWTY
        GNLR AVP VNH+I LPQEWTY
Subjt:  GNLRAAVPVVNHDIRLPQEWTY

A0A6J1DF19 uncharacterized protein LOC1110194791.3e-243100Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD

Query:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
        NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
Subjt:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY

Query:  QQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGSGM
        QQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGSGM
Subjt:  QQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGSGM

Query:  RAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKGNL
        RAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKGNL
Subjt:  RAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKGNL

Query:  RAAVPVVNHDIRLPQEWTY
        RAAVPVVNHDIRLPQEWTY
Subjt:  RAAVPVVNHDIRLPQEWTY

A0A6J1EV17 uncharacterized protein LOC1114382292.4e-16876Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSS--YPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFG
        MAESLDDGEFWLPPKFLNDDDLFLE+KC GND K  RDG  +  +PFE+ LGFGPFGV+SDLGSPVESLIGSSETESDE+EYIAGL  QMARSTL+DGFG
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSS--YPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFG

Query:  LDNSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAG
        L+ SHGW SSGSPQSTLC VG+GCGCKQ  SRGSPN H     S PQLTLDLLYAAAGEVSKMR+NEEAY  +NNRG   PPRKPSPV+VP+KNR+ DAG
Subjt:  LDNSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAG

Query:  VYQQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWA--APRKHAVNPPPN
        VYQQLQASQFLHL+RQQL+EQ+NS      AARVG   G SVR+ Q         Q+PQNRGRNS+FF GRNCR A GL S P WA   PRKH+VNP PN
Subjt:  VYQQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWA--APRKHAVNPPPN

Query:  GSGMRAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQ
        GSGMRAVFLGVPGGKRECAGTGVFLPRQ+GAVSE+RKKPACSTVLVPARVMQALNLNLDDMYVQR QPQ  LQSRSP VFN GKND+  R RSE L +QQ
Subjt:  GSGMRAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQ

Query:  KG-NLRAAVPVVNHDI-RLPQEWTY
        K  NLRAAV  VN +I RLPQEW+Y
Subjt:  KG-NLRAAVPVVNHDI-RLPQEWTY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39870.1 unknown protein6.5e-0926.37Show/hide
Query:  YPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARST--LEDGFGLDNSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGS--PNGHYS
        +P EFP  F     +    SP +S     E+  DEE+++AGLT ++A ST  L             ++ SPQSTL  +GS        SR    P+    
Subjt:  YPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARST--LEDGFGLDNSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGS--PNGHYS

Query:  QAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVYQQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGC
         ++ +     D++ AAAGEV++++L           G   P   P      +  R+ +A ++ +LQ         Q+L+EQ+    + +A +R   S+  
Subjt:  QAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVYQQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGC

Query:  SVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGSGMRAVFLGVPGGKRECAGTGVFLPRQV--GAVSESRKKPA
          R + +E    + P+  +   RN+                 PTW  P++ A                    KR  AGTGVFLPR+    A S+S K P 
Subjt:  SVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGSGMRAVFLGVPGGKRECAGTGVFLPRQV--GAVSESRKKPA

Query:  CSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKGNLRAAVPVVNHDIRLPQEWTY
         +  ++  +V +  NLN D+ +   + P+     RS   +       C+  RS  L  Q  GN RA          LPQ+W Y
Subjt:  CSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKGNLRAAVPVVNHDIRLPQEWTY

AT3G54000.1 CONTAINS InterPro DOMAIN/s: Uncharacterised conserved protein UCP022260 (InterPro:IPR016802); Has 94 Blast hits to 94 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 94; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).3.8e-3334.62Show/hide
Query:  LDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGF--GLDNS
        +DD EFWLP +FL DDD  +E +   N V  G D  S +P+E   GFG FG T         +  ++  E DEE ++AGLT QM  S+L+D F  G+  +
Subjt:  LDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGF--GLDNS

Query:  H---------GWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNR
        H          W  + SP    C  G+GC C         N  ++Q  +    + DL  AA     +M +N+E Y   + RG L  P K   +S  VKN 
Subjt:  H---------GWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNR

Query:  EPDAG---------VYQQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWA
          +            YQ+LQA QF  L++QQLM                            +H+ Q    + QNRG   +    +N  P     S   W+
Subjt:  EPDAG---------VYQQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWA

Query:  APRKHAVNPPPNGSGMRAVFLGVPGGKRECAGTGVFLPRQVGAVS--ESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKND
               N  P    MRAVF+G   GKR   GTGVFLPR V   S  E+R+KP  STVLVPAR+ Q LNLNL +          P++S       A  ND
Subjt:  APRKHAVNPPPNGSGMRAVFLGVPGGKRECAGTGVFLPRQVGAVS--ESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKND

Query:  VCVRVRSECLG--SQQKGNLRAAVPVVNHDIRLPQEWTY
        V  R RS   G  SQ  G +RA   V   + RLP EW Y
Subjt:  VCVRVRSECLG--SQQKGNLRAAVPVVNHDIRLPQEWTY

AT3G54000.2 unknown protein4.4e-2132.77Show/hide
Query:  LDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGF--GLDNS
        +DD EFWLP +FL DDD  +E +   N V  G D  S +P+E   GFG FG T         +  ++  E DEE ++AGLT QM  S+L+D F  G+  +
Subjt:  LDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGF--GLDNS

Query:  H---------GWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNR
        H          W  + SP    C  G+GC C         N  ++Q  +    + DL  AA     +M +N+E Y   + RG L  P K   +S  VKN 
Subjt:  H---------GWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNR

Query:  EPDAG---------VYQQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWA
          +            YQ+LQA QF  L++QQLM                            +H+ Q    + QNRG   +    +N  P     S   W+
Subjt:  EPDAG---------VYQQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWA

Query:  APRKHAVNPPPNGSGMRAVFLGVPGGKRECAGTGVFLPRQVGAVS--ESRKKPA
               N  P    MRAVF+G   GKR   GTGVFLPR V   S  E+R+KP+
Subjt:  APRKHAVNPPPNGSGMRAVFLGVPGGKRECAGTGVFLPRQVGAVS--ESRKKPA

AT3G54000.3 unknown protein4.4e-2132.77Show/hide
Query:  LDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGF--GLDNS
        +DD EFWLP +FL DDD  +E +   N V  G D  S +P+E   GFG FG T         +  ++  E DEE ++AGLT QM  S+L+D F  G+  +
Subjt:  LDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGF--GLDNS

Query:  H---------GWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNR
        H          W  + SP    C  G+GC C         N  ++Q  +    + DL  AA     +M +N+E Y   + RG L  P K   +S  VKN 
Subjt:  H---------GWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNR

Query:  EPDAG---------VYQQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWA
          +            YQ+LQA QF  L++QQLM                            +H+ Q    + QNRG   +    +N  P     S   W+
Subjt:  EPDAG---------VYQQLQASQFLHLRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWA

Query:  APRKHAVNPPPNGSGMRAVFLGVPGGKRECAGTGVFLPRQVGAVS--ESRKKPA
               N  P    MRAVF+G   GKR   GTGVFLPR V   S  E+R+KP+
Subjt:  APRKHAVNPPPNGSGMRAVFLGVPGGKRECAGTGVFLPRQVGAVS--ESRKKPA

AT5G59050.1 unknown protein1.1e-1340.31Show/hide
Query:  SGMRAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRI--QPQPP-----LQSRSPPVFNAGKNDVCVRVRSE
        SG++AVF+   G +    GTGVFLPR  G V ESRKK  CSTV++PARV++AL ++ D + V        PP     L S +     + KN    RV+S 
Subjt:  SGMRAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRI--QPQPP-----LQSRSPPVFNAGKNDVCVRVRSE

Query:  CLGSQQKGNLRAAVPVVNHDIRLPQEWTY
          GS  +  + +A         LPQEWTY
Subjt:  CLGSQQKGNLRAAVPVVNHDIRLPQEWTY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAGAGTTTGGACGACGGCGAGTTCTGGCTTCCTCCCAAGTTCCTTAACGACGACGACTTGTTCTTGGAGGACAAGTGTGGGGGAAATGATGTTAAGAAT
GGGAGGGATGGAGTTAGCTCCTACCCGTTTGAGTTCCCTCTCGGGTTTGGGCCTTTTGGGGTTACTTCTGATCTCGGCTCGCCGGTTGAATCTCTCATTGGTTCC
AGCGAGACCGAGAGCGATGAGGAGGAATACATCGCTGGATTGACTCACCAAATGGCGCGTTCCACGCTGGAGGATGGATTTGGCCTCGACAACTCTCACGGTTGG
GGCTCTTCTGGTTCTCCACAGTCAACGCTGTGCGCTGTAGGAAGTGGGTGCGGCTGCAAACAGGGCTGTAGCAGAGGAAGCCCCAACGGTCATTATTCCCAAGCT
GCTTCTCAGCCGCAGCTCACTTTGGATCTGCTCTACGCCGCGGCCGGCGAAGTCTCCAAGATGCGTCTGAATGAAGAAGCCTACGGCTTAATCAACAACCGTGGA
CCCCTCCCTCCGCCGAGAAAGCCCTCTCCCGTCTCTGTTCCTGTAAAAAACCGGGAACCCGACGCCGGAGTTTACCAGCAGCTGCAGGCATCTCAGTTTTTACAT
CTGCGACGGCAACAGCTCATGGAGCAACTGAACTCGGCGGCGGTGGCGGCGGCGGCTGCTCGGGTAGGGCAGTCAAAGGGCTGCTCTGTTAGAAACCTTCAGCAC
GAGCACCAGCACCAGCAGGGTCCCCAAATGCCGCAGAACAGAGGAAGAAACAGCGATTTCTTCAGTGGCAGAAACTGTCGCCCTGCTTCTGGCTTACCTTCCCCT
CCCACTTGGGCTGCTCCACGGAAACACGCCGTGAATCCCCCGCCGAACGGCTCCGGCATGAGGGCGGTGTTTCTCGGCGTTCCGGGTGGCAAGAGGGAATGCGCC
GGCACTGGCGTGTTTTTGCCTCGGCAAGTCGGCGCCGTCTCTGAATCCCGCAAGAAGCCAGCTTGCTCGACTGTGTTAGTTCCTGCAAGAGTGATGCAAGCTCTG
AATCTAAACTTAGACGACATGTACGTTCAACGAATCCAGCCCCAGCCTCCTCTTCAATCCCGTTCCCCTCCAGTTTTCAACGCAGGGAAGAACGATGTTTGTGTG
AGAGTCCGCAGCGAGTGTTTGGGATCTCAGCAAAAGGGGAATCTCCGGGCGGCGGTGCCGGTGGTGAACCATGACATCCGGCTACCACAGGAGTGGACTTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGAGAGTTTGGACGACGGCGAGTTCTGGCTTCCTCCCAAGTTCCTTAACGACGACGACTTGTTCTTGGAGGACAAGTGTGGGGGAAATGATGTTAAGAAT
GGGAGGGATGGAGTTAGCTCCTACCCGTTTGAGTTCCCTCTCGGGTTTGGGCCTTTTGGGGTTACTTCTGATCTCGGCTCGCCGGTTGAATCTCTCATTGGTTCC
AGCGAGACCGAGAGCGATGAGGAGGAATACATCGCTGGATTGACTCACCAAATGGCGCGTTCCACGCTGGAGGATGGATTTGGCCTCGACAACTCTCACGGTTGG
GGCTCTTCTGGTTCTCCACAGTCAACGCTGTGCGCTGTAGGAAGTGGGTGCGGCTGCAAACAGGGCTGTAGCAGAGGAAGCCCCAACGGTCATTATTCCCAAGCT
GCTTCTCAGCCGCAGCTCACTTTGGATCTGCTCTACGCCGCGGCCGGCGAAGTCTCCAAGATGCGTCTGAATGAAGAAGCCTACGGCTTAATCAACAACCGTGGA
CCCCTCCCTCCGCCGAGAAAGCCCTCTCCCGTCTCTGTTCCTGTAAAAAACCGGGAACCCGACGCCGGAGTTTACCAGCAGCTGCAGGCATCTCAGTTTTTACAT
CTGCGACGGCAACAGCTCATGGAGCAACTGAACTCGGCGGCGGTGGCGGCGGCGGCTGCTCGGGTAGGGCAGTCAAAGGGCTGCTCTGTTAGAAACCTTCAGCAC
GAGCACCAGCACCAGCAGGGTCCCCAAATGCCGCAGAACAGAGGAAGAAACAGCGATTTCTTCAGTGGCAGAAACTGTCGCCCTGCTTCTGGCTTACCTTCCCCT
CCCACTTGGGCTGCTCCACGGAAACACGCCGTGAATCCCCCGCCGAACGGCTCCGGCATGAGGGCGGTGTTTCTCGGCGTTCCGGGTGGCAAGAGGGAATGCGCC
GGCACTGGCGTGTTTTTGCCTCGGCAAGTCGGCGCCGTCTCTGAATCCCGCAAGAAGCCAGCTTGCTCGACTGTGTTAGTTCCTGCAAGAGTGATGCAAGCTCTG
AATCTAAACTTAGACGACATGTACGTTCAACGAATCCAGCCCCAGCCTCCTCTTCAATCCCGTTCCCCTCCAGTTTTCAACGCAGGGAAGAACGATGTTTGTGTG
AGAGTCCGCAGCGAGTGTTTGGGATCTCAGCAAAAGGGGAATCTCCGGGCGGCGGTGCCGGTGGTGAACCATGACATCCGGCTACCACAGGAGTGGACTTACTGA
Protein sequenceShow/hide protein sequence
MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLDNSHGW
GSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVYQQLQASQFLH
LRRQQLMEQLNSAAVAAAAARVGQSKGCSVRNLQHEHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGSGMRAVFLGVPGGKRECA
GTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKGNLRAAVPVVNHDIRLPQEWTY