; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013970 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013970
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionTIP41-like protein
Genome locationscaffold607:1328088..1329722
RNA-Seq ExpressionMS013970
SyntenyMS013970
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7027367.1 hypothetical protein SDJN02_11379 [Cucurbita argyrosperma subsp. argyrosperma]8.1e-17177.07Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRD--GVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFG
        MAESLDDGEFWLPPKFLNDDDLFLE KC GND K  RD  GV  +PFE+ LGFGPFGV+SDLGSPVESLIGSSETESDE+EYIAGL  Q+ARSTLEDGFG
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRD--GVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFG

Query:  LDNSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAG
        LD SHGW SSGSPQSTLC VG+GCGCKQ  SRGSPN H     S PQLTLDLLYAAAGEVSKMR+NEEAY  +NN G   PPRKPSPV+VP+KNR+ DAG
Subjt:  LDNSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAG

Query:  VYQQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGS
        VYQQLQASQFLHL+RQQLIEQ+ S      AARVG   G SVR+PQ        PQ+PQNRGRNS+FF GRNCR A GL S P WA PRKH+VNP PNGS
Subjt:  VYQQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGS

Query:  GMRAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKG
        GMRAVFLGVPGGKRECAGTGVFLPRQ+GAVSE+RKKPACSTVLVPARVMQALNLNLDDMYVQR QPQ  LQSRSP VFN GKND+  R RSE L +QQK 
Subjt:  GMRAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKG

Query:  -NLRAAVTVVNHDI-RLPQEWTY
         NLRAAV  VN +I RLPQEW+Y
Subjt:  -NLRAAVTVVNHDI-RLPQEWTY

XP_004147909.1 uncharacterized protein LOC101214270 [Cucumis sativus]7.3e-18079.91Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLF+E+KCGGND+K+GR+GV  YP      FG FG TSDLGSPVESL+GSSETESDEEEYIAGLTH+M RSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD

Query:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
        NSH WGSSGSPQSTLCA+GSGCGCKQ  SRGSPNGHY   AS PQLTLDLLYAAAGEVSKMR+NEEAYG IN+ GPL PPRKPSPVSVP+KNREPD  VY
Subjt:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY

Query:  QQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAAP--RKHAVNPPPNG
        QQLQASQFLHLRRQQLIEQ+NS      AARVGQ KG +VR PQ        PQMPQNRGRN++FF+GRNCR A +GLPS PTW AP  ++H VNPP NG
Subjt:  QQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAAP--RKHAVNPPPNG

Query:  SGMRAVFLGVPGGKRECAGTGVFLPRQVG-AVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQ
        SGMRAVFLG PGGKRECAGTGVFLPRQ G A+SE+RKKPACSTVLVPARVMQALNLNLDDMYVQR+ P P LQSRSPPVFNAGKNDV VR RSE L  QQ
Subjt:  SGMRAVFLGVPGGKRECAGTGVFLPRQVG-AVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQ

Query:  KGNLRAAVTVVNHDIRLPQEWTY
        KGNLRAAV  VNH+I LPQEWTY
Subjt:  KGNLRAAVTVVNHDIRLPQEWTY

XP_008448729.1 PREDICTED: uncharacterized protein LOC103490808 [Cucumis melo]4.8e-17980.09Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLF+E+KC GND+KNGR+GV  YP      FG FG TSDLGSPVESL+GSSETESDEEEYIAGLTH++ RSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD

Query:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
        NSH WGSSGSPQSTLCA+GSGCGCKQG SRGSPNGHY   AS PQLTLDLLYAAAGEVSKMR+NEE YG IN+ GPL PPRKPSPVSVP+KNREPDA VY
Subjt:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY

Query:  QQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAA-PRKHAVNPPPNGS
        QQLQASQFLHLRRQQLIEQ+NS      A RVGQ KG SVR+PQ        PQM QNRGRN++FF+GRNCR A +GLPS PTWAA PRKH VNPPPNGS
Subjt:  QQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAA-PRKHAVNPPPNGS

Query:  GMRAVFLGVPGGKRECAGTGVFLPRQV-GAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQK
        GMRAVFLG PGGKRECAGTGVFLPRQ  G V+E+RKKPACSTVLVPARVMQALNLNLDDMYVQRIQP P LQSRSPPV+ AGKNDV VR +SE L  QQK
Subjt:  GMRAVFLGVPGGKRECAGTGVFLPRQV-GAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQK

Query:  GNLRAAVTVVNHDIRLPQEWTY
        GNLR AV  VNH+I LPQEWTY
Subjt:  GNLRAAVTVVNHDIRLPQEWTY

XP_022151566.1 uncharacterized protein LOC111019479 [Momordica charantia]4.8e-24098.81Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD

Query:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
        NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
Subjt:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY

Query:  QQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGSGM
        QQLQASQFLHLRRQQL+EQLNSAAVAAAAARVGQ+KGCSVRN QH+HQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGSGM
Subjt:  QQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGSGM

Query:  RAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKGNL
        RAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKGNL
Subjt:  RAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKGNL

Query:  RAAVTVVNHDIRLPQEWTY
        RAAV VVNHDIRLPQEWTY
Subjt:  RAAVTVVNHDIRLPQEWTY

XP_038883347.1 uncharacterized protein LOC120074329 [Benincasa hispida]3.3e-18081.04Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLFLE+KCGGNDVKNGR GV  YP      FGPFG  SDLGSPVESL+GSSETESDEEEYIAGLTHQM RSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD

Query:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
        NSH WGSSGSPQSTLCA+GSGCGCKQG SRGSPNGHY   AS PQLTLDLL+AAAGEVSKMR+NEEAYG IN+RGPL PPRKPSPVSVP+KNREP+A VY
Subjt:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY

Query:  QQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAA-PRKHAVNPPPNGS
        QQLQASQFLHLRRQQLIEQ+NS       ARV Q KG SVR+ Q        PQM QNRGRNS+FF+GRNCR A +GL S PTWAA PRKH VNPPPNGS
Subjt:  QQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAA-PRKHAVNPPPNGS

Query:  GMRAVFLGVPGGKRECAGTGVFLPRQV-GAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQK
        GMRAVFLG PGGKRECAGTGVFLPRQ  G VSE RKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQ  LQ+RSP  FNAGKNDV VR+RSE L SQ K
Subjt:  GMRAVFLGVPGGKRECAGTGVFLPRQV-GAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQK

Query:  GNLRAAVTVVNHDIRLPQEWTY
         NLR AV  VNHDI LPQEWTY
Subjt:  GNLRAAVTVVNHDIRLPQEWTY

TrEMBL top hitse value%identityAlignment
A0A0A0L2G7 Uncharacterized protein3.6e-18079.91Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLF+E+KCGGND+K+GR+GV  YP      FG FG TSDLGSPVESL+GSSETESDEEEYIAGLTH+M RSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD

Query:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
        NSH WGSSGSPQSTLCA+GSGCGCKQ  SRGSPNGHY   AS PQLTLDLLYAAAGEVSKMR+NEEAYG IN+ GPL PPRKPSPVSVP+KNREPD  VY
Subjt:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY

Query:  QQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAAP--RKHAVNPPPNG
        QQLQASQFLHLRRQQLIEQ+NS      AARVGQ KG +VR PQ        PQMPQNRGRN++FF+GRNCR A +GLPS PTW AP  ++H VNPP NG
Subjt:  QQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAAP--RKHAVNPPPNG

Query:  SGMRAVFLGVPGGKRECAGTGVFLPRQVG-AVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQ
        SGMRAVFLG PGGKRECAGTGVFLPRQ G A+SE+RKKPACSTVLVPARVMQALNLNLDDMYVQR+ P P LQSRSPPVFNAGKNDV VR RSE L  QQ
Subjt:  SGMRAVFLGVPGGKRECAGTGVFLPRQVG-AVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQ

Query:  KGNLRAAVTVVNHDIRLPQEWTY
        KGNLRAAV  VNH+I LPQEWTY
Subjt:  KGNLRAAVTVVNHDIRLPQEWTY

A0A1S3BKD4 uncharacterized protein LOC1034908082.3e-17980.09Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLF+E+KC GND+KNGR+GV  YP      FG FG TSDLGSPVESL+GSSETESDEEEYIAGLTH++ RSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD

Query:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
        NSH WGSSGSPQSTLCA+GSGCGCKQG SRGSPNGHY   AS PQLTLDLLYAAAGEVSKMR+NEE YG IN+ GPL PPRKPSPVSVP+KNREPDA VY
Subjt:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY

Query:  QQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAA-PRKHAVNPPPNGS
        QQLQASQFLHLRRQQLIEQ+NS      A RVGQ KG SVR+PQ        PQM QNRGRN++FF+GRNCR A +GLPS PTWAA PRKH VNPPPNGS
Subjt:  QQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAA-PRKHAVNPPPNGS

Query:  GMRAVFLGVPGGKRECAGTGVFLPRQV-GAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQK
        GMRAVFLG PGGKRECAGTGVFLPRQ  G V+E+RKKPACSTVLVPARVMQALNLNLDDMYVQRIQP P LQSRSPPV+ AGKNDV VR +SE L  QQK
Subjt:  GMRAVFLGVPGGKRECAGTGVFLPRQV-GAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQK

Query:  GNLRAAVTVVNHDIRLPQEWTY
        GNLR AV  VNH+I LPQEWTY
Subjt:  GNLRAAVTVVNHDIRLPQEWTY

A0A5A7TPQ0 Uncharacterized protein2.3e-17980.09Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLF+E+KC GND+KNGR+GV  YP      FG FG TSDLGSPVESL+GSSETESDEEEYIAGLTH++ RSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD

Query:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
        NSH WGSSGSPQSTLCA+GSGCGCKQG SRGSPNGHY   AS PQLTLDLLYAAAGEVSKMR+NEE YG IN+ GPL PPRKPSPVSVP+KNREPDA VY
Subjt:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY

Query:  QQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAA-PRKHAVNPPPNGS
        QQLQASQFLHLRRQQLIEQ+NS      A RVGQ KG SVR+PQ        PQM QNRGRN++FF+GRNCR A +GLPS PTWAA PRKH VNPPPNGS
Subjt:  QQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPA-SGLPSPPTWAA-PRKHAVNPPPNGS

Query:  GMRAVFLGVPGGKRECAGTGVFLPRQV-GAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQK
        GMRAVFLG PGGKRECAGTGVFLPRQ  G V+E+RKKPACSTVLVPARVMQALNLNLDDMYVQRIQP P LQSRSPPV+ AGKNDV VR +SE L  QQK
Subjt:  GMRAVFLGVPGGKRECAGTGVFLPRQV-GAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQK

Query:  GNLRAAVTVVNHDIRLPQEWTY
        GNLR AV  VNH+I LPQEWTY
Subjt:  GNLRAAVTVVNHDIRLPQEWTY

A0A6J1DF19 uncharacterized protein LOC1110194792.3e-24098.81Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLD

Query:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
        NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY
Subjt:  NSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVY

Query:  QQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGSGM
        QQLQASQFLHLRRQQL+EQLNSAAVAAAAARVGQ+KGCSVRN QH+HQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGSGM
Subjt:  QQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGSGM

Query:  RAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKGNL
        RAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKGNL
Subjt:  RAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKGNL

Query:  RAAVTVVNHDIRLPQEWTY
        RAAV VVNHDIRLPQEWTY
Subjt:  RAAVTVVNHDIRLPQEWTY

A0A6J1EV17 uncharacterized protein LOC1114382295.7e-17076.47Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSS--YPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFG
        MAESLDDGEFWLPPKFLNDDDLFLE+KC GND K  RDG  +  +PFE+ LGFGPFGV+SDLGSPVESLIGSSETESDE+EYIAGL  QMARSTL+DGFG
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSS--YPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFG

Query:  LDNSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAG
        L+ SHGW SSGSPQSTLC VG+GCGCKQ  SRGSPN H     S PQLTLDLLYAAAGEVSKMR+NEEAY  +NNRG   PPRKPSPV+VP+KNR+ DAG
Subjt:  LDNSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAG

Query:  VYQQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWA--APRKHAVNPPPN
        VYQQLQASQFLHL+RQQLIEQ+NS      AARVG   G SVR+PQ         Q+PQNRGRNS+FF GRNCR A GL S P WA   PRKH+VNP PN
Subjt:  VYQQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWA--APRKHAVNPPPN

Query:  GSGMRAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQ
        GSGMRAVFLGVPGGKRECAGTGVFLPRQ+GAVSE+RKKPACSTVLVPARVMQALNLNLDDMYVQR QPQ  LQSRSP VFN GKND+  R RSE L +QQ
Subjt:  GSGMRAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQ

Query:  KG-NLRAAVTVVNHDI-RLPQEWTY
        K  NLRAAV  VN +I RLPQEW+Y
Subjt:  KG-NLRAAVTVVNHDI-RLPQEWTY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39870.1 unknown protein1.2e-0726.37Show/hide
Query:  YPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARST--LEDGFGLDNSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGS--PNGHYS
        +P EFP  F     +    SP +S     E+  DEE+++AGLT ++A ST  L             ++ SPQSTL  +GS        SR    P+    
Subjt:  YPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARST--LEDGFGLDNSHGWGSSGSPQSTLCAVGSGCGCKQGCSRGS--PNGHYS

Query:  QAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVYQQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGC
         ++ +     D++ AAAGEV++++L           G   P   P      +  R+ +A ++ +LQ         Q+LIEQ+    + +A +R   ++  
Subjt:  QAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVYQQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGC

Query:  SVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGSGMRAVFLGVPGGKRECAGTGVFLPRQV--GAVSESRKKPA
          R P+     +   + P+   RN+                 PTW  P++ A                    KR  AGTGVFLPR+    A S+S K P 
Subjt:  SVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGSGMRAVFLGVPGGKRECAGTGVFLPRQV--GAVSESRKKPA

Query:  CSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKGNLRAAVTVVNHDIRLPQEWTY
         +  ++  +V +  NLN D+ +   + P+     RS   +       C+  RS  L  Q  GN RA          LPQ+W Y
Subjt:  CSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKGNLRAAVTVVNHDIRLPQEWTY

AT3G54000.1 CONTAINS InterPro DOMAIN/s: Uncharacterised conserved protein UCP022260 (InterPro:IPR016802); Has 94 Blast hits to 94 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 94; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.1e-3234.4Show/hide
Query:  LDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGF--GLDNS
        +DD EFWLP +FL DDD  +E +   N V  G D  S +P+E   GFG FG T         +  ++  E DEE ++AGLT QM  S+L+D F  G+  +
Subjt:  LDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGF--GLDNS

Query:  H---------GWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNR
        H          W  + SP    C  G+GC C         N  ++Q  +    + DL  AA     +M +N+E Y   + RG L  P K   +S  VKN 
Subjt:  H---------GWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNR

Query:  EPDAG---------VYQQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWA
          +            YQ+LQA QF  L++QQL+                            +H+ Q    + QNRG   +    +N  P     S   W+
Subjt:  EPDAG---------VYQQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWA

Query:  APRKHAVNPPPNGSGMRAVFLGVPGGKRECAGTGVFLPRQVGAVS--ESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKND
               N  P    MRAVF+G   GKR   GTGVFLPR V   S  E+R+KP  STVLVPAR+ Q LNLNL +          P++S       A  ND
Subjt:  APRKHAVNPPPNGSGMRAVFLGVPGGKRECAGTGVFLPRQVGAVS--ESRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKND

Query:  VCVRVRSECLG--SQQKGNLRAAVTVVNHDIRLPQEWTY
        V  R RS   G  SQ  G +RA  +V   + RLP EW Y
Subjt:  VCVRVRSECLG--SQQKGNLRAAVTVVNHDIRLPQEWTY

AT3G54000.2 unknown protein1.3e-2032.49Show/hide
Query:  LDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGF--GLDNS
        +DD EFWLP +FL DDD  +E +   N V  G D  S +P+E   GFG FG T         +  ++  E DEE ++AGLT QM  S+L+D F  G+  +
Subjt:  LDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGF--GLDNS

Query:  H---------GWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNR
        H          W  + SP    C  G+GC C         N  ++Q  +    + DL  AA     +M +N+E Y   + RG L  P K   +S  VKN 
Subjt:  H---------GWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNR

Query:  EPDAG---------VYQQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWA
          +            YQ+LQA QF  L++QQL+                            +H+ Q    + QNRG   +    +N  P     S   W+
Subjt:  EPDAG---------VYQQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWA

Query:  APRKHAVNPPPNGSGMRAVFLGVPGGKRECAGTGVFLPRQVGAVS--ESRKKPA
               N  P    MRAVF+G   GKR   GTGVFLPR V   S  E+R+KP+
Subjt:  APRKHAVNPPPNGSGMRAVFLGVPGGKRECAGTGVFLPRQVGAVS--ESRKKPA

AT3G54000.3 unknown protein1.3e-2032.49Show/hide
Query:  LDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGF--GLDNS
        +DD EFWLP +FL DDD  +E +   N V  G D  S +P+E   GFG FG T         +  ++  E DEE ++AGLT QM  S+L+D F  G+  +
Subjt:  LDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGF--GLDNS

Query:  H---------GWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNR
        H          W  + SP    C  G+GC C         N  ++Q  +    + DL  AA     +M +N+E Y   + RG L  P K   +S  VKN 
Subjt:  H---------GWGSSGSPQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNR

Query:  EPDAG---------VYQQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWA
          +            YQ+LQA QF  L++QQL+                            +H+ Q    + QNRG   +    +N  P     S   W+
Subjt:  EPDAG---------VYQQLQASQFLHLRRQQLIEQLNSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWA

Query:  APRKHAVNPPPNGSGMRAVFLGVPGGKRECAGTGVFLPRQVGAVS--ESRKKPA
               N  P    MRAVF+G   GKR   GTGVFLPR V   S  E+R+KP+
Subjt:  APRKHAVNPPPNGSGMRAVFLGVPGGKRECAGTGVFLPRQVGAVS--ESRKKPA

AT5G59050.1 unknown protein8.8e-1440.31Show/hide
Query:  SGMRAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRI--QPQPP-----LQSRSPPVFNAGKNDVCVRVRSE
        SG++AVF+   G +    GTGVFLPR  G V ESRKK  CSTV++PARV++AL ++ D + V        PP     L S +     + KN    RV+S 
Subjt:  SGMRAVFLGVPGGKRECAGTGVFLPRQVGAVSESRKKPACSTVLVPARVMQALNLNLDDMYVQRI--QPQPP-----LQSRSPPVFNAGKNDVCVRVRSE

Query:  CLGSQQKGNLRAAVTVVNHDIRLPQEWTY
          GS  +  + +A +       LPQEWTY
Subjt:  CLGSQQKGNLRAAVTVVNHDIRLPQEWTY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAGAGTTTGGACGACGGCGAGTTCTGGCTTCCTCCCAAGTTCCTTAACGACGACGACTTGTTCTTGGAGGACAAGTGTGGGGGAAATGATGTTAAGAATGGGAG
GGATGGAGTTAGCTCCTACCCGTTTGAGTTCCCTCTCGGATTTGGGCCTTTTGGGGTTACTTCTGATCTCGGCTCGCCGGTTGAATCTCTCATTGGTTCCAGCGAGACCG
AGAGCGATGAGGAGGAATACATCGCTGGATTGACTCACCAAATGGCGCGTTCCACGCTGGAGGATGGATTTGGCCTCGACAACTCTCACGGTTGGGGCTCTTCTGGTTCT
CCACAGTCAACGCTGTGCGCTGTGGGAAGTGGGTGCGGCTGCAAACAGGGCTGTAGCAGAGGAAGCCCCAACGGTCATTATTCCCAAGCTGCTTCTCAGCCGCAGCTCAC
TTTGGATCTGCTCTACGCCGCGGCCGGCGAAGTCTCCAAGATGCGTCTGAATGAAGAAGCCTACGGCTTAATCAACAACCGTGGACCCCTCCCTCCGCCGAGAAAGCCCT
CTCCCGTCTCTGTTCCTGTAAAAAACCGGGAACCCGACGCCGGAGTTTACCAGCAGCTCCAGGCATCTCAATTTTTACATCTGCGACGGCAACAGCTCATCGAGCAACTG
AACTCTGCGGCGGTGGCGGCGGCGGCTGCTCGGGTGGGGCAGGCAAAGGGCTGCTCTGTTAGAAACCCTCAGCACCAGCACCAGCACCAGCAGGGTCCCCAAATGCCGCA
GAACAGAGGAAGAAACAGCGATTTCTTCAGTGGCAGAAACTGTCGCCCTGCTTCTGGCTTACCGTCCCCTCCCACTTGGGCTGCTCCACGGAAACACGCCGTGAATCCCC
CGCCGAACGGCTCCGGCATGAGGGCGGTGTTTCTCGGCGTTCCGGGCGGCAAGAGGGAATGCGCCGGCACTGGCGTGTTTTTGCCTCGGCAAGTCGGCGCCGTCTCTGAA
TCCCGCAAGAAGCCAGCTTGCTCGACTGTGTTAGTTCCTGCGAGAGTGATGCAAGCTCTGAATCTAAACTTAGACGACATGTACGTTCAACGAATCCAGCCCCAGCCTCC
TCTTCAATCCCGTTCCCCTCCAGTTTTCAACGCAGGGAAGAACGATGTTTGTGTGAGAGTCCGCAGCGAGTGTTTGGGATCTCAGCAAAAGGGGAATCTCCGGGCGGCGG
TGACGGTGGTGAACCATGACATCCGGCTACCACAGGAGTGGACTTAC
mRNA sequenceShow/hide mRNA sequence
ATGGCTGAGAGTTTGGACGACGGCGAGTTCTGGCTTCCTCCCAAGTTCCTTAACGACGACGACTTGTTCTTGGAGGACAAGTGTGGGGGAAATGATGTTAAGAATGGGAG
GGATGGAGTTAGCTCCTACCCGTTTGAGTTCCCTCTCGGATTTGGGCCTTTTGGGGTTACTTCTGATCTCGGCTCGCCGGTTGAATCTCTCATTGGTTCCAGCGAGACCG
AGAGCGATGAGGAGGAATACATCGCTGGATTGACTCACCAAATGGCGCGTTCCACGCTGGAGGATGGATTTGGCCTCGACAACTCTCACGGTTGGGGCTCTTCTGGTTCT
CCACAGTCAACGCTGTGCGCTGTGGGAAGTGGGTGCGGCTGCAAACAGGGCTGTAGCAGAGGAAGCCCCAACGGTCATTATTCCCAAGCTGCTTCTCAGCCGCAGCTCAC
TTTGGATCTGCTCTACGCCGCGGCCGGCGAAGTCTCCAAGATGCGTCTGAATGAAGAAGCCTACGGCTTAATCAACAACCGTGGACCCCTCCCTCCGCCGAGAAAGCCCT
CTCCCGTCTCTGTTCCTGTAAAAAACCGGGAACCCGACGCCGGAGTTTACCAGCAGCTCCAGGCATCTCAATTTTTACATCTGCGACGGCAACAGCTCATCGAGCAACTG
AACTCTGCGGCGGTGGCGGCGGCGGCTGCTCGGGTGGGGCAGGCAAAGGGCTGCTCTGTTAGAAACCCTCAGCACCAGCACCAGCACCAGCAGGGTCCCCAAATGCCGCA
GAACAGAGGAAGAAACAGCGATTTCTTCAGTGGCAGAAACTGTCGCCCTGCTTCTGGCTTACCGTCCCCTCCCACTTGGGCTGCTCCACGGAAACACGCCGTGAATCCCC
CGCCGAACGGCTCCGGCATGAGGGCGGTGTTTCTCGGCGTTCCGGGCGGCAAGAGGGAATGCGCCGGCACTGGCGTGTTTTTGCCTCGGCAAGTCGGCGCCGTCTCTGAA
TCCCGCAAGAAGCCAGCTTGCTCGACTGTGTTAGTTCCTGCGAGAGTGATGCAAGCTCTGAATCTAAACTTAGACGACATGTACGTTCAACGAATCCAGCCCCAGCCTCC
TCTTCAATCCCGTTCCCCTCCAGTTTTCAACGCAGGGAAGAACGATGTTTGTGTGAGAGTCCGCAGCGAGTGTTTGGGATCTCAGCAAAAGGGGAATCTCCGGGCGGCGG
TGACGGTGGTGAACCATGACATCCGGCTACCACAGGAGTGGACTTAC
Protein sequenceShow/hide protein sequence
MAESLDDGEFWLPPKFLNDDDLFLEDKCGGNDVKNGRDGVSSYPFEFPLGFGPFGVTSDLGSPVESLIGSSETESDEEEYIAGLTHQMARSTLEDGFGLDNSHGWGSSGS
PQSTLCAVGSGCGCKQGCSRGSPNGHYSQAASQPQLTLDLLYAAAGEVSKMRLNEEAYGLINNRGPLPPPRKPSPVSVPVKNREPDAGVYQQLQASQFLHLRRQQLIEQL
NSAAVAAAAARVGQAKGCSVRNPQHQHQHQQGPQMPQNRGRNSDFFSGRNCRPASGLPSPPTWAAPRKHAVNPPPNGSGMRAVFLGVPGGKRECAGTGVFLPRQVGAVSE
SRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQPPLQSRSPPVFNAGKNDVCVRVRSECLGSQQKGNLRAAVTVVNHDIRLPQEWTY