; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G014970 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G014970
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionTIP41-like protein
Genome locationchr01:13039716..13042182
RNA-Seq ExpressionLsi01G014970
SyntenyLsi01G014970
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147909.1 uncharacterized protein LOC101214270 [Cucumis sativus]4.4e-20690.12Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLF+EEKC GND+K+GR+GVGLYP      FG FG TSDLGSPVESLVGSSETESDEEEYIAGLTH+MTRSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFGLD

Query:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVYQQ
        NSHVWGSSGSPQSTLCAMGSGCGCKQ SSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINS GPLAPPRKPSPVSVPLKNREPD EVYQQ
Subjt:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVYQQ

Query:  LQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRK-HAVNPPPNGSGMRAVFLGAPGGKREC
        LQASQFLHLRRQQLIEQMNS  RVGQTKG VR PQPQMPQNRGRNNEFFNGRNCRSAT GL SQPTW APPRK H VNPP NGSGMRAVFLGAPGGKREC
Subjt:  LQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRK-HAVNPPPNGSGMRAVFLGAPGGKREC

Query:  AGTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIRNETLVSQQKANLRAAVPTVNHDIGLP
        AGTGVFLPRQAG  ISETRKKPACSTVLVPARVMQALNLNLDDMYVQR+ P QLQSRSPPVFNAGKNDVSVR R+E+L  QQK NLRAAVP VNH+IGLP
Subjt:  AGTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIRNETLVSQQKANLRAAVPTVNHDIGLP

Query:  QEWTY
        QEWTY
Subjt:  QEWTY

XP_008448729.1 PREDICTED: uncharacterized protein LOC103490808 [Cucumis melo]1.2e-20890.59Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLF+EEKC GND+KNGR+GVGLYP      FG FG TSDLGSPVESLVGSSETESDEEEYIAGLTH++TRSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFGLD

Query:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVYQQ
        NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEE YGFINS GPLAPPRKPSPVSVPLKNREPDAEVYQQ
Subjt:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVYQQ

Query:  LQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNGSGMRAVFLGAPGGKRECA
        LQASQFLHLRRQQLIEQMNS  RVGQTKG VRHPQPQM QNRGRNNEFFNGRNCRSAT GL SQPTWAAPPRKH VNPPPNGSGMRAVFLGAPGGKRECA
Subjt:  LQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNGSGMRAVFLGAPGGKRECA

Query:  GTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIRNETLVSQQKANLRAAVPTVNHDIGLPQ
        GTGVFLPRQAGGT++ETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQP QLQSRSPPV+ AGKNDVSVR ++E+L  QQK NLR AVP VNH+IGLPQ
Subjt:  GTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIRNETLVSQQKANLRAAVPTVNHDIGLPQ

Query:  EWTY
        EWTY
Subjt:  EWTY

XP_022151566.1 uncharacterized protein LOC111019479 [Momordica charantia]1.2e-19283.65Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLFLE+KC GNDVKNGR GV  YPFEF LGFGPFG TSDLGSPVESL+GSSETESDEEEYIAGLTHQM RSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFGLD

Query:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQ--ASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVY
        NSH WGSSGSPQSTLCA+GSGCGCKQG SRGSPNGHY   AS PQLTLDLLYAAAGEVSKMR+NEEAYG IN+RGPL PPRKPSPVSVP+KNREPDA VY
Subjt:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQ--ASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVY

Query:  QQLQASQFLHLRRQQLIEQMNS------VGRVGQTKGC-VRHPQ--------PQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNGS
        QQLQASQFLHLRRQQL+EQ+NS        RVGQ+KGC VR+ Q        PQMPQNRGRN++FF+GRNCR A +GL S PTWAA PRKHAVNPPPNGS
Subjt:  QQLQASQFLHLRRQQLIEQMNS------VGRVGQTKGC-VRHPQ--------PQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNGS

Query:  GMRAVFLGAPGGKRECAGTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQ-QLQSRSPPVFNAGKNDVSVRIRNETLVSQQK
        GMRAVFLG PGGKRECAGTGVFLPRQ  G +SE+RKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQ  LQSRSPPVFNAGKNDV VR+R+E L SQQK
Subjt:  GMRAVFLGAPGGKRECAGTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQ-QLQSRSPPVFNAGKNDVSVRIRNETLVSQQK

Query:  ANLRAAVPTVNHDIGLPQEWTY
         NLRAAVP VNHDI LPQEWTY
Subjt:  ANLRAAVPTVNHDIGLPQEWTY

XP_022931961.1 uncharacterized protein LOC111438229 [Cucurbita moschata]3.5e-17979.95Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRH--GVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFG
        MAESLDDGEFWLPPKFLNDDDLFLEEKC GND K  R   G GL+PFE++LGFGPFG +SDLGSPVESL+GSSETESDE+EYIAGL  QM RSTL+DGFG
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRH--GVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFG

Query:  LDNSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVY
        L+ SH W SSGSPQSTLC +G+GCGCKQ SSRGSPN H   SHPQLTLDLLYAAAGEVSKMRMNEEAY F+N+RG   PPRKPSPV+VPLKNR+ DA VY
Subjt:  LDNSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVY

Query:  QQLQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWA-APPRKHAVNPPPNGSGMRAVFLGAPGGKR
        QQLQASQFLHL+RQQLIEQMNS  RVG   G VRHPQ Q+PQNRGRN+EFF+GRNCRSA  GL SQP WA  PPRKH+VNP PNGSGMRAVFLG PGGKR
Subjt:  QQLQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWA-APPRKHAVNPPPNGSGMRAVFLGAPGGKR

Query:  ECAGTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIRNETLVSQQKA-NLRAAVPTVNHDI
        ECAGTGVFLPRQ  G +SETRKKPACSTVLVPARVMQALNLNLDDMYVQR QPQQLQSRSP VFN GKND+S R R+E+L++QQKA NLRAAV  VN +I
Subjt:  ECAGTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIRNETLVSQQKA-NLRAAVPTVNHDI

Query:  G-LPQEWTY
        G LPQEW+Y
Subjt:  G-LPQEWTY

XP_038883347.1 uncharacterized protein LOC120074329 [Benincasa hispida]2.2e-21392.33Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLFLEEKC GNDVKNGRHGVGLYP      FGPFG  SDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFGLD

Query:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVYQQ
        NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLL+AAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREP+AEVYQQ
Subjt:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVYQQ

Query:  LQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNGSGMRAVFLGAPGGKRECA
        LQASQFLHLRRQQLIEQMNS  RV QTKG VRH QPQM QNRGRN+EFFNGRNCRSATAGL SQPTWAAPPRKH VNPPPNGSGMRAVFLGAPGGKRECA
Subjt:  LQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNGSGMRAVFLGAPGGKRECA

Query:  GTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIRNETLVSQQKANLRAAVPTVNHDIGLPQ
        GTGVFLPRQAGGT+SE RKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQ+RSP  FNAGKNDVSVR+R+E+LVSQ KANLR AVP VNHDIGLPQ
Subjt:  GTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIRNETLVSQQKANLRAAVPTVNHDIGLPQ

Query:  EWTY
        EWTY
Subjt:  EWTY

TrEMBL top hitse value%identityAlignment
A0A0A0L2G7 Uncharacterized protein2.1e-20690.12Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLF+EEKC GND+K+GR+GVGLYP      FG FG TSDLGSPVESLVGSSETESDEEEYIAGLTH+MTRSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFGLD

Query:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVYQQ
        NSHVWGSSGSPQSTLCAMGSGCGCKQ SSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINS GPLAPPRKPSPVSVPLKNREPD EVYQQ
Subjt:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVYQQ

Query:  LQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRK-HAVNPPPNGSGMRAVFLGAPGGKREC
        LQASQFLHLRRQQLIEQMNS  RVGQTKG VR PQPQMPQNRGRNNEFFNGRNCRSAT GL SQPTW APPRK H VNPP NGSGMRAVFLGAPGGKREC
Subjt:  LQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRK-HAVNPPPNGSGMRAVFLGAPGGKREC

Query:  AGTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIRNETLVSQQKANLRAAVPTVNHDIGLP
        AGTGVFLPRQAG  ISETRKKPACSTVLVPARVMQALNLNLDDMYVQR+ P QLQSRSPPVFNAGKNDVSVR R+E+L  QQK NLRAAVP VNH+IGLP
Subjt:  AGTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIRNETLVSQQKANLRAAVPTVNHDIGLP

Query:  QEWTY
        QEWTY
Subjt:  QEWTY

A0A1S3BKD4 uncharacterized protein LOC1034908086.0e-20990.59Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLF+EEKC GND+KNGR+GVGLYP      FG FG TSDLGSPVESLVGSSETESDEEEYIAGLTH++TRSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFGLD

Query:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVYQQ
        NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEE YGFINS GPLAPPRKPSPVSVPLKNREPDAEVYQQ
Subjt:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVYQQ

Query:  LQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNGSGMRAVFLGAPGGKRECA
        LQASQFLHLRRQQLIEQMNS  RVGQTKG VRHPQPQM QNRGRNNEFFNGRNCRSAT GL SQPTWAAPPRKH VNPPPNGSGMRAVFLGAPGGKRECA
Subjt:  LQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNGSGMRAVFLGAPGGKRECA

Query:  GTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIRNETLVSQQKANLRAAVPTVNHDIGLPQ
        GTGVFLPRQAGGT++ETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQP QLQSRSPPV+ AGKNDVSVR ++E+L  QQK NLR AVP VNH+IGLPQ
Subjt:  GTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIRNETLVSQQKANLRAAVPTVNHDIGLPQ

Query:  EWTY
        EWTY
Subjt:  EWTY

A0A5A7TPQ0 Uncharacterized protein6.0e-20990.59Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLF+EEKC GND+KNGR+GVGLYP      FG FG TSDLGSPVESLVGSSETESDEEEYIAGLTH++TRSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFGLD

Query:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVYQQ
        NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEE YGFINS GPLAPPRKPSPVSVPLKNREPDAEVYQQ
Subjt:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVYQQ

Query:  LQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNGSGMRAVFLGAPGGKRECA
        LQASQFLHLRRQQLIEQMNS  RVGQTKG VRHPQPQM QNRGRNNEFFNGRNCRSAT GL SQPTWAAPPRKH VNPPPNGSGMRAVFLGAPGGKRECA
Subjt:  LQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNGSGMRAVFLGAPGGKRECA

Query:  GTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIRNETLVSQQKANLRAAVPTVNHDIGLPQ
        GTGVFLPRQAGGT++ETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQP QLQSRSPPV+ AGKNDVSVR ++E+L  QQK NLR AVP VNH+IGLPQ
Subjt:  GTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIRNETLVSQQKANLRAAVPTVNHDIGLPQ

Query:  EWTY
        EWTY
Subjt:  EWTY

A0A6J1DF19 uncharacterized protein LOC1110194796.0e-19383.65Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLFLE+KC GNDVKNGR GV  YPFEF LGFGPFG TSDLGSPVESL+GSSETESDEEEYIAGLTHQM RSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFGLD

Query:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQ--ASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVY
        NSH WGSSGSPQSTLCA+GSGCGCKQG SRGSPNGHY   AS PQLTLDLLYAAAGEVSKMR+NEEAYG IN+RGPL PPRKPSPVSVP+KNREPDA VY
Subjt:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQ--ASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVY

Query:  QQLQASQFLHLRRQQLIEQMNS------VGRVGQTKGC-VRHPQ--------PQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNGS
        QQLQASQFLHLRRQQL+EQ+NS        RVGQ+KGC VR+ Q        PQMPQNRGRN++FF+GRNCR A +GL S PTWAA PRKHAVNPPPNGS
Subjt:  QQLQASQFLHLRRQQLIEQMNS------VGRVGQTKGC-VRHPQ--------PQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNGS

Query:  GMRAVFLGAPGGKRECAGTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQ-QLQSRSPPVFNAGKNDVSVRIRNETLVSQQK
        GMRAVFLG PGGKRECAGTGVFLPRQ  G +SE+RKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQ  LQSRSPPVFNAGKNDV VR+R+E L SQQK
Subjt:  GMRAVFLGAPGGKRECAGTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQ-QLQSRSPPVFNAGKNDVSVRIRNETLVSQQK

Query:  ANLRAAVPTVNHDIGLPQEWTY
         NLRAAVP VNHDI LPQEWTY
Subjt:  ANLRAAVPTVNHDIGLPQEWTY

A0A6J1EV17 uncharacterized protein LOC1114382291.7e-17979.95Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRH--GVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFG
        MAESLDDGEFWLPPKFLNDDDLFLEEKC GND K  R   G GL+PFE++LGFGPFG +SDLGSPVESL+GSSETESDE+EYIAGL  QM RSTL+DGFG
Subjt:  MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRH--GVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFG

Query:  LDNSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVY
        L+ SH W SSGSPQSTLC +G+GCGCKQ SSRGSPN H   SHPQLTLDLLYAAAGEVSKMRMNEEAY F+N+RG   PPRKPSPV+VPLKNR+ DA VY
Subjt:  LDNSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVY

Query:  QQLQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWA-APPRKHAVNPPPNGSGMRAVFLGAPGGKR
        QQLQASQFLHL+RQQLIEQMNS  RVG   G VRHPQ Q+PQNRGRN+EFF+GRNCRSA  GL SQP WA  PPRKH+VNP PNGSGMRAVFLG PGGKR
Subjt:  QQLQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWA-APPRKHAVNPPPNGSGMRAVFLGAPGGKR

Query:  ECAGTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIRNETLVSQQKA-NLRAAVPTVNHDI
        ECAGTGVFLPRQ  G +SETRKKPACSTVLVPARVMQALNLNLDDMYVQR QPQQLQSRSP VFN GKND+S R R+E+L++QQKA NLRAAV  VN +I
Subjt:  ECAGTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIRNETLVSQQKA-NLRAAVPTVNHDI

Query:  G-LPQEWTY
        G LPQEW+Y
Subjt:  G-LPQEWTY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39870.1 unknown protein1.3e-0625.66Show/hide
Query:  EKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRST--LEDGFGLDNSHVWGSSGSPQSTLCAMGSGCG
        EK   +++   R G   +P EF   F     +    SP +S     E+  DEE+++AGLT ++  ST  L             ++ SPQSTL  +GS   
Subjt:  EKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRST--LEDGFGLDNSHVWGSSGSPQSTLCAMGSGCG

Query:  CKQGSS-RGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQFLHLRRQQLIEQMNSVG
            S    SP     +       D++ AAAGEV+++++           G   P   P      L  R+ +A ++ +LQ         Q+LIEQM    
Subjt:  CKQGSS-RGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQFLHLRRQQLIEQMNSVG

Query:  RVGQTKGCVRHPQPQMPQNR------GRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNGSGMRAVFLGAPGGKRECAGTGVFLPRQ-AGGTIS
               C    + ++ +NR           F N R  R       + PTW  P +                   A   KR  AGTGVFLPR+      S
Subjt:  RVGQTKGCVRHPQPQMPQNR------GRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNGSGMRAVFLGAPGGKRECAGTGVFLPRQ-AGGTIS

Query:  ETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQ
        ++ K P  +  ++  +V +  NLN D+ +   + P++ Q
Subjt:  ETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQ

AT3G54000.1 CONTAINS InterPro DOMAIN/s: Uncharacterised conserved protein UCP022260 (InterPro:IPR016802); Has 94 Blast hits to 94 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 94; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.1e-3335.06Show/hide
Query:  LDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGV--GLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGF--GLD
        +DD EFWLP +FL DDD FL EK      +N   G+   L+P+E   GFG FG T         +  ++  E DEE ++AGLT QM  S+L+D F  G+ 
Subjt:  LDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGV--GLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGF--GLD

Query:  NSH---------VWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNR
         +H          W  + SP    C  G+GC C   + R + N + + S        LY AA    +M +N+E Y   + RG L  P K   +S  +KN 
Subjt:  NSH---------VWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNR

Query:  EPDAE---------VYQQLQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNG
          +            YQ+LQA QF  L++QQL                ++H +  + QNRG      NG N       L+S   W+        N  P  
Subjt:  EPDAE---------VYQQLQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNG

Query:  SGMRAVFLGAPGGKRECAGTGVFLPRQAGGTI-SETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIR--NETLVS
          MRAVF+G   GKR   GTGVFLPR    T  +ETR+KP  STVLVPAR+ Q LNLNL +               P    A  NDVS R R  N    S
Subjt:  SGMRAVFLGAPGGKRECAGTGVFLPRQAGGTI-SETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIR--NETLVS

Query:  QQKANLRAAVPTVNHDIGLPQEWTY
        Q    +RA       +  LP EW Y
Subjt:  QQKANLRAAVPTVNHDIGLPQEWTY

AT3G54000.2 unknown protein9.0e-2434.31Show/hide
Query:  LDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGV--GLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGF--GLD
        +DD EFWLP +FL DDD FL EK      +N   G+   L+P+E   GFG FG T         +  ++  E DEE ++AGLT QM  S+L+D F  G+ 
Subjt:  LDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGV--GLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGF--GLD

Query:  NSH---------VWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNR
         +H          W  + SP    C  G+GC C   + R + N + + S        LY AA    +M +N+E Y   + RG L  P K   +S  +KN 
Subjt:  NSH---------VWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNR

Query:  EPDAE---------VYQQLQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNG
          +            YQ+LQA QF  L++QQL                ++H +  + QNRG      NG N       L+S   W+        N  P  
Subjt:  EPDAE---------VYQQLQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNG

Query:  SGMRAVFLGAPGGKRECAGTGVFLPRQAGGTI-SETRKKPA
          MRAVF+G   GKR   GTGVFLPR    T  +ETR+KP+
Subjt:  SGMRAVFLGAPGGKRECAGTGVFLPRQAGGTI-SETRKKPA

AT3G54000.3 unknown protein9.0e-2434.31Show/hide
Query:  LDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGV--GLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGF--GLD
        +DD EFWLP +FL DDD FL EK      +N   G+   L+P+E   GFG FG T         +  ++  E DEE ++AGLT QM  S+L+D F  G+ 
Subjt:  LDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGV--GLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGF--GLD

Query:  NSH---------VWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNR
         +H          W  + SP    C  G+GC C   + R + N + + S        LY AA    +M +N+E Y   + RG L  P K   +S  +KN 
Subjt:  NSH---------VWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNR

Query:  EPDAE---------VYQQLQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNG
          +            YQ+LQA QF  L++QQL                ++H +  + QNRG      NG N       L+S   W+        N  P  
Subjt:  EPDAE---------VYQQLQASQFLHLRRQQLIEQMNSVGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNG

Query:  SGMRAVFLGAPGGKRECAGTGVFLPRQAGGTI-SETRKKPA
          MRAVF+G   GKR   GTGVFLPR    T  +ETR+KP+
Subjt:  SGMRAVFLGAPGGKRECAGTGVFLPRQAGGTI-SETRKKPA

AT5G59050.1 unknown protein5.0e-1437.69Show/hide
Query:  SGMRAVFLGAPGGKRECAGTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNA---GKNDVSVRIRNETLVS
        SG++AVF+   G +    GTGVFLPR   GT+ E+RKK  CSTV++PARV++AL ++ D + V    P    S  PP  +A     N+  ++    T +S
Subjt:  SGMRAVFLGAPGGKRECAGTGVFLPRQAGGTISETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNA---GKNDVSVRIRNETLVS

Query:  --QQKANLRAAVPTVNHD---IGLPQEWTY
          Q  +     +   +H      LPQEWTY
Subjt:  --QQKANLRAAVPTVNHD---IGLPQEWTY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAGAGTTTGGACGATGGTGAGTTTTGGCTTCCTCCTAAGTTCCTTAACGACGACGACTTGTTTCTCGAGGAAAAGTGTAGGGGTAATGATGTTAAGAATGGAAG
ACATGGAGTTGGGTTGTACCCGTTTGAGTTTGCTCTTGGATTTGGGCCTTTTGGGTGTACTTCTGATCTCGGTTCACCGGTTGAATCTCTGGTTGGTTCCAGCGAAACAG
AGAGTGATGAGGAGGAATATATCGCTGGATTGACGCATCAAATGACGCGTTCTACTCTGGAAGATGGTTTTGGACTTGATAACTCTCATGTTTGGGGATCTTCTGGTTCA
CCACAGTCAACTCTCTGCGCTATGGGAAGTGGGTGCGGCTGCAAACAGGGCTCTAGCAGGGGAAGTCCCAATGGCCATTACCAAGCTTCTCATCCACAACTCACTTTGGA
TCTACTCTATGCCGCTGCCGGTGAAGTCTCGAAGATGCGGATGAATGAAGAAGCATACGGTTTCATTAACTCTCGTGGACCTCTGGCTCCACCAAGAAAGCCCTCTCCCG
TCTCTGTTCCACTCAAAAACCGCGAGCCCGACGCTGAAGTTTACCAGCAGCTGCAGGCTTCCCAATTTTTGCATCTGAGACGACAGCAGCTTATCGAGCAAATGAACTCA
GTGGGTCGTGTGGGACAGACAAAGGGCTGTGTGAGACACCCTCAGCCCCAAATGCCACAGAACAGAGGAAGAAATAATGAGTTCTTCAATGGCAGAAACTGTCGCTCGGC
AACTGCTGGTTTAGCGTCCCAACCCACTTGGGCGGCTCCTCCTCGGAAACACGCCGTTAACCCCCCGCCGAACGGTTCCGGCATGAGGGCAGTCTTTCTCGGCGCCCCCG
GCGGCAAGAGGGAATGCGCCGGTACGGGGGTGTTTTTGCCTCGACAAGCCGGCGGCACCATCTCTGAAACCCGCAAGAAGCCAGCTTGTTCGACTGTTCTGGTTCCTGCC
AGAGTGATGCAGGCCCTGAATCTGAACTTAGACGACATGTACGTTCAGCGTATTCAACCCCAACAACTTCAATCTCGTTCCCCTCCAGTTTTCAACGCAGGGAAGAACGA
TGTTTCTGTTAGGATTCGAAATGAAACTTTGGTGTCGCAGCAGAAGGCGAATCTCCGAGCGGCGGTGCCGACAGTAAACCATGACATTGGGCTTCCACAGGAGTGGACTT
ACTGA
mRNA sequenceShow/hide mRNA sequence
GCTTCCCCTTCCGACCCTTCTTCTTCTTCTTCCTCTCTGTATTTTTTTCTATTCTGCTTTTCTGGCTTTTTTCTCCTCTCTTCATAATACATACGGTCGCAAGTCGTTCT
CCCTCTCTCTCTCTCTGATTCAATTCCAGTTCGACTTTCCTTTTTTAAGGGCTCCCACCACTGCGTGCCCTTAAACCCTATTTCCTCTGTTTTAATCTTCTTTTTTTTTT
TTTTTTTCTGATTTGACTGGGGCCCTTTTCGCGGGGCGCCACTGCACTTCCTTCAAACCCTACTCATACACTCACTCTTCTTTTCTTCTACCCTTCTTGTTATTATTAAA
CCTTAACCAAGCCGCAAAACACAATCCCAAATATACTCTACTCTCACTCCCATCTCTCTCTCCTTTCTTTTTAGTGAAGTCAGGATAACCAGTCGTTGTTGTTTGCTTTT
AGTTCTTTTGGGGTTGTTTTTTCTTTTTTCAGCTTTCTGTAATGGCTGAGAGTTTGGACGATGGTGAGTTTTGGCTTCCTCCTAAGTTCCTTAACGACGACGACTTGTTT
CTCGAGGAAAAGTGTAGGGGTAATGATGTTAAGAATGGAAGACATGGAGTTGGGTTGTACCCGTTTGAGTTTGCTCTTGGATTTGGGCCTTTTGGGTGTACTTCTGATCT
CGGTTCACCGGTTGAATCTCTGGTTGGTTCCAGCGAAACAGAGAGTGATGAGGAGGAATATATCGCTGGATTGACGCATCAAATGACGCGTTCTACTCTGGAAGATGGTT
TTGGACTTGATAACTCTCATGTTTGGGGATCTTCTGGTTCACCACAGTCAACTCTCTGCGCTATGGGAAGTGGGTGCGGCTGCAAACAGGGCTCTAGCAGGGGAAGTCCC
AATGGCCATTACCAAGCTTCTCATCCACAACTCACTTTGGATCTACTCTATGCCGCTGCCGGTGAAGTCTCGAAGATGCGGATGAATGAAGAAGCATACGGTTTCATTAA
CTCTCGTGGACCTCTGGCTCCACCAAGAAAGCCCTCTCCCGTCTCTGTTCCACTCAAAAACCGCGAGCCCGACGCTGAAGTTTACCAGCAGCTGCAGGCTTCCCAATTTT
TGCATCTGAGACGACAGCAGCTTATCGAGCAAATGAACTCAGTGGGTCGTGTGGGACAGACAAAGGGCTGTGTGAGACACCCTCAGCCCCAAATGCCACAGAACAGAGGA
AGAAATAATGAGTTCTTCAATGGCAGAAACTGTCGCTCGGCAACTGCTGGTTTAGCGTCCCAACCCACTTGGGCGGCTCCTCCTCGGAAACACGCCGTTAACCCCCCGCC
GAACGGTTCCGGCATGAGGGCAGTCTTTCTCGGCGCCCCCGGCGGCAAGAGGGAATGCGCCGGTACGGGGGTGTTTTTGCCTCGACAAGCCGGCGGCACCATCTCTGAAA
CCCGCAAGAAGCCAGCTTGTTCGACTGTTCTGGTTCCTGCCAGAGTGATGCAGGCCCTGAATCTGAACTTAGACGACATGTACGTTCAGCGTATTCAACCCCAACAACTT
CAATCTCGTTCCCCTCCAGTTTTCAACGCAGGGAAGAACGATGTTTCTGTTAGGATTCGAAATGAAACTTTGGTGTCGCAGCAGAAGGCGAATCTCCGAGCGGCGGTGCC
GACAGTAAACCATGACATTGGGCTTCCACAGGAGTGGACTTACTGAAGCGAGAACACCGCCATTAGAATAAAAAAAAGGGGTGTCGATTTACTAACTGCCGTGTTATTTT
AGGGGAATTAAAAGGGTAGATTTGATAAAGAAAAGTTAATGAAGCTTTGTTAGATTAATGGGGTTATTATAGGAAATATAGTATTTTTGTTTAATAGGAAAAGGGTAAGA
TAGGAAGGAGGAAAAGGTGTTATTGGGAGAAAGGAAAGGAAAAGAAGAGAAAAGAAAAAGGGTTTTTGGTTTTTTGGGGATATATTAGGAACTTAGATTAGATGAAATAA
ACTTGTGTTAAGCGGGGGGTTTAATTGTTATTTTGATTGGGCTGAAAAAGCCAACTTAGAATGGAATGTAATGGGAAGGAATGGGAAGAAAAGGAAAAAAGAG
Protein sequenceShow/hide protein sequence
MAESLDDGEFWLPPKFLNDDDLFLEEKCRGNDVKNGRHGVGLYPFEFALGFGPFGCTSDLGSPVESLVGSSETESDEEEYIAGLTHQMTRSTLEDGFGLDNSHVWGSSGS
PQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEEAYGFINSRGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQFLHLRRQQLIEQMNS
VGRVGQTKGCVRHPQPQMPQNRGRNNEFFNGRNCRSATAGLASQPTWAAPPRKHAVNPPPNGSGMRAVFLGAPGGKRECAGTGVFLPRQAGGTISETRKKPACSTVLVPA
RVMQALNLNLDDMYVQRIQPQQLQSRSPPVFNAGKNDVSVRIRNETLVSQQKANLRAAVPTVNHDIGLPQEWTY