; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0018493 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0018493
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionTIP41-like protein
Genome locationchr08:30198037..30200745
RNA-Seq ExpressionPay0018493
SyntenyPay0018493
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603614.1 hypothetical protein SDJN03_04223, partial [Cucurbita argyrosperma subsp. sororia]2.1e-16876.43Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPF------GSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLD
        MA+SLDDGEFWLPPKFL+DDDLF+E K  G+D+KNGR GVGLYPF      G+FG TSDL SPVESLVGSSETESDEEEYIAGL H++ RSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPF------GSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLD

Query:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQ
        NSH WG SGSPQSTLC++GSGCGCKQGSSRGSPNGH+QASHPQLTLDLL+AAAGEVSKMRMNEE+YGF ++  PLAPPRKP PVSVPLKN E DA VYQQ
Subjt:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQ

Query:  LQASQFLHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECA
        LQAS+FL+LRRQQ+I+Q+N+ AR+GQTK S    QP + QN+GRN EFFNGRNCRS TTGL  Q TWA PPRKH++NPPP  SGMRAVFLGAPGGKRECA
Subjt:  LQASQFLHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECA

Query:  GTGVFLPRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGK-NDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQE
        GTGVFLPRQ  GTV+E RKKPACSTVLVPARVMQALNLNLDDMYVQR+QP QLQSR PPV  A K NDV+ R ++E         T  P VNH+I LPQE
Subjt:  GTGVFLPRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGK-NDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQE

Query:  WTY
        WTY
Subjt:  WTY

XP_004147909.1 uncharacterized protein LOC101214270 [Cucumis sativus]7.9e-21693.95Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLDNSHVWG
        MAESLDDGEFWLPPKFLNDDDLFIEEKC GNDLK+GRNGVGLYPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGL HR+TRSTLEDGFGLDNSHVWG
Subjt:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLDNSHVWG

Query:  SSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQF
        SSGSPQSTLCAMGSGCGCKQ SSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEE YGFINSCGPLAPPRKPSPVSVPLKNREPD EVYQQLQASQF
Subjt:  SSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQF

Query:  LHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRK-HTVNPPPNGSGMRAVFLGAPGGKRECAGTGVF
        LHLRRQQLIEQMNSAARVGQTKG+VR PQP M QNRGRNNEFFNGRNCRSATTGLPSQPTW APPRK HTVNPP NGSGMRAVFLGAPGGKRECAGTGVF
Subjt:  LHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRK-HTVNPPPNGSGMRAVFLGAPGGKRECAGTGVF

Query:  LPRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGKNDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQEWTY
        LPRQAG  ++ETRKKPACSTVLVPARVMQALNLNLDDMYVQR+ PPQLQSRSPPV+ AGKNDVSVRN+SESLQQKGNLR AVPAVNHEIGLPQEWTY
Subjt:  LPRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGKNDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQEWTY

XP_008448729.1 PREDICTED: uncharacterized protein LOC103490808 [Cucumis melo]5.2e-22899.24Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLDNSHVWG
        MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGL HRLTRSTLEDGFGLDNSHVWG
Subjt:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLDNSHVWG

Query:  SSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQF
        SSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQF
Subjt:  SSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQF

Query:  LHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECAGTGVFL
        LHLRRQQLIEQMNSA RVGQTKGSVRHPQP MLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECAGTGVFL
Subjt:  LHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECAGTGVFL

Query:  PRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGKNDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQEWTY
        PRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGKNDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQEWTY
Subjt:  PRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGKNDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQEWTY

XP_022151566.1 uncharacterized protein LOC111019479 [Momordica charantia]1.0e-17879.62Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYP------FGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLF+E+KC GND+KNGR+GV  YP      FG FG TSDLGSPVESL+GSSETESDEEEYIAGL H++ RSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYP------FGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLD

Query:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQ--ASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVY
        NSH WGSSGSPQSTLCA+GSGCGCKQG SRGSPNGHY   AS PQLTLDLLYAAAGEVSKMR+NEE YG IN+ GPL PPRKPSPVSVP+KNREPDA VY
Subjt:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQ--ASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVY

Query:  QQLQASQFLHLRRQQLIEQMNS------AARVGQTKG-SVRHPQ--------PHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGS
        QQLQASQFLHLRRQQL+EQ+NS      AARVGQ+KG SVR+ Q        P M QNRGRN++FF+GRNCR A +GLPS PTWAA PRKH VNPPPNGS
Subjt:  QQLQASQFLHLRRQQLIEQMNS------AARVGQTKG-SVRHPQ--------PHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGS

Query:  GMRAVFLGAPGGKRECAGTGVFLPRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQP-PQLQSRSPPVYIAGKNDVSVRNKSESL--QQK
        GMRAVFLG PGGKRECAGTGVFLPRQ  G V+E+RKKPACSTVLVPARVMQALNLNLDDMYVQRIQP P LQSRSPPV+ AGKNDV VR +SE L  QQK
Subjt:  GMRAVFLGAPGGKRECAGTGVFLPRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQP-PQLQSRSPPVYIAGKNDVSVRNKSESL--QQK

Query:  GNLRTAVPAVNHEIGLPQEWTY
        GNLR AVP VNH+I LPQEWTY
Subjt:  GNLRTAVPAVNHEIGLPQEWTY

XP_038883347.1 uncharacterized protein LOC120074329 [Benincasa hispida]8.7e-20790.7Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLDNSHVWG
        MAESLDDGEFWLPPKFLNDDDLF+EEKC GND+KNGR+GVGLYPFG FG+ SDLGSPVESLVGSSETESDEEEYIAGL H++TRSTLEDGFGLDNSHVWG
Subjt:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLDNSHVWG

Query:  SSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQF
        SSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLL+AAAGEVSKMRMNEE YGFINS GPLAPPRKPSPVSVPLKNREP+AEVYQQLQASQF
Subjt:  SSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQF

Query:  LHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECAGTGVFL
        LHLRRQQLIEQMNS ARV QTKGSVRH QP MLQNRGRN+EFFNGRNCRSAT GL SQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECAGTGVFL
Subjt:  LHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECAGTGVFL

Query:  PRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGKNDVSVRNKSESL--QQKGNLRTAVPAVNHEIGLPQEWTY
        PRQAGGTV+E RKKPACSTVLVPARVMQALNLNLDDMYVQRIQP QLQ+RSP  + AGKNDVSVR +SESL  Q K NLR AVPAVNH+IGLPQEWTY
Subjt:  PRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGKNDVSVRNKSESL--QQKGNLRTAVPAVNHEIGLPQEWTY

TrEMBL top hitse value%identityAlignment
A0A0A0L2G7 Uncharacterized protein3.8e-21693.95Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLDNSHVWG
        MAESLDDGEFWLPPKFLNDDDLFIEEKC GNDLK+GRNGVGLYPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGL HR+TRSTLEDGFGLDNSHVWG
Subjt:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLDNSHVWG

Query:  SSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQF
        SSGSPQSTLCAMGSGCGCKQ SSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEE YGFINSCGPLAPPRKPSPVSVPLKNREPD EVYQQLQASQF
Subjt:  SSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQF

Query:  LHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRK-HTVNPPPNGSGMRAVFLGAPGGKRECAGTGVF
        LHLRRQQLIEQMNSAARVGQTKG+VR PQP M QNRGRNNEFFNGRNCRSATTGLPSQPTW APPRK HTVNPP NGSGMRAVFLGAPGGKRECAGTGVF
Subjt:  LHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRK-HTVNPPPNGSGMRAVFLGAPGGKRECAGTGVF

Query:  LPRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGKNDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQEWTY
        LPRQAG  ++ETRKKPACSTVLVPARVMQALNLNLDDMYVQR+ PPQLQSRSPPV+ AGKNDVSVRN+SESLQQKGNLR AVPAVNHEIGLPQEWTY
Subjt:  LPRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGKNDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQEWTY

A0A1S3BKD4 uncharacterized protein LOC1034908082.5e-22899.24Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLDNSHVWG
        MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGL HRLTRSTLEDGFGLDNSHVWG
Subjt:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLDNSHVWG

Query:  SSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQF
        SSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQF
Subjt:  SSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQF

Query:  LHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECAGTGVFL
        LHLRRQQLIEQMNSA RVGQTKGSVRHPQP MLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECAGTGVFL
Subjt:  LHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECAGTGVFL

Query:  PRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGKNDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQEWTY
        PRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGKNDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQEWTY
Subjt:  PRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGKNDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQEWTY

A0A5A7TPQ0 Uncharacterized protein2.5e-22899.24Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLDNSHVWG
        MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGL HRLTRSTLEDGFGLDNSHVWG
Subjt:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLDNSHVWG

Query:  SSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQF
        SSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQF
Subjt:  SSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQF

Query:  LHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECAGTGVFL
        LHLRRQQLIEQMNSA RVGQTKGSVRHPQP MLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECAGTGVFL
Subjt:  LHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECAGTGVFL

Query:  PRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGKNDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQEWTY
        PRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGKNDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQEWTY
Subjt:  PRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGKNDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQEWTY

A0A6J1DF19 uncharacterized protein LOC1110194794.9e-17979.62Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYP------FGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLD
        MAESLDDGEFWLPPKFLNDDDLF+E+KC GND+KNGR+GV  YP      FG FG TSDLGSPVESL+GSSETESDEEEYIAGL H++ RSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYP------FGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLD

Query:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQ--ASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVY
        NSH WGSSGSPQSTLCA+GSGCGCKQG SRGSPNGHY   AS PQLTLDLLYAAAGEVSKMR+NEE YG IN+ GPL PPRKPSPVSVP+KNREPDA VY
Subjt:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQ--ASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVY

Query:  QQLQASQFLHLRRQQLIEQMNS------AARVGQTKG-SVRHPQ--------PHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGS
        QQLQASQFLHLRRQQL+EQ+NS      AARVGQ+KG SVR+ Q        P M QNRGRN++FF+GRNCR A +GLPS PTWAA PRKH VNPPPNGS
Subjt:  QQLQASQFLHLRRQQLIEQMNS------AARVGQTKG-SVRHPQ--------PHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGS

Query:  GMRAVFLGAPGGKRECAGTGVFLPRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQP-PQLQSRSPPVYIAGKNDVSVRNKSESL--QQK
        GMRAVFLG PGGKRECAGTGVFLPRQ  G V+E+RKKPACSTVLVPARVMQALNLNLDDMYVQRIQP P LQSRSPPV+ AGKNDV VR +SE L  QQK
Subjt:  GMRAVFLGAPGGKRECAGTGVFLPRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQP-PQLQSRSPPVYIAGKNDVSVRNKSESL--QQK

Query:  GNLRTAVPAVNHEIGLPQEWTY
        GNLR AVP VNH+I LPQEWTY
Subjt:  GNLRTAVPAVNHEIGLPQEWTY

A0A6J1IPE8 uncharacterized protein LOC1114782596.6e-16876.67Show/hide
Query:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPF------GSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLD
        MA+SLDDGEFWLPPKFLNDDDLF+E    G+D+KNGR GVGLYPF      G+FG TSDL SPVESLVGSSETESDEEEYIAGL H++ RSTLEDGFGLD
Subjt:  MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPF------GSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLD

Query:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQ
         SH WG SGSPQSTLC++GSGCGCKQGSSRGSPNGH+QASHPQLTLDLL+AAAGEV+KMRMNEE+YGFIN+  PLAPPRKP PVSVPLKN E DA VYQQ
Subjt:  NSHVWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQ

Query:  LQASQFLHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECA
        LQAS+FL+LRRQQLI+Q+N+ AR+GQTK +    QP + QN+GRN EF NGRNCRS ++GL  Q TWA PPRKH+VNPPPNGS MRAVFLGAPGGKRECA
Subjt:  LQASQFLHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECA

Query:  GTGVFLPRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGK-NDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQE
        GTGVFLPRQ  GTV+E RKKPACSTVLVPARVMQALNLNLDDMYVQR+QP QLQSR PPV IA K NDV+ R ++ES      LR   P VNH+I LPQE
Subjt:  GTGVFLPRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGK-NDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQE

Query:  WTY
        W+Y
Subjt:  WTY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39870.1 unknown protein9.5e-1026.33Show/hide
Query:  YPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRST--LEDGFGLDNSHVWGSSGSPQSTLCAMGSGCGCKQGSS-RGSPNGHYQASHPQLT
        Y F S  F+    SP +S     E+  DEE+++AGL  RL  ST  L             ++ SPQSTL  +GS       S    SP     +      
Subjt:  YPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRST--LEDGFGLDNSHVWGSSGSPQSTLCAMGSGCGCKQGSS-RGSPNGHYQASHPQLT

Query:  LDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQFLHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRN
         D++ AAAGEV+++++           G   P   P      L  R+ +A ++ +LQ         Q+LIEQM   +   + K S       ++   G  
Subjt:  LDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQFLHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRN

Query:  NEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECAGTGVFLPRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYV
          F N R  R       + PTW  P +                   A   KR  AGTGVFLPR+          K   +T  +    ++  NLN D+   
Subjt:  NEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECAGTGVFLPRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYV

Query:  QRIQPPQLQSRSPPVYIAGKNDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQEWTY
          I  P+         +A         +S  L ++GN R          GLPQ+W Y
Subjt:  QRIQPPQLQSRSPPVYIAGKNDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQEWTY

AT3G54000.1 CONTAINS InterPro DOMAIN/s: Uncharacterised conserved protein UCP022260 (InterPro:IPR016802); Has 94 Blast hits to 94 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 94; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.4e-2933.26Show/hide
Query:  LDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVG----LYP------FGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGF--G
        +DD EFWLP +FL DDD  +E++          N VG    L+P      FG+FG T         +  ++  E DEE ++AGL  ++  S+L+D F  G
Subjt:  LDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVG----LYP------FGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGF--G

Query:  LDNSH---------VWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLK
        +  +H          W  + SP    C  G+GC C   + R + N + + S        LY AA    +M +N+E Y   +  G L  P K   +S  +K
Subjt:  LDNSH---------VWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLK

Query:  NREPDAE---------VYQQLQASQFLHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPP
        N   +            YQ+LQA QF  L++QQL                ++H +  + QNRG      NG N       L S   W+        N  P
Subjt:  NREPDAE---------VYQQLQASQFLHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPP

Query:  NGSGMRAVFLGAPGGKRECAGTGVFLPRQAGGTV-TETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGKNDVSVRNKSE----
            MRAVF+G   GKR   GTGVFLPR    T  TETR+KP  STVLVPAR+ Q LNLNL +               P    A  NDVS R +S     
Subjt:  NGSGMRAVFLGAPGGKRECAGTGVFLPRQAGGTV-TETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIAGKNDVSVRNKSE----

Query:  SLQQKGNLRTAVPAVNHEIGLPQEWTY
        S Q  G +R        E  LP EW Y
Subjt:  SLQQKGNLRTAVPAVNHEIGLPQEWTY

AT3G54000.2 unknown protein1.3e-1932.07Show/hide
Query:  LDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVG----LYP------FGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGF--G
        +DD EFWLP +FL DDD  +E++          N VG    L+P      FG+FG T         +  ++  E DEE ++AGL  ++  S+L+D F  G
Subjt:  LDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVG----LYP------FGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGF--G

Query:  LDNSH---------VWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLK
        +  +H          W  + SP    C  G+GC C   + R + N + + S        LY AA    +M +N+E Y   +  G L  P K   +S  +K
Subjt:  LDNSH---------VWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLK

Query:  NREPDAE---------VYQQLQASQFLHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPP
        N   +            YQ+LQA QF  L++QQL                ++H +  + QNRG      NG N       L S   W+        N  P
Subjt:  NREPDAE---------VYQQLQASQFLHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPP

Query:  NGSGMRAVFLGAPGGKRECAGTGVFLPRQAGGTV-TETRKKPA
            MRAVF+G   GKR   GTGVFLPR    T  TETR+KP+
Subjt:  NGSGMRAVFLGAPGGKRECAGTGVFLPRQAGGTV-TETRKKPA

AT3G54000.3 unknown protein1.3e-1932.07Show/hide
Query:  LDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVG----LYP------FGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGF--G
        +DD EFWLP +FL DDD  +E++          N VG    L+P      FG+FG T         +  ++  E DEE ++AGL  ++  S+L+D F  G
Subjt:  LDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVG----LYP------FGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGF--G

Query:  LDNSH---------VWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLK
        +  +H          W  + SP    C  G+GC C   + R + N + + S        LY AA    +M +N+E Y   +  G L  P K   +S  +K
Subjt:  LDNSH---------VWGSSGSPQSTLCAMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLK

Query:  NREPDAE---------VYQQLQASQFLHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPP
        N   +            YQ+LQA QF  L++QQL                ++H +  + QNRG      NG N       L S   W+        N  P
Subjt:  NREPDAE---------VYQQLQASQFLHLRRQQLIEQMNSAARVGQTKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPP

Query:  NGSGMRAVFLGAPGGKRECAGTGVFLPRQAGGTV-TETRKKPA
            MRAVF+G   GKR   GTGVFLPR    T  TETR+KP+
Subjt:  NGSGMRAVFLGAPGGKRECAGTGVFLPRQAGGTV-TETRKKPA

AT5G59050.1 unknown protein4.4e-1541.54Show/hide
Query:  SGMRAVFLGAPGGKRECAGTGVFLPRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIA------GKNDVSVRNKSES
        SG++AVF+   G +    GTGVFLPR   GTV E+RKK  CSTV++PARV++AL ++ D + V    P    S  PP + A       K   S +N S S
Subjt:  SGMRAVFLGAPGGKRECAGTGVFLPRQAGGTVTETRKKPACSTVLVPARVMQALNLNLDDMYVQRIQPPQLQSRSPPVYIA------GKNDVSVRNKSES

Query:  LQQKGN-LRTAVPAVNHE---IGLPQEWTY
          Q G+     + A +H+     LPQEWTY
Subjt:  LQQKGN-LRTAVPAVNHE---IGLPQEWTY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAGAGTTTGGATGATGGGGAGTTTTGGCTTCCTCCTAAGTTTCTTAACGACGATGACTTGTTCATTGAGGAAAAGTGTGCGGGTAATGATCTTAAGAATGGGAG
AAATGGTGTTGGGTTGTACCCATTTGGTTCTTTTGGGTTTACTTCTGATCTCGGTTCGCCGGTTGAATCTCTGGTTGGTTCCAGCGAAACAGAGAGTGATGAGGAGGAAT
ACATCGCTGGATTGAAGCACCGATTGACGCGTTCCACTCTGGAAGATGGTTTTGGTCTTGACAACTCTCACGTTTGGGGATCTTCTGGTTCACCACAGTCAACGTTGTGC
GCTATGGGAAGTGGGTGCGGCTGTAAACAGGGCTCGAGCAGGGGAAGCCCTAATGGACATTACCAAGCTTCTCATCCACAGTTAACTTTGGATCTACTCTATGCTGCTGC
CGGTGAAGTCTCGAAGATGCGGATGAATGAAGAAACGTACGGTTTTATTAACTCTTGTGGACCTCTTGCTCCGCCTAGAAAGCCTTCTCCCGTCTCTGTTCCGCTCAAAA
ACCGCGAACCCGACGCCGAAGTTTACCAGCAGCTGCAGGCTTCTCAATTTCTGCATCTGAGACGACAGCAGCTTATCGAGCAAATGAACTCTGCCGCTCGTGTGGGACAA
ACGAAGGGTTCTGTGAGACACCCTCAACCCCACATGCTGCAAAACAGAGGAAGAAATAACGAGTTCTTCAATGGTAGAAACTGCCGCTCTGCAACTACTGGCTTACCGTC
CCAACCCACTTGGGCGGCTCCTCCACGGAAACACACTGTGAACCCCCCACCCAACGGTTCTGGCATGAGAGCGGTATTTCTAGGCGCTCCTGGCGGGAAGAGGGAATGCG
CCGGTACGGGTGTGTTTTTACCTCGACAAGCCGGCGGTACCGTCACTGAAACACGCAAGAAGCCAGCTTGTTCGACTGTTTTGGTTCCTGCTAGAGTGATGCAAGCCCTG
AATCTGAACTTAGACGACATGTACGTTCAACGTATTCAACCCCCACAACTTCAATCTCGTTCCCCTCCAGTTTACATCGCAGGGAAGAACGATGTTTCTGTAAGGAACAA
AAGTGAAAGTTTGCAGCAGAAGGGAAACCTCCGAACGGCGGTGCCGGCAGTAAACCATGAGATTGGGCTTCCACAAGAGTGGACTTACTGA
mRNA sequenceShow/hide mRNA sequence
TCCGAGGCTTCCTCTTCCGACCCTTCTTCTTCTCTGTATTTTTGTTTCCATTCTTCTTTCTCCATAATACATACATACCCTCTCACCCTCCCTCTCCTTTCTTCCTCCCT
CTCTCTTTTCCACTCTCCTTTTTTAAGGGTTCCCACCACTGTTTGCTTCTTAAACCCTATTTCCCTCTGTTTTTAATCTTTCTTTTTCTTCTGATTTGACTGGGAACCTT
TTCGCCGCTTCAAACCCTACTCATACTCACTCTTCTTCTTCTTCCTCTTCTTCCCTCTTTTTATTATTAAACCTTAACAAAGCCGCAAAACACAATCTCAAATATACTCT
TCCGCCTTTCACTCCACTCCTCTCTCTGTTTTCTTTTTAGTCTACTCCGGATAACTAGTCCTTCTTCTTTGCTTCTCCTTCTCCTTTTCTGTTTTGTTTTTTTTTTTTAA
CCTTTCTGTAATGGCCGAGAGTTTGGATGATGGGGAGTTTTGGCTTCCTCCTAAGTTTCTTAACGACGATGACTTGTTCATTGAGGAAAAGTGTGCGGGTAATGATCTTA
AGAATGGGAGAAATGGTGTTGGGTTGTACCCATTTGGTTCTTTTGGGTTTACTTCTGATCTCGGTTCGCCGGTTGAATCTCTGGTTGGTTCCAGCGAAACAGAGAGTGAT
GAGGAGGAATACATCGCTGGATTGAAGCACCGATTGACGCGTTCCACTCTGGAAGATGGTTTTGGTCTTGACAACTCTCACGTTTGGGGATCTTCTGGTTCACCACAGTC
AACGTTGTGCGCTATGGGAAGTGGGTGCGGCTGTAAACAGGGCTCGAGCAGGGGAAGCCCTAATGGACATTACCAAGCTTCTCATCCACAGTTAACTTTGGATCTACTCT
ATGCTGCTGCCGGTGAAGTCTCGAAGATGCGGATGAATGAAGAAACGTACGGTTTTATTAACTCTTGTGGACCTCTTGCTCCGCCTAGAAAGCCTTCTCCCGTCTCTGTT
CCGCTCAAAAACCGCGAACCCGACGCCGAAGTTTACCAGCAGCTGCAGGCTTCTCAATTTCTGCATCTGAGACGACAGCAGCTTATCGAGCAAATGAACTCTGCCGCTCG
TGTGGGACAAACGAAGGGTTCTGTGAGACACCCTCAACCCCACATGCTGCAAAACAGAGGAAGAAATAACGAGTTCTTCAATGGTAGAAACTGCCGCTCTGCAACTACTG
GCTTACCGTCCCAACCCACTTGGGCGGCTCCTCCACGGAAACACACTGTGAACCCCCCACCCAACGGTTCTGGCATGAGAGCGGTATTTCTAGGCGCTCCTGGCGGGAAG
AGGGAATGCGCCGGTACGGGTGTGTTTTTACCTCGACAAGCCGGCGGTACCGTCACTGAAACACGCAAGAAGCCAGCTTGTTCGACTGTTTTGGTTCCTGCTAGAGTGAT
GCAAGCCCTGAATCTGAACTTAGACGACATGTACGTTCAACGTATTCAACCCCCACAACTTCAATCTCGTTCCCCTCCAGTTTACATCGCAGGGAAGAACGATGTTTCTG
TAAGGAACAAAAGTGAAAGTTTGCAGCAGAAGGGAAACCTCCGAACGGCGGTGCCGGCAGTAAACCATGAGATTGGGCTTCCACAAGAGTGGACTTACTGAAGCGACAAA
CACCGCCATTATAGAATAAAGAAAAAAGAAAGGGGTGGTGATTAGATATTTTACTAACTGCTGTGTTAAATTTTAGGGGGGGGGGTAGAGATTAAGATAAAAGAAAATGA
AGGTTTGTTAGATTTGAATGGGGTTATTATAGGGAACAGAGTATTTGTTTAATAGGAAAAAAGGGTAAGATAAGAAGAGGGGATTAAGGAAGGTGTTCTGATGAGAAAAA
GAAAGGAAAAGAGAAAGTGGTTTTTGGTTCTTTTTGGGATTAGGGATTTTAGATTAGAAGATAGAAATAAAGAGTTGTGTTTAAGTTTTTAGGGGCGGGGAGATGTTTAT
TTGTTTAATTTTGATGTGGGCTGAAAAAGCCAACTTAGAAATGAATGTAATGGGAATGAGAAGAAAAGGAAGAAGAAAAAAGAGGGGGTTTTTCCTTTTCTTTTTCGTTT
TTGGTGTTGAAAACTCTGGAAAAGAGAGAAAGAAAAGAAGGAGGGGTTTAAGTAGTAGTAAGTTGGCTTCTTTTGTTTTTTTTTTCTTTTTTTTTTTTTTTTTTTGTGAA
GGGCAAATTAGGAACTTTGTTTTGGATTCTAATTTGTACCCTCTTCCCCTCTCCTCTCCTCTCCTCTCCTCTCCCCTCCCCTCCCCATGCACTAATAAAATAATATATAT
AACAAATATTTTACTTTACC
Protein sequenceShow/hide protein sequence
MAESLDDGEFWLPPKFLNDDDLFIEEKCAGNDLKNGRNGVGLYPFGSFGFTSDLGSPVESLVGSSETESDEEEYIAGLKHRLTRSTLEDGFGLDNSHVWGSSGSPQSTLC
AMGSGCGCKQGSSRGSPNGHYQASHPQLTLDLLYAAAGEVSKMRMNEETYGFINSCGPLAPPRKPSPVSVPLKNREPDAEVYQQLQASQFLHLRRQQLIEQMNSAARVGQ
TKGSVRHPQPHMLQNRGRNNEFFNGRNCRSATTGLPSQPTWAAPPRKHTVNPPPNGSGMRAVFLGAPGGKRECAGTGVFLPRQAGGTVTETRKKPACSTVLVPARVMQAL
NLNLDDMYVQRIQPPQLQSRSPPVYIAGKNDVSVRNKSESLQQKGNLRTAVPAVNHEIGLPQEWTY