; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014787 (gene) of Snake gourd v1 genome

Gene IDTan0014787
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF616)
Genome locationLG01:109121894..109132483
RNA-Seq ExpressionTan0014787
SyntenyTan0014787
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006852 - Protein of unknown function DUF616


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064330.1 Inner membrane protein oxaA [Cucumis melo var. makuwa]6.7e-30489.24Show/hide
Query:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL
        MTGGS GLRTGSYASLH QLQNGALQNSATS LVHKPSKT+LSSS  RER++PLFCRYFGRRRVAMLLLFILALLVFVFGSFAV+RDS+DLKAPYE+TTL
Subjt:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL

Query:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP
        RDLIN AV +T +FKD ITSTHGNL          PP PH NRSLIS  FPRHSLPSPHPC NFALPPPAPANGKRTGARPCPVCYIPV+QAIASMPIAP
Subjt:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP

Query:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI
        S+SPVLR LTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSF+IKESMTVHCGFVKGSKPGNQSGFDIDEADL+ELEEFHEVIVASAIFGNYDVLQQP NI
Subjt:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI

Query:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA
        SEESK+FVPFYMF+DEETEAYMKNSS+LDSRKRIGLWRIIVV NVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA
Subjt:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA

Query:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK
        ISRHYKRFDVFEEADANKAAGKYDNSSIDEQI FYVSEGLTPYSLAKLPITSDVPEGCVLIREH+PITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK
Subjt:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK

Query:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKSPVHIVLK-PPVKRNPPKRGKGERRSTSKRHRKVIAGHREDNLF
        VNWS+NMFLDCERRNFVIQTYHRELLEHMPPPA A++ RP  VPDG TV K  VHIV K PPVKRN  KRG+ ++RSTSKRHRKVIAGHREDN F
Subjt:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKSPVHIVLK-PPVKRNPPKRGKGERRSTSKRHRKVIAGHREDNLF

XP_008452456.1 PREDICTED: uncharacterized protein LOC103493486 [Cucumis melo]6.7e-30489.24Show/hide
Query:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL
        MTGGS GLRTGSYASLH QLQNGALQNSATS LVHKPSKT+LSSS  RER++PLFCRYFGRRRVAMLLLFILALLVFVFGSFAV+RDS+DLKAPYE+TTL
Subjt:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL

Query:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP
        RDLIN AV +T +FKD ITSTHGNL          PP PH NRSLIS  FPRHSLPSPHPC NFALPPPAPANGKRTGARPCPVCYIPV+QAIASMPIAP
Subjt:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP

Query:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI
        S+SPVLR LTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSF+IKESMTVHCGFVKGSKPGNQSGFDIDEADL+ELEEFHEVIVASAIFGNYDVLQQP NI
Subjt:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI

Query:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA
        SEESK+FVPFYMF+DEETEAYMKNSS+LDSRKRIGLWRIIVV NVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA
Subjt:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA

Query:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK
        ISRHYKRFDVFEEADANKAAGKYDNSSIDEQI FYVSEGLTPYSLAKLPITSDVPEGCVLIREH+PITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK
Subjt:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK

Query:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKSPVHIVLK-PPVKRNPPKRGKGERRSTSKRHRKVIAGHREDNLF
        VNWS+NMFLDCERRNFVIQTYHRELLEHMPPPA A++ RP  VPDG TV K  VHIV K PPVKRN  KRG+ ++RSTSKRHRKVIAGHREDN F
Subjt:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKSPVHIVLK-PPVKRNPPKRGKGERRSTSKRHRKVIAGHREDNLF

XP_022139453.1 uncharacterized protein LOC111010380 [Momordica charantia]2.9e-30788.91Show/hide
Query:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL
        MTGGS GLRTGSY SL Q LQNGALQ+SATSLLVHKPSK+ LSSS  RER+IPLFCRYFGRRRVAMLLLFILALLVFVFGSFAV+RDS DLKAPYE+TT 
Subjt:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL

Query:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP
         DLINP V+RT +FKDN+TSTHGNL G NE SSL+PPLPH NRSLI+  FPR SLPSPHPC NFA+PPPAPANGKRTGARPCPVCYIPV+QAI+SMP AP
Subjt:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP

Query:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI
        SISPVLR LTYI D NP+KTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQP NI
Subjt:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI

Query:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA
        SEESK+ VPFYMFIDEETEAYMKNSS+L+ + R+GLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDP+QILERFLWRQNATFA
Subjt:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA

Query:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK
        ISRHYKRFDVFEEADANKAAGKYDNSSIDEQI+FYVSEGLTPYSLAKLPITSDVPEGCVLIREH+PITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMS+
Subjt:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK

Query:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPP-VPDGRTVPKSPVHIVLKPPVKRNPPKRGKGERRSTSKRHRKVIAGHREDNLF
        VNW I+MFLDCERRNFVIQTYHRELLEHMPPP+VAVIR PPP  P GRTVPK PVHIVLK PVKRNP KRG+G++RSTSKRHRKV  GHREDNLF
Subjt:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPP-VPDGRTVPKSPVHIVLKPPVKRNPPKRGKGERRSTSKRHRKVIAGHREDNLF

XP_038897638.1 uncharacterized protein LOC120085624 isoform X2 [Benincasa hispida]0.0e+0091.43Show/hide
Query:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL
        MTGGSFGLRTGSYASLHQQLQN ALQNSATSLLVHK SKT+LSSS  RER+IPL CRYFGRRRVAMLLLFILALLVFVFGSFAV+RDS DLKAPYE+TT+
Subjt:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL

Query:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP
        R LINPAV  T +FKD ITSTHGN+  R E+SSL+PP PHPNRSL+S FFPRHSLPS HPC NFALPPPAPANGKRTGARPCPVCYIPV+QAIASMPIAP
Subjt:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP

Query:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI
        SISPVLR LTYIHD NPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQP NI
Subjt:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI

Query:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA
        SEESKRF+PFYMFIDEETEAYMKNSS+LDSRKRIGLWRIIVV NVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA
Subjt:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA

Query:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK
        ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK
Subjt:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK

Query:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKSPVHIVLK-PPVKRNPPKRGKGERRSTSKRHRKVIAGHREDNLF
        VNWS+NMFLDCERRNFVIQTYHRELLEHMPPPA AVIRRPPPVPDG TV + PVHIV K PPVKRNP +RG+ ++RSTSKRHRKVI+GHREDN F
Subjt:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKSPVHIVLK-PPVKRNPPKRGKGERRSTSKRHRKVIAGHREDNLF

XP_038897641.1 uncharacterized protein LOC120085624 isoform X3 [Benincasa hispida]5.1e-30490.14Show/hide
Query:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL
        MTGGSFGLRTGSYASLHQQLQN ALQNSATSLLVHK SKT+LSSS  RER+IPL CRYFGRRRVAMLLLFILALLVFVFGSFAV+RDS DLKAPYE+TT+
Subjt:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL

Query:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP
        R LINPAV  T +FKD ITSTHGN+  R E+SSL+PP PHPNRSL+S FFPRHSLPS HPC NFALPPPAPANGKRTGARPCPVCYIPV+QAIASMPIAP
Subjt:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP

Query:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI
        SISPVLR LTYIHD NPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQP NI
Subjt:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI

Query:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA
        SEESKRF+PFYMFIDEETEAYMKNSS+LDSRKRIGLWRIIVV NVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA
Subjt:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA

Query:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK
        ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK
Subjt:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK

Query:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKSPVHIVLK-PPVKRNPPKRGKGERRSTSKRHRKVIAG
        VNWS+NMFLDCERRNFVIQTYHRELLEHMPPPA AVIRRPPPVPDG TV + PVHIV K PPVKRNP +RG+ ++R    R R +  G
Subjt:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKSPVHIVLK-PPVKRNPPKRGKGERRSTSKRHRKVIAG

TrEMBL top hitse value%identityAlignment
A0A0A0L266 Uncharacterized protein3.7e-30088.7Show/hide
Query:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL
        MTGGS GLRTGSYASLH QLQNGALQNSATS LVHK SKT+LSS+  RER+IPL CRYFGRRRVAMLLLFILALLVFVFGSFAV+RDS+DLKAPYE+TTL
Subjt:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL

Query:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP
        RDL+NPAV +T +FKD ITSTHGNL          PP P+ NRSLIS  FPRHSLPSPHPC NFALPPPAPANGKRTGARPCPVCYIPV+QAIASMPIAP
Subjt:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP

Query:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI
        SISPVLR LTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSF+IKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQP NI
Subjt:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI

Query:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA
        SEESK+FVPFYMFIDEETEAYMKNSS+LDSRKRIGLWRIIVV NVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA
Subjt:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA

Query:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK
        ISRHYKRFDVFEEADANKAAGKYDNSSIDEQI FYV+EGLTPYSLAKLPITSDVPEGCVLIREH+PITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK
Subjt:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK

Query:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKSPVHIVLK-PPVKRNPPKRGKGERRSTSKRHRKVIAGHREDN
        VNWS+NMFLDCERRNFVIQTYHRELLEHMPPPA  V+ RP  VPD  TV K  VHIV K PPVKRN  KRG+ +++STSKRHRKVI+GHREDN
Subjt:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKSPVHIVLK-PPVKRNPPKRGKGERRSTSKRHRKVIAGHREDN

A0A1S3BTU6 uncharacterized protein LOC1034934863.2e-30489.24Show/hide
Query:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL
        MTGGS GLRTGSYASLH QLQNGALQNSATS LVHKPSKT+LSSS  RER++PLFCRYFGRRRVAMLLLFILALLVFVFGSFAV+RDS+DLKAPYE+TTL
Subjt:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL

Query:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP
        RDLIN AV +T +FKD ITSTHGNL          PP PH NRSLIS  FPRHSLPSPHPC NFALPPPAPANGKRTGARPCPVCYIPV+QAIASMPIAP
Subjt:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP

Query:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI
        S+SPVLR LTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSF+IKESMTVHCGFVKGSKPGNQSGFDIDEADL+ELEEFHEVIVASAIFGNYDVLQQP NI
Subjt:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI

Query:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA
        SEESK+FVPFYMF+DEETEAYMKNSS+LDSRKRIGLWRIIVV NVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA
Subjt:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA

Query:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK
        ISRHYKRFDVFEEADANKAAGKYDNSSIDEQI FYVSEGLTPYSLAKLPITSDVPEGCVLIREH+PITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK
Subjt:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK

Query:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKSPVHIVLK-PPVKRNPPKRGKGERRSTSKRHRKVIAGHREDNLF
        VNWS+NMFLDCERRNFVIQTYHRELLEHMPPPA A++ RP  VPDG TV K  VHIV K PPVKRN  KRG+ ++RSTSKRHRKVIAGHREDN F
Subjt:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKSPVHIVLK-PPVKRNPPKRGKGERRSTSKRHRKVIAGHREDNLF

A0A5A7VAS7 Inner membrane protein oxaA3.2e-30489.24Show/hide
Query:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL
        MTGGS GLRTGSYASLH QLQNGALQNSATS LVHKPSKT+LSSS  RER++PLFCRYFGRRRVAMLLLFILALLVFVFGSFAV+RDS+DLKAPYE+TTL
Subjt:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL

Query:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP
        RDLIN AV +T +FKD ITSTHGNL          PP PH NRSLIS  FPRHSLPSPHPC NFALPPPAPANGKRTGARPCPVCYIPV+QAIASMPIAP
Subjt:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP

Query:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI
        S+SPVLR LTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSF+IKESMTVHCGFVKGSKPGNQSGFDIDEADL+ELEEFHEVIVASAIFGNYDVLQQP NI
Subjt:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI

Query:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA
        SEESK+FVPFYMF+DEETEAYMKNSS+LDSRKRIGLWRIIVV NVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA
Subjt:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA

Query:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK
        ISRHYKRFDVFEEADANKAAGKYDNSSIDEQI FYVSEGLTPYSLAKLPITSDVPEGCVLIREH+PITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK
Subjt:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK

Query:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKSPVHIVLK-PPVKRNPPKRGKGERRSTSKRHRKVIAGHREDNLF
        VNWS+NMFLDCERRNFVIQTYHRELLEHMPPPA A++ RP  VPDG TV K  VHIV K PPVKRN  KRG+ ++RSTSKRHRKVIAGHREDN F
Subjt:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKSPVHIVLK-PPVKRNPPKRGKGERRSTSKRHRKVIAGHREDNLF

A0A6J1CFM4 uncharacterized protein LOC1110103801.4e-30788.91Show/hide
Query:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL
        MTGGS GLRTGSY SL Q LQNGALQ+SATSLLVHKPSK+ LSSS  RER+IPLFCRYFGRRRVAMLLLFILALLVFVFGSFAV+RDS DLKAPYE+TT 
Subjt:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL

Query:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP
         DLINP V+RT +FKDN+TSTHGNL G NE SSL+PPLPH NRSLI+  FPR SLPSPHPC NFA+PPPAPANGKRTGARPCPVCYIPV+QAI+SMP AP
Subjt:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP

Query:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI
        SISPVLR LTYI D NP+KTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQP NI
Subjt:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI

Query:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA
        SEESK+ VPFYMFIDEETEAYMKNSS+L+ + R+GLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDP+QILERFLWRQNATFA
Subjt:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA

Query:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK
        ISRHYKRFDVFEEADANKAAGKYDNSSIDEQI+FYVSEGLTPYSLAKLPITSDVPEGCVLIREH+PITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMS+
Subjt:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK

Query:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPP-VPDGRTVPKSPVHIVLKPPVKRNPPKRGKGERRSTSKRHRKVIAGHREDNLF
        VNW I+MFLDCERRNFVIQTYHRELLEHMPPP+VAVIR PPP  P GRTVPK PVHIVLK PVKRNP KRG+G++RSTSKRHRKV  GHREDNLF
Subjt:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPP-VPDGRTVPKSPVHIVLKPPVKRNPPKRGKGERRSTSKRHRKVIAGHREDNLF

A0A6J1FE28 uncharacterized protein LOC1114431382.5e-29686.6Show/hide
Query:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL
        MTGGS GLRTGSYASL QQLQNGALQN A S   HKP K +LSSS  RER++PL CRYFGRRRVAMLLLF+LALLVFVFGSFAV+RD++DLK PYE+TTL
Subjt:  MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSS--RERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTL

Query:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP
        + +INPAV RT DFKDNITST GNL G  E SSL+PP PH NRSLISDFFPRHS+PSPHPC NFALPPPAP NGKRTGARPC VCYIPV+QAIASMPIAP
Subjt:  RDLINPAVTRTGDFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAP

Query:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI
        SISP+LR LTYIH+ NP KTEPHGGSDFGGYPSL QRNDSFNIKESMTVHCGFVKGSKPG QSGFDIDEADLLELEEFHEVIVASAIFGNYD+LQQP NI
Subjt:  SISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNI

Query:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA
        SEESKRFVPF MFIDEETEA++KNS +LDSRKRIGLWRIIVVRN+PYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQL VDPFQILERFLWRQNATFA
Subjt:  SEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFA

Query:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK
        ISRHYKRFDVFEEA ANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCV+IREH+PITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK
Subjt:  ISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSK

Query:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAV-AVIRRPPPVPDGRTVPK-SPVHIVLK-PPVKRNPPKRGKGERRSTSKRHRKVIAGHREDNLF
        VNWS+NMFLDCERRNFVIQTYHRE+LEHMPPP V  VIRRPPPV  GR  P  S  HIVLK PPVKRNP KRG+ ++RS+SKRHRKVIA HRE+NLF
Subjt:  VNWSINMFLDCERRNFVIQTYHRELLEHMPPPAV-AVIRRPPPVPDGRTVPK-SPVHIVLK-PPVKRNPPKRGKGERRSTSKRHRKVIAGHREDNLF

SwissProt top hitse value%identityAlignment
Q9FZ97 Probable hexosyltransferase MUCI704.5e-17054.31Show/hide
Query:  MTGGSFGLRTGSYASLHQQLQNG-ALQNSATSLLVHKPSKTSLSSSRERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSF-AVNRDSNDLKAPYEDTTL
        MTG   G+R+ SY SL +   NG  L    T+    KPSK  +   RE ++   C++ GR++V MLLLF+++ +VF+   +     DS + + P    +L
Subjt:  MTGGSFGLRTGSYASLHQQLQNG-ALQNSATSLLVHKPSKTSLSSSRERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSF-AVNRDSNDLKAPYEDTTL

Query:  RDLINPAVTRTGDFKDN--ITSTHGNL-FGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMP
            +  V  +   + N  +    GN+ F   E+    PP  H         F  +SLP  HPC +F LPPP PA+ KRTG RPCPVCY+PV++A+A MP
Subjt:  RDLINPAVTRTGDFKDN--ITSTHGNL-FGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMP

Query:  IAPSISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQP
         APS SPVL+ LTYI++    +    GGSDFGGYP+L+ RNDSF+IKE+M+VHCGFVKG +PG  +GFDIDEADLLE+++   ++VASA+F  +D ++ P
Subjt:  IAPSISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQP

Query:  TNISEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNA
         NIS+ ++  V FYMF+DEETE+ +K    LD  K++G+WR++VV N+PY+D RRNGK+PKLL+HR+FPN +YS+WIDGKL+LVVDP+QILERFLWR+NA
Subjt:  TNISEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNA

Query:  TFAISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKI
        TFAISRHYKRFDV  EA+ANKAAGKYDN+SID Q+DFY +EGLTPYS+AKLPITSDVPEGCV++REHVPI+NLFTCLWFNEVDRFTSRDQ+SFS VRDKI
Subjt:  TFAISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKI

Query:  MSKVNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKSPVHIVLKPPVKRN-----PPKRGKGERRSTSKRHRK
         +K NW+++MFLDCERRNFV+Q YHR   E        V   PPP P     P  PV I    P K +     PP+R   +RRS  + HRK
Subjt:  MSKVNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKSPVHIVLKPPVKRN-----PPKRGKGERRSTSKRHRK

Arabidopsis top hitse value%identityAlignment
AT1G28240.1 Protein of unknown function (DUF616)3.2e-17154.31Show/hide
Query:  MTGGSFGLRTGSYASLHQQLQNG-ALQNSATSLLVHKPSKTSLSSSRERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSF-AVNRDSNDLKAPYEDTTL
        MTG   G+R+ SY SL +   NG  L    T+    KPSK  +   RE ++   C++ GR++V MLLLF+++ +VF+   +     DS + + P    +L
Subjt:  MTGGSFGLRTGSYASLHQQLQNG-ALQNSATSLLVHKPSKTSLSSSRERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSF-AVNRDSNDLKAPYEDTTL

Query:  RDLINPAVTRTGDFKDN--ITSTHGNL-FGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMP
            +  V  +   + N  +    GN+ F   E+    PP  H         F  +SLP  HPC +F LPPP PA+ KRTG RPCPVCY+PV++A+A MP
Subjt:  RDLINPAVTRTGDFKDN--ITSTHGNL-FGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMP

Query:  IAPSISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQP
         APS SPVL+ LTYI++    +    GGSDFGGYP+L+ RNDSF+IKE+M+VHCGFVKG +PG  +GFDIDEADLLE+++   ++VASA+F  +D ++ P
Subjt:  IAPSISPVLRKLTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQP

Query:  TNISEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNA
         NIS+ ++  V FYMF+DEETE+ +K    LD  K++G+WR++VV N+PY+D RRNGK+PKLL+HR+FPN +YS+WIDGKL+LVVDP+QILERFLWR+NA
Subjt:  TNISEESKRFVPFYMFIDEETEAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNA

Query:  TFAISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKI
        TFAISRHYKRFDV  EA+ANKAAGKYDN+SID Q+DFY +EGLTPYS+AKLPITSDVPEGCV++REHVPI+NLFTCLWFNEVDRFTSRDQ+SFS VRDKI
Subjt:  TFAISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKI

Query:  MSKVNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKSPVHIVLKPPVKRN-----PPKRGKGERRSTSKRHRK
         +K NW+++MFLDCERRNFV+Q YHR   E        V   PPP P     P  PV I    P K +     PP+R   +RRS  + HRK
Subjt:  MSKVNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKSPVHIVLKPPVKRN-----PPKRGKGERRSTSKRHRK

AT1G34550.1 Protein of unknown function (DUF616)2.0e-7244.92Show/hide
Query:  LTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTN--ISEESKR
        L YI   +    E      F G+ SL++R DSF + +   +HCGFVKG K  + +GFD+ E D   +   H + V+S IFGN D L+ P N  IS  S++
Subjt:  LTYIHDGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTN--ISEESKR

Query:  FVPFYMFIDEET-EAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFAISRHY
         V F +F+DE T +         D    IGLW+++VV+N+PYAD RR GKIPK+L HRLFP+ +YSIW+D KL+L +DP  ILE FLWR+   +AIS HY
Subjt:  FVPFYMFIDEET-EAYMKNSSILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFAISRHY

Query:  KRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLA---KLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSKVN
         R  ++EE   NK   KY+++ I++Q  FY ++GLT ++ +   KL + S+VPEG  ++R H P++NLF+CLWFNEV+RFT RDQLSF+    K + ++N
Subjt:  KRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPYSLA---KLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSKVN

Query:  ----WSINMFLDCERRNFVIQTYHR
            ++++MF DCERR       HR
Subjt:  ----WSINMFLDCERRNFVIQTYHR

AT1G53040.1 Protein of unknown function (DUF616)1.5e-17357.81Show/hide
Query:  TSLSSSRERLIP-LFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSN--DLKAPYEDTTLRDLINPAVTRTGDF--KDNITSTHGNLFGRNEISSLIP
        TS S  +ER +  L C Y GRRRVAMLLL  LA +VFV GS+ +N++SN  ++    E         P       F  K++      + F  N I     
Subjt:  TSLSSSRERLIP-LFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSN--DLKAPYEDTTLRDLINPAVTRTGDF--KDNITSTHGNLFGRNEISSLIP

Query:  PLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAPSISPVLRKLTYIHDGNPIK-TEPHGGSDFGGYPSLR
         + HP       F P       HPC +F+ PPP P   +R G RPCPVCY+P ++A+A MP  P  SP+L+ LTYI + +P+K  E  GGS+FGGYPSL 
Subjt:  PLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAPSISPVLRKLTYIHDGNPIK-TEPHGGSDFGGYPSLR

Query:  QRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNISEESKRFVPFYMFIDEETEAYMKN-SSILDSRKRI
         R +SF+IKESMTVHCGF+KG+KPG+Q+GFDIDE  L EL++ H+VIVASAIFG YD++Q+P NISE +++ +PFYMF+DEET  Y+KN SS  D  KR+
Subjt:  QRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNISEESKRFVPFYMFIDEETEAYMKN-SSILDSRKRI

Query:  GLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFAISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDF
        GLWRIIVV NVPY D+RRNGK+PKLLLHRLFPN++YSIW+D KLQLVVDP+QILERFLWR N++FAISRHY+RFDVF EA+ANKAA KYDN+SID Q++F
Subjt:  GLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFAISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDF

Query:  YVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSKVNWSINMFLDCERRNFVIQTYHRELLEHMPPPAV
        Y  EGLTPY+ AKLPITSDVPEGC +IREH+PITNLFTC+WFNEVDRFTSRDQLSF++ RDKI  KV+WSINMFLDCERRNFV Q YHR++L  M PP  
Subjt:  YVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSKVNWSINMFLDCERRNFVIQTYHRELLEHMPPPAV

Query:  A--VIRRPPPVPDGRTVPKSPVHIVLKPPVKRNPPKRGKGERRSTSKRHRKVIAGHR
        +  V+  P  +P GR             P K+ P +RGK       +RHRKV AG R
Subjt:  A--VIRRPPPVPDGRTVPKSPVHIVLKPPVKRNPPKRGKGERRSTSKRHRKVIAGHR

AT1G53040.2 Protein of unknown function (DUF616)1.5e-17357.81Show/hide
Query:  TSLSSSRERLIP-LFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSN--DLKAPYEDTTLRDLINPAVTRTGDF--KDNITSTHGNLFGRNEISSLIP
        TS S  +ER +  L C Y GRRRVAMLLL  LA +VFV GS+ +N++SN  ++    E         P       F  K++      + F  N I     
Subjt:  TSLSSSRERLIP-LFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSN--DLKAPYEDTTLRDLINPAVTRTGDF--KDNITSTHGNLFGRNEISSLIP

Query:  PLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAPSISPVLRKLTYIHDGNPIK-TEPHGGSDFGGYPSLR
         + HP       F P       HPC +F+ PPP P   +R G RPCPVCY+P ++A+A MP  P  SP+L+ LTYI + +P+K  E  GGS+FGGYPSL 
Subjt:  PLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAPSISPVLRKLTYIHDGNPIK-TEPHGGSDFGGYPSLR

Query:  QRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNISEESKRFVPFYMFIDEETEAYMKN-SSILDSRKRI
         R +SF+IKESMTVHCGF+KG+KPG+Q+GFDIDE  L EL++ H+VIVASAIFG YD++Q+P NISE +++ +PFYMF+DEET  Y+KN SS  D  KR+
Subjt:  QRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNISEESKRFVPFYMFIDEETEAYMKN-SSILDSRKRI

Query:  GLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFAISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDF
        GLWRIIVV NVPY D+RRNGK+PKLLLHRLFPN++YSIW+D KLQLVVDP+QILERFLWR N++FAISRHY+RFDVF EA+ANKAA KYDN+SID Q++F
Subjt:  GLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFAISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDF

Query:  YVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSKVNWSINMFLDCERRNFVIQTYHRELLEHMPPPAV
        Y  EGLTPY+ AKLPITSDVPEGC +IREH+PITNLFTC+WFNEVDRFTSRDQLSF++ RDKI  KV+WSINMFLDCERRNFV Q YHR++L  M PP  
Subjt:  YVSEGLTPYSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSKVNWSINMFLDCERRNFVIQTYHRELLEHMPPPAV

Query:  A--VIRRPPPVPDGRTVPKSPVHIVLKPPVKRNPPKRGKGERRSTSKRHRKVIAGHR
        +  V+  P  +P GR             P K+ P +RGK       +RHRKV AG R
Subjt:  A--VIRRPPPVPDGRTVPKSPVHIVLKPPVKRNPPKRGKGERRSTSKRHRKVIAGHR

AT2G02910.1 Protein of unknown function (DUF616)5.3e-7342.36Show/hide
Query:  PANGKRTGARPCPVCYIPVKQAIASMPIAPSISPVLRKLTYIH----DGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFD
        P + KRT   PC V            P+A S   +L    Y++        ++TE +    FGG+ +L +R  S++   + T+HCGFVKG      +GFD
Subjt:  PANGKRTGARPCPVCYIPVKQAIASMPIAPSISPVLRKLTYIH----DGNPIKTEPHGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFD

Query:  IDEADLLELEEFHEVIVASAIFGNYDVLQQPT--NISEESKRFVPFYMFIDEETEAYMKNSS-ILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHR
        + E D   ++    V V+S IFG+ D L++P    ISE SKR V F MF+DE+T + + +   + D +  +GLW+ +VV N+PY D R+ GK+PK L HR
Subjt:  IDEADLLELEEFHEVIVASAIFGNYDVLQQPT--NISEESKRFVPFYMFIDEETEAYMKNSS-ILDSRKRIGLWRIIVVRNVPYADSRRNGKIPKLLLHR

Query:  LFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFAISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPY--SLAKLPITSDVPEGCVLI
        LFP+ +YSIW+D K++L  DP  I++ FLWR  + FAIS HY R  V++E   NK   KY++S+IDEQ  FY S+GL  +  S    P+ S VPEG  ++
Subjt:  LFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFAISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTPY--SLAKLPITSDVPEGCVLI

Query:  REHVPITNLFTCLWFNEVDRFTSRDQLSFS---MVRDKIMSKVNWSINMFLDCERRNFVIQTYHRELLEHMPP
        R H P++NLFTCLWFNEVDRFTSRDQLSF+   +   ++ S     +NMF DCERR      +HR  ++  PP
Subjt:  REHVPITNLFTCLWFNEVDRFTSRDQLSFS---MVRDKIMSKVNWSINMFLDCERRNFVIQTYHRELLEHMPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGGAGGGTCATTTGGTCTTCGTACGGGAAGCTATGCATCACTGCATCAACAGCTACAAAATGGTGCTTTGCAAAATTCAGCAACGTCATTGCTTGTGCACAAGCC
TTCGAAAACAAGTCTTTCAAGTTCAAGAGAGAGACTTATTCCCTTGTTTTGTCGATACTTCGGTCGGAGAAGGGTTGCAATGCTCCTCTTATTTATCCTCGCTCTTTTGG
TTTTTGTATTTGGATCCTTTGCAGTCAATAGAGATAGCAATGATTTGAAGGCACCTTATGAAGACACTACACTAAGAGACTTGATAAATCCCGCTGTTACAAGAACAGGA
GACTTCAAGGATAATATAACTTCTACACATGGAAATTTGTTTGGAAGAAATGAAATTTCATCTCTGATTCCTCCTTTGCCTCATCCAAATCGTTCTTTAATCTCTGATTT
TTTTCCTCGGCATTCCCTTCCATCGCCACATCCATGCGGGAACTTTGCGCTTCCTCCTCCAGCTCCAGCTAATGGAAAGCGGACTGGAGCTCGTCCTTGTCCAGTATGTT
ATATCCCTGTGAAGCAGGCTATTGCTAGTATGCCAATTGCACCATCAATATCTCCTGTGCTACGCAAGTTAACGTATATTCATGACGGAAATCCAATAAAAACAGAACCA
CATGGGGGGTCTGATTTTGGTGGATATCCTTCTTTAAGACAGAGGAATGATTCTTTTAATATAAAAGAGTCGATGACAGTTCACTGTGGATTTGTTAAAGGAAGTAAACC
TGGTAATCAGAGTGGTTTCGATATTGATGAAGCCGACCTTTTGGAGTTGGAGGAGTTTCACGAGGTCATTGTTGCATCAGCCATATTTGGAAACTATGATGTATTACAGC
AGCCTACAAACATTAGTGAAGAATCAAAAAGATTTGTTCCTTTCTACATGTTCATTGATGAAGAGACAGAAGCATATATGAAGAATTCTAGCATCTTAGATAGCAGAAAG
AGGATTGGCTTGTGGAGAATCATTGTAGTTCGTAATGTTCCTTATGCTGATTCAAGACGTAATGGCAAGATTCCCAAGCTTCTTCTGCATAGGCTTTTCCCCAACATCCA
ATACTCAATATGGATAGATGGAAAACTTCAGCTTGTTGTGGATCCATTTCAAATTCTTGAGAGGTTCCTGTGGCGTCAAAATGCAACTTTTGCAATTTCAAGACACTATA
AACGATTTGATGTCTTTGAAGAGGCTGATGCTAATAAGGCTGCTGGAAAATATGACAACTCCTCCATTGATGAGCAGATTGATTTTTATGTTAGCGAGGGTTTAACTCCA
TATTCTTTGGCTAAGCTTCCTATAACTAGTGATGTTCCTGAAGGTTGTGTTCTAATTCGTGAACATGTTCCTATAACAAATTTATTCACTTGCCTTTGGTTCAATGAAGT
TGATCGTTTTACTTCAAGAGATCAGTTAAGCTTCTCGATGGTGAGAGATAAGATAATGTCTAAAGTCAATTGGAGCATTAACATGTTCTTGGACTGTGAAAGGCGCAATT
TTGTAATACAGACATATCATAGAGAATTACTGGAACACATGCCACCTCCTGCCGTAGCAGTAATCCGTCGTCCTCCACCAGTGCCTGATGGTCGTACTGTTCCAAAGTCT
CCAGTTCATATTGTTCTAAAACCTCCAGTAAAGAGAAATCCTCCCAAGAGGGGAAAAGGCGAGAGGAGGTCTACTTCAAAGCGGCATCGTAAAGTTATTGCTGGACATAG
AGAGGATAATTTGTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATTAAAAAGAAGCAATTTGCTCAACATAAATTCCATTATTCTACTCTCTGGCTGGGTTCGTAAAACCATGGCATTTTGTGGCCGGAATACCCGTTTTGGAAATGTCCGTC
GTTCGTTGTTCATAGCCTCCGGCAAAGAAAACACAATCGTCGGCTGGTGGAACTTCTGATTGGATTGGCTTTCGACTCCACTTATCTTCCAGGCTTCAACGAGTTACCGC
TGCAATCGGTTTCGGGGAAGGAACACTGGGTTTAAGAGTTGAGTTTCTTTGTTGGGGTGGTTTTGTGATGAGTCCGATGTTATGAGGAGACTTGTATTAGCAGATCCAGT
GTATGATTGTTATTTGTGATTGTAGGTGTAAGTTTTGGCTTTATGGGAATGGTTTCTTCACTTCTTGCATTGTATTTGACCACGGCCGTATAGAATTTAGCATTGAAAAA
GAATAGTAGTTTTGGTGGATTTGAATTTGGGTTTTTTTCAGAAGCGATACTGGGTTCGTGATGATTCTATCATAGAACTTGATCAATTTTTAGATTGGTGTTGGTTGGTG
GTGATAACAGCGGGAGTTGGTGCATCACTTCAGACTTCAGTGTTGGGAAGTAATTCAGTTTCCTGGGAAGTTACGGGAGATGAAAGTCGAAGTTGTGGGGTTGGAGGTAT
AATGACTGGAGGGTCATTTGGTCTTCGTACGGGAAGCTATGCATCACTGCATCAACAGCTACAAAATGGTGCTTTGCAAAATTCAGCAACGTCATTGCTTGTGCACAAGC
CTTCGAAAACAAGTCTTTCAAGTTCAAGAGAGAGACTTATTCCCTTGTTTTGTCGATACTTCGGTCGGAGAAGGGTTGCAATGCTCCTCTTATTTATCCTCGCTCTTTTG
GTTTTTGTATTTGGATCCTTTGCAGTCAATAGAGATAGCAATGATTTGAAGGCACCTTATGAAGACACTACACTAAGAGACTTGATAAATCCCGCTGTTACAAGAACAGG
AGACTTCAAGGATAATATAACTTCTACACATGGAAATTTGTTTGGAAGAAATGAAATTTCATCTCTGATTCCTCCTTTGCCTCATCCAAATCGTTCTTTAATCTCTGATT
TTTTTCCTCGGCATTCCCTTCCATCGCCACATCCATGCGGGAACTTTGCGCTTCCTCCTCCAGCTCCAGCTAATGGAAAGCGGACTGGAGCTCGTCCTTGTCCAGTATGT
TATATCCCTGTGAAGCAGGCTATTGCTAGTATGCCAATTGCACCATCAATATCTCCTGTGCTACGCAAGTTAACGTATATTCATGACGGAAATCCAATAAAAACAGAACC
ACATGGGGGGTCTGATTTTGGTGGATATCCTTCTTTAAGACAGAGGAATGATTCTTTTAATATAAAAGAGTCGATGACAGTTCACTGTGGATTTGTTAAAGGAAGTAAAC
CTGGTAATCAGAGTGGTTTCGATATTGATGAAGCCGACCTTTTGGAGTTGGAGGAGTTTCACGAGGTCATTGTTGCATCAGCCATATTTGGAAACTATGATGTATTACAG
CAGCCTACAAACATTAGTGAAGAATCAAAAAGATTTGTTCCTTTCTACATGTTCATTGATGAAGAGACAGAAGCATATATGAAGAATTCTAGCATCTTAGATAGCAGAAA
GAGGATTGGCTTGTGGAGAATCATTGTAGTTCGTAATGTTCCTTATGCTGATTCAAGACGTAATGGCAAGATTCCCAAGCTTCTTCTGCATAGGCTTTTCCCCAACATCC
AATACTCAATATGGATAGATGGAAAACTTCAGCTTGTTGTGGATCCATTTCAAATTCTTGAGAGGTTCCTGTGGCGTCAAAATGCAACTTTTGCAATTTCAAGACACTAT
AAACGATTTGATGTCTTTGAAGAGGCTGATGCTAATAAGGCTGCTGGAAAATATGACAACTCCTCCATTGATGAGCAGATTGATTTTTATGTTAGCGAGGGTTTAACTCC
ATATTCTTTGGCTAAGCTTCCTATAACTAGTGATGTTCCTGAAGGTTGTGTTCTAATTCGTGAACATGTTCCTATAACAAATTTATTCACTTGCCTTTGGTTCAATGAAG
TTGATCGTTTTACTTCAAGAGATCAGTTAAGCTTCTCGATGGTGAGAGATAAGATAATGTCTAAAGTCAATTGGAGCATTAACATGTTCTTGGACTGTGAAAGGCGCAAT
TTTGTAATACAGACATATCATAGAGAATTACTGGAACACATGCCACCTCCTGCCGTAGCAGTAATCCGTCGTCCTCCACCAGTGCCTGATGGTCGTACTGTTCCAAAGTC
TCCAGTTCATATTGTTCTAAAACCTCCAGTAAAGAGAAATCCTCCCAAGAGGGGAAAAGGCGAGAGGAGGTCTACTTCAAAGCGGCATCGTAAAGTTATTGCTGGACATA
GAGAGGATAATTTGTTCTAATTCGATTCATATAAAAAAAGAAAATGCCTTTTTCTTTTGACATTCTTAGTTTCAATCTCATTCCCTCAAATTTTACATCTCAATACAGGG
AGAGCTGGTTTAGATGTAGAGCATTGTTTGATGGGACAAAATGATACCAATTTTGTACTGGAAATTGTTGATATATATTTATATTTATATATATTGATTTATGTTGTAGA
Protein sequenceShow/hide protein sequence
MTGGSFGLRTGSYASLHQQLQNGALQNSATSLLVHKPSKTSLSSSRERLIPLFCRYFGRRRVAMLLLFILALLVFVFGSFAVNRDSNDLKAPYEDTTLRDLINPAVTRTG
DFKDNITSTHGNLFGRNEISSLIPPLPHPNRSLISDFFPRHSLPSPHPCGNFALPPPAPANGKRTGARPCPVCYIPVKQAIASMPIAPSISPVLRKLTYIHDGNPIKTEP
HGGSDFGGYPSLRQRNDSFNIKESMTVHCGFVKGSKPGNQSGFDIDEADLLELEEFHEVIVASAIFGNYDVLQQPTNISEESKRFVPFYMFIDEETEAYMKNSSILDSRK
RIGLWRIIVVRNVPYADSRRNGKIPKLLLHRLFPNIQYSIWIDGKLQLVVDPFQILERFLWRQNATFAISRHYKRFDVFEEADANKAAGKYDNSSIDEQIDFYVSEGLTP
YSLAKLPITSDVPEGCVLIREHVPITNLFTCLWFNEVDRFTSRDQLSFSMVRDKIMSKVNWSINMFLDCERRNFVIQTYHRELLEHMPPPAVAVIRRPPPVPDGRTVPKS
PVHIVLKPPVKRNPPKRGKGERRSTSKRHRKVIAGHREDNLF