; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C02G040010 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C02G040010
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionDUF4220 domain-containing protein
Genome locationCla97Chr02:27917241..27919229
RNA-Seq ExpressionCla97C02G040010
SyntenyCla97C02G040010
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007658 - Protein of unknown function DUF594
IPR025315 - Domain of unknown function DUF4220


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599136.1 hypothetical protein SDJN03_08914, partial [Cucurbita argyrosperma subsp. sororia]4.4e-26972.51Show/hide
Query:  MFSDI-TRMVLEHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------
        MFSD+  R++  H + IWN WGIE LV ANF+FQVIL + GSRRRHTPGYKLSL+VW SY+L A+IATVVLGKLTTID                      
Subjt:  MFSDI-TRMVLEHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------

Query:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ-R
          G+PDTITAYSIEDNQLGVR VF   +QV IMFYIL+RSWTDSKTSFLYL MSLAGIIKYGETSWALKSAL+ NL  TIAD  KY E   LF++LPQ  
Subjt:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ-R

Query:  LPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKDAF
        LPEAKLILRAYYRFCCLKPH+ENWLYYPPTDCD  KLYIDDC +EDVFRITDSELGFMYDALYTKAPVIYT  GLILRFISLLS+IA L G SVLFKDAF
Subjt:  LPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKDAF

Query:  VYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDMKL
        VYNIS+G I++VLIASLIIE+YQI+R+PFTDWAIVQM+RHH+TFPIL   L SL PQSATWRRWS  MGQFNLLDFC+QTKH+ YS+IK+LR WGMDMKL
Subjt:  VYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDMKL

Query:  RKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTVGS
        RKQ+SL RIEV PEVKELVVTEL EIEKIKGQEEF+QRGQWTIDRY+ KLK N++SKLIKAL+ TV++R FDKSIFIWHITTNIFY+I+ F DTS  +G+
Subjt:  RKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTVGS

Query:  KIKAIMSLSDYMMHLMATCPHVLSTTR-DIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNML-----EASKAQKAVVGNWDLLKDVKELVDSLLT
        K++AIMS+S+YMM+L+ T  HVLSTT  +IIFDH+CV+LG+FTRT  R KE  CN +L  + EG +      E S A+K VVGNW LLKDVK+L +SLL 
Subjt:  KIKAIMSLSDYMMHLMATCPHVLSTTR-DIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNML-----EASKAQKAVVGNWDLLKDVKELVDSLLT

Query:  LSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLPQ--GSRDEENPATS
        LSN++ W+LIGSMWFEMLGYAASKCEM+YHSEHIRQGGELITHVWLLIAHNVTK+S      G +DEE  A S
Subjt:  LSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLPQ--GSRDEENPATS

XP_004144790.1 uncharacterized protein LOC101214084 [Cucumis sativus]3.6e-27173.6Show/hide
Query:  MFSDITRMVL-EHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------
        MFS++  +++ EH + IW YWGIE LVLANF+FQVIL F GSRRRHTPG +LSL VW SY+L A+IATVVLGKLTTID                      
Subjt:  MFSDITRMVL-EHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------

Query:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ--
          G+PDTITAYSIEDNQLGVR VF  VIQVGIMFYIL+RSWTDSKTSFLYL MSLAGIIKYGETSWALKSALN N   TIAD  KY E   LF++LPQ  
Subjt:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ--

Query:  -RLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKD
          LPEA LILRAYYRFCCLKPH+ENWLYYPPTDCD+ KL+I +C +EDVFRITD ELGFMYDALYTKAPV+YT  GLILR ISLLS+IA L G SVLFKD
Subjt:  -RLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKD

Query:  AFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDM
        AFVYNISIG I+FVLIA+LIIEIYQILRLPFTDWAIVQMVRHH+ FPIL  FL+SL PQSATWRRWS  MGQFNLL+FCLQTKH+ YS+IKILRY GMDM
Subjt:  AFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDM

Query:  KLRKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTV
        KLRKQLSL RI+V PEVKE VVTEL EIE IKG+EEFDQRGQWTI+RY+T L LN E+KLIKA++ TVS+R FDK IFIWHITTNIFYNI+G+RDTS  V
Subjt:  KLRKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTV

Query:  GSKIKAIMSLSDYMMHLMATCPHVLS-TTRDIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNMLEA--------SKAQKAVVGNWDLLKDVKELV
        G+K +AIMSLSDYMM+L+ T  HVLS TT DIIFDH+CV+LGKFTRT   KKE ICNDILK ++E ++L A        S+A+K VVGNW L+KDVKEL 
Subjt:  GSKIKAIMSLSDYMMHLMATCPHVLS-TTRDIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNMLEA--------SKAQKAVVGNWDLLKDVKELV

Query:  DSLLTLSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLP--QGSRDEENPATS
        D LL LSN+N W+LIGSMWFEMLGYAASKCEM+YHSEHIRQGGELITHVWLLIAHNVTK+S      G +DEE PATS
Subjt:  DSLLTLSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLP--QGSRDEENPATS

XP_008452495.1 PREDICTED: uncharacterized protein LOC103493508 [Cucumis melo]9.6e-27273.6Show/hide
Query:  MFSDITRMVL-EHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------
        MFS++  +++ EH + IW YWGIE LVLANF+FQVIL F GSRRRHTPG +LSL VW SY+L A+IATVVLGKLTTID                      
Subjt:  MFSDITRMVL-EHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------

Query:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ--
          G+PDTITAYSIEDNQLGVR VF  VIQVGIMFYIL+RSWTDSKTSFLYL MSLAGIIKYGETSWALKSALN N   TIAD  KY E  +LF++LPQ  
Subjt:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ--

Query:  -RLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKD
          LPEA LILRAYYRFCCLKPH+ENWLYYPPTDCD+DKLYI DC +EDVFRITD ELGFMYDALYTKAPV+YT  GLILRFISLLS+IA L G SVLFKD
Subjt:  -RLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKD

Query:  AFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDM
        AFVYNISIG I+FVLIA+LIIEIYQILRLPFTDWAIVQMVRHH+ FPIL  FL+SL PQSATWRRWS  MGQFNLLDFCLQTKH+ YS+IKILRYWGMDM
Subjt:  AFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDM

Query:  KLRKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTV
        KLRKQLSL RI+VHPEV+E +VTEL EIE+IKGQEEFD RGQWTIDRY+TK     E+KLIKA++ TV +R FDK IFIWHITTNIFYNI+G+RD S  V
Subjt:  KLRKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTV

Query:  GSKIKAIMSLSDYMMHLMATCPHVLS-TTRDIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNMLEA--------SKAQKAVVGNWDLLKDVKELV
        G+K +AIM LSDYMM+L+ T  HVLS TT DIIFDH+CV+LGKFTRT   KKE ICNDIL  ++E ++L A        S+A+K VVGNW L+KDVKEL 
Subjt:  GSKIKAIMSLSDYMMHLMATCPHVLS-TTRDIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNMLEA--------SKAQKAVVGNWDLLKDVKELV

Query:  DSLLTLSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLP--QGSRDEENPATS
        D LLTLSN++ W+LIGSMWFEMLGYAASKCEM+YHSEHIRQGGELITHVWLLIAHNVTK+S      G +DEE PATS
Subjt:  DSLLTLSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLP--QGSRDEENPATS

XP_022999644.1 uncharacterized protein LOC111493941 [Cucurbita maxima]1.9e-27273.55Show/hide
Query:  MFSDI-TRMVLEHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------
        MFSD+  R++  H + IWN WGIE LV ANF+FQVIL + GSRRRHTPGYKLSL+VW SY+L A+IATVVLGKLTTID                      
Subjt:  MFSDI-TRMVLEHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------

Query:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ-R
          G+PDTITAYSIEDNQLGVR VF  VIQV IMFYIL+RSWTDSKTSFLYL MSLAGIIKYGETSWALKSAL+ NL  TIAD  KY E  +LFD+LPQ  
Subjt:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ-R

Query:  LPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKDAF
        LPEAKLILRAYYRFCCLKPH+ENWLYYPPTDCD  KLYIDDC +EDVFRITDSELGFMYDALYTKAPVIYT  GLILRFISLLS+IA L G SVLFKDAF
Subjt:  LPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKDAF

Query:  VYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDMKL
        VYNIS+G I++VLIASLIIEIYQI+R+PFTDWAIVQM+RHH+TFPIL   L SL PQSATWRRWS  MGQFNLLDFCLQTKH+ YS+IK+LR WG+DMKL
Subjt:  VYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDMKL

Query:  RKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTVGS
        RKQ+SL RI+VHPEVKELVVTEL EIEKIKGQEEF+QRGQWTIDRY+TKLK N++S LIKAL+ TV++R FDKSIFIWHITTNIFY+I+ F DTS  +G+
Subjt:  RKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTVGS

Query:  KIKAIMSLSDYMMHLMATCPHVLSTTR-DIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNML-----EASKAQKAVVGNWDLLKDVKELVDSLLT
        K++AIMS+S+YMM+L+ T  HVLSTT  +IIFDH+CV+LGKFTRT    KE  CN +L  Q+EG +      E S+A+K VVGNW LLKDVK+L DSLL 
Subjt:  KIKAIMSLSDYMMHLMATCPHVLSTTR-DIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNML-----EASKAQKAVVGNWDLLKDVKELVDSLLT

Query:  LSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLPQ--GSRDEENPATS
        LSN+N W+LIGSMWFEMLGYAASKCEM+YHSEHIRQGGELITHVWLLIAHNVTK+S      G +DEE  A S
Subjt:  LSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLPQ--GSRDEENPATS

XP_038889477.1 uncharacterized protein LOC120079385 [Benincasa hispida]1.1e-27273.3Show/hide
Query:  MFSDITRMVLE-HFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------
        MFS++  +++  H + IW+YWGIE LVLANF+FQVIL F GSRRRHTPG KLSL+VW SY+L A+IATVVLGKLTTI+                      
Subjt:  MFSDITRMVLE-HFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------

Query:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ--
          G+PDTITAYSIEDNQLGVR VF  VIQV IMFYILIRSWT+SKTSFLY+ MS+AGIIKYGETSWALKSALN N   TIAD  KY E   LF +LPQ  
Subjt:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ--

Query:  -RLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKD
          LPEA LILRAYYRFCCLKPH+ENWLYYPPTDCD++KLYIDDC +EDVFRITDSELGFMYDALYTKAPV+YT  GLILRFISLLS+IA L G SVLFKD
Subjt:  -RLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKD

Query:  AFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDM
        AFVYNISIG I+FVLIA+LIIEIYQILRLPFTDWAIVQMVRHH+ FPIL  FL+SL PQSATWRRWS  +GQFNLLDFCLQTKH+ YS+IKILRYWGMDM
Subjt:  AFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDM

Query:  KLRKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTV
        KLRKQLSL RI++ PEVKELVVTEL EIEKIKGQEEFDQRGQWTIDRY+ KL  N E+KLIKA++ TVS+R FDK IFIWHITTNIFYNI+ +RD S  V
Subjt:  KLRKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTV

Query:  GSKIKAIMSLSDYMMHLMATCPHVLS-TTRDIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNMLEA--------SKAQKAVVGNWDLLKDVKELV
        G+K +AIMSLSDYMM+L+ T  HVLS TT DIIFDH+CV+LGKFTRT   KKE +CNDIL  Q+E ++L A        S+A+K VVGNW L+KDVKEL 
Subjt:  GSKIKAIMSLSDYMMHLMATCPHVLS-TTRDIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNMLEA--------SKAQKAVVGNWDLLKDVKELV

Query:  DSLLTLSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLPQ--GSRDEENPATS
        DSLLTLSN++ W+L+GSMWFEM+GYAASKCEM+YHSEHIRQGGELITHVWLLIAHNVTK+S      G +DEE P  S
Subjt:  DSLLTLSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLPQ--GSRDEENPATS

TrEMBL top hitse value%identityAlignment
A0A0A0LK45 DUF4220 domain-containing protein1.8e-27173.6Show/hide
Query:  MFSDITRMVL-EHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------
        MFS++  +++ EH + IW YWGIE LVLANF+FQVIL F GSRRRHTPG +LSL VW SY+L A+IATVVLGKLTTID                      
Subjt:  MFSDITRMVL-EHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------

Query:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ--
          G+PDTITAYSIEDNQLGVR VF  VIQVGIMFYIL+RSWTDSKTSFLYL MSLAGIIKYGETSWALKSALN N   TIAD  KY E   LF++LPQ  
Subjt:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ--

Query:  -RLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKD
          LPEA LILRAYYRFCCLKPH+ENWLYYPPTDCD+ KL+I +C +EDVFRITD ELGFMYDALYTKAPV+YT  GLILR ISLLS+IA L G SVLFKD
Subjt:  -RLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKD

Query:  AFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDM
        AFVYNISIG I+FVLIA+LIIEIYQILRLPFTDWAIVQMVRHH+ FPIL  FL+SL PQSATWRRWS  MGQFNLL+FCLQTKH+ YS+IKILRY GMDM
Subjt:  AFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDM

Query:  KLRKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTV
        KLRKQLSL RI+V PEVKE VVTEL EIE IKG+EEFDQRGQWTI+RY+T L LN E+KLIKA++ TVS+R FDK IFIWHITTNIFYNI+G+RDTS  V
Subjt:  KLRKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTV

Query:  GSKIKAIMSLSDYMMHLMATCPHVLS-TTRDIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNMLEA--------SKAQKAVVGNWDLLKDVKELV
        G+K +AIMSLSDYMM+L+ T  HVLS TT DIIFDH+CV+LGKFTRT   KKE ICNDILK ++E ++L A        S+A+K VVGNW L+KDVKEL 
Subjt:  GSKIKAIMSLSDYMMHLMATCPHVLS-TTRDIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNMLEA--------SKAQKAVVGNWDLLKDVKELV

Query:  DSLLTLSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLP--QGSRDEENPATS
        D LL LSN+N W+LIGSMWFEMLGYAASKCEM+YHSEHIRQGGELITHVWLLIAHNVTK+S      G +DEE PATS
Subjt:  DSLLTLSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLP--QGSRDEENPATS

A0A1S4DZ97 uncharacterized protein LOC1034935084.6e-27273.6Show/hide
Query:  MFSDITRMVL-EHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------
        MFS++  +++ EH + IW YWGIE LVLANF+FQVIL F GSRRRHTPG +LSL VW SY+L A+IATVVLGKLTTID                      
Subjt:  MFSDITRMVL-EHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------

Query:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ--
          G+PDTITAYSIEDNQLGVR VF  VIQVGIMFYIL+RSWTDSKTSFLYL MSLAGIIKYGETSWALKSALN N   TIAD  KY E  +LF++LPQ  
Subjt:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ--

Query:  -RLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKD
          LPEA LILRAYYRFCCLKPH+ENWLYYPPTDCD+DKLYI DC +EDVFRITD ELGFMYDALYTKAPV+YT  GLILRFISLLS+IA L G SVLFKD
Subjt:  -RLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKD

Query:  AFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDM
        AFVYNISIG I+FVLIA+LIIEIYQILRLPFTDWAIVQMVRHH+ FPIL  FL+SL PQSATWRRWS  MGQFNLLDFCLQTKH+ YS+IKILRYWGMDM
Subjt:  AFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDM

Query:  KLRKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTV
        KLRKQLSL RI+VHPEV+E +VTEL EIE+IKGQEEFD RGQWTIDRY+TK     E+KLIKA++ TV +R FDK IFIWHITTNIFYNI+G+RD S  V
Subjt:  KLRKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTV

Query:  GSKIKAIMSLSDYMMHLMATCPHVLS-TTRDIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNMLEA--------SKAQKAVVGNWDLLKDVKELV
        G+K +AIM LSDYMM+L+ T  HVLS TT DIIFDH+CV+LGKFTRT   KKE ICNDIL  ++E ++L A        S+A+K VVGNW L+KDVKEL 
Subjt:  GSKIKAIMSLSDYMMHLMATCPHVLS-TTRDIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNMLEA--------SKAQKAVVGNWDLLKDVKELV

Query:  DSLLTLSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLP--QGSRDEENPATS
        D LLTLSN++ W+LIGSMWFEMLGYAASKCEM+YHSEHIRQGGELITHVWLLIAHNVTK+S      G +DEE PATS
Subjt:  DSLLTLSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLP--QGSRDEENPATS

A0A5A7TID1 DUF4220 domain-containing protein4.6e-27273.6Show/hide
Query:  MFSDITRMVL-EHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------
        MFS++  +++ EH + IW YWGIE LVLANF+FQVIL F GSRRRHTPG +LSL VW SY+L A+IATVVLGKLTTID                      
Subjt:  MFSDITRMVL-EHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------

Query:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ--
          G+PDTITAYSIEDNQLGVR VF  VIQVGIMFYIL+RSWTDSKTSFLYL MSLAGIIKYGETSWALKSALN N   TIAD  KY E  +LF++LPQ  
Subjt:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ--

Query:  -RLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKD
          LPEA LILRAYYRFCCLKPH+ENWLYYPPTDCD+DKLYI DC +EDVFRITD ELGFMYDALYTKAPV+YT  GLILRFISLLS+IA L G SVLFKD
Subjt:  -RLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKD

Query:  AFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDM
        AFVYNISIG I+FVLIA+LIIEIYQILRLPFTDWAIVQMVRHH+ FPIL  FL+SL PQSATWRRWS  MGQFNLLDFCLQTKH+ YS+IKILRYWGMDM
Subjt:  AFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDM

Query:  KLRKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTV
        KLRKQLSL RI+VHPEV+E +VTEL EIE+IKGQEEFD RGQWTIDRY+TK     E+KLIKA++ TV +R FDK IFIWHITTNIFYNI+G+RD S  V
Subjt:  KLRKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTV

Query:  GSKIKAIMSLSDYMMHLMATCPHVLS-TTRDIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNMLEA--------SKAQKAVVGNWDLLKDVKELV
        G+K +AIM LSDYMM+L+ T  HVLS TT DIIFDH+CV+LGKFTRT   KKE ICNDIL  ++E ++L A        S+A+K VVGNW L+KDVKEL 
Subjt:  GSKIKAIMSLSDYMMHLMATCPHVLS-TTRDIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNMLEA--------SKAQKAVVGNWDLLKDVKELV

Query:  DSLLTLSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLP--QGSRDEENPATS
        D LLTLSN++ W+LIGSMWFEMLGYAASKCEM+YHSEHIRQGGELITHVWLLIAHNVTK+S      G +DEE PATS
Subjt:  DSLLTLSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLP--QGSRDEENPATS

A0A6J1G3A8 uncharacterized protein LOC1114503966.9e-26872.21Show/hide
Query:  MFSDI-TRMVLEHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------
        MFSD+  R++  H + IWN WGIE LV ANF+FQVIL + GSRRRHTPGYKLSL+VW SY+L A+IATVVLGKLTTID                      
Subjt:  MFSDI-TRMVLEHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------

Query:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ-R
          G+PDTITAYSIEDNQLGVR VF   +QV IMFYIL+RSWTDSKTSFLYL MSLAGIIKYGETSWALKSAL+ NL  TIAD  KY E   LF++LPQ  
Subjt:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ-R

Query:  LPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKDAF
        LPEAKLILRAYYRFCCLKPH+ENWLYYPPTDCD  KLYIDDC +EDVFRITDSELGFMYDALYTKAPVIYT  GLILRFISLLS+IA L G SVLFKDAF
Subjt:  LPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKDAF

Query:  VYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDMKL
        VYNIS+G I++VLIASLIIE+YQI+R+PFTDWAIVQM+RHH+TFPIL   L SL PQSATWRRWS  MGQFNLLDFC+QTKH+ YS+IK+LR WGMDMKL
Subjt:  VYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDMKL

Query:  RKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTVGS
        RKQ+SL RIEV PEVKELVVTEL EIE IKGQEEF+QRGQWTIDRY+ KLK N++SKLIKAL+ TV++R FDKSIFIWHITTNIFY+I+ F DTS  +G+
Subjt:  RKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTVGS

Query:  KIKAIMSLSDYMMHLMATCPHVLSTTR-DIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNML-----EASKAQKAVVGNWDLLKDVKELVDSLLT
        K++AIMS+S+YMM+L+ T  HVLSTT  +IIFDH+CV+LG+FTRT  R  E  CN +L  + EG +      E S A+K VVGNW LLKDVK+L +SLL 
Subjt:  KIKAIMSLSDYMMHLMATCPHVLSTTR-DIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNML-----EASKAQKAVVGNWDLLKDVKELVDSLLT

Query:  LSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLPQ--GSRDEENPATS
        LSN++ W+LIGSMWFEMLGYAASKCEM+YHSEHIRQGGELITHVWLLIAHNVTK+S      G +DEE  A S
Subjt:  LSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLPQ--GSRDEENPATS

A0A6J1KHN9 uncharacterized protein LOC1114939419.3e-27373.55Show/hide
Query:  MFSDI-TRMVLEHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------
        MFSD+  R++  H + IWN WGIE LV ANF+FQVIL + GSRRRHTPGYKLSL+VW SY+L A+IATVVLGKLTTID                      
Subjt:  MFSDI-TRMVLEHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTID----------------------

Query:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ-R
          G+PDTITAYSIEDNQLGVR VF  VIQV IMFYIL+RSWTDSKTSFLYL MSLAGIIKYGETSWALKSAL+ NL  TIAD  KY E  +LFD+LPQ  
Subjt:  -TGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVE-PHLFDRLPQ-R

Query:  LPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKDAF
        LPEAKLILRAYYRFCCLKPH+ENWLYYPPTDCD  KLYIDDC +EDVFRITDSELGFMYDALYTKAPVIYT  GLILRFISLLS+IA L G SVLFKDAF
Subjt:  LPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKDAF

Query:  VYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDMKL
        VYNIS+G I++VLIASLIIEIYQI+R+PFTDWAIVQM+RHH+TFPIL   L SL PQSATWRRWS  MGQFNLLDFCLQTKH+ YS+IK+LR WG+DMKL
Subjt:  VYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDMKL

Query:  RKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTVGS
        RKQ+SL RI+VHPEVKELVVTEL EIEKIKGQEEF+QRGQWTIDRY+TKLK N++S LIKAL+ TV++R FDKSIFIWHITTNIFY+I+ F DTS  +G+
Subjt:  RKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALK-TVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTVGS

Query:  KIKAIMSLSDYMMHLMATCPHVLSTTR-DIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNML-----EASKAQKAVVGNWDLLKDVKELVDSLLT
        K++AIMS+S+YMM+L+ T  HVLSTT  +IIFDH+CV+LGKFTRT    KE  CN +L  Q+EG +      E S+A+K VVGNW LLKDVK+L DSLL 
Subjt:  KIKAIMSLSDYMMHLMATCPHVLSTTR-DIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNML-----EASKAQKAVVGNWDLLKDVKELVDSLLT

Query:  LSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLPQ--GSRDEENPATS
        LSN+N W+LIGSMWFEMLGYAASKCEM+YHSEHIRQGGELITHVWLLIAHNVTK+S      G +DEE  A S
Subjt:  LSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLPQ--GSRDEENPATS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G45460.1 unknown protein4.9e-1623.58Show/hide
Query:  MVLEHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLT--------------------------TIDTGSP
        ++ +H    W+ W I   +  +   Q  LI     R+ TP   L + +W SY+L    A   +G ++                           +  G P
Subjt:  MVLEHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLT--------------------------TIDTGSP

Query:  DTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSA-----------------------------LNANL
        DTITA+++EDN L +R+VFG V Q     Y++++S  +S    + L + ++G IKY E + AL SA                               A L
Subjt:  DTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSA-----------------------------LNANL

Query:  DITIADLNKYVEPHLFDRLP----------QRLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPV
           I  +++  + H   +L           + L   ++   AY  F   K  + N ++         +++ +    E+  RI + ELGF+YDAL+TK  V
Subjt:  DITIADLNKYVEPHLFDRLP----------QRLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPV

Query:  IYTHTGLILRFISLLSMIAALWGLSVLF-----KDAFVYNISIGLIYFVLIASLIIEIYQILRLPFTDW
        ++T  G + R ++  S++AA     ++F     K    +   + + Y +    L+++   IL   F+DW
Subjt:  IYTHTGLILRFISLLSMIAALWGLSVLF-----KDAFVYNISIGLIYFVLIASLIIEIYQILRLPFTDW

AT5G45470.1 Protein of unknown function (DUF594)1.4e-2320.95Show/hide
Query:  ITRMVLEHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLT--------------------------TIDT
        +  ++ +H   +W+ W I   V+ +   Q ILI     R+ TP   L + VW SY+L    A   +G ++                           +  
Subjt:  ITRMVLEHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLT--------------------------TIDT

Query:  GSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSA-LNANLDITI---------ADLNKYVEPHLF
        G PDTITA+++EDN L +RHVFG V Q     Y+++ S  +S    + L + ++G IKY E + AL SA L+   D  I         A L +  +    
Subjt:  GSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSA-LNANLDITI---------ADLNKYVEPHLF

Query:  DRLPQR-----------------------------LPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTK
         RLP +                             L + +++  AY  F   K  + N ++      +  +++ +    E+  RI + ELGF+YDAL+TK
Subjt:  DRLPQR-----------------------------LPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTK

Query:  APVIYTHTGLILRFISLLSMIAALWGLSVLF-----KDAFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSAT
          +++T  G + R  +  +++AA     ++F     K    +   + + Y +    L+++   IL   F+DW        + +       LQS   +   
Subjt:  APVIYTHTGLILRFISLLSMIAALWGLSVLF-----KDAFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDPQSAT

Query:  W-----------------------------------------------------------------------------RRWSTKMGQFNLLDFCLQT---
        W                                                                             RRWS  +  FN + +  +    
Subjt:  W-----------------------------------------------------------------------------RRWSTKMGQFNLLDFCLQT---

Query:  -------KHQKYS-----------KIKILRYWGMDMKL----------------RKQLSLKRIEVHPEVKEL-------------VVTELGEI-------
                 ++YS              I + +G  +KL                RK L  +R  ++P   E              +++E  +I       
Subjt:  -------KHQKYS-----------KIKILRYWGMDMKL----------------RKQLSLKRIEVHPEVKEL-------------VVTELGEI-------

Query:  --------------------EKIKGQEEFDQ-----------RGQWTIDRYQTKLKLNEESKLIKALKTVSERVFDKSIFIWHITTNIFY------NIKG
                            E++K + ++             RG+WT+     +  L  +++  K ++ V++  +D+S+ +WHI T + Y       I  
Subjt:  --------------------EKIKGQEEFDQ-----------RGQWTIDRYQTKLKLNEESKLIKALKTVSERVFDKSIFIWHITTNIFY------NIKG

Query:  FRDTSTTVGSKIKAIMSLSDYMMHLMATCPHVLSTTR---DIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNMLEASKAQKAVVGNWD----LLK
          D      S  +    +SDYMM+L+   P ++S       I F        KF      ++ HI ND        N+L+     + +    D    +L 
Subjt:  FRDTSTTVGSKIKAIMSLSDYMMHLMATCPHVLSTTR---DIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNMLEASKAQKAVVGNWD----LLK

Query:  DVKELVDSLLTLS---NDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAH
        D   L   L  +    N + WE++  +W E+L YAA  C+   H E + +GGELI  VWLL+AH
Subjt:  DVKELVDSLLTLS---NDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAH

AT5G45480.1 Protein of unknown function (DUF594)4.9e-1624.29Show/hide
Query:  IWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLT--------------------------TIDTGSPDTITAYSI
        IW+ W I   ++ +   Q  LIF   +R+ +    L   +W +Y+L    A    G+++                           +  G PDTITA ++
Subjt:  IWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLT--------------------------TIDTGSPDTITAYSI

Query:  EDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSA-------------------------LNANLDITI-ADLNKYV
        EDN+L +RH+ G   Q     Y+L++S  ++    + L  +  G+IKY E + AL  A                           A  D+ +   + K  
Subjt:  EDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSA-------------------------LNANLDITI-ADLNKYV

Query:  EPHL--FDRLPQRLPEA----KLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLL
        EP     D  P + P+      ++  AY  F   K  + + L +        K + D    E+  RI + EL F+Y ALYTKA +++   G + RFI+L 
Subjt:  EPHL--FDRLPQRLPEA----KLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLL

Query:  SMIAALWGLSVLFKDAFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQM
         + AAL       K  +     +GL Y +L+  + ++   ++    +DW  V++
Subjt:  SMIAALWGLSVLFKDAFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQM

AT5G45480.1 Protein of unknown function (DUF594)2.4e-1530.13Show/hide
Query:  RGQWTIDRYQTKLKLNEESKLIKALKTVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTVGSKIKAIMSLSDYMMHLMATCPHVLSTTRDIIFDHACVRL
        RG WT+   Q   +   E K  K L+ V E  +D+S+ +WHI T + Y  K     + +     K    LSDYMM+L+   P ++S         A V +
Subjt:  RGQWTIDRYQTKLKLNEESKLIKALKTVSERVFDKSIFIWHITTNIFYNIKGFRDTSTTVGSKIKAIMSLSDYMMHLMATCPHVLSTTRDIIFDHACVRL

Query:  GKFT-RTECRK------KEHICNDILK---SQEEGNMLEASKAQKAVVGNWDLLKD------------VKELVDSLLTLSNDNS-WELIGSMWFEMLGYA
        GK   R  C +      + HI     K     +E ++   S A  A     D+  D             KEL        +D+  W+++  +W E+L YA
Subjt:  GKFT-RTECRK------KEHICNDILK---SQEEGNMLEASKAQKAVVGNWDLLKD------------VKELVDSLLTLSNDNS-WELIGSMWFEMLGYA

Query:  ASKCEMKYHSEHIRQGGELITHVWLLIAH
        A+KC    H+  + +GGELI+ VWLL+AH
Subjt:  ASKCEMKYHSEHIRQGGELITHVWLLIAH

AT5G45530.1 Protein of unknown function (DUF594)1.4e-2621.69Show/hide
Query:  IWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLT--------------------------TIDTGSPDTITAYSI
        I + W I  LV+ + +FQ  LIF    R+ T    L+  +W +Y+L    A   + ++T                           +  G PDTITA ++
Subjt:  IWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLT--------------------------TIDTGSPDTITAYSI

Query:  EDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSA-------------------------------LNANLDITIAD
        EDN L  RH+FG V Q     Y +++S  +     + L + + G IKY E + AL SA                                N   +I + D
Subjt:  EDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSA-------------------------------LNANLDITIAD

Query:  -LNKYVEPHLFDRLPQRLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLL
          +K+  P    +  + L + +++   +  F   K  + + L +   + D  + +  +    +  RI ++ELGF+Y+++YTK  +++T  G + R IS  
Subjt:  -LNKYVEPHLFDRLPQRLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLL

Query:  SMIAALWGLSVL-FKDAFVYNISIGLIYFVLIASLIIEIYQILRLPFTDW--AIVQMVR---HHQTFPILERFLQSLDPQSATW----------------
        S++++ +       K    +   + + Y + I  + +++  ++    +DW  A+++ ++     ++  I   F   L+ +   W                
Subjt:  SMIAALWGLSVL-FKDAFVYNISIGLIYFVLIASLIIEIYQILRLPFTDW--AIVQMVR---HHQTFPILERFLQSLDPQSATW----------------

Query:  ----RRWSTKMGQFNLLDFCLQTK----HQKYS------------------KIKILRYW--GMDMKLR-------KQLSLKRIEVH----------PEVK
            RRWS  +  FN + FCL+ K    HQK +                  +I+++  W   ++  +R       K+  + R  V+          PEV 
Subjt:  ----RRWSTKMGQFNLLDFCLQTK----HQKYS------------------KIKILRYW--GMDMKLR-------KQLSLKRIEVH----------PEVK

Query:  ELVVTELGEIEKI----------------KGQEEFD----------------------QRGQWTIDRYQTKLKLNEESKLIKALKTVSERVFDKSIFIWH
         ++   +  I  +                K Q EF                        RG+W +       KL E   L++ ++ V    +D+S+ +WH
Subjt:  ELVVTELGEIEKI----------------KGQEEFD----------------------QRGQWTIDRYQTKLKLNEESKLIKALKTVSERVFDKSIFIWH

Query:  ITTNIFYNIK--GFRDTSTTVGSKIKAIMS-LSDYMMHLMATCPHVLSTTRDIIFDHACVRLGKFTRTECRK--KEHICNDI--LKSQEEGNMLEASKAQ
        I T + +  +  G  +  +  G   +     +SDYMM+L+   P ++S    I      +R  + T+ E  +  K     D+  +K   E  +L ++  +
Subjt:  ITTNIFYNIK--GFRDTSTTVGSKIKAIMS-LSDYMMHLMATCPHVLSTTRDIIFDHACVRLGKFTRTECRK--KEHICNDI--LKSQEEGNMLEASKAQ

Query:  KAVVGN-------WDLLKDVKELVDSLLTLSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAH
          +V         +D     KEL +   + + D  W ++  +W E+L YAAS C+   H   + +GGEL+  VWLL+AH
Subjt:  KAVVGN-------WDLLKDVKELVDSLLTLSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAH

AT5G45540.1 Protein of unknown function (DUF594)1.3e-4023.54Show/hide
Query:  ITRMVLEHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLT--------------------------TIDT
        +T M+  H   +W+ W I  +++ +   Q ILIF    RR T      + +W +Y+L    A   +G+++                           +  
Subjt:  ITRMVLEHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLT--------------------------TIDT

Query:  GSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSA-------------------------------
        G PDTITA ++EDN+L  RH+F  V Q     Y+++ S   ++     L M + G+IKY E + AL SA                               
Subjt:  GSPDTITAYSIEDNQLGVRHVFGTVIQVGIMFYILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSA-------------------------------

Query:  LNANLD-ITIADLNKYVEPHLFDRLPQRLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTH
        +N   D I + D  K  E +   R    L   ++I  AY  F   K  I + L +   + D  + + D  T E+  RI + ELG +YD L+TKA +++  
Subjt:  LNANLD-ITIADLNKYVEPHLFDRLPQRLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHEDVFRITDSELGFMYDALYTKAPVIYTH

Query:  TGLILRFISLLSMIAALWGLSVLFKDAFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTF----PILERFLQ-SLDPQSATW------
        TG + RFI+L  ++A+L    +  KD +     + L Y +LI  + ++   +L    +DW I ++ +  +       + +R L   LD ++  W      
Subjt:  TGLILRFISLLSMIAALWGLSVLFKDAFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTF----PILERFLQ-SLDPQSATW------

Query:  -------------RRWSTKMGQFNLLDFCL--QTKHQKYSKIKI--------------------------------------------------------
                     RRWS  +  +NL+ FCL  + K   Y+K KI                                                        
Subjt:  -------------RRWSTKMGQFNLLDFCL--QTKHQKYSKIKI--------------------------------------------------------

Query:  -----------LRYWGMDMKLRKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEE-----FDQRGQWTIDRYQTKLKLNEESKLIKALKTVSERVFDKSI
                   L ++G+   + +        +  E+ E + TE+ +  +    +E        RG WT+    +K K  + +   K L+ V+E+ +D+SI
Subjt:  -----------LRYWGMDMKLRKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEE-----FDQRGQWTIDRYQTKLKLNEESKLIKALKTVSERVFDKSI

Query:  FIWHITTNIFY------NIKGFRDTSTT----VGSKIKAIMSLSDYMMHLMATCPHVLSTTRDII---FDHACVRLGK-FTRTECRKKEHICNDILKSQE
         +WHI T + Y       +    + ST       S  +    LSDYMM+L+   P ++S    I    F   C      F R    K  ++  +++K   
Subjt:  FIWHITTNIFY------NIKGFRDTSTT----VGSKIKAIMSLSDYMMHLMATCPHVLSTTRDII---FDHACVRLGK-FTRTECRKKEHICNDILKSQE

Query:  EGNMLEASKAQKAVV---GNWDLLKDVKELVDSLLTLSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAH
           +   ++     V    +  +L D   L   L+    +N WE++  +W E+L YA+  C+ + H+  + +GGELI  VWLL+AH
Subjt:  EGNMLEASKAQKAVV---GNWDLLKDVKELVDSLLTLSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTCTGACATAACACGTATGGTACTGGAACATTTCAATTTGATATGGAATTACTGGGGAATTGAGTTTCTAGTCTTGGCAAACTTCATATTCCAAGTTATCCTAAT
TTTCTGTGGAAGTCGTAGAAGGCACACACCAGGATATAAGCTTAGTTTATCTGTTTGGGTTTCCTACATACTTGTTGCTAGAATTGCAACTGTTGTTCTAGGCAAGCTCA
CGACAATCGACACAGGAAGTCCAGATACAATAACAGCCTACTCAATTGAAGACAACCAACTAGGAGTGAGGCACGTTTTCGGTACAGTGATCCAAGTGGGAATTATGTTT
TACATTCTTATAAGGTCATGGACAGATTCCAAAACATCATTTCTTTACTTGTCGATGTCTTTGGCTGGAATAATCAAGTATGGTGAGACTTCATGGGCTCTAAAATCAGC
ACTCAATGCCAACTTGGACATCACAATTGCTGACTTAAACAAATATGTTGAACCTCATTTGTTTGACAGATTGCCACAGCGACTTCCAGAGGCAAAATTGATCTTAAGAG
CTTACTACAGATTTTGCTGCCTAAAACCTCATATTGAGAATTGGCTTTACTATCCTCCAACTGATTGTGATCGTGACAAACTATACATTGATGATTGTACGCATGAAGAT
GTGTTCAGAATTACAGATTCTGAATTGGGTTTCATGTATGATGCACTCTACACTAAGGCACCAGTTATATATACTCATACGGGTTTAATTCTTCGTTTCATAAGTTTGCT
TAGTATGATTGCTGCACTATGGGGATTGTCTGTATTGTTCAAGGATGCATTTGTGTATAATATTAGTATTGGCCTCATCTATTTTGTGTTGATAGCATCTTTGATAATTG
AGATTTATCAGATCTTGAGACTACCATTCACAGATTGGGCTATAGTACAAATGGTAAGACACCATCAAACTTTTCCTATCCTTGAGAGGTTTTTGCAGTCTCTAGACCCT
CAGTCGGCAACTTGGAGAAGGTGGTCTACCAAAATGGGGCAGTTCAATCTTTTAGATTTCTGCTTGCAAACCAAGCATCAGAAATACAGCAAAATCAAAATTCTCCGGTA
TTGGGGGATGGATATGAAACTCCGAAAGCAACTGAGTTTGAAACGAATTGAAGTCCATCCAGAAGTGAAAGAACTTGTGGTCACAGAACTAGGAGAGATAGAGAAGATCA
AAGGACAAGAAGAATTTGATCAAAGAGGCCAATGGACAATCGATAGGTACCAAACAAAACTGAAACTCAATGAAGAAAGCAAATTGATCAAAGCACTTAAAACTGTATCA
GAGAGAGTTTTCGATAAGAGCATCTTCATATGGCACATCACAACAAATATTTTCTATAACATCAAAGGTTTTCGTGATACTAGTACTACGGTAGGTAGTAAAATAAAAGC
TATCATGAGTTTATCTGATTATATGATGCACCTTATGGCGACTTGTCCCCATGTGCTATCAACAACTAGAGATATTATATTTGATCATGCATGCGTGAGGCTTGGGAAGT
TCACAAGAACTGAATGTAGAAAGAAAGAACATATATGCAACGATATTTTGAAATCACAAGAAGAAGGTAATATGCTTGAAGCAAGTAAAGCACAGAAAGCAGTTGTAGGG
AATTGGGATCTGCTGAAGGATGTAAAAGAACTTGTAGATAGCTTATTGACTTTGAGTAATGACAATAGTTGGGAGCTAATTGGCAGCATGTGGTTTGAGATGTTGGGATA
TGCTGCAAGTAAGTGTGAAATGAAATATCATTCAGAACACATCAGGCAAGGTGGTGAATTGATCACTCATGTTTGGCTTTTGATAGCCCATAATGTTACCAAATTCAGTC
CACTTCCTCAGGGCAGTCGAGATGAAGAGAATCCAGCTACTTCAAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTTCTGACATAACACGTATGGTACTGGAACATTTCAATTTGATATGGAATTACTGGGGAATTGAGTTTCTAGTCTTGGCAAACTTCATATTCCAAGTTATCCTAAT
TTTCTGTGGAAGTCGTAGAAGGCACACACCAGGATATAAGCTTAGTTTATCTGTTTGGGTTTCCTACATACTTGTTGCTAGAATTGCAACTGTTGTTCTAGGCAAGCTCA
CGACAATCGACACAGGAAGTCCAGATACAATAACAGCCTACTCAATTGAAGACAACCAACTAGGAGTGAGGCACGTTTTCGGTACAGTGATCCAAGTGGGAATTATGTTT
TACATTCTTATAAGGTCATGGACAGATTCCAAAACATCATTTCTTTACTTGTCGATGTCTTTGGCTGGAATAATCAAGTATGGTGAGACTTCATGGGCTCTAAAATCAGC
ACTCAATGCCAACTTGGACATCACAATTGCTGACTTAAACAAATATGTTGAACCTCATTTGTTTGACAGATTGCCACAGCGACTTCCAGAGGCAAAATTGATCTTAAGAG
CTTACTACAGATTTTGCTGCCTAAAACCTCATATTGAGAATTGGCTTTACTATCCTCCAACTGATTGTGATCGTGACAAACTATACATTGATGATTGTACGCATGAAGAT
GTGTTCAGAATTACAGATTCTGAATTGGGTTTCATGTATGATGCACTCTACACTAAGGCACCAGTTATATATACTCATACGGGTTTAATTCTTCGTTTCATAAGTTTGCT
TAGTATGATTGCTGCACTATGGGGATTGTCTGTATTGTTCAAGGATGCATTTGTGTATAATATTAGTATTGGCCTCATCTATTTTGTGTTGATAGCATCTTTGATAATTG
AGATTTATCAGATCTTGAGACTACCATTCACAGATTGGGCTATAGTACAAATGGTAAGACACCATCAAACTTTTCCTATCCTTGAGAGGTTTTTGCAGTCTCTAGACCCT
CAGTCGGCAACTTGGAGAAGGTGGTCTACCAAAATGGGGCAGTTCAATCTTTTAGATTTCTGCTTGCAAACCAAGCATCAGAAATACAGCAAAATCAAAATTCTCCGGTA
TTGGGGGATGGATATGAAACTCCGAAAGCAACTGAGTTTGAAACGAATTGAAGTCCATCCAGAAGTGAAAGAACTTGTGGTCACAGAACTAGGAGAGATAGAGAAGATCA
AAGGACAAGAAGAATTTGATCAAAGAGGCCAATGGACAATCGATAGGTACCAAACAAAACTGAAACTCAATGAAGAAAGCAAATTGATCAAAGCACTTAAAACTGTATCA
GAGAGAGTTTTCGATAAGAGCATCTTCATATGGCACATCACAACAAATATTTTCTATAACATCAAAGGTTTTCGTGATACTAGTACTACGGTAGGTAGTAAAATAAAAGC
TATCATGAGTTTATCTGATTATATGATGCACCTTATGGCGACTTGTCCCCATGTGCTATCAACAACTAGAGATATTATATTTGATCATGCATGCGTGAGGCTTGGGAAGT
TCACAAGAACTGAATGTAGAAAGAAAGAACATATATGCAACGATATTTTGAAATCACAAGAAGAAGGTAATATGCTTGAAGCAAGTAAAGCACAGAAAGCAGTTGTAGGG
AATTGGGATCTGCTGAAGGATGTAAAAGAACTTGTAGATAGCTTATTGACTTTGAGTAATGACAATAGTTGGGAGCTAATTGGCAGCATGTGGTTTGAGATGTTGGGATA
TGCTGCAAGTAAGTGTGAAATGAAATATCATTCAGAACACATCAGGCAAGGTGGTGAATTGATCACTCATGTTTGGCTTTTGATAGCCCATAATGTTACCAAATTCAGTC
CACTTCCTCAGGGCAGTCGAGATGAAGAGAATCCAGCTACTTCAAGTTAG
Protein sequenceShow/hide protein sequence
MFSDITRMVLEHFNLIWNYWGIEFLVLANFIFQVILIFCGSRRRHTPGYKLSLSVWVSYILVARIATVVLGKLTTIDTGSPDTITAYSIEDNQLGVRHVFGTVIQVGIMF
YILIRSWTDSKTSFLYLSMSLAGIIKYGETSWALKSALNANLDITIADLNKYVEPHLFDRLPQRLPEAKLILRAYYRFCCLKPHIENWLYYPPTDCDRDKLYIDDCTHED
VFRITDSELGFMYDALYTKAPVIYTHTGLILRFISLLSMIAALWGLSVLFKDAFVYNISIGLIYFVLIASLIIEIYQILRLPFTDWAIVQMVRHHQTFPILERFLQSLDP
QSATWRRWSTKMGQFNLLDFCLQTKHQKYSKIKILRYWGMDMKLRKQLSLKRIEVHPEVKELVVTELGEIEKIKGQEEFDQRGQWTIDRYQTKLKLNEESKLIKALKTVS
ERVFDKSIFIWHITTNIFYNIKGFRDTSTTVGSKIKAIMSLSDYMMHLMATCPHVLSTTRDIIFDHACVRLGKFTRTECRKKEHICNDILKSQEEGNMLEASKAQKAVVG
NWDLLKDVKELVDSLLTLSNDNSWELIGSMWFEMLGYAASKCEMKYHSEHIRQGGELITHVWLLIAHNVTKFSPLPQGSRDEENPATSS