; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033842 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033842
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUPF0481 protein At3g47200-like
Genome locationscaffold13:36625165..36628003
RNA-Seq ExpressionSpg033842
SyntenySpg033842
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131634.1 UPF0481 protein At3g47200-like [Momordica charantia]1.4e-7947.35Show/hide
Query:  QKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRNHLYY
        Q +L   + HKL  L  YLHR+ M VE  ++I QNW  +AR  Y EPI M +D+FV M+L+DGCF++ F+IL Y+N+   +     D SFY AM + +Y 
Subjt:  QKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRNHLYY

Query:  DIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHLVHLLRDYYSFSIVKDRTKNNDSSEQRGFLLPPTLT
        D+ MLENQLP FVL+GL++ +   +D  ++  S   LI+ F  +    +   +   V   N  HLV LL  Y  F    D  + +D  E   +L+ P +T
Subjt:  DIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHLVHLLRDYYSFSIVKDRTKNNDSSEQRGFLLPPTLT

Query:  ELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNNIGGSNEEVS
        EL EAGVTIKK  E    MDISFK+GVLEIP  +IDD FET VRNLMAFE Y      + Y   Y  FLD +ISTEKD  LLV+A I+ N+IGGS++EVS
Subjt:  ELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNNIGGSNEEVS

Query:  KLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSK
        +LFNDL K +++PG  +Y  ++TK L  +CK    R  A+L+RDYFN+PWA IS VAAT  I+LT +QTI++ +S  K
Subjt:  KLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSK

XP_022132066.1 UPF0481 protein At3g47200-like [Momordica charantia]9.2e-8448.45Show/hide
Query:  FTMGQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYH--NFSPPDQILTLDCSFYGAM
        F  GQK   AME  KL  L  YL R+ M +E A  IAQ W  +AR  YAE I+M  D FVKMMLVDG FL+EF+ + Y     + P+    L+ + + A+
Subjt:  FTMGQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYH--NFSPPDQILTLDCSFYGAM

Query:  RNHLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHLVHLLRDYYSFSIVKDRTKNNDSSEQRGFL
           +Y D+++LENQLP F+LE L ++  +          F +    F  +    + + +S  ++ +  NHLV  L  YY+   V   T  ND  +     
Subjt:  RNHLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHLVHLLRDYYSFSIVKDRTKNNDSSEQRGFL

Query:  LPPTLTELWEAGVTIKKASEEIH-FMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNNIG
         PPT TELWEAGV  +KA+E+    MDI FKDGVL IPH EI D FET VRNL+A+E Y ++  D+  +  Y+ FLD LISTE+D SLLVKA I+TNNIG
Subjt:  LPPTLTELWEAGVTIKKASEEIH-FMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNNIG

Query:  GSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSKHPP
        G+NE+VSKLFNDLCK I +    YYY +++  L KYC+T  HR MASLRRDYFNTPWA ISF+AAT  +LLT +Q IYS +SY K  P
Subjt:  GSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSKHPP

XP_022158989.1 UPF0481 protein At3g47200-like isoform X1 [Momordica charantia]1.0e-8248.58Show/hide
Query:  FTMGQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRN
        F  G++ L  ME HKL  L  YL R N  +EV + I ++W   AR  YAEPINM  DEFVKMMLVDGCF++E M++     S  +     D   + AM  
Subjt:  FTMGQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRN

Query:  HLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRL----SQIVINRNANHLVHLLRDYYSFSIVK-DRTKNNDSSEQR
         LY D++MLENQLP FVL+GLF+Q     + G   LSF  L  +F   G +     L      ++     NHLV  L  YY+ +      T ++ +  ++
Subjt:  HLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRL----SQIVINRNANHLVHLLRDYYSFSIVK-DRTKNNDSSEQR

Query:  GFLLPPTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNN
            PPT+TELWEAG+  KKA    H MDISFKD VL+IP  EI D FET VRNLMAFEQY     D  Y   Y  FL+GLIS E+D SLLVKA I+TN 
Subjt:  GFLLPPTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNN

Query:  IGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSK
        IGG+N+EVS LFNDLCK + + G    + ++ +AL ++C    ++ MASLRRDYFNTPWA ISFVAA   ILLTF+QT++S +S SK
Subjt:  IGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSK

XP_022158990.1 UPF0481 protein At3g47200-like isoform X2 [Momordica charantia]1.0e-8248.58Show/hide
Query:  FTMGQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRN
        F  G++ L  ME HKL  L  YL R N  +EV + I ++W   AR  YAEPINM  DEFVKMMLVDGCF++E M++     S  +     D   + AM  
Subjt:  FTMGQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRN

Query:  HLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRL----SQIVINRNANHLVHLLRDYYSFSIVK-DRTKNNDSSEQR
         LY D++MLENQLP FVL+GLF+Q     + G   LSF  L  +F   G +     L      ++     NHLV  L  YY+ +      T ++ +  ++
Subjt:  HLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRL----SQIVINRNANHLVHLLRDYYSFSIVK-DRTKNNDSSEQR

Query:  GFLLPPTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNN
            PPT+TELWEAG+  KKA    H MDISFKD VL+IP  EI D FET VRNLMAFEQY     D  Y   Y  FL+GLIS E+D SLLVKA I+TN 
Subjt:  GFLLPPTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNN

Query:  IGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSK
        IGG+N+EVS LFNDLCK + + G    + ++ +AL ++C    ++ MASLRRDYFNTPWA ISFVAA   ILLTF+QT++S +S SK
Subjt:  IGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSK

XP_022158992.1 UPF0481 protein At3g47200-like isoform X3 [Momordica charantia]1.0e-8248.58Show/hide
Query:  FTMGQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRN
        F  G++ L  ME HKL  L  YL R N  +EV + I ++W   AR  YAEPINM  DEFVKMMLVDGCF++E M++     S  +     D   + AM  
Subjt:  FTMGQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRN

Query:  HLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRL----SQIVINRNANHLVHLLRDYYSFSIVK-DRTKNNDSSEQR
         LY D++MLENQLP FVL+GLF+Q     + G   LSF  L  +F   G +     L      ++     NHLV  L  YY+ +      T ++ +  ++
Subjt:  HLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRL----SQIVINRNANHLVHLLRDYYSFSIVK-DRTKNNDSSEQR

Query:  GFLLPPTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNN
            PPT+TELWEAG+  KKA    H MDISFKD VL+IP  EI D FET VRNLMAFEQY     D  Y   Y  FL+GLIS E+D SLLVKA I+TN 
Subjt:  GFLLPPTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNN

Query:  IGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSK
        IGG+N+EVS LFNDLCK + + G    + ++ +AL ++C    ++ MASLRRDYFNTPWA ISFVAA   ILLTF+QT++S +S SK
Subjt:  IGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSK

TrEMBL top hitse value%identityAlignment
A0A6J1BQT6 UPF0481 protein At3g47200-like6.7e-8047.35Show/hide
Query:  QKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRNHLYY
        Q +L   + HKL  L  YLHR+ M VE  ++I QNW  +AR  Y EPI M +D+FV M+L+DGCF++ F+IL Y+N+   +     D SFY AM + +Y 
Subjt:  QKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRNHLYY

Query:  DIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHLVHLLRDYYSFSIVKDRTKNNDSSEQRGFLLPPTLT
        D+ MLENQLP FVL+GL++ +   +D  ++  S   LI+ F  +    +   +   V   N  HLV LL  Y  F    D  + +D  E   +L+ P +T
Subjt:  DIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHLVHLLRDYYSFSIVKDRTKNNDSSEQRGFLLPPTLT

Query:  ELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNNIGGSNEEVS
        EL EAGVTIKK  E    MDISFK+GVLEIP  +IDD FET VRNLMAFE Y      + Y   Y  FLD +ISTEKD  LLV+A I+ N+IGGS++EVS
Subjt:  ELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNNIGGSNEEVS

Query:  KLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSK
        +LFNDL K +++PG  +Y  ++TK L  +CK    R  A+L+RDYFN+PWA IS VAAT  I+LT +QTI++ +S  K
Subjt:  KLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSK

A0A6J1BR71 UPF0481 protein At3g47200-like4.5e-8448.45Show/hide
Query:  FTMGQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYH--NFSPPDQILTLDCSFYGAM
        F  GQK   AME  KL  L  YL R+ M +E A  IAQ W  +AR  YAE I+M  D FVKMMLVDG FL+EF+ + Y     + P+    L+ + + A+
Subjt:  FTMGQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYH--NFSPPDQILTLDCSFYGAM

Query:  RNHLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHLVHLLRDYYSFSIVKDRTKNNDSSEQRGFL
           +Y D+++LENQLP F+LE L ++  +          F +    F  +    + + +S  ++ +  NHLV  L  YY+   V   T  ND  +     
Subjt:  RNHLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHLVHLLRDYYSFSIVKDRTKNNDSSEQRGFL

Query:  LPPTLTELWEAGVTIKKASEEIH-FMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNNIG
         PPT TELWEAGV  +KA+E+    MDI FKDGVL IPH EI D FET VRNL+A+E Y ++  D+  +  Y+ FLD LISTE+D SLLVKA I+TNNIG
Subjt:  LPPTLTELWEAGVTIKKASEEIH-FMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNNIG

Query:  GSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSKHPP
        G+NE+VSKLFNDLCK I +    YYY +++  L KYC+T  HR MASLRRDYFNTPWA ISF+AAT  +LLT +Q IYS +SY K  P
Subjt:  GSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSKHPP

A0A6J1DXD6 UPF0481 protein At3g47200-like isoform X24.9e-8348.58Show/hide
Query:  FTMGQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRN
        F  G++ L  ME HKL  L  YL R N  +EV + I ++W   AR  YAEPINM  DEFVKMMLVDGCF++E M++     S  +     D   + AM  
Subjt:  FTMGQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRN

Query:  HLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRL----SQIVINRNANHLVHLLRDYYSFSIVK-DRTKNNDSSEQR
         LY D++MLENQLP FVL+GLF+Q     + G   LSF  L  +F   G +     L      ++     NHLV  L  YY+ +      T ++ +  ++
Subjt:  HLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRL----SQIVINRNANHLVHLLRDYYSFSIVK-DRTKNNDSSEQR

Query:  GFLLPPTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNN
            PPT+TELWEAG+  KKA    H MDISFKD VL+IP  EI D FET VRNLMAFEQY     D  Y   Y  FL+GLIS E+D SLLVKA I+TN 
Subjt:  GFLLPPTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNN

Query:  IGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSK
        IGG+N+EVS LFNDLCK + + G    + ++ +AL ++C    ++ MASLRRDYFNTPWA ISFVAA   ILLTF+QT++S +S SK
Subjt:  IGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSK

A0A6J1DYL4 UPF0481 protein At3g47200-like isoform X34.9e-8348.58Show/hide
Query:  FTMGQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRN
        F  G++ L  ME HKL  L  YL R N  +EV + I ++W   AR  YAEPINM  DEFVKMMLVDGCF++E M++     S  +     D   + AM  
Subjt:  FTMGQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRN

Query:  HLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRL----SQIVINRNANHLVHLLRDYYSFSIVK-DRTKNNDSSEQR
         LY D++MLENQLP FVL+GLF+Q     + G   LSF  L  +F   G +     L      ++     NHLV  L  YY+ +      T ++ +  ++
Subjt:  HLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRL----SQIVINRNANHLVHLLRDYYSFSIVK-DRTKNNDSSEQR

Query:  GFLLPPTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNN
            PPT+TELWEAG+  KKA    H MDISFKD VL+IP  EI D FET VRNLMAFEQY     D  Y   Y  FL+GLIS E+D SLLVKA I+TN 
Subjt:  GFLLPPTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNN

Query:  IGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSK
        IGG+N+EVS LFNDLCK + + G    + ++ +AL ++C    ++ MASLRRDYFNTPWA ISFVAA   ILLTF+QT++S +S SK
Subjt:  IGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSK

A0A6J1E120 UPF0481 protein At3g47200-like isoform X14.9e-8348.58Show/hide
Query:  FTMGQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRN
        F  G++ L  ME HKL  L  YL R N  +EV + I ++W   AR  YAEPINM  DEFVKMMLVDGCF++E M++     S  +     D   + AM  
Subjt:  FTMGQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRN

Query:  HLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRL----SQIVINRNANHLVHLLRDYYSFSIVK-DRTKNNDSSEQR
         LY D++MLENQLP FVL+GLF+Q     + G   LSF  L  +F   G +     L      ++     NHLV  L  YY+ +      T ++ +  ++
Subjt:  HLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRL----SQIVINRNANHLVHLLRDYYSFSIVK-DRTKNNDSSEQR

Query:  GFLLPPTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNN
            PPT+TELWEAG+  KKA    H MDISFKD VL+IP  EI D FET VRNLMAFEQY     D  Y   Y  FL+GLIS E+D SLLVKA I+TN 
Subjt:  GFLLPPTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNN

Query:  IGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSK
        IGG+N+EVS LFNDLCK + + G    + ++ +AL ++C    ++ MASLRRDYFNTPWA ISFVAA   ILLTF+QT++S +S SK
Subjt:  IGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSK

SwissProt top hitse value%identityAlignment
Q9SD53 UPF0481 protein At3g472006.3e-2725.5Show/hide
Query:  GQKHLEAMEGHKLMGLKIYL---HRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCF-LLEFMILAYHNFSPPDQILTLDCSFYGAMR
        G+KHL+ ++ HK   L+++L    + ++   V ++   +   K R  Y+E +    D  + MM++DGCF L+ F+I++ +     D I ++       + 
Subjt:  GQKHLEAMEGHKLMGLKIYL---HRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCF-LLEFMILAYHNFSPPDQILTLDCSFYGAMR

Query:  NHLYYDIMMLENQLPLFVLEGLF--NQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHLVHLLRD---------------YYSFSIV
        + +  D+++LENQ+P FVL+ L+  +++G   D  + R++F        + G      R      N  A HL+ L+R+               +    + 
Subjt:  NHLYYDIMMLENQLPLFVLEGLF--NQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHLVHLLRD---------------YYSFSIV

Query:  KDRTKNNDSSEQRGFLLPPTLTELWEAGVTIK-KASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQT-YVTNYMSFLDGLISTE
        + ++ N  S + +   L  +   L   G+  + + S+E   +++  K   L+IP    D    +   N +AFEQ+    TD +  +T Y+ F+  L++ E
Subjt:  KDRTKNNDSSEQRGFLLPPTLTELWEAGVTIK-KASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQT-YVTNYMSFLDGLISTE

Query:  KDASLLVKAEILTNNIGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSY
        +D + L   +++  N  GSN EVS+ F  + K +       Y  N+ K +++Y K   +   A  R  +F +PW  +S  A    ILLT +Q+  + LSY
Subjt:  KDASLLVKAEILTNNIGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSY

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)8.4e-4329.64Show/hide
Query:  GQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSP-----PDQILTLDCSFYGAM
        G+K L +M+ HK   +   L R N  +++ I   +    KAR  Y  P+++  +EF++M+++DGCF+LE    A   F+       D +  +  S +   
Subjt:  GQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSP-----PDQILTLDCSFYGAM

Query:  RNHLYYDIMMLENQLPLFVLEGLFN-QLGAYQDTGM----------------------------ERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHL
        R     D++MLENQLPLFVL  L   QLG    TG+                              L+ +     F + G +   D   + ++  +    
Subjt:  RNHLYYDIMMLENQLPLFVLEGLFN-QLGAYQDTGM----------------------------ERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHL

Query:  VHLLRDYYSFSIVKDRTKNNDSSEQRGFLLPPTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNY
          L R  +S        +N   +++R   L   +TEL EAG+  ++   +  F D+ FK+G LEIP   I D  ++   NL+AFEQ     ++   +T+Y
Subjt:  VHLLRDYYSFSIVKDRTKNNDSSEQRGFLLPPTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNY

Query:  MSFLDGLISTEKDASLLVKAEILTNNIGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLT
        + F+D LI + +D S L    I+ + + GS+ EV+ LFN LC+ +     D Y   ++  +++Y     + W A+L+  YFN PWA +SF AA + ++LT
Subjt:  MSFLDGLISTEKDASLLVKAEILTNNIGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLT

Query:  FIQTIYSHLSYSKHP
        F Q+ Y+  +Y K P
Subjt:  FIQTIYSHLSYSKHP

AT3G50160.1 Plant protein of unknown function (DUF247)3.3e-3931.79Show/hide
Query:  GQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNF-----SPPDQILTLDCSFYGAM
        G KHL  ME HK   + + + R    +E+ I   +    KAR  Y  PINM  +EF++M+++DG F++E        F     +P D +        G M
Subjt:  GQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNF-----SPPDQILTLDCSFYGAM

Query:  RNHLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHLVHLLRD---YYSFSIVKDRTKNNDSSEQR
        ++ +  D++MLENQLP  VL+GL   L   +   +++++    +Q+F  +   +      +++      H + +LR      S +  +D +  N   +Q 
Subjt:  RNHLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHLVHLLRD---YYSFSIVKDRTKNNDSSEQR

Query:  GFLLPPTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNN
           L   +TEL  AGV   +  E  HF DI FK+G L+IP   I D  ++   NL+AFEQ     + +  +T+Y+ F+D LI++ +D S L    I+ N 
Subjt:  GFLLPPTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNN

Query:  IGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSKHPP
        + GS+ EVS LFN L K +    +D Y   +T  ++ Y +   +   A+LR  YFN PWA  SF+AA   ++ TF Q+ ++  +Y K PP
Subjt:  IGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSKHPP

AT3G50170.1 Plant protein of unknown function (DUF247)1.6e-4131.84Show/hide
Query:  GQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRNHLY
        G+K L  ME HK   L   L R+  R+E+     +    KAR  Y  PI++  +EF +M+++DGCF+LE        F+               + + + 
Subjt:  GQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRNHLY

Query:  YDIMMLENQLPLFVLEGLFN-QLGAYQDTGMER----LSFEVLI----------QIFLEYGCIKSYDRLSQIVINRNANHLVHLLRDYY--------SFS
         D++MLENQLPLFVL+ L   QLG    TG+        F+ L+          Q  L     KS D L     ++   H + + R           + S
Subjt:  YDIMMLENQLPLFVLEGLFN-QLGAYQDTGMER----LSFEVLI----------QIFLEYGCIKSYDRLSQIVINRNANHLVHLLRDYY--------SFS

Query:  IVKDRTKNNDSSEQRGFLLPPTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTE
        ++K  T+N    ++R   L   +TEL EAGV  +K   +  F DI FK+G LEIP   I D  ++   NL+AFEQ         ++T+Y+ F+D LI++ 
Subjt:  IVKDRTKNNDSSEQRGFLLPPTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTE

Query:  KDASLLVKAEILTNNIGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSY
        +D S L    I+ + + GS+ EV+ LFN LC+ +     D +   ++  +++Y     +   A+L   YFN PWA  SF AA + +LLT  Q+ Y+  +Y
Subjt:  KDASLLVKAEILTNNIGGSNEEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSY

Query:  SK
         K
Subjt:  SK

AT4G31980.1 unknown protein5.6e-4734.83Show/hide
Query:  GQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMR-NHL
        G++ L+AME  K   L  ++ R N  +E  +R+A+ W   AR  YAE + ++ DEFV+M++VDG FL+E ++ +++    P      D  F  +M    +
Subjt:  GQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMR-NHL

Query:  YYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHLVHLLRDYY--SFSIVKDRTKNNDSSEQRGFLLP
          D++++ENQLP FV++ +F  L  Y   G    S   L Q    Y       R+          H V LLR  Y   F I  + T     +        
Subjt:  YYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHLVHLLRDYY--SFSIVKDRTKNNDSSEQRGFLLP

Query:  PTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNNIGGSN
        P  TEL  AGV  K A      +DISF DGVL+IP   +DD  E+  +N++ FEQ    R       +Y+  L   I +  DA LL+ + I+ N +G S 
Subjt:  PTLTELWEAGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNNIGGSN

Query:  EEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLS
         +VS LFN + K + +    +Y+  +++ L  YC T  +RW A LRRDYF+ PWA  S  AA L +LLTFIQ++ S L+
Subjt:  EEVSKLFNDLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLS

AT5G11290.1 Plant protein of unknown function (DUF247)3.1e-4533.69Show/hide
Query:  MEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRN--HLYYDIMM
        ME HKL  L+ ++ R  + +E  +R+A+ W  +AR  Y E + +  DE+VKM++VD  FL+E ++      S  D    +    YG  +    + +D+M+
Subjt:  MEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRN--HLYYDIMM

Query:  LENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHLVHLLRDYYSFSIVKDRTKNNDSSEQRGFLLPPTLTELWE
        LENQLP FV+EG+F  L       +  L+  ++   F ++    S    S+ + +    H V LLR     SI      +      R      +  E+  
Subjt:  LENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHLVHLLRDYYSFSIVKDRTKNNDSSEQRGFLLPPTLTELWE

Query:  AGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNNIGGSNEEVSKLFN
        AGV ++ A      +DISF +GVL IP  +I+D  E+  RN++ FEQ   L     Y  +YM FL   I +  DA L +   I+ N  G + E+VS+LFN
Subjt:  AGVTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNNIGGSNEEVSKLFN

Query:  DLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLS
         + K  +  G  +YY  +   L  +C    ++W A+LRRDYF+ PW+A S VAA + +LLTF+Q I S L+
Subjt:  DLCKGITLPGHDYYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACCTCAAGTCATTTCCATTGGCCCTTTTCACCATGGGTCAAAAGCATTTGGAGGCCATGGAAGGACATAAGCTTATGGGTCTCAAAATTTACCTACATCGTATAAA
CATGAGAGTTGAGGTTGCCATCAGAATTGCTCAAAATTGGGTAGGGAAAGCTCGTGGTTACTATGCAGAGCCCATTAACATGTACGACGACGAGTTTGTGAAAATGATGC
TTGTGGATGGTTGCTTTCTACTGGAGTTTATGATACTGGCTTATCACAACTTCTCTCCACCAGATCAAATTCTAACGTTAGATTGCTCGTTCTATGGAGCTATGAGGAAT
CATTTATATTACGATATTATGATGCTTGAGAATCAACTTCCTCTCTTTGTTCTCGAAGGTCTATTTAACCAACTTGGAGCCTACCAAGACACTGGCATGGAAAGACTCTC
CTTTGAGGTTCTTATACAAATTTTTCTTGAATATGGGTGCATTAAATCATATGATCGGCTCTCCCAAATTGTCATCAACAGAAATGCAAATCACTTGGTCCATTTGTTGA
GAGACTACTACAGCTTCTCAATTGTTAAGGATAGGACGAAAAACAATGATTCGTCCGAGCAGAGGGGTTTTTTGCTTCCCCCAACTTTAACTGAGCTGTGGGAGGCTGGT
GTCACCATCAAGAAAGCATCAGAAGAGATTCATTTCATGGACATAAGTTTCAAAGATGGGGTTCTAGAAATCCCACATTTCGAAATTGACGATCAATTTGAAACCCGTGT
AAGAAATCTAATGGCTTTTGAGCAGTACCAGTACTTGAGGACAGATCAAACATATGTAACCAACTACATGTCATTTCTAGATGGCTTGATAAGCACGGAGAAAGACGCAA
GTTTACTTGTGAAGGCAGAAATCTTAACCAACAATATTGGTGGCAGTAATGAAGAAGTTTCAAAACTGTTCAATGATTTATGTAAAGGAATAACACTCCCAGGCCATGAC
TATTACTACTACAATATGACCAAAGCTTTAAGTAAGTATTGCAAGACGATGAAGCATCGATGGATGGCTTCATTGAGACGTGACTATTTTAATACGCCATGGGCTGCTAT
CTCCTTTGTTGCAGCAACTTTATTCATTCTTCTCACTTTCATTCAAACCATATACTCTCATCTATCGTATTCCAAGCACCCTCCAACTGCAACCAAGCTTTGTGATGCTA
GTGTCAAGTTTCAAAATGCAACACAAACCAAACACATTATGGACATAAACTTCCACGACGGAGTTCTAGAAATCCCACCTTTCGAAATCAGTGACATCGTTGAAACCAAT
ATGCGAAACCTTATGGCATTTAAGCATTACCACATAGGGAGTAATAAGAGCGAAGCTTTACTTAAGCATTTTAGTGCACGATGGAACAAGTGGAAAGCTTCACTGAAACG
TGATCATTGCAATACTCCATGGACTCTTATCTCTTCCATTGCTGCTACTGTCCTCATTCTTTTCACTGCTCCACAAGTTCTATTCGCTGTTGTACCACTTTCCAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCACCTCAAGTCATTTCCATTGGCCCTTTTCACCATGGGTCAAAAGCATTTGGAGGCCATGGAAGGACATAAGCTTATGGGTCTCAAAATTTACCTACATCGTATAAA
CATGAGAGTTGAGGTTGCCATCAGAATTGCTCAAAATTGGGTAGGGAAAGCTCGTGGTTACTATGCAGAGCCCATTAACATGTACGACGACGAGTTTGTGAAAATGATGC
TTGTGGATGGTTGCTTTCTACTGGAGTTTATGATACTGGCTTATCACAACTTCTCTCCACCAGATCAAATTCTAACGTTAGATTGCTCGTTCTATGGAGCTATGAGGAAT
CATTTATATTACGATATTATGATGCTTGAGAATCAACTTCCTCTCTTTGTTCTCGAAGGTCTATTTAACCAACTTGGAGCCTACCAAGACACTGGCATGGAAAGACTCTC
CTTTGAGGTTCTTATACAAATTTTTCTTGAATATGGGTGCATTAAATCATATGATCGGCTCTCCCAAATTGTCATCAACAGAAATGCAAATCACTTGGTCCATTTGTTGA
GAGACTACTACAGCTTCTCAATTGTTAAGGATAGGACGAAAAACAATGATTCGTCCGAGCAGAGGGGTTTTTTGCTTCCCCCAACTTTAACTGAGCTGTGGGAGGCTGGT
GTCACCATCAAGAAAGCATCAGAAGAGATTCATTTCATGGACATAAGTTTCAAAGATGGGGTTCTAGAAATCCCACATTTCGAAATTGACGATCAATTTGAAACCCGTGT
AAGAAATCTAATGGCTTTTGAGCAGTACCAGTACTTGAGGACAGATCAAACATATGTAACCAACTACATGTCATTTCTAGATGGCTTGATAAGCACGGAGAAAGACGCAA
GTTTACTTGTGAAGGCAGAAATCTTAACCAACAATATTGGTGGCAGTAATGAAGAAGTTTCAAAACTGTTCAATGATTTATGTAAAGGAATAACACTCCCAGGCCATGAC
TATTACTACTACAATATGACCAAAGCTTTAAGTAAGTATTGCAAGACGATGAAGCATCGATGGATGGCTTCATTGAGACGTGACTATTTTAATACGCCATGGGCTGCTAT
CTCCTTTGTTGCAGCAACTTTATTCATTCTTCTCACTTTCATTCAAACCATATACTCTCATCTATCGTATTCCAAGCACCCTCCAACTGCAACCAAGCTTTGTGATGCTA
GTGTCAAGTTTCAAAATGCAACACAAACCAAACACATTATGGACATAAACTTCCACGACGGAGTTCTAGAAATCCCACCTTTCGAAATCAGTGACATCGTTGAAACCAAT
ATGCGAAACCTTATGGCATTTAAGCATTACCACATAGGGAGTAATAAGAGCGAAGCTTTACTTAAGCATTTTAGTGCACGATGGAACAAGTGGAAAGCTTCACTGAAACG
TGATCATTGCAATACTCCATGGACTCTTATCTCTTCCATTGCTGCTACTGTCCTCATTCTTTTCACTGCTCCACAAGTTCTATTCGCTGTTGTACCACTTTCCAAGTAA
Protein sequenceShow/hide protein sequence
MHLKSFPLALFTMGQKHLEAMEGHKLMGLKIYLHRINMRVEVAIRIAQNWVGKARGYYAEPINMYDDEFVKMMLVDGCFLLEFMILAYHNFSPPDQILTLDCSFYGAMRN
HLYYDIMMLENQLPLFVLEGLFNQLGAYQDTGMERLSFEVLIQIFLEYGCIKSYDRLSQIVINRNANHLVHLLRDYYSFSIVKDRTKNNDSSEQRGFLLPPTLTELWEAG
VTIKKASEEIHFMDISFKDGVLEIPHFEIDDQFETRVRNLMAFEQYQYLRTDQTYVTNYMSFLDGLISTEKDASLLVKAEILTNNIGGSNEEVSKLFNDLCKGITLPGHD
YYYYNMTKALSKYCKTMKHRWMASLRRDYFNTPWAAISFVAATLFILLTFIQTIYSHLSYSKHPPTATKLCDASVKFQNATQTKHIMDINFHDGVLEIPPFEISDIVETN
MRNLMAFKHYHIGSNKSEALLKHFSARWNKWKASLKRDHCNTPWTLISSIAATVLILFTAPQVLFAVVPLSK