; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019053 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019053
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr5:38089038..38097334
RNA-Seq ExpressionLag0019053
SyntenyLag0019053
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015379.1 hypothetical protein SDJN02_23014, partial [Cucurbita argyrosperma subsp. argyrosperma]4.4e-18175.72Show/hide
Query:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPYCDNKLDF
        MEELRKLEQVQKLIT MES GIA  S SSSN+F+ANFVLLLV+PCGELDF+DKF+LVSEY+PKFSEEFLG+ASLLL DGGYRG EME TL+PY DNKLDF
Subjt:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPYCDNKLDF

Query:  GSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGMDV
        GS  N  GE+AMVGLDAMQRANSTLEDF                    A  + P+ +L             +++ L           QFRSYFMFHGMDV
Subjt:  GSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGMDV

Query:  NKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALVSK
        N PQ VFKY P+LSFTESYIYQLDTLNEEI  G S FGESQEANER  + L+AIK DP QPLI++ KSHGL+T+RLIHELR GEEYWALER LC AL SK
Subjt:  NKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALVSK

Query:  GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDRLL
        GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGE+V DLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYG+SAPTALAK +SEAEEKYDRLL
Subjt:  GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDRLL

Query:  KALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSV
        KALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLI DE+SFR+SV
Subjt:  KALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSV

XP_008453814.1 PREDICTED: uncharacterized protein LOC103494428 [Cucumis melo]7.5e-18173.94Show/hide
Query:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPYCDNKLDF
        MEELRKLEQV  LITLMESRGI+  S SSSNRF+ANFVLLLVQPCGELDFDDKFDLVSEY+PKFSEEFLGD SLLL DG YRG EMENTL+PYCD+KLDF
Subjt:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPYCDNKLDF

Query:  GSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGMDV
        G  QN  GE+AMVGLDAMQRANSTLEDF                                                            FRSYFMFHGMD 
Subjt:  GSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGMDV

Query:  NKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALVSK
        NKPQSVFKY P+LSFTESYIYQLDTLNE+IV G   FGESQE NE+  K L+AIK DPL+PLI++ KSHGL+TDRL+HELR GEEYWALERDLCGAL S 
Subjt:  NKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALVSK

Query:  GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDRLL
        GKVS+EDVMRAIHLKSFDYRVLNLLLYQLRG++V DLHMEFLS+SELLVEIADDLFDYEDDVLENNFNILRMFVR+YG+SAPTALAK++SEAEEKYD+LL
Subjt:  GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDRLL

Query:  KALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSV
        KALDPHLSSLYQRRCEEATKEGGKVSAH FGSWS+PPLI DEESFR+SV
Subjt:  KALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSV

XP_022136920.1 uncharacterized protein LOC111008497 isoform X1 [Momordica charantia]2.0e-18175.5Show/hide
Query:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPY--CDNKL
        MEELRKLEQVQKLI L+ESRGIA  S SSSNRFLANFVLLLVQPCGELDFD+K DLVSEY+PKFSEEFLGDASLLL+D  + G EMEN LKPY   DNKL
Subjt:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPY--CDNKL

Query:  DFGSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGM
        DFGS QN  GE+AMVGLDAMQRANSTLEDF                                                            FRSYFMFHGM
Subjt:  DFGSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGM

Query:  DVNKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALV
        DVNKPQSVFKY PVLSFTESYIYQLDTLNE++V G   FG+SQEANERG KFLNAIK DPLQPLI++L+ HGLVTDRLIHELRCG+EYWALERDLCGAL+
Subjt:  DVNKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALV

Query:  SKGKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDR
        S+GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGE+V DLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYG+SAPT LAK+ISEAEEKYDR
Subjt:  SKGKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDR

Query:  LLKALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSVSA
        LLKAL  HLSSLYQRRCEEAT+EGGKVSAHPFGSWSMPPLIL EESFR+SV +
Subjt:  LLKALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSVSA

XP_031745684.1 uncharacterized protein LOC101211687 [Cucumis sativus]1.7e-18074.16Show/hide
Query:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPYCDNKLDF
        MEELRKLEQV  LITLM+SRGI   S SSSNRF+ANF+LLLVQPCGELDFDDKFDLVSEY+PKFSEEFLGD SLLL DG YRG EMENTL+PYCDNKLD 
Subjt:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPYCDNKLDF

Query:  GSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGMDV
        GS QN  GE+AMVGLDAMQRANSTLEDF                                                            FRSYFMFHGMD 
Subjt:  GSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGMDV

Query:  NKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALVSK
        NKPQSVFKY P+LSFTESYIYQLDTLNE+IV G   FGESQE NE+  K L+AI+ DPLQPLI++LKSHGL+TDRL+HELR GEEYWALERDLCGAL S 
Subjt:  NKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALVSK

Query:  GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDRLL
        GKVS+EDVMRAIHLKSFDYRVLNLLLYQLRG++V DLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVR+YG+SAPTALAK++SEAEEKYD LL
Subjt:  GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDRLL

Query:  KALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSV
        KALDPHLSSLYQRRCEEATKEGGKVSAH FGSWS+PPLI DEESFR+SV
Subjt:  KALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSV

XP_038878058.1 uncharacterized protein LOC120070244 isoform X1 [Benincasa hispida]1.8e-18275.5Show/hide
Query:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPYCDNKLDF
        MEELRKLEQVQKLITLM+SRGIA  S SSSNRF+ANFVLLLVQPCGELDFDDK DLVSEYIPKFSEEFLGD SLLL+DG YRG EM+ TL+PYCDN LDF
Subjt:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPYCDNKLDF

Query:  GSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGMDV
         S QN  GE+AMVGLDAMQRANSTLEDF                                                            FRSYFMFHGMDV
Subjt:  GSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGMDV

Query:  NKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALVSK
        NKPQSVFKY P+LSFTESYIYQLDTLNE+IV G   F ESQEANER  K L+AIK DPLQPLI++L+SHGL+TDRL+HELR GEEYWALER LCGAL SK
Subjt:  NKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALVSK

Query:  GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDRLL
         KVS+EDVMRAIHLKSFDYRVLNLLLYQLRGE+V DLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYG+SAPTALAK++SEAEEKYDRLL
Subjt:  GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDRLL

Query:  KALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSV
        KALDPHLSSLYQRRCEEATKEGGKVSAHPFGSW MPPLI DEESFR+SV
Subjt:  KALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSV

TrEMBL top hitse value%identityAlignment
A0A1S3BX98 uncharacterized protein LOC1034944283.6e-18173.94Show/hide
Query:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPYCDNKLDF
        MEELRKLEQV  LITLMESRGI+  S SSSNRF+ANFVLLLVQPCGELDFDDKFDLVSEY+PKFSEEFLGD SLLL DG YRG EMENTL+PYCD+KLDF
Subjt:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPYCDNKLDF

Query:  GSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGMDV
        G  QN  GE+AMVGLDAMQRANSTLEDF                                                            FRSYFMFHGMD 
Subjt:  GSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGMDV

Query:  NKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALVSK
        NKPQSVFKY P+LSFTESYIYQLDTLNE+IV G   FGESQE NE+  K L+AIK DPL+PLI++ KSHGL+TDRL+HELR GEEYWALERDLCGAL S 
Subjt:  NKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALVSK

Query:  GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDRLL
        GKVS+EDVMRAIHLKSFDYRVLNLLLYQLRG++V DLHMEFLS+SELLVEIADDLFDYEDDVLENNFNILRMFVR+YG+SAPTALAK++SEAEEKYD+LL
Subjt:  GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDRLL

Query:  KALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSV
        KALDPHLSSLYQRRCEEATKEGGKVSAH FGSWS+PPLI DEESFR+SV
Subjt:  KALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSV

A0A5A7TTZ5 Uncharacterized protein6.8e-18074Show/hide
Query:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPYCDNKLDF
        MEELRKLEQV  LITLMESRGI+  S SSSNRF+ANFVLLLVQPCGELDFDDKFDLVSEY+PKFSEEFLGD SLLL DG YRG EMENTL+PYCD+KLDF
Subjt:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPYCDNKLDF

Query:  GSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGMDV
        G  QN  GE+AMVGLDAMQRANSTLEDF                                                            FRSYFMFHGMD 
Subjt:  GSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGMDV

Query:  NKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALVSK
        NKPQSVFKY P+LSFTESYIYQLDTLNE+IV G   FGESQE NE+  K L+AIK DPL+PLI++ KSHGL+TDRL+HELR GEEYWALERDLCGAL S 
Subjt:  NKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALVSK

Query:  GK-VSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDRL
        GK VS+EDVMRAIHLKSFDYRVLNLLLYQLRG++V DLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVR+YG+SAPTALAK++SEAEEKYD+L
Subjt:  GK-VSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDRL

Query:  LKALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSV
        LKALDPHLSSLYQRRCEEATKEGGKVSAH FGSWS+PPLI DEESFR+SV
Subjt:  LKALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSV

A0A6J1C599 uncharacterized protein LOC111008497 isoform X19.5e-18275.5Show/hide
Query:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPY--CDNKL
        MEELRKLEQVQKLI L+ESRGIA  S SSSNRFLANFVLLLVQPCGELDFD+K DLVSEY+PKFSEEFLGDASLLL+D  + G EMEN LKPY   DNKL
Subjt:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPY--CDNKL

Query:  DFGSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGM
        DFGS QN  GE+AMVGLDAMQRANSTLEDF                                                            FRSYFMFHGM
Subjt:  DFGSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGM

Query:  DVNKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALV
        DVNKPQSVFKY PVLSFTESYIYQLDTLNE++V G   FG+SQEANERG KFLNAIK DPLQPLI++L+ HGLVTDRLIHELRCG+EYWALERDLCGAL+
Subjt:  DVNKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALV

Query:  SKGKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDR
        S+GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGE+V DLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYG+SAPT LAK+ISEAEEKYDR
Subjt:  SKGKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDR

Query:  LLKALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSVSA
        LLKAL  HLSSLYQRRCEEAT+EGGKVSAHPFGSWSMPPLIL EESFR+SV +
Subjt:  LLKALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSVSA

A0A6J1EN90 uncharacterized protein LOC111435894 isoform X21.3e-17874.16Show/hide
Query:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPYCDNKLDF
        MEELRKLEQVQKLIT MES GIA  S SSSN+F+ANFVLLLV+PCGELDF+DKF+LVSEY+PKFSEEFLG+ASLLL D GYRG EME TL+PY DNKLDF
Subjt:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPYCDNKLDF

Query:  GSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGMDV
        GS  N  GE+AMVGLDAMQRANSTLEDF                                                            FRSYFMFHGMDV
Subjt:  GSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGMDV

Query:  NKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALVSK
        N PQ +FKY P+LSFTESYIYQLDTLNEEI  G S FGESQEANER  + L+AIK DP QPLI++ K HGL+T+RLIHELR GEEYWALER LCGAL SK
Subjt:  NKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALVSK

Query:  GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDRLL
        GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGE+V DLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYG+SAPTALAK +SEAEEKYDRLL
Subjt:  GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDRLL

Query:  KALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSV
        KALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLI DE+SFR+SV
Subjt:  KALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSV

A0A6J1JA35 uncharacterized protein LOC111482641 isoform X21.5e-17974.39Show/hide
Query:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPYCDNKLDF
        MEELRKLEQVQKLIT MES GIA  S SSSN+F+ANFVLLLV+PCGELDF+DKF+LVSEY+PKFSEEFLG+ASLLL DGGYRG EME TL+PY DNKLDF
Subjt:  MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPYCDNKLDF

Query:  GSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGMDV
        GS  N  GE+AMVGLDAMQRANS+LEDF                                                            FRSYFMFHGMDV
Subjt:  GSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGMDV

Query:  NKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALVSK
        N PQ VFKY P+LSFTESYIYQLDTLNE+I  G S FGESQEANER  + L+AIK DP QPLI++ KSHGL+T+RLIHELR GEEYWALER LCGAL SK
Subjt:  NKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALVSK

Query:  GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDRLL
        GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGE+V DLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYG+SAPTAL K +SEAEEKYDRLL
Subjt:  GKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDRLL

Query:  KALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSV
        KALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLI DEESFR+SV
Subjt:  KALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSSV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G10320.1 unknown protein2.1e-11750.64Show/hide
Query:  MEELRKLEQVQKLITLMESRGIAKASLS---SSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEME-------NTL
        MEEL++LE+ Q +++ +   G+  +S S   SS+RFL+N VL LVQPCGELD D K  L+SE++PK S  FL + S        R +E++       NT+
Subjt:  MEELRKLEQVQKLITLMESRGIAKASLS---SSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEME-------NTL

Query:  KPYCDNKLDFGSIQNN---------KGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTY
           C N     S+ +N         +  +AMVGLDAM+RANSTLEDF                                                     
Subjt:  KPYCDNKLDFGSIQNN---------KGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTY

Query:  GIWSYRQFRSYFMFHGMDVNKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFD--PLQPLIDVLKSHGLVTDRLIHE
                RSYFMFH +D+N+PQS+F+YLPVLSFTESYIYQ+D LNE+IVS +S  G     +  G    + + F+  PL+PL D+L+   L+T+R+  E
Subjt:  GIWSYRQFRSYFMFHGMDVNKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFD--PLQPLIDVLKSHGLVTDRLIHE

Query:  LRCGEEYWALERDLCGALVSKGKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGS
           GEEYWALER LC AL +K K+ +EDVMRAIHLKSFDYRVLNLLLY+LRGE+V +LHM+FLSISE LVE+ADDLFDYEDDVLENNFN+LRMFV I+GS
Subjt:  LRCGEEYWALERDLCGALVSKGKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGS

Query:  S-APTALAKHISEAEEKYDRLLKALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSS
        S APT LAK ISEAEEKY+ ++K+LDPHLSS YQRRCEEATKEGGK+S H  G+W++P +I DEE++RS+
Subjt:  S-APTALAKHISEAEEKYDRLLKALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSS

AT5G10320.2 unknown protein1.8e-11651.63Show/hide
Query:  MEELRKLEQVQKLITLMESRGIAKASLS---SSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGD--ASLLLTDGGYRGM----EMENT-L
        MEEL++LE+ Q +++ +   G+  +S S   SS+RFL+N VL LVQPCGELD D K  L+SE++PK S  FL +   SL L D     +    E +N+ +
Subjt:  MEELRKLEQVQKLITLMESRGIAKASLS---SSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGD--ASLLLTDGGYRGM----EMENT-L

Query:  KPYCDNKLDFGSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFR
        K    + +D    Q ++  +AMVGLDAM+RANSTLEDF                                                             R
Subjt:  KPYCDNKLDFGSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFR

Query:  SYFMFHGMDVNKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFD--PLQPLIDVLKSHGLVTDRLIHELRCGEEYWA
        SYFMFH +D+N+PQS+F+YLPVLSFTESYIYQ+D LNE+IVS +S  G     +  G    + + F+  PL+PL D+L+   L+T+R+  E   GEEYWA
Subjt:  SYFMFHGMDVNKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFD--PLQPLIDVLKSHGLVTDRLIHELRCGEEYWA

Query:  LERDLCGALVSKGKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSS-APTALAK
        LER LC AL +K K+ +EDVMRAIHLKSFDYRVLNLLLY+LRGE+V +LHM+FLSISE LVE+ADDLFDYEDDVLENNFN+LRMFV I+GSS APT LAK
Subjt:  LERDLCGALVSKGKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSS-APTALAK

Query:  HISEAEEKYDRLLKALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSS
         ISEAEEKY+ ++K+LDPHLSS YQRRCEEATKEGGK+S H  G+W++P +I DEE++RS+
Subjt:  HISEAEEKYDRLLKALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLILDEESFRSS

AT5G10320.3 unknown protein5.3e-7646.63Show/hide
Query:  MEELRKLEQVQKLITLMESRGIAKASLS---SSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGD--ASLLLTDGGYRGM----EMENT-L
        MEEL++LE+ Q +++ +   G+  +S S   SS+RFL+N VL LVQPCGELD D K  L+SE++PK S  FL +   SL L D     +    E +N+ +
Subjt:  MEELRKLEQVQKLITLMESRGIAKASLS---SSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGD--ASLLLTDGGYRGM----EMENT-L

Query:  KPYCDNKLDFGSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFR
        K    + +D    Q ++  +AMVGLDAM+RANSTLEDF                                                             R
Subjt:  KPYCDNKLDFGSIQNNKGELAMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFR

Query:  SYFMFHGMDVNKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFD--PLQPLIDVLKSHGLVTDRLIHELRCGEEYWA
        SYFMFH +D+N+PQS+F+YLPVLSFTESYIYQ+D LNE+IVS +S  G     +  G    + + F+  PL+PL D+L+   L+T+R+  E   GEEYWA
Subjt:  SYFMFHGMDVNKPQSVFKYLPVLSFTESYIYQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFD--PLQPLIDVLKSHGLVTDRLIHELRCGEEYWA

Query:  LERDLCGALVSKGKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYE
        LER LC AL +K K+ +EDVMRAIHLKSFDYRVLNLLLY+LRGE+V +LHM+FLSISE LVE+ADDLFDYE
Subjt:  LERDLCGALVSKGKVSVEDVMRAIHLKSFDYRVLNLLLYQLRGEQVKDLHMEFLSISELLVEIADDLFDYE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAACTAAGGAAGCTGGAACAAGTTCAGAAACTCATTACGTTAATGGAGTCTCGCGGAATCGCCAAGGCTTCCCTATCTTCCTCCAATCGATTCCTTGCCAATTT
CGTTCTCCTTTTGGTGCAACCATGTGGAGAGCTTGACTTTGACGATAAATTCGATTTGGTTTCTGAATACATTCCGAAGTTTTCAGAAGAGTTTCTTGGCGATGCATCAT
TGCTGCTTACTGATGGAGGCTATAGAGGAATGGAGATGGAAAACACTTTGAAGCCTTATTGTGACAATAAGCTGGATTTTGGTTCTATCCAGAACAATAAAGGGGAATTG
GCTATGGTTGGCTTGGATGCAATGCAGCGTGCAAATTCTACACTTGAAGATTTTGTTGAAAGGACCTCAAAAGGAAGAGGCTTCTTCTGTAAACAAGGAAGTTGGAAAAC
AATGGCAGGAAAGCTGGAACCTTGGTTCCAATTGCTTAGGCAGAAAAATATTGCTTTCTTAGCTCAAGGGGTTTGGAGATTTCAGCTGGAAATTACCACTTATGGCATAT
GGTCATATCGGCAATTCAGATCTTATTTCATGTTCCACGGAATGGATGTAAACAAGCCTCAATCTGTATTCAAATACTTACCCGTTCTTTCATTTACAGAAAGCTACATA
TATCAGCTAGACACTTTGAACGAGGAGATAGTTTCCGGCGATTCCATTTTTGGAGAGTCTCAGGAAGCAAATGAAAGGGGCAATAAGTTTCTCAATGCCATCAAATTTGA
TCCACTTCAACCTCTTATTGATGTCCTTAAATCGCATGGCTTAGTGACGGACAGATTGATCCATGAACTAAGATGTGGAGAGGAGTATTGGGCATTGGAAAGGGACCTAT
GTGGGGCGTTGGTGAGCAAAGGGAAGGTCTCTGTTGAAGATGTAATGCGGGCTATTCATCTGAAATCATTTGATTATAGAGTGTTGAATCTTCTTTTGTATCAGTTGAGA
GGAGAACAGGTTAAGGACTTGCACATGGAGTTTCTATCAATCTCAGAATTGCTAGTTGAGATAGCTGACGATTTGTTTGATTATGAGGATGATGTACTAGAAAATAATTT
CAATATTCTGCGCATGTTTGTCCGGATATATGGATCTTCAGCTCCAACTGCGCTGGCAAAACATATCAGCGAGGCTGAGGAAAAGTACGACAGATTATTAAAAGCTTTGG
ATCCTCATTTATCCTCACTTTACCAAAGGAGATGTGAAGAGGCCACTAAAGAAGGTGGTAAGGTGTCAGCACATCCATTCGGCTCGTGGAGTATGCCACCTCTGATACTA
GACGAGGAGTCGTTTCGGTCATCTGTTAGTGCTGGTTCTGCTCGTTACTCCAAAACTGATCAGCAGATCGACCAAATCAGTCCCCCGACATATCGTCAGATTTCGAAGGC
CATTTGGTGCCATGGTTGGACTTCTAACCAGCCTGGCAAAAGACGTTCAACAAACAATTCTTGTTCTTTATGTAATTTGATCTATCTGGATGACATTTGGATATTGATTG
CTGAATTATATGGAGTTAATGAGCTTGAGATTGGTTTTATTTTGGGTTCCGGTGGGCGGCAACGGCAATGGATTTCCGCTAGCGGAGGGCGCCGACGAGGAGGAACGTTG
TGGCGTGGTCTGTCTGGGAGACCAGAATCCGTCCACCTCAGGATTACTTCGTCACCTTCCATTTTCGATAGTGTCGCGACTCAACCTCCCTCCTTCGCCGTCTCCTTCGC
CTCCGCCGTTTGTCTCATTCGCCTTCGCCGTCTGTATCCTTCGCCGTCCGTCTCCTCCGTCGGCTGCTGTAATGTCGCGTCTCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAACTAAGGAAGCTGGAACAAGTTCAGAAACTCATTACGTTAATGGAGTCTCGCGGAATCGCCAAGGCTTCCCTATCTTCCTCCAATCGATTCCTTGCCAATTT
CGTTCTCCTTTTGGTGCAACCATGTGGAGAGCTTGACTTTGACGATAAATTCGATTTGGTTTCTGAATACATTCCGAAGTTTTCAGAAGAGTTTCTTGGCGATGCATCAT
TGCTGCTTACTGATGGAGGCTATAGAGGAATGGAGATGGAAAACACTTTGAAGCCTTATTGTGACAATAAGCTGGATTTTGGTTCTATCCAGAACAATAAAGGGGAATTG
GCTATGGTTGGCTTGGATGCAATGCAGCGTGCAAATTCTACACTTGAAGATTTTGTTGAAAGGACCTCAAAAGGAAGAGGCTTCTTCTGTAAACAAGGAAGTTGGAAAAC
AATGGCAGGAAAGCTGGAACCTTGGTTCCAATTGCTTAGGCAGAAAAATATTGCTTTCTTAGCTCAAGGGGTTTGGAGATTTCAGCTGGAAATTACCACTTATGGCATAT
GGTCATATCGGCAATTCAGATCTTATTTCATGTTCCACGGAATGGATGTAAACAAGCCTCAATCTGTATTCAAATACTTACCCGTTCTTTCATTTACAGAAAGCTACATA
TATCAGCTAGACACTTTGAACGAGGAGATAGTTTCCGGCGATTCCATTTTTGGAGAGTCTCAGGAAGCAAATGAAAGGGGCAATAAGTTTCTCAATGCCATCAAATTTGA
TCCACTTCAACCTCTTATTGATGTCCTTAAATCGCATGGCTTAGTGACGGACAGATTGATCCATGAACTAAGATGTGGAGAGGAGTATTGGGCATTGGAAAGGGACCTAT
GTGGGGCGTTGGTGAGCAAAGGGAAGGTCTCTGTTGAAGATGTAATGCGGGCTATTCATCTGAAATCATTTGATTATAGAGTGTTGAATCTTCTTTTGTATCAGTTGAGA
GGAGAACAGGTTAAGGACTTGCACATGGAGTTTCTATCAATCTCAGAATTGCTAGTTGAGATAGCTGACGATTTGTTTGATTATGAGGATGATGTACTAGAAAATAATTT
CAATATTCTGCGCATGTTTGTCCGGATATATGGATCTTCAGCTCCAACTGCGCTGGCAAAACATATCAGCGAGGCTGAGGAAAAGTACGACAGATTATTAAAAGCTTTGG
ATCCTCATTTATCCTCACTTTACCAAAGGAGATGTGAAGAGGCCACTAAAGAAGGTGGTAAGGTGTCAGCACATCCATTCGGCTCGTGGAGTATGCCACCTCTGATACTA
GACGAGGAGTCGTTTCGGTCATCTGTTAGTGCTGGTTCTGCTCGTTACTCCAAAACTGATCAGCAGATCGACCAAATCAGTCCCCCGACATATCGTCAGATTTCGAAGGC
CATTTGGTGCCATGGTTGGACTTCTAACCAGCCTGGCAAAAGACGTTCAACAAACAATTCTTGTTCTTTATGTAATTTGATCTATCTGGATGACATTTGGATATTGATTG
CTGAATTATATGGAGTTAATGAGCTTGAGATTGGTTTTATTTTGGGTTCCGGTGGGCGGCAACGGCAATGGATTTCCGCTAGCGGAGGGCGCCGACGAGGAGGAACGTTG
TGGCGTGGTCTGTCTGGGAGACCAGAATCCGTCCACCTCAGGATTACTTCGTCACCTTCCATTTTCGATAGTGTCGCGACTCAACCTCCCTCCTTCGCCGTCTCCTTCGC
CTCCGCCGTTTGTCTCATTCGCCTTCGCCGTCTGTATCCTTCGCCGTCCGTCTCCTCCGTCGGCTGCTGTAATGTCGCGTCTCTTTGA
Protein sequenceShow/hide protein sequence
MEELRKLEQVQKLITLMESRGIAKASLSSSNRFLANFVLLLVQPCGELDFDDKFDLVSEYIPKFSEEFLGDASLLLTDGGYRGMEMENTLKPYCDNKLDFGSIQNNKGEL
AMVGLDAMQRANSTLEDFVERTSKGRGFFCKQGSWKTMAGKLEPWFQLLRQKNIAFLAQGVWRFQLEITTYGIWSYRQFRSYFMFHGMDVNKPQSVFKYLPVLSFTESYI
YQLDTLNEEIVSGDSIFGESQEANERGNKFLNAIKFDPLQPLIDVLKSHGLVTDRLIHELRCGEEYWALERDLCGALVSKGKVSVEDVMRAIHLKSFDYRVLNLLLYQLR
GEQVKDLHMEFLSISELLVEIADDLFDYEDDVLENNFNILRMFVRIYGSSAPTALAKHISEAEEKYDRLLKALDPHLSSLYQRRCEEATKEGGKVSAHPFGSWSMPPLIL
DEESFRSSVSAGSARYSKTDQQIDQISPPTYRQISKAIWCHGWTSNQPGKRRSTNNSCSLCNLIYLDDIWILIAELYGVNELEIGFILGSGGRQRQWISASGGRRRGGTL
WRGLSGRPESVHLRITSSPSIFDSVATQPPSFAVSFASAVCLIRLRRLYPSPSVSSVGCCNVASL