; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS006871 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS006871
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionMajor facilitator superfamily protein
Genome locationscaffold871:300563..302218
RNA-Seq ExpressionMS006871
SyntenyMS006871
Gene Ontology termsGO:0055085 - transmembrane transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0022857 - transmembrane transporter activity (molecular function)
InterPro domainsIPR010658 - Nodulin-like
IPR036259 - MFS transporter superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583549.1 Protein NUCLEAR FUSION DEFECTIVE 4, partial [Cucurbita argyrosperma subsp. sororia]8.7e-22168.35Show/hide
Query:  EGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVT
        E WRF KQV +GRWF+VFASF+IMIGAGS Y+FGTYSK IK+QFDY+QTQ+NTLGF+KDLGSN GVFAGLL EVAPPWVLF+VGS+LNF+S+ MIWLS+T
Subjt:  EGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVT

Query:  HRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRAR
         RI+KP FW+MF+++ +AAN+QNFANTAV+V SV+NFPDRRGII+GLLKGFVG GGA+LTQ YLA+YGH++P++++LLLSW P+V S++   SIR I+ R
Subjt:  HRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRAR

Query:  KHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAV--DPSVTLSIPEEKQSLPYSNILLSTV
        K P+EL+V YHLLY SIILA+ +L +T+TQK  +F+  AY+    V++  L +PLLIAI+EEF+LFKLN Q +  +P+V + +PE + S P +   +  +
Subjt:  KHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAV--DPSVTLSIPEEKQSLPYSNILLSTV

Query:  LEKDSP------SKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPR
         E  +        KP+RGEDFTILQAL SKDMALVF+AT++ACG+SI AIDNLGQ+ ESL YP+Q+IS+FVSW+SIFNFFGRVFSGFISE LMTKYKLPR
Subjt:  LEKDSP------SKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPR

Query:  PLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGL
        PLMFA +HLLTCIGILFIAFPY GSVY ASLIIG GFGAQVP+LFAIIS+LFGLK+YST+ NCGQLAVP+GSYILNV ++G++YD+EA + G  K+GKGL
Subjt:  PLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGL

Query:  TCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM
        TC G HCF GSF ILA + LFGA++SL LA RTR+FYKGD+Y RYR+DMWI QSDM
Subjt:  TCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM

XP_022152848.1 uncharacterized membrane protein YMR155W-like [Momordica charantia]3.0e-30699.09Show/hide
Query:  MCAAEGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIW
        MCAAEGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIW
Subjt:  MCAAEGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIW

Query:  LSVTHRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRT
        LSVTHRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRT
Subjt:  LSVTHRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRT

Query:  IRARKHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLS
        IRAR HPEELKVFYHLLYASIILAISILIITLTQKHI+FSHAAYLGSVAVVVA LCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLS
Subjt:  IRARKHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLS

Query:  TVLEKDSPSKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPRPLMF
        TVLEKDSPSKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPRPLMF
Subjt:  TVLEKDSPSKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPRPLMF

Query:  ALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGLTCNG
        ALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEA RAGNTKHGKGLTCNG
Subjt:  ALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGLTCNG

Query:  PHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM
        PHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWIT+SDM
Subjt:  PHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM

XP_022152849.1 uncharacterized protein LOC111020473 [Momordica charantia]9.5e-22873.82Show/hide
Query:  AAEGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLS
        AA  W F KQVA+GRWFSVFASF+IMIGAGS YLFGTYSK++KTQFDY QTQVNTLGF+KDLGSN GVFAGLLAEVAP WVLF+VGSSLNFFS+ MIWLS
Subjt:  AAEGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLS

Query:  VTHRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIR
        VT RIA P+FW M LY+ LAAN+QNFANTAV+V SVKNFP+RRGII+GLLKGFVG+GGA+LTQ+YLAIYGHE+ + V+LLL+WFPSVIS V + SIRTI+
Subjt:  VTHRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIR

Query:  ARKHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLSTV
         RKHPEELKVFYHLLY SIILA+ IL +TLTQK  +FSHA Y+G  +V+V  LCLPLLIAI+EEF+LF LN Q  D  V + I ++K   P    L    
Subjt:  ARKHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLSTV

Query:  LEKDSPSKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPRPLMFAL
        ++  S  KPERG+DFTI QALLSKDMAL+F+AT+S CGSSI AIDN+GQIAESL YP  SI++FVSW+SIFNFFGRVFSGFISETLM KYKLPRPLMFA 
Subjt:  LEKDSPSKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPRPLMFAL

Query:  AHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGLTCNGPH
        + LLTCIG L IAFPY GSVY ASLIIG GFGAQVPLLFAIIS+LFGLKHYSTL NCGQLAVP GSYILNV ++G++YDMEA R G  K GKGLTC G +
Subjt:  AHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGLTCNGPH

Query:  CFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM
        CF  SFTILAA+T FGA+SSL LA RTR+FYKG+ YK+YR+DMWI ++DM
Subjt:  CFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM

XP_022970242.1 uncharacterized protein LOC111469251 [Cucurbita maxima]8.7e-22167.99Show/hide
Query:  EGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVT
        E WRF KQV +GRWF+VFASF+IMIGAGS Y+FGTYSK IK+QFDY+QTQ+NTLGF+KDLGSN GVFAGLL EVAPPWVLF+VGS+LNF+S+ MIWLS+T
Subjt:  EGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVT

Query:  HRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRAR
         RI KP FW+MF+++ +AAN+QNFANTA++V SV+NFPDRRGII+GLLKGFVG GGA+LTQ YLA+YGH++P++++LLLSW P+V+S++   SIR I+ R
Subjt:  HRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRAR

Query:  KHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAV--DPSVTLSIPEEKQSLPYSNILLSTV
        KHP+EL+V YHLLY SIILA+ +L +T+TQK  +F+  AY+    V++  L +PLLIAI+EEFLLFKLN Q +  +P+V + +PE + S P +   +  +
Subjt:  KHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAV--DPSVTLSIPEEKQSLPYSNILLSTV

Query:  LEKDSP------SKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPR
         E  +        +P+RGEDFTILQAL SKDMALVF+AT+SACG+SI AIDNLGQ+ ESL YP+Q+IS+FVSW+SIFNFFGRVFSGFIS+ LMTKYKLPR
Subjt:  LEKDSP------SKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPR

Query:  PLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGL
        PLMFA +H+LTC+GILFIAFPY GSVY ASLIIG GFGAQVP+LFAIIS+LFGLK+YST+ NCGQLAVP+GSYILNV ++GR+YD+EA + G  K+GKGL
Subjt:  PLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGL

Query:  TCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM
        TC G  CF GSF ILA + LFGA++SL LA RTR+FYKGD+Y RYR+DMWI QSDM
Subjt:  TCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM

XP_038895745.1 uncharacterized protein LOC120083909 [Benincasa hispida]4.6e-22270.68Show/hide
Query:  AEGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSV
        AE WRF KQV +GRWFSVFA+F+IMIG GS YLFGTYSKV+KT+FDY+QTQ++ LGF+KDLGSNLGVFAGL AEVAPPW+LF+VG +LNFFS+ MIWLSV
Subjt:  AEGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSV

Query:  THRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRA
        T  + KPE W MF+Y+ ++ANAQNFANTAVMV SV+NFPD+RGII+GLLKGFVGLGGA+LTQIY AIYGH+DP +++LLLSW PS+I  +   S RTI+A
Subjt:  THRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRA

Query:  RKHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLSTVL
        RKHP+ELKVF+HLLY SI +A+ IL +T+TQKH  F+HA Y+G  +V+V  LCLPLLIAIKEE  LFKLN Q  DPSV +SIP +K         L  + 
Subjt:  RKHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLSTVL

Query:  EKDSP-------SKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPR
        E  SP       +KP+RGEDF I+QAL SKDMAL+F+ATVSACGSS+ AIDNLGQIAESL YP+++I+VFVSWISIFNFFGRV SGF+SETLMTKYKLPR
Subjt:  EKDSP-------SKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPR

Query:  PLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGL
        PLMF L  ++T IG++ IAFPY  SVY ASLIIG GFGAQ PLLFA+ISDLFGLKHYSTLLNCGQLAVP GSYI+NV +VG+ YD EAT+ GN K+GKGL
Subjt:  PLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGL

Query:  TCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDD-MWITQSD
        TC G HCF  SF IL  +TLFGAM+S  LA RTR FYKGDIYKRYRDD MW TQSD
Subjt:  TCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDD-MWITQSD

TrEMBL top hitse value%identityAlignment
A0A6J1DG00 uncharacterized membrane protein YMR155W-like1.5e-30699.09Show/hide
Query:  MCAAEGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIW
        MCAAEGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIW
Subjt:  MCAAEGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIW

Query:  LSVTHRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRT
        LSVTHRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRT
Subjt:  LSVTHRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRT

Query:  IRARKHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLS
        IRAR HPEELKVFYHLLYASIILAISILIITLTQKHI+FSHAAYLGSVAVVVA LCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLS
Subjt:  IRARKHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLS

Query:  TVLEKDSPSKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPRPLMF
        TVLEKDSPSKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPRPLMF
Subjt:  TVLEKDSPSKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPRPLMF

Query:  ALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGLTCNG
        ALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEA RAGNTKHGKGLTCNG
Subjt:  ALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGLTCNG

Query:  PHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM
        PHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWIT+SDM
Subjt:  PHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM

A0A6J1DHB0 uncharacterized protein LOC1110204734.6e-22873.82Show/hide
Query:  AAEGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLS
        AA  W F KQVA+GRWFSVFASF+IMIGAGS YLFGTYSK++KTQFDY QTQVNTLGF+KDLGSN GVFAGLLAEVAP WVLF+VGSSLNFFS+ MIWLS
Subjt:  AAEGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLS

Query:  VTHRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIR
        VT RIA P+FW M LY+ LAAN+QNFANTAV+V SVKNFP+RRGII+GLLKGFVG+GGA+LTQ+YLAIYGHE+ + V+LLL+WFPSVIS V + SIRTI+
Subjt:  VTHRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIR

Query:  ARKHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLSTV
         RKHPEELKVFYHLLY SIILA+ IL +TLTQK  +FSHA Y+G  +V+V  LCLPLLIAI+EEF+LF LN Q  D  V + I ++K   P    L    
Subjt:  ARKHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLSTV

Query:  LEKDSPSKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPRPLMFAL
        ++  S  KPERG+DFTI QALLSKDMAL+F+AT+S CGSSI AIDN+GQIAESL YP  SI++FVSW+SIFNFFGRVFSGFISETLM KYKLPRPLMFA 
Subjt:  LEKDSPSKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPRPLMFAL

Query:  AHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGLTCNGPH
        + LLTCIG L IAFPY GSVY ASLIIG GFGAQVPLLFAIIS+LFGLKHYSTL NCGQLAVP GSYILNV ++G++YDMEA R G  K GKGLTC G +
Subjt:  AHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGLTCNGPH

Query:  CFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM
        CF  SFTILAA+T FGA+SSL LA RTR+FYKG+ YK+YR+DMWI ++DM
Subjt:  CFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM

A0A6J1HIM6 uncharacterized protein LOC1114647771.2e-22068.17Show/hide
Query:  EGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVT
        E WRF KQV +GRWF+VFASF+IMIGAGS Y+FGTYSK IK+QFDY+QTQ+NTLGF+KDLGSN GVFAGLL EVAPPWVLF+VGS+LNF+S+ MIWLS+T
Subjt:  EGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVT

Query:  HRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRAR
         RI+KP FW+MF+++ +AAN+QNFANTA++V SV+NFPDRRGII+GLLKGFVG GGA+LTQ YLA+YGH++P++++LLLSW P+V S++   SIR I+ R
Subjt:  HRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRAR

Query:  KHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAV--DPSVTLSIPEEKQSLPYSNILLSTV
        K P+EL+V YHLLY SIILA+ +L +T+TQK  +F+  AY+    V++  L +PLLIAI+EEF+LFKLN Q +  +P+V + +PE + S P +   +  +
Subjt:  KHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAV--DPSVTLSIPEEKQSLPYSNILLSTV

Query:  LEKDSP------SKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPR
         E  +        KP+RGEDFTILQAL SKDMALVF+AT++ACG+SI AIDNLGQ+ ESL YP Q+IS+FVSW+SIFNFFGRVFSGFISE LMTKYKLPR
Subjt:  LEKDSP------SKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPR

Query:  PLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGL
        PLMFA +HLLTCIGILFIAFPY GSVY ASLIIG GFGAQVP+LFAIIS+LFGLK+YST+ NCGQLAVP+GSYILNV ++G++YD+EA + G  K+GKGL
Subjt:  PLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGL

Query:  TCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM
        TC G HCF GSF ILA + LFGA++SL LA RTR+FYKGD+Y RYR+DMWI QSDM
Subjt:  TCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM

A0A6J1HYK5 uncharacterized protein LOC1114692514.2e-22167.99Show/hide
Query:  EGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVT
        E WRF KQV +GRWF+VFASF+IMIGAGS Y+FGTYSK IK+QFDY+QTQ+NTLGF+KDLGSN GVFAGLL EVAPPWVLF+VGS+LNF+S+ MIWLS+T
Subjt:  EGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVT

Query:  HRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRAR
         RI KP FW+MF+++ +AAN+QNFANTA++V SV+NFPDRRGII+GLLKGFVG GGA+LTQ YLA+YGH++P++++LLLSW P+V+S++   SIR I+ R
Subjt:  HRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRAR

Query:  KHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAV--DPSVTLSIPEEKQSLPYSNILLSTV
        KHP+EL+V YHLLY SIILA+ +L +T+TQK  +F+  AY+    V++  L +PLLIAI+EEFLLFKLN Q +  +P+V + +PE + S P +   +  +
Subjt:  KHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAV--DPSVTLSIPEEKQSLPYSNILLSTV

Query:  LEKDSP------SKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPR
         E  +        +P+RGEDFTILQAL SKDMALVF+AT+SACG+SI AIDNLGQ+ ESL YP+Q+IS+FVSW+SIFNFFGRVFSGFIS+ LMTKYKLPR
Subjt:  LEKDSP------SKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPR

Query:  PLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGL
        PLMFA +H+LTC+GILFIAFPY GSVY ASLIIG GFGAQVP+LFAIIS+LFGLK+YST+ NCGQLAVP+GSYILNV ++GR+YD+EA + G  K+GKGL
Subjt:  PLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGL

Query:  TCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM
        TC G  CF GSF ILA + LFGA++SL LA RTR+FYKGD+Y RYR+DMWI QSDM
Subjt:  TCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM

A0A6J1I3B9 uncharacterized protein LOC1114692461.6e-22068.17Show/hide
Query:  EGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVT
        E WRF KQV +GRWF+VFASF+IMIGAGS Y+FGTYSK IK+QF+Y+QTQ+NTLGF+KDLGSN GVFAGLL EVAPPWVLF+VGS+LNF+S+ MIWLS+T
Subjt:  EGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVT

Query:  HRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRAR
         RIAKP FW+MFL++ +AAN+QNFANTA++V SV+NFPDRRGII+GLLKGFVG GGA+LTQ YLA+YGH++P++++LLLSW P+V+S++   SIR I+ R
Subjt:  HRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRAR

Query:  KHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAV--DPSVTLSIPEEKQSLPYSNILLSTV
        KHP+EL+V YHLLY SIILA+ +L +T+TQK  +F+  AY+    V++  L +PLLIAI+EEFLLFKLN Q +  +P+V + +PE + S P +   +  +
Subjt:  KHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAV--DPSVTLSIPEEKQSLPYSNILLSTV

Query:  LEKDSP------SKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPR
         E  +        +P+RGEDFTILQAL SKDMALVF+AT+SACG+SI AIDNLGQ+ ESL YP+Q+IS+FVSW+SIFNFFGRVFSGFISE LMTKYKLPR
Subjt:  LEKDSP------SKPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPR

Query:  PLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGL
        PLMFA +HLLTCIGILFIAFPY GSVY ASLIIG GFGAQVP++FAIIS+LFGLK+YST+ NCGQLAVP+GSYILNV ++G++YD+EA + G  K+GKGL
Subjt:  PLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGL

Query:  TCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM
        TC G  CF GSF ILA + LFGA+ SL LA RT++FYKGD+Y RYR+DMWI QSDM
Subjt:  TCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM

SwissProt top hitse value%identityAlignment
F4I9E1 Protein NUCLEAR FUSION DEFECTIVE 41.6e-2825.37Show/hide
Query:  RWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVTHRIAKPEFWQMF
        +W  + A+  I    G+ + F  YS  +K+    SQ ++N L  + DLG   G  +G+     P  V+    +++ F  + + WL +T+ I  P +  +F
Subjt:  RWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVTHRIAKPEFWQMF

Query:  LYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSD---VLLLLSWFPSVISIVLLFSIRTIRARKHPEELKVF
        L   LA  +  + NTA  ++ +++FP+ R + + L   F G+  A+ +  + AI    +PS     LLL S  P V+S   L+ + T   +   +    +
Subjt:  LYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSD---VLLLLSWFPSVISIVLLFSIRTIRARKHPEELKVF

Query:  YHLLYASIILAISILIITLTQKHIIFSHAA-------YLGSVAVVVASLCLPLLIAIKEEFLLF---KLNNQAVDPSVTLSIPEEKQSLPYSNILLSTVL
            + S +  I  ++  +T  H++ S ++       ++G+V ++V  LC PLL+  ++ FL     +LN+++    V L+I E K     +++   T  
Subjt:  YHLLYASIILAISILIITLTQKHIIFSHAA-------YLGSVAVVVASLCLPLLIAIKEEFLLF---KLNNQAVDPSVTLSIPEEKQSLPYSNILLSTVL

Query:  EKDSPSKP----ERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPRPLM
        E    +K       G++ +    +   +  L ++A        +V  +NLGQIA+SL    Q+ +  V+  S F+FFGR+ S    + +  +++L R   
Subjt:  EKDSPSKP----ERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPRPLM

Query:  FALAHLLTCIG--ILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGLT
        FA+A L T I   +L ++     ++  A+ +IG   G       +I SDLFG        N     +P+GS +L   +   +Y+  A+          + 
Subjt:  FALAHLLTCIG--ILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGLT

Query:  CNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFY
        C G  C+  +F     +++ G +SSL+L  RT+  Y
Subjt:  CNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFY

Q03795 Uncharacterized membrane protein YMR155W2.4e-0822.08Show/hide
Query:  VIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNL-GVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVTHRIAKPEFWQMFLYVGLAAN
        V+ +GAG+PYLF  Y+  + ++     +  + L FS  +GS+L G+ AG++ + +P  +  ++GS   F +++++ L   H       W     + L+  
Subjt:  VIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNL-GVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVTHRIAKPEFWQMFLYVGLAAN

Query:  AQNFANTAVMVISVK----NFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRARKHPEELKVFYHLLYAS
           + + +    SVK    NFP  RG          GL G V + +   ++G E+   V + L      + +V  FS+  I +    ++  +    L  S
Subjt:  AQNFANTAVMVISVK----NFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRARKHPEELKVFYHLLYAS

Query:  IILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLSTVLEKDSPSKPERGEDFTIL
                I+ L +    +  +    S      +  L        EF   +    +  P ++ S P  K      N   +TV E  +    ++     + 
Subjt:  IILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLSTVLEKDSPSKPERGEDFTIL

Query:  QALLSKDMALVFLATVSACGSSIVAIDNLGQIAES---------LAYPAQSI-SVFVSWISIFNFFGRVFSGFISETLMTKYKLPRPLMFALAHLLTCIG
        Q+L S      ++      G  ++ I ++G + ++         L   A+ I S+ V+ +S+ +F GR+ SG IS+ L+ K+K  R     +A LL  + 
Subjt:  QALLSKDMALVFLATVSACGSSIVAIDNLGQIAES---------LAYPAQSI-SVFVSWISIFNFFGRVFSGFISETLMTKYKLPRPLMFALAHLLTCIG

Query:  ILFIAFPYHG-------------SVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVL--LVGRMYDMEATRAGNTKHGKG
           I+  +               ++ V S I G  FG       +I++D FG   YSTL     +    G + ++V   ++GR  D +A    NT    G
Subjt:  ILFIAFPYHG-------------SVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVL--LVGRMYDMEATRAGNTKHGKG

Query:  LTCNGPHCFGGSFTILAAITLFGAMSSLAL
            G  C+  +F +      F  +  L +
Subjt:  LTCNGPHCFGGSFTILAAITLFGAMSSLAL

Arabidopsis top hitse value%identityAlignment
AT2G28120.1 Major facilitator superfamily protein9.4e-15752Show/hide
Query:  RFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVTHRI
        RF     +GRWF VFASF+IM  AG+ YLFGTYSK IK+   Y QT +N LGF KDLG+N+GV +GL+AEV P W +  +GS++NF  + MIWL+VT ++
Subjt:  RFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVTHRI

Query:  AKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRARKHP
        AKP+ WQM LY+ + AN+QNFANT  +V  VKNFP+ RG+++GLLKG+VGL GA+ TQ+Y AIYGH D   ++LL++W P+ +S+V ++ IR  +  +  
Subjt:  AKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRARKHP

Query:  EELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLSTVLEKDS
         EL VFY  LY SI LA+ ++ + + +K + FS AAY  S  +  A L +PL +++K+E  ++ +    ++    + + + K+ L       + V  ++ 
Subjt:  EELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLSTVLEKDS

Query:  PSK---------PERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPRPL
         +K         P RGED+TILQALLS DM ++F+AT    GSS+ A+DNLGQI ESL YP  ++S FVS +SI+N+FGRVFSGF+SE L+ KYKLPRPL
Subjt:  PSK---------PERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPRPL

Query:  MFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEA-----TRAGNTKHG
        M  L  LL+C G L IAFP  GSVY+AS+++G  FGAQ+PLLFAIIS+LFGLK+YSTL NCGQLA PLGSYILNV + G +YD EA      R    K  
Subjt:  MFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEA-----TRAGNTKHG

Query:  KGLTCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRD
        K LTC G  C+   F ILAA+T FGA+ SL LA RTR FYKGDIYK++R+
Subjt:  KGLTCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRD

AT2G39210.1 Major facilitator superfamily protein3.5e-14348.04Show/hide
Query:  QVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVTHRIAKPE
        Q+  GRWF  F S +IM  AG+ Y+FG YS  IK    Y QT +N L F KDLG+N+GV AGLL EV PPW + ++G+ LNFF + MIWL+VT RI+KP+
Subjt:  QVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVTHRIAKPE

Query:  FWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRARKHPEELK
         W M LY+ + AN+Q+FANT  +V  VKNFP+ RG+++G+LKG+VGL GA++TQ+Y A YG ED  +++L++ W P+++S   L +IR ++ ++   ELK
Subjt:  FWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRARKHPEELK

Query:  VFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNIL------LSTVLEK
        VFY+ LY S+ LA  ++++ +  K   F+ + + GS AVV+  L LP+++ I EE  L+K    A++    +++  EK  L  S            V+EK
Subjt:  VFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNIL------LSTVLEK

Query:  -DSPS-------KPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPRP
          +PS        PERG+D+TILQAL S DM ++FLAT+   G ++ AIDNLGQI  SL YP +S+S FVS +SI+N++GRV SG +SE  + KYK PRP
Subjt:  -DSPS-------KPERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPRP

Query:  LMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATR----AGNTK-H
        LM  +  LL+C G L IAF   G +YVAS+IIG  FGAQ PLLFAIIS++FGLK+YSTL N G +A P+GSY+LNV + G +YD+EA +     G T+  
Subjt:  LMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATR----AGNTK-H

Query:  GKGLTCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM
        G+ L C G  CF  SF I+AA+TLFG + S+ L  RT+ FYK DIYK++R+     + +M
Subjt:  GKGLTCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQSDM

AT5G14120.1 Major facilitator superfamily protein1.5e-7732.69Show/hide
Query:  DGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVTHRIAKPEFWQ
        + RW    A+  I   AG  YLFG+ S VIK+  +Y+Q +++ LG +KDLG ++G  AG L+E+ P W   +VG+  N   +  +WL VT R      W 
Subjt:  DGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVTHRIAKPEFWQ

Query:  MFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRARK--HPEELKV
        M + + +  N + + NT  +V  V+NFP  RG ++G+LKGF GLGGA+++QIY  I+   +P+ ++L+++  P+V+ + L+F IR +   K   P +   
Subjt:  MFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRARK--HPEELKV

Query:  FYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIP---EEKQSLPYSNILLSTVLEKDSPS
        F  +    ++LA  ++ + L Q  ++ SH        V+   L +P+L+ I   F  F   N+  D      +P   +++  L   +++LS V E + P 
Subjt:  FYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIP---EEKQSLPYSNILLSTVLEKDSPS

Query:  K------------------------------------PERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIF
                                             P RGEDFT+ QAL+  D  L+F + +   GS +  IDNLGQ+++SL Y   +  V VS ISI+
Subjt:  K------------------------------------PERGEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIF

Query:  NFFGRVFSGFISETLMTKYKLPRPLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNV
        NF GR+  G+ SE ++  Y  PRP+  A+A L+  +G +F A+ + G++Y+ +L+IG G+GA   ++ A  S+LFGLK +  L N   LA P GS + + 
Subjt:  NFFGRVFSGFISETLMTKYKLPRPLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNV

Query:  LLVGRMYDMEATRAGNTKHGK------GLTCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFY
        ++   +YD EA R     HG        L CNG  CF  +  I++   +   M S+ L  RT++ Y
Subjt:  LLVGRMYDMEATRAGNTKHGK------GLTCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFY

AT5G50520.1 Major facilitator superfamily protein7.1e-8034.01Show/hide
Query:  WRFAKQ-VADGRWFSVFASFVIMIGAGSPYLF-GTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVT
        WR   + + + RW     +  +   AG  YLF G+ S  IKT   Y+Q Q+  LG +K+LG  +G  +G L+EV+P WV+ +VG++ N F + ++WL VT
Subjt:  WRFAKQ-VADGRWFSVFASFVIMIGAGSPYLF-GTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVT

Query:  HRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTI---
         ++     W +F+ + +  N + + NTA +V  + NFP+ RG ++G+LKGF GL GA+LTQ+YL ++     S V+L+++  P V+ + LLF +R +   
Subjt:  HRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTI---

Query:  -RARKHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLS
         R     ++L+ F  +    ++LA+ +L + + Q     +      S A++V  + +P+L+     F+    NN      VT   PEE  S    +   +
Subjt:  -RARKHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLS

Query:  TVLEKDSPSKPER----GEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPR
         +   D P + +R    GEDFT+LQAL   D  L+F++ V   GS I  IDNLGQI  SL Y   +  +FVS ISI NF GRV  G+ SE ++ K  LPR
Subjt:  TVLEKDSPSKPER----GEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPR

Query:  PLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATR-AGNTKHGKG
         L  ++   +  +G+++ A  + G +YV +++IG G+GA   +  A +SD+FGLK + +L N    A+P+GS++ + ++   +YD  A + AG T   + 
Subjt:  PLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATR-AGNTKHGKG

Query:  LTCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFY
        L C G  C+  + ++++ + L   + SL++  RTR FY
Subjt:  LTCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFY

AT5G50630.1 Major facilitator superfamily protein7.1e-8034.01Show/hide
Query:  WRFAKQ-VADGRWFSVFASFVIMIGAGSPYLF-GTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVT
        WR   + + + RW     +  +   AG  YLF G+ S  IKT   Y+Q Q+  LG +K+LG  +G  +G L+EV+P WV+ +VG++ N F + ++WL VT
Subjt:  WRFAKQ-VADGRWFSVFASFVIMIGAGSPYLF-GTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVT

Query:  HRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTI---
         ++     W +F+ + +  N + + NTA +V  + NFP+ RG ++G+LKGF GL GA+LTQ+YL ++     S V+L+++  P V+ + LLF +R +   
Subjt:  HRIAKPEFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTI---

Query:  -RARKHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLS
         R     ++L+ F  +    ++LA+ +L + + Q     +      S A++V  + +P+L+     F+    NN      VT   PEE  S    +   +
Subjt:  -RARKHPEELKVFYHLLYASIILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLS

Query:  TVLEKDSPSKPER----GEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPR
         +   D P + +R    GEDFT+LQAL   D  L+F++ V   GS I  IDNLGQI  SL Y   +  +FVS ISI NF GRV  G+ SE ++ K  LPR
Subjt:  TVLEKDSPSKPER----GEDFTILQALLSKDMALVFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPR

Query:  PLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATR-AGNTKHGKG
         L  ++   +  +G+++ A  + G +YV +++IG G+GA   +  A +SD+FGLK + +L N    A+P+GS++ + ++   +YD  A + AG T   + 
Subjt:  PLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLLFAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATR-AGNTKHGKG

Query:  LTCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFY
        L C G  C+  + ++++ + L   + SL++  RTR FY
Subjt:  LTCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGCAGCAGAAGGTTGGCGATTCGCGAAACAAGTTGCCGATGGGCGATGGTTTTCGGTGTTCGCTTCGTTCGTGATCATGATCGGCGCCGGCTCACCCTACCTATT
CGGAACATACTCCAAAGTCATAAAGACCCAATTCGATTACAGCCAAACCCAAGTTAACACCTTGGGATTCTCCAAAGATCTCGGCTCCAACCTCGGAGTCTTCGCCGGAC
TTCTCGCCGAGGTGGCTCCGCCGTGGGTTCTGTTCGTCGTCGGCTCCTCCCTCAACTTCTTCAGCTTCATCATGATTTGGCTCTCCGTCACTCACCGGATCGCCAAGCCC
GAGTTCTGGCAGATGTTCCTCTACGTCGGCTTGGCTGCCAACGCACAAAACTTCGCCAACACTGCGGTTATGGTCATCAGCGTTAAAAACTTCCCGGACCGACGAGGGAT
TATTATTGGCCTTCTCAAGGGCTTCGTCGGCCTCGGAGGGGCCGTCTTGACTCAGATCTATCTAGCCATTTATGGCCACGAAGATCCATCTGATGTGCTTCTTCTGCTCT
CATGGTTTCCGTCCGTTATATCCATTGTTCTTTTGTTTTCAATTAGAACAATCAGAGCTCGCAAACATCCGGAGGAGCTTAAAGTGTTTTACCACCTGCTGTACGCATCC
ATAATTCTAGCCATCTCTATCCTTATTATTACCTTGACCCAAAAACACATTATTTTCTCCCATGCTGCCTATTTGGGAAGCGTCGCCGTGGTTGTGGCCTCACTGTGTCT
GCCCCTTCTGATTGCCATTAAAGAAGAATTCTTGCTTTTCAAGCTCAACAACCAAGCCGTGGATCCATCTGTCACCCTTTCCATTCCAGAGGAGAAGCAATCTCTGCCGT
ACTCCAATATCTTACTATCCACAGTACTTGAAAAAGATTCCCCTTCTAAGCCAGAAAGAGGGGAAGATTTCACCATCTTGCAAGCTCTCCTCAGCAAAGACATGGCTCTT
GTTTTCTTGGCAACTGTTTCTGCATGTGGGTCCTCCATTGTCGCCATCGACAATCTTGGACAAATCGCAGAATCACTTGCCTATCCTGCTCAATCCATAAGCGTCTTCGT
TTCTTGGATTTCCATATTCAATTTCTTCGGTCGGGTCTTCTCTGGCTTCATCTCCGAAACGCTAATGACCAAATACAAATTACCTCGTCCTCTCATGTTCGCCCTTGCCC
ATCTCCTCACCTGCATTGGCATACTCTTCATCGCCTTTCCATATCATGGTTCGGTCTACGTGGCTTCATTGATCATTGGGTCTGGATTTGGAGCCCAAGTTCCTTTGTTA
TTCGCTATTATCTCTGATCTATTTGGCCTCAAGCACTACTCAACGCTGCTCAATTGTGGGCAACTGGCGGTGCCGCTTGGATCTTATATACTAAATGTACTTCTAGTAGG
TAGGATGTATGATATGGAAGCCACTCGAGCTGGTAATACAAAGCACGGCAAAGGCCTAACTTGCAATGGACCTCATTGCTTTGGCGGCTCGTTCACAATTTTGGCAGCAA
TAACATTGTTTGGAGCAATGAGTTCACTTGCACTGGCTTGTAGGACTCGAAACTTTTACAAAGGCGACATATATAAGAGATATAGAGATGACATGTGGATTACCCAATCC
GACATG
mRNA sequenceShow/hide mRNA sequence
ATGTGTGCAGCAGAAGGTTGGCGATTCGCGAAACAAGTTGCCGATGGGCGATGGTTTTCGGTGTTCGCTTCGTTCGTGATCATGATCGGCGCCGGCTCACCCTACCTATT
CGGAACATACTCCAAAGTCATAAAGACCCAATTCGATTACAGCCAAACCCAAGTTAACACCTTGGGATTCTCCAAAGATCTCGGCTCCAACCTCGGAGTCTTCGCCGGAC
TTCTCGCCGAGGTGGCTCCGCCGTGGGTTCTGTTCGTCGTCGGCTCCTCCCTCAACTTCTTCAGCTTCATCATGATTTGGCTCTCCGTCACTCACCGGATCGCCAAGCCC
GAGTTCTGGCAGATGTTCCTCTACGTCGGCTTGGCTGCCAACGCACAAAACTTCGCCAACACTGCGGTTATGGTCATCAGCGTTAAAAACTTCCCGGACCGACGAGGGAT
TATTATTGGCCTTCTCAAGGGCTTCGTCGGCCTCGGAGGGGCCGTCTTGACTCAGATCTATCTAGCCATTTATGGCCACGAAGATCCATCTGATGTGCTTCTTCTGCTCT
CATGGTTTCCGTCCGTTATATCCATTGTTCTTTTGTTTTCAATTAGAACAATCAGAGCTCGCAAACATCCGGAGGAGCTTAAAGTGTTTTACCACCTGCTGTACGCATCC
ATAATTCTAGCCATCTCTATCCTTATTATTACCTTGACCCAAAAACACATTATTTTCTCCCATGCTGCCTATTTGGGAAGCGTCGCCGTGGTTGTGGCCTCACTGTGTCT
GCCCCTTCTGATTGCCATTAAAGAAGAATTCTTGCTTTTCAAGCTCAACAACCAAGCCGTGGATCCATCTGTCACCCTTTCCATTCCAGAGGAGAAGCAATCTCTGCCGT
ACTCCAATATCTTACTATCCACAGTACTTGAAAAAGATTCCCCTTCTAAGCCAGAAAGAGGGGAAGATTTCACCATCTTGCAAGCTCTCCTCAGCAAAGACATGGCTCTT
GTTTTCTTGGCAACTGTTTCTGCATGTGGGTCCTCCATTGTCGCCATCGACAATCTTGGACAAATCGCAGAATCACTTGCCTATCCTGCTCAATCCATAAGCGTCTTCGT
TTCTTGGATTTCCATATTCAATTTCTTCGGTCGGGTCTTCTCTGGCTTCATCTCCGAAACGCTAATGACCAAATACAAATTACCTCGTCCTCTCATGTTCGCCCTTGCCC
ATCTCCTCACCTGCATTGGCATACTCTTCATCGCCTTTCCATATCATGGTTCGGTCTACGTGGCTTCATTGATCATTGGGTCTGGATTTGGAGCCCAAGTTCCTTTGTTA
TTCGCTATTATCTCTGATCTATTTGGCCTCAAGCACTACTCAACGCTGCTCAATTGTGGGCAACTGGCGGTGCCGCTTGGATCTTATATACTAAATGTACTTCTAGTAGG
TAGGATGTATGATATGGAAGCCACTCGAGCTGGTAATACAAAGCACGGCAAAGGCCTAACTTGCAATGGACCTCATTGCTTTGGCGGCTCGTTCACAATTTTGGCAGCAA
TAACATTGTTTGGAGCAATGAGTTCACTTGCACTGGCTTGTAGGACTCGAAACTTTTACAAAGGCGACATATATAAGAGATATAGAGATGACATGTGGATTACCCAATCC
GACATG
Protein sequenceShow/hide protein sequence
MCAAEGWRFAKQVADGRWFSVFASFVIMIGAGSPYLFGTYSKVIKTQFDYSQTQVNTLGFSKDLGSNLGVFAGLLAEVAPPWVLFVVGSSLNFFSFIMIWLSVTHRIAKP
EFWQMFLYVGLAANAQNFANTAVMVISVKNFPDRRGIIIGLLKGFVGLGGAVLTQIYLAIYGHEDPSDVLLLLSWFPSVISIVLLFSIRTIRARKHPEELKVFYHLLYAS
IILAISILIITLTQKHIIFSHAAYLGSVAVVVASLCLPLLIAIKEEFLLFKLNNQAVDPSVTLSIPEEKQSLPYSNILLSTVLEKDSPSKPERGEDFTILQALLSKDMAL
VFLATVSACGSSIVAIDNLGQIAESLAYPAQSISVFVSWISIFNFFGRVFSGFISETLMTKYKLPRPLMFALAHLLTCIGILFIAFPYHGSVYVASLIIGSGFGAQVPLL
FAIISDLFGLKHYSTLLNCGQLAVPLGSYILNVLLVGRMYDMEATRAGNTKHGKGLTCNGPHCFGGSFTILAAITLFGAMSSLALACRTRNFYKGDIYKRYRDDMWITQS
DM