; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g1023 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g1023
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionFilamentous hemagglutinin
Genome locationMC05:13832754..13839091
RNA-Seq ExpressionMC05g1023
SyntenyMC05g1023
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136773.3 uncharacterized protein LOC101213172 isoform X1 [Cucumis sativus]1.29e-27982.05Show/hide
Query:  MGKTEEEQPLPVGLSSSELSDGNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQN
        MGK+EEEQPLPVG SSSELSD NV +RCG G    IRRLIAVRCVFFLLLSAAVF+SA FWLPPFLS G+WPDR +DSAYRDH+IVASFHA KPV FLQ 
Subjt:  MGKTEEEQPLPVGLSSSELSDGNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQN

Query:  HILELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQS
        HI ELEDNIFGEIP+P VKV ILSLQSLGG NVTKIVFAVD DAKYSKIPPTSQSLIKE FET+ INEPPL L  +LFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HILELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SSKKRLKQLAQTITDSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LS YENLYVSLSN RGST+DAPT++QSSVLMAIGTN  SSK+RLKQLA TIT+SHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SSKKRLKQLAQTITDSHSGNLGLN

Query:  NTIFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHH---DASYSPSPGTEEHAHMPHNGVSSAP--GSAPVESPT
        NT+FGKVKQVRLS  LNHSL GG NARSP+P+PLPHSHHH HHHHHHHHHHHHHHHHHHHH   DA+YSPSPGTEEH H P NGVSSAP  GS+P+E PT
Subjt:  NTIFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHH---DASYSPSPGTEEHAHMPHNGVSSAP--GSAPVESPT

Query:  PKKRNDEATPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVG
         +KRN EATPPA +YGYKRS T++RK +LGPI S SS PSSPY RVG PAPVS SISASSPLS VVLSNVQPP+ GS HAENFER +PSVLP QFSS+ G
Subjt:  PKKRNDEATPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVG

Query:  VRVYTIRWTLLLFLLVWH
        VRVYTI+WTL LFLL+WH
Subjt:  VRVYTIRWTLLLFLLVWH

XP_008443610.1 PREDICTED: uncharacterized protein LOC103487165 [Cucumis melo]6.15e-27578.12Show/hide
Query:  MGKTEEEQPLPVGLSSSELSDGNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQN
        MGK+EEEQPLPVG+SSSELSD NV +RCG G    IR+LIAVRCVFFLLLSAAVF+SA FWLPPFLS G+WPDR +DSAYRDH+IVASFHAWKPV FLQN
Subjt:  MGKTEEEQPLPVGLSSSELSDGNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQN

Query:  HILELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQS
        HI ELEDNIFGEIP+P VKV ILSLQSL G NVTKIVFAVD DAKYSKIPPTSQSLIKE FET+ INEPPL L  +LFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HILELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SSKKRLKQLAQTITDSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LS YENLYVSLSN RGST+DAPT++QSSVLMAIGTN  SSK+RLKQLA TIT+SHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SSKKRLKQLAQTITDSHSGNLGLN

Query:  NTIFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHD----------------------------ASYSPSPGTE
        NT+FGKVKQVRLS  LNHSL GG NA SP+P+PLPHSHHHHHHHHHHHHHHHHHHHHHHHH                             A+YSPSPGTE
Subjt:  NTIFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHD----------------------------ASYSPSPGTE

Query:  EHAHMPHNGVSSAP--GSAPVESPTPKKRNDEATPPACQYGYKRSSTEVRKRH-LGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPD
        EH H P NGVSSAP  GS+P+E PT +KRN EATPPA +YGYKRSST++RK+H LGPI S SS P SPY RVG PAPVS SISASSPLS VVLSNVQPP+
Subjt:  EHAHMPHNGVSSAP--GSAPVESPTPKKRNDEATPPACQYGYKRSSTEVRKRH-LGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPD

Query:  KGSRHAENFERRAPSVLPSQFSSSVGVRVYTIRWTLLLFLLVWH
         GS HAENFER +PSVLP QFSS+  VRVYTI+WTL LFLLVWH
Subjt:  KGSRHAENFERRAPSVLPSQFSSSVGVRVYTIRWTLLLFLLVWH

XP_022147793.1 TSC22 domain family protein 1-like isoform X1 [Momordica charantia]0.0100Show/hide
Query:  MGKTEEEQPLPVGLSSSELSDGNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILE
        MGKTEEEQPLPVGLSSSELSDGNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILE
Subjt:  MGKTEEEQPLPVGLSSSELSDGNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILE

Query:  LEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLL
        LEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLL
Subjt:  LEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLL

Query:  QTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGK
        QTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGK
Subjt:  QTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGK

Query:  VKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATPP
        VKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATPP
Subjt:  VKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATPP

Query:  ACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVGVRVYTIRWTLL
        ACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVGVRVYTIRWTLL
Subjt:  ACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVGVRVYTIRWTLL

Query:  LFLLVWHWHI
        LFLLVWHWHI
Subjt:  LFLLVWHWHI

XP_022147794.1 E3 ubiquitin-protein ligase arkadia-C-like isoform X2 [Momordica charantia]4.73e-26999.5Show/hide
Query:  VKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNY
        ++VVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNY
Subjt:  VKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNY

Query:  SIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVRLSSVLNHS
        SIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVRLSSVLNHS
Subjt:  SIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVRLSSVLNHS

Query:  LSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATPPACQYGYKRSSTEV
        LSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATPPACQYGYKRSSTEV
Subjt:  LSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATPPACQYGYKRSSTEV

Query:  RKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVGVRVYTIRWTLLLFLLVWHWHI
        RKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVGVRVYTIRWTLLLFLLVWHWHI
Subjt:  RKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVGVRVYTIRWTLLLFLLVWHWHI

XP_022983747.1 uncharacterized protein LOC111482272 isoform X2 [Cucurbita maxima]5.00e-26878.17Show/hide
Query:  MGKTEEEQPLPVGLSSSELSDGNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQN
        MGK+EEEQPLPVG+SSSELSD  V SRCG G    IRRLIAVRCVFFLLLSAAVF+SA FWLPPFLS GDWPD+A DS YRDHEIVA F A KPV FL+N
Subjt:  MGKTEEEQPLPVGLSSSELSDGNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQN

Query:  HILELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQS
        HI ELEDNIFGEIPVPFVKV +LSLQSLGG NVT I+F+VDPDAKYSKIPPTSQSLIKE FET+ IN+PPL L A+LFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HILELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNT
        AFLLQTAQIYFNFTLNYSIYQIQVNF+DLTSQLRSGL LS YENLYVSLSN RGST+ APTI+QSSVLMAIGTNSS +RLKQLAQTIT+SHSGNLGLNNT
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNT

Query:  IFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSH----------------------HHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHN
        +FGKVKQVRLSSVLNHSLSGG  ARSP+P+PLPHSH                      HHHHHHHHHHHHHHHHHH HHH DA+YSPSPGTEEH H P N
Subjt:  IFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSH----------------------HHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHN

Query:  GVSSAP--GSAPVESPTPKKRNDEATPPACQYGYKRSSTEVRKR-HLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAEN
        G+SSAP  GS+PVESP  KKRN EATPP  +YGYK  ST+VRKR HLG I S SSPPSSPY RVG PAPV+ SISASSPL  V LSNVQPP+KG      
Subjt:  GVSSAP--GSAPVESPTPKKRNDEATPPACQYGYKRSSTEVRKR-HLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAEN

Query:  FERRAPSVLPSQFSSSVGVRVYTIRWTLLLFLLVWH
         +R APSVLP QFS SVGVRV+TIRWTL LFL+VWH
Subjt:  FERRAPSVLPSQFSSSVGVRVYTIRWTLLLFLLVWH

TrEMBL top hitse value%identityAlignment
A0A0A0LHD1 Uncharacterized protein5.15e-27781.36Show/hide
Query:  MGKTEEEQPLPVGLSSSELSDGNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQN
        MGK+EEEQPLPVG SSSELSD NV +RCG G    IRRLIAVRCVFFLLLSAAVF+SA FWLPPFLS G+WPDR +DSAYRDH+IVASFHA KPV FLQ 
Subjt:  MGKTEEEQPLPVGLSSSELSDGNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQN

Query:  HILELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQS
        HI ELEDNIFGEIP+P VKV ILSLQSLGG NVTKIVFAVD DAKYSKIPPTSQSLIKE FET+ INEPPL L  +LFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HILELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SSKKRLKQLAQTITDSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LS YENLYVSLSN RGST+DAPT++QSSVLMAIGTN  SSK+RLKQLA TIT+SHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SSKKRLKQLAQTITDSHSGNLGLN

Query:  NTIFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAP--GSAPVESPTPKK
        NT+FGKVKQVRLS  LNHSL GG NARSP+P+PLPHSHHH HHHHHHHHHHHHHH      DA+YSPSPGTEEH H P NGVSSAP  GS+P+E PT +K
Subjt:  NTIFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAP--GSAPVESPTPKK

Query:  RNDEATPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVGVRV
        RN EATPPA +YGYKRS T++RK +LGPI S SS PSSPY RVG PAPVS SISASSPLS VVLSNVQPP+ GS HAENFER +PSVLP QFSS+ GVRV
Subjt:  RNDEATPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVGVRV

Query:  YTIRWTLLLFLLVWH
        YTI+WTL LFLL+WH
Subjt:  YTIRWTLLLFLLVWH

A0A1S3B8E9 uncharacterized protein LOC1034871652.98e-27578.12Show/hide
Query:  MGKTEEEQPLPVGLSSSELSDGNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQN
        MGK+EEEQPLPVG+SSSELSD NV +RCG G    IR+LIAVRCVFFLLLSAAVF+SA FWLPPFLS G+WPDR +DSAYRDH+IVASFHAWKPV FLQN
Subjt:  MGKTEEEQPLPVGLSSSELSDGNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQN

Query:  HILELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQS
        HI ELEDNIFGEIP+P VKV ILSLQSL G NVTKIVFAVD DAKYSKIPPTSQSLIKE FET+ INEPPL L  +LFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HILELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SSKKRLKQLAQTITDSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LS YENLYVSLSN RGST+DAPT++QSSVLMAIGTN  SSK+RLKQLA TIT+SHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SSKKRLKQLAQTITDSHSGNLGLN

Query:  NTIFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHD----------------------------ASYSPSPGTE
        NT+FGKVKQVRLS  LNHSL GG NA SP+P+PLPHSHHHHHHHHHHHHHHHHHHHHHHHH                             A+YSPSPGTE
Subjt:  NTIFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHD----------------------------ASYSPSPGTE

Query:  EHAHMPHNGVSSAP--GSAPVESPTPKKRNDEATPPACQYGYKRSSTEVRKRH-LGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPD
        EH H P NGVSSAP  GS+P+E PT +KRN EATPPA +YGYKRSST++RK+H LGPI S SS P SPY RVG PAPVS SISASSPLS VVLSNVQPP+
Subjt:  EHAHMPHNGVSSAP--GSAPVESPTPKKRNDEATPPACQYGYKRSSTEVRKRH-LGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPD

Query:  KGSRHAENFERRAPSVLPSQFSSSVGVRVYTIRWTLLLFLLVWH
         GS HAENFER +PSVLP QFSS+  VRVYTI+WTL LFLLVWH
Subjt:  KGSRHAENFERRAPSVLPSQFSSSVGVRVYTIRWTLLLFLLVWH

A0A6J1D227 E3 ubiquitin-protein ligase arkadia-C-like isoform X22.29e-26999.5Show/hide
Query:  VKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNY
        ++VVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNY
Subjt:  VKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNY

Query:  SIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVRLSSVLNHS
        SIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVRLSSVLNHS
Subjt:  SIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVRLSSVLNHS

Query:  LSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATPPACQYGYKRSSTEV
        LSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATPPACQYGYKRSSTEV
Subjt:  LSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATPPACQYGYKRSSTEV

Query:  RKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVGVRVYTIRWTLLLFLLVWHWHI
        RKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVGVRVYTIRWTLLLFLLVWHWHI
Subjt:  RKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVGVRVYTIRWTLLLFLLVWHWHI

A0A6J1D2A3 TSC22 domain family protein 1-like isoform X10.0100Show/hide
Query:  MGKTEEEQPLPVGLSSSELSDGNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILE
        MGKTEEEQPLPVGLSSSELSDGNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILE
Subjt:  MGKTEEEQPLPVGLSSSELSDGNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILE

Query:  LEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLL
        LEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLL
Subjt:  LEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLL

Query:  QTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGK
        QTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGK
Subjt:  QTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGK

Query:  VKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATPP
        VKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATPP
Subjt:  VKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATPP

Query:  ACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVGVRVYTIRWTLL
        ACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVGVRVYTIRWTLL
Subjt:  ACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVGVRVYTIRWTLL

Query:  LFLLVWHWHI
        LFLLVWHWHI
Subjt:  LFLLVWHWHI

A0A6J1J074 uncharacterized protein LOC111482272 isoform X22.42e-26878.17Show/hide
Query:  MGKTEEEQPLPVGLSSSELSDGNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQN
        MGK+EEEQPLPVG+SSSELSD  V SRCG G    IRRLIAVRCVFFLLLSAAVF+SA FWLPPFLS GDWPD+A DS YRDHEIVA F A KPV FL+N
Subjt:  MGKTEEEQPLPVGLSSSELSDGNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQN

Query:  HILELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQS
        HI ELEDNIFGEIPVPFVKV +LSLQSLGG NVT I+F+VDPDAKYSKIPPTSQSLIKE FET+ IN+PPL L A+LFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HILELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNT
        AFLLQTAQIYFNFTLNYSIYQIQVNF+DLTSQLRSGL LS YENLYVSLSN RGST+ APTI+QSSVLMAIGTNSS +RLKQLAQTIT+SHSGNLGLNNT
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNT

Query:  IFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSH----------------------HHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHN
        +FGKVKQVRLSSVLNHSLSGG  ARSP+P+PLPHSH                      HHHHHHHHHHHHHHHHHH HHH DA+YSPSPGTEEH H P N
Subjt:  IFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSH----------------------HHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHN

Query:  GVSSAP--GSAPVESPTPKKRNDEATPPACQYGYKRSSTEVRKR-HLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAEN
        G+SSAP  GS+PVESP  KKRN EATPP  +YGYK  ST+VRKR HLG I S SSPPSSPY RVG PAPV+ SISASSPL  V LSNVQPP+KG      
Subjt:  GVSSAP--GSAPVESPTPKKRNDEATPPACQYGYKRSSTEVRKR-HLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAEN

Query:  FERRAPSVLPSQFSSSVGVRVYTIRWTLLLFLLVWH
         +R APSVLP QFS SVGVRV+TIRWTL LFL+VWH
Subjt:  FERRAPSVLPSQFSSSVGVRVYTIRWTLLLFLLVWH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)3.1e-3733.65Show/hide
Query:  LSSSELSDGNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILELEDNIFGEIPVP-
        L + E S  + G  C +   RL+ +RC+  L+LS A+ +SA FWL P  S  ++  +A  +   +  + ASF   KPVS +  H  ++E +I   I +  
Subjt:  LSSSELSDGNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILELEDNIFGEIPVP-

Query:  FVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLN
          KV +LSL   G  N T + FAV P     +I   S SL++  F  +F     L LT + FG  + F+VLKFPGGIT+ P + A +   A + F+ T+ 
Subjt:  FVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLN

Query:  YSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVRLSSVLNH
         SI  +Q   D L       L L  YE+++  L+N +GST+  P   Q  V   +      +RL    Q I  S + NLGL+  +FG+VK +  S+ L+ 
Subjt:  YSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVRLSSVLNH

Query:  SLSGGANARSPAPSP
         +       +PAP+P
Subjt:  SLSGGANARSPAPSP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein2.8e-9947.6Show/hide
Query:  MGKTEEEQPLPV--GLSSSELSDGNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHI
        MGKTE++  L V  G ++ + +  N    C   I   +  +C+F LLLS A+F+SA F L PF  D +  D  LD  +R H IVASF   +  SFL  + 
Subjt:  MGKTEEEQPLPV--GLSSSELSDGNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHI

Query:  LELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAF
        L+L+++IF E+    +KV IL+++    LN+TK+VF +DPD  Y +I P S S IKE+FE++ IN+  L LT +LFG T LFEVLKFPGGIT+IPPQSAF
Subjt:  LELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAF

Query:  LLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIF
         LQ  +I FNFTLNYSI+QIQ+NF+ L SQL++GL+L+ YENLYVSLSN+ GSTV  PT + SSVL+ +GT++S  RLKQL  TIT S S NLGLNNTIF
Subjt:  LLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIF

Query:  GKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEA-
        GKVKQVRLSS L +  S  ++ +SP+PSP PHS HHHHHHHHHHHHHHHHH+HHHHH  + SP                 AP  +PV SP P +    A 
Subjt:  GKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEA-

Query:  -TPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSS---PYFRVGPPAPVSGS----ISASSPLSEVVLSN-VQPPDKGSRHAENFERRAPSVLPSQFSSSV
          PP C  G +    E R +      S  +P  S   P+ ++  PAP+S +    +  S+PL  VV ++  QPP    R     E   P   P   SS++
Subjt:  -TPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSS---PYFRVGPPAPVSGS----ISASSPLSEVVLSN-VQPPDKGSRHAENFERRAPSVLPSQFSSSV

Query:  GVRVYTIRWTLLLFLLVWHWH
         V +  + W +LL L+V   H
Subjt:  GVRVYTIRWTLLLFLLVWHWH

AT3G56590.1 hydroxyproline-rich glycoprotein family protein2.2e-9947.59Show/hide
Query:  MGK-TEEEQPLPV--GLSSSELSDGNVGSRCGA--GIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQ
        MGK T EEQ LPV  G +S+  + G   S C     I    ++RCV  L  SAAVF+SA FWLPPFL   D  D  LD  ++DH IVASF   KP+SF++
Subjt:  MGK-TEEEQPLPV--GLSSSELSDGNVGSRCGA--GIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQ

Query:  NHILELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQ
        +++++LE++I  EI  P  KVV+L+L+ LG LN T ++FA+DP+ + SKIP   +SLIK  FET+   +    LT +LFG    FEVLKFPGGIT+IPPQ
Subjt:  NHILELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQ

Query:  SAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNN
          F LQ AQ+ FNFTLN+SIYQIQ NF++L SQL+ G++L+ YENLY++LSN+RGSTV  PTI+ SSVL+  G++S   RLKQLAQTIT SHS NLGLN+
Subjt:  SAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNN

Query:  TIFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVE-SPTPKKRN
        T+FGKVKQVRLSS+L HS    A + +P+PSP P +H + HHH HHHHHHH             +P P        P  G   AP SAP + SP P +  
Subjt:  TIFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVE-SPTPKKRN

Query:  DEATPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSG-----SISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSS
             P C Y  +R        H    H+    P+    +  PPAP        +I  SSPL  VV +++ PP K S  +E    ++PS  P+   SS
Subjt:  DEATPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSG-----SISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSS

AT3G56590.2 hydroxyproline-rich glycoprotein family protein1.2e-10047.6Show/hide
Query:  MGK-TEEEQPLPV--GLSSSELSDGNVGSRCGA--GIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQ
        MGK T EEQ LPV  G +S+  + G   S C     I    ++RCV  L  SAAVF+SA FWLPPFL   D  D  LD  ++DH IVASF   KP+SF++
Subjt:  MGK-TEEEQPLPV--GLSSSELSDGNVGSRCGA--GIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQ

Query:  NHILELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQ
        +++++LE++I  EI  P  KVV+L+L+ LG LN T ++FA+DP+ + SKIP   +SLIK  FET+   +    LT +LFG    FEVLKFPGGIT+IPPQ
Subjt:  NHILELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQ

Query:  SAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNN
          F LQ AQ+ FNFTLN+SIYQIQ NF++L SQL+ G++L+ YENLY++LSN+RGSTV  PTI+ SSVL+  G++S   RLKQLAQTIT SHS NLGLN+
Subjt:  SAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNN

Query:  TIFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVE-SPTPKKRN
        T+FGKVKQVRLSS+L HS    A + +P+PSP P +H + HHH HHHHHHH             +P P        P  G   AP SAP + SP P +  
Subjt:  TIFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVE-SPTPKKRN

Query:  DEATPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSG-----SISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVG
             P C Y  +R        H    H+    P+    +  PPAP        +I  SSPL  VV +++ PP K S  +E    ++PS  P+  S+S+G
Subjt:  DEATPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSG-----SISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGACTGAGGAAGAGCAGCCGCTGCCGGTTGGGTTGAGCTCCTCCGAGCTCTCTGATGGGAATGTGGGAAGCCGGTGCGGCGCTGGGATTCGTAGACTGATTGC
GGTGAGATGTGTCTTCTTCCTTCTACTGTCGGCGGCTGTGTTCATTTCTGCTTTCTTTTGGCTGCCGCCGTTCCTCTCCGATGGAGATTGGCCCGATCGGGCTCTTGATT
CTGCTTATAGAGATCATGAAATAGTAGCAAGTTTTCATGCTTGGAAGCCAGTTTCGTTTCTGCAAAACCATATTTTGGAGCTTGAGGATAATATTTTTGGAGAAATACCT
GTACCTTTTGTCAAGGTGGTCATCCTCTCGCTGCAATCATTAGGTGGACTGAACGTAACAAAGATTGTTTTTGCAGTAGATCCTGACGCCAAATATTCAAAAATACCCCC
AACTTCTCAAAGTTTAATCAAGGAGATCTTTGAAACAATGTTTATAAATGAACCACCTCTCACATTGACTGCAGCATTATTTGGCAATACATCCTTATTTGAGGTGTTGA
AATTTCCTGGAGGAATAACTATTATTCCTCCTCAGAGTGCATTTCTTCTGCAGACAGCACAGATCTATTTCAACTTTACGTTAAATTATTCTATTTATCAAATTCAAGTG
AATTTCGATGATCTTACAAGCCAGCTGAGGTCAGGATTACATCTATCTCTTTATGAGAATTTATATGTTAGTCTATCGAATGCAAGAGGTTCAACAGTGGATGCCCCCAC
CATTATTCAATCATCTGTTCTAATGGCAATTGGAACTAATTCATCGAAAAAAAGACTAAAACAGCTGGCTCAAACCATCACAGATTCCCATTCAGGAAATCTTGGCCTGA
ATAACACTATATTTGGTAAGGTCAAGCAAGTTCGTCTTTCATCTGTCCTGAACCACTCTCTTAGTGGTGGTGCAAATGCACGGTCACCTGCACCTTCTCCTCTGCCTCAT
TCTCACCACCACCACCATCACCACCATCACCACCATCACCACCATCATCACCATCACCACCATCACCACCATCATGATGCATCATATTCACCGAGTCCTGGAACAGAGGA
GCACGCACATATGCCACATAATGGGGTGTCGTCTGCTCCTGGATCTGCCCCAGTGGAAAGTCCCACTCCGAAGAAGAGAAATGATGAAGCAACTCCGCCTGCTTGTCAAT
ATGGATATAAACGGTCTTCAACAGAAGTCAGAAAACGTCATTTAGGCCCTATTCATTCACTAAGCAGTCCTCCATCATCGCCATACTTTCGAGTAGGTCCACCAGCACCT
GTTTCTGGCTCAATTTCTGCTTCAAGTCCACTGTCAGAGGTAGTTTTATCTAATGTTCAGCCTCCTGATAAAGGAAGCAGACACGCCGAAAATTTTGAAAGAAGAGCCCC
CTCAGTTTTACCTTCACAATTTTCTTCTTCTGTAGGTGTTCGTGTTTATACAATTCGATGGACACTTCTGCTGTTTCTTCTTGTATGGCATTGGCATATATAA
mRNA sequenceShow/hide mRNA sequence
GACCAAACAAAATATATCTTAGAAGAACATGACCCAAAATGAACTGAAGTTTTTAACCATCTTTAACGATATCGAACTAATCTTGTACTCAACTCTGTTAAATATTTGAT
AATTAAAATAGAGAAAGGATTTTTACCAATTATTTTTCTTTTAACATTATTATTATTATTATTATTATTATTAAAGTGGGTGTGACGAGAAGGCCATACCCATCTTCCAC
GCCTCTTCATTAAGTGCGCAATTAGTGCTTTCTTCTTTATTTATTTTCAAATTAATTAATACACAGATCGCTGTTTTTGCCGCCTCTTCTTCCTCCTTCTTCTCACTTTG
CTTCCCTTCTACTGTCTTGTCAACATCTGTTCAATTTCTCTTCCTCTTCAGAGTTTCAAATTTCTATCAGTAAATTTGGAAAACAACAAAAACCCCATTGACAGATCAAA
GTTGAAAGATAAACCCACCACTAAATCCCATTACTCCAATAAAAGACCATGTCCCCTGTCACTTGCAGTTGCTGCGCCAGTTAAATTAAACCCCCTTTTCTCTCTCGAGC
CCTAGAAATTTTCAAATTTGATTTTGGGTTTGTGGTGGATTAACCCCACGAGGTGGGATTCTGAGGTTTTATCAGTAATGGGTGAAGATAATTCGGACCCAGTTGAGGGA
GGCGACGATGGAGCTTGTTGAATCAAACCCAGTTCCCAATTCTCATGCATTGCTTCCATGGGGAAGACTGAGGAAGAGCAGCCGCTGCCGGTTGGGTTGAGCTCCTCCGA
GCTCTCTGATGGGAATGTGGGAAGCCGGTGCGGCGCTGGGATTCGTAGACTGATTGCGGTGAGATGTGTCTTCTTCCTTCTACTGTCGGCGGCTGTGTTCATTTCTGCTT
TCTTTTGGCTGCCGCCGTTCCTCTCCGATGGAGATTGGCCCGATCGGGCTCTTGATTCTGCTTATAGAGATCATGAAATAGTAGCAAGTTTTCATGCTTGGAAGCCAGTT
TCGTTTCTGCAAAACCATATTTTGGAGCTTGAGGATAATATTTTTGGAGAAATACCTGTACCTTTTGTCAAGGTGGTCATCCTCTCGCTGCAATCATTAGGTGGACTGAA
CGTAACAAAGATTGTTTTTGCAGTAGATCCTGACGCCAAATATTCAAAAATACCCCCAACTTCTCAAAGTTTAATCAAGGAGATCTTTGAAACAATGTTTATAAATGAAC
CACCTCTCACATTGACTGCAGCATTATTTGGCAATACATCCTTATTTGAGGTGTTGAAATTTCCTGGAGGAATAACTATTATTCCTCCTCAGAGTGCATTTCTTCTGCAG
ACAGCACAGATCTATTTCAACTTTACGTTAAATTATTCTATTTATCAAATTCAAGTGAATTTCGATGATCTTACAAGCCAGCTGAGGTCAGGATTACATCTATCTCTTTA
TGAGAATTTATATGTTAGTCTATCGAATGCAAGAGGTTCAACAGTGGATGCCCCCACCATTATTCAATCATCTGTTCTAATGGCAATTGGAACTAATTCATCGAAAAAAA
GACTAAAACAGCTGGCTCAAACCATCACAGATTCCCATTCAGGAAATCTTGGCCTGAATAACACTATATTTGGTAAGGTCAAGCAAGTTCGTCTTTCATCTGTCCTGAAC
CACTCTCTTAGTGGTGGTGCAAATGCACGGTCACCTGCACCTTCTCCTCTGCCTCATTCTCACCACCACCACCATCACCACCATCACCACCATCACCACCATCATCACCA
TCACCACCATCACCACCATCATGATGCATCATATTCACCGAGTCCTGGAACAGAGGAGCACGCACATATGCCACATAATGGGGTGTCGTCTGCTCCTGGATCTGCCCCAG
TGGAAAGTCCCACTCCGAAGAAGAGAAATGATGAAGCAACTCCGCCTGCTTGTCAATATGGATATAAACGGTCTTCAACAGAAGTCAGAAAACGTCATTTAGGCCCTATT
CATTCACTAAGCAGTCCTCCATCATCGCCATACTTTCGAGTAGGTCCACCAGCACCTGTTTCTGGCTCAATTTCTGCTTCAAGTCCACTGTCAGAGGTAGTTTTATCTAA
TGTTCAGCCTCCTGATAAAGGAAGCAGACACGCCGAAAATTTTGAAAGAAGAGCCCCCTCAGTTTTACCTTCACAATTTTCTTCTTCTGTAGGTGTTCGTGTTTATACAA
TTCGATGGACACTTCTGCTGTTTCTTCTTGTATGGCATTGGCATATATAACCAAGGAGATAAAACCTACATGCATATGTCAGGATACCAGTAACGACTGGACTCATCATT
CATATAAAAGTTGTGATAGCGAGACGAACGCGAAGGCGTGCTCCTGCTAGAGTCGAGATAATGATGTTTCCATGTGTAAATATCAGTTAAGCAAGTTGGAAGATGCATTT
TTCCAGGTCAAAGTCACAGAGGTGGCAGGCCTTTGGTAAGATGAAGAATTGGTGTTGTTTTTTTTTTTTGTGTATTATTTCTCTTCTTCACAAAATGTGAGGAAAGAAAT
CAGCAAATGTATCTGATTCCATCTTGATCTCAACTTCTATTATAAAGTCAATCGAGCAATTACAATTTTTTCACTGCAGAAAAATCCTCATTTAAAATAAAGATAAAAAT
GTGTCTGATGTGACTAGTGATTAGGTAACTCAAGTTCTTGGGTATGTGTAAGGTTTATACCTGATTTATTGTTTTGGTTTTAAAC
Protein sequenceShow/hide protein sequence
MGKTEEEQPLPVGLSSSELSDGNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILELEDNIFGEIP
VPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQV
NFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPH
SHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAP
VSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSSSVGVRVYTIRWTLLLFLLVWHWHI