; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012047 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012047
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionTSC22 domain family protein 1-like isoform X1
Genome locationscaffold54:133543..137485
RNA-Seq ExpressionMS012047
SyntenyMS012047
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053895.1 Filamentous hemagglutinin [Cucumis melo var. makuwa]3.3e-20378.18Show/hide
Query:  EEQPLPVGLSSSELSDRNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILEL
        EEQPLPVG+SSSELSDRNV +RCG G    IR+LIAVRCVFFLLLSAAVF+SA FWLPPFLS G+WPDR +DSAYRDH+IVASFHAWKPV FLQNHI EL
Subjt:  EEQPLPVGLSSSELSDRNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILEL

Query:  EDNIFGEIPVPFVK---------------------------VVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALF
        EDNIFGEIP+P VK                           V ILSLQSL G NVTKIVFAVD DAKYSKIPPTSQSLIKE FET+ INEPPL L  +LF
Subjt:  EDNIFGEIPVPFVK---------------------------VVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALF

Query:  GNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SS
        GNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LS YENLYVSLSN RGST+DAPT++QSSVLMAIGTN  SS
Subjt:  GNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SS

Query:  KKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHDASYSPSPG
        K+RLKQLA TIT+SHSGNLGLNNT+FGKVKQVRL S LNHSL GG NA SP+P+PLPHSHHHHHHHHHHHHHHH  HHHHHHHHHHH+HH  A+YSPSPG
Subjt:  KKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHDASYSPSPG

Query:  TEEHAHMPHNGVSSAP--GSAPVESPTPKKRNDEATPPACQYGYKRSSTEVRKR-HLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQP
        TEEH H P NGVSSAP  GS+P+E PT +KRN EATPPA +YGYKRSST++RK+ HLGPI S SS P SPY RVG PAPVS SISASSPLS VVLSNVQP
Subjt:  TEEHAHMPHNGVSSAP--GSAPVESPTPKKRNDEATPPACQYGYKRSSTEVRKR-HLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQP

Query:  PDKGSRHAENFERRAPSVLPSQFSCEY
        P+ GS HAENFER +PSVLP QFSCEY
Subjt:  PDKGSRHAENFERRAPSVLPSQFSCEY

XP_004136773.3 uncharacterized protein LOC101213172 isoform X1 [Cucumis sativus]1.8e-20482.39Show/hide
Query:  EEQPLPVGLSSSELSDRNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILEL
        EEQPLPVG SSSELSDRNV +RCG G    IRRLIAVRCVFFLLLSAAVF+SA FWLPPFLS G+WPDR +DSAYRDH+IVASFHA KPV FLQ HI EL
Subjt:  EEQPLPVGLSSSELSDRNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILEL

Query:  EDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQ
        EDNIFGEIP+P VKV ILSLQSLGG NVTKIVFAVD DAKYSKIPPTSQSLIKE FET+ INEPPL L  +LFGNTSLFEVLKFPGGITIIPPQSAFLLQ
Subjt:  EDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQ

Query:  TAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SSKKRLKQLAQTITDSHSGNLGLNNTIFG
        TAQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LS YENLYVSLSN RGST+DAPT++QSSVLMAIGTN  SSK+RLKQLA TIT+SHSGNLGLNNT+FG
Subjt:  TAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SSKKRLKQLAQTITDSHSGNLGLNNTIFG

Query:  KVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAP--GSAPVESPTPK
        KVKQVRL S LNHSL GG NARSP+P+PLPHS   HHH HHHHHHHHHHHHHHHHHHHHHH DA+YSPSPGTEEH H P NGVSSAP  GS+P+E PT +
Subjt:  KVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAP--GSAPVESPTPK

Query:  KRNDEATPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFS
        KRN EATPPA +YGYKRS T++RK +LGPI S SS PSSPY RVG PAPVS SISASSPLS VVLSNVQPP+ GS HAENFER +PSVLP QFS
Subjt:  KRNDEATPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFS

XP_008443610.1 PREDICTED: uncharacterized protein LOC103487165 [Cucumis melo]1.4e-20479.5Show/hide
Query:  EEQPLPVGLSSSELSDRNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILEL
        EEQPLPVG+SSSELSDRNV +RCG G    IR+LIAVRCVFFLLLSAAVF+SA FWLPPFLS G+WPDR +DSAYRDH+IVASFHAWKPV FLQNHI EL
Subjt:  EEQPLPVGLSSSELSDRNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILEL

Query:  EDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQ
        EDNIFGEIP+P VKV ILSLQSL G NVTKIVFAVD DAKYSKIPPTSQSLIKE FET+ INEPPL L  +LFGNTSLFEVLKFPGGITIIPPQSAFLLQ
Subjt:  EDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQ

Query:  TAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SSKKRLKQLAQTITDSHSGNLGLNNTIFG
        TAQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LS YENLYVSLSN RGST+DAPT++QSSVLMAIGTN  SSK+RLKQLA TIT+SHSGNLGLNNT+FG
Subjt:  TAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SSKKRLKQLAQTITDSHSGNLGLNNTIFG

Query:  KVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------------------DASYSPSPGTEEHAHM
        KVKQVRL S LNHSL GG NA SP+P+PLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH                       A+YSPSPGTEEH H 
Subjt:  KVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------------------DASYSPSPGTEEHAHM

Query:  PHNGVSSAP--GSAPVESPTPKKRNDEATPPACQYGYKRSSTEVRKR-HLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRH
        P NGVSSAP  GS+P+E PT +KRN EATPPA +YGYKRSST++RK+ HLGPI S SS P SPY RVG PAPVS SISASSPLS VVLSNVQPP+ GS H
Subjt:  PHNGVSSAP--GSAPVESPTPKKRNDEATPPACQYGYKRSSTEVRKR-HLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRH

Query:  AENFERRAPSVLPSQFS
        AENFER +PSVLP QFS
Subjt:  AENFERRAPSVLPSQFS

XP_022147793.1 TSC22 domain family protein 1-like isoform X1 [Momordica charantia]9.9e-25698.56Show/hide
Query:  EEQPLPVGLSSSELSDRNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILELEDNI
        EEQPLPVGLSSSELSD NVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILELEDNI
Subjt:  EEQPLPVGLSSSELSDRNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILELEDNI

Query:  FGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQI
        FGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQI
Subjt:  FGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQI

Query:  YFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVR
        YFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVR
Subjt:  YFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVR

Query:  LSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATP
        LSSVLNHSLSGGANARSPAPSPLPHS      HHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATP
Subjt:  LSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATP

Query:  PACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFS
        PACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFS
Subjt:  PACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFS

XP_022934949.1 uncharacterized protein LOC111441963 [Cucurbita moschata]5.5e-19880.24Show/hide
Query:  EEQPLPVGLSSSELSDRNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILEL
        EEQPLPVG+SSSELSD  V SRCG G    IRRLIAVRCVFFLLLSAAVF+SA FWLPPFLS GDWPD+A DS YRDHEIVA F A KPV FL+NHI EL
Subjt:  EEQPLPVGLSSSELSDRNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILEL

Query:  EDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQ
        EDNIFGEIPVPFVKV +LSLQSLGG NVT I+F+VDPDAKYSKIPPTSQSLIKE FET+ IN+PPL L A+LFGNTSLFEVLKFPGGITIIPPQSAFLLQ
Subjt:  EDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQ

Query:  TAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKV
        TAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGL LS YENLYVSLSN RGST+ APTI+QSSVLMAIGTNSS +RLKQLAQTIT+SHSGNLGLNNT+FGKV
Subjt:  TAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKV

Query:  KQVRLSSVLNHSLSGGANARSPAPSPLPHS--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAP--GSAPV
        KQVRLSSVLNHSLSGG  ARSP+P+PLPHS        HHHHHHHH HHHHHHHHHHHHHHHHHHHH DA+YSPSPGTEEH + P NG+SSAP  GS+PV
Subjt:  KQVRLSSVLNHSLSGGANARSPAPSPLPHS--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAP--GSAPV

Query:  ESPTPKKRNDEATPPACQYGYKRSSTEVRKR-HLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQF
        ESP  KKRN EATPP  +YGYK  S +VRKR HLG I S SSPPSSPY RVG PAPV+ SISASSPL  V LSNVQPP+KG       +R APSVLP QF
Subjt:  ESPTPKKRNDEATPPACQYGYKRSSTEVRKR-HLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQF

Query:  S
        S
Subjt:  S

TrEMBL top hitse value%identityAlignment
A0A0A0LHD1 Uncharacterized protein3.0e-20280.77Show/hide
Query:  EEQPLPVGLSSSELSDRNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILEL
        EEQPLPVG SSSELSDRNV +RCG G    IRRLIAVRCVFFLLLSAAVF+SA FWLPPFLS G+WPDR +DSAYRDH+IVASFHA KPV FLQ HI EL
Subjt:  EEQPLPVGLSSSELSDRNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILEL

Query:  EDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQ
        EDNIFGEIP+P VKV ILSLQSLGG NVTKIVFAVD DAKYSKIPPTSQSLIKE FET+ INEPPL L  +LFGNTSLFEVLKFPGGITIIPPQSAFLLQ
Subjt:  EDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQ

Query:  TAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SSKKRLKQLAQTITDSHSGNLGLNNTIFG
        TAQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LS YENLYVSLSN RGST+DAPT++QSSVLMAIGTN  SSK+RLKQLA TIT+SHSGNLGLNNT+FG
Subjt:  TAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SSKKRLKQLAQTITDSHSGNLGLNNTIFG

Query:  KVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAP--GSAPVESPTPK
        KVKQVRL S LNHSL GG NARSP+P+PLPHS           HHH HHHHHHHHHHHHHH DA+YSPSPGTEEH H P NGVSSAP  GS+P+E PT +
Subjt:  KVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAP--GSAPVESPTPK

Query:  KRNDEATPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFS
        KRN EATPPA +YGYKRS T++RK +LGPI S SS PSSPY RVG PAPVS SISASSPLS VVLSNVQPP+ GS HAENFER +PSVLP QFS
Subjt:  KRNDEATPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFS

A0A1S3B8E9 uncharacterized protein LOC1034871656.5e-20579.5Show/hide
Query:  EEQPLPVGLSSSELSDRNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILEL
        EEQPLPVG+SSSELSDRNV +RCG G    IR+LIAVRCVFFLLLSAAVF+SA FWLPPFLS G+WPDR +DSAYRDH+IVASFHAWKPV FLQNHI EL
Subjt:  EEQPLPVGLSSSELSDRNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILEL

Query:  EDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQ
        EDNIFGEIP+P VKV ILSLQSL G NVTKIVFAVD DAKYSKIPPTSQSLIKE FET+ INEPPL L  +LFGNTSLFEVLKFPGGITIIPPQSAFLLQ
Subjt:  EDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQ

Query:  TAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SSKKRLKQLAQTITDSHSGNLGLNNTIFG
        TAQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LS YENLYVSLSN RGST+DAPT++QSSVLMAIGTN  SSK+RLKQLA TIT+SHSGNLGLNNT+FG
Subjt:  TAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SSKKRLKQLAQTITDSHSGNLGLNNTIFG

Query:  KVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------------------DASYSPSPGTEEHAHM
        KVKQVRL S LNHSL GG NA SP+P+PLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH                       A+YSPSPGTEEH H 
Subjt:  KVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------------------DASYSPSPGTEEHAHM

Query:  PHNGVSSAP--GSAPVESPTPKKRNDEATPPACQYGYKRSSTEVRKR-HLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRH
        P NGVSSAP  GS+P+E PT +KRN EATPPA +YGYKRSST++RK+ HLGPI S SS P SPY RVG PAPVS SISASSPLS VVLSNVQPP+ GS H
Subjt:  PHNGVSSAP--GSAPVESPTPKKRNDEATPPACQYGYKRSSTEVRKR-HLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRH

Query:  AENFERRAPSVLPSQFS
        AENFER +PSVLP QFS
Subjt:  AENFERRAPSVLPSQFS

A0A5A7UJM2 Filamentous hemagglutinin1.6e-20378.18Show/hide
Query:  EEQPLPVGLSSSELSDRNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILEL
        EEQPLPVG+SSSELSDRNV +RCG G    IR+LIAVRCVFFLLLSAAVF+SA FWLPPFLS G+WPDR +DSAYRDH+IVASFHAWKPV FLQNHI EL
Subjt:  EEQPLPVGLSSSELSDRNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILEL

Query:  EDNIFGEIPVPFVK---------------------------VVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALF
        EDNIFGEIP+P VK                           V ILSLQSL G NVTKIVFAVD DAKYSKIPPTSQSLIKE FET+ INEPPL L  +LF
Subjt:  EDNIFGEIPVPFVK---------------------------VVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALF

Query:  GNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SS
        GNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LS YENLYVSLSN RGST+DAPT++QSSVLMAIGTN  SS
Subjt:  GNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTN--SS

Query:  KKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHDASYSPSPG
        K+RLKQLA TIT+SHSGNLGLNNT+FGKVKQVRL S LNHSL GG NA SP+P+PLPHSHHHHHHHHHHHHHHH  HHHHHHHHHHH+HH  A+YSPSPG
Subjt:  KKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHDASYSPSPG

Query:  TEEHAHMPHNGVSSAP--GSAPVESPTPKKRNDEATPPACQYGYKRSSTEVRKR-HLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQP
        TEEH H P NGVSSAP  GS+P+E PT +KRN EATPPA +YGYKRSST++RK+ HLGPI S SS P SPY RVG PAPVS SISASSPLS VVLSNVQP
Subjt:  TEEHAHMPHNGVSSAP--GSAPVESPTPKKRNDEATPPACQYGYKRSSTEVRKR-HLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQP

Query:  PDKGSRHAENFERRAPSVLPSQFSCEY
        P+ GS HAENFER +PSVLP QFSCEY
Subjt:  PDKGSRHAENFERRAPSVLPSQFSCEY

A0A6J1D2A3 TSC22 domain family protein 1-like isoform X14.8e-25698.56Show/hide
Query:  EEQPLPVGLSSSELSDRNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILELEDNI
        EEQPLPVGLSSSELSD NVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILELEDNI
Subjt:  EEQPLPVGLSSSELSDRNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILELEDNI

Query:  FGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQI
        FGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQI
Subjt:  FGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQI

Query:  YFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVR
        YFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVR
Subjt:  YFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVR

Query:  LSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATP
        LSSVLNHSLSGGANARSPAPSPLPHS      HHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATP
Subjt:  LSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATP

Query:  PACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFS
        PACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFS
Subjt:  PACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFS

A0A6J1F409 uncharacterized protein LOC1114419632.7e-19880.24Show/hide
Query:  EEQPLPVGLSSSELSDRNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILEL
        EEQPLPVG+SSSELSD  V SRCG G    IRRLIAVRCVFFLLLSAAVF+SA FWLPPFLS GDWPD+A DS YRDHEIVA F A KPV FL+NHI EL
Subjt:  EEQPLPVGLSSSELSDRNVGSRCGAG----IRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILEL

Query:  EDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQ
        EDNIFGEIPVPFVKV +LSLQSLGG NVT I+F+VDPDAKYSKIPPTSQSLIKE FET+ IN+PPL L A+LFGNTSLFEVLKFPGGITIIPPQSAFLLQ
Subjt:  EDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQ

Query:  TAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKV
        TAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGL LS YENLYVSLSN RGST+ APTI+QSSVLMAIGTNSS +RLKQLAQTIT+SHSGNLGLNNT+FGKV
Subjt:  TAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKV

Query:  KQVRLSSVLNHSLSGGANARSPAPSPLPHS--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAP--GSAPV
        KQVRLSSVLNHSLSGG  ARSP+P+PLPHS        HHHHHHHH HHHHHHHHHHHHHHHHHHHH DA+YSPSPGTEEH + P NG+SSAP  GS+PV
Subjt:  KQVRLSSVLNHSLSGGANARSPAPSPLPHS--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAP--GSAPV

Query:  ESPTPKKRNDEATPPACQYGYKRSSTEVRKR-HLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQF
        ESP  KKRN EATPP  +YGYK  S +VRKR HLG I S SSPPSSPY RVG PAPV+ SISASSPL  V LSNVQPP+KG       +R APSVLP QF
Subjt:  ESPTPKKRNDEATPPACQYGYKRSSTEVRKR-HLGPIHSLSSPPSSPYFRVGPPAPVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQF

Query:  S
        S
Subjt:  S

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)3.6e-3833.97Show/hide
Query:  LSSSELSDRNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILELEDNIFGEIPVP-
        L + E S R+ G  C +   RL+ +RC+  L+LS A+ +SA FWL P  S  ++  +A  +   +  + ASF   KPVS +  H  ++E +I   I +  
Subjt:  LSSSELSDRNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILELEDNIFGEIPVP-

Query:  FVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLN
          KV +LSL   G  N T + FAV P     +I   S SL++  F  +F     L LT + FG  + F+VLKFPGGIT+ P + A +   A + F+ T+ 
Subjt:  FVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLN

Query:  YSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVRLSSVLNH
         SI  +Q   D L       L L  YE+++  L+N +GST+  P   Q  V   +      +RL    Q I  S + NLGL+  +FG+VK +  S+ L+ 
Subjt:  YSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVRLSSVLNH

Query:  SLSGGANARSPAPSP
         +       +PAP+P
Subjt:  SLSGGANARSPAPSP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein3.0e-9347.58Show/hide
Query:  GLSSSELSDRNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILELEDNIFGEIPVP
        G ++ + + RN    C   I   +  +C+F LLLS A+F+SA F L PF  D +  D  LD  +R H IVASF   +  SFL  + L+L+++IF E+   
Subjt:  GLSSSELSDRNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILELEDNIFGEIPVP

Query:  FVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLN
         +KV IL+++    LN+TK+VF +DPD  Y +I P S S IKE+FE++ IN+  L LT +LFG T LFEVLKFPGGIT+IPPQSAF LQ  +I FNFTLN
Subjt:  FVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLN

Query:  YSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVRLSSVLNH
        YSI+QIQ+NF+ L SQL++GL+L+ YENLYVSLSN+ GSTV  PT + SSVL+ +GT++S  RLKQL  TIT S S NLGLNNTIFGKVKQVRLSS L +
Subjt:  YSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVRLSSVLNH

Query:  SLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEA--TPPACQY
          S  ++ +SP+PSP PHS HHHHHHHHHHHHHHHHH+HHHHHHH      + SP                 AP  +PV SP P +    A   PP C  
Subjt:  SLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEA--TPPACQY

Query:  GYKRSSTEVRKRHLGPIHSLSSPPSS---PYFRVGPPAPVSGS----ISASSPLSEVVLSN-VQPPDKGSRHAENFERRAPSVLPSQFSCEYITSL
        G +    E R +      S  +P  S   P+ ++  PAP+S +    +  S+PL  VV ++  QPP    R     E   P    S  + E + ++
Subjt:  GYKRSSTEVRKRHLGPIHSLSSPPSS---PYFRVGPPAPVSGS----ISASSPLSEVVLSN-VQPPDKGSRHAENFERRAPSVLPSQFSCEYITSL

AT3G56590.1 hydroxyproline-rich glycoprotein family protein4.9e-9646.32Show/hide
Query:  EEQPLPVGLSSSELSDRNVGSR------CGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHIL
        EEQ LPV  S    S RN G        C   I    ++RCV  L  SAAVF+SA FWLPPFL   D  D  LD  ++DH IVASF   KP+SF++++++
Subjt:  EEQPLPVGLSSSELSDRNVGSR------CGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHIL

Query:  ELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFL
        +LE++I  EI  P  KVV+L+L+ LG LN T ++FA+DP+ + SKIP   +SLIK  FET+   +    LT +LFG    FEVLKFPGGIT+IPPQ  F 
Subjt:  ELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFL

Query:  LQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFG
        LQ AQ+ FNFTLN+SIYQIQ NF++L SQL+ G++L+ YENLY++LSN+RGSTV  PTI+ SSVL+  G++S   RLKQLAQTIT SHS NLGLN+T+FG
Subjt:  LQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFG

Query:  KVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVE-SPTPKK
        KVKQVRLSS+L HS    A + +P+PSP P +H + HHH HHHHHHH                   +P P        P  G   AP SAP + SP P +
Subjt:  KVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVE-SPTPKK

Query:  RNDEATPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSG-----SISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSCE
               P C Y  +R        H    H+    P+    +  PPAP        +I  SSPL  VV +++ PP K S  +E    ++PS  P+     
Subjt:  RNDEATPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSG-----SISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSCE

Query:  YIT
         +T
Subjt:  YIT

AT3G56590.2 hydroxyproline-rich glycoprotein family protein3.8e-9646.79Show/hide
Query:  EEQPLPVGLSSSELSDRNVGSR------CGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHIL
        EEQ LPV  S    S RN G        C   I    ++RCV  L  SAAVF+SA FWLPPFL   D  D  LD  ++DH IVASF   KP+SF++++++
Subjt:  EEQPLPVGLSSSELSDRNVGSR------CGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHIL

Query:  ELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFL
        +LE++I  EI  P  KVV+L+L+ LG LN T ++FA+DP+ + SKIP   +SLIK  FET+   +    LT +LFG    FEVLKFPGGIT+IPPQ  F 
Subjt:  ELEDNIFGEIPVPFVKVVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFL

Query:  LQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFG
        LQ AQ+ FNFTLN+SIYQIQ NF++L SQL+ G++L+ YENLY++LSN+RGSTV  PTI+ SSVL+  G++S   RLKQLAQTIT SHS NLGLN+T+FG
Subjt:  LQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFG

Query:  KVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVE-SPTPKK
        KVKQVRLSS+L HS    A + +P+PSP P +H + HHH HHHHHHH                   +P P        P  G   AP SAP + SP P +
Subjt:  KVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVE-SPTPKK

Query:  RNDEATPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSG-----SISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFS
               P C Y  +R        H    H+    P+    +  PPAP        +I  SSPL  VV +++ PP K S  +E    ++PS  P+  S
Subjt:  RNDEATPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPAPVSG-----SISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAAGAGCAGCCGCTGCCGGTTGGGTTGAGCTCCTCCGAGCTCTCTGATCGGAATGTGGGAAGCCGGTGCGGCGCTGGGATTCGTAGACTGATTGCGGTGAGATGTGTCTT
CTTCCTTCTACTGTCGGCGGCTGTGTTCATTTCTGCTTTCTTTTGGCTGCCGCCGTTCCTCTCCGATGGAGATTGGCCCGATCGGGCTCTTGATTCTGCTTATAGAGATC
ATGAAATAGTAGCAAGTTTTCATGCTTGGAAGCCAGTTTCGTTTCTGCAAAACCATATTTTGGAGCTTGAGGATAATATTTTTGGAGAAATACCTGTACCTTTTGTCAAG
GTGGTCATCCTCTCGCTGCAATCATTAGGTGGACTGAACGTAACAAAGATTGTTTTTGCAGTAGATCCTGACGCCAAATATTCAAAAATACCCCCAACTTCTCAAAGTTT
AATCAAGGAGATCTTTGAAACAATGTTTATAAATGAACCACCTCTCACATTGACTGCAGCATTATTTGGCAATACATCCTTATTTGAGGTGTTGAAATTTCCTGGAGGAA
TAACTATTATTCCTCCTCAGAGTGCATTTCTTCTGCAGACAGCACAGATCTATTTCAACTTTACGTTAAATTATTCTATTTATCAAATTCAAGTGAATTTCGATGATCTT
ACAAGCCAGCTGAGGTCAGGATTACATCTATCTCTTTATGAGAATTTATATGTTAGTCTATCGAATGCAAGAGGTTCAACAGTGGATGCCCCCACCATTATTCAATCATC
TGTTCTAATGGCAATTGGAACTAATTCATCGAAAAAAAGACTAAAACAGCTGGCTCAAACCATCACAGATTCCCATTCAGGAAATCTTGGCCTGAATAACACTATATTTG
GTAAGGTCAAGCAAGTTCGTCTTTCATCTGTCCTGAACCACTCTCTTAGTGGTGGTGCAAATGCACGGTCACCTGCACCTTCTCCTCTGCCTCATTCTCACCACCACCAC
CATCACCACCATCACCACCATCATCACCATCACCACCACCACCACCATCACCACCACCACCACCATCACCACCATCATGATGCATCATATTCACCGAGTCCTGGAACAGA
GGAGCACGCACATATGCCACATAATGGGGTGTCGTCTGCTCCTGGATCTGCCCCAGTGGAAAGTCCCACTCCGAAGAAGAGAAATGATGAAGCAACTCCGCCTGCTTGTC
AATATGGATATAAACGGTCTTCAACAGAAGTCAGAAAACGTCATTTAGGCCCTATTCATTCACTAAGCAGTCCTCCATCATCGCCATACTTTCGAGTAGGTCCACCAGCA
CCTGTTTCTGGCTCAATTTCTGCTTCAAGTCCACTGTCAGAGGTAGTTTTATCTAATGTTCAGCCTCCTGATAAAGGAAGCAGACACGCCGAAAATTTTGAAAGAAGAGC
CCCCTCAGTTTTACCTTCACAATTTTCTTGTGAGTATATTACGTCCCTCATTCTAACATTTTCTCAGTCGTTTACTTGT
mRNA sequenceShow/hide mRNA sequence
GAAGAGCAGCCGCTGCCGGTTGGGTTGAGCTCCTCCGAGCTCTCTGATCGGAATGTGGGAAGCCGGTGCGGCGCTGGGATTCGTAGACTGATTGCGGTGAGATGTGTCTT
CTTCCTTCTACTGTCGGCGGCTGTGTTCATTTCTGCTTTCTTTTGGCTGCCGCCGTTCCTCTCCGATGGAGATTGGCCCGATCGGGCTCTTGATTCTGCTTATAGAGATC
ATGAAATAGTAGCAAGTTTTCATGCTTGGAAGCCAGTTTCGTTTCTGCAAAACCATATTTTGGAGCTTGAGGATAATATTTTTGGAGAAATACCTGTACCTTTTGTCAAG
GTGGTCATCCTCTCGCTGCAATCATTAGGTGGACTGAACGTAACAAAGATTGTTTTTGCAGTAGATCCTGACGCCAAATATTCAAAAATACCCCCAACTTCTCAAAGTTT
AATCAAGGAGATCTTTGAAACAATGTTTATAAATGAACCACCTCTCACATTGACTGCAGCATTATTTGGCAATACATCCTTATTTGAGGTGTTGAAATTTCCTGGAGGAA
TAACTATTATTCCTCCTCAGAGTGCATTTCTTCTGCAGACAGCACAGATCTATTTCAACTTTACGTTAAATTATTCTATTTATCAAATTCAAGTGAATTTCGATGATCTT
ACAAGCCAGCTGAGGTCAGGATTACATCTATCTCTTTATGAGAATTTATATGTTAGTCTATCGAATGCAAGAGGTTCAACAGTGGATGCCCCCACCATTATTCAATCATC
TGTTCTAATGGCAATTGGAACTAATTCATCGAAAAAAAGACTAAAACAGCTGGCTCAAACCATCACAGATTCCCATTCAGGAAATCTTGGCCTGAATAACACTATATTTG
GTAAGGTCAAGCAAGTTCGTCTTTCATCTGTCCTGAACCACTCTCTTAGTGGTGGTGCAAATGCACGGTCACCTGCACCTTCTCCTCTGCCTCATTCTCACCACCACCAC
CATCACCACCATCACCACCATCATCACCATCACCACCACCACCACCATCACCACCACCACCACCATCACCACCATCATGATGCATCATATTCACCGAGTCCTGGAACAGA
GGAGCACGCACATATGCCACATAATGGGGTGTCGTCTGCTCCTGGATCTGCCCCAGTGGAAAGTCCCACTCCGAAGAAGAGAAATGATGAAGCAACTCCGCCTGCTTGTC
AATATGGATATAAACGGTCTTCAACAGAAGTCAGAAAACGTCATTTAGGCCCTATTCATTCACTAAGCAGTCCTCCATCATCGCCATACTTTCGAGTAGGTCCACCAGCA
CCTGTTTCTGGCTCAATTTCTGCTTCAAGTCCACTGTCAGAGGTAGTTTTATCTAATGTTCAGCCTCCTGATAAAGGAAGCAGACACGCCGAAAATTTTGAAAGAAGAGC
CCCCTCAGTTTTACCTTCACAATTTTCTTGTGAGTATATTACGTCCCTCATTCTAACATTTTCTCAGTCGTTTACTTGT
Protein sequenceShow/hide protein sequence
EEQPLPVGLSSSELSDRNVGSRCGAGIRRLIAVRCVFFLLLSAAVFISAFFWLPPFLSDGDWPDRALDSAYRDHEIVASFHAWKPVSFLQNHILELEDNIFGEIPVPFVK
VVILSLQSLGGLNVTKIVFAVDPDAKYSKIPPTSQSLIKEIFETMFINEPPLTLTAALFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDL
TSQLRSGLHLSLYENLYVSLSNARGSTVDAPTIIQSSVLMAIGTNSSKKRLKQLAQTITDSHSGNLGLNNTIFGKVKQVRLSSVLNHSLSGGANARSPAPSPLPHSHHHH
HHHHHHHHHHHHHHHHHHHHHHHHHHDASYSPSPGTEEHAHMPHNGVSSAPGSAPVESPTPKKRNDEATPPACQYGYKRSSTEVRKRHLGPIHSLSSPPSSPYFRVGPPA
PVSGSISASSPLSEVVLSNVQPPDKGSRHAENFERRAPSVLPSQFSCEYITSLILTFSQSFTC