; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0022971 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0022971
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionZinc finger family protein, putative isoform 1
Genome locationchr08:6885900..6891204
RNA-Seq ExpressionPI0022971
SyntenyPI0022971
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004175 - endopeptidase activity (molecular function)
GO:0008236 - serine-type peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN54878.2 hypothetical protein Csa_012907 [Cucumis sativus]1.4e-27096.44Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDL LNPSYRGHDIVATFNVERSVSLLEDNF
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  EQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        +QLRTDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
Subjt:  EQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
        LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVT PTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG

Query:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKRK
        KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPT QPHN HHPPTHHHHHHHT LTPAISPAPATEKGAPEYGSPAPER+AASPKRSYTAKPPGCQYRYKRK
Subjt:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKRK

Query:  SGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSPSGAVRCHMITQWGFTLFL
        SGRKEGKQSHLTP ASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHP     NPS+APSPSGA RCHMITQWGFTLFL
Subjt:  SGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSPSGAVRCHMITQWGFTLFL

Query:  FLARHM
         LA HM
Subjt:  FLARHM

XP_004144318.1 uncharacterized protein LOC101216010 isoform X1 [Cucumis sativus]1.4e-27096.44Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDL LNPSYRGHDIVATFNVERSVSLLEDNF
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  EQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        +QLRTDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
Subjt:  EQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
        LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVT PTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG

Query:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKRK
        KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPT QPHN HHPPTHHHHHHHT LTPAISPAPATEKGAPEYGSPAPER+AASPKRSYTAKPPGCQYRYKRK
Subjt:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKRK

Query:  SGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSPSGAVRCHMITQWGFTLFL
        SGRKEGKQSHLTP ASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHP     NPS+APSPSGA RCHMITQWGFTLFL
Subjt:  SGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSPSGAVRCHMITQWGFTLFL

Query:  FLARHM
         LA HM
Subjt:  FLARHM

XP_008455751.1 PREDICTED: uncharacterized protein LOC103495852 [Cucumis melo]1.2e-26695.26Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSGLVADGRCC GCVSIRRLIGFRCIFILLLSVALFVSAV WLPPF+HYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  EQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        +QLRTDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFS+DPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
Subjt:  EQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
        LQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNN EFG
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG

Query:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKRK
        KVKQVRLSSILKHSLNGS+GNGPVRSPSPAPT QPHNHHHPPTHHHHHHHT L  AISPAPATEKGAPEYGSPAPERSAASP+RSYTA+PPGCQYRYKRK
Subjt:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKRK

Query:  SGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSPSGAVRCHMITQWGFTLFL
        SGRKEGKQSHLTP ASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSN P     NPSVAPSPSGA RCHMITQWGFTLFL
Subjt:  SGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSPSGAVRCHMITQWGFTLFL

Query:  FLARHM
         LARHM
Subjt:  FLARHM

XP_031740216.1 uncharacterized protein LOC101216010 isoform X2 [Cucumis sativus]1.4e-25796.89Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDL LNPSYRGHDIVATFNVERSVSLLEDNF
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  EQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        +QLRTDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
Subjt:  EQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
        LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVT PTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG

Query:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKRK
        KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPT QPHN HHPPTHHHHHHHT LTPAISPAPATEKGAPEYGSPAPER+AASPKRSYTAKPPGCQYRYKRK
Subjt:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKRK

Query:  SGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSP
        SGRKEGKQSHLTP ASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHP     NPS+APSP
Subjt:  SGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSP

XP_038882638.1 uncharacterized protein LOC120073837 [Benincasa hispida]6.7e-25291.57Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSG VADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVER VSLLEDN 
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  EQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        EQLRTDIFEEF IPSIKVDILSLE L GSNRTKVVFSLDPDTD+SEISSTYLSLIRS I SLVTNQFL ITKS FGEA+SFEVLKFPGGITIIPPQSAFL
Subjt:  EQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
        LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILY+KLWNAEGSTVTAPTIVQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG

Query:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKRK
        KVKQVRLSSILKHSLNGS+GNGP RSPSPAP  QPHN  +PPT HHHHHHT LTPAISPAPATEKGAPEYGSPAPERS ASPKRSYTAKPPGCQY  KRK
Subjt:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKRK

Query:  SGRKEGKQSHLTPFASPNISPDHSAASPS--PQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVA--PSPSGAVRCHMITQWGF
        SGRKEGKQSHLTP ASPN+SPDHSAASPS  PQH++NPPAAP+ PAPALTPLPNVIYAHVQPPSKS+SNHPEKS TNPS A  PSPSGA RC MITQWGF
Subjt:  SGRKEGKQSHLTPFASPNISPDHSAASPS--PQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVA--PSPSGAVRCHMITQWGF

Query:  TLFLFLARHM
        TLFL LA HM
Subjt:  TLFLFLARHM

TrEMBL top hitse value%identityAlignment
A0A0A0KYS3 Uncharacterized protein6.9e-27196.44Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDL LNPSYRGHDIVATFNVERSVSLLEDNF
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  EQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        +QLRTDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
Subjt:  EQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
        LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVT PTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG

Query:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKRK
        KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPT QPHN HHPPTHHHHHHHT LTPAISPAPATEKGAPEYGSPAPER+AASPKRSYTAKPPGCQYRYKRK
Subjt:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKRK

Query:  SGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSPSGAVRCHMITQWGFTLFL
        SGRKEGKQSHLTP ASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHP     NPS+APSPSGA RCHMITQWGFTLFL
Subjt:  SGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSPSGAVRCHMITQWGFTLFL

Query:  FLARHM
         LA HM
Subjt:  FLARHM

A0A1S3C173 uncharacterized protein LOC1034958526.0e-26795.26Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSGLVADGRCC GCVSIRRLIGFRCIFILLLSVALFVSAV WLPPF+HYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  EQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        +QLRTDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFS+DPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
Subjt:  EQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
        LQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNN EFG
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG

Query:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKRK
        KVKQVRLSSILKHSLNGS+GNGPVRSPSPAPT QPHNHHHPPTHHHHHHHT L  AISPAPATEKGAPEYGSPAPERSAASP+RSYTA+PPGCQYRYKRK
Subjt:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKRK

Query:  SGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSPSGAVRCHMITQWGFTLFL
        SGRKEGKQSHLTP ASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSN P     NPSVAPSPSGA RCHMITQWGFTLFL
Subjt:  SGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSPSGAVRCHMITQWGFTLFL

Query:  FLARHM
         LARHM
Subjt:  FLARHM

A0A5A7SNH7 Zinc finger family protein, putative isoform 15.0e-22194.86Show/hide
Query:  RGHDIVATFNVERSVSLLEDNFEQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEA
        +GHDIVATFNVERSVSLLEDNF+QLRTDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFS+DPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEA
Subjt:  RGHDIVATFNVERSVSLLEDNFEQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEA

Query:  YSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLK
        YSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLK
Subjt:  YSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLK

Query:  QLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERS
        QLAQTISGSNSSNLGLNN EFGKVKQVRLSSILKHSLNGS+GNGPVRSPSPAPT QPHNHHHPPTHHHHHHHT L  AISPAPATEKGAPEYGSPAPERS
Subjt:  QLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERS

Query:  AASPKRSYTAKPPGCQYRYKRKSGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSV
        AASP+RSYTA+PPGCQYRYKRKSGRKEGKQSHLTP ASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSN P     NPSV
Subjt:  AASPKRSYTAKPPGCQYRYKRKSGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSV

Query:  APSPSGAVRCHMITQWGFTLFLFLARHM
        APSPSGA RCHMITQWGFTLFL LARHM
Subjt:  APSPSGAVRCHMITQWGFTLFLFLARHM

A0A6J1EB56 uncharacterized protein LOC111432513 isoform X38.6e-22182.39Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGE P PSA+ S PS     GRCC GCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHY+DQKDLGLNPSYRGHDIVATF VER VSLL+DN 
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  EQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQ-FLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF
        E+LRTDIFEEFPIPSIKVDILSL  LSGSNRTKVVF +DPDTDD EI STYLSLIRS   S+VTNQ FL ITKS FGEA+SFEVLKFPGGITIIPPQSAF
Subjt:  EQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQ-FLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTIVQ+SVLLEVGNTPSM+RLKQLAQTIS SNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF

Query:  GKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKR
        GKVKQVRLSSILKHSLNG DG GP+RSPSPAPT QPHN HHPP+HHHHHHH  LTP ISPAPA E GAPEYG  AP +SAASPKRSY AKPPGCQ  YKR
Subjt:  GKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKR

Query:  KSGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSV----APSPSGAVRCHMITQWG
        KSGRKEGKQ HL+P ASP+ISP HSAASPS QH        VSP  A TPLP+VIYAHVQPPSKSDSNHPEKS T+PS+    +PSPS A    MIT+WG
Subjt:  KSGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSV----APSPSGAVRCHMITQWG

Query:  FTLFLFLARHM
        FTL L +A +M
Subjt:  FTLFLFLARHM

A0A6J1EEJ8 uncharacterized protein LOC111432513 isoform X41.7e-22182.71Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGE P PSA+ S PS     GRCC GCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHY+DQKDLGLNPSYRGHDIVATF VER VSLL+DN 
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  EQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQ-FLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF
        E+LRTDIFEEFPIPSIKVDILSL  LSGSNRTKVVF +DPDTDD EI STYLSLIRS   S+VTNQ FL ITKS FGEA+SFEVLKFPGGITIIPPQSAF
Subjt:  EQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQ-FLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTIVQ+SVLLEVGNTPSM+RLKQLAQTIS SNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF

Query:  GKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKR
        GKVKQVRLSSILKHSLNG DG GP+RSPSPAPT QPHN HHPP+HHHHHHH  LTP ISPAPA E GAPEYG  AP +SAASPKRSY AKPPGCQ  YKR
Subjt:  GKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKR

Query:  KSGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSPSGAVRCH--MITQWGFT
        KSGRKEGKQ HL+P ASP+ISP HSAASPS QH        VSP  A TPLP+VIYAHVQPPSKSDSNHPEKS T+PS+ PSPS +   H  MIT+WGFT
Subjt:  KSGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSPSGAVRCH--MITQWGFT

Query:  LFLFLARHM
        L L +A +M
Subjt:  LFLFLARHM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)4.1e-3435.24Show/hide
Query:  ADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLG---LNPSYRGHDIVATFNVERSVSLLEDNFEQLRTDIFEEFPIP-SIKV
        + GR C    S  RL+G RC+ +L+LS A+ +SA+FWL P    ++ K  G   LN S     + A+F +++ VS +  +  ++  DI     +  + KV
Subjt:  ADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLG---LNPSYRGHDIVATFNVERSVSLLEDNFEQLRTDIFEEFPIP-SIKV

Query:  DILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQF-LSITKSTFGEAYSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIH
         +LSL     SN T V F++ P   D EIS   LSL+RS    L   +  L +T S FG+  SF+VLKFPGGIT+ P + A +     +LF+ T+  SI 
Subjt:  DILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQF-LSITKSTFGEAYSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIH

Query:  QIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNG
         +Q     L    E  L L PYE ++ +L N +GST++ P   Q  V   +      +RL    Q I  S + NLGL+   FG+VK +  S+ L   +  
Subjt:  QIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNG

Query:  SDGNGPVRSPSPAPT
        SD        +PAPT
Subjt:  SDGNGPVRSPSPAPT

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein5.3e-9846.63Show/hide
Query:  MGKNDGEQPLPSA-IDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDN
        MGK + +  L  A  ++     V + RC C C  I   +GF+C+F+LLLSVALF+SA+F L PF    D++D  L+P +RGH IVA+F++ RS S L +N
Subjt:  MGKNDGEQPLPSA-IDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDN

Query:  FEQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQ-FLSITKSTFGEAYSFEVLKFPGGITIIPPQSA
          QL+ DIF+E    SIKV IL++EP    N TKVVF +DPDT   EI    LS I+ +  S++ NQ  L +TKS FGE + FEVLKFPGGIT+IPPQSA
Subjt:  FEQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQ-FLSITKSTFGEAYSFEVLKFPGGITIIPPQSA

Query:  FLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTE
        F LQK +I+FNFTLN+SIHQIQ++F+ L SQL+ GL LAPYE LY+ L N+EGSTV+ PT V +SVLL VG + S  RLKQL  TI+GS S NLGLNNT 
Subjt:  FLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTE

Query:  FGKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTH------HHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKP--
        FGKVKQVRLSS L +S + S    P  SPSP   H  H+HHH   H      HHHHHH +L+P ++P            SPAP RS    KR+ +A P  
Subjt:  FGKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTH------HHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKP--

Query:  -PGCQYRYKRKSGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPA-----PALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSPSG
         PG +  +K K       Q   TP  +P        ++ +P HQ++ P AP+S A     P   PLP+V++AH   P  ++   P  +        S S 
Subjt:  -PGCQYRYKRKSGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPA-----PALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSPSG

Query:  AVRCHMITQWGFTLFLFLA
        A+       W   L L +A
Subjt:  AVRCHMITQWGFTLFLFLA

AT3G56590.1 hydroxyproline-rich glycoprotein family protein1.2e-9747.34Show/hide
Query:  MGKND-GEQPLP---SAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLL
        MGKN   EQ LP    A  +R +G      CCC C  I      RC+ IL  S A+F+SA+FWLPPFL +AD  DL L+P ++ H IVA+F+V + +S +
Subjt:  MGKND-GEQPLP---SAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLL

Query:  EDNFEQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQF-LSITKSTFGEAYSFEVLKFPGGITIIPP
        EDN  QL  DI +E   P  KV +L+LE L   NRT V+F++DP+ ++S+I +   SLI++   +LV  Q    +T+S FGE + FEVLKFPGGIT+IPP
Subjt:  EDNFEQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQF-LSITKSTFGEAYSFEVLKFPGGITIIPP

Query:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLN
        Q  F LQK Q+LFNFTLNFSI+QIQ +F EL SQL+ G+ LA YE LYI L N+ GSTV  PTIV +SVLL  G   S  RLKQLAQTI+ S+S NLGLN
Subjt:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLN

Query:  NTEFGKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTH-HHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQ
        +T FGKVKQVRLSSIL HS        P  S +P+P+ QP  H +P  H HHHHHH  L P  S +P T+  AP   + AP + +  P R+     P C 
Subjt:  NTEFGKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTH-HHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQ

Query:  YRYKRKSGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSP
        Y  +R  G          P  +P+ S  H    P+P    NP        P  +PLP+V++AH+ PPSKS          +PS AP+P
Subjt:  YRYKRKSGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSP

AT3G56590.2 hydroxyproline-rich glycoprotein family protein1.4e-9847.45Show/hide
Query:  MGKND-GEQPLP---SAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLL
        MGKN   EQ LP    A  +R +G      CCC C  I      RC+ IL  S A+F+SA+FWLPPFL +AD  DL L+P ++ H IVA+F+V + +S +
Subjt:  MGKND-GEQPLP---SAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLL

Query:  EDNFEQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQF-LSITKSTFGEAYSFEVLKFPGGITIIPP
        EDN  QL  DI +E   P  KV +L+LE L   NRT V+F++DP+ ++S+I +   SLI++   +LV  Q    +T+S FGE + FEVLKFPGGIT+IPP
Subjt:  EDNFEQLRTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQF-LSITKSTFGEAYSFEVLKFPGGITIIPP

Query:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLN
        Q  F LQK Q+LFNFTLNFSI+QIQ +F EL SQL+ G+ LA YE LYI L N+ GSTV  PTIV +SVLL  G   S  RLKQLAQTI+ S+S NLGLN
Subjt:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLN

Query:  NTEFGKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTH-HHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQ
        +T FGKVKQVRLSSIL HS        P  S +P+P+ QP  H +P  H HHHHHH  L P  S +P T+  AP   + AP + +  P R+     P C 
Subjt:  NTEFGKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTHQPHNHHHPPTH-HHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQ

Query:  YRYKRKSGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSPSGA
        Y  +R  G          P  +P+ S  H    P+P    NP        P  +PLP+V++AH+ PPSKS          +PS AP+PS A
Subjt:  YRYKRKSGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSPSGA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAAAACGACGGAGAACAGCCACTGCCGTCCGCCATCGACTCCAGGCCGTCCGGTCTGGTTGCCGATGGTCGATGCTGTTGTGGGTGTGTTTCGATTCGAAGACT
CATTGGCTTCAGATGCATCTTCATTCTCCTATTATCCGTTGCCTTGTTCGTTTCTGCTGTTTTTTGGCTGCCCCCTTTCCTCCATTATGCAGATCAAAAGGATCTGGGTC
TCAATCCGTCGTATCGAGGTCATGATATTGTAGCAACATTTAATGTTGAGAGGTCGGTGTCTTTGCTGGAAGACAATTTTGAGCAACTCCGAACCGACATTTTTGAAGAG
TTCCCCATACCTTCTATCAAAGTGGATATACTGTCTCTAGAACCATTGTCTGGATCAAACCGTACCAAAGTTGTGTTCAGCCTCGATCCAGATACTGATGATTCGGAAAT
CTCGTCAACGTATCTAAGTTTAATCAGGTCAATCATTACAAGTCTAGTAACAAATCAGTTCCTCAGCATTACCAAATCCACATTCGGGGAGGCCTATTCGTTTGAAGTGC
TGAAATTCCCCGGAGGAATAACGATAATCCCGCCACAGAGTGCATTTCTTTTGCAAAAAGTGCAAATTCTTTTCAACTTTACGTTGAACTTCTCTATTCATCAGATTCAA
GTACATTTCAGCGAACTGACGAGCCAACTGGAGGCGGGATTACGACTAGCTCCATATGAGATTTTATATATTAAACTTTGGAATGCGGAAGGTTCGACAGTGACAGCCCC
TACAATTGTCCAGACGTCTGTACTTCTTGAAGTTGGAAATACTCCATCGATGCGACGGCTGAAGCAGCTAGCTCAAACAATCTCAGGTTCTAATTCTAGCAACCTCGGCC
TGAATAATACGGAGTTTGGTAAGGTGAAACAAGTTCGCCTTTCTTCGATTCTTAAACACTCCCTCAATGGCAGTGACGGGAACGGTCCCGTAAGGTCACCTTCTCCTGCT
CCTACACACCAGCCCCATAACCACCATCATCCTCCGACTCACCACCACCACCACCATCACACCTCTCTAACCCCTGCAATTTCACCTGCCCCTGCAACCGAGAAGGGTGC
ACCAGAATACGGTTCGCCTGCCCCTGAAAGAAGTGCAGCATCACCTAAGAGAAGTTATACGGCTAAGCCACCCGGTTGTCAATATAGGTACAAGAGGAAGTCTGGTAGGA
AAGAAGGAAAGCAATCTCATTTAACCCCATTTGCGTCACCCAATATATCTCCTGATCATTCTGCTGCATCGCCATCACCACAACATCAAATAAACCCACCAGCAGCACCC
GTCTCACCAGCTCCGGCATTAACTCCATTGCCAAACGTCATTTACGCTCATGTTCAACCACCTTCCAAAAGCGACTCCAATCACCCCGAAAAATCCATGACAAATCCATC
AGTTGCACCATCTCCATCTGGCGCAGTTCGTTGTCATATGATCACTCAATGGGGATTCACACTGTTTCTTTTTCTCGCACGCCATATGTAA
mRNA sequenceShow/hide mRNA sequence
CAATGACCCACTTCAAGAAATCAATCAAAGACAAAAACCCACAAAGTAAAGTAGAAAATTTCAATCTCCAAGCAATTCTGTGACCATTTTTCACAATTAACATCTTTCAT
ATTCATCGCACTCTTATTATTACACACCCACATTCAAATTTCACTCCCTACTTCCTCTTCTCTTCTCACTCCACCGCAATAATGGCCGCACACATCTCGGACTACAGCCG
GAATGGCCCTTGACCCACTTCTCCGGCATTGTTCCCGATGGGGAAAAACGACGGAGAACAGCCACTGCCGTCCGCCATCGACTCCAGGCCGTCCGGTCTGGTTGCCGATG
GTCGATGCTGTTGTGGGTGTGTTTCGATTCGAAGACTCATTGGCTTCAGATGCATCTTCATTCTCCTATTATCCGTTGCCTTGTTCGTTTCTGCTGTTTTTTGGCTGCCC
CCTTTCCTCCATTATGCAGATCAAAAGGATCTGGGTCTCAATCCGTCGTATCGAGGTCATGATATTGTAGCAACATTTAATGTTGAGAGGTCGGTGTCTTTGCTGGAAGA
CAATTTTGAGCAACTCCGAACCGACATTTTTGAAGAGTTCCCCATACCTTCTATCAAAGTGGATATACTGTCTCTAGAACCATTGTCTGGATCAAACCGTACCAAAGTTG
TGTTCAGCCTCGATCCAGATACTGATGATTCGGAAATCTCGTCAACGTATCTAAGTTTAATCAGGTCAATCATTACAAGTCTAGTAACAAATCAGTTCCTCAGCATTACC
AAATCCACATTCGGGGAGGCCTATTCGTTTGAAGTGCTGAAATTCCCCGGAGGAATAACGATAATCCCGCCACAGAGTGCATTTCTTTTGCAAAAAGTGCAAATTCTTTT
CAACTTTACGTTGAACTTCTCTATTCATCAGATTCAAGTACATTTCAGCGAACTGACGAGCCAACTGGAGGCGGGATTACGACTAGCTCCATATGAGATTTTATATATTA
AACTTTGGAATGCGGAAGGTTCGACAGTGACAGCCCCTACAATTGTCCAGACGTCTGTACTTCTTGAAGTTGGAAATACTCCATCGATGCGACGGCTGAAGCAGCTAGCT
CAAACAATCTCAGGTTCTAATTCTAGCAACCTCGGCCTGAATAATACGGAGTTTGGTAAGGTGAAACAAGTTCGCCTTTCTTCGATTCTTAAACACTCCCTCAATGGCAG
TGACGGGAACGGTCCCGTAAGGTCACCTTCTCCTGCTCCTACACACCAGCCCCATAACCACCATCATCCTCCGACTCACCACCACCACCACCATCACACCTCTCTAACCC
CTGCAATTTCACCTGCCCCTGCAACCGAGAAGGGTGCACCAGAATACGGTTCGCCTGCCCCTGAAAGAAGTGCAGCATCACCTAAGAGAAGTTATACGGCTAAGCCACCC
GGTTGTCAATATAGGTACAAGAGGAAGTCTGGTAGGAAAGAAGGAAAGCAATCTCATTTAACCCCATTTGCGTCACCCAATATATCTCCTGATCATTCTGCTGCATCGCC
ATCACCACAACATCAAATAAACCCACCAGCAGCACCCGTCTCACCAGCTCCGGCATTAACTCCATTGCCAAACGTCATTTACGCTCATGTTCAACCACCTTCCAAAAGCG
ACTCCAATCACCCCGAAAAATCCATGACAAATCCATCAGTTGCACCATCTCCATCTGGCGCAGTTCGTTGTCATATGATCACTCAATGGGGATTCACACTGTTTCTTTTT
CTCGCACGCCATATGTAACATCAAAAAGAAGACTACCGGTTTTCTGATGAACATGTGATACGAGAAATGCGAAGGTTATTATAAAATGATAAAAAGTGCAGGAGTTTCTT
TAAAAGTGTAAGCAGAGTAGAGGAAAGCAAAGCAAAAGAGGCATTGCTTGATGAAAGATGGTTTCTTTTCTAAGTGTGTAAATGTAAATATCATCTGATTAAGAAACTTG
TTGCTGATGCAGTTTCAGGTCAAAGTCCACAGAGGTGGCAGGCCTTTAGAAACTTGCATATTTTCCCACTGTTTTTGTGTATTATTATGATCTTCTTCTCCATAAAATGT
AAGGAGATAGAGAAGGAAGAAAAAGTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGAAGGAAAACAGAGCAAATGCATAACTTTTTTCTTTTACCATTGTGTCTT
TTCCCATAACTTTTTGTTCTGAACTTTGTTTTAGGCTTTTTGTACTCAAAATGGCCTTACTCTTGTATTGCTCAAAATTGAAGCTTGTATGTATATATGAACACATAACA
CAGCTTAATCCCTCCACTTGAATGCAACAACTACTGTAGTAATTTTTGGATTGAGATTATTGTATTTACA
Protein sequenceShow/hide protein sequence
MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNFEQLRTDIFEE
FPIPSIKVDILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQ
VHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPVRSPSPA
PTHQPHNHHHPPTHHHHHHHTSLTPAISPAPATEKGAPEYGSPAPERSAASPKRSYTAKPPGCQYRYKRKSGRKEGKQSHLTPFASPNISPDHSAASPSPQHQINPPAAP
VSPAPALTPLPNVIYAHVQPPSKSDSNHPEKSMTNPSVAPSPSGAVRCHMITQWGFTLFLFLARHM