; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G7968 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G7968
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Descriptionhydroxyproline-rich glycoprotein family protein
Genome locationctg1556:1201401..1206672
RNA-Seq ExpressionCucsat.G7968
SyntenyCucsat.G7968
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004175 - endopeptidase activity (molecular function)
GO:0008236 - serine-type peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN54878.2 hypothetical protein Csa_012907 [Cucumis sativus]0.0100Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
        LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG

Query:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRK
        KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRK
Subjt:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRK

Query:  SGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSPSGADRCHMITQWGFTLFLILAC
        SGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSPSGADRCHMITQWGFTLFLILAC
Subjt:  SGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSPSGADRCHMITQWGFTLFLILAC

Query:  HM
        HM
Subjt:  HM

XP_004144318.1 uncharacterized protein LOC101216010 isoform X1 [Cucumis sativus]0.0100Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
        LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG

Query:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRK
        KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRK
Subjt:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRK

Query:  SGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSPSGADRCHMITQWGFTLFLILAC
        SGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSPSGADRCHMITQWGFTLFLILAC
Subjt:  SGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSPSGADRCHMITQWGFTLFLILAC

Query:  HM
        HM
Subjt:  HM

XP_008455751.1 PREDICTED: uncharacterized protein LOC103495852 [Cucumis melo]0.096.41Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSGLVADGRCC GCVSIRRLIGFRCIFILLLSVALFVSAV WLPPF+HYADQKDL LNPSYRGHDIVATFNVERSVSLLEDNF
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFS+DPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
        LQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVT PTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNN EFG
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG

Query:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRK
        KVKQVRLSSILKHSLNGS+GNGPVRSPSPAPTPQPHN HHPPTHHHHHHHTPL  AISPAPATEKGAPEYGSPAPER+AASP+RSYTA+PPGCQYRYKRK
Subjt:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRK

Query:  SGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSPSGADRCHMITQWGFTLFLILAC
        SGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSN PANPS+APSPSGADRCHMITQWGFTLFLILA 
Subjt:  SGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSPSGADRCHMITQWGFTLFLILAC

Query:  HM
        HM
Subjt:  HM

XP_031740216.1 uncharacterized protein LOC101216010 isoform X2 [Cucumis sativus]0.0100Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
        LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG

Query:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRK
        KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRK
Subjt:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRK

Query:  SGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSP
        SGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSP
Subjt:  SGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSP

XP_038882638.1 uncharacterized protein LOC120073837 [Benincasa hispida]0.090.78Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSG VADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDL LNPSYRGHDIVATFNVER VSLLEDN 
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        +QLRTDIFEEF IPSIKV+ILSLE L GSNRTKVVFSLDPDTD+SEISSTYLSLIRS I SLVTNQFL ITKS FGEA+SFEVLKFPGGITIIPPQSAFL
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
        LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILY+KLWNAEGSTVT PTIVQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG

Query:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRK
        KVKQVRLSSILKHSLNGS+GNGP RSPSPAP PQPHN   PPTHHHHHH T LTPAISPAPATEKGAPEYGSPAPER+ ASPKRSYTAKPPGCQY  KRK
Subjt:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRK

Query:  SGRKEGKQSHLTPLASPNISPDHSAASPSP--QHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHP----ANPSIAPSPS--GADRCHMITQWGF
        SGRKEGKQSHLTPLASPN+SPDHSAASPSP  QH++NPPAAP+ PAPALTPLPNVIYAHVQPPSKS+SNHP     NPS APSPS  GADRC MITQWGF
Subjt:  SGRKEGKQSHLTPLASPNISPDHSAASPSP--QHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHP----ANPSIAPSPS--GADRCHMITQWGF

Query:  TLFLILACHM
        TLFLILACHM
Subjt:  TLFLILACHM

TrEMBL top hitse value%identityAlignment
A0A0A0KYS3 Uncharacterized protein0.0100Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
        LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG

Query:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRK
        KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRK
Subjt:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRK

Query:  SGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSPSGADRCHMITQWGFTLFLILAC
        SGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSPSGADRCHMITQWGFTLFLILAC
Subjt:  SGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSPSGADRCHMITQWGFTLFLILAC

Query:  HM
        HM
Subjt:  HM

A0A1S3C173 uncharacterized protein LOC1034958520.096.41Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSGLVADGRCC GCVSIRRLIGFRCIFILLLSVALFVSAV WLPPF+HYADQKDL LNPSYRGHDIVATFNVERSVSLLEDNF
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFS+DPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG
        LQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVT PTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNN EFG
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFG

Query:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRK
        KVKQVRLSSILKHSLNGS+GNGPVRSPSPAPTPQPHN HHPPTHHHHHHHTPL  AISPAPATEKGAPEYGSPAPER+AASP+RSYTA+PPGCQYRYKRK
Subjt:  KVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRK

Query:  SGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSPSGADRCHMITQWGFTLFLILAC
        SGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSN PANPS+APSPSGADRCHMITQWGFTLFLILA 
Subjt:  SGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSPSGADRCHMITQWGFTLFLILAC

Query:  HM
        HM
Subjt:  HM

A0A5A7SNH7 Zinc finger family protein, putative isoform 13.82e-28896.46Show/hide
Query:  RGHDIVATFNVERSVSLLEDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEA
        +GHDIVATFNVERSVSLLEDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFS+DPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEA
Subjt:  RGHDIVATFNVERSVSLLEDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEA

Query:  YSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLK
        YSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVT PTIVQTSVLLEVGNTPSMRRLK
Subjt:  YSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLK

Query:  QLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERN
        QLAQTISGSNSSNLGLNN EFGKVKQVRLSSILKHSLNGS+GNGPVRSPSPAPTPQPHN HHPPTHHHHHHHTPL  AISPAPATEKGAPEYGSPAPER+
Subjt:  QLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERN

Query:  AASPKRSYTAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSP
        AASP+RSYTA+PPGCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSN PANPS+APSP
Subjt:  AASPKRSYTAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSP

Query:  SGADRCHMITQWGFTLFLILACHM
        SGADRCHMITQWGFTLFLILA HM
Subjt:  SGADRCHMITQWGFTLFLILACHM

A0A6J1EB56 uncharacterized protein LOC111432513 isoform X31.91e-27681.41Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGE P PSA+ S PS     GRCC GCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHY+DQKDL LNPSYRGHDIVATF VER VSLL+DN 
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQ-FLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF
        ++LRTDIFEEFPIPSIKV+ILSL  LSGSNRTKVVF +DPDTDD EI STYLSLIRS   S+VTNQ FL ITKS FGEA+SFEVLKFPGGITIIPPQSAF
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQ-FLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVT PTIVQ+SVLLEVGNTPSM+RLKQLAQTIS SNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF

Query:  GKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKR
        GKVKQVRLSSILKHSLNG DG GP+RSPSPAPTPQPHN HHPP+HHHHHHH PLTP ISPAPA E GAPEYG  AP ++AASPKRSY AKPPGCQY  KR
Subjt:  GKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKR

Query:  KSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPAN--------PSIAPSPSGADRCHMITQWG
        KSGRKEGKQ HL+PLASP+ISP HSAASPS QH        VSP  A TPLP+VIYAHVQPPSKSDSNHP          PS +PSPS A    MIT+WG
Subjt:  KSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPAN--------PSIAPSPSGADRCHMITQWG

Query:  FTLFLILACHM
        FTL LI+A +M
Subjt:  FTLFLILACHM

A0A6J1EEJ8 uncharacterized protein LOC111432513 isoform X44.36e-27781.93Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGE P PSA+ S PS     GRCC GCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHY+DQKDL LNPSYRGHDIVATF VER VSLL+DN 
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQ-FLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF
        ++LRTDIFEEFPIPSIKV+ILSL  LSGSNRTKVVF +DPDTDD EI STYLSLIRS   S+VTNQ FL ITKS FGEA+SFEVLKFPGGITIIPPQSAF
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQ-FLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVT PTIVQ+SVLLEVGNTPSM+RLKQLAQTIS SNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF

Query:  GKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKR
        GKVKQVRLSSILKHSLNG DG GP+RSPSPAPTPQPHN HHPP+HHHHHHH PLTP ISPAPA E GAPEYG  AP ++AASPKRSY AKPPGCQY  KR
Subjt:  GKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKR

Query:  KSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHP----ANPSIAPSPSGADRCH--MITQWGFT
        KSGRKEGKQ HL+PLASP+ISP HSAASPS QH        VSP  A TPLP+VIYAHVQPPSKSDSNHP     +PSI PSPS +   H  MIT+WGFT
Subjt:  KSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHP----ANPSIAPSPSGADRCH--MITQWGFT

Query:  LFLILACHM
        L LI+A +M
Subjt:  LFLILACHM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)6.3e-3535.13Show/hide
Query:  ADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQK---DLDLNPSYRGHDIVATFNVERSVSLLEDNFDQLRTDIFEEFPIP-SIKV
        + GR C    S  RL+G RC+ +L+LS A+ +SA+FWL P    ++ K    + LN S     + A+F +++ VS +  +  ++  DI     +  + KV
Subjt:  ADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQK---DLDLNPSYRGHDIVATFNVERSVSLLEDNFDQLRTDIFEEFPIP-SIKV

Query:  NILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQF-LSITKSTFGEAYSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIH
         +LSL     SN T V F++ P   D EIS   LSL+RS    L   +  L +T S FG+  SF+VLKFPGGIT+ P + A +     +LF+ T+  SI 
Subjt:  NILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQF-LSITKSTFGEAYSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIH

Query:  QIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNG
         +Q     L    E  L L PYE ++ +L N +GST++ P   Q  V   +      +RL    Q I  S + NLGL+   FG+VK +  S+ L   +  
Subjt:  QIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNG

Query:  SDGNGPVRSPSPAPTP
        SD        +PAPTP
Subjt:  SDGNGPVRSPSPAPTP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein1.5e-9746.85Show/hide
Query:  MGKNDGEQPLPSA-IDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDN
        MGK + +  L  A  ++     V + RC C C  I   +GF+C+F+LLLSVALF+SA+F L PF    D++D +L+P +RGH IVA+F++ RS S L +N
Subjt:  MGKNDGEQPLPSA-IDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDN

Query:  FDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQ-FLSITKSTFGEAYSFEVLKFPGGITIIPPQSA
          QL+ DIF+E    SIKV IL++EP    N TKVVF +DPDT   EI    LS I+ +  S++ NQ  L +TKS FGE + FEVLKFPGGIT+IPPQSA
Subjt:  FDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQ-FLSITKSTFGEAYSFEVLKFPGGITIIPPQSA

Query:  FLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTE
        F LQK +I+FNFTLN+SIHQIQ++F+ L SQL+ GL LAPYE LY+ L N+EGSTV+ PT V +SVLL VG + S  RLKQL  TI+GS S NLGLNNT 
Subjt:  FLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTE

Query:  FGKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQP----------HNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTA
        FGKVKQVRLSS L +S + S      +SPSP+P+P            H+ HH   +HHHHHH  L+P ++P            SPAP R   S KR+ +A
Subjt:  FGKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQP----------HNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTA

Query:  KP---PGCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPA-----PALTPLPNVIYAH-VQPPSKSDSNHPANPSIAPSP-
         P   PG +  +K K       Q   TP  +P        ++ +P HQ++ P AP+S A     P   PLP+V++AH  QPP        AN    P P 
Subjt:  KP---PGCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPA-----PALTPLPNVIYAH-VQPPSKSDSNHPANPSIAPSP-

Query:  --SGADRCHMITQWGFTLFLILA
          S A        W   L LI+A
Subjt:  --SGADRCHMITQWGFTLFLILA

AT3G56590.1 hydroxyproline-rich glycoprotein family protein2.4e-9847.42Show/hide
Query:  MGKND-GEQPLP---SAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLL
        MGKN   EQ LP    A  +R +G      CCC C  I      RC+ IL  S A+F+SA+FWLPPFL +AD  DLDL+P ++ H IVA+F+V + +S +
Subjt:  MGKND-GEQPLP---SAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLL

Query:  EDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQF-LSITKSTFGEAYSFEVLKFPGGITIIPP
        EDN  QL  DI +E   P  KV +L+LE L   NRT V+F++DP+ ++S+I +   SLI++   +LV  Q    +T+S FGE + FEVLKFPGGIT+IPP
Subjt:  EDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQF-LSITKSTFGEAYSFEVLKFPGGITIIPP

Query:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLN
        Q  F LQK Q+LFNFTLNFSI+QIQ +F EL SQL+ G+ LA YE LYI L N+ GSTV  PTIV +SVLL  G   S  RLKQLAQTI+ S+S NLGLN
Subjt:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLN

Query:  NTEFGKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTH-HHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQ
        +T FGKVKQVRLSSIL HS        P  S +P+P+PQP    +P  H HHHHHH  L P  S +P T+  AP   + AP +++  P R+     P C 
Subjt:  NTEFGKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTH-HHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQ

Query:  YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSPS
        Y  +R  G          P  +P+ S  H    P+P    NP        P  +PLP+V++AH+ PPSKS          +PSP+
Subjt:  YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSPS

AT3G56590.2 hydroxyproline-rich glycoprotein family protein3.7e-9947.66Show/hide
Query:  MGKND-GEQPLP---SAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLL
        MGKN   EQ LP    A  +R +G      CCC C  I      RC+ IL  S A+F+SA+FWLPPFL +AD  DLDL+P ++ H IVA+F+V + +S +
Subjt:  MGKND-GEQPLP---SAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLL

Query:  EDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQF-LSITKSTFGEAYSFEVLKFPGGITIIPP
        EDN  QL  DI +E   P  KV +L+LE L   NRT V+F++DP+ ++S+I +   SLI++   +LV  Q    +T+S FGE + FEVLKFPGGIT+IPP
Subjt:  EDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQF-LSITKSTFGEAYSFEVLKFPGGITIIPP

Query:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLN
        Q  F LQK Q+LFNFTLNFSI+QIQ +F EL SQL+ G+ LA YE LYI L N+ GSTV  PTIV +SVLL  G   S  RLKQLAQTI+ S+S NLGLN
Subjt:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLN

Query:  NTEFGKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTH-HHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQ
        +T FGKVKQVRLSSIL HS        P  S +P+P+PQP    +P  H HHHHHH  L P  S +P T+  AP   + AP +++  P R+     P C 
Subjt:  NTEFGKVKQVRLSSILKHSLNGSDGNGPVRSPSPAPTPQPHNQHHPPTH-HHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQ

Query:  YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSK----SDSNHPANPSIAPSPSGA
        Y  +R  G          P  +P+ S  H    P+P    NP        P  +PLP+V++AH+ PPSK    S+     +PS AP+PS A
Subjt:  YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSK----SDSNHPANPSIAPSPSGA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAAAACGACGGAGAACAGCCACTGCCCTCCGCCATCGACTCCAGGCCCTCCGGTCTGGTTGCCGATGGTCGATGCTGTTGTGGGTGTGTTTCGATTCGAAGACT
CATTGGCTTCAGATGCATCTTCATTCTCCTATTATCCGTTGCCTTGTTCGTTTCTGCTGTTTTTTGGCTGCCCCCTTTTCTCCATTATGCAGATCAAAAGGATCTGGATC
TCAATCCGTCGTATCGAGGTCATGATATTGTTGCAACATTTAATGTTGAGAGATCGGTGTCTTTGCTGGAAGACAATTTTGATCAACTCCGAACCGACATTTTTGAAGAG
TTCCCTATACCTTCTATCAAAGTGAATATACTGTCTCTAGAACCGTTGTCTGGATCAAACCGTACAAAAGTTGTTTTCAGCCTCGATCCAGATACTGATGATTCGGAAAT
CTCGTCAACGTATCTAAGTTTAATCAGGTCAATCATTACAAGTCTAGTAACAAATCAGTTCCTCAGCATTACCAAATCCACATTCGGGGAGGCCTATTCGTTTGAAGTGC
TGAAATTCCCCGGAGGAATAACGATAATCCCGCCACAGAGTGCATTTCTTTTGCAAAAAGTGCAAATTCTTTTCAACTTTACATTGAACTTCTCTATTCATCAGATTCAA
GTACATTTCAGCGAACTGACCAGCCAACTGGAGGCGGGATTAAGACTAGCTCCATATGAGATTTTATATATTAAACTTTGGAATGCGGAAGGTTCAACCGTGACAGATCC
TACAATTGTCCAGACGTCTGTACTTCTTGAAGTTGGAAATACTCCGTCGATGCGACGGCTGAAGCAGCTAGCTCAAACAATCTCAGGTTCTAATTCTAGCAACCTCGGCC
TGAATAATACGGAGTTTGGTAAGGTGAAACAAGTTCGCCTTTCTTCTATTCTTAAACACTCCCTCAATGGCAGTGATGGGAATGGCCCCGTAAGGTCACCTTCTCCTGCT
CCTACACCCCAGCCCCATAACCAACATCATCCTCCAACTCACCACCACCACCACCATCACACCCCTCTAACCCCTGCAATTTCACCTGCTCCTGCAACCGAGAAGGGTGC
ACCAGAATATGGTTCGCCTGCCCCTGAAAGAAATGCAGCATCACCTAAGAGAAGTTATACGGCTAAGCCGCCTGGTTGTCAATATAGATACAAGAGGAAGTCTGGTAGGA
AAGAAGGAAAGCAATCTCATTTAACCCCGCTTGCGTCACCCAATATATCTCCTGATCATTCTGCTGCATCGCCATCACCACAACATCAAATCAACCCACCAGCAGCACCC
GTCTCACCAGCTCCGGCATTAACTCCATTGCCAAACGTCATTTACGCTCATGTTCAACCACCTTCCAAAAGCGACTCCAATCACCCCGCAAATCCATCAATTGCACCATC
TCCATCTGGCGCAGATCGTTGTCATATGATCACTCAATGGGGATTCACACTGTTTCTTATTCTCGCATGCCATATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAAAACGACGGAGAACAGCCACTGCCCTCCGCCATCGACTCCAGGCCCTCCGGTCTGGTTGCCGATGGTCGATGCTGTTGTGGGTGTGTTTCGATTCGAAGACT
CATTGGCTTCAGATGCATCTTCATTCTCCTATTATCCGTTGCCTTGTTCGTTTCTGCTGTTTTTTGGCTGCCCCCTTTTCTCCATTATGCAGATCAAAAGGATCTGGATC
TCAATCCGTCGTATCGAGGTCATGATATTGTTGCAACATTTAATGTTGAGAGATCGGTGTCTTTGCTGGAAGACAATTTTGATCAACTCCGAACCGACATTTTTGAAGAG
TTCCCTATACCTTCTATCAAAGTGAATATACTGTCTCTAGAACCGTTGTCTGGATCAAACCGTACAAAAGTTGTTTTCAGCCTCGATCCAGATACTGATGATTCGGAAAT
CTCGTCAACGTATCTAAGTTTAATCAGGTCAATCATTACAAGTCTAGTAACAAATCAGTTCCTCAGCATTACCAAATCCACATTCGGGGAGGCCTATTCGTTTGAAGTGC
TGAAATTCCCCGGAGGAATAACGATAATCCCGCCACAGAGTGCATTTCTTTTGCAAAAAGTGCAAATTCTTTTCAACTTTACATTGAACTTCTCTATTCATCAGATTCAA
GTACATTTCAGCGAACTGACCAGCCAACTGGAGGCGGGATTAAGACTAGCTCCATATGAGATTTTATATATTAAACTTTGGAATGCGGAAGGTTCAACCGTGACAGATCC
TACAATTGTCCAGACGTCTGTACTTCTTGAAGTTGGAAATACTCCGTCGATGCGACGGCTGAAGCAGCTAGCTCAAACAATCTCAGGTTCTAATTCTAGCAACCTCGGCC
TGAATAATACGGAGTTTGGTAAGGTGAAACAAGTTCGCCTTTCTTCTATTCTTAAACACTCCCTCAATGGCAGTGATGGGAATGGCCCCGTAAGGTCACCTTCTCCTGCT
CCTACACCCCAGCCCCATAACCAACATCATCCTCCAACTCACCACCACCACCACCATCACACCCCTCTAACCCCTGCAATTTCACCTGCTCCTGCAACCGAGAAGGGTGC
ACCAGAATATGGTTCGCCTGCCCCTGAAAGAAATGCAGCATCACCTAAGAGAAGTTATACGGCTAAGCCGCCTGGTTGTCAATATAGATACAAGAGGAAGTCTGGTAGGA
AAGAAGGAAAGCAATCTCATTTAACCCCGCTTGCGTCACCCAATATATCTCCTGATCATTCTGCTGCATCGCCATCACCACAACATCAAATCAACCCACCAGCAGCACCC
GTCTCACCAGCTCCGGCATTAACTCCATTGCCAAACGTCATTTACGCTCATGTTCAACCACCTTCCAAAAGCGACTCCAATCACCCCGCAAATCCATCAATTGCACCATC
TCCATCTGGCGCAGATCGTTGTCATATGATCACTCAATGGGGATTCACACTGTTTCTTATTCTCGCATGCCATATGTAA
Protein sequenceShow/hide protein sequence
MGKNDGEQPLPSAIDSRPSGLVADGRCCCGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLDLNPSYRGHDIVATFNVERSVSLLEDNFDQLRTDIFEE
FPIPSIKVNILSLEPLSGSNRTKVVFSLDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQ
VHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTDPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPVRSPSPA
PTPQPHNQHHPPTHHHHHHHTPLTPAISPAPATEKGAPEYGSPAPERNAASPKRSYTAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAP
VSPAPALTPLPNVIYAHVQPPSKSDSNHPANPSIAPSPSGADRCHMITQWGFTLFLILACHM