; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G06220 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G06220
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionZinc finger family protein, putative isoform 1
Genome locationClcChr04:19396464..19401357
RNA-Seq ExpressionClc04G06220
SyntenyClc04G06220
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004175 - endopeptidase activity (molecular function)
GO:0008236 - serine-type peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025811.1 Zinc finger family protein, putative isoform 1 [Cucumis melo var. makuwa]1.4e-20589.35Show/hide
Query:  RGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEA
        +GHDIVATFNVER VSLLEDN +QLRTDIFEEFPIPSIKV+ILSLE LSGSNRTKVVFS+DPD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA
Subjt:  RGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEA

Query:  LSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLK
         SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWN EGSTVTAPTIVQ+SVLLEVGNTPSMRRLK
Subjt:  LSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLK

Query:  QLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPTPQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERS
        QLAQTISGSNSSNLGLNN EFGKVKQVRLSSILKHSLNGS+GNGP RSPSPAPTPQPHN+HHPPTHHHHH HTPL  AISPAPATEKGA EYGSPAPERS
Subjt:  QLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPTPQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERS

Query:  AASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLPRHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNP
        AASP+RSYTA PPGCQYRYKRKSGRKEGKQSHLTPLASPNISP+HSAASPS  P+HQ+NPPAAPV+PAPALTPLPNVIYAHVQPPSKSDS+       NP
Subjt:  AASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLPRHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNP

Query:  SVASSPSPSGADRRRMITQWGFTLFLILARYM
        SVA  PSPSGADR  MITQWGFTLFLILAR+M
Subjt:  SVASSPSPSGADRRRMITQWGFTLFLILARYM

KGN54878.2 hypothetical protein Csa_012907 [Cucumis sativus]1.6e-21485.56Show/hide
Query:  CRWPMLLWVC-FDSKLHWLQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRT
        C + +LL V  F S + WL   L         H+  QKDL LN SYRGHDIVATFNVER VSLLEDN +QLRTDIFEEFPIPSIKV+ILSLE LSGSNRT
Subjt:  CRWPMLLWVC-FDSKLHWLQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRT

Query:  KVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEA
        KVVFSLDPD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEA
Subjt:  KVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEA

Query:  GLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPT
        GLRLAPYEILYIKLWN EGSTVT PTIVQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGP RSPSPAPT
Subjt:  GLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPT

Query:  PQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLP
        PQPHN HHPPTHHHHH HTPLTPAISPAPATEKGA EYGSPAPER+AASPKRSYTA+PPGCQYRYKRKSGRKEGKQSHLTPLASPNISP+HSAASPS  P
Subjt:  PQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLP

Query:  RHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPSPSGADRRRMITQWGFTLFLILARYM
        +HQ+NPPAAPV+PAPALTPLPNVIYAHVQPPSKSDS+H      NPS+A  PSPSGADR  MITQWGFTLFLILA +M
Subjt:  RHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPSPSGADRRRMITQWGFTLFLILARYM

XP_004144318.1 uncharacterized protein LOC101216010 isoform X1 [Cucumis sativus]1.6e-21485.56Show/hide
Query:  CRWPMLLWVC-FDSKLHWLQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRT
        C + +LL V  F S + WL   L         H+  QKDL LN SYRGHDIVATFNVER VSLLEDN +QLRTDIFEEFPIPSIKV+ILSLE LSGSNRT
Subjt:  CRWPMLLWVC-FDSKLHWLQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRT

Query:  KVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEA
        KVVFSLDPD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEA
Subjt:  KVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEA

Query:  GLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPT
        GLRLAPYEILYIKLWN EGSTVT PTIVQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGP RSPSPAPT
Subjt:  GLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPT

Query:  PQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLP
        PQPHN HHPPTHHHHH HTPLTPAISPAPATEKGA EYGSPAPER+AASPKRSYTA+PPGCQYRYKRKSGRKEGKQSHLTPLASPNISP+HSAASPS  P
Subjt:  PQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLP

Query:  RHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPSPSGADRRRMITQWGFTLFLILARYM
        +HQ+NPPAAPV+PAPALTPLPNVIYAHVQPPSKSDS+H      NPS+A  PSPSGADR  MITQWGFTLFLILA +M
Subjt:  RHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPSPSGADRRRMITQWGFTLFLILARYM

XP_008455751.1 PREDICTED: uncharacterized protein LOC103495852 [Cucumis melo]1.4e-21381.98Show/hide
Query:  AGIVPDGEKRRRTVTAVRHRLQAVRPGCRWPMLLWVC-FDSKLHWLQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRT
        +G+V DG   R  V+    RL   R  C + +LL V  F S + WL            +H+  QKDLGLN SYRGHDIVATFNVER VSLLEDN +QLRT
Subjt:  AGIVPDGEKRRRTVTAVRHRLQAVRPGCRWPMLLWVC-FDSKLHWLQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRT

Query:  DIFEEFPIPSIKVDILSLELLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQ
        DIFEEFPIPSIKV+ILSLE LSGSNRTKVVFS+DPD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA SFEVLKFPGGITIIPPQSAFLLQKVQ
Subjt:  DIFEEFPIPSIKVDILSLELLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQ

Query:  ILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQV
        ILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWN EGSTVTAPTIVQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNN EFGKVKQV
Subjt:  ILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQV

Query:  RLSSILKHSLNGSDGNGPSRSPSPAPTPQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKE
        RLSSILKHSLNGS+GNGP RSPSPAPTPQPHN+HHPPTHHHHH HTPL  AISPAPATEKGA EYGSPAPERSAASP+RSYTA PPGCQYRYKRKSGRKE
Subjt:  RLSSILKHSLNGSDGNGPSRSPSPAPTPQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKE

Query:  GKQSHLTPLASPNISPEHSAASPSSLPRHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPSPSGADRRRMITQWGFTLFLI
        GKQSHLTPLASPNISP+HSAASPS  P+HQ+NPPAAPV+PAPALTPLPNVIYAHVQPPSKSDS+       NPSVA  PSPSGADR  MITQWGFTLFLI
Subjt:  GKQSHLTPLASPNISPEHSAASPSSLPRHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPSPSGADRRRMITQWGFTLFLI

Query:  LARYM
        LAR+M
Subjt:  LARYM

XP_038882638.1 uncharacterized protein LOC120073837 [Benincasa hispida]3.3e-21586.19Show/hide
Query:  CRWPMLLWVC-FDSKLHWLQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRT
        C + +LL V  F S + WL   L         H+  QKDLGLN SYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEF IPSIKVDILSLE L GSNRT
Subjt:  CRWPMLLWVC-FDSKLHWLQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRT

Query:  KVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEA
        KVVFSLDPD D+ EISSTYLSLIRSTI SLVTNQFLRITKSMFGEA SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEA
Subjt:  KVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEA

Query:  GLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPT
        GLRLAPYEILY+KLWN EGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGS+GNGP+RSPSPAP 
Subjt:  GLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPT

Query:  PQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLP
        PQPHN  +PPTHHHHH HT LTPAISPAPATEKGA EYGSPAPERS ASPKRSYTA+PPGCQY  KRKSGRKEGKQSHLTPLASPN+SP+HSAASPS LP
Subjt:  PQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLP

Query:  RHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPSPSGADRRRMITQWGFTLFLILARYM
        +H+VNPPAAP+ PAPALTPLPNVIYAHVQPPSKS+S+H EKSTTNPS A SPSPSGADR  MITQWGFTLFLILA +M
Subjt:  RHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPSPSGADRRRMITQWGFTLFLILARYM

TrEMBL top hitse value%identityAlignment
A0A0A0KYS3 Uncharacterized protein7.9e-21585.56Show/hide
Query:  CRWPMLLWVC-FDSKLHWLQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRT
        C + +LL V  F S + WL   L         H+  QKDL LN SYRGHDIVATFNVER VSLLEDN +QLRTDIFEEFPIPSIKV+ILSLE LSGSNRT
Subjt:  CRWPMLLWVC-FDSKLHWLQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRT

Query:  KVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEA
        KVVFSLDPD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEA
Subjt:  KVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEA

Query:  GLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPT
        GLRLAPYEILYIKLWN EGSTVT PTIVQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGP RSPSPAPT
Subjt:  GLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPT

Query:  PQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLP
        PQPHN HHPPTHHHHH HTPLTPAISPAPATEKGA EYGSPAPER+AASPKRSYTA+PPGCQYRYKRKSGRKEGKQSHLTPLASPNISP+HSAASPS  P
Subjt:  PQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLP

Query:  RHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPSPSGADRRRMITQWGFTLFLILARYM
        +HQ+NPPAAPV+PAPALTPLPNVIYAHVQPPSKSDS+H      NPS+A  PSPSGADR  MITQWGFTLFLILA +M
Subjt:  RHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPSPSGADRRRMITQWGFTLFLILARYM

A0A1S3C173 uncharacterized protein LOC1034958526.7e-21481.98Show/hide
Query:  AGIVPDGEKRRRTVTAVRHRLQAVRPGCRWPMLLWVC-FDSKLHWLQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRT
        +G+V DG   R  V+    RL   R  C + +LL V  F S + WL            +H+  QKDLGLN SYRGHDIVATFNVER VSLLEDN +QLRT
Subjt:  AGIVPDGEKRRRTVTAVRHRLQAVRPGCRWPMLLWVC-FDSKLHWLQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRT

Query:  DIFEEFPIPSIKVDILSLELLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQ
        DIFEEFPIPSIKV+ILSLE LSGSNRTKVVFS+DPD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA SFEVLKFPGGITIIPPQSAFLLQKVQ
Subjt:  DIFEEFPIPSIKVDILSLELLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQ

Query:  ILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQV
        ILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWN EGSTVTAPTIVQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNN EFGKVKQV
Subjt:  ILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQV

Query:  RLSSILKHSLNGSDGNGPSRSPSPAPTPQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKE
        RLSSILKHSLNGS+GNGP RSPSPAPTPQPHN+HHPPTHHHHH HTPL  AISPAPATEKGA EYGSPAPERSAASP+RSYTA PPGCQYRYKRKSGRKE
Subjt:  RLSSILKHSLNGSDGNGPSRSPSPAPTPQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKE

Query:  GKQSHLTPLASPNISPEHSAASPSSLPRHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPSPSGADRRRMITQWGFTLFLI
        GKQSHLTPLASPNISP+HSAASPS  P+HQ+NPPAAPV+PAPALTPLPNVIYAHVQPPSKSDS+       NPSVA  PSPSGADR  MITQWGFTLFLI
Subjt:  GKQSHLTPLASPNISPEHSAASPSSLPRHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPSPSGADRRRMITQWGFTLFLI

Query:  LARYM
        LAR+M
Subjt:  LARYM

A0A5A7SNH7 Zinc finger family protein, putative isoform 16.7e-20689.35Show/hide
Query:  RGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEA
        +GHDIVATFNVER VSLLEDN +QLRTDIFEEFPIPSIKV+ILSLE LSGSNRTKVVFS+DPD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA
Subjt:  RGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEA

Query:  LSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLK
         SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWN EGSTVTAPTIVQ+SVLLEVGNTPSMRRLK
Subjt:  LSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLK

Query:  QLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPTPQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERS
        QLAQTISGSNSSNLGLNN EFGKVKQVRLSSILKHSLNGS+GNGP RSPSPAPTPQPHN+HHPPTHHHHH HTPL  AISPAPATEKGA EYGSPAPERS
Subjt:  QLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPTPQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERS

Query:  AASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLPRHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNP
        AASP+RSYTA PPGCQYRYKRKSGRKEGKQSHLTPLASPNISP+HSAASPS  P+HQ+NPPAAPV+PAPALTPLPNVIYAHVQPPSKSDS+       NP
Subjt:  AASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLPRHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNP

Query:  SVASSPSPSGADRRRMITQWGFTLFLILARYM
        SVA  PSPSGADR  MITQWGFTLFLILAR+M
Subjt:  SVASSPSPSGADRRRMITQWGFTLFLILARYM

A0A6J1EEJ8 uncharacterized protein LOC111432513 isoform X41.9e-19279.12Show/hide
Query:  CRWPMLLWVC-FDSKLHWLQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRT
        C + +LL V  F S + WL   L         H+  QKDLGLN SYRGHDIVATF VERPVSLL+DNIE+LRTDIFEEFPIPSIKVDILSL  LSGSNRT
Subjt:  CRWPMLLWVC-FDSKLHWLQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRT

Query:  KVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQ-FLRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLE
        KVVF +DPD DD EI STYLSLIRST AS+VTNQ FLRITKSMFGEA SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+
Subjt:  KVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQ-FLRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLE

Query:  AGLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAP
        AGLRLAPYEILYIKLWN EGSTVTAPTIVQSSVLLEVGNTPSM+RLKQLAQTIS SNSSNLGLNNTEFGKVKQVRLSSILKHSLNG DG GP RSPSPAP
Subjt:  AGLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAP

Query:  TPQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSL
        TPQPHN+HHPP+HHHHH H PLTP ISPAPA E GA EYG  AP +SAASPKRSY A+PPGCQ  YKRKSGRKEGKQ HL+PLASP+ISP HSAASPS  
Subjt:  TPQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSL

Query:  PRHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPSPSGADRRRMITQWGFTLFLILARYM
         +H        V+P  A TPLP+VIYAHVQPPSKSDS+H EKSTT+PS+  SPSPS A    MIT+WGFTL LI+A YM
Subjt:  PRHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPSPSGADRRRMITQWGFTLFLILARYM

A0A6J1HSR1 uncharacterized protein LOC111466276 isoform X34.7e-19182.55Show/hide
Query:  HFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVT
        H+  QKDLGLN SYRGHDIVATF VERPVSLLEDNIE+LRTDIFEEFPIPSIKVDILSL  LSGSNRTKVVF +DPD DD EI STYLSLIRST ASLVT
Subjt:  HFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVT

Query:  NQ-FLRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNGEGSTVTAPTIVQSS
        NQ FLRITKSMFGEA SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWN EGSTVTAPTIVQSS
Subjt:  NQ-FLRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNGEGSTVTAPTIVQSS

Query:  VLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPTPQPHNYHHPPTHHHHHRHTPLTPAISPAPAT
        VLLEVGNTPSM+RLKQLAQTIS SNSSNLGLNNTEFGKVKQVRLSSILKHSLNG DG GP RSPSPAPTPQ HN+HHPP+HHHHH H+PLTP ISPAPA 
Subjt:  VLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPTPQPHNYHHPPTHHHHHRHTPLTPAISPAPAT

Query:  EKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLPRHQVNPPAAPVAPAPALTPLPNVIYAHVQPP
        E GA EYG PAP +SAASPKRSY A+PPGCQ  YKRKSGRKEGKQ +L+PLASP+ISP HSAASPS   +H        V+P  A TPLP+VIYAHVQPP
Subjt:  EKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLPRHQVNPPAAPVAPAPALTPLPNVIYAHVQPP

Query:  SKSDSSHQEKSTTNPSVASSPSPSGADRRRMITQWGFTLFLILARYM
        SKS+S+H EKSTT+PS+  SPSPS A    MIT+W FTL LI+A YM
Subjt:  SKSDSSHQEKSTTNPSVASSPSPSGADRRRMITQWGFTLFLILARYM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)1.1e-2735.18Show/hide
Query:  IVATFNVERPVSLLEDNIEQLRTDIFEEFPIP-SIKVDILSLELLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGEALS
        + A+F +++PVS +  +  ++  DI     +  + KV +LSL     SN T V F++ P   D EIS   LSL+RS+   L   +  L++T S FG+  S
Subjt:  IVATFNVERPVSLLEDNIEQLRTDIFEEFPIP-SIKVDILSLELLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGEALS

Query:  FEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQL
        F+VLKFPGGIT+ P + A +     +LF+ T+  SI  +Q     L    E  L L PYE ++ +L N +GST++ P   Q  V   +      +RL   
Subjt:  FEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQL

Query:  AQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPTP
         Q I  S + NLGL+   FG+VK +  S+ L   +  SD        +PAPTP
Subjt:  AQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPTP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein2.7e-8247.36Show/hide
Query:  QKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQ-F
        ++D  L+  +RGH IVA+F++ R  S L +N  QL+ DIF+E    SIKV IL++E     N TKVVF +DPD    EI    LS I+    S++ NQ  
Subjt:  QKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQ-F

Query:  LRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLE
        L++TKS+FGE   FEVLKFPGGIT+IPPQSAF LQK +I+FNFTLN+SIHQIQ++F+ L SQL+ GL LAPYE LY+ L N EGSTV+ PT V SSVLL 
Subjt:  LRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLE

Query:  VGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPTPQPHNYHHPPTH--------HHHHRHTPLTPAISP
        VG + S  RLKQL  TI+GS S NLGLNNT FGKVKQVRLSS L    N SD +  S SPSP+P  + H++HH   H        HHHH H  L+P ++P
Subjt:  VGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPTPQPHNYHHPPTH--------HHHHRHTPLTPAISP

Query:  APATEKGASEYGSPAPERSAASPKRSYTARP---PGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLPRHQVNPPAAPVAPA-----PALTP
                S   SPAP RS    KR+ +A P   PG +  +K K       Q   TP  +P+          +  P HQ++ P AP++ A     P   P
Subjt:  APATEKGASEYGSPAPERSAASPKRSYTARP---PGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLPRHQVNPPAAPVAPA-----PALTP

Query:  LPNVIYAH-VQPPSKSDSSHQEKSTTNPSVASSPS
        LP+V++AH  QPP             +P   SS S
Subjt:  LPNVIYAH-VQPPSKSDSSHQEKSTTNPSVASSPS

AT3G56590.1 hydroxyproline-rich glycoprotein family protein1.2e-8545.54Show/hide
Query:  FDSKLHWLQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRTKVVFSLDPDGD
        F S L WL   L          F    DL L+  ++ H IVA+F+V +P+S +EDN+ QL  DI +E   P  KV +L+LE L   NRT V+F++DP+ +
Subjt:  FDSKLHWLQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRTKVVFSLDPDGD

Query:  DLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEIL
        + +I +   SLI++   +LV  Q   R+T+S+FGE   FEVLKFPGGIT+IPPQ  F LQK Q+LFNFTLNFSI+QIQ +F EL SQL+ G+ LA YE L
Subjt:  DLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEIL

Query:  YIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPTPQPHNY-HHP
        YI L N  GSTV  PTIV SSVLL  G   S  RLKQLAQTI+ S+S NLGLN+T FGKVKQVRLSSIL HS         S +PSP+P P+ H Y HH 
Subjt:  YIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPTPQPHNY-HHP

Query:  PTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLPRHQVNPPAA
        P HHHHH      P++SP       AS     AP + +  P R+     P C Y  +R  G          P  +P+ S  H  A   + PRH       
Subjt:  PTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLPRHQVNPPAA

Query:  PVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPSPSGA
             P  +PLP+V++AH+ PPSKS    +     +PS A +P  S +
Subjt:  PVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPSPSGA

AT3G56590.2 hydroxyproline-rich glycoprotein family protein9.0e-8645.95Show/hide
Query:  FDSKLHWLQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRTKVVFSLDPDGD
        F S L WL   L          F    DL L+  ++ H IVA+F+V +P+S +EDN+ QL  DI +E   P  KV +L+LE L   NRT V+F++DP+ +
Subjt:  FDSKLHWLQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRTKVVFSLDPDGD

Query:  DLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEIL
        + +I +   SLI++   +LV  Q   R+T+S+FGE   FEVLKFPGGIT+IPPQ  F LQK Q+LFNFTLNFSI+QIQ +F EL SQL+ G+ LA YE L
Subjt:  DLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEIL

Query:  YIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPTPQPHNY-HHP
        YI L N  GSTV  PTIV SSVLL  G   S  RLKQLAQTI+ S+S NLGLN+T FGKVKQVRLSSIL HS         S +PSP+P P+ H Y HH 
Subjt:  YIKLWNGEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPTPQPHNY-HHP

Query:  PTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLPRHQVNPPAA
        P HHHHH      P++SP       AS     AP + +  P R+     P C Y  +R  G          P  +P+ S  H  A   + PRH       
Subjt:  PTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAASPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLPRHQVNPPAA

Query:  PVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPS
             P  +PLP+V++AH+ PPSKS    +     +PS A +PS
Subjt:  PVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGTCCACCATCCAAAAGGCCCCAAAAAGAGTAGAGTTGTCATTTGCAGTTCACTTACTGTTCAATCTTATTATTACACACACCCACATTCAAATTTCACTCCCAC
TTCTGTTCTCTTCTCACTCCACCGCAATAATGGCCGCACCCAACTCACCCCCAGTGGAGAAGAGCCGGAATGGCCCTTAACCCACTTTGCCGGCATTGTTCCCGATGGGG
AAAAACGACGGAGAACAGTCACTGCCGTCCGCCATCGACTCCAGGCCGTCCGGCCAGGCTGCCGATGGCCGATGCTCTTGTGGGTGTGTTTCGATTCGAAGCTTCATTGG
CTTCAGATGCATCTTCATTCTGCTATTGTCCGTTGCCTTGTTCATTTCTGCTATCAGAAGGATCTGGGTCTTAATTCATCGTATCGAGGTCATGATATAGTAGCAACATT
CAATGTTGAGAGACCGGTTTCTTTGCTGGAAGACAATATCGAGCAACTTCGAACCGACATTTTTGAAGAGTTCCCTATACCTTCTATCAAAGTGGATATACTATCTCTAG
AACTATTATCTGGATCAAACCGTACAAAAGTTGTGTTCAGCCTCGATCCAGATGGCGATGACTTGGAAATCTCATCAACTTATCTAAGTTTAATCAGGTCAACCATTGCA
AGTCTAGTAACGAATCAGTTCCTCCGCATTACTAAGTCCATGTTTGGGGAGGCCTTATCGTTTGAAGTACTGAAATTCCCCGGAGGAATAACGATAATCCCGCCACAGAG
TGCATTTCTTTTGCAGAAAGTGCAGATTCTTTTCAACTTTACATTGAACTTCTCTATTCATCAGATTCAAGTACATTTCAGTGAACTTACCAGCCAACTGGAGGCGGGAT
TACGACTAGCTCCATATGAGATTTTATATATTAAGTTGTGGAATGGGGAAGGTTCGACTGTGACTGCCCCTACGATTGTCCAGTCGTCTGTACTTCTTGAAGTTGGAAAT
ACTCCATCAATGCGACGGCTGAAGCAGCTAGCTCAAACAATCTCAGGTTCTAATTCTAGCAACCTTGGCCTGAATAATACGGAGTTTGGAAAGGTGAAGCAAGTTCGCCT
TTCGTCAATTCTTAAACACTCCCTCAATGGCAGTGATGGGAACGGCCCCTCAAGGTCACCTTCTCCTGCTCCTACACCCCAGCCCCATAACTACCATCATCCCCCGACTC
ACCACCACCACCACCGTCACACTCCTCTAACCCCTGCAATTTCACCTGCCCCTGCAACCGAGAAGGGTGCATCAGAATACGGTTCACCTGCCCCTGAAAGAAGCGCAGCA
TCACCTAAGCGAAGTTACACGGCAAGGCCGCCTGGTTGTCAATATAGATACAAGAGGAAGTCTGGTAGGAAAGAAGGAAAGCAATCTCACTTAACCCCGCTTGCTTCACC
CAATATATCTCCTGAGCATTCTGCTGCATCGCCATCATCACTGCCACGACATCAAGTAAACCCGCCAGCAGCACCCGTCGCTCCAGCTCCGGCATTAACTCCATTGCCAA
ACGTCATTTACGCTCATGTTCAACCACCTTCCAAAAGTGACTCCAGCCACCAGGAAAAATCCACGACAAATCCATCAGTTGCATCATCTCCATCTCCATCTGGTGCTGAT
CGTCGTCGTATGATCACTCAATGGGGATTCACACTGTTTCTTATTCTCGCACGCTATATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGTCCACCATCCAAAAGGCCCCAAAAAGAGTAGAGTTGTCATTTGCAGTTCACTTACTGTTCAATCTTATTATTACACACACCCACATTCAAATTTCACTCCCAC
TTCTGTTCTCTTCTCACTCCACCGCAATAATGGCCGCACCCAACTCACCCCCAGTGGAGAAGAGCCGGAATGGCCCTTAACCCACTTTGCCGGCATTGTTCCCGATGGGG
AAAAACGACGGAGAACAGTCACTGCCGTCCGCCATCGACTCCAGGCCGTCCGGCCAGGCTGCCGATGGCCGATGCTCTTGTGGGTGTGTTTCGATTCGAAGCTTCATTGG
CTTCAGATGCATCTTCATTCTGCTATTGTCCGTTGCCTTGTTCATTTCTGCTATCAGAAGGATCTGGGTCTTAATTCATCGTATCGAGGTCATGATATAGTAGCAACATT
CAATGTTGAGAGACCGGTTTCTTTGCTGGAAGACAATATCGAGCAACTTCGAACCGACATTTTTGAAGAGTTCCCTATACCTTCTATCAAAGTGGATATACTATCTCTAG
AACTATTATCTGGATCAAACCGTACAAAAGTTGTGTTCAGCCTCGATCCAGATGGCGATGACTTGGAAATCTCATCAACTTATCTAAGTTTAATCAGGTCAACCATTGCA
AGTCTAGTAACGAATCAGTTCCTCCGCATTACTAAGTCCATGTTTGGGGAGGCCTTATCGTTTGAAGTACTGAAATTCCCCGGAGGAATAACGATAATCCCGCCACAGAG
TGCATTTCTTTTGCAGAAAGTGCAGATTCTTTTCAACTTTACATTGAACTTCTCTATTCATCAGATTCAAGTACATTTCAGTGAACTTACCAGCCAACTGGAGGCGGGAT
TACGACTAGCTCCATATGAGATTTTATATATTAAGTTGTGGAATGGGGAAGGTTCGACTGTGACTGCCCCTACGATTGTCCAGTCGTCTGTACTTCTTGAAGTTGGAAAT
ACTCCATCAATGCGACGGCTGAAGCAGCTAGCTCAAACAATCTCAGGTTCTAATTCTAGCAACCTTGGCCTGAATAATACGGAGTTTGGAAAGGTGAAGCAAGTTCGCCT
TTCGTCAATTCTTAAACACTCCCTCAATGGCAGTGATGGGAACGGCCCCTCAAGGTCACCTTCTCCTGCTCCTACACCCCAGCCCCATAACTACCATCATCCCCCGACTC
ACCACCACCACCACCGTCACACTCCTCTAACCCCTGCAATTTCACCTGCCCCTGCAACCGAGAAGGGTGCATCAGAATACGGTTCACCTGCCCCTGAAAGAAGCGCAGCA
TCACCTAAGCGAAGTTACACGGCAAGGCCGCCTGGTTGTCAATATAGATACAAGAGGAAGTCTGGTAGGAAAGAAGGAAAGCAATCTCACTTAACCCCGCTTGCTTCACC
CAATATATCTCCTGAGCATTCTGCTGCATCGCCATCATCACTGCCACGACATCAAGTAAACCCGCCAGCAGCACCCGTCGCTCCAGCTCCGGCATTAACTCCATTGCCAA
ACGTCATTTACGCTCATGTTCAACCACCTTCCAAAAGTGACTCCAGCCACCAGGAAAAATCCACGACAAATCCATCAGTTGCATCATCTCCATCTCCATCTGGTGCTGAT
CGTCGTCGTATGATCACTCAATGGGGATTCACACTGTTTCTTATTCTCGCACGCTATATGTAACATTCAAAAAGAAGACTACTGGTTTTCTGGTAAACATGTGTCGACGG
CAACAAGAGATGCCAAGGTTTTTAAAAAATGATAAAGTGCAGGAGTTTCTTTAAAAGTGTGAGCACAGAGTAGAGAAGCAAAGCCAAAGAGGCATTGCTTGTTGTAAGAT
GGTTTCCAAGTGTGTAAATATCATCTGATTAAGAAACTTGTTGCAAATGCAGTTTCAGGTCAAAGTCCACAGAGGTGGCAGGCCTTCAGAAACTTGCATATTTTCCCACT
GTTTTGTGTATTATGATCTTCTTCTCCATAAAATGTAAGGAGATAGAGAAGGAAAAAAAGGAAAAAA
Protein sequenceShow/hide protein sequence
MGVHHPKGPKKSRVVICSSLTVQSYYYTHPHSNFTPTSVLFSLHRNNGRTQLTPSGEEPEWPLTHFAGIVPDGEKRRRTVTAVRHRLQAVRPGCRWPMLLWVCFDSKLHW
LQMHLHSAIVRCLVHFCYQKDLGLNSSYRGHDIVATFNVERPVSLLEDNIEQLRTDIFEEFPIPSIKVDILSLELLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIA
SLVTNQFLRITKSMFGEALSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNGEGSTVTAPTIVQSSVLLEVGN
TPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPSRSPSPAPTPQPHNYHHPPTHHHHHRHTPLTPAISPAPATEKGASEYGSPAPERSAA
SPKRSYTARPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPEHSAASPSSLPRHQVNPPAAPVAPAPALTPLPNVIYAHVQPPSKSDSSHQEKSTTNPSVASSPSPSGAD
RRRMITQWGFTLFLILARYM