; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g0203 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g0203
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionserine/arginine repetitive matrix protein 1-like
Genome locationMC09:1797743..1799395
RNA-Seq ExpressionMC09g0203
SyntenyMC09g0203
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575459.1 hypothetical protein SDJN03_26098, partial [Cucurbita argyrosperma subsp. sororia]1.31e-28877.18Show/hide
Query:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
        ME DG+APPPFWLQ SNSLH LD +RR R RLSRASSFLLNSSAFL+VLLVIVLCFI IVIPKFV FGSQL+RPQS+KKSWDSLNLVLVLFAIVCGFLSR
Subjt:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR

Query:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
        N GDDS+ SFEDRSVSS+R  KSNP  PR+W GY+D RP HYT+NRMRSSSSYPDLRLQES LDAGD+RWR YDDTHV N+R P+SDQLHRRREARPELE
Subjt:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE

Query:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPP
         ED   +S GFDRS IRE+VYS+  + SPPRSPPP         PPPT PPP  TTP VVKRRP RTHKVHSHTP GEID+ +KNGDSDVA+ QRI LPP
Subjt:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPP

Query:  LSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPE
        LSPP FY+ESEQK  KNEKKRGG  KE W+ ALRRR+KKQRQKSIESFE I++SQRPSTSSLPP SPPPPPPL  PSV Q LF+SKKG+G+K QS   PE
Subjt:  LSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPE

Query:  PPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFK-FARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAG
         PP SIASSEPKP I DQN L   HEPP+EL RL+S+NDEEY+TRIGGES FHPIPPPP P   F  HG+FDS GSNSSTPRAVSPDM+ESEADG PAAG
Subjt:  PPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFK-FARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAG

Query:  QMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP
        + + +K+S  PMFCSSPDVNSKADKFI RFRADLKLQKMNSIKEK ARKRSNLGR PGPGP
Subjt:  QMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP

XP_022143734.1 uncharacterized protein C6orf132 homolog [Momordica charantia]0.0100Show/hide
Query:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
        MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
Subjt:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR

Query:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
        NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
Subjt:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE

Query:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPAFYQE
        HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPAFYQE
Subjt:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPAFYQE

Query:  SEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASS
        SEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASS
Subjt:  SEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASS

Query:  EPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMRLLKDSAT
        EPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMRLLKDSAT
Subjt:  EPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMRLLKDSAT

Query:  PMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP
        PMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP
Subjt:  PMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP

XP_022953834.1 protein enabled homolog [Cucurbita moschata]1.31e-28877.36Show/hide
Query:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
        ME DG+APPPFWLQ SNSLH LD +RR R RLSRASSFLLNSSAFL+VLLVIVLCFI IVIPKFV FGSQL+RPQS+KKSWDSLNLVLVLFAIVCGFLSR
Subjt:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR

Query:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
        N GDDS+ SFEDRSVSS+RT K+NP  PR+W GY+D RP HYT+NRMRSSSSYPDLRLQES L AGD+R R YDDTHV N+R P SDQL+RRREARPELE
Subjt:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE

Query:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPP
         ED   +S GFDRS IRE+VYS+  + SPPRSPPP         PPPT PPP  TTP VVKRRP RTHKVHSHTP GEID+ +KNGDSDVA+ QRI LPP
Subjt:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPP

Query:  LSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPE
        LSPP FY+ESEQK  KNEKKRGG  KE W+ ALRRR+KKQRQKSIESFE I++SQRPSTSSLPP SPPPPPPLP PSV Q LF+SKKG+G+K QS   PE
Subjt:  LSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPE

Query:  PPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFK-FARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAG
         PP SIASSEPKP I DQN L   HEPP+EL RL+S+NDEEY+TRIGGESPFHPIPPPP P   F  HG+FDS GSNSSTPRAVSPDM+ESEADG PAAG
Subjt:  PPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFK-FARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAG

Query:  QMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP
        + +L+KDS  PMFCSSPDVNSKADKFI RFRADLKLQKMNSIKEK ARKRSNLGR PGPGP
Subjt:  QMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP

XP_023548433.1 protein enabled homolog [Cucurbita pepo subsp. pepo]8.36e-29277.72Show/hide
Query:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
        ME DG+APPPFWLQ SNSLH LD +RR R RLSRASSFLLNSSAFL+VLLVIVLCFI IVIPKFV FGSQL+RPQSVKKSWDSLNLVLVLFAIVCGFLSR
Subjt:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR

Query:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
        N G+DS+ SFEDRSVSS+RT KSNP  PR+W GY+D RP HYT+NRMRSSSSYPDLRLQES LDAGD++WR YDDTHV N+R P+SDQLHRRREARPELE
Subjt:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE

Query:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPP
         ED   +S GFDRS +RE+VYS+  + SPPRSPPP         PPPT PPP  TTP VVKRRP RTHKVHSHTP GEID+ +KNGDSDVA+ QRI LPP
Subjt:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPP

Query:  LSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPE
        LSPP FY+ESEQK  KN+KKRGG  KE W+ ALRRR+KKQRQKSIESFE I++SQRPSTSSLPP SPPPPPPLP PSV Q LF+SKKGKG+K QS   PE
Subjt:  LSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPE

Query:  PPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFK-FARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAG
         PP SIAS EPKP I DQN L   HEPP+EL RLSS+NDEEY+TRIGGESPFHPIPPPP P   F  HG+FDS GSNSSTPRAVSPDM ESEADG PAAG
Subjt:  PPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFK-FARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAG

Query:  QMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP
        + +L+KDS  PMFCSSPDVNSKADKFI RFRADLKLQKMNSIKEK ARKRSNLGR PGPGP
Subjt:  QMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP

XP_038896222.1 serine/arginine repetitive matrix protein 1-like [Benincasa hispida]4.37e-29377.88Show/hide
Query:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
        ME DG+APPPFWLQSSNSLH LDYNRR  RRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFV F SQL+RPQSVKKSWDSLNL+LVLFAIVCGFLSR
Subjt:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR

Query:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
        NTGDDS+ASFED SVSS+RT KSNPTTPRRW GY+D RP HYTLNRMRSSSSYPDLRLQES  DAGD RWRFYDDTHV NHR  +SDQLHRRRE RPELE
Subjt:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE

Query:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPR--SPPP---------PPPTLPPPPKTTPT--VVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRI
          D  A+S GFDRS IRE+VYS+P + SPPR  SPPP         PPPT PPP  TTP   VVKRRP RTHKVHSHTPD EID+Q++NGDSDVA+ QRI
Subjt:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPR--SPPP---------PPPTLPPPPKTTPT--VVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRI

Query:  QLPPLSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSL
        QLPPLSPP+FY+ESEQK  +NEKKRGG +KE W+ ALRRRKKKQRQKSIESFE I++SQR ST    P+SPPPPPPLP PSV QNLFSSKKGKG+K QS 
Subjt:  QLPPLSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSL

Query:  TLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFK-FARHGNFDSSGSNSSTPRAVSPDMEESEADGP
          PEPP    ASSEPKP+  D+NQ+   HEPPMEL+RLSS+NDEEYNTRIGGESP+HPIPPPP P   F  HG+FDS GSNSSTPRA+SP+M+ESEADGP
Subjt:  TLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFK-FARHGNFDSSGSNSSTPRAVSPDMEESEADGP

Query:  PAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP
        PA G+ +L+KDS  P+FCSSPDVNSKADKFI RFRADLKLQKMNSIKEK ARKRSNLGR  GPGP
Subjt:  PAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP

TrEMBL top hitse value%identityAlignment
A0A1S3CII2 LOW QUALITY PROTEIN: serine/arginine repetitive matrix protein 1-like2.00e-28275.44Show/hide
Query:  MEGDGDA-PPPFWLQSSNS-LHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFL
        ME DG+A  PPFWLQSSNS LH L Y+RR  RRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFV F SQL+RPQSVKKSWDSLNL+LVLFAIVCGFL
Subjt:  MEGDGDA-PPPFWLQSSNS-LHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFL

Query:  SRNTG-DDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARP
         RN G DDS+ SFEDRSVSS+R+ KSNPTTPRRW GY+D RP H+TLNRMRSSSSYPDLRLQES  DAGD RWRFYDDTHV NHR  +SDQLHRRRE +P
Subjt:  SRNTG-DDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARP

Query:  ELEHEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQ
        ELE +D  A+S  FDRS IR+ VYS+P + SPPRSPPP         PPPT PPP  T P +VKRRP RTHKVHSHTP+ EI++QH+NGDSDVA+ QRIQ
Subjt:  ELEHEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQ

Query:  LPPLSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPP--LPPPSVFQNLFSSKKGKGRKGQS
        LPPLSPP FY+ESEQK  KNEKKR G +KE W+ ALRRRKKKQRQKS+ESFE I++SQR STSSLPP SPPPPPP  LP PSV QNLFSS+KGK +K QS
Subjt:  LPPLSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPP--LPPPSVFQNLFSSKKGKGRKGQS

Query:  LTLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFK-FARHGNFDSSGSNSSTPRAVSPDMEESEADG
         +LP+PPPPSIASSEPKP+  DQNQ+    +PPMEL+RLSS+NDEEY+TRIGGESP+HPIPPPP P   F  HG+FDS GSNSSTPRA+SP+M+ESEAD 
Subjt:  LTLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFK-FARHGNFDSSGSNSSTPRAVSPDMEESEADG

Query:  PPAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP
        PPA  + +L+KD   PMFCSSPDVNSKADKFI RFRADLKLQKMNSIKEK  RKRSNLGR  GPGP
Subjt:  PPAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP

A0A5A7V0Q3 Serine/arginine repetitive matrix protein 1-like1.15e-28175.27Show/hide
Query:  MEGDGDA-PPPFWLQSSNS-LHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFL
        ME DG+A  PPFWLQSSNS LH L Y+RR  RRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFV F SQL+RPQSVKKSWDSLNL+LVLFAIVCGFL
Subjt:  MEGDGDA-PPPFWLQSSNS-LHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFL

Query:  SRNTG-DDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARP
         RN G DDS+ SFEDRSVSS+R+ KSNPTTPRRW GY+D RP H+TLNRMRSSSSYPDLRLQES  DAGD +WRFYDDTHV NHR  +SDQLHRRRE +P
Subjt:  SRNTG-DDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARP

Query:  ELEHEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQ
        ELE +D  A+S  FDRS IR+ VYS+P + SPPRSPPP         PPPT PPP  T P +VKRRP RTHKVHSHTP+ EI++QH+NGDSDVA+ QRIQ
Subjt:  ELEHEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQ

Query:  LPPLSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPP--LPPPSVFQNLFSSKKGKGRKGQS
        LPPLSPP FY+ESEQK  KNEKKR G +KE W+ ALRRRKKKQRQKS+ESFE I++SQR STSSLPP SPPPPPP  LP PSV QNLFSS+KGK +K QS
Subjt:  LPPLSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPP--LPPPSVFQNLFSSKKGKGRKGQS

Query:  LTLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFK-FARHGNFDSSGSNSSTPRAVSPDMEESEADG
         +LP+PPPPSIASSEPKP+  DQNQ+    +PPMEL+RLSS+NDEEY+TRIGGESP+HPIPPPP P   F  HG+FDS GSNSSTPRA+SP+M+ESEAD 
Subjt:  LTLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFK-FARHGNFDSSGSNSSTPRAVSPDMEESEADG

Query:  PPAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP
        PPA  + +L+KD   PMFCSSPDVNSKADKFI RFRADLKLQKMNSIKEK  RKRSNLGR  GPGP
Subjt:  PPAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP

A0A6J1CQ76 uncharacterized protein C6orf132 homolog0.0100Show/hide
Query:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
        MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
Subjt:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR

Query:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
        NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
Subjt:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE

Query:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPAFYQE
        HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPAFYQE
Subjt:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPAFYQE

Query:  SEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASS
        SEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASS
Subjt:  SEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASS

Query:  EPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMRLLKDSAT
        EPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMRLLKDSAT
Subjt:  EPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMRLLKDSAT

Query:  PMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP
        PMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP
Subjt:  PMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP

A0A6J1GQY1 protein enabled homolog6.33e-28977.36Show/hide
Query:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
        ME DG+APPPFWLQ SNSLH LD +RR R RLSRASSFLLNSSAFL+VLLVIVLCFI IVIPKFV FGSQL+RPQS+KKSWDSLNLVLVLFAIVCGFLSR
Subjt:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR

Query:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
        N GDDS+ SFEDRSVSS+RT K+NP  PR+W GY+D RP HYT+NRMRSSSSYPDLRLQES L AGD+R R YDDTHV N+R P SDQL+RRREARPELE
Subjt:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE

Query:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPP
         ED   +S GFDRS IRE+VYS+  + SPPRSPPP         PPPT PPP  TTP VVKRRP RTHKVHSHTP GEID+ +KNGDSDVA+ QRI LPP
Subjt:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPP

Query:  LSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPE
        LSPP FY+ESEQK  KNEKKRGG  KE W+ ALRRR+KKQRQKSIESFE I++SQRPSTSSLPP SPPPPPPLP PSV Q LF+SKKG+G+K QS   PE
Subjt:  LSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPE

Query:  PPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFK-FARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAG
         PP SIASSEPKP I DQN L   HEPP+EL RL+S+NDEEY+TRIGGESPFHPIPPPP P   F  HG+FDS GSNSSTPRAVSPDM+ESEADG PAAG
Subjt:  PPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFK-FARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAG

Query:  QMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP
        + +L+KDS  PMFCSSPDVNSKADKFI RFRADLKLQKMNSIKEK ARKRSNLGR PGPGP
Subjt:  QMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP

A0A6J1KKJ2 serine/arginine repetitive matrix protein 1-like6.16e-28375.99Show/hide
Query:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
        MEGDG+A PPFWLQSSNS   + YNRR  RRLSRASSFLLNSSAFL VLLVIVLCF+LIVIPK V F SQL+RPQSVKKSWDSLNLVLVLFAIVCGFLSR
Subjt:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR

Query:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
        NTGDD++  FEDRSVSS+R  KSNPTTPR+W GY D RP HYT+NRMRSSSSYPDLRLQES LDAGDERWRFYDDTHV NHR  +SDQLHRR +ARPELE
Subjt:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE

Query:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP----PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPA
         ED GA+STGFDRS + E+VYS+P + SPPR PPP    PPPTL  P  TTP VVKRRP RTH VHSHTPDG ID+Q KN DSDVAD QRI LPPLSPP+
Subjt:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP----PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPA

Query:  FYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPS
        FYQESEQK GKNEKKRGG  KE W+ ALRRRKKKQRQKS+ESFE I ++ R STSSLP ASPPPPPPLPPP V QNLF SKKGK +K QS   PE PPP+
Subjt:  FYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPS

Query:  IASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFH---PIPPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMR
        I +SEPKPEI  QN     ++PPMELERLSS+NDEEYNTRIG +SPFH   P PPPP P  F  HG+FDS+GSNSSTPRA+SP+++ESE DGPPAAG+M+
Subjt:  IASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFH---PIPPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMR

Query:  LLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP
        + + S TP+FCSSPDVNSKADKFI RF+ADLKLQKMNSIKE+ ARKRSNLGR  GPGP
Subjt:  LLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G72790.1 hydroxyproline-rich glycoprotein family protein8.7e-7038.36Show/hide
Query:  EGDGDAPPPFWLQS--SNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLS
        E DGDA  PFWLQS  +N+      +   R        F   ++A LIV         + +IP F    SQ+ RP  V+KSWD LN VLVLFA++CGFLS
Subjt:  EGDGDAPPPFWLQS--SNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLS

Query:  RNTGDDSKASFEDRSVSSK----------RTGKSNP-TTPRRWY----GYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLP
        RNT +D     ++  + +K          R+  SN  TTPR W     G   D+  +   +R+RS SSYPDLRL+E      DERWRFYDDT V   R  
Subjt:  RNTGDDSKASFEDRSVSSK----------RTGKSNP-TTPRRWY----GYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLP

Query:  ASDQLHRRR-------EARPELEHEDFGARSTGFDRSGIRE----------------EVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKV
          D ++  +       E +P  E  D        + S +R                 EV  + ++ S P   P PPP+ P PP       K+   +T++V
Subjt:  ASDQLHRRR-------EARPELEHEDFGARSTGFDRSGIRE----------------EVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKV

Query:  HSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPP
        +    D     + K  D  VA        P+ PPA      QK  K EKK+GG TK+F   ALRR+KKKQRQ+SI+  + +  S  P   S      PPP
Subjt:  HSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPP

Query:  PPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTR---IGGESPFHPIPPPPLP------
        PP PPP  FQ LFSSKKGK +K  S   P PPPP      P+     +   S L + P+E  R S  N     T+    G ESP  PIPPPP P      
Subjt:  PPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTR---IGGESPFHPIPPPPLP------

Query:  -FKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPG
         +KF + G++    S+ S    +S D E  + D   +AG     K++A  MFC SPDV++KAD FI RFRA LKL+KMNS+K    R RSNLG EPG
Subjt:  -FKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPG

AT5G57070.1 hydroxyproline-rich glycoprotein family protein1.6e-3930.08Show/hide
Query:  DAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGD-
        D PP  W Q        D     RRR S  +  +L  +   +    I L F+  V+P F+   SQ+L+P SVK+ WDS+N+VLV+FAI+CG L+R   D 
Subjt:  DAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGD-

Query:  ---DSKASFEDRSVSS-------------KRTGKSNPTTPRRWYG--YSDDR----------------PTHYTLNRMRSSSSYPDLRLQESLLDAGDERW
           +S    E+  V                +   S+ T   +W+   Y  DR                P    +   RSSSSYPDLR Q    + GD R+
Subjt:  ---DSKASFEDRSVSS-------------KRTGKSNPTTPRRWYG--YSDDR----------------PTHYTLNRMRSSSSYPDLRLQESLLDAGDERW

Query:  RFYDDTHVHNHRLPASDQLHR-RREARPELEHEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGE
        RFYDD  +  +R   S    + +  ++ E+E E+   +    D   ++          SPP+ PP  PP  PPPP   P  V ++P RTH+   +     
Subjt:  RFYDDTHVHNHRLPASDQLHR-RREARPELEHEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGE

Query:  IDRQHKNGDSDVADCQRIQLPPLSP---------PAFYQESEQKIGKNEKKRGGPTKE----FWTTALRRRKKKQRQKS-----IESFETILSSQRPS--
         D Q     S+    +  Q PP  P         P       +K G  ++++    KE    F +   + +KKK+ QKS     IES   +     P   
Subjt:  IDRQHKNGDSDVADCQRIQLPPLSP---------PAFYQESEQKIGKNEKKRGGPTKE----FWTTALRRRKKKQRQKS-----IESFETILSSQRPS--

Query:  TSSLPPASPPPPPPLPPP------SVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPF
         S +PP SPPPPPP PPP      SVF  LF       +K  S+  P PPPP    ++  P+   +   S     P + +  +  N+ + +  I    P 
Subjt:  TSSLPPASPPPPPPLPPP------SVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPF

Query:  HPIPPPPL---PFKFARHGNFDSSGSNSSTPRAVSPD---------MEESEADGPPAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNS
         P PPPP    P K+   G+F    SN S+ R  SP+         +E +++DG         +     P FC SPDV++KAD FI R R + +L K+NS
Subjt:  HPIPPPPL---PFKFARHGNFDSSGSNSSTPRAVSPD---------MEESEADGPPAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNS

Query:  IKEKK
        +  K+
Subjt:  IKEKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGAGACGGCGACGCGCCGCCGCCATTCTGGCTCCAATCCTCCAACTCTCTTCACCATCTCGACTACAATCGCCGCCCCCGCCGCCGCCTCAGCCGCGCATCGTC
TTTCCTTCTCAACTCCAGCGCCTTTCTCATTGTTTTGTTAGTAATAGTTCTGTGTTTCATCTTGATTGTGATCCCTAAATTTGTGCACTTCGGTTCTCAATTGCTTCGAC
CTCAATCGGTCAAGAAGAGTTGGGATTCTCTCAATTTGGTTCTCGTTCTGTTCGCCATTGTCTGCGGATTTCTCAGCAGAAACACGGGCGATGATAGTAAAGCCTCTTTT
GAAGATCGGAGTGTTTCGTCGAAGCGAACCGGGAAGTCAAACCCTACGACTCCGCGCCGATGGTATGGATATTCCGATGATCGGCCGACTCATTACACTCTCAATCGGAT
GAGGAGTAGTAGTTCGTATCCGGATCTACGTCTGCAGGAGTCTTTGTTGGATGCCGGCGATGAACGGTGGCGATTTTACGACGACACTCATGTGCATAATCATCGGCTTC
CGGCCTCCGATCAGCTTCATCGTCGTCGTGAAGCTCGGCCGGAGCTTGAGCACGAAGATTTCGGTGCCAGAAGTACAGGTTTCGACAGATCTGGGATTCGTGAGGAAGTC
TATTCAAAACCGCGAATGCTTTCACCGCCGCGCTCGCCGCCGCCGCCGCCCCCAACCCTTCCCCCTCCCCCTAAGACGACCCCTACAGTGGTTAAACGGAGGCCAATGAG
AACGCATAAGGTCCATAGCCATACGCCCGACGGAGAAATCGATCGACAGCACAAGAACGGCGATTCAGACGTCGCGGATTGTCAACGGATTCAGCTTCCACCACTCTCAC
CGCCGGCATTTTATCAGGAATCGGAGCAGAAGATCGGCAAAAATGAGAAGAAGAGAGGCGGACCTACAAAAGAATTTTGGACAACCGCACTGAGGAGGAGGAAGAAGAAG
CAAAGACAAAAGAGCATCGAAAGCTTCGAGACTATCCTCAGCTCCCAGCGCCCTTCTACATCGTCATTACCACCAGCGTCACCTCCGCCTCCTCCGCCGCTTCCTCCGCC
GTCAGTTTTTCAAAACCTATTTTCTTCCAAGAAAGGGAAAGGCAGAAAAGGACAATCATTAACTTTACCAGAGCCGCCTCCACCATCAATAGCCTCCTCAGAACCTAAAC
CAGAGATCGGAGATCAAAATCAGCTCTCCAACCTTCACGAGCCTCCAATGGAGCTCGAGAGACTGAGCAGTATAAACGACGAAGAATACAATACACGCATTGGCGGTGAG
TCGCCGTTCCATCCGATTCCTCCACCGCCGCTGCCTTTCAAATTCGCGAGACACGGAAACTTTGACAGTTCTGGAAGCAATAGCAGTACGCCGAGAGCCGTTTCGCCGGA
CATGGAGGAGAGTGAAGCCGATGGCCCACCGGCGGCCGGCCAAATGAGGCTGTTGAAAGATTCCGCAACTCCGATGTTTTGCTCAAGCCCGGATGTTAACAGTAAAGCCG
ATAAGTTCATCGAAAGATTCAGAGCGGATTTGAAGTTGCAGAAGATGAATTCCATCAAGGAGAAGAAGGCGAGGAAGAGATCTAACCTAGGCCGAGAACCAGGCCCAGGC
CCA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGAGACGGCGACGCGCCGCCGCCATTCTGGCTCCAATCCTCCAACTCTCTTCACCATCTCGACTACAATCGCCGCCCCCGCCGCCGCCTCAGCCGCGCATCGTC
TTTCCTTCTCAACTCCAGCGCCTTTCTCATTGTTTTGTTAGTAATAGTTCTGTGTTTCATCTTGATTGTGATCCCTAAATTTGTGCACTTCGGTTCTCAATTGCTTCGAC
CTCAATCGGTCAAGAAGAGTTGGGATTCTCTCAATTTGGTTCTCGTTCTGTTCGCCATTGTCTGCGGATTTCTCAGCAGAAACACGGGCGATGATAGTAAAGCCTCTTTT
GAAGATCGGAGTGTTTCGTCGAAGCGAACCGGGAAGTCAAACCCTACGACTCCGCGCCGATGGTATGGATATTCCGATGATCGGCCGACTCATTACACTCTCAATCGGAT
GAGGAGTAGTAGTTCGTATCCGGATCTACGTCTGCAGGAGTCTTTGTTGGATGCCGGCGATGAACGGTGGCGATTTTACGACGACACTCATGTGCATAATCATCGGCTTC
CGGCCTCCGATCAGCTTCATCGTCGTCGTGAAGCTCGGCCGGAGCTTGAGCACGAAGATTTCGGTGCCAGAAGTACAGGTTTCGACAGATCTGGGATTCGTGAGGAAGTC
TATTCAAAACCGCGAATGCTTTCACCGCCGCGCTCGCCGCCGCCGCCGCCCCCAACCCTTCCCCCTCCCCCTAAGACGACCCCTACAGTGGTTAAACGGAGGCCAATGAG
AACGCATAAGGTCCATAGCCATACGCCCGACGGAGAAATCGATCGACAGCACAAGAACGGCGATTCAGACGTCGCGGATTGTCAACGGATTCAGCTTCCACCACTCTCAC
CGCCGGCATTTTATCAGGAATCGGAGCAGAAGATCGGCAAAAATGAGAAGAAGAGAGGCGGACCTACAAAAGAATTTTGGACAACCGCACTGAGGAGGAGGAAGAAGAAG
CAAAGACAAAAGAGCATCGAAAGCTTCGAGACTATCCTCAGCTCCCAGCGCCCTTCTACATCGTCATTACCACCAGCGTCACCTCCGCCTCCTCCGCCGCTTCCTCCGCC
GTCAGTTTTTCAAAACCTATTTTCTTCCAAGAAAGGGAAAGGCAGAAAAGGACAATCATTAACTTTACCAGAGCCGCCTCCACCATCAATAGCCTCCTCAGAACCTAAAC
CAGAGATCGGAGATCAAAATCAGCTCTCCAACCTTCACGAGCCTCCAATGGAGCTCGAGAGACTGAGCAGTATAAACGACGAAGAATACAATACACGCATTGGCGGTGAG
TCGCCGTTCCATCCGATTCCTCCACCGCCGCTGCCTTTCAAATTCGCGAGACACGGAAACTTTGACAGTTCTGGAAGCAATAGCAGTACGCCGAGAGCCGTTTCGCCGGA
CATGGAGGAGAGTGAAGCCGATGGCCCACCGGCGGCCGGCCAAATGAGGCTGTTGAAAGATTCCGCAACTCCGATGTTTTGCTCAAGCCCGGATGTTAACAGTAAAGCCG
ATAAGTTCATCGAAAGATTCAGAGCGGATTTGAAGTTGCAGAAGATGAATTCCATCAAGGAGAAGAAGGCGAGGAAGAGATCTAACCTAGGCCGAGAACCAGGCCCAGGC
CCA
Protein sequenceShow/hide protein sequence
MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDDSKASF
EDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELEHEDFGARSTGFDRSGIREEV
YSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKK
QRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGE
SPFHPIPPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPG
P