; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g02150 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g02150
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionserine/arginine repetitive matrix protein 1-like
Genome locationchr9:1785457..1787115
RNA-Seq ExpressionMoc09g02150
SyntenyMoc09g02150
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575459.1 hypothetical protein SDJN03_26098, partial [Cucurbita argyrosperma subsp. sororia]3.1e-22677.05Show/hide
Query:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
        ME DG+APPPFWLQ SNSLH LD +RR R RLSRASSFLLNSSAFL+VLLVIVLCFI IVIPKFV FGSQL+RPQS+KKSWDSLNLVLVLFAIVCGFLSR
Subjt:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR

Query:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
        N GDDS+ SFEDRSVSS+R  KSNP  PR+W GY+D RP HYT+NRMRSSSSYPDLRLQES LDAGD+RWR YDDTHV N+R P+SDQLHRRREARPELE
Subjt:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE

Query:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPP
         ED   +S GFDRS IRE+VYS+  + SPPRSPPP         PPPT PPP  TTP VVKRRP RTHKVHSHTP GEID+ +KNGDSDVA+ QRI LPP
Subjt:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPP

Query:  LSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPE
        LSPP FY+ESEQK  KNEKKRGG  KE W +ALRRR+KKQRQKSIESFE I++SQRPSTSSLPP SPPPPPPL  PSV Q LF+SKKG+G+K QS   PE
Subjt:  LSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPE

Query:  PPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPI-PPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAG
          PPSIASSEPKP I DQN L   HEPP+EL RL+S+NDEEY+TRIGGES FHPI PPPP P  F  HG+FDS GSNSSTPRAVSPDM+ESEADG PAAG
Subjt:  PPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPI-PPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAG

Query:  QMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGPK
        + + +K+S  PMFCSSPDVNSKADKFI RFRADLKLQKMNSIKEK ARKRSNLGR PGPGP+
Subjt:  QMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGPK

XP_022143734.1 uncharacterized protein C6orf132 homolog [Momordica charantia]0.0e+00100Show/hide
Query:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
        MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
Subjt:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR

Query:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
        NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
Subjt:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE

Query:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPAFYQE
        HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPAFYQE
Subjt:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPAFYQE

Query:  SEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASS
        SEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASS
Subjt:  SEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASS

Query:  EPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMRLLKDSAT
        EPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMRLLKDSAT
Subjt:  EPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMRLLKDSAT

Query:  PMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGPK
        PMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGPK
Subjt:  PMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGPK

XP_022953834.1 protein enabled homolog [Cucurbita moschata]3.1e-22677.22Show/hide
Query:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
        ME DG+APPPFWLQ SNSLH LD N R R RLSRASSFLLNSSAFL+VLLVIVLCFI IVIPKFV FGSQL+RPQS+KKSWDSLNLVLVLFAIVCGFLSR
Subjt:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR

Query:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
        N GDDS+ SFEDRSVSS+RT K+NP  PR+W GY+D RP HYT+NRMRSSSSYPDLRLQES L AGD+R R YDDTHV N+R P SDQL+RRREARPELE
Subjt:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE

Query:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPP
         ED   +S GFDRS IRE+VYS+  + SPPRSPPP         PPPT PPP  TTP VVKRRP RTHKVHSHTP GEID+ +KNGDSDVA+ QRI LPP
Subjt:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPP

Query:  LSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPE
        LSPP FY+ESEQK  KNEKKRGG  KE W +ALRRR+KKQRQKSIESFE I++SQRPSTSSLPP SPPPPPPLP PSV Q LF+SKKG+G+K QS   PE
Subjt:  LSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPE

Query:  PPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPI-PPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAG
          PPSIASSEPKP I DQN L   HEPP+EL RL+S+NDEEY+TRIGGESPFHPI PPPP P  F  HG+FDS GSNSSTPRAVSPDM+ESEADG PAAG
Subjt:  PPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPI-PPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAG

Query:  QMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGPK
        + +L+KDS  PMFCSSPDVNSKADKFI RFRADLKLQKMNSIKEK ARKRSNLGR PGPGP+
Subjt:  QMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGPK

XP_023548433.1 protein enabled homolog [Cucurbita pepo subsp. pepo]1.1e-22877.58Show/hide
Query:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
        ME DG+APPPFWLQ SNSLH LD N R R RLSRASSFLLNSSAFL+VLLVIVLCFI IVIPKFV FGSQL+RPQSVKKSWDSLNLVLVLFAIVCGFLSR
Subjt:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR

Query:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
        N G+DS+ SFEDRSVSS+RT KSNP  PR+W GY+D RP HYT+NRMRSSSSYPDLRLQES LDAGD++WR YDDTHV N+R P+SDQLHRRREARPELE
Subjt:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE

Query:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPP
         ED   +S GFDRS +RE+VYS+  + SPPRSPPP         PPPT PPP  TTP VVKRRP RTHKVHSHTP GEID+ +KNGDSDVA+ QRI LPP
Subjt:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPP

Query:  LSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPE
        LSPP FY+ESEQK  KN+KKRGG  KE W +ALRRR+KKQRQKSIESFE I++SQRPSTSSLPP SPPPPPPLP PSV Q LF+SKKGKG+K QS   PE
Subjt:  LSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPE

Query:  PPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPI-PPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAG
          PPSIAS EPKP I DQN L   HEPP+EL RLSS+NDEEY+TRIGGESPFHPI PPPP P  F  HG+FDS GSNSSTPRAVSPDM ESEADG PAAG
Subjt:  PPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPI-PPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAG

Query:  QMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGPK
        + +L+KDS  PMFCSSPDVNSKADKFI RFRADLKLQKMNSIKEK ARKRSNLGR PGPGP+
Subjt:  QMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGPK

XP_038896222.1 serine/arginine repetitive matrix protein 1-like [Benincasa hispida]4.6e-23077.92Show/hide
Query:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
        ME DG+APPPFWLQSSNSLH LDYNR  RRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFV F SQL+RPQSVKKSWDSLNL+LVLFAIVCGFLSR
Subjt:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR

Query:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
        NTGDDS+ASFED SVSS+RT KSNPTTPRRW GY+D RP HYTLNRMRSSSSYPDLRLQES  DAGD RWRFYDDTHV NHR  +SDQLHRRRE RPELE
Subjt:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE

Query:  HEDFGARSTGFDRSGIREEVYSKPRMLSP--PRSPPP---------PPPTLPPPPKTT--PTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRI
          D  A+S GFDRS IRE+VYS+P + SP  PRSPPP         PPPT PPP  TT  P VVKRRP RTHKVHSHTPD EID+Q++NGDSDVA+ QRI
Subjt:  HEDFGARSTGFDRSGIREEVYSKPRMLSP--PRSPPP---------PPPTLPPPPKTT--PTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRI

Query:  QLPPLSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSL
        QLPPLSPP+FY+ESEQK  +NEKKRGG +KE W +ALRRRKKKQRQKSIESFE I++SQR ST    P+SPPPPPPLP PSV QNLFSSKKGKG+K QS 
Subjt:  QLPPLSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSL

Query:  TLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPI-PPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGP
          PEPP    ASSEPKP+  D+NQ+   HEPPMEL+RLSS+NDEEYNTRIGGESP+HPI PPPP P  F  HG+FDS GSNSSTPRA+SP+M+ESEADGP
Subjt:  TLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPI-PPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGP

Query:  PAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGPK
        PA G+ +L+KDS  P+FCSSPDVNSKADKFI RFRADLKLQKMNSIKEK ARKRSNLGR  GPGPK
Subjt:  PAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGPK

TrEMBL top hitse value%identityAlignment
A0A1S3CII2 LOW QUALITY PROTEIN: serine/arginine repetitive matrix protein 1-like1.1e-22175.44Show/hide
Query:  MEGDGDA-PPPFWLQSSN-SLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFL
        ME DG+A  PPFWLQSSN SLH L Y+R  RRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFV F SQL+RPQSVKKSWDSLNL+LVLFAIVCGFL
Subjt:  MEGDGDA-PPPFWLQSSN-SLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFL

Query:  SRNT-GDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARP
         RN  GDDS+ SFEDRSVSS+R+ KSNPTTPRRW GY+D RP H+TLNRMRSSSSYPDLRLQES  DAGD RWRFYDDTHV NHR  +SDQLHRRRE +P
Subjt:  SRNT-GDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARP

Query:  ELEHEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQ
        ELE +D  A+S  FDRS IR +VYS+P + SPPRSPPP         PPPT PPP  T P +VKRRP RTHKVHSHTP+ EI++QH+NGDSDVA+ QRIQ
Subjt:  ELEHEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQ

Query:  LPPLSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPAS--PPPPPPLPPPSVFQNLFSSKKGKGRKGQS
        LPPLSPP FY+ESEQK  KNEKKR G +KE W +ALRRRKKKQRQKS+ESFE I++SQR STSSLPP S  PPPPPPLP PSV QNLFSS+KGK +K QS
Subjt:  LPPLSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPAS--PPPPPPLPPPSVFQNLFSSKKGKGRKGQS

Query:  LTLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPI-PPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADG
         +LP+PPPPSIASSEPKP+  DQNQ+    +PPMEL+RLSS+NDEEY+TRIGGESP+HPI PPPP P  F  HG+FDS GSNSSTPRA+SP+M+ESEAD 
Subjt:  LTLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPI-PPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADG

Query:  PPAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP
        PPA  + +L+KD   PMFCSSPDVNSKADKFI RFRADLKLQKMNSIKEK  RKRSNLGR  GPGP
Subjt:  PPAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP

A0A5A7V0Q3 Serine/arginine repetitive matrix protein 1-like4.2e-22175.27Show/hide
Query:  MEGDGDA-PPPFWLQSSN-SLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFL
        ME DG+A  PPFWLQSSN SLH L Y+R  RRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFV F SQL+RPQSVKKSWDSLNL+LVLFAIVCGFL
Subjt:  MEGDGDA-PPPFWLQSSN-SLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFL

Query:  SRNT-GDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARP
         RN  GDDS+ SFEDRSVSS+R+ KSNPTTPRRW GY+D RP H+TLNRMRSSSSYPDLRLQES  DAGD +WRFYDDTHV NHR  +SDQLHRRRE +P
Subjt:  SRNT-GDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARP

Query:  ELEHEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQ
        ELE +D  A+S  FDRS IR +VYS+P + SPPRSPPP         PPPT PPP  T P +VKRRP RTHKVHSHTP+ EI++QH+NGDSDVA+ QRIQ
Subjt:  ELEHEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQ

Query:  LPPLSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPAS--PPPPPPLPPPSVFQNLFSSKKGKGRKGQS
        LPPLSPP FY+ESEQK  KNEKKR G +KE W +ALRRRKKKQRQKS+ESFE I++SQR STSSLPP S  PPPPPPLP PSV QNLFSS+KGK +K QS
Subjt:  LPPLSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPAS--PPPPPPLPPPSVFQNLFSSKKGKGRKGQS

Query:  LTLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPI-PPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADG
         +LP+PPPPSIASSEPKP+  DQNQ+    +PPMEL+RLSS+NDEEY+TRIGGESP+HPI PPPP P  F  HG+FDS GSNSSTPRA+SP+M+ESEAD 
Subjt:  LTLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPI-PPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADG

Query:  PPAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP
        PPA  + +L+KD   PMFCSSPDVNSKADKFI RFRADLKLQKMNSIKEK  RKRSNLGR  GPGP
Subjt:  PPAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGP

A0A6J1CQ76 uncharacterized protein C6orf132 homolog0.0e+00100Show/hide
Query:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
        MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
Subjt:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR

Query:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
        NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
Subjt:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE

Query:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPAFYQE
        HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPAFYQE
Subjt:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPAFYQE

Query:  SEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASS
        SEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASS
Subjt:  SEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASS

Query:  EPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMRLLKDSAT
        EPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMRLLKDSAT
Subjt:  EPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPIPPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMRLLKDSAT

Query:  PMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGPK
        PMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGPK
Subjt:  PMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGPK

A0A6J1GQY1 protein enabled homolog1.5e-22677.22Show/hide
Query:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
        ME DG+APPPFWLQ SNSLH LD N R R RLSRASSFLLNSSAFL+VLLVIVLCFI IVIPKFV FGSQL+RPQS+KKSWDSLNLVLVLFAIVCGFLSR
Subjt:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR

Query:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
        N GDDS+ SFEDRSVSS+RT K+NP  PR+W GY+D RP HYT+NRMRSSSSYPDLRLQES L AGD+R R YDDTHV N+R P SDQL+RRREARPELE
Subjt:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE

Query:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPP
         ED   +S GFDRS IRE+VYS+  + SPPRSPPP         PPPT PPP  TTP VVKRRP RTHKVHSHTP GEID+ +KNGDSDVA+ QRI LPP
Subjt:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP---------PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPP

Query:  LSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPE
        LSPP FY+ESEQK  KNEKKRGG  KE W +ALRRR+KKQRQKSIESFE I++SQRPSTSSLPP SPPPPPPLP PSV Q LF+SKKG+G+K QS   PE
Subjt:  LSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPE

Query:  PPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPI-PPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAG
          PPSIASSEPKP I DQN L   HEPP+EL RL+S+NDEEY+TRIGGESPFHPI PPPP P  F  HG+FDS GSNSSTPRAVSPDM+ESEADG PAAG
Subjt:  PPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFHPI-PPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAG

Query:  QMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGPK
        + +L+KDS  PMFCSSPDVNSKADKFI RFRADLKLQKMNSIKEK ARKRSNLGR PGPGP+
Subjt:  QMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGPK

A0A6J1KKJ2 serine/arginine repetitive matrix protein 1-like2.2e-22276.03Show/hide
Query:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR
        MEGDG+A PPFWLQSSNS   + YNR  RRRLSRASSFLLNSSAFL VLLVIVLCF+LIVIPK V F SQL+RPQSVKKSWDSLNLVLVLFAIVCGFLSR
Subjt:  MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSR

Query:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE
        NTGDD++  FEDRSVSS+R  KSNPTTPR+W GY D RP HYT+NRMRSSSSYPDLRLQES LDAGDERWRFYDDTHV NHR  +SDQLHRR +ARPELE
Subjt:  NTGDDSKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELE

Query:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP----PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPA
         ED GA+STGFDRS + E+VYS+P + SPPR PPP    PPPTL  P  TTP VVKRRP RTH VHSHTPDG ID+Q KN DSDVAD QRI LPPLSPP+
Subjt:  HEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPP----PPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPA

Query:  FYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPS
        FYQESEQK GKNEKKRGG  KE W +ALRRRKKKQRQKS+ESFE I ++ R STSSLP ASPPPPPPLPPP V QNLF SKKGK +K QS   PE PPP+
Subjt:  FYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPS

Query:  IASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFH---PIPPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMR
        I +SEPKPEI  QN     ++PPMELERLSS+NDEEYNTRIG +SPFH   P PPPP P  F  HG+FDS+GSNSSTPRA+SP+++ESE DGPPAAG+M+
Subjt:  IASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPFH---PIPPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMR

Query:  LLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGPK
        + + S TP+FCSSPDVNSKADKFI RF+ADLKLQKMNSIKE+ ARKRSNLGR  GPGPK
Subjt:  LLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPGPGPK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G72790.1 hydroxyproline-rich glycoprotein family protein1.1e-6938.36Show/hide
Query:  EGDGDAPPPFWLQS--SNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLS
        E DGDA  PFWLQS  +N+      +   R        F   ++A LIV         + +IP F    SQ+ RP  V+KSWD LN VLVLFA++CGFLS
Subjt:  EGDGDAPPPFWLQS--SNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLS

Query:  RNTGDDSKASFEDRSVSSK----------RTGKSNP-TTPRRWY----GYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLP
        RNT +D     ++  + +K          R+  SN  TTPR W     G   D+  +   +R+RS SSYPDLRL+E      DERWRFYDDT V   R  
Subjt:  RNTGDDSKASFEDRSVSSK----------RTGKSNP-TTPRRWY----GYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLP

Query:  ASDQLHRRR-------EARPELEHEDFGARSTGFDRSGIRE----------------EVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKV
          D ++  +       E +P  E  D        + S +R                 EV  + ++ S P   P PPP+ P PP       K+   +T++V
Subjt:  ASDQLHRRR-------EARPELEHEDFGARSTGFDRSGIRE----------------EVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKV

Query:  HSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPP
        +    D     + K  D  VA        P+ PPA      QK  K EKK+GG TK+F   ALRR+KKKQRQ+SI+  + +  S  P   S      PPP
Subjt:  HSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPAFYQESEQKIGKNEKKRGGPTKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPP

Query:  PPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTR---IGGESPFHPIPPPPLP------
        PP PPP  FQ LFSSKKGK +K  S   P PPPP      P+     +   S L + P+E  R S  N     T+    G ESP  PIPPPP P      
Subjt:  PPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTR---IGGESPFHPIPPPPLP------

Query:  -FKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPG
         +KF + G++    S+ S    +S D E  + D   +AG     K++A  MFC SPDV++KAD FI RFRA LKL+KMNS+K    R RSNLG EPG
Subjt:  -FKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNSIKEKKARKRSNLGREPG

AT5G57070.1 hydroxyproline-rich glycoprotein family protein1.6e-3930.08Show/hide
Query:  DAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGD-
        D PP  W Q        D     RRR S  +  +L  +   +    I L F+  V+P F+   SQ+L+P SVK+ WDS+N+VLV+FAI+CG L+R   D 
Subjt:  DAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGD-

Query:  ---DSKASFEDRSVSS-------------KRTGKSNPTTPRRWYG--YSDDR----------------PTHYTLNRMRSSSSYPDLRLQESLLDAGDERW
           +S    E+  V                +   S+ T   +W+   Y  DR                P    +   RSSSSYPDLR Q    + GD R+
Subjt:  ---DSKASFEDRSVSS-------------KRTGKSNPTTPRRWYG--YSDDR----------------PTHYTLNRMRSSSSYPDLRLQESLLDAGDERW

Query:  RFYDDTHVHNHRLPASDQLHR-RREARPELEHEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGE
        RFYDD  +  +R   S    + +  ++ E+E E+   +    D   ++          SPP+ PP  PP  PPPP   P  V ++P RTH+   +     
Subjt:  RFYDDTHVHNHRLPASDQLHR-RREARPELEHEDFGARSTGFDRSGIREEVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGE

Query:  IDRQHKNGDSDVADCQRIQLPPLSP---------PAFYQESEQKIGKNEKKRGGPTKE----FWTTALRRRKKKQRQKS-----IESFETILSSQRPS--
         D Q     S+    +  Q PP  P         P       +K G  ++++    KE    F +   + +KKK+ QKS     IES   +     P   
Subjt:  IDRQHKNGDSDVADCQRIQLPPLSP---------PAFYQESEQKIGKNEKKRGGPTKE----FWTTALRRRKKKQRQKS-----IESFETILSSQRPS--

Query:  TSSLPPASPPPPPPLPPP------SVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPF
         S +PP SPPPPPP PPP      SVF  LF       +K  S+  P PPPP    ++  P+   +   S     P + +  +  N+ + +  I    P 
Subjt:  TSSLPPASPPPPPPLPPP------SVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPMELERLSSINDEEYNTRIGGESPF

Query:  HPIPPPPL---PFKFARHGNFDSSGSNSSTPRAVSPD---------MEESEADGPPAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNS
         P PPPP    P K+   G+F    SN S+ R  SP+         +E +++DG         +     P FC SPDV++KAD FI R R + +L K+NS
Subjt:  HPIPPPPL---PFKFARHGNFDSSGSNSSTPRAVSPD---------MEESEADGPPAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLKLQKMNS

Query:  IKEKK
        +  K+
Subjt:  IKEKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGAGACGGCGACGCGCCGCCGCCATTCTGGCTCCAATCCTCCAACTCTCTTCACCATCTCGACTACAATCGCCGCCCCCGCCGCCGCCTCAGCCGCGCA
TCGTCTTTCCTTCTCAACTCCAGCGCCTTTCTCATTGTTTTGTTAGTAATAGTTCTGTGTTTCATCTTGATTGTGATCCCTAAATTTGTGCACTTCGGTTCTCAA
TTGCTTCGACCTCAATCGGTCAAGAAGAGTTGGGATTCTCTCAATTTGGTTCTCGTTCTGTTCGCCATTGTCTGCGGATTTCTCAGCAGAAACACGGGCGATGAT
AGTAAAGCCTCTTTTGAAGATCGGAGTGTTTCGTCGAAGCGAACCGGGAAGTCAAACCCTACGACTCCGCGCCGATGGTATGGATATTCCGATGATCGGCCGACT
CATTACACTCTCAATCGGATGAGGAGTAGTAGTTCGTATCCGGATCTACGTCTGCAGGAGTCTTTGTTGGATGCCGGCGATGAACGGTGGCGATTTTACGACGAC
ACTCATGTGCATAATCATCGGCTTCCGGCCTCCGATCAGCTTCATCGTCGTCGTGAAGCTCGGCCGGAGCTTGAGCACGAAGATTTCGGTGCCAGAAGTACAGGT
TTCGACAGATCTGGGATTCGTGAGGAAGTCTATTCAAAACCGCGAATGCTTTCACCGCCGCGCTCGCCGCCGCCGCCGCCCCCAACCCTTCCCCCTCCCCCTAAG
ACGACCCCTACAGTGGTTAAACGGAGGCCAATGAGAACGCATAAGGTCCATAGCCATACGCCCGACGGAGAAATCGATCGACAGCACAAGAACGGCGATTCAGAC
GTCGCGGATTGTCAACGGATTCAGCTTCCACCACTCTCACCGCCGGCATTTTATCAGGAATCGGAGCAGAAGATCGGCAAAAATGAGAAGAAGAGAGGCGGACCT
ACAAAAGAATTTTGGACAACCGCACTGAGGAGGAGGAAGAAGAAGCAAAGACAAAAGAGCATCGAAAGCTTCGAGACTATCCTCAGCTCCCAGCGCCCTTCTACA
TCGTCATTACCACCAGCGTCACCTCCGCCTCCTCCGCCGCTTCCTCCGCCGTCAGTTTTTCAAAACCTATTTTCTTCCAAGAAAGGGAAAGGCAGAAAAGGACAA
TCATTAACTTTACCAGAGCCGCCTCCACCATCAATAGCCTCCTCAGAACCTAAACCAGAGATCGGAGATCAAAATCAGCTCTCCAACCTTCACGAGCCTCCAATG
GAGCTCGAGAGACTGAGCAGTATAAACGACGAAGAATACAATACACGCATTGGCGGTGAGTCGCCGTTCCATCCGATTCCTCCACCGCCGCTGCCTTTCAAATTC
GCGAGACACGGAAACTTTGACAGTTCTGGAAGCAATAGCAGTACGCCGAGAGCCGTTTCGCCGGACATGGAGGAGAGTGAAGCCGATGGCCCACCGGCGGCCGGC
CAAATGAGGCTGTTGAAAGATTCCGCAACTCCGATGTTTTGCTCAAGCCCGGATGTTAACAGTAAAGCCGATAAGTTCATCGAAAGATTCAGAGCGGATTTGAAG
TTGCAGAAGATGAATTCCATCAAGGAGAAGAAGGCGAGGAAGAGATCTAACCTAGGCCGAGAACCAGGCCCAGGCCCAAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGAGACGGCGACGCGCCGCCGCCATTCTGGCTCCAATCCTCCAACTCTCTTCACCATCTCGACTACAATCGCCGCCCCCGCCGCCGCCTCAGCCGCGCA
TCGTCTTTCCTTCTCAACTCCAGCGCCTTTCTCATTGTTTTGTTAGTAATAGTTCTGTGTTTCATCTTGATTGTGATCCCTAAATTTGTGCACTTCGGTTCTCAA
TTGCTTCGACCTCAATCGGTCAAGAAGAGTTGGGATTCTCTCAATTTGGTTCTCGTTCTGTTCGCCATTGTCTGCGGATTTCTCAGCAGAAACACGGGCGATGAT
AGTAAAGCCTCTTTTGAAGATCGGAGTGTTTCGTCGAAGCGAACCGGGAAGTCAAACCCTACGACTCCGCGCCGATGGTATGGATATTCCGATGATCGGCCGACT
CATTACACTCTCAATCGGATGAGGAGTAGTAGTTCGTATCCGGATCTACGTCTGCAGGAGTCTTTGTTGGATGCCGGCGATGAACGGTGGCGATTTTACGACGAC
ACTCATGTGCATAATCATCGGCTTCCGGCCTCCGATCAGCTTCATCGTCGTCGTGAAGCTCGGCCGGAGCTTGAGCACGAAGATTTCGGTGCCAGAAGTACAGGT
TTCGACAGATCTGGGATTCGTGAGGAAGTCTATTCAAAACCGCGAATGCTTTCACCGCCGCGCTCGCCGCCGCCGCCGCCCCCAACCCTTCCCCCTCCCCCTAAG
ACGACCCCTACAGTGGTTAAACGGAGGCCAATGAGAACGCATAAGGTCCATAGCCATACGCCCGACGGAGAAATCGATCGACAGCACAAGAACGGCGATTCAGAC
GTCGCGGATTGTCAACGGATTCAGCTTCCACCACTCTCACCGCCGGCATTTTATCAGGAATCGGAGCAGAAGATCGGCAAAAATGAGAAGAAGAGAGGCGGACCT
ACAAAAGAATTTTGGACAACCGCACTGAGGAGGAGGAAGAAGAAGCAAAGACAAAAGAGCATCGAAAGCTTCGAGACTATCCTCAGCTCCCAGCGCCCTTCTACA
TCGTCATTACCACCAGCGTCACCTCCGCCTCCTCCGCCGCTTCCTCCGCCGTCAGTTTTTCAAAACCTATTTTCTTCCAAGAAAGGGAAAGGCAGAAAAGGACAA
TCATTAACTTTACCAGAGCCGCCTCCACCATCAATAGCCTCCTCAGAACCTAAACCAGAGATCGGAGATCAAAATCAGCTCTCCAACCTTCACGAGCCTCCAATG
GAGCTCGAGAGACTGAGCAGTATAAACGACGAAGAATACAATACACGCATTGGCGGTGAGTCGCCGTTCCATCCGATTCCTCCACCGCCGCTGCCTTTCAAATTC
GCGAGACACGGAAACTTTGACAGTTCTGGAAGCAATAGCAGTACGCCGAGAGCCGTTTCGCCGGACATGGAGGAGAGTGAAGCCGATGGCCCACCGGCGGCCGGC
CAAATGAGGCTGTTGAAAGATTCCGCAACTCCGATGTTTTGCTCAAGCCCGGATGTTAACAGTAAAGCCGATAAGTTCATCGAAAGATTCAGAGCGGATTTGAAG
TTGCAGAAGATGAATTCCATCAAGGAGAAGAAGGCGAGGAAGAGATCTAACCTAGGCCGAGAACCAGGCCCAGGCCCAAAGTAG
Protein sequenceShow/hide protein sequence
MEGDGDAPPPFWLQSSNSLHHLDYNRRPRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVHFGSQLLRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDD
SKASFEDRSVSSKRTGKSNPTTPRRWYGYSDDRPTHYTLNRMRSSSSYPDLRLQESLLDAGDERWRFYDDTHVHNHRLPASDQLHRRREARPELEHEDFGARSTG
FDRSGIREEVYSKPRMLSPPRSPPPPPPTLPPPPKTTPTVVKRRPMRTHKVHSHTPDGEIDRQHKNGDSDVADCQRIQLPPLSPPAFYQESEQKIGKNEKKRGGP
TKEFWTTALRRRKKKQRQKSIESFETILSSQRPSTSSLPPASPPPPPPLPPPSVFQNLFSSKKGKGRKGQSLTLPEPPPPSIASSEPKPEIGDQNQLSNLHEPPM
ELERLSSINDEEYNTRIGGESPFHPIPPPPLPFKFARHGNFDSSGSNSSTPRAVSPDMEESEADGPPAAGQMRLLKDSATPMFCSSPDVNSKADKFIERFRADLK
LQKMNSIKEKKARKRSNLGREPGPGPK