; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008072 (gene) of Snake gourd v1 genome

Gene IDTan0008072
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionserine/arginine repetitive matrix protein 1-like
Genome locationLG06:1415824..1417924
RNA-Seq ExpressionTan0008072
SyntenyTan0008072
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575459.1 hypothetical protein SDJN03_26098, partial [Cucurbita argyrosperma subsp. sororia]8.6e-26186.23Show/hide
Query:  MEEDGNAPPPFWLQSSTSMDQVDYNRRR-RLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRN
        MEEDGNAPPPFWLQ S S+ ++D +RRR RLSRASSFLLNSSAFL+VLLVIVLCFI IVIPKFVQF SQLIRPQS+KKSWDSLNLVLVLFAIVCGFLSRN
Subjt:  MEEDGNAPPPFWLQSSTSMDQVDYNRRR-RLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRN

Query:  TGDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPELER
         GDD+R SFEDRSVSSRR +KSNP  PR+WDGY+DHRP H+TVNRMRSSSSYPDLRLQESSLDAGD+RWR YDDTHV N+RF SSDQLH RREARPELER
Subjt:  TGDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPELER

Query:  EDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPL
        EDS  KSIGFDRSEIREDVYSQ  IPSPPRSPPP+VSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTP G IDQ  KN DSDVA+F+RI LPPL
Subjt:  EDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPL

Query:  SPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTP----P
        SPP FYRESEQKS KNEKKRGGAPKEIWSALRRR+KKQRQKS+ESFEAI+ASQ  STSSLPPPSPPPPPPL  PSVLQ LF+SKKG+ KKVQSTP    P
Subjt:  SPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTP----P

Query:  PSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGEMK
        PSI SSEPKP IEDQNHLLKPH+PP+EL RL+SLNDEEY+TRIGGES FHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA+SPD+DESEADG PAAGE K
Subjt:  PSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGEMK

Query:  LMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGPK
         +K+STIPMFCSSPDVNSKAD FIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGP+
Subjt:  LMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGPK

KAG6593534.1 hypothetical protein SDJN03_13010, partial [Cucurbita argyrosperma subsp. sororia]6.2e-25986.87Show/hide
Query:  MEEDGNAPPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNT
        ME DGNA PPFWLQSS+S  QV YNRRRRLSRASSFLLNSSAFLIVLLVIVLCF+LIVIPK VQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNT
Subjt:  MEEDGNAPPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNT

Query:  GDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPELERE
        GDDNRG FEDRSVSSRRR+KSNPTTPR+WDGYSDHRPN +TVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHV NHRFASSDQLH R +ARPELERE
Subjt:  GDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPELERE

Query:  DSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPLS
        DS AKS GFDRSE+REDVYSQPAIPSPPR PPPR      PSPPPT   PA+TTPKV KRRPKRTH VHSHTPDGAIDQQQKNDDSDVADF+RI LPPLS
Subjt:  DSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPLS

Query:  PPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTPPPSIVS
        PPSFY+ESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAI   + +++SSLP PSPPPPPPLPPP VLQNLF SKKGKAKKVQS PPP+IV+
Subjt:  PPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTPPPSIVS

Query:  SEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPP--FRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGEMKLMK
        SEPKPEIE QNHLLKP+DPPMELERLSSLNDEEYNTRIG +SPFH IPPPPPPPPP  FRMHGDFDS GSNSSTPRAISP+I ESE DGPPAAG+MK+ +
Subjt:  SEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPP--FRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGEMKLMK

Query:  DSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGPK
         ST P+FCSSPDVNSKADNFIARF+ADLKLQKMNSIKE++ARKRSNLGR  GPGPK
Subjt:  DSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGPK

XP_022953834.1 protein enabled homolog [Cucurbita moschata]5.6e-26086.05Show/hide
Query:  MEEDGNAPPPFWLQSSTSMDQVDYNRRR-RLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRN
        MEEDGNAPPPFWLQ S S+ ++D +RRR RLSRASSFLLNSSAFL+VLLVIVLCFI IVIPKFVQF SQLIRPQS+KKSWDSLNLVLVLFAIVCGFLSRN
Subjt:  MEEDGNAPPPFWLQSSTSMDQVDYNRRR-RLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRN

Query:  TGDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPELER
         GDD+R SFEDRSVSSRR +K+NP  PR+WDGY+DHRP H+TVNRMRSSSSYPDLRLQESSL AGD+R R YDDTHV N+RF  SDQL+ RREARPELER
Subjt:  TGDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPELER

Query:  EDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPL
        EDS  KSIGFDRSEIREDVYSQ  IPSPPRSPPP+VSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTP G IDQ  KN DSDVA+F+RI LPPL
Subjt:  EDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPL

Query:  SPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTP----P
        SPP FYRESEQKS KNEKKRGGAPKEIWSALRRR+KKQRQKS+ESFEAI+ASQ  STSSLPPPSPPPPPPLP PSVLQ LF+SKKG+ KKVQSTP    P
Subjt:  SPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTP----P

Query:  PSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGEMK
        PSI SSEPKP IEDQNHLLKPH+PP+EL RL+SLNDEEY+TRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA+SPD+DESEADG PAAGE K
Subjt:  PSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGEMK

Query:  LMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGPK
        L+KDSTIPMFCSSPDVNSKAD FIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGP+
Subjt:  LMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGPK

XP_023548433.1 protein enabled homolog [Cucurbita pepo subsp. pepo]7.8e-26286.23Show/hide
Query:  MEEDGNAPPPFWLQSSTSMDQVDYNRRR-RLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRN
        MEEDGNAPPPFWLQ S S+ ++D +RRR RLSRASSFLLNSSAFL+VLLVIVLCFI IVIPKFVQF SQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRN
Subjt:  MEEDGNAPPPFWLQSSTSMDQVDYNRRR-RLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRN

Query:  TGDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPELER
         G+D+R SFEDRSVSSRR +KSNP  PR+WDGY+DHRP H+TVNRMRSSSSYPDLRLQESSLDAGD++WR YDDTHV N+RF SSDQLH RREARPELER
Subjt:  TGDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPELER

Query:  EDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPL
        EDS  KSIGFDRSE+REDVYSQ  IPSPPRSPPP+VSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTP G IDQ  KN DSDVA+F+RI LPPL
Subjt:  EDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPL

Query:  SPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTP----P
        SPP FYRESEQKS KN+KKRGGAPKEIWSALRRR+KKQRQKS+ESFE I+ASQ  STSSLPPPSPPPPPPLP PSVLQ LF+SKKGK KKVQSTP    P
Subjt:  SPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTP----P

Query:  PSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGEMK
        PSI S EPKP IEDQNHLLKPH+PP+EL RLSSLNDEEY+TRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA+SPD+ ESEADG PAAGE K
Subjt:  PSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGEMK

Query:  LMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGPK
        L+KDSTIPMFCSSPDVNSKAD FIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGP+
Subjt:  LMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGPK

XP_038896222.1 serine/arginine repetitive matrix protein 1-like [Benincasa hispida]4.0e-26687.84Show/hide
Query:  MEEDGNAPPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNT
        MEEDGNAPPPFWLQSS S+ ++DYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQF SQLIRPQSVKKSWDSLNL+LVLFAIVCGFLSRNT
Subjt:  MEEDGNAPPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNT

Query:  GDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPELERE
        GDD+R SFED SVSSRR MKSNPTTPRRWDGY+DHRPNH+T+NRMRSSSSYPDLRLQES+ DAGD RWRFYDDTHV NHR+ SSDQLH RRE RPELER 
Subjt:  GDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPELERE

Query:  DSGAKSIGFDRSEIREDVYSQPAIPSP--PRSPPPRVSPPRSPSPPPTPPPPANTT--PKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQL
        DS AKSIGFDRSEIREDVYSQPAIPSP  PRSPPPRVSPPR PSPPPTPPPPANTT  PKVVKRRPKRTHKVHSHTPD  IDQQ +N DSDVA+F+RIQL
Subjt:  DSGAKSIGFDRSEIREDVYSQPAIPSP--PRSPPPRVSPPRSPSPPPTPPPPANTT--PKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQL

Query:  PPLSPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTPPP
        PPLSPPSFYRESEQKS +NEKKRGGA KEIWSALRRRKKKQRQKS+ESFEAIIASQ AST    P SPPPPPPLP PSVLQNLFSSKKGK KKVQSTPPP
Subjt:  PPLSPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTPPP

Query:  S-IVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGEMK
            SSEPKP+ ED+N +LKPH+PPMEL+RLSSLNDEEYNTRIGGESP+HPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISP++DESEADGPPA GE K
Subjt:  S-IVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGEMK

Query:  LMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGPK
        L+KDSTIP+FCSSPDVNSKAD FIARFRADLKLQKMNSIKEKTARKRSNLGRT GPGPK
Subjt:  LMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGPK

TrEMBL top hitse value%identityAlignment
A0A1S3CII2 LOW QUALITY PROTEIN: serine/arginine repetitive matrix protein 1-like2.8e-25785.61Show/hide
Query:  MEEDGNA-PPPFWLQSS-TSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSR
        MEEDGNA  PPFWLQSS +S+ ++ Y+RRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQF SQLIRPQSVKKSWDSLNL+LVLFAIVCGFL R
Subjt:  MEEDGNA-PPPFWLQSS-TSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSR

Query:  NT-GDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPEL
        N  GDD+RGSFEDRSVSSRR MKSNPTTPRRWDGY+DHRPNHFT+NRMRSSSSYPDLRLQESS DAGD RWRFYDDTHV NHR++SSDQLH RRE +PEL
Subjt:  NT-GDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPEL

Query:  EREDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLP
        ER+DS AKSI FDRSEIR DVYS+P IPSPPRSPPP+VSPPR PSPPPTPPPPANT PK+VKRRPKRTHKVHSHTP+  I+QQ +N DSDVA+F+RIQLP
Subjt:  EREDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLP

Query:  PLSPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPS--PPPPPPLPPPSVLQNLFSSKKGKAKKVQST--
        PLSPP FYRESEQKS KNEKKR GA KEIWSALRRRKKKQRQKSVESFEAIIASQ ASTSSLPPPS  PPPPPPLP PSVLQNLFSS+KGK KKVQST  
Subjt:  PLSPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPS--PPPPPPLPPPSVLQNLFSSKKGKAKKVQST--

Query:  ---PPPSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPA
           PPPSI SSEPKP+ EDQN +LKP DPPMEL+RLSSLNDEEY+TRIGGESP+HPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISP++DESEAD PPA
Subjt:  ---PPPSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPA

Query:  AGEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGP
          E KL+KD TIPMFCSSPDVNSKAD FIARFRADLKLQKMNSIKEKT RKRSNLGRT GPGP
Subjt:  AGEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGP

A0A5A7V0Q3 Serine/arginine repetitive matrix protein 1-like1.1e-25685.44Show/hide
Query:  MEEDGNA-PPPFWLQSS-TSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSR
        MEEDGNA  PPFWLQSS +S+ ++ Y+RRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQF SQLIRPQSVKKSWDSLNL+LVLFAIVCGFL R
Subjt:  MEEDGNA-PPPFWLQSS-TSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSR

Query:  NT-GDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPEL
        N  GDD+RGSFEDRSVSSRR MKSNPTTPRRWDGY+DHRPNHFT+NRMRSSSSYPDLRLQESS DAGD +WRFYDDTHV NHR++SSDQLH RRE +PEL
Subjt:  NT-GDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPEL

Query:  EREDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLP
        ER+DS AKSI FDRSEIR DVYS+P IPSPPRSPPP+VSPPR PSPPPTPPPPANT PK+VKRRPKRTHKVHSHTP+  I+QQ +N DSDVA+F+RIQLP
Subjt:  EREDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLP

Query:  PLSPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPS--PPPPPPLPPPSVLQNLFSSKKGKAKKVQST--
        PLSPP FYRESEQKS KNEKKR GA KEIWSALRRRKKKQRQKSVESFEAIIASQ ASTSSLPPPS  PPPPPPLP PSVLQNLFSS+KGK KKVQST  
Subjt:  PLSPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPS--PPPPPPLPPPSVLQNLFSSKKGKAKKVQST--

Query:  ---PPPSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPA
           PPPSI SSEPKP+ EDQN +LKP DPPMEL+RLSSLNDEEY+TRIGGESP+HPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISP++DESEAD PPA
Subjt:  ---PPPSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPA

Query:  AGEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGP
          E KL+KD TIPMFCSSPDVNSKAD FIARFRADLKLQKMNSIKEKT RKRSNLGRT GPGP
Subjt:  AGEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGP

A0A6J1GQY1 protein enabled homolog2.7e-26086.05Show/hide
Query:  MEEDGNAPPPFWLQSSTSMDQVDYNRRR-RLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRN
        MEEDGNAPPPFWLQ S S+ ++D +RRR RLSRASSFLLNSSAFL+VLLVIVLCFI IVIPKFVQF SQLIRPQS+KKSWDSLNLVLVLFAIVCGFLSRN
Subjt:  MEEDGNAPPPFWLQSSTSMDQVDYNRRR-RLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRN

Query:  TGDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPELER
         GDD+R SFEDRSVSSRR +K+NP  PR+WDGY+DHRP H+TVNRMRSSSSYPDLRLQESSL AGD+R R YDDTHV N+RF  SDQL+ RREARPELER
Subjt:  TGDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPELER

Query:  EDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPL
        EDS  KSIGFDRSEIREDVYSQ  IPSPPRSPPP+VSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTP G IDQ  KN DSDVA+F+RI LPPL
Subjt:  EDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPL

Query:  SPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTP----P
        SPP FYRESEQKS KNEKKRGGAPKEIWSALRRR+KKQRQKS+ESFEAI+ASQ  STSSLPPPSPPPPPPLP PSVLQ LF+SKKG+ KKVQSTP    P
Subjt:  SPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTP----P

Query:  PSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGEMK
        PSI SSEPKP IEDQNHLLKPH+PP+EL RL+SLNDEEY+TRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA+SPD+DESEADG PAAGE K
Subjt:  PSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGEMK

Query:  LMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGPK
        L+KDSTIPMFCSSPDVNSKAD FIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGP+
Subjt:  LMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGPK

A0A6J1HGU6 serine/arginine repetitive matrix protein 1-like3.9e-25986.87Show/hide
Query:  MEEDGNAPPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNT
        ME DGNA PPFWLQSS+S  QV YNRRRRLSRASSFLLNSSAFLIVLLVIVLCF+LIVIPK VQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNT
Subjt:  MEEDGNAPPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNT

Query:  GDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPELERE
        GDDNRG FEDRSVSSRRR+KSNPTTPR+WDGYSDHRPN +TVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHV NHRFASSDQLH R +ARPELERE
Subjt:  GDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPELERE

Query:  DSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPLS
        DS AKS GFDRSE+REDVYSQPAIPSPPR P     PPRSPSPPPT   PA+TTPKVVKRRPKRTH VHSHTPDGAIDQQQKNDDSDVADF+RI LPPLS
Subjt:  DSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPLS

Query:  PPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTPPPSIVS
        PPSFY+ESEQKSGKNEKKRGGAPKEIWS LRRRKKKQRQKSVESFEAI   + +++SSLP PSPPPPPPLPPP VLQNLF SKKGKAKKVQS PPP+IV+
Subjt:  PPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTPPPSIVS

Query:  SEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPP--FRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGEMKLMK
        SEPKPEIE QNHLLKP+DPPMELERLSSLNDEEYNTRIG +SPFH IPPPPPPPPP  FRMHGDFDS GSNS TPRAISP+I ESE DGPPAAG+MK+ +
Subjt:  SEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPP--FRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGEMKLMK

Query:  DSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGPK
         ST P+FCSSPDVNSKADNFIARF+ADLKLQKMNSIKE++ARKRSNLGR  GPGPK
Subjt:  DSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGPK

A0A6J1KKJ2 serine/arginine repetitive matrix protein 1-like9.6e-25886.81Show/hide
Query:  MEEDGNAPPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNT
        ME DGNA PPFWLQSS S  QV YNRRRRLSRASSFLLNSSAFL VLLVIVLCF+LIVIPK VQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNT
Subjt:  MEEDGNAPPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNT

Query:  GDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPELERE
        GDDNRG FEDRSVSSRRR+KSNPTTPR+WDGY DHRPNH+TVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHV NHRFASSDQLH R +ARPELERE
Subjt:  GDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPELERE

Query:  DSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPLS
        DSGAKS GFDRSE+ EDVYSQPAIPSPPR P     PPRSPSPPPT   PA+TTPKVVKRRPKRTH VHSHTPDGAIDQQQKNDDSDVADF+RI LPPLS
Subjt:  DSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPLS

Query:  PPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQS-----TPP
        PPSFY+ESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEA IA+  ASTSSLP  SPPPPPPLPPP VLQNLF SKKGKAKKVQS     +PP
Subjt:  PPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQS-----TPP

Query:  PSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPP--FRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGE
        P+IV+SEPKPEIE QNH LKP+DPPMELERLSSLNDEEYNTRIG +SPFH IPPPPPPPPP  FRMHGDFDS GSNSSTPRAISP+IDESE DGPPAAG+
Subjt:  PSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPP--FRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGE

Query:  MKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGPK
        MK+ + ST P+FCSSPDVNSKAD FIARF+ADLKLQKMNSIKE++ARKRSNLGRT GPGPK
Subjt:  MKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGPK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G72790.1 hydroxyproline-rich glycoprotein family protein8.8e-7037.73Show/hide
Query:  EEDGNAPPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTG
        E+DG+A  PFWLQS    +   + R   L   ++ +     F     ++++ FI   IP F    SQ+ RP  V+KSWD LN VLVLFA++CGFLSRNT 
Subjt:  EEDGNAPPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTG

Query:  DD-----------NRGSFEDRSVSSRRRMKSNPTTPRRWD----GYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQ
        +D           N+ S     +  R R+ ++ TTPR W+    G    +  +   +R+RS SSYPDLRL+E      DERWRFYDDT V   R+   D 
Subjt:  DD-----------NRGSFEDRSVSSRRRMKSNPTTPRRWD----GYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQ

Query:  LHHRR-------EARPELEREDSGAKSIGFDRSEIRE----------------DVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPK
        ++  +       E +P  E  D        + S++R                 +V  +  +PS         +PP  PSPPP+PP      P   K+  +
Subjt:  LHHRR-------EARPELEREDSGAKSIGFDRSEIRE----------------DVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPK

Query:  RTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPLSPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPP-
        +T++V+    D +  +++K  D  VA        P+ PP+      QKS K EKK+GGA K+   ALRR+KKKQRQ+S++  + +  S         PP 
Subjt:  RTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPLSPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPP-

Query:  --SPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTPPPSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTR---IGGESPFHPIPPPPPPPP--
          SPPPPPP PPP   Q LFSSKKGK+KK  S PPP      P+   E +    K    P+E  R S  N     T+    G ESP  PIPPPPPPPP  
Subjt:  --SPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTPPPSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTR---IGGESPFHPIPPPPPPPP--

Query:  ----PFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPG
             F   GD+  + S+ S        I   E D P  A +    K++   MFC SPDV++KAD+FIARFRA LKL+KMNS+K    R RSNLG  PG
Subjt:  ----PFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPG

AT5G57070.1 hydroxyproline-rich glycoprotein family protein5.3e-4331.27Show/hide
Query:  PPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGD----D
        PP  W Q     D   Y RRR    A   +L  +   +    I L F+  V+P F+   SQ+++P SVK+ WDS+N+VLV+FAI+CG L+R   D    +
Subjt:  PPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGD----D

Query:  NRGSFEDRSVSS-------------RRRMKSNPTTPRRW--DGYSDHR----------------PNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYD
        +    E+  V                +   S+ T   +W  D Y   R                P    V   RSSSSYPDLR Q    + GD R+RFYD
Subjt:  NRGSFEDRSVSS-------------RRRMKSNPTTPRRW--DGYSDHR----------------PNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYD

Query:  DTHVHNHRFA-SSDQLHHRREARPELEREDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSH
        D  +  +R   SS     +  ++ E+E E+S  K I  D   ++          SPP+ PP         +PPP PPPP    P  V ++P+RTH+   +
Subjt:  DTHVHNHRFA-SSDQLHHRREARPELEREDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSH

Query:  TPDGAIDQQQKNDDSDVADFRRIQLPPLSPPS----------FYRESEQKSGKNEKKRGGAPKEI-------WSALRRRKKKQRQKSVESFEAIIASQNA
                 Q+N       F+R   PP SPP                 +K G  ++++  A KEI       ++  +++KK Q+ K  E  E+    ++ 
Subjt:  TPDGAIDQQQKNDDSDVADFRRIQLPPLSPPS----------FYRESEQKSGKNEKKRGGAPKEI-------WSALRRRKKKQRQKSVESFEAIIASQNA

Query:  S-----TSSLPPPSPPPPPPLPPP------SVLQNLFSSKKGKAKKVQSTPPPSIVSSEPKPEIEDQNHLLKPHDPPMELER---LSSLNDEEYNTRIGG
        +      S +PPPSPPPPPP PPP      SV   LF       KK+ S P P      P P    Q     P  PP  ++          + +N    G
Subjt:  S-----TSSLPPPSPPPPPPLPPP------SVLQNLFSSKKGKAKKVQSTPPPSIVSSEPKPEIEDQNHLLKPHDPPMELER---LSSLNDEEYNTRIGG

Query:  E-SPFHPIPPPPPPPPPFR-------MHGDFDSVGSNSSTPRAISPD---------IDESEADGPPAAGEMKLMKDSTIPMFCSSPDVNSKADNFIARFR
        + SP   I PPPPPPPPFR       + GDF  + SN S+ R  SP+         ++ +++DG         +    +P FC SPDV++KADNFIAR R
Subjt:  E-SPFHPIPPPPPPPPPFR-------MHGDFDSVGSNSSTPRAISPD---------IDESEADGPPAAGEMKLMKDSTIPMFCSSPDVNSKADNFIARFR

Query:  ADLKLQKMNSIKEK
         + +L K+NS+  K
Subjt:  ADLKLQKMNSIKEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAAGACGGCAACGCGCCGCCGCCGTTCTGGCTTCAATCCTCTACCTCCATGGACCAAGTCGACTACAATCGCCGCCGTCGCCTCAGCCGCGCATCGTCGTTCCT
CCTCAACTCCAGCGCCTTTCTCATTGTTTTGTTAGTAATAGTTCTCTGTTTCATCTTGATTGTGATTCCTAAATTTGTACAGTTCGCTTCTCAATTGATTCGGCCTCAAT
CGGTCAAGAAGAGCTGGGATTCCCTCAATTTGGTTCTTGTTCTCTTCGCCATTGTTTGTGGATTTCTCAGTAGAAACACTGGTGATGATAATAGAGGCTCTTTTGAAGAT
CGGAGCGTTTCTTCGAGGCGGAGAATGAAGTCAAACCCTACGACTCCGCGCCGATGGGATGGATATTCCGATCATCGGCCGAATCATTTCACCGTCAATCGGATGAGGAG
TAGTAGTTCGTATCCCGATCTACGTCTTCAGGAGTCTTCATTGGATGCCGGGGATGAACGGTGGCGATTTTACGATGATACTCATGTGCATAATCATCGGTTTGCGTCCT
CCGATCAGCTTCATCACCGTCGTGAAGCTCGGCCGGAGCTTGAACGCGAAGATTCTGGTGCCAAAAGTATAGGTTTCGACAGATCTGAGATTCGTGAAGATGTATATTCA
CAACCGGCGATACCTTCTCCCCCGCGATCGCCGCCGCCGCGGGTGTCTCCTCCGCGATCTCCATCACCGCCTCCTACGCCTCCGCCTCCTGCTAATACGACTCCTAAAGT
GGTTAAACGAAGGCCAAAGAGAACCCATAAGGTCCATAGCCATACGCCCGATGGAGCAATCGATCAACAGCAGAAGAATGACGATTCGGACGTAGCCGATTTTCGACGGA
TTCAGCTTCCACCACTCTCGCCGCCGTCATTTTATCGGGAATCGGAGCAGAAGAGCGGCAAAAACGAGAAGAAGAGAGGTGGCGCTCCAAAAGAAATTTGGTCCGCACTG
AGGAGGAGGAAGAAGAAGCAAAGACAAAAGAGCGTCGAAAGCTTCGAGGCTATCATCGCCTCCCAAAACGCTTCAACATCGTCATTACCACCGCCGTCACCACCGCCGCC
TCCGCCGCTCCCGCCGCCGTCAGTTCTGCAAAATCTATTTTCATCCAAGAAAGGAAAAGCAAAAAAGGTACAGTCCACACCTCCACCATCAATAGTCTCCTCAGAACCTA
AACCAGAGATCGAAGATCAAAATCACCTCCTCAAACCTCACGATCCTCCAATGGAGCTTGAGAGACTGAGCAGTTTAAACGACGAAGAGTACAATACGCGCATTGGCGGT
GAGTCGCCATTTCATCCGATTCCTCCGCCACCACCGCCGCCGCCGCCGTTCAGAATGCATGGAGACTTCGACAGTGTAGGAAGCAACAGCAGTACACCAAGAGCCATCTC
GCCGGACATTGACGAGAGTGAAGCCGATGGACCGCCCGCGGCCGGCGAAATGAAACTCATGAAAGATTCAACAATTCCGATGTTCTGTTCAAGCCCAGATGTTAACAGTA
AAGCCGATAATTTCATTGCAAGATTCAGAGCCGATTTGAAGTTGCAGAAGATGAATTCCATCAAAGAGAAGACGGCGAGGAAGAGATCTAACCTAGGCCGAACACCAGGC
CCAGGCCCAAAGTAA
mRNA sequenceShow/hide mRNA sequence
CAAGCAAGAAAAAAAAAAAAAGAAACGCAAAAGCGATCTCATAATACGGCACAACAAAAGCAAACTCAAAGGCAAAGCTAAGAAAGAGAGAAAAAAGACCCATCATCTCG
CTTTCTGTAAAGGAAAAAAGCCCAACCAAACCGAGTTCACCGGCGAAAATGGAGGAAGACGGCAACGCGCCGCCGCCGTTCTGGCTTCAATCCTCTACCTCCATGGACCA
AGTCGACTACAATCGCCGCCGTCGCCTCAGCCGCGCATCGTCGTTCCTCCTCAACTCCAGCGCCTTTCTCATTGTTTTGTTAGTAATAGTTCTCTGTTTCATCTTGATTG
TGATTCCTAAATTTGTACAGTTCGCTTCTCAATTGATTCGGCCTCAATCGGTCAAGAAGAGCTGGGATTCCCTCAATTTGGTTCTTGTTCTCTTCGCCATTGTTTGTGGA
TTTCTCAGTAGAAACACTGGTGATGATAATAGAGGCTCTTTTGAAGATCGGAGCGTTTCTTCGAGGCGGAGAATGAAGTCAAACCCTACGACTCCGCGCCGATGGGATGG
ATATTCCGATCATCGGCCGAATCATTTCACCGTCAATCGGATGAGGAGTAGTAGTTCGTATCCCGATCTACGTCTTCAGGAGTCTTCATTGGATGCCGGGGATGAACGGT
GGCGATTTTACGATGATACTCATGTGCATAATCATCGGTTTGCGTCCTCCGATCAGCTTCATCACCGTCGTGAAGCTCGGCCGGAGCTTGAACGCGAAGATTCTGGTGCC
AAAAGTATAGGTTTCGACAGATCTGAGATTCGTGAAGATGTATATTCACAACCGGCGATACCTTCTCCCCCGCGATCGCCGCCGCCGCGGGTGTCTCCTCCGCGATCTCC
ATCACCGCCTCCTACGCCTCCGCCTCCTGCTAATACGACTCCTAAAGTGGTTAAACGAAGGCCAAAGAGAACCCATAAGGTCCATAGCCATACGCCCGATGGAGCAATCG
ATCAACAGCAGAAGAATGACGATTCGGACGTAGCCGATTTTCGACGGATTCAGCTTCCACCACTCTCGCCGCCGTCATTTTATCGGGAATCGGAGCAGAAGAGCGGCAAA
AACGAGAAGAAGAGAGGTGGCGCTCCAAAAGAAATTTGGTCCGCACTGAGGAGGAGGAAGAAGAAGCAAAGACAAAAGAGCGTCGAAAGCTTCGAGGCTATCATCGCCTC
CCAAAACGCTTCAACATCGTCATTACCACCGCCGTCACCACCGCCGCCTCCGCCGCTCCCGCCGCCGTCAGTTCTGCAAAATCTATTTTCATCCAAGAAAGGAAAAGCAA
AAAAGGTACAGTCCACACCTCCACCATCAATAGTCTCCTCAGAACCTAAACCAGAGATCGAAGATCAAAATCACCTCCTCAAACCTCACGATCCTCCAATGGAGCTTGAG
AGACTGAGCAGTTTAAACGACGAAGAGTACAATACGCGCATTGGCGGTGAGTCGCCATTTCATCCGATTCCTCCGCCACCACCGCCGCCGCCGCCGTTCAGAATGCATGG
AGACTTCGACAGTGTAGGAAGCAACAGCAGTACACCAAGAGCCATCTCGCCGGACATTGACGAGAGTGAAGCCGATGGACCGCCCGCGGCCGGCGAAATGAAACTCATGA
AAGATTCAACAATTCCGATGTTCTGTTCAAGCCCAGATGTTAACAGTAAAGCCGATAATTTCATTGCAAGATTCAGAGCCGATTTGAAGTTGCAGAAGATGAATTCCATC
AAAGAGAAGACGGCGAGGAAGAGATCTAACCTAGGCCGAACACCAGGCCCAGGCCCAAAGTAAATCAAGATAAGGCTCAGCCCATATACACTTTTTTTTCAATAATGAAA
ATTAAAAAAAAAAAGAAAAAAAAAATCTCAATTCTTTTGTTTGTTGTTATTTGTTGTTTTTTAAGAGCATGTTTTGTTTGAAGCTTTTTCGAATACCATGACATGATAGG
ACAAGTAGAAACATAGGCTGATATATATTTTAAGGTATTTTGGATATGAATTTTTTTTTGTTTTTTTTTTCTCTCAAATTATTTGCTTGGTGAGAAATGAAAGGATGCTA
TAATTTGAATA
Protein sequenceShow/hide protein sequence
MEEDGNAPPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDDNRGSFED
RSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPELEREDSGAKSIGFDRSEIREDVYS
QPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPLSPPSFYRESEQKSGKNEKKRGGAPKEIWSAL
RRRKKKQRQKSVESFEAIIASQNASTSSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTPPPSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGG
ESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPG
PGPK