; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G007380 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G007380
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionMucin-2
Genome locationCma_Chr04:3751321..3753868
RNA-Seq ExpressionCmaCh04G007380
SyntenyCmaCh04G007380
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600562.1 hypothetical protein SDJN03_05795, partial [Cucurbita argyrosperma subsp. sororia]1.2e-24496.07Show/hide
Query:  MREMRRRADADADAD-ADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVL
        MR MRRRADADADAD ADLRPMNNTFQTIT AAD IATVDHRFPR TAVQKRRWGSCWSIYWCFGSL+QRKRIGHAVLVPEPSPSPEAHQNSLQSPD VL
Subjt:  MREMRRRADADADAD-ADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVL

Query:  PFAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQ
        PFAAPPSSP SFLQSEPPS TQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPP+NFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQ
Subjt:  PFAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQ

Query:  PNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFD
        P LPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFS FSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDD D
Subjt:  PNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFD

Query:  LNPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQE
        LNPRTSDSMNESQNIQILIDGSQMEEPDV NHRFSFELSDEDSLLRN+ESKPLESNVAVASSPMHETFETAKETSSGGGHSSN IEEKAADGEEANQHQE
Subjt:  LNPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQE

Query:  HHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR
        HHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKD ETKGTT GAWSFFPMAQQR
Subjt:  HHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR

XP_022941648.1 uncharacterized protein At1g76660-like isoform X1 [Cucurbita moschata]1.2e-24495.04Show/hide
Query:  MREMRRRADADADAD-------ADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQ
        MR MRRRADADADAD       ADLRPMNNTFQTIT AAD IATVDHRFPR TAVQKRRWGSCWSIYWCFGSL+QRKRIGHAVLVPEPSPSPEAHQNSLQ
Subjt:  MREMRRRADADADAD-------ADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQ

Query:  SPDIVLPFAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
        SPDIVLPFAAPPSSP SFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPP+NFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
Subjt:  SPDIVLPFAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP

Query:  FAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
        FAQFLQP LPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSP  DLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
Subjt:  FAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS

Query:  NDDDFDLNPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEE
        NDDDFDL+PRTSDSMNESQNIQILIDGSQMEEPDV NHRFSFELSDEDSLLRN+ESKPLESNVAVASSPMHETFETAKETSSGGGHSSN IEEKAADGEE
Subjt:  NDDDFDLNPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEE

Query:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR
        ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNI+SDWWANAKD ETKGTTTGAWSFFPMAQQR
Subjt:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR

XP_022980796.1 uncharacterized protein At1g76660-like [Cucurbita maxima]1.2e-255100Show/hide
Query:  MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLP
        MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLP
Subjt:  MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLP

Query:  FAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQP
        FAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQP
Subjt:  FAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQP

Query:  NLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL
        NLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL
Subjt:  NLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL

Query:  NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEH
        NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEH
Subjt:  NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEH

Query:  HHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR
        HHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR
Subjt:  HHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR

XP_023522163.1 uncharacterized protein At1g76660-like [Cucurbita pepo subsp. pepo]4.9e-24696.94Show/hide
Query:  MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLP
        MR MRRRADADA   ADLRPMNNTFQTIT AAD IATVDHRFPR TAVQKRRWGSCWSIYWCFGSL+QRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLP
Subjt:  MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLP

Query:  FAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQP
        FAAPPSSP SFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPP+NFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQP
Subjt:  FAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQP

Query:  NLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL
         L KAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL
Subjt:  NLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL

Query:  NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEH
        NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSN IEEKAADGEEANQHQEH
Subjt:  NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEH

Query:  HHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR
        HHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKD ETKGTTTGAWSFFPMAQQR
Subjt:  HHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR

XP_023529207.1 uncharacterized protein At1g76660-like isoform X1 [Cucurbita pepo subsp. pepo]2.6e-24797.37Show/hide
Query:  MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLP
        MR MRRRADADADA ADLRPMNNTFQTIT AAD IATVDHRFPR TAVQKRRWGSCWSIYWCFGSL+QRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLP
Subjt:  MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLP

Query:  FAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQP
        FAAPPSSP SFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPP+NFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQP
Subjt:  FAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQP

Query:  NLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL
         L KAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL
Subjt:  NLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL

Query:  NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEH
        NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSN IEEKAADGEEANQHQEH
Subjt:  NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEH

Query:  HHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR
        HHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKD ETKGTTTGAWSFFPMAQQR
Subjt:  HHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR

TrEMBL top hitse value%identityAlignment
A0A5D3CYQ2 Mucin-22.1e-18174.95Show/hide
Query:  MRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAA
        MRRR D D     D RP+NNTFQTIT AAD IATVDHRFPR TAVQKRRWGSC SIYWCFGSL+QRKRIGHAVLVPEPSPS E H+N+LQSPDIVLPFAA
Subjt:  MRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAA

Query:  PPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPNLP
        PPSSP S LQSEPPSA QSP+ ++SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPP+NFSTLTTEPSTP FTPPESIHLTTPSSPEVPFAQF+ P+L 
Subjt:  PPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPNLP

Query:  KAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNSVGFKSNDD
        K ESD+QY+ PNDDFQSYQFYPGSPVS+LISPRS IS SGASSPLPD DFAS  SQF NF L+VPP L NLD       RQ QS+DSCTQ+S+ FKS+ +
Subjt:  KAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNSVGFKSNDD

Query:  DFDLNPRTSDSM------NESQNIQILID--GSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESN-VAVASSPMHETFETAKETSSGGGHSSNSIEEK
        DF LNP TS+SM      NESQNIQILID    + EEP  TNHRFSFELSD D L ++V SKPLESN + V SSP+HE FET KE S  G H+SN IEEK
Subjt:  DFDLNPRTSDSM------NESQNIQILID--GSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESN-VAVASSPMHETFETAKETSSGGGHSSNSIEEK

Query:  A-ADGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR
          ADG+EA+QHQE HHS  LGSV EFNFDN NGS+   P INSDWW NAKD  T+GTTTGAWSFFP  QQR
Subjt:  A-ADGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR

A0A6J1C828 uncharacterized protein At1g76660-like8.6e-18876.81Show/hide
Query:  MRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAA
        MRRR   DADADADL P+NNTFQTIT AAD IATVDHRFPR TAVQKRRWGSCWSIYWCFGSL+QRKRIGHAVLVPEPSPS E  +N+LQSPDIVLPFAA
Subjt:  MRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAA

Query:  PPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPNLP
        PPSSP SFLQSEPPSATQSP+ ILSFTSLTANMYSPDGPSSIFA+GPFAHETQLVSPP+NFST+TT+PST  FTPPESIHLTTPSSPEVPFAQ+LQP+  
Subjt:  PPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPNLP

Query:  KAESDDQY-SCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNSVGFKSND
        K ESD QY   PNDDFQSYQFYPGSPVS+LISPRS IS SGASSPLPD DF  S S FSNF ++VPP LLNLD       R  QSSDSCTQNSVG+KS+ 
Subjt:  KAESDDQY-SCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNSVGFKSND

Query:  DDFDLNPRTSDSM------NESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESN-VAVASSPMHETFETAKETSSGGGHSSNSIEE-K
        +DF LNP+TS+S+      NE  NIQIL DGSQ +E    NHRFSFELSDED+LL++VE+KPLESN +AVASSP+HE  ETAKETS  GGH+SN  EE +
Subjt:  DDFDLNPRTSDSM------NESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESN-VAVASSPMHETFETAKETSSGGGHSSNSIEE-K

Query:  AADGEEANQHQE-HHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQ
         ADGEE + HQE  HHS TLG+V EFNFDNGNG + LKPNINS WWAN KDAET+GTTTGAWSFFP+ QQ
Subjt:  AADGEEANQHQE-HHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQ

A0A6J1FP20 uncharacterized protein At1g76660-like isoform X28.5e-24495.84Show/hide
Query:  MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLP
        MR MRRRADADA   ADLRPMNNTFQTIT AAD IATVDHRFPR TAVQKRRWGSCWSIYWCFGSL+QRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLP
Subjt:  MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLP

Query:  FAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQP
        FAAPPSSP SFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPP+NFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQP
Subjt:  FAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQP

Query:  NLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL
         LPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSP  DLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL
Subjt:  NLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL

Query:  NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEH
        +PRTSDSMNESQNIQILIDGSQMEEPDV NHRFSFELSDEDSLLRN+ESKPLESNVAVASSPMHETFETAKETSSGGGHSSN IEEKAADGEEANQHQEH
Subjt:  NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEH

Query:  HHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR
        HHSTTLGSVNEFNFDNGNGSNALKPNI+SDWWANAKD ETKGTTTGAWSFFPMAQQR
Subjt:  HHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR

A0A6J1FSP7 uncharacterized protein At1g76660-like isoform X15.9e-24595.04Show/hide
Query:  MREMRRRADADADAD-------ADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQ
        MR MRRRADADADAD       ADLRPMNNTFQTIT AAD IATVDHRFPR TAVQKRRWGSCWSIYWCFGSL+QRKRIGHAVLVPEPSPSPEAHQNSLQ
Subjt:  MREMRRRADADADAD-------ADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQ

Query:  SPDIVLPFAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
        SPDIVLPFAAPPSSP SFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPP+NFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
Subjt:  SPDIVLPFAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP

Query:  FAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
        FAQFLQP LPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSP  DLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
Subjt:  FAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS

Query:  NDDDFDLNPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEE
        NDDDFDL+PRTSDSMNESQNIQILIDGSQMEEPDV NHRFSFELSDEDSLLRN+ESKPLESNVAVASSPMHETFETAKETSSGGGHSSN IEEKAADGEE
Subjt:  NDDDFDLNPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEE

Query:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR
        ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNI+SDWWANAKD ETKGTTTGAWSFFPMAQQR
Subjt:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR

A0A6J1IUL0 uncharacterized protein At1g76660-like5.7e-256100Show/hide
Query:  MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLP
        MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLP
Subjt:  MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLP

Query:  FAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQP
        FAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQP
Subjt:  FAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQP

Query:  NLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL
        NLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL
Subjt:  NLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDL

Query:  NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEH
        NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEH
Subjt:  NPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEH

Query:  HHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR
        HHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR
Subjt:  HHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFPMAQQR

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766609.9e-3240.51Show/hide
Query:  QKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPE-----PSPSPEAHQ----NSLQSPDIVLPFAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSP
        Q++RWG C  ++ CF S +  KRI  A  +PE      S    AHQ    N+  +  I L   APPSSPASF  S  PS TQSP+    + SL AN  SP
Subjt:  QKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPE-----PSPSPEAHQ----NSLQSPDIVLPFAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSP

Query:  DGP-SSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRS
         GP SS++A GP+AHETQLVSPP+ FST TTEPST  FT PPE   LT PSSP+VP+A+FL  ++    S   +   ND   +Y  YPGSP S L SP S
Subjt:  DGP-SSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRS

Query:  AISLSGASSPLP--------------DLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLNPRTSDSMNESQNIQILIDGSQM
          S  G  SP                D +  S+  Q SNF      A   LD      D     + G  S   D D+ P T+   N +QN Q       M
Subjt:  AISLSGASSPLP--------------DLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLNPRTSDSMNESQNIQILIDGSQM

Query:  EEPDVTNHRFSFELSD
        EE +     F F   +
Subjt:  EEPDVTNHRFSFELSD

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)9.8e-5147.42Show/hide
Query:  NNTFQTITTAADVIATVDHRFPRDTAV-QKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEP----SPSPEAHQNSLQSPDIVLPFAAPPSSPASFLQSEP
        NN F TI  AA  IA+ D R  + + + +KR+W + WS+  CFGS RQRKRIG++VLVPEP    S +     +  +S    LPF APPSSPASF QSEP
Subjt:  NNTFQTITTAADVIATVDHRFPRDTAV-QKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEP----SPSPEAHQNSLQSPDIVLPFAAPPSSPASFLQSEP

Query:  PSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPP---ESIHL--TTPSSPEVPFAQFLQPNLPKAESDDQY
        PSATQSP  ILSF+ L  N        SIFAIGP+AHETQLVSPP+ FST TTEPS+   TPP    SI+L  TTPSSPEVPFAQ    N        ++
Subjt:  PSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPP---ESIHL--TTPSSPEVPFAQFLQPNLPKAESDDQY

Query:  SCPND-DFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLN
           +  +FQ YQ  PGSP+  LISP      SG +SP PD       S F +F +  PP LL+    G ++  C +  +        FDL+
Subjt:  SCPND-DFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLN

AT1G76660.1 FUNCTIONS IN: molecular_function unknown7.1e-3340.51Show/hide
Query:  QKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPE-----PSPSPEAHQ----NSLQSPDIVLPFAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSP
        Q++RWG C  ++ CF S +  KRI  A  +PE      S    AHQ    N+  +  I L   APPSSPASF  S  PS TQSP+    + SL AN  SP
Subjt:  QKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPE-----PSPSPEAHQ----NSLQSPDIVLPFAAPPSSPASFLQSEPPSATQSPSNILSFTSLTANMYSP

Query:  DGP-SSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRS
         GP SS++A GP+AHETQLVSPP+ FST TTEPST  FT PPE   LT PSSP+VP+A+FL  ++    S   +   ND   +Y  YPGSP S L SP S
Subjt:  DGP-SSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRS

Query:  AISLSGASSPLP--------------DLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLNPRTSDSMNESQNIQILIDGSQM
          S  G  SP                D +  S+  Q SNF      A   LD      D     + G  S   D D+ P T+   N +QN Q       M
Subjt:  AISLSGASSPLP--------------DLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLNPRTSDSMNESQNIQILIDGSQM

Query:  EEPDVTNHRFSFELSD
        EE +     F F   +
Subjt:  EEPDVTNHRFSFELSD

AT4G25620.1 hydroxyproline-rich glycoprotein family protein9.5e-4635.48Show/hide
Query:  NNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEA----HQNSLQSPDIVLPFAAPPSSPASFLQSEPP
        N++  T+  AA  I + + R  + ++VQK+R GS WS+YWCFGS +  KRIGHAVLVPEP+ S  A      +S  S  I +PF APPSSPASFL S PP
Subjt:  NNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEA----HQNSLQSPDIVLPFAAPPSSPASFLQSEPP

Query:  SATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPNLPKAE------SDDQY
        SA+ +P   L   SLT N      P S F IGP+AHETQ V+PP+ FS  TTEPST  FTPP      +PSSPEVPFAQ L  +L +A        + ++
Subjt:  SATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPNLPKAE------SDDQY

Query:  SCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLD-----RQGQSSDSCTQNSVGFKSNDDDFDLNP---R
        S  + +F+S Q YPGSP  NLISP      SG SSP P             F +  PP  L  +     + G    S +    G  S      L P   +
Subjt:  SCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLD-----RQGQSSDSCTQNSVGFKSNDDDFDLNP---R

Query:  TSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFE---------TAKETSSGGGHSSNSIEEK-----AA
         +  +      + +I  S      +       ++S+  SL  +       ++ A+   P   +FE          A + +  G H   S E         
Subjt:  TSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFE---------TAKETSSGGGHSSNSIEEK-----AA

Query:  DGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDA-ETKGTTTGAWSFFPM
         GE  ++  +   S + GS  EF FD+ N    +   I S+WWAN K A +   +   +W+FFP+
Subjt:  DGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDA-ETKGTTTGAWSFFPM

AT5G52430.1 hydroxyproline-rich glycoprotein family protein9.2e-5737.8Show/hide
Query:  MNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPS---PEAHQNSLQSPDIVLPFAAPPSSPASFLQSEPP
        +NN+ +T+  AA  I T + R  + ++ QK RWG CWS+Y CFG+ +  KRIG+AVLVPEP  S       QNS  S  +VLPF APPSSPASFLQS+P 
Subjt:  MNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPS---PEAHQNSLQSPDIVLPFAAPPSSPASFLQSEPP

Query:  SATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPP--ESIHLTTPSSPEVPFAQFLQPNLPKAESD------D
        S + SP   L   SLT+N +SP  P S+F +GP+A+ETQ V+PP+ FS   TEPST  +TPP   S+H+TTPSSPEVPFAQ L  +L     D       
Subjt:  SATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPP--ESIHLTTPSSPEVPFAQFLQPNLPKAESD------D

Query:  QYSCPNDDFQSYQFYPGSP-VSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLD---------RQGQSSDSCTQNSVGFKSN-----
        ++S  + +F+S Q  PGSP   NLISP S IS SG SSP P        S    F +  PP  L  +         R G  S +   +  G  S      
Subjt:  QYSCPNDDFQSYQFYPGSP-VSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLD---------RQGQSSDSCTQNSVGFKSN-----

Query:  -----DDDFDLNPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAA
               +   N  T    N+   +  L +     E  V +HR SFEL+ ED + R + SK   S+  + ++   ET E      S       +IE+++ 
Subjt:  -----DDDFDLNPRTSDSMNESQNIQILIDGSQMEEPDVTNHRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAA

Query:  DGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFP
        D E      +   S+++GS  EF FD                  N KD   +     +WSFFP
Subjt:  DGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAETKGTTTGAWSFFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGAGATGAGGCGGCGTGCGGATGCGGATGCGGATGCTGATGCTGATCTGAGGCCTATGAATAACACTTTTCAGACCATTACTACGGCCGCCGATGTGATCGCCAC
CGTTGATCATCGGTTTCCTCGGGATACTGCCGTCCAGAAAAGAAGATGGGGTAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAGACAGAGGAAACGAATTGGGC
ATGCCGTCCTTGTCCCAGAACCAAGTCCTTCGCCTGAGGCTCATCAAAATTCATTGCAATCCCCAGACATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGCATCC
TTTCTTCAATCAGAGCCACCTTCTGCTACACAATCACCTTCAAATATACTCTCCTTCACTTCTCTCACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCGATTTTTGC
CATTGGCCCATTTGCTCATGAGACACAGCTTGTATCTCCACCTATGAATTTTTCTACTCTCACCACTGAACCATCGACTCCTTCCTTCACTCCTCCTGAGTCTATCCACT
TGACTACACCTTCTTCCCCTGAAGTTCCATTTGCTCAGTTTCTTCAACCGAACCTTCCTAAAGCTGAGTCTGATGACCAATATTCATGTCCTAATGATGACTTTCAATCT
TATCAATTCTATCCTGGTAGCCCAGTTAGCAACCTCATATCGCCACGCTCTGCCATTTCTCTTTCTGGGGCATCTTCGCCTTTGCCAGATTTGGATTTTGCTTCCTCAGC
TTCTCAATTTTCTAATTTCTCATTGGATGTTCCACCTGCGCTGTTGAACCTTGACAGACAAGGGCAAAGTTCTGATTCTTGCACTCAAAATTCTGTAGGATTCAAATCGA
ATGATGATGATTTTGATTTGAATCCTCGAACTTCAGATTCAATGAATGAATCCCAAAATATTCAAATTCTCATTGATGGAAGCCAAATGGAGGAACCTGATGTTACTAAT
CATAGATTCTCATTTGAGTTATCTGACGAAGATTCTTTATTAAGAAACGTAGAAAGTAAGCCACTGGAGTCAAATGTTGCAGTTGCATCATCTCCAATGCATGAAACATT
TGAAACGGCTAAAGAAACTTCTTCTGGTGGTGGTCATAGCTCAAATAGTATAGAAGAAAAGGCAGCAGACGGTGAAGAAGCAAATCAGCATCAAGAACATCATCATTCTA
CTACTCTTGGGTCTGTGAATGAATTTAATTTTGATAATGGCAATGGAAGTAATGCACTTAAGCCTAATATCAACTCGGACTGGTGGGCTAATGCGAAAGATGCAGAGACA
AAAGGCACGACCACGGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGA
mRNA sequenceShow/hide mRNA sequence
TCTCTTCACTTTCTGACTGCAAATTCTCCTTATTTGTTCTGTGTTTTCTCCCGAAAAATTTCGTGTAAGAGGAACCACAACTTTCTTCTATGAACACTATCAGCGAATCC
CTGGCGTCGATCAATGAGAGAGATGAGGCGGCGTGCGGATGCGGATGCGGATGCTGATGCTGATCTGAGGCCTATGAATAACACTTTTCAGACCATTACTACGGCCGCCG
ATGTGATCGCCACCGTTGATCATCGGTTTCCTCGGGATACTGCCGTCCAGAAAAGAAGATGGGGTAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAGACAGAGG
AAACGAATTGGGCATGCCGTCCTTGTCCCAGAACCAAGTCCTTCGCCTGAGGCTCATCAAAATTCATTGCAATCCCCAGACATTGTGCTTCCTTTTGCTGCACCTCCCTC
TTCCCCTGCATCCTTTCTTCAATCAGAGCCACCTTCTGCTACACAATCACCTTCAAATATACTCTCCTTCACTTCTCTCACTGCTAACATGTATTCTCCTGATGGGCCTT
CCTCGATTTTTGCCATTGGCCCATTTGCTCATGAGACACAGCTTGTATCTCCACCTATGAATTTTTCTACTCTCACCACTGAACCATCGACTCCTTCCTTCACTCCTCCT
GAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCATTTGCTCAGTTTCTTCAACCGAACCTTCCTAAAGCTGAGTCTGATGACCAATATTCATGTCCTAATGA
TGACTTTCAATCTTATCAATTCTATCCTGGTAGCCCAGTTAGCAACCTCATATCGCCACGCTCTGCCATTTCTCTTTCTGGGGCATCTTCGCCTTTGCCAGATTTGGATT
TTGCTTCCTCAGCTTCTCAATTTTCTAATTTCTCATTGGATGTTCCACCTGCGCTGTTGAACCTTGACAGACAAGGGCAAAGTTCTGATTCTTGCACTCAAAATTCTGTA
GGATTCAAATCGAATGATGATGATTTTGATTTGAATCCTCGAACTTCAGATTCAATGAATGAATCCCAAAATATTCAAATTCTCATTGATGGAAGCCAAATGGAGGAACC
TGATGTTACTAATCATAGATTCTCATTTGAGTTATCTGACGAAGATTCTTTATTAAGAAACGTAGAAAGTAAGCCACTGGAGTCAAATGTTGCAGTTGCATCATCTCCAA
TGCATGAAACATTTGAAACGGCTAAAGAAACTTCTTCTGGTGGTGGTCATAGCTCAAATAGTATAGAAGAAAAGGCAGCAGACGGTGAAGAAGCAAATCAGCATCAAGAA
CATCATCATTCTACTACTCTTGGGTCTGTGAATGAATTTAATTTTGATAATGGCAATGGAAGTAATGCACTTAAGCCTAATATCAACTCGGACTGGTGGGCTAATGCGAA
AGATGCAGAGACAAAAGGCACGACCACGGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGAGCAAACTGGTGATTATCCTCTGGAATTTCCTCATGCCCATCAT
GTTTTGCAGTTGCAATTTAGTAGGTAATAGGTAAGACAAATTGCTAGAGGACTGGTGAGCTTTGAAGGTAAAAAAGAGGACAAATCATGAAAAGAGAAAAACCAGAAGCC
ATATTATTTTCAACAATCTGACCTCCTAAACACAGGCAGGTCTGAATAGTATGATAATTAGAAATCTGTAGTCGACAATGGGCCCTATTAACAAACAGTAGTGGCTCCTC
ACTTGAATTGTAACAGCTATTAGTATTCTGTAGAAATTGAAAGTGTGTAAATATGGTAATAAAAATTGTTTTTATCTTTTGACAGC
Protein sequenceShow/hide protein sequence
MREMRRRADADADADADLRPMNNTFQTITTAADVIATVDHRFPRDTAVQKRRWGSCWSIYWCFGSLRQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAAPPSSPAS
FLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPMNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPNLPKAESDDQYSCPNDDFQS
YQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLNPRTSDSMNESQNIQILIDGSQMEEPDVTN
HRFSFELSDEDSLLRNVESKPLESNVAVASSPMHETFETAKETSSGGGHSSNSIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDAET
KGTTTGAWSFFPMAQQR