; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg14135 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg14135
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionMucin-2
Genome locationCarg_Chr04:3961016..3963090
RNA-Seq ExpressionCarg14135
SyntenyCarg14135
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600562.1 hypothetical protein SDJN03_05795, partial [Cucurbita argyrosperma subsp. sororia]2.2e-254100Show/hide
Query:  MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFA
        MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFA
Subjt:  MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFA

Query:  APPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL
        APPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL
Subjt:  APPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL

Query:  PKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDLDLNP
        PKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDLDLNP
Subjt:  PKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDLDLNP

Query:  RTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHH
        RTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHH
Subjt:  RTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHH

Query:  STTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR
        STTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR
Subjt:  STTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR

KAG7031203.1 hypothetical protein SDJN02_05243, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-254100Show/hide
Query:  MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFA
        MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFA
Subjt:  MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFA

Query:  APPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL
        APPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL
Subjt:  APPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL

Query:  PKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDLDLNP
        PKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDLDLNP
Subjt:  PKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDLDLNP

Query:  RTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHH
        RTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHH
Subjt:  RTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHH

Query:  STTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR
        STTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR
Subjt:  STTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR

XP_022941648.1 uncharacterized protein At1g76660-like isoform X1 [Cucurbita moschata]2.9e-24696.75Show/hide
Query:  MRRR------ADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPD
        MRRR      ADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPD
Subjt:  MRRR------ADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPD

Query:  TVLPFAAPPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQ
         VLPFAAPPSSPVSFLQSEPPS TQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQ
Subjt:  TVLPFAAPPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQ

Query:  FLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDD
        FLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSP  DLDFASSASQFS FSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDD
Subjt:  FLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDD

Query:  DLDLNPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQ
        D DL+PRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQ
Subjt:  DLDLNPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQ

Query:  HQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR
        HQEHHHSTTLGSVNEFNFDNGNGSNALKPNI+SDWWANAKDVETKGTT GAWSFFPMAQQR
Subjt:  HQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR

XP_023522163.1 uncharacterized protein At1g76660-like [Cucurbita pepo subsp. pepo]1.1e-24597.36Show/hide
Query:  MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFA
        MRRR    ADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPD VLPFA
Subjt:  MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFA

Query:  APPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL
        APPSSPVSFLQSEPPS TQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL
Subjt:  APPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL

Query:  PKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDLDLNP
         KAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFS FSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDD DLNP
Subjt:  PKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDLDLNP

Query:  RTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHH
        RTSDSMNESQNIQILIDGSQMEEPDV NHRFSFELSDEDSLLRN+ESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHH
Subjt:  RTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHH

Query:  STTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR
        STTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTT GAWSFFPMAQQR
Subjt:  STTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR

XP_023529207.1 uncharacterized protein At1g76660-like isoform X1 [Cucurbita pepo subsp. pepo]5.8e-24797.8Show/hide
Query:  MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFA
        MRRR  ADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPD VLPFA
Subjt:  MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFA

Query:  APPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL
        APPSSPVSFLQSEPPS TQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL
Subjt:  APPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL

Query:  PKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDLDLNP
         KAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFS FSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDD DLNP
Subjt:  PKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDLDLNP

Query:  RTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHH
        RTSDSMNESQNIQILIDGSQMEEPDV NHRFSFELSDEDSLLRN+ESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHH
Subjt:  RTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHH

Query:  STTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR
        STTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTT GAWSFFPMAQQR
Subjt:  STTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR

TrEMBL top hitse value%identityAlignment
A0A5D3CYQ2 Mucin-25.0e-18074.58Show/hide
Query:  MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFA
        MRRR D D      D RP+NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPS E H+N+LQSPD VLPFA
Subjt:  MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFA

Query:  APPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL
        APPSSPVS LQSEPPS  QSP+ ++SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPSTP FTPPESIHLTTPSSPEVPFAQF+ P+L
Subjt:  APPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL

Query:  PKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLD-------RQGQSSDSCTQNSVGFKSND
         K ESD+QY+ PNDDFQSYQFYPGSPVS+LISPRS IS SGASSPLPD DFAS  SQF  F L+VPP L NLD       RQ QS+DSCTQ+S+ FKS++
Subjt:  PKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLD-------RQGQSSDSCTQNSVGFKSND

Query:  DDLDLNPRTSDSM------NESQNIQILID--GSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESN-VAVASSPMHETFETAKETSSGGGHSSNGIEE
        D + LNP TS+SM      NESQNIQILID    + EEP   NHRFSFELSD D L +++ SKPLESN + V SSP+HE FET KE S  G H+SN IEE
Subjt:  DDLDLNPRTSDSM------NESQNIQILID--GSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESN-VAVASSPMHETFETAKETSSGGGHSSNGIEE

Query:  KA-ADGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR
        K  ADG+EA+QHQE HHS  LGSV EFNFDN NGS+   P INSDWW NAKD  T+GTT GAWSFFP  QQR
Subjt:  KA-ADGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR

A0A6J1C828 uncharacterized protein At1g76660-like2.5e-18776.65Show/hide
Query:  MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFA
        MRRR DADAD   ADL P+NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS E  +N+LQSPD VLPFA
Subjt:  MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFA

Query:  APPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL
        APPSSPVSFLQSEPPS TQSP+ ILSFTSLTANMYSPDGPSSIFA+GPFAHETQLVSPPLNFST+TT+PST  FTPPESIHLTTPSSPEVPFAQ+LQP+ 
Subjt:  APPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL

Query:  PKAESDDQY-SCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLD-------RQGQSSDSCTQNSVGFKSN
         K ESD QY   PNDDFQSYQFYPGSPVS+LISPRS IS SGASSPLPD DF  S S FS F ++VPP LLNLD       R  QSSDSCTQNSVG+KS+
Subjt:  PKAESDDQY-SCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLD-------RQGQSSDSCTQNSVGFKSN

Query:  DDDLDLNPRTSDSM------NESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESN-VAVASSPMHETFETAKETSSGGGHSSNGIEE-
        +D + LNP+TS+S+      NE  NIQIL DGSQ +E   ANHRFSFELSDED+LL+++E+KPLESN +AVASSP+HE  ETAKETS  GGH+SN  EE 
Subjt:  DDDLDLNPRTSDSM------NESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESN-VAVASSPMHETFETAKETSSGGGHSSNGIEE-

Query:  KAADGEEANQHQE-HHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQ
        + ADGEE + HQE  HHS TLG+V EFNFDNGNG + LKPNINS WWAN KD ET+GTT GAWSFFP+ QQ
Subjt:  KAADGEEANQHQE-HHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQ

A0A6J1FP20 uncharacterized protein At1g76660-like isoform X21.0e-24497.14Show/hide
Query:  MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFA
        MRRR    ADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPD VLPFA
Subjt:  MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFA

Query:  APPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL
        APPSSPVSFLQSEPPS TQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL
Subjt:  APPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL

Query:  PKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDLDLNP
        PKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSP  DLDFASSASQFS FSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDD DL+P
Subjt:  PKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDLDLNP

Query:  RTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHH
        RTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHH
Subjt:  RTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHH

Query:  STTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR
        STTLGSVNEFNFDNGNGSNALKPNI+SDWWANAKDVETKGTT GAWSFFPMAQQR
Subjt:  STTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR

A0A6J1FSP7 uncharacterized protein At1g76660-like isoform X11.4e-24696.75Show/hide
Query:  MRRR------ADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPD
        MRRR      ADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPD
Subjt:  MRRR------ADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPD

Query:  TVLPFAAPPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQ
         VLPFAAPPSSPVSFLQSEPPS TQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQ
Subjt:  TVLPFAAPPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQ

Query:  FLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDD
        FLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSP  DLDFASSASQFS FSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDD
Subjt:  FLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDD

Query:  DLDLNPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQ
        D DL+PRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQ
Subjt:  DLDLNPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQ

Query:  HQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR
        HQEHHHSTTLGSVNEFNFDNGNGSNALKPNI+SDWWANAKDVETKGTT GAWSFFPMAQQR
Subjt:  HQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR

A0A6J1IUL0 uncharacterized protein At1g76660-like1.3e-24496.26Show/hide
Query:  MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFA
        MRRRADADADAD ADLRPMNNTFQTIT AAD IATVDHRFPR TAVQKRRWGSCWSIYWCFGSL+QRKRIGHAVLVPEPSPSPEAHQNSLQSPD VLPFA
Subjt:  MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFA

Query:  APPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL
        APPSSP SFLQSEPPS TQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPP+NFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQP L
Subjt:  APPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTL

Query:  PKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDLDLNP
        PKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFS FSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDD DLNP
Subjt:  PKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDLDLNP

Query:  RTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHH
        RTSDSMNESQNIQILIDGSQMEEPDV NHRFSFELSDEDSLLRN+ESKPLESNVAVASSPMHETFETAKETSSGGGHSSN IEEKAADGEEANQHQEHHH
Subjt:  RTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHH

Query:  STTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR
        STTLGSVNEFNFDNGNGSNALKPNINSDWWANAKD ETKGTT GAWSFFPMAQQR
Subjt:  STTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFPMAQQR

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766607.8e-2935.32Show/hide
Query:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE-----PSPSPEAHQ----NSLQSPDTVLPFAAPPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSP
        Q++RWG C  ++ CF S K  KRI  A  +PE      S    AHQ    N+  +    L   APPSSP SF  S  PS TQSP+    + SL AN  SP
Subjt:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE-----PSPSPEAHQ----NSLQSPDTVLPFAAPPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSP

Query:  DGP-SSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRS
         GP SS++A GP+AHETQLVSPP+ FST TTEPST  FT PPE   LT PSSP+VP+A+FL  ++    S   +   ND   +Y  YPGSP S L SP S
Subjt:  DGP-SSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRS

Query:  AIS------------------------LSGASSPLPDLDFASSASQFSIFSLDVPPAL-LNLDRQGQSSDSCTQNSVGF-KSNDDDLDLNPRTSDSMNES
          S                         +G S+PL + +F    + F+ F LD  P++  N  R   S DS    + G+   N +  + +P+      E+
Subjt:  AIS------------------------LSGASSPLPDLDFASSASQFSIFSLDVPPAL-LNLDRQGQSSDSCTQNSVGF-KSNDDDLDLNPRTSDSMNES

Query:  QNI-------QILIDGSQMEEPDVANHRF---SFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEE
                  +I+     +E  DV +  F   ++  SD   LLR       E+N+   +SP  E    ++        SSN  ++
Subjt:  QNI-------QILIDGSQMEEPDVANHRF---SFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEE

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)3.7e-5048.39Show/hide
Query:  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEP----SPSPEAHQNSLQSPDTVLPFAAPPSSPVSFLQSEP
        NN F TI AAA AIA+ D R  +++ + +KR+W + WS+  CFGS +QRKRIG++VLVPEP    S +     +  +S  T LPF APPSSP SF QSEP
Subjt:  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEP----SPSPEAHQNSLQSPDTVLPFAAPPSSPVSFLQSEP

Query:  PSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPP---ESIHL--TTPSSPEVPFAQFLQPTLPKAESDDQY
        PS TQSP  ILSF+ L  N        SIFAIGP+AHETQLVSPP+ FST TTEPS+   TPP    SI+L  TTPSSPEVPFAQ             ++
Subjt:  PSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPP---ESIHL--TTPSSPEVPFAQFLQPTLPKAESDDQY

Query:  SCPND-DFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSV
           +  +FQ YQ  PGSP+  LISP      SG +SP PD       S F  F +  PP LL+    G ++    Q  V
Subjt:  SCPND-DFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSV

AT1G76660.1 FUNCTIONS IN: molecular_function unknown5.6e-3035.32Show/hide
Query:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE-----PSPSPEAHQ----NSLQSPDTVLPFAAPPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSP
        Q++RWG C  ++ CF S K  KRI  A  +PE      S    AHQ    N+  +    L   APPSSP SF  S  PS TQSP+    + SL AN  SP
Subjt:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE-----PSPSPEAHQ----NSLQSPDTVLPFAAPPSSPVSFLQSEPPSVTQSPSNILSFTSLTANMYSP

Query:  DGP-SSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRS
         GP SS++A GP+AHETQLVSPP+ FST TTEPST  FT PPE   LT PSSP+VP+A+FL  ++    S   +   ND   +Y  YPGSP S L SP S
Subjt:  DGP-SSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRS

Query:  AIS------------------------LSGASSPLPDLDFASSASQFSIFSLDVPPAL-LNLDRQGQSSDSCTQNSVGF-KSNDDDLDLNPRTSDSMNES
          S                         +G S+PL + +F    + F+ F LD  P++  N  R   S DS    + G+   N +  + +P+      E+
Subjt:  AIS------------------------LSGASSPLPDLDFASSASQFSIFSLDVPPAL-LNLDRQGQSSDSCTQNSVGF-KSNDDDLDLNPRTSDSMNES

Query:  QNI-------QILIDGSQMEEPDVANHRF---SFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEE
                  +I+     +E  DV +  F   ++  SD   LLR       E+N+   +SP  E    ++        SSN  ++
Subjt:  QNI-------QILIDGSQMEEPDVANHRF---SFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEE

AT4G25620.1 hydroxyproline-rich glycoprotein family protein2.1e-4535.84Show/hide
Query:  NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEA----HQNSLQSPDTVLPFAAPPSSPVSFLQSEPP
        N++  T+ AAA AI + + R  + ++VQK+R GS WS+YWCFGS K  KRIGHAVLVPEP+ S  A      +S  S    +PF APPSSP SFL S PP
Subjt:  NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEA----HQNSLQSPDTVLPFAAPPSSPVSFLQSEPP

Query:  SVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTLPKAE------SDDQY
        S + +P   L   SLT N      P S F IGP+AHETQ V+PP+ FS  TTEPST  FTPP      +PSSPEVPFAQ L  +L +A        + ++
Subjt:  SVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTLPKAE------SDDQY

Query:  SCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLD-----RQGQSSDSCTQNSVGFKSNDDDLDLNP---R
        S  + +F+S Q YPGSP  NLISP      SG SSP P             F +  PP  L  +     + G    S +    G  S      L P   +
Subjt:  SCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLD-----RQGQSSDSCTQNSVGFKSNDDDLDLNP---R

Query:  TSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSS---GGGHSSNGIEEKAA-----------
         +  +      + +I  S      +       ++S+  SL  +       ++ A+   P   +FE   E  +       + +G  EKA+           
Subjt:  TSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSS---GGGHSSNGIEEKAA-----------

Query:  DGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKG--TTPGAWSFFPM
         GE  ++  +   S + GS  EF FD+ N    +   I S+WWAN K V  KG  +   +W+FFP+
Subjt:  DGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKG--TTPGAWSFFPM

AT5G52430.1 hydroxyproline-rich glycoprotein family protein1.7e-5838.88Show/hide
Query:  MNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS---PEAHQNSLQSPDTVLPFAAPPSSPVSFLQSEPP
        +NN+ +T+ AAA AI T + R  + ++ QK RWG CWS+Y CFG+ K  KRIG+AVLVPEP  S       QNS  S   VLPF APPSSP SFLQS+P 
Subjt:  MNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS---PEAHQNSLQSPDTVLPFAAPPSSPVSFLQSEPP

Query:  SVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPP--ESIHLTTPSSPEVPFAQFLQPTLPKAESD------D
        SV+ SP   L   SLT+N +SP  P S+F +GP+A+ETQ V+PP+ FS   TEPST  +TPP   S+H+TTPSSPEVPFAQ L  +L     D       
Subjt:  SVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPP--ESIHLTTPSSPEVPFAQFLQPTLPKAESD------D

Query:  QYSCPNDDFQSYQFYPGSP-VSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLD---------RQGQSSDSCTQNSVGFKSN-----
        ++S  + +F+S Q  PGSP   NLISP S IS SG SSP P        S    F +  PP  L  +         R G  S +   +  G  S      
Subjt:  QYSCPNDDFQSYQFYPGSP-VSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLD---------RQGQSSDSCTQNSVGFKSN-----

Query:  -----DDDLDLNPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAA
               +L  N  T    N+   +  L +     E  VA+HR SFEL+ ED + R + SK   S+  + ++   ET E      S        IE+++ 
Subjt:  -----DDDLDLNPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAA

Query:  DGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFP
        D E      +   S+++GS  EF FD                  N KD   +     +WSFFP
Subjt:  DGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKGTTPGAWSFFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGCGGCGTGCGGATGCGGATGCTGATGCTGATGCTGCTGATCTGAGGCCTATGAATAACACTTTTCAGACCATTACTGCGGCGGCCGATGCGATTGCGACCGTTGA
TCATCGTTTTCCTCGGGCTACTGCCGTCCAGAAAAGAAGATGGGGTAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAACGAATTGGGCATGCTG
TCCTTGTCCCAGAACCAAGTCCTTCGCCTGAGGCTCATCAAAATTCATTGCAATCCCCAGACACTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGTATCCTTTCTT
CAATCAGAGCCACCTTCTGTGACACAATCACCTTCAAATATACTCTCCTTCACTTCTCTCACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCGATTTTTGCCATTGG
CCCATTTGCTCATGAGACACAGCTTGTATCTCCACCTCTGAATTTTTCTACTCTCACCACTGAACCATCGACTCCTTCCTTCACTCCTCCTGAGTCTATCCACTTGACTA
CACCTTCTTCCCCTGAAGTTCCATTTGCTCAGTTTCTTCAACCTACCCTTCCGAAAGCTGAGTCTGATGACCAATATTCATGTCCTAATGATGACTTTCAATCTTATCAA
TTCTATCCTGGTAGCCCAGTTAGCAACCTCATATCGCCACGCTCTGCCATTTCTCTTTCTGGGGCATCTTCGCCTTTGCCAGATTTAGATTTTGCTTCCTCTGCTTCTCA
ATTTTCTATTTTCTCATTGGATGTTCCACCTGCGCTGTTGAACCTTGACAGACAAGGCCAAAGTTCTGATTCTTGCACTCAAAATTCTGTAGGATTCAAATCGAATGATG
ATGATCTTGATTTGAATCCTCGAACTTCAGACTCAATGAATGAATCCCAAAATATTCAAATTCTCATTGATGGAAGCCAAATGGAGGAACCTGATGTTGCTAATCATAGA
TTCTCATTTGAGTTATCTGATGAAGATTCTTTATTAAGAAACATAGAAAGTAAGCCACTGGAGTCAAATGTTGCAGTTGCATCATCTCCAATGCATGAAACATTTGAAAC
GGCTAAAGAAACTTCTTCTGGTGGTGGTCATAGCTCAAATGGTATAGAAGAAAAGGCAGCAGACGGTGAAGAAGCAAATCAGCATCAAGAACATCATCATTCTACTACTC
TTGGGTCTGTGAATGAATTTAATTTTGATAATGGCAATGGAAGTAATGCACTTAAGCCTAATATCAACTCAGACTGGTGGGCTAATGCGAAAGATGTAGAGACAAAAGGC
ACGACCCCGGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGCGGCGTGCGGATGCGGATGCTGATGCTGATGCTGCTGATCTGAGGCCTATGAATAACACTTTTCAGACCATTACTGCGGCGGCCGATGCGATTGCGACCGTTGA
TCATCGTTTTCCTCGGGCTACTGCCGTCCAGAAAAGAAGATGGGGTAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAACGAATTGGGCATGCTG
TCCTTGTCCCAGAACCAAGTCCTTCGCCTGAGGCTCATCAAAATTCATTGCAATCCCCAGACACTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGTATCCTTTCTT
CAATCAGAGCCACCTTCTGTGACACAATCACCTTCAAATATACTCTCCTTCACTTCTCTCACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCGATTTTTGCCATTGG
CCCATTTGCTCATGAGACACAGCTTGTATCTCCACCTCTGAATTTTTCTACTCTCACCACTGAACCATCGACTCCTTCCTTCACTCCTCCTGAGTCTATCCACTTGACTA
CACCTTCTTCCCCTGAAGTTCCATTTGCTCAGTTTCTTCAACCTACCCTTCCGAAAGCTGAGTCTGATGACCAATATTCATGTCCTAATGATGACTTTCAATCTTATCAA
TTCTATCCTGGTAGCCCAGTTAGCAACCTCATATCGCCACGCTCTGCCATTTCTCTTTCTGGGGCATCTTCGCCTTTGCCAGATTTAGATTTTGCTTCCTCTGCTTCTCA
ATTTTCTATTTTCTCATTGGATGTTCCACCTGCGCTGTTGAACCTTGACAGACAAGGCCAAAGTTCTGATTCTTGCACTCAAAATTCTGTAGGATTCAAATCGAATGATG
ATGATCTTGATTTGAATCCTCGAACTTCAGACTCAATGAATGAATCCCAAAATATTCAAATTCTCATTGATGGAAGCCAAATGGAGGAACCTGATGTTGCTAATCATAGA
TTCTCATTTGAGTTATCTGATGAAGATTCTTTATTAAGAAACATAGAAAGTAAGCCACTGGAGTCAAATGTTGCAGTTGCATCATCTCCAATGCATGAAACATTTGAAAC
GGCTAAAGAAACTTCTTCTGGTGGTGGTCATAGCTCAAATGGTATAGAAGAAAAGGCAGCAGACGGTGAAGAAGCAAATCAGCATCAAGAACATCATCATTCTACTACTC
TTGGGTCTGTGAATGAATTTAATTTTGATAATGGCAATGGAAGTAATGCACTTAAGCCTAATATCAACTCAGACTGGTGGGCTAATGCGAAAGATGTAGAGACAAAAGGC
ACGACCCCGGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGA
Protein sequenceShow/hide protein sequence
MRRRADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDTVLPFAAPPSSPVSFL
QSEPPSVTQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQ
FYPGSPVSNLISPRSAISLSGASSPLPDLDFASSASQFSIFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDLDLNPRTSDSMNESQNIQILIDGSQMEEPDVANHR
FSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNINSDWWANAKDVETKG
TTPGAWSFFPMAQQR