; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G007840 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G007840
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionMucin-2
Genome locationCmo_Chr04:3929395..3931985
RNA-Seq ExpressionCmoCh04G007840
SyntenyCmoCh04G007840
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600562.1 hypothetical protein SDJN03_05795, partial [Cucurbita argyrosperma subsp. sororia]7.0e-24896.77Show/hide
Query:  MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ
        MRAMRRR      ADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ
Subjt:  MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ

Query:  SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
        SPD VLPFAAPPSSPVSFLQSEPPS TQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
Subjt:  SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP

Query:  FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
        FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSP  DLDFASSASQFS FSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
Subjt:  FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS

Query:  NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE
        NDDD DL+PRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE
Subjt:  NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE

Query:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR
        ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNI+SDWWANAKDVETKGTT GAWSFFPMAQQR
Subjt:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR

XP_022941648.1 uncharacterized protein At1g76660-like isoform X1 [Cucurbita moschata]1.0e-259100Show/hide
Query:  MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ
        MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ
Subjt:  MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ

Query:  SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
        SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
Subjt:  SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP

Query:  FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
        FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
Subjt:  FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS

Query:  NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE
        NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE
Subjt:  NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE

Query:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR
        ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR
Subjt:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR

XP_022941649.1 uncharacterized protein At1g76660-like isoform X2 [Cucurbita moschata]2.1e-25297.84Show/hide
Query:  MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ
        MRAMRRR          ADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ
Subjt:  MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ

Query:  SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
        SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
Subjt:  SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP

Query:  FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
        FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
Subjt:  FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS

Query:  NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE
        NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE
Subjt:  NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE

Query:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR
        ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR
Subjt:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR

XP_023522163.1 uncharacterized protein At1g76660-like [Cucurbita pepo subsp. pepo]2.7e-24796.34Show/hide
Query:  MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ
        MRAMRRR          ADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ
Subjt:  MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ

Query:  SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
        SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
Subjt:  SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP

Query:  FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
        FAQFLQPTL KAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSP  DLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
Subjt:  FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS

Query:  NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE
        NDDDFDL+PRTSDSMNESQNIQILIDGSQMEEPDV NHRFSFELSDEDSLLRN+ESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE
Subjt:  NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE

Query:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR
        ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNI+SDWWANAKDVETKGTTTGAWSFFPMAQQR
Subjt:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR

XP_023529207.1 uncharacterized protein At1g76660-like isoform X1 [Cucurbita pepo subsp. pepo]1.4e-24896.77Show/hide
Query:  MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ
        MRAMRRR        ADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ
Subjt:  MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ

Query:  SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
        SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
Subjt:  SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP

Query:  FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
        FAQFLQPTL KAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSP  DLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
Subjt:  FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS

Query:  NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE
        NDDDFDL+PRTSDSMNESQNIQILIDGSQMEEPDV NHRFSFELSDEDSLLRN+ESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE
Subjt:  NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE

Query:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR
        ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNI+SDWWANAKDVETKGTTTGAWSFFPMAQQR
Subjt:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR

TrEMBL top hitse value%identityAlignment
A0A5D3CYQ2 Mucin-25.1e-18073.85Show/hide
Query:  MRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPD
        MRRR D D            D RP+NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPS E H+N+LQSPD
Subjt:  MRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPD

Query:  IVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQ
        IVLPFAAPPSSPVS LQSEPPSA QSP+ ++SFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPSTP FTPPESIHLTTPSSPEVPFAQ
Subjt:  IVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQ

Query:  FLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNSV
        F+ P+L K ESD+QY+ PNDDFQSYQFYPGSPVS+LISPRS IS SGASSP  D DFAS  SQF NF L+VPP L NLD       RQ QS+DSCTQ+S+
Subjt:  FLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNSV

Query:  GFKSNDDDFDLDPRTSDSM------NESQNIQILID--GSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESN-VAVASSPMHETFETAKETSSGGGHS
         FKS+ +DF L+P TS+SM      NESQNIQILID    + EEP   NHRFSFELSD D L +++ SKPLESN + V SSP+HE FET KE S  G H+
Subjt:  GFKSNDDDFDLDPRTSDSM------NESQNIQILID--GSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESN-VAVASSPMHETFETAKETSSGGGHS

Query:  SNGIEEKA-ADGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR
        SN IEEK  ADG+EA+QHQE HHS  LGSV EFNFDN NGS+   P I+SDWW NAKD  T+GTTTGAWSFFP  QQR
Subjt:  SNGIEEKA-ADGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR

A0A6J1C828 uncharacterized protein At1g76660-like3.3e-18775.89Show/hide
Query:  MRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPD
        MRRR DADAD         ADL P+NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS E  +N+LQSPD
Subjt:  MRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPD

Query:  IVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQ
        IVLPFAAPPSSPVSFLQSEPPSATQSP+ ILSFTSLTANMYSPDGPSSIFA+GPFAHETQLVSPPLNFST+TT+PST  FTPPESIHLTTPSSPEVPFAQ
Subjt:  IVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQ

Query:  FLQPTLPKAESDDQY-SCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNS
        +LQP+  K ESD QY   PNDDFQSYQFYPGSPVS+LISPRS IS SGASSP  D DF  S S FSNF ++VPP LLNLD       R  QSSDSCTQNS
Subjt:  FLQPTLPKAESDDQY-SCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLD-------RQGQSSDSCTQNS

Query:  VGFKSNDDDFDLDPRTSDSM------NESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESN-VAVASSPMHETFETAKETSSGGGHSS
        VG+KS+ +DF L+P+TS+S+      NE  NIQIL DGSQ +E   ANHRFSFELSDED+LL+++E+KPLESN +AVASSP+HE  ETAKETS  GGH+S
Subjt:  VGFKSNDDDFDLDPRTSDSM------NESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESN-VAVASSPMHETFETAKETSSGGGHSS

Query:  NGIEE-KAADGEEANQHQE-HHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQ
        N  EE + ADGEE + HQE  HHS TLG+V EFNFDNGNG + LKPNI+S WWAN KD ET+GTTTGAWSFFP+ QQ
Subjt:  NGIEE-KAADGEEANQHQE-HHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQ

A0A6J1FP20 uncharacterized protein At1g76660-like isoform X21.0e-25297.84Show/hide
Query:  MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ
        MRAMRRR          ADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ
Subjt:  MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ

Query:  SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
        SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
Subjt:  SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP

Query:  FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
        FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
Subjt:  FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS

Query:  NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE
        NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE
Subjt:  NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE

Query:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR
        ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR
Subjt:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR

A0A6J1FSP7 uncharacterized protein At1g76660-like isoform X15.0e-260100Show/hide
Query:  MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ
        MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ
Subjt:  MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ

Query:  SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
        SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
Subjt:  SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP

Query:  FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
        FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
Subjt:  FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS

Query:  NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE
        NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE
Subjt:  NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE

Query:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR
        ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR
Subjt:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR

A0A6J1IUL0 uncharacterized protein At1g76660-like1.6e-24595.04Show/hide
Query:  MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ
        MR MRRRADADADAD       ADLRPMNNTFQTIT AAD IATVDHRFPR TAVQKRRWGSCWSIYWCFGSL+QRKRIGHAVLVPEPSPSPEAHQNSLQ
Subjt:  MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQ

Query:  SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
        SPDIVLPFAAPPSSP SFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPP+NFSTLTTEPSTPSFTPPESIHLTTPSSPEVP
Subjt:  SPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVP

Query:  FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
        FAQFLQP LPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSP  DLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS
Subjt:  FAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKS

Query:  NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE
        NDDDFDL+PRTSDSMNESQNIQILIDGSQMEEPDV NHRFSFELSDEDSLLRN+ESKPLESNVAVASSPMHETFETAKETSSGGGHSSN IEEKAADGEE
Subjt:  NDDDFDLDPRTSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEE

Query:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR
        ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNI+SDWWANAKD ETKGTTTGAWSFFPMAQQR
Subjt:  ANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFPMAQQR

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766604.5e-3240.51Show/hide
Query:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE-----PSPSPEAHQ----NSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSP
        Q++RWG C  ++ CF S K  KRI  A  +PE      S    AHQ    N+  +  I L   APPSSP SF  S  PS TQSP+    + SL AN  SP
Subjt:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE-----PSPSPEAHQ----NSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSP

Query:  DGP-SSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRS
         GP SS++A GP+AHETQLVSPP+ FST TTEPST  FT PPE   LT PSSP+VP+A+FL  ++    S   +   ND   +Y  YPGSP S L SP S
Subjt:  DGP-SSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRS

Query:  AISLSGASSPWT--------------DLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLDPRTSDSMNESQNIQILIDGSQM
          S  G  SP                D +  S+  Q SNF      A   LD      D     + G  S   D D+ P T+   N +QN Q       M
Subjt:  AISLSGASSPWT--------------DLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLDPRTSDSMNESQNIQILIDGSQM

Query:  EEPDVANHRFSFELSD
        EE +     F F   +
Subjt:  EEPDVANHRFSFELSD

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)4.9e-5047.08Show/hide
Query:  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEP----SPSPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEP
        NN F TI AAA AIA+ D R  +++ + +KR+W + WS+  CFGS +QRKRIG++VLVPEP    S +     +  +S    LPF APPSSP SF QSEP
Subjt:  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEP----SPSPEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEP

Query:  PSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPP---ESIHL--TTPSSPEVPFAQFLQPTLPKAESDDQY
        PSATQSP  ILSF+ L  N        SIFAIGP+AHETQLVSPP+ FST TTEPS+   TPP    SI+L  TTPSSPEVPFAQ             ++
Subjt:  PSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPP---ESIHL--TTPSSPEVPFAQFLQPTLPKAESDDQY

Query:  SCPND-DFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLD
           +  +FQ YQ  PGSP+  LISP      SG +SP+ D       S F +F +  PP LL+    G ++  C +  +        FDLD
Subjt:  SCPND-DFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLD

AT1G76660.1 FUNCTIONS IN: molecular_function unknown3.2e-3340.51Show/hide
Query:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE-----PSPSPEAHQ----NSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSP
        Q++RWG C  ++ CF S K  KRI  A  +PE      S    AHQ    N+  +  I L   APPSSP SF  S  PS TQSP+    + SL AN  SP
Subjt:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE-----PSPSPEAHQ----NSLQSPDIVLPFAAPPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSP

Query:  DGP-SSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRS
         GP SS++A GP+AHETQLVSPP+ FST TTEPST  FT PPE   LT PSSP+VP+A+FL  ++    S   +   ND   +Y  YPGSP S L SP S
Subjt:  DGP-SSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFT-PPESIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSCPNDDFQSYQFYPGSPVSNLISPRS

Query:  AISLSGASSPWT--------------DLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLDPRTSDSMNESQNIQILIDGSQM
          S  G  SP                D +  S+  Q SNF      A   LD      D     + G  S   D D+ P T+   N +QN Q       M
Subjt:  AISLSGASSPWT--------------DLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLDPRTSDSMNESQNIQILIDGSQM

Query:  EEPDVANHRFSFELSD
        EE +     F F   +
Subjt:  EEPDVANHRFSFELSD

AT4G25620.1 hydroxyproline-rich glycoprotein family protein1.9e-4636.05Show/hide
Query:  NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEA----HQNSLQSPDIVLPFAAPPSSPVSFLQSEPP
        N++  T+ AAA AI + + R  + ++VQK+R GS WS+YWCFGS K  KRIGHAVLVPEP+ S  A      +S  S  I +PF APPSSP SFL S PP
Subjt:  NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEA----HQNSLQSPDIVLPFAAPPSSPVSFLQSEPP

Query:  SATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTLPKAE------SDDQY
        SA+ +P   L   SLT N      P S F IGP+AHETQ V+PP+ FS  TTEPST  FTPP      +PSSPEVPFAQ L  +L +A        + ++
Subjt:  SATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTLPKAE------SDDQY

Query:  SCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLD-----RQGQSSDSCTQNSVGFKSNDDDFDLDP---R
        S  + +F+S Q YPGSP  NLISP      SG SSP+              F +  PP  L  +     + G    S +    G  S      L P   +
Subjt:  SCPNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLD-----RQGQSSDSCTQNSVGFKSNDDDFDLDP---R

Query:  TSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSS---GGGHSSNGIEEKAA-----------
         +  +      + +I  S      +       ++S+  SL  +       ++ A+   P   +FE   E  +       + +G  EKA+           
Subjt:  TSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSS---GGGHSSNGIEEKAA-----------

Query:  DGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKG--TTTGAWSFFPM
         GE  ++  +   S + GS  EF FD+ N    +   I S+WWAN K V  KG  +   +W+FFP+
Subjt:  DGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKG--TTTGAWSFFPM

AT5G52430.1 hydroxyproline-rich glycoprotein family protein1.4e-5738.44Show/hide
Query:  MNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS---PEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPP
        +NN+ +T+ AAA AI T + R  + ++ QK RWG CWS+Y CFG+ K  KRIG+AVLVPEP  S       QNS  S  +VLPF APPSSP SFLQS+P 
Subjt:  MNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS---PEAHQNSLQSPDIVLPFAAPPSSPVSFLQSEPP

Query:  SATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPP--ESIHLTTPSSPEVPFAQFLQPTLPKAESD------D
        S + SP   L   SLT+N +SP  P S+F +GP+A+ETQ V+PP+ FS   TEPST  +TPP   S+H+TTPSSPEVPFAQ L  +L     D       
Subjt:  SATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPP--ESIHLTTPSSPEVPFAQFLQPTLPKAESD------D

Query:  QYSCPNDDFQSYQFYPGSP-VSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLD-----RQGQSSDSCTQNSVGFKSNDDDFDLDPR
        ++S  + +F+S Q  PGSP   NLISP S IS SG SSP+         S    F +  PP  L  +     + G    S +   VG  S      L P 
Subjt:  QYSCPNDDFQSYQFYPGSP-VSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLD-----RQGQSSDSCTQNSVGFKSNDDDFDLDPR

Query:  --------------TSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAA
                      T    N+   +  L +     E  VA+HR SFEL+ ED + R + SK   S+  + ++   ET E      S        IE+++ 
Subjt:  --------------TSDSMNESQNIQILIDGSQMEEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAA

Query:  DGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFP
        D E      +   S+++GS  EF FD                  N KD   +     +WSFFP
Subjt:  DGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWANAKDVETKGTTTGAWSFFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGCGATGAGGCGGCGTGCGGATGCGGATGCGGATGCTGATGCTGATGCTGATGCTGATGCTGCTGATCTGAGGCCTATGAATAACACTTTTCAGACCATTACTGC
GGCGGCCGATGCGATCGCGACCGTTGATCATCGTTTTCCTCGGGCTACTGCCGTCCAGAAAAGAAGATGGGGTAGCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCA
AACAGAGGAAACGAATTGGGCATGCTGTCCTTGTCCCAGAACCAAGTCCTTCGCCTGAGGCTCATCAAAATTCATTGCAATCCCCAGACATTGTGCTTCCTTTTGCTGCA
CCTCCCTCTTCCCCTGTATCCTTTCTTCAATCAGAGCCACCTTCTGCTACACAATCACCTTCAAATATACTCTCCTTCACTTCTCTCACTGCTAACATGTATTCTCCTGA
TGGGCCTTCCTCGATTTTTGCCATTGGCCCATTTGCTCATGAGACACAGCTTGTATCTCCACCTCTGAATTTTTCTACTCTCACCACTGAACCATCGACTCCTTCCTTCA
CTCCTCCTGAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCATTTGCTCAGTTTCTTCAACCTACCCTTCCGAAAGCTGAGTCTGATGACCAATATTCATGT
CCTAATGATGACTTTCAATCTTATCAATTCTATCCTGGTAGCCCAGTTAGCAACCTCATATCGCCACGCTCTGCCATTTCTCTTTCTGGGGCATCTTCGCCTTGGACAGA
TTTAGATTTTGCTTCCTCTGCTTCTCAATTTTCTAATTTCTCATTGGATGTTCCACCTGCGCTGTTGAACCTTGACAGACAAGGCCAAAGTTCTGATTCTTGCACTCAAA
ATTCTGTAGGATTCAAATCGAATGATGATGATTTTGATTTGGATCCTCGAACTTCAGACTCAATGAATGAATCCCAAAATATTCAAATTCTCATTGATGGAAGCCAAATG
GAGGAACCTGATGTTGCTAATCATAGATTCTCATTTGAGTTATCTGATGAAGATTCTTTATTAAGAAACATAGAAAGTAAGCCACTGGAGTCAAATGTTGCAGTTGCATC
ATCTCCAATGCATGAAACATTTGAAACGGCTAAAGAAACTTCTTCTGGTGGTGGTCATAGCTCAAATGGTATAGAAGAAAAGGCAGCAGACGGTGAAGAAGCAAATCAGC
ATCAAGAACATCATCATTCTACTACTCTTGGGTCTGTGAATGAATTTAATTTTGATAATGGCAATGGAAGTAATGCACTTAAGCCTAATATCCACTCAGACTGGTGGGCT
AATGCGAAAGATGTAGAGACAAAAGGCACGACCACGGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGA
mRNA sequenceShow/hide mRNA sequence
TTCTTCTCTCTTCTAACGAACTTTCTCTTCACTTTCTGACTGCAAATTCTCCTTATTTGTTCTGTGTTTTCCCCCGAAAAATTTCGTGTAAGAGGAACCACAACTTTCTT
CTATGAACACGATCAGCGATTCCCTGGCTTCGATCAATGAGAGCGATGAGGCGGCGTGCGGATGCGGATGCGGATGCTGATGCTGATGCTGATGCTGATGCTGCTGATCT
GAGGCCTATGAATAACACTTTTCAGACCATTACTGCGGCGGCCGATGCGATCGCGACCGTTGATCATCGTTTTCCTCGGGCTACTGCCGTCCAGAAAAGAAGATGGGGTA
GCTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAACGAATTGGGCATGCTGTCCTTGTCCCAGAACCAAGTCCTTCGCCTGAGGCTCATCAAAATTCA
TTGCAATCCCCAGACATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGTATCCTTTCTTCAATCAGAGCCACCTTCTGCTACACAATCACCTTCAAATATACTCTC
CTTCACTTCTCTCACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCGATTTTTGCCATTGGCCCATTTGCTCATGAGACACAGCTTGTATCTCCACCTCTGAATTTTT
CTACTCTCACCACTGAACCATCGACTCCTTCCTTCACTCCTCCTGAGTCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCATTTGCTCAGTTTCTTCAACCTACC
CTTCCGAAAGCTGAGTCTGATGACCAATATTCATGTCCTAATGATGACTTTCAATCTTATCAATTCTATCCTGGTAGCCCAGTTAGCAACCTCATATCGCCACGCTCTGC
CATTTCTCTTTCTGGGGCATCTTCGCCTTGGACAGATTTAGATTTTGCTTCCTCTGCTTCTCAATTTTCTAATTTCTCATTGGATGTTCCACCTGCGCTGTTGAACCTTG
ACAGACAAGGCCAAAGTTCTGATTCTTGCACTCAAAATTCTGTAGGATTCAAATCGAATGATGATGATTTTGATTTGGATCCTCGAACTTCAGACTCAATGAATGAATCC
CAAAATATTCAAATTCTCATTGATGGAAGCCAAATGGAGGAACCTGATGTTGCTAATCATAGATTCTCATTTGAGTTATCTGATGAAGATTCTTTATTAAGAAACATAGA
AAGTAAGCCACTGGAGTCAAATGTTGCAGTTGCATCATCTCCAATGCATGAAACATTTGAAACGGCTAAAGAAACTTCTTCTGGTGGTGGTCATAGCTCAAATGGTATAG
AAGAAAAGGCAGCAGACGGTGAAGAAGCAAATCAGCATCAAGAACATCATCATTCTACTACTCTTGGGTCTGTGAATGAATTTAATTTTGATAATGGCAATGGAAGTAAT
GCACTTAAGCCTAATATCCACTCAGACTGGTGGGCTAATGCGAAAGATGTAGAGACAAAAGGCACGACCACGGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATG
AGCAAACTGGTGATTATCCTCTGGAATTTCCTCATGCCCATCATGTTTTGCAGTTGCAATTTAGTAGGTAATAGGTAAGACAAATAGCTAAAGGACTGGTGAGCTTTGAA
GGTAAAAAAGAGGACAAATCATGAAAAGAGTAAAACCAGAAGCCATATTCTTTTCAACAATCTGACCTCCTAAACACAGGCAGGTCTGAATAGTATGATAATTAGAAACC
TGTAGGCGACAATGGGCCCTATTAACATACAGTAGTGGCTCCTCACTTGAATTGTAACAGCTATTAGTATTCTGTAGAAATTGGAAGTGTGAAAATATGGTAATAAAAAT
TGTTTTTTATCTTTTGACAGC
Protein sequenceShow/hide protein sequence
MRAMRRRADADADADADADADAADLRPMNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSPEAHQNSLQSPDIVLPFAA
PPSSPVSFLQSEPPSATQSPSNILSFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPSFTPPESIHLTTPSSPEVPFAQFLQPTLPKAESDDQYSC
PNDDFQSYQFYPGSPVSNLISPRSAISLSGASSPWTDLDFASSASQFSNFSLDVPPALLNLDRQGQSSDSCTQNSVGFKSNDDDFDLDPRTSDSMNESQNIQILIDGSQM
EEPDVANHRFSFELSDEDSLLRNIESKPLESNVAVASSPMHETFETAKETSSGGGHSSNGIEEKAADGEEANQHQEHHHSTTLGSVNEFNFDNGNGSNALKPNIHSDWWA
NAKDVETKGTTTGAWSFFPMAQQR