; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0000221 (gene) of Chayote v1 genome

Gene IDSed0000221
OrganismSechium edule (Chayote v1)
DescriptionATP-dependent DNA helicase
Genome locationLG12:32956385..32960220
RNA-Seq ExpressionSed0000221
SyntenySed0000221
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146718.1 uncharacterized protein LOC101219584 isoform X2 [Cucumis sativus]6.3e-19469Show/hide
Query:  MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLH-QSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-AGYSNS
        MV+F+DS  TL+P+K  VKRECD SFH+E    DKRFK D H QS++   TL +TS  +N LDEPSPLGL LRKSPSLLDLIQMKLSQG ++T AG SN+
Subjt:  MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLH-QSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-AGYSNS

Query:  GNFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLAR
          FDFVVK ES D T+ GT EKLKASNFPASHLKIGRWEYKSR+EGDLVAKCYYAKHKLVWE+LEGGLKSKIEIQWSDIM LKAN PDDGPA LN+VLAR
Subjt:  GNFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLAR

Query:  RPLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT--------------SRP
        +PLFFRETNPQPRKHTLWQATADFTDGEAS+ R+H +QCP GLLNKHFEKLIQCDSRLNFLSRQPEIV GSPYFE RAS FT               ++ 
Subjt:  RPLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT--------------SRP

Query:  ILSTFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSSVMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQHITEQMTSTKSPFV
        +LS FQDV SS+AA+   IE++SPQMVFEP  + APSPSSVMDA+EIEEN  +TKVT  P NWEQIK P LHPSMSMSDLVNHI  HITEQM STK+PFV
Subjt:  ILSTFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSSVMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQHITEQMTSTKSPFV

Query:  DDGS-QCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSK----------------------IPAGET
        DDGS + Q ML++++ YLLSDNQLS  ATSDE SLMSRVNSLCCLLQKEP    VQS Q +GE   +G ++K                      I   ++
Subjt:  DDGS-QCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSK----------------------IPAGET

Query:  KDVQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD
          +Q  AM RKDS+ +LLLHLPRIASLPKFLF+ISDGDE QD
Subjt:  KDVQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD

XP_008443832.1 PREDICTED: uncharacterized protein LOC103487331 isoform X3 [Cucumis melo]1.1e-19569.12Show/hide
Query:  MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLH--QSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-AGYSN
        MV+F DS  TL+ +K  VKRECD SFH+E    DKRFK D H  QS++G  TL +TS  +N LDEPSPLGL LRKSPSLLDLIQMKLSQG +ST AG SN
Subjt:  MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLH--QSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-AGYSN

Query:  SGNFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLA
        +  FDFVVK ES D T+ GT EKLKASNFPAS LKIGRWEYKSR+EGDLVAKCYYAKHKLVWE+LEGGLKSKIEIQWSDIM LKAN PDDGPA LN+VLA
Subjt:  SGNFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLA

Query:  RRPLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT---------------S
        RRPLFFRETNPQPRKHTLWQATADFTDGEAS+HRQHF+QCP GLLNKHFEKL+QCDSRLNFLSRQP IV GSPYFE RAS FTT               +
Subjt:  RRPLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT---------------S

Query:  RPILSTFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSSVMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQHITEQMTSTKSP
        + +LS FQDV SS+ A+   IE++SPQMVFEP  + APSPSSVMDA EIEEN  ++KVTS PRNWE IK P LHPSMSMSDLVNHI  HITEQM STK+P
Subjt:  RPILSTFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSSVMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQHITEQMTSTKSP

Query:  FVDDGS-QCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSK----------------------IPAG
        FVDDGS + Q ML++++ YLLSDNQLS  ATSDE SLMSRVNSLCCLLQKEP    VQS Q +GE   EG ++K                      I   
Subjt:  FVDDGS-QCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSK----------------------IPAG

Query:  ETKDVQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD
        ++  +Q  +M RKDS+ +LLLHLPRIASLPKFLF+ISDGDE QD
Subjt:  ETKDVQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD

XP_031739284.1 uncharacterized protein LOC101219584 isoform X3 [Cucumis sativus]7.4e-19568.95Show/hide
Query:  MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLHQSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-AGYSNSG
        MV+F+DS  TL+P+K  VKRECD SFH+E    DKRFK D H+S++   TL +TS  +N LDEPSPLGL LRKSPSLLDLIQMKLSQG ++T AG SN+ 
Subjt:  MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLHQSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-AGYSNSG

Query:  NFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLARR
         FDFVVK ES D T+ GT EKLKASNFPASHLKIGRWEYKSR+EGDLVAKCYYAKHKLVWE+LEGGLKSKIEIQWSDIM LKAN PDDGPA LN+VLAR+
Subjt:  NFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLARR

Query:  PLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT--------------SRPI
        PLFFRETNPQPRKHTLWQATADFTDGEAS+ R+H +QCP GLLNKHFEKLIQCDSRLNFLSRQPEIV GSPYFE RAS FT               ++ +
Subjt:  PLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT--------------SRPI

Query:  LSTFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSSVMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQHITEQMTSTKSPFVD
        LS FQDV SS+AA+   IE++SPQMVFEP  + APSPSSVMDA+EIEEN  +TKVT  P NWEQIK P LHPSMSMSDLVNHI  HITEQM STK+PFVD
Subjt:  LSTFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSSVMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQHITEQMTSTKSPFVD

Query:  DGS-QCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSK----------------------IPAGETK
        DGS + Q ML++++ YLLSDNQLS  ATSDE SLMSRVNSLCCLLQKEP    VQS Q +GE   +G ++K                      I   ++ 
Subjt:  DGS-QCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSK----------------------IPAGETK

Query:  DVQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD
         +Q  AM RKDS+ +LLLHLPRIASLPKFLF+ISDGDE QD
Subjt:  DVQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD

XP_038879515.1 uncharacterized protein LOC120071357 isoform X1 [Benincasa hispida]8.8e-19668.65Show/hide
Query:  MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLHQSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST--AGYSNS
        MV+F+DS  TLYPEKL VKRE D SFH+E   LDKRFK D H+S+ G  TL ATS  +N LDEPSPLGL LRKSPSLLDLIQMKLSQG  ST  AG SN+
Subjt:  MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLHQSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST--AGYSNS

Query:  GNFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLAR
          F+FVVK ES D T+ GT EKLKASNFPAS L+IGRWEYKSR+EGDLVAKCYYAKHKLVWE+LEGGLKSKIEIQWSDIMAL AN PDDGPA LN+VLAR
Subjt:  GNFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLAR

Query:  RPLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT---------------SR
        RPLFFRETNPQPRKHTLWQATADFTDGEAS+ RQHF+QCP GLLNKHFEKLIQCDSRLNFLSRQPEIV GSPYFE  AS FTT               ++
Subjt:  RPLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT---------------SR

Query:  PILSTFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSS-------------VMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQ
         +LSTFQDV SSSA S  KIE++SPQMVFEP  + APSPSS             VMDA EIEEN  +TKVT  PRNWEQIK P +HPS+SMSDLVNHI  
Subjt:  PILSTFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSS-------------VMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQ

Query:  HITEQMTSTKSPFVDDGS-QCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSKIPAGETKD------
        HITEQM STK+PFVD+GS + Q ML++++ YLLSDNQLSAA  SDE SLMSRV+SLCCLLQKEP    VQS Q +GE CVEG + +      +D      
Subjt:  HITEQMTSTKSPFVDDGS-QCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSKIPAGETKD------

Query:  ---------------VQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD
                       +Q  AM RKDS+ DLLLHLPRIASLPK LF+ISDGDE QD
Subjt:  ---------------VQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD

XP_038879516.1 uncharacterized protein LOC120071357 isoform X2 [Benincasa hispida]1.4e-19870.3Show/hide
Query:  MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLHQSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST--AGYSNS
        MV+F+DS  TLYPEKL VKRE D SFH+E   LDKRFK D H+S+ G  TL ATS  +N LDEPSPLGL LRKSPSLLDLIQMKLSQG  ST  AG SN+
Subjt:  MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLHQSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST--AGYSNS

Query:  GNFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLAR
          F+FVVK ES D T+ GT EKLKASNFPAS L+IGRWEYKSR+EGDLVAKCYYAKHKLVWE+LEGGLKSKIEIQWSDIMAL AN PDDGPA LN+VLAR
Subjt:  GNFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLAR

Query:  RPLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT---------------SR
        RPLFFRETNPQPRKHTLWQATADFTDGEAS+ RQHF+QCP GLLNKHFEKLIQCDSRLNFLSRQPEIV GSPYFE  AS FTT               ++
Subjt:  RPLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT---------------SR

Query:  PILSTFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSSVMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQHITEQMTSTKSPF
         +LSTFQDV SSSA S  KIE++SPQMVFEP  + APSPSSVMDA EIEEN  +TKVT  PRNWEQIK P +HPS+SMSDLVNHI  HITEQM STK+PF
Subjt:  PILSTFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSSVMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQHITEQMTSTKSPF

Query:  VDDGS-QCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSKIPAGETKD-------------------
        VD+GS + Q ML++++ YLLSDNQLSAA  SDE SLMSRV+SLCCLLQKEP    VQS Q +GE CVEG + +      +D                   
Subjt:  VDDGS-QCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSKIPAGETKD-------------------

Query:  --VQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD
          +Q  AM RKDS+ DLLLHLPRIASLPK LF+ISDGDE QD
Subjt:  --VQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD

TrEMBL top hitse value%identityAlignment
A0A0A0LTD1 Uncharacterized protein5.7e-19369.02Show/hide
Query:  FMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLH-QSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-AGYSNSGNF
        F+DS  TL+P+K  VKRECD SFH+E    DKRFK D H QS++   TL +TS  +N LDEPSPLGL LRKSPSLLDLIQMKLSQG ++T AG SN+  F
Subjt:  FMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLH-QSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-AGYSNSGNF

Query:  DFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLARRPL
        DFVVK ES D T+ GT EKLKASNFPASHLKIGRWEYKSR+EGDLVAKCYYAKHKLVWE+LEGGLKSKIEIQWSDIM LKAN PDDGPA LN+VLAR+PL
Subjt:  DFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLARRPL

Query:  FFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT--------------SRPILS
        FFRETNPQPRKHTLWQATADFTDGEAS+ R+H +QCP GLLNKHFEKLIQCDSRLNFLSRQPEIV GSPYFE RAS FT               ++ +LS
Subjt:  FFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT--------------SRPILS

Query:  TFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSSVMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQHITEQMTSTKSPFVDDG
         FQDV SS+AA+   IE++SPQMVFEP  + APSPSSVMDA+EIEEN  +TKVT  P NWEQIK P LHPSMSMSDLVNHI  HITEQM STK+PFVDDG
Subjt:  TFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSSVMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQHITEQMTSTKSPFVDDG

Query:  S-QCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSK----------------------IPAGETKDV
        S + Q ML++++ YLLSDNQLS  ATSDE SLMSRVNSLCCLLQKEP    VQS Q +GE   +G ++K                      I   ++  +
Subjt:  S-QCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSK----------------------IPAGETKDV

Query:  QPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD
        Q  AM RKDS+ +LLLHLPRIASLPKFLF+ISDGDE QD
Subjt:  QPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD

A0A1S3B8H0 uncharacterized protein LOC103487331 isoform X13.4e-19367.5Show/hide
Query:  MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLH--QSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-AGYSN
        MV+F DS  TL+ +K  VKRECD SFH+E    DKRFK D H  QS++G  TL +TS  +N LDEPSPLGL LRKSPSLLDLIQMKLSQG +ST AG SN
Subjt:  MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLH--QSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-AGYSN

Query:  SGNFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLA
        +  FDFVVK ES D T+ GT EKLKASNFPAS LKIGRWEYKSR+EGDLVAKCYYAKHKLVWE+LEGGLKSKIEIQWSDIM LKAN PDDGPA LN+VLA
Subjt:  SGNFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLA

Query:  RRPLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT---------------S
        RRPLFFRETNPQPRKHTLWQATADFTDGEAS+HRQHF+QCP GLLNKHFEKL+QCDSRLNFLSRQP IV GSPYFE RAS FTT               +
Subjt:  RRPLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT---------------S

Query:  RPILSTFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSS-------------VMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIE
        + +LS FQDV SS+ A+   IE++SPQMVFEP  + APSPSS             VMDA EIEEN  ++KVTS PRNWE IK P LHPSMSMSDLVNHI 
Subjt:  RPILSTFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSS-------------VMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIE

Query:  QHITEQMTSTKSPFVDDGS-QCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSK-------------
         HITEQM STK+PFVDDGS + Q ML++++ YLLSDNQLS  ATSDE SLMSRVNSLCCLLQKEP    VQS Q +GE   EG ++K             
Subjt:  QHITEQMTSTKSPFVDDGS-QCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSK-------------

Query:  ---------IPAGETKDVQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD
                 I   ++  +Q  +M RKDS+ +LLLHLPRIASLPKFLF+ISDGDE QD
Subjt:  ---------IPAGETKDVQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD

A0A1S3B8X8 uncharacterized protein LOC103487331 isoform X22.6e-19367.63Show/hide
Query:  MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLH-QSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-AGYSNS
        MV+F DS  TL+ +K  VKRECD SFH+E    DKRFK D H QS++G  TL +TS  +N LDEPSPLGL LRKSPSLLDLIQMKLSQG +ST AG SN+
Subjt:  MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLH-QSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-AGYSNS

Query:  GNFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLAR
          FDFVVK ES D T+ GT EKLKASNFPAS LKIGRWEYKSR+EGDLVAKCYYAKHKLVWE+LEGGLKSKIEIQWSDIM LKAN PDDGPA LN+VLAR
Subjt:  GNFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLAR

Query:  RPLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT---------------SR
        RPLFFRETNPQPRKHTLWQATADFTDGEAS+HRQHF+QCP GLLNKHFEKL+QCDSRLNFLSRQP IV GSPYFE RAS FTT               ++
Subjt:  RPLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT---------------SR

Query:  PILSTFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSS-------------VMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQ
         +LS FQDV SS+ A+   IE++SPQMVFEP  + APSPSS             VMDA EIEEN  ++KVTS PRNWE IK P LHPSMSMSDLVNHI  
Subjt:  PILSTFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSS-------------VMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQ

Query:  HITEQMTSTKSPFVDDGS-QCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSK--------------
        HITEQM STK+PFVDDGS + Q ML++++ YLLSDNQLS  ATSDE SLMSRVNSLCCLLQKEP    VQS Q +GE   EG ++K              
Subjt:  HITEQMTSTKSPFVDDGS-QCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSK--------------

Query:  --------IPAGETKDVQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD
                I   ++  +Q  +M RKDS+ +LLLHLPRIASLPKFLF+ISDGDE QD
Subjt:  --------IPAGETKDVQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD

A0A1S3B908 uncharacterized protein LOC103487331 isoform X35.6e-19669.12Show/hide
Query:  MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLH--QSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-AGYSN
        MV+F DS  TL+ +K  VKRECD SFH+E    DKRFK D H  QS++G  TL +TS  +N LDEPSPLGL LRKSPSLLDLIQMKLSQG +ST AG SN
Subjt:  MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLH--QSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-AGYSN

Query:  SGNFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLA
        +  FDFVVK ES D T+ GT EKLKASNFPAS LKIGRWEYKSR+EGDLVAKCYYAKHKLVWE+LEGGLKSKIEIQWSDIM LKAN PDDGPA LN+VLA
Subjt:  SGNFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLA

Query:  RRPLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT---------------S
        RRPLFFRETNPQPRKHTLWQATADFTDGEAS+HRQHF+QCP GLLNKHFEKL+QCDSRLNFLSRQP IV GSPYFE RAS FTT               +
Subjt:  RRPLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT---------------S

Query:  RPILSTFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSSVMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQHITEQMTSTKSP
        + +LS FQDV SS+ A+   IE++SPQMVFEP  + APSPSSVMDA EIEEN  ++KVTS PRNWE IK P LHPSMSMSDLVNHI  HITEQM STK+P
Subjt:  RPILSTFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSSVMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQHITEQMTSTKSP

Query:  FVDDGS-QCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSK----------------------IPAG
        FVDDGS + Q ML++++ YLLSDNQLS  ATSDE SLMSRVNSLCCLLQKEP    VQS Q +GE   EG ++K                      I   
Subjt:  FVDDGS-QCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSK----------------------IPAG

Query:  ETKDVQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD
        ++  +Q  +M RKDS+ +LLLHLPRIASLPKFLF+ISDGDE QD
Subjt:  ETKDVQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD

A0A6J1JMN7 uncharacterized protein LOC111485770 isoform X27.0e-19168.47Show/hide
Query:  MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLHQSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-AGYSNSG
        MV+F++SK TLYPEK A KREC   F  E   LDKR K D H+S+ G  TL A+S  +N LDEPSPLGL LRKSPSLLDLIQMKLSQ   ST  G SNS 
Subjt:  MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLHQSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-AGYSNSG

Query:  NFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLARR
         F+FVVKK++ + T+ G  EKLKASNFPAS LKIGRWEYKSR+EGDLVAKCYYAKHK+VWE+LEGGLKSKIEIQWSDIMALKANFPDDGPA LN+VLARR
Subjt:  NFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLARR

Query:  PLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT-------SRPILSTFQDV
        PLFFRETNPQPRKHTLWQA ADFTDGEAS+ RQHF+QCP G+LNKHFEKLIQCDS LNFLSRQPEIV GSPYFE  AS FTT        + +LSTFQDV
Subjt:  PLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTT-------SRPILSTFQDV

Query:  ASSSAASLSKIEESSPQMVFEPCYLAAPSPSSVMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQHITEQMTSTKSPFVDDGS-QCQ
         SSS  S  +IE+SSPQMVF+P  +  PSPSSVMDA EIEEN  +T+V+  PRNWEQIK   LHPSMSM DLV+HI  HITEQM STK+PFVDDGS + Q
Subjt:  ASSSAASLSKIEESSPQMVFEPCYLAAPSPSSVMDAQEIEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQHITEQMTSTKSPFVDDGS-QCQ

Query:  EMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSKI---PAGETKD---------------------VQPL
         MLNE++ YLLSD+QLS+AA + E S+MSRVNSLCCLLQKEP  AAVQS Q SGE CV+  + ++    A E +D                     +Q  
Subjt:  EMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSKI---PAGETKD---------------------VQPL

Query:  AMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD
        AM RKDSF DL LHLPRIASLPKF F+IS+GDE QD
Subjt:  AMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G54300.1 unknown protein1.9e-2643.9Show/hide
Query:  NFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEG-------GLKSKIEIQWSDIMALKANFPD-DGPATLNIVLARRPLFFRETNPQPRKHTLW
        NFP S ++IG W   ++   D+VAK Y+AK KL+WE L G        LK KIEIQW+D+ + + +    D    L I L +RP FF ETNPQ  KHT W
Subjt:  NFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEG-------GLKSKIEIQWSDIMALKANFPD-DGPATLNIVLARRPLFFRETNPQPRKHTLW

Query:  -QATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGF
         Q   DFT   AS +R+H +  P G+L K+ EKL+  DS  + L   P  V  S YF+   SGF
Subjt:  -QATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGF

AT2G24100.1 unknown protein7.3e-11650.48Show/hide
Query:  MVEFMDSKTTL----------YPEKLAVKRECDRSFHYELGPLDKRFKSDLHQSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGIT
        MVE M S+  L          +P KL +    + S   E  PL+KR K      +   GT   +  + +LL+EPSPLGL+L+KSPS  +LI+MKLSQ   
Subjt:  MVEFMDSKTTL----------YPEKLAVKRECDRSFHYELGPLDKRFKSDLHQSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGIT

Query:  STAGYSNSGNFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPA
               SG+    VKKES      GT EKLKASNFPA+ L+IG+WEYKSRYEGDLVAKCY+AKHKLVWEVLE GLKSKIEIQWSDIMALKAN P+D P 
Subjt:  STAGYSNSGNFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPA

Query:  TLNIVLARRPLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTTSRPILSTFQ
        TL IVLARRPLFFRETNPQPRKHTLWQAT+DFTDG+AS++RQHF+QCP G++NKHFEKL+QCD RL  LSRQPEI   +P+F+ R S F    P +S   
Subjt:  TLNIVLARRPLFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTTSRPILSTFQ

Query:  DVASSSAASLSKIEESSPQMVFEPCYLAAPSPSSVMDAQEIEENMITTKVTSMPRN-WEQIKFPRLHPSMSMSDLVNHIEQHITEQMTSTKSPFVDDGSQ
        ++AS   A  S    S            A SPSSVMDA+ IE   +   + S   N W QIK P LH S+SM+D +  +                D   +
Subjt:  DVASSSAASLSKIEESSPQMVFEPCYLAAPSPSSVMDAQEIEENMITTKVTSMPRN-WEQIKFPRLHPSMSMSDLVNHIEQHITEQMTSTKSPFVDDGSQ

Query:  CQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSK--IPAGETKDVQPLA-------MLRKDSFADLLL
          +   E+   LLSDN  +    SDE S+MS+VNS C LLQ   AA +  + + +    V G D+   +P G  + V P +       M RKDSF+DLL+
Subjt:  CQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQPSGEICVEGFDSK--IPAGETKDVQPLA-------MLRKDSFADLLL

Query:  HLPRIASLPKFLFNISDGD
        HLPRI SLPKFLFNIS+ D
Subjt:  HLPRIASLPKFLFNISDGD

AT3G05770.1 unknown protein1.9e-3139.56Show/hide
Query:  SNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITSTAGYSNSGNFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKL
        +  +DE   L L L K+P L++ I+  L   +  T  +  + N        S   T+  + EKLKA NFP S +KIG   + ++   D+VAK Y+AK KL
Subjt:  SNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITSTAGYSNSGNFDFVVKKESMDVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKL

Query:  VWEVLEG-------GLKSKIEIQWSDIMALKANFPD-DGPATLNIVLARRPLFFRETNPQPRKHTLW-QATADFTDGEASLHRQHFMQCPQGLLNKHFEK
        +WE L G        LKSKIEIQW+D+ + + +    D    L I L +RP FF ETNPQ  KHT W Q   DFT  +AS +R+H +  P G+L K+ EK
Subjt:  VWEVLEG-------GLKSKIEIQWSDIMALKANFPD-DGPATLNIVLARRPLFFRETNPQPRKHTLW-QATADFTDGEASLHRQHFMQCPQGLLNKHFEK

Query:  LIQCDSRLNFLSRQPEIVSGSPYFE
        L+  DS  + L + P  V  S YF+
Subjt:  LIQCDSRLNFLSRQPEIVSGSPYFE

AT4G30780.1 unknown protein3.3e-10846.88Show/hide
Query:  EKLAVKRE-CDRSFHYELGPLDKRFKSDLHQSIMGIGTLPATSLQS--NLLDEPSPLGLTLRKSPSLLDLIQMKLSQ-GITSTAGYSNSGNFDFVVKKES
        ++L VK E  +     E GPL+KR +      +   GT  +T   +  N LDEPSPLGL+L+KSPSLL+LIQMK++  G    A    +G     +K+ES
Subjt:  EKLAVKRE-CDRSFHYELGPLDKRFKSDLHQSIMGIGTLPATSLQS--NLLDEPSPLGLTLRKSPSLLDLIQMKLSQ-GITSTAGYSNSGNFDFVVKKES

Query:  MDVTMS---------GTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLARRP
          +  +         G+ EKLKASNFPAS LKIG+WEYKSRYEGDLVAKCY+AKHKLVWEVLE GLKSKIEIQWSDIMALKAN P+DGP TL +VLAR+P
Subjt:  MDVTMS---------GTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLARRP

Query:  LFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGF-TTSRPILSTFQDVASSSAAS
        LFFRETNPQPRKHTLWQAT+DFTDG+AS++RQHF+QC QG++NKHFEKL+QCD RL  LSRQPEI   SPYF+ R S F   S      F ++  S+  S
Subjt:  LFFRETNPQPRKHTLWQATADFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGF-TTSRPILSTFQDVASSSAAS

Query:  LSKIEE-SSP---QMVFEPCYLA--APSPSSVMDAQEIEENMITTKVTSMPR-NWEQIKFPRLHPSMSMSDLV-------NHIEQHITEQMTS-------
        +S  +  +SP   Q   E  YL+  APSPSSV+DA+  E    +  V S  R +  QI+ P +H SMS+SD +       N  + ++ + +         
Subjt:  LSKIEE-SSP---QMVFEPCYLA--APSPSSVMDAQEIEENMITTKVTSMPR-NWEQIKFPRLHPSMSMSDLV-------NHIEQHITEQMTS-------

Query:  -------TKSPFVDDGSQCQ-----------------------------EMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQ
               + S  + D  Q +                             E    +   LLSDN    A   DE SLM RVNSL  LL K+P  AA  + Q
Subjt:  -------TKSPFVDDGSQCQ-----------------------------EMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQSFQ

Query:  PSGEICVEGFDSKIPAGETKD-----------------VQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDED
         + E+ V G  S+ P G   D                  +P  MLRKDSF+DLLLHLPRI SLPKFL NIS+ D D
Subjt:  PSGEICVEGFDSKIPAGETKD-----------------VQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGAATTCATGGATTCCAAGACGACGTTGTATCCGGAGAAATTGGCGGTAAAGCGGGAGTGTGATCGTTCGTTTCACTACGAGCTTGGTCCCTTGGATAAGCGGTT
CAAGTCCGATCTTCATCAGTCAATTATGGGAATTGGCACATTACCTGCAACTTCTTTACAGAGTAATCTGCTGGATGAGCCCAGTCCTCTTGGTTTGACTCTGAGGAAGA
GCCCGTCCCTGTTGGATCTAATTCAAATGAAACTCTCACAAGGGATTACGTCTACAGCTGGATATTCAAACAGTGGAAATTTCGACTTCGTTGTTAAAAAAGAAAGCATG
GATGTTACCATGTCGGGTACCGCTGAGAAACTAAAGGCTTCGAACTTTCCAGCTTCACATTTAAAGATTGGACGTTGGGAGTATAAATCTAGATATGAAGGTGATCTTGT
GGCCAAGTGTTACTATGCTAAGCATAAGCTTGTCTGGGAAGTCCTCGAGGGTGGACTCAAAAGTAAAATAGAGATACAATGGTCTGATATTATGGCTTTAAAAGCGAATT
TCCCTGATGACGGACCTGCTACACTAAATATTGTGCTGGCTAGACGGCCTCTTTTCTTCCGGGAGACCAATCCGCAACCAAGGAAACACACTTTGTGGCAGGCAACAGCT
GACTTTACGGATGGTGAGGCCAGCTTGCATAGACAGCATTTTATGCAATGCCCTCAAGGTCTTCTAAACAAGCATTTTGAAAAGCTCATCCAGTGTGACTCGCGCCTAAA
CTTCTTAAGCCGACAACCAGAGATTGTATCAGGCTCTCCATATTTTGAACAAAGAGCCTCTGGTTTTACAACCAGTCGACCAATACTTTCTACTTTCCAGGATGTAGCTT
CATCATCTGCTGCATCATTATCAAAGATAGAAGAGTCTTCTCCTCAAATGGTGTTTGAACCTTGCTACTTGGCGGCACCTTCACCCAGTTCAGTGATGGATGCTCAAGAG
ATTGAAGAAAACATGATCACAACTAAAGTAACCAGCATGCCAAGAAATTGGGAGCAGATAAAGTTTCCAAGGCTACATCCATCCATGTCGATGAGCGACCTCGTAAACCA
CATTGAGCAGCATATCACCGAACAAATGACTTCGACAAAGTCGCCTTTTGTCGATGATGGTTCGCAATGCCAAGAAATGCTGAATGAACTTTCACTATACTTGCTGAGTG
ACAATCAATTGTCAGCAGCAGCAACTTCCGACGAGGCATCTCTCATGTCGAGAGTCAATTCTCTATGTTGCCTATTGCAGAAAGAACCAGCTGCTGCTGCAGTTCAAAGT
TTTCAACCCAGCGGTGAAATCTGTGTTGAAGGATTCGATTCTAAGATTCCTGCAGGGGAAACGAAGGATGTTCAACCCTTGGCGATGTTGAGAAAAGATTCATTTGCTGA
TTTGCTGTTGCATCTTCCTCGAATTGCATCGCTGCCGAAGTTCTTGTTTAACATTTCGGATGGCGATGAAGATCAAGATTAA
mRNA sequenceShow/hide mRNA sequence
TTTAAATCTAGGGTTCGTGAAATTTTCCTCTCTTACATTTCTTCTTCGTTTTCATTTCTGAAAGGTTTTCTTGCGATTATCAATGGTGGAATTCATGGATTCCAAGACGA
CGTTGTATCCGGAGAAATTGGCGGTAAAGCGGGAGTGTGATCGTTCGTTTCACTACGAGCTTGGTCCCTTGGATAAGCGGTTCAAGTCCGATCTTCATCAGTCAATTATG
GGAATTGGCACATTACCTGCAACTTCTTTACAGAGTAATCTGCTGGATGAGCCCAGTCCTCTTGGTTTGACTCTGAGGAAGAGCCCGTCCCTGTTGGATCTAATTCAAAT
GAAACTCTCACAAGGGATTACGTCTACAGCTGGATATTCAAACAGTGGAAATTTCGACTTCGTTGTTAAAAAAGAAAGCATGGATGTTACCATGTCGGGTACCGCTGAGA
AACTAAAGGCTTCGAACTTTCCAGCTTCACATTTAAAGATTGGACGTTGGGAGTATAAATCTAGATATGAAGGTGATCTTGTGGCCAAGTGTTACTATGCTAAGCATAAG
CTTGTCTGGGAAGTCCTCGAGGGTGGACTCAAAAGTAAAATAGAGATACAATGGTCTGATATTATGGCTTTAAAAGCGAATTTCCCTGATGACGGACCTGCTACACTAAA
TATTGTGCTGGCTAGACGGCCTCTTTTCTTCCGGGAGACCAATCCGCAACCAAGGAAACACACTTTGTGGCAGGCAACAGCTGACTTTACGGATGGTGAGGCCAGCTTGC
ATAGACAGCATTTTATGCAATGCCCTCAAGGTCTTCTAAACAAGCATTTTGAAAAGCTCATCCAGTGTGACTCGCGCCTAAACTTCTTAAGCCGACAACCAGAGATTGTA
TCAGGCTCTCCATATTTTGAACAAAGAGCCTCTGGTTTTACAACCAGTCGACCAATACTTTCTACTTTCCAGGATGTAGCTTCATCATCTGCTGCATCATTATCAAAGAT
AGAAGAGTCTTCTCCTCAAATGGTGTTTGAACCTTGCTACTTGGCGGCACCTTCACCCAGTTCAGTGATGGATGCTCAAGAGATTGAAGAAAACATGATCACAACTAAAG
TAACCAGCATGCCAAGAAATTGGGAGCAGATAAAGTTTCCAAGGCTACATCCATCCATGTCGATGAGCGACCTCGTAAACCACATTGAGCAGCATATCACCGAACAAATG
ACTTCGACAAAGTCGCCTTTTGTCGATGATGGTTCGCAATGCCAAGAAATGCTGAATGAACTTTCACTATACTTGCTGAGTGACAATCAATTGTCAGCAGCAGCAACTTC
CGACGAGGCATCTCTCATGTCGAGAGTCAATTCTCTATGTTGCCTATTGCAGAAAGAACCAGCTGCTGCTGCAGTTCAAAGTTTTCAACCCAGCGGTGAAATCTGTGTTG
AAGGATTCGATTCTAAGATTCCTGCAGGGGAAACGAAGGATGTTCAACCCTTGGCGATGTTGAGAAAAGATTCATTTGCTGATTTGCTGTTGCATCTTCCTCGAATTGCA
TCGCTGCCGAAGTTCTTGTTTAACATTTCGGATGGCGATGAAGATCAAGATTAATTAGTTGCAAGCTAGTTTAGGCGTTTGGTTAGTATATAAATAACTATATGTATAGC
GATTGGAGATTCTGGAATTTCCCTTGCATGTAAAAATTCTTTTATTAGCAGACTAGACTCACTGTCCATCGACCACAGAACGCTTACATTGTACACATAATAGGTTTGAA
TTAATTATTATTTAAGGATGAAATTCAATTT
Protein sequenceShow/hide protein sequence
MVEFMDSKTTLYPEKLAVKRECDRSFHYELGPLDKRFKSDLHQSIMGIGTLPATSLQSNLLDEPSPLGLTLRKSPSLLDLIQMKLSQGITSTAGYSNSGNFDFVVKKESM
DVTMSGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEVLEGGLKSKIEIQWSDIMALKANFPDDGPATLNIVLARRPLFFRETNPQPRKHTLWQATA
DFTDGEASLHRQHFMQCPQGLLNKHFEKLIQCDSRLNFLSRQPEIVSGSPYFEQRASGFTTSRPILSTFQDVASSSAASLSKIEESSPQMVFEPCYLAAPSPSSVMDAQE
IEENMITTKVTSMPRNWEQIKFPRLHPSMSMSDLVNHIEQHITEQMTSTKSPFVDDGSQCQEMLNELSLYLLSDNQLSAAATSDEASLMSRVNSLCCLLQKEPAAAAVQS
FQPSGEICVEGFDSKIPAGETKDVQPLAMLRKDSFADLLLHLPRIASLPKFLFNISDGDEDQD