; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG10G002960 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG10G002960
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGlycine-rich domain-containing protein 1-like
Genome locationCG_Chr10:3691298..3700567
RNA-Seq ExpressionClCG10G002960
SyntenyClCG10G002960
Gene Ontology termsNA
InterPro domainsIPR009836 - Glycine-rich domain-containing protein-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK19501.1 glycine-rich domain-containing protein 1-like [Cucumis melo var. makuwa]1.1e-12474.25Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEW +AQ+IEIGVDLVAAAKRQLQFLSAV+ +              +YNAYWLPLLAKHSESPLF+GPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVP--KDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVAR
        ELYGKILDNSNVIST  I S C +ETEKVWNELYP+EPF+FN FN   D +    + LS L+KYTKYDLVSAVKRQ+PFFYQVS+PHM NE FLQEAVAR
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVP--KDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVAR

Query:  YRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA
        Y+GFLYLIKS  ++S+ RFCVPTYDIDLIWH+HQLHPLSYCKD+KK+LG VLEHDDT +DR+ GK+LD GF+GTTKQW+DTFGTRYWK   MYRG  P+
Subjt:  YRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA

XP_008448876.1 PREDICTED: glycine-rich domain-containing protein 1-like [Cucumis melo]4.2e-12473.51Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEW +AQ+IEIGVDLVAAAKRQLQFLSAV+ +              +YNAYWLPLLAKHSESPLF+GPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNL-----QEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEA
        ELYGKILDNSNVIST  I S C +ETEKVWNELYP+EPF+FN FN       ++D+  + LS L+KYTKYDLVSAVKRQ+PFFYQVS+PHM NE FLQEA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNL-----QEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEA

Query:  VARYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPA
        VARY+GFLYLIKS  ++S+ RFCVPTYDIDLIWH+HQLHPLSYCKD+KK+LG VLEHDDT +DR+ GK+LD GF+GTTKQW+DTFGTRYWK   MYRG  
Subjt:  VARYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPA

Query:  PA
        P+
Subjt:  PA

XP_023007682.1 glycine-rich domain-containing protein 1-like isoform X1 [Cucurbita maxima]9.4e-12473.33Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEWA+AQ+IEIGVDLVAAAKRQLQFLSAVD +              +YNAYWLPLLAKHSESPLFEGPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA
        ELYGKILDNSNV+ST  + S C +ETE+VWNELYP+E F+FN+      QED +    LS LEKYTKYDLVSAVKRQ+PFFYQVS+PHM NE FLQEAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA

Query:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA
        RY+GFLYLIKS  ++S+ RFCVPTYDIDLIWHTHQLHP+SYCKD+K LLG++LEHDD  +DR+ GK+LD GF+GTTKQW+DTFGTRYWK   MYRG +P+
Subjt:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA

XP_023552041.1 glycine-rich domain-containing protein 1-like [Cucurbita pepo subsp. pepo]7.2e-12473Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEWA+AQ+IEIGVDLVAAAKRQLQFLSAVD +              +YNAYWLPLLAKHSESPLFEGPL VP DCEWIWHCHRLNPV+Y S+CE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA
        ELYGKILDNSNV+ST  + S CL+ETE+VWNELYP+EPF+FN+      QED +    LS L+KYTKYDLVSAVKRQ+PFFYQVS+PHM NE FL+EAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA

Query:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA
        RY+GFLYLIKS  + S+ RFCVPTYDIDLIWHTHQLHP+SYCKD+K LLG+VLEHDD  +DR+ GK+LD GF+GTTKQW+DTFGTRYWK   MYRG +P+
Subjt:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA

XP_038876850.1 uncharacterized protein LOC120069217 [Benincasa hispida]1.0e-15469.8Show/hide
Query:  RKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDS--------------DQYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISD
        RK+EKNKELEW KAQKIEIGVDLVAAAK QLQFLS VDS               +YNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWH HRLNPVQYI+D
Subjt:  RKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDS--------------DQYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISD

Query:  CEELYGKILDNSNVISTTIIES-CCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKD-QLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAV
        CEELYGKILDNSNVISTT++ES C +KETE +WNELYP+EPFSFN FN QE DVPKD QLSQLEKYTKYDLV AVKRQTPFFYQVSQPHMKNE+FLQEA+
Subjt:  CEELYGKILDNSNVISTTIIES-CCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKD-QLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAV

Query:  ARYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP
        ARYRGFLYLIKS+MDESV  FCVPTYDIDLIWHTHQLHPL YCKDMKKLLGLVLEHDDTVTDR+VG++LDIGFTGTTKQWDDTFGT Y KI  MYRGPAP
Subjt:  ARYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP

Query:  AGSCGSKVQGGGCTAAAGSCGSKVQGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSC
              ++QGG     +G C  +  GG     A  CGSKVE  G TA AG CGSKVE  G    AG CGSKV   G T AA   GSKV G      A  C
Subjt:  AGSCGSKVQGGGCTAAAGSCGSKVQGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSC

Query:  GSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKV
        GS ++    T A G CGSKV  GG T A+ +CGSKVE  G T AAG CGSKV  GG  AAAG CGSK+E        G CGSK+E  G T  AG C  +V
Subjt:  GSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKV

TrEMBL top hitse value%identityAlignment
A0A1S3BLM1 glycine-rich domain-containing protein 1-like2.0e-12473.51Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEW +AQ+IEIGVDLVAAAKRQLQFLSAV+ +              +YNAYWLPLLAKHSESPLF+GPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNL-----QEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEA
        ELYGKILDNSNVIST  I S C +ETEKVWNELYP+EPF+FN FN       ++D+  + LS L+KYTKYDLVSAVKRQ+PFFYQVS+PHM NE FLQEA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNL-----QEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEA

Query:  VARYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPA
        VARY+GFLYLIKS  ++S+ RFCVPTYDIDLIWH+HQLHPLSYCKD+KK+LG VLEHDDT +DR+ GK+LD GF+GTTKQW+DTFGTRYWK   MYRG  
Subjt:  VARYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPA

Query:  PA
        P+
Subjt:  PA

A0A5D3D7F4 Glycine-rich domain-containing protein 1-like5.4e-12574.25Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEW +AQ+IEIGVDLVAAAKRQLQFLSAV+ +              +YNAYWLPLLAKHSESPLF+GPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVP--KDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVAR
        ELYGKILDNSNVIST  I S C +ETEKVWNELYP+EPF+FN FN   D +    + LS L+KYTKYDLVSAVKRQ+PFFYQVS+PHM NE FLQEAVAR
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVP--KDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVAR

Query:  YRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA
        Y+GFLYLIKS  ++S+ RFCVPTYDIDLIWH+HQLHPLSYCKD+KK+LG VLEHDDT +DR+ GK+LD GF+GTTKQW+DTFGTRYWK   MYRG  P+
Subjt:  YRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA

A0A6J1KZD5 glycine-rich domain-containing protein 1-like isoform X14.6e-12473.33Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEWA+AQ+IEIGVDLVAAAKRQLQFLSAVD +              +YNAYWLPLLAKHSESPLFEGPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA
        ELYGKILDNSNV+ST  + S C +ETE+VWNELYP+E F+FN+      QED +    LS LEKYTKYDLVSAVKRQ+PFFYQVS+PHM NE FLQEAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA

Query:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA
        RY+GFLYLIKS  ++S+ RFCVPTYDIDLIWHTHQLHP+SYCKD+K LLG++LEHDD  +DR+ GK+LD GF+GTTKQW+DTFGTRYWK   MYRG +P+
Subjt:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA

A0A6J1L3N7 glycine-rich domain-containing protein 1-like isoform X34.6e-12473.33Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEWA+AQ+IEIGVDLVAAAKRQLQFLSAVD +              +YNAYWLPLLAKHSESPLFEGPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA
        ELYGKILDNSNV+ST  + S C +ETE+VWNELYP+E F+FN+      QED +    LS LEKYTKYDLVSAVKRQ+PFFYQVS+PHM NE FLQEAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA

Query:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA
        RY+GFLYLIKS  ++S+ RFCVPTYDIDLIWHTHQLHP+SYCKD+K LLG++LEHDD  +DR+ GK+LD GF+GTTKQW+DTFGTRYWK   MYRG +P+
Subjt:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA

A0A6J1L8C3 glycine-rich domain-containing protein 1-like isoform X24.6e-12473.33Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEWA+AQ+IEIGVDLVAAAKRQLQFLSAVD +              +YNAYWLPLLAKHSESPLFEGPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA
        ELYGKILDNSNV+ST  + S C +ETE+VWNELYP+E F+FN+      QED +    LS LEKYTKYDLVSAVKRQ+PFFYQVS+PHM NE FLQEAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA

Query:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA
        RY+GFLYLIKS  ++S+ RFCVPTYDIDLIWHTHQLHP+SYCKD+K LLG++LEHDD  +DR+ GK+LD GF+GTTKQW+DTFGTRYWK   MYRG +P+
Subjt:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA

SwissProt top hitse value%identityAlignment
Q9SZJ2 Glycine-rich domain-containing protein 21.9e-9858.45Show/hide
Query:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        EK + LEW +AQKI+I VDL+AAAK+ L FL AVD +              +YNAYWLPLLA+++E S + +GPLV PLDCEW+WHCHRLNPV+Y +DCE
Subjt:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYR
        + YG++LDNS V+S+  +   C  +TE +W  LYP EP+  ++ N   +  P D +S LEK T YDLV AVKRQ+PFFYQVS+ H+ N+ FLQEAVARY+
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYR

Query:  GFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP
         FLYLIK   + S+  FCVPTYDIDLIWHTHQLH +SYC D+ K++G VLEHDDT +DRS GK+LD GF+GTT QW++TFG RYWK   M RG  P
Subjt:  GFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP

Q9ZQ47 Glycine-rich domain-containing protein 15.2e-10159.8Show/hide
Query:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        EK+ E+EW +AQKIEI VDL+AAAK+ L FL  VD +              +YNA WLPLL K+SE S + EG LV PLDCEWIWHCHRLNPV+Y SDCE
Subjt:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYR
        + YG++LDNS V+S+  ++  C  +TE +W  LYP EP+  +  N+  +D+  ++ S LEK TKYDLVSAVKRQ+PF+YQVS+ H+ ++ FLQEAVARY+
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYR

Query:  GFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP
        GFLYLIK   + S+ RFCVPTYD+DLIWHTHQLHP+SYC DM KL+G VLEHDDT +DR  GK+LD GF+ TT QW++TFGTRYWK   M+RG  P
Subjt:  GFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP

Arabidopsis top hitse value%identityAlignment
AT1G56230.1 Protein of unknown function (DUF1399)1.6e-2528.83Show/hide
Query:  EWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQ--------------YNAYWLPLLAKHSESPLFEGPLVV-PLDCEWIWHCHRLNPVQYISDCEELYGKI
        E ++   + IG D++++A+R +  L +V   Q              Y+  W+PL++  +     + P+++ PLD EW+W CH LNPV Y   CE  + K+
Subjt:  EWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQ--------------YNAYWLPLLAKHSESPLFEGPLVV-PLDCEWIWHCHRLNPVQYISDCEELYGKI

Query:  LDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYRGFLYLI
        +    +      E   + + EK+W+  YP E F        E+    D L  +    + D+ S VK+Q   + + S P+M    +L  A  RY+GFL ++
Subjt:  LDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYRGFLYLI

Query:  KSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQ-WDDTFGTRYWK
            DE      +P  DI L+W THQ +P  Y  D+ ++L      + T     VG++++     TTK+ WD  F   Y K
Subjt:  KSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQ-WDDTFGTRYWK

AT1G56230.2 Protein of unknown function (DUF1399)1.6e-2528.83Show/hide
Query:  EWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQ--------------YNAYWLPLLAKHSESPLFEGPLVV-PLDCEWIWHCHRLNPVQYISDCEELYGKI
        E ++   + IG D++++A+R +  L +V   Q              Y+  W+PL++  +     + P+++ PLD EW+W CH LNPV Y   CE  + K+
Subjt:  EWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQ--------------YNAYWLPLLAKHSESPLFEGPLVV-PLDCEWIWHCHRLNPVQYISDCEELYGKI

Query:  LDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYRGFLYLI
        +    +      E   + + EK+W+  YP E F        E+    D L  +    + D+ S VK+Q   + + S P+M    +L  A  RY+GFL ++
Subjt:  LDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYRGFLYLI

Query:  KSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQ-WDDTFGTRYWK
            DE      +P  DI L+W THQ +P  Y  D+ ++L      + T     VG++++     TTK+ WD  F   Y K
Subjt:  KSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQ-WDDTFGTRYWK

AT2G22660.1 Protein of unknown function (duplicated DUF1399)3.7e-10259.8Show/hide
Query:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        EK+ E+EW +AQKIEI VDL+AAAK+ L FL  VD +              +YNA WLPLL K+SE S + EG LV PLDCEWIWHCHRLNPV+Y SDCE
Subjt:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYR
        + YG++LDNS V+S+  ++  C  +TE +W  LYP EP+  +  N+  +D+  ++ S LEK TKYDLVSAVKRQ+PF+YQVS+ H+ ++ FLQEAVARY+
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYR

Query:  GFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP
        GFLYLIK   + S+ RFCVPTYD+DLIWHTHQLHP+SYC DM KL+G VLEHDDT +DR  GK+LD GF+ TT QW++TFGTRYWK   M+RG  P
Subjt:  GFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP

AT2G22660.2 Protein of unknown function (duplicated DUF1399)3.7e-10259.8Show/hide
Query:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        EK+ E+EW +AQKIEI VDL+AAAK+ L FL  VD +              +YNA WLPLL K+SE S + EG LV PLDCEWIWHCHRLNPV+Y SDCE
Subjt:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYR
        + YG++LDNS V+S+  ++  C  +TE +W  LYP EP+  +  N+  +D+  ++ S LEK TKYDLVSAVKRQ+PF+YQVS+ H+ ++ FLQEAVARY+
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYR

Query:  GFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP
        GFLYLIK   + S+ RFCVPTYD+DLIWHTHQLHP+SYC DM KL+G VLEHDDT +DR  GK+LD GF+ TT QW++TFGTRYWK   M+RG  P
Subjt:  GFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP

AT4G37900.1 Protein of unknown function (duplicated DUF1399)1.3e-9958.45Show/hide
Query:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        EK + LEW +AQKI+I VDL+AAAK+ L FL AVD +              +YNAYWLPLLA+++E S + +GPLV PLDCEW+WHCHRLNPV+Y +DCE
Subjt:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSD--------------QYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYR
        + YG++LDNS V+S+  +   C  +TE +W  LYP EP+  ++ N   +  P D +S LEK T YDLV AVKRQ+PFFYQVS+ H+ N+ FLQEAVARY+
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYR

Query:  GFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP
         FLYLIK   + S+  FCVPTYDIDLIWHTHQLH +SYC D+ K++G VLEHDDT +DRS GK+LD GF+GTT QW++TFG RYWK   M RG  P
Subjt:  GFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGAACAAAGAGCTTGAGTGGGCAAAAGCACAGAAGATTGAAATAGGTGTTGATCTTGTAGCTGCTGCTAAACGTCAGCTTCAGTTCCTCTCTGCAGTC
GATAGCGATCAGTACAATGCTTATTGGCTTCCATTGCTTGCTAAACATTCTGAATCCCCACTATTTGAAGGACCTTTGGTTGTCCCTTTGGACTGTGAATGGATT
TGGCATTGCCACAGATTGAATCCTGAACTTTATGGAAAGATACTTGACAATTCCAATGTTATATCAACTACCATTATTGAAAGTTGCTGTCTTAAAGAAACTGAA
AAAGTTTGGAATGAATTATACCCTCAAGAGCCCTTCAGCTTCAATTACTTCAACTTACAAGAGGATGATGTTTTAATTATTTCATGTCACAGGAAAATGGAGAAG
AACAAAGAGCTTGAGTGGGCAAAAGCACAGAAGATTGAAATAGGTGTTGATCTTGTAGCTGCTGCTAAACGTCAGCTTCAGTTCCTCTCTGCAGTCGATAGCGAT
CAGTACAATGCTTATTGGCTTCCATTGCTTGCTAAACATTCTGAATCCCCACTATTTGAAGGACCTTTGGTTGTCCCTTTGGACTGTGAATGGATTTGGCATTGC
CACAGATTGAATCCTGTACAATACATATCTGATTGTGAGGAACTTTATGGAAAGATACTTGACAATTCCAATGTTATATCAACTACCATTATTGAAAGTTGCTGT
CTTAAAGAAACTGAAAAAGTTTGGAATGAATTATACCCTCAAGAGCCCTTCAGCTTCAATTACTTCAACTTACAAGAGGATGATGTCCCAAAAGATCAGCTCTCA
CAACTTGAAAAATACACCAAATATGATCTTGTCTCAGCTGTTAAAAGACAGACCCCCTTCTTCTATCAGGTTTCTCAACCTCACATGAAGAATGAAGATTTCCTT
CAAGAAGCTGTGGCTAGATACAGAGGATTTCTATATCTAATCAAAAGTGCAATGGATGAGTCTGTTGCTAGGTTTTGTGTTCCAACATATGATATTGATCTAATT
TGGCATACCCATCAATTGCACCCTCTTTCCTATTGCAAAGACATGAAAAAATTACTTGGTTTGGTATTGGAACATGATGATACGGTTACGGATAGATCAGTAGGA
AAACAATTGGATATTGGGTTCACTGGAACTACAAAACAATGGGATGATACATTTGGTACAAGGTATTGGAAGATCGAACCAATGTATAGAGGCCCAGCTCCTGCT
GGTAGTTGCGGAAGCAAGGTACAAGGTGGTGGATGCACTGCTGCAGCTGGTAGTTGCGGAAGCAAGGTACAGGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGC
GGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAG
GTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGC
GGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGC
ACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCT
GCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGT
TGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGC
AAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAG
GGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGA
TGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATCCACTCCT
GCTGGTAGTTGCGGAAGCAAGGTACAGGGCGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTTCAGAGCGGTGGATCCACTGCTGCTGCTGGGTGC
GGAAGCAAGGTAAAGGGCGGTGGATCCACTCCTGCTGGTAGTTGCGGAAGCAAGGTACAGGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTA
CAGAGCGGTGGATGCAATGTTGCTGCTGGGTGCGAAAGCAAGATACAGAGCGGTGGATGCACTGCTGCTGCTGGGTGCGGAAGCAAGGTACCAAGCGGTGGATGC
ACTGCTGCTGCTGGATGCAAAAACATCGTAAAGAGCAATGGATGCATTGCTTGTGCTGGGTGTGGAATCATGGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAGAACAAAGAGCTTGAGTGGGCAAAAGCACAGAAGATTGAAATAGGTGTTGATCTTGTAGCTGCTGCTAAACGTCAGCTTCAGTTCCTCTCTGCAGTC
GATAGCGATCAGTACAATGCTTATTGGCTTCCATTGCTTGCTAAACATTCTGAATCCCCACTATTTGAAGGACCTTTGGTTGTCCCTTTGGACTGTGAATGGATT
TGGCATTGCCACAGATTGAATCCTGAACTTTATGGAAAGATACTTGACAATTCCAATGTTATATCAACTACCATTATTGAAAGTTGCTGTCTTAAAGAAACTGAA
AAAGTTTGGAATGAATTATACCCTCAAGAGCCCTTCAGCTTCAATTACTTCAACTTACAAGAGGATGATGTTTTAATTATTTCATGTCACAGGAAAATGGAGAAG
AACAAAGAGCTTGAGTGGGCAAAAGCACAGAAGATTGAAATAGGTGTTGATCTTGTAGCTGCTGCTAAACGTCAGCTTCAGTTCCTCTCTGCAGTCGATAGCGAT
CAGTACAATGCTTATTGGCTTCCATTGCTTGCTAAACATTCTGAATCCCCACTATTTGAAGGACCTTTGGTTGTCCCTTTGGACTGTGAATGGATTTGGCATTGC
CACAGATTGAATCCTGTACAATACATATCTGATTGTGAGGAACTTTATGGAAAGATACTTGACAATTCCAATGTTATATCAACTACCATTATTGAAAGTTGCTGT
CTTAAAGAAACTGAAAAAGTTTGGAATGAATTATACCCTCAAGAGCCCTTCAGCTTCAATTACTTCAACTTACAAGAGGATGATGTCCCAAAAGATCAGCTCTCA
CAACTTGAAAAATACACCAAATATGATCTTGTCTCAGCTGTTAAAAGACAGACCCCCTTCTTCTATCAGGTTTCTCAACCTCACATGAAGAATGAAGATTTCCTT
CAAGAAGCTGTGGCTAGATACAGAGGATTTCTATATCTAATCAAAAGTGCAATGGATGAGTCTGTTGCTAGGTTTTGTGTTCCAACATATGATATTGATCTAATT
TGGCATACCCATCAATTGCACCCTCTTTCCTATTGCAAAGACATGAAAAAATTACTTGGTTTGGTATTGGAACATGATGATACGGTTACGGATAGATCAGTAGGA
AAACAATTGGATATTGGGTTCACTGGAACTACAAAACAATGGGATGATACATTTGGTACAAGGTATTGGAAGATCGAACCAATGTATAGAGGCCCAGCTCCTGCT
GGTAGTTGCGGAAGCAAGGTACAAGGTGGTGGATGCACTGCTGCAGCTGGTAGTTGCGGAAGCAAGGTACAGGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGC
GGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAG
GTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGC
GGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGC
ACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCT
GCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGT
TGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGC
AAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAG
GGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGA
TGCACTGCTGCTGCTGGGAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATCCACTCCT
GCTGGTAGTTGCGGAAGCAAGGTACAGGGCGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTTCAGAGCGGTGGATCCACTGCTGCTGCTGGGTGC
GGAAGCAAGGTAAAGGGCGGTGGATCCACTCCTGCTGGTAGTTGCGGAAGCAAGGTACAGGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTA
CAGAGCGGTGGATGCAATGTTGCTGCTGGGTGCGAAAGCAAGATACAGAGCGGTGGATGCACTGCTGCTGCTGGGTGCGGAAGCAAGGTACCAAGCGGTGGATGC
ACTGCTGCTGCTGGATGCAAAAACATCGTAAAGAGCAATGGATGCATTGCTTGTGCTGGGTGTGGAATCATGGTGTAG
Protein sequenceShow/hide protein sequence
MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPELYGKILDNSNVISTTIIESCCLKETE
KVWNELYPQEPFSFNYFNLQEDDVLIISCHRKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHC
HRLNPVQYISDCEELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFL
QEAVARYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA
GSCGSKVQGGGCTAAAGSCGSKVQGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEG
GGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGS
CGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGG
CTAAAGSCGSKVEGGGCTAAAGSCGSKVEGGGSTPAGSCGSKVQGGGCTAAAGSCGSKVQSGGSTAAAGCGSKVKGGGSTPAGSCGSKVQGGGCTAAAGSCGSKV
QSGGCNVAAGCESKIQSGGCTAAAGCGSKVPSGGCTAAAGCKNIVKSNGCIACAGCGIMV