; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G03140 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G03140
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGlycine-rich domain-containing protein 1-like
Genome locationClcChr10:3446828..3452738
RNA-Seq ExpressionClc10G03140
SyntenyClc10G03140
Gene Ontology termsNA
InterPro domainsIPR008519 - Tandem-repeating region of mucin, epiglycanin-like
IPR009836 - Glycine-rich domain-containing protein-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023007682.1 glycine-rich domain-containing protein 1-like isoform X1 [Cucurbita maxima]6.5e-13677.93Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEWA+AQ+IEIGVDLVAAAKRQLQFLSAVD ++FL+EGP L+RAIYRYNAYWLPLLAKHSESPLFEGPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVA
        ELYGKILDNSNV+ST  + S C +ETE+VWNELYP+E F+FN+      QED +    LS LEKYTKYDLVSAVKRQ+PF+YQVS+PHM NE FL+EAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVA

Query:  RYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP
        RY+GFLYLIKSN ++S+++FCVPTYDIDLIWHTHQLHP+ Y KD+K LLG++LEHDDMDSDR++GKKLD GF+GTTKQW+DTFGTRYWK GAMYRG +P
Subjt:  RYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP

XP_023007683.1 glycine-rich domain-containing protein 1-like isoform X2 [Cucurbita maxima]6.5e-13677.93Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEWA+AQ+IEIGVDLVAAAKRQLQFLSAVD ++FL+EGP L+RAIYRYNAYWLPLLAKHSESPLFEGPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVA
        ELYGKILDNSNV+ST  + S C +ETE+VWNELYP+E F+FN+      QED +    LS LEKYTKYDLVSAVKRQ+PF+YQVS+PHM NE FL+EAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVA

Query:  RYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP
        RY+GFLYLIKSN ++S+++FCVPTYDIDLIWHTHQLHP+ Y KD+K LLG++LEHDDMDSDR++GKKLD GF+GTTKQW+DTFGTRYWK GAMYRG +P
Subjt:  RYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP

XP_023007685.1 glycine-rich domain-containing protein 1-like isoform X3 [Cucurbita maxima]6.5e-13677.93Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEWA+AQ+IEIGVDLVAAAKRQLQFLSAVD ++FL+EGP L+RAIYRYNAYWLPLLAKHSESPLFEGPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVA
        ELYGKILDNSNV+ST  + S C +ETE+VWNELYP+E F+FN+      QED +    LS LEKYTKYDLVSAVKRQ+PF+YQVS+PHM NE FL+EAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVA

Query:  RYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP
        RY+GFLYLIKSN ++S+++FCVPTYDIDLIWHTHQLHP+ Y KD+K LLG++LEHDDMDSDR++GKKLD GF+GTTKQW+DTFGTRYWK GAMYRG +P
Subjt:  RYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP

XP_023552041.1 glycine-rich domain-containing protein 1-like [Cucurbita pepo subsp. pepo]1.0e-13678.26Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEWA+AQ+IEIGVDLVAAAKRQLQFLSAVD ++FL+EGP L+RAIYRYNAYWLPLLAKHSESPLFEGPL VP DCEWIWHCHRLNPV+Y S+CE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVA
        ELYGKILDNSNV+ST  + S CL+ETE+VWNELYP+EPF+FN+      QED +    LS L+KYTKYDLVSAVKRQ+PF+YQVS+PHM NE FLEEAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVA

Query:  RYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP
        RY+GFLYLIKSN + S+++FCVPTYDIDLIWHTHQLHP+ Y KD+K LLG+VLEHDDMDSDR++GKKLD GF+GTTKQW+DTFGTRYWK GAMYRG +P
Subjt:  RYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP

XP_038876850.1 uncharacterized protein LOC120069217 [Benincasa hispida]3.2e-16770.34Show/hide
Query:  RKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISD
        RK+EKNKELEW KAQKIEIGVDLVAAAK QLQFLS VDS  FLHEGP LDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWH HRLNPVQYI+D
Subjt:  RKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISD

Query:  CEELYGKILDNSNVISTTIIES-CCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKD-QLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAV
        CEELYGKILDNSNVISTT++ES C +KETE +WNELYP+EPFSFN FN QE DVPKD QLSQLEKYTKYDLV AVKRQTPF+YQVSQPHMKNE+FL+EA+
Subjt:  CEELYGKILDNSNVISTTIIES-CCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKD-QLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAV

Query:  ARYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP
        ARYRGFLYLIKS+MDESVQ FCVPTYDIDLIWHTHQLHPL Y KDMKKLLGLVLEHDD  +DR+ G+KLDIGFTGTTKQWDDTFGT Y K+GAMYRGPAP
Subjt:  ARYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP

Query:  RMQGMVKSGGCTTAAGCSSKVESGGCSAAAAGCSSKVKSGGCSAAAACLSKVKSGVESGGCSAAAGCSSKVKSGGCSAAAGCSSKVKSGGCSAAAGCSNK
        RMQG VKSG C        K+ES G    AAGC SKV+S G +A A C SK    VE+ G    AGC SKV   G + AA   SKV S   + AA C + 
Subjt:  RMQGMVKSGGCTTAAGCSSKVESGGCSAAAAGCSSKVKSGGCSAAAACLSKVKSGVESGGCSAAAGCSSKVKSGGCSAAAGCSSKVKSGGCSAAAGCSNK

Query:  VKSGVESGGCSAAAGCSSKVKSGGCSAAAACLSKVKSGVESGGCSAAAGCSSKVKSGGCSAAAGCSSKVKSGGCSAAAGCSNKVKSGGCSAAAACLSKV
        +K    S   + A GC SKV SGG + A+ C SK    VES G + AAGC SKV SGG  AAAGC SK+++       GC +K++S G   A  C  +V
Subjt:  VKSGVESGGCSAAAGCSSKVKSGGCSAAAACLSKVKSGVESGGCSAAAGCSSKVKSGGCSAAAGCSSKVKSGGCSAAAGCSNKVKSGGCSAAAACLSKV

TrEMBL top hitse value%identityAlignment
A0A6J1E6D0 glycine-rich domain-containing protein 2-like isoform X21.2e-13576.92Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        ME N+ELEWA+AQ+IE+GVDLVA AKRQLQFLSAVD ++FL+EGP L+RAIYRYNAYWLPLLAKHSESPLFEGPL VP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVA
        ELYGKILDNSNV+ST  + S CL+ETE+VWNELYP+EPF+FN+      QED +    LS LEKYTKYDLVSAVKRQ+PF+YQVS+PHM NE FL+EAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVA

Query:  RYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP
        RY+GFLYLIKSN ++S+++FCVPTYDIDLIWH+HQLHP+ Y KD+K L+G+VLEHDDMDSDR++GKKLD GF+GTTKQW+DTFGTRYWK GAMYRG +P
Subjt:  RYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP

A0A6J1EBB2 glycine-rich domain-containing protein 1-like isoform X11.2e-13576.92Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        ME N+ELEWA+AQ+IE+GVDLVA AKRQLQFLSAVD ++FL+EGP L+RAIYRYNAYWLPLLAKHSESPLFEGPL VP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVA
        ELYGKILDNSNV+ST  + S CL+ETE+VWNELYP+EPF+FN+      QED +    LS LEKYTKYDLVSAVKRQ+PF+YQVS+PHM NE FL+EAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVA

Query:  RYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP
        RY+GFLYLIKSN ++S+++FCVPTYDIDLIWH+HQLHP+ Y KD+K L+G+VLEHDDMDSDR++GKKLD GF+GTTKQW+DTFGTRYWK GAMYRG +P
Subjt:  RYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP

A0A6J1KZD5 glycine-rich domain-containing protein 1-like isoform X13.2e-13677.93Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEWA+AQ+IEIGVDLVAAAKRQLQFLSAVD ++FL+EGP L+RAIYRYNAYWLPLLAKHSESPLFEGPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVA
        ELYGKILDNSNV+ST  + S C +ETE+VWNELYP+E F+FN+      QED +    LS LEKYTKYDLVSAVKRQ+PF+YQVS+PHM NE FL+EAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVA

Query:  RYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP
        RY+GFLYLIKSN ++S+++FCVPTYDIDLIWHTHQLHP+ Y KD+K LLG++LEHDDMDSDR++GKKLD GF+GTTKQW+DTFGTRYWK GAMYRG +P
Subjt:  RYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP

A0A6J1L3N7 glycine-rich domain-containing protein 1-like isoform X33.2e-13677.93Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEWA+AQ+IEIGVDLVAAAKRQLQFLSAVD ++FL+EGP L+RAIYRYNAYWLPLLAKHSESPLFEGPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVA
        ELYGKILDNSNV+ST  + S C +ETE+VWNELYP+E F+FN+      QED +    LS LEKYTKYDLVSAVKRQ+PF+YQVS+PHM NE FL+EAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVA

Query:  RYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP
        RY+GFLYLIKSN ++S+++FCVPTYDIDLIWHTHQLHP+ Y KD+K LLG++LEHDDMDSDR++GKKLD GF+GTTKQW+DTFGTRYWK GAMYRG +P
Subjt:  RYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP

A0A6J1L8C3 glycine-rich domain-containing protein 1-like isoform X23.2e-13677.93Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEWA+AQ+IEIGVDLVAAAKRQLQFLSAVD ++FL+EGP L+RAIYRYNAYWLPLLAKHSESPLFEGPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVA
        ELYGKILDNSNV+ST  + S C +ETE+VWNELYP+E F+FN+      QED +    LS LEKYTKYDLVSAVKRQ+PF+YQVS+PHM NE FL+EAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVA

Query:  RYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP
        RY+GFLYLIKSN ++S+++FCVPTYDIDLIWHTHQLHP+ Y KD+K LLG++LEHDDMDSDR++GKKLD GF+GTTKQW+DTFGTRYWK GAMYRG +P
Subjt:  RYRGFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP

SwissProt top hitse value%identityAlignment
Q9SZJ2 Glycine-rich domain-containing protein 21.6e-10861.62Show/hide
Query:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        EK + LEW +AQKI+I VDL+AAAK+ L FL AVD ++ L++GP L RAIYRYNAYWLPLLA+++E S + +GPLV PLDCEW+WHCHRLNPV+Y +DCE
Subjt:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVARYR
        + YG++LDNS V+S+  +   C  +TE +W  LYP EP+  ++ N   +  P D +S LEK T YDLV AVKRQ+PF+YQVS+ H+ N+ FL+EAVARY+
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVARYR

Query:  GFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAPR
         FLYLIK N + S++ FCVPTYDIDLIWHTHQLH + Y  D+ K++G VLEHDD DSDRS+GKKLD GF+GTT QW++TFG RYWK GAM RG  P+
Subjt:  GFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAPR

Q9ZQ47 Glycine-rich domain-containing protein 11.5e-11163.18Show/hide
Query:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        EK+ E+EW +AQKIEI VDL+AAAK+ L FL  VD +++L++GP L++AIYRYNA WLPLL K+SE S + EG LV PLDCEWIWHCHRLNPV+Y SDCE
Subjt:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVARYR
        + YG++LDNS V+S+  ++  C  +TE +W  LYP EP+  +  N+  +D+  ++ S LEK TKYDLVSAVKRQ+PFYYQVS+ H+ ++ FL+EAVARY+
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVARYR

Query:  GFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP
        GFLYLIK N + S+++FCVPTYD+DLIWHTHQLHP+ Y  DM KL+G VLEHDD DSDR +GKKLD GF+ TT QW++TFGTRYWK GAM+RG  P
Subjt:  GFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP

Arabidopsis top hitse value%identityAlignment
AT1G56230.1 Protein of unknown function (DUF1399)3.2e-3230.99Show/hide
Query:  EWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVV-PLDCEWIWHCHRLNPVQYISDCEELYGKI
        E ++   + IG D++++A+R +  L +V   Q+LH  P++  AI RY+  W+PL++  +     + P+++ PLD EW+W CH LNPV Y   CE  + K+
Subjt:  EWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVV-PLDCEWIWHCHRLNPVQYISDCEELYGKI

Query:  LDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVARYRGFLYLI
        +    +      E   + + EK+W+  YP E F        E+    D L  +    + D+ S VK+Q   + + S P+M    +L  A  RY+GFL ++
Subjt:  LDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVARYRGFLYLI

Query:  KSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSE-GKKLDIGFTGTTKQ-WDDTFGTRYWKVG
            DE      +P  DI L+W THQ +P  Y  D+ ++L      ++M     + G+K++     TTK+ WD  F   Y K G
Subjt:  KSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSE-GKKLDIGFTGTTKQ-WDDTFGTRYWKVG

AT1G56230.2 Protein of unknown function (DUF1399)3.2e-3230.99Show/hide
Query:  EWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVV-PLDCEWIWHCHRLNPVQYISDCEELYGKI
        E ++   + IG D++++A+R +  L +V   Q+LH  P++  AI RY+  W+PL++  +     + P+++ PLD EW+W CH LNPV Y   CE  + K+
Subjt:  EWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVV-PLDCEWIWHCHRLNPVQYISDCEELYGKI

Query:  LDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVARYRGFLYLI
        +    +      E   + + EK+W+  YP E F        E+    D L  +    + D+ S VK+Q   + + S P+M    +L  A  RY+GFL ++
Subjt:  LDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVARYRGFLYLI

Query:  KSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSE-GKKLDIGFTGTTKQ-WDDTFGTRYWKVG
            DE      +P  DI L+W THQ +P  Y  D+ ++L      ++M     + G+K++     TTK+ WD  F   Y K G
Subjt:  KSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSE-GKKLDIGFTGTTKQ-WDDTFGTRYWKVG

AT2G22660.1 Protein of unknown function (duplicated DUF1399)1.1e-11263.18Show/hide
Query:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        EK+ E+EW +AQKIEI VDL+AAAK+ L FL  VD +++L++GP L++AIYRYNA WLPLL K+SE S + EG LV PLDCEWIWHCHRLNPV+Y SDCE
Subjt:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVARYR
        + YG++LDNS V+S+  ++  C  +TE +W  LYP EP+  +  N+  +D+  ++ S LEK TKYDLVSAVKRQ+PFYYQVS+ H+ ++ FL+EAVARY+
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVARYR

Query:  GFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP
        GFLYLIK N + S+++FCVPTYD+DLIWHTHQLHP+ Y  DM KL+G VLEHDD DSDR +GKKLD GF+ TT QW++TFGTRYWK GAM+RG  P
Subjt:  GFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP

AT2G22660.2 Protein of unknown function (duplicated DUF1399)1.1e-11263.18Show/hide
Query:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        EK+ E+EW +AQKIEI VDL+AAAK+ L FL  VD +++L++GP L++AIYRYNA WLPLL K+SE S + EG LV PLDCEWIWHCHRLNPV+Y SDCE
Subjt:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVARYR
        + YG++LDNS V+S+  ++  C  +TE +W  LYP EP+  +  N+  +D+  ++ S LEK TKYDLVSAVKRQ+PFYYQVS+ H+ ++ FL+EAVARY+
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVARYR

Query:  GFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP
        GFLYLIK N + S+++FCVPTYD+DLIWHTHQLHP+ Y  DM KL+G VLEHDD DSDR +GKKLD GF+ TT QW++TFGTRYWK GAM+RG  P
Subjt:  GFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAP

AT4G37900.1 Protein of unknown function (duplicated DUF1399)1.1e-10961.62Show/hide
Query:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        EK + LEW +AQKI+I VDL+AAAK+ L FL AVD ++ L++GP L RAIYRYNAYWLPLLA+++E S + +GPLV PLDCEW+WHCHRLNPV+Y +DCE
Subjt:  EKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVARYR
        + YG++LDNS V+S+  +   C  +TE +W  LYP EP+  ++ N   +  P D +S LEK T YDLV AVKRQ+PF+YQVS+ H+ N+ FL+EAVARY+
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVARYR

Query:  GFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAPR
         FLYLIK N + S++ FCVPTYDIDLIWHTHQLH + Y  D+ K++G VLEHDD DSDRS+GKKLD GF+GTT QW++TFG RYWK GAM RG  P+
Subjt:  GFLYLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAPR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCACTCCCACTTCAAATCTCTCATGCTCATGGAATTTCAAGGAAAATGGAGAAGAACAAAGAGCTTGAGTGGGCAAAAGCACAGAAGATTGAAATAGGTGTTGA
TCTTGTAGCTGCTGCTAAACGTCAGCTTCAGTTCCTCTCTGCAGTCGATAGCGATCAGTTCCTTCATGAAGGCCCTATCCTTGATCGGGCTATTTACAGGTACAATGCTT
ATTGGCTTCCATTGCTTGCTAAACATTCTGAATCCCCACTATTTGAAGGACCTTTGGTTGTCCCTTTGGACTGTGAATGGATTTGGCATTGCCACAGATTGAATCCTGTA
CAATACATATCTGATTGTGAGGAACTTTATGGAAAGATACTTGACAATTCCAATGTTATATCAACTACCATTATTGAAAGTTGCTGTCTTAAAGAAACTGAAAAAGTTTG
GAATGAATTATACCCTCAAGAGCCCTTCAGCTTCAATTACTTCAACTTACAAGAGGATGATGTCCCAAAAGATCAGCTCTCACAACTTGAAAAATACACCAAATATGATC
TTGTCTCAGCTGTTAAAAGACAGACCCCCTTCTACTATCAGGTGTCTCAACCTCACATGAAAAATGAAGATTTCCTTGAAGAAGCAGTGGCTAGATACAGAGGATTCCTA
TATCTAATCAAAAGTAACATGGATGAGTCTGTACAAAAGTTTTGTGTTCCAACATATGATATCGATCTAATTTGGCATACTCATCAATTGCACCCTCTTCCCTATTACAA
AGACATGAAAAAATTACTTGGTTTGGTATTGGAGCATGATGATATGGATTCAGATCGATCTGAAGGAAAGAAATTGGATATTGGCTTCACTGGAACTACAAAACAATGGG
ATGATACATTTGGTACAAGGTATTGGAAGGTTGGAGCAATGTATAGAGGCCCAGCTCCTCGAATGCAAGGTATGGTGAAAAGTGGTGGATGCACTACTGCTGCTGGGTGC
TCAAGCAAGGTAGAGAGCGGTGGATGCAGTGCTGCTGCTGCTGGGTGCTCAAGCAAGGTAAAAAGCGGTGGATGCAGTGCTGCTGCTGCGTGCTTAAGCAAGGTAAAGAG
CGGTGTAGAGAGCGGTGGATGCAGTGCTGCTGCTGGGTGCTCAAGCAAGGTAAAAAGCGGTGGATGCAGTGCTGCTGCTGGGTGCTCAAGCAAGGTAAAGAGCGGTGGAT
GCAGTGCTGCTGCTGGGTGCTCAAACAAGGTAAAGAGCGGTGTAGAGAGCGGTGGATGCAGTGCTGCTGCTGGGTGCTCAAGCAAGGTAAAAAGCGGTGGATGCAGTGCT
GCTGCTGCGTGCTTAAGCAAGGTAAAGAGCGGTGTAGAGAGCGGTGGATGCAGTGCTGCTGCTGGGTGCTCAAGCAAGGTAAAAAGCGGTGGATGCAGTGCTGCTGCTGG
GTGCTCAAGCAAGGTAAAGAGCGGTGGATGCAGTGCTGCTGCTGGGTGCTCAAACAAGGTAAAGAGCGGTGGATGCAGTGCTGCTGCTGCGTGCTTAAGCAAGGTAAAGA
GCGGTGTAGAGAGCGGTGGATGCAGTGCTGCTGCTGGGTGCTCAAGCAAGGTAAAAAGCGGTGGATGCAGTGCTGCTGCTGCGTGCTTAAGCAAGGTAAAAAGCGGTGGA
TGCAGTGCTGCTGCTGGGTGCTCAAGCAAGGTAAAGAGCGGTGGATGCAGTGCTGCTGCTGGGTGCTCAAACAAGGTAAAGAGCGGTGTAGAGAGCGGTGGATGCAGTGC
TGCTGTTGGGTGCGGAAGCAAGGTAAAGAGTGGTGGATGCAGTGCTGCTGCTGGGTGGGGAAGCCAGGTAAAGAGTGGTGGATGCAGTGCTGCTGCTGGGTCCGAAAGCC
AGGTAAAGAGTGGTGGATGCAATGCTGCTGTTGGGTGGGGAAGCCAGGTAAAGAGTGGTGGATGCAGTGCTGCTGCTGGGTGCGGAAGCAAGGTGAAGAGTGGTGGATGC
AGTGCTATTGCTGGGTCCGGAAGCCAAGTAAAGAGTGGTGGATGCAGTGCTGCTGCTGGGTGCGGAAGCCAGGTAAAGAGTGGTGGATGCAGTGCTGCTGCTGGGTGCAA
AGACATCGTAAAGAGCGATGGATGCAATGTTTGTGCTGGGTGTGGAATCATGGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCACTCCCACTTCAAATCTCTCATGCTCATGGAATTTCAAGGAAAATGGAGAAGAACAAAGAGCTTGAGTGGGCAAAAGCACAGAAGATTGAAATAGGTGTTGA
TCTTGTAGCTGCTGCTAAACGTCAGCTTCAGTTCCTCTCTGCAGTCGATAGCGATCAGTTCCTTCATGAAGGCCCTATCCTTGATCGGGCTATTTACAGGTACAATGCTT
ATTGGCTTCCATTGCTTGCTAAACATTCTGAATCCCCACTATTTGAAGGACCTTTGGTTGTCCCTTTGGACTGTGAATGGATTTGGCATTGCCACAGATTGAATCCTGTA
CAATACATATCTGATTGTGAGGAACTTTATGGAAAGATACTTGACAATTCCAATGTTATATCAACTACCATTATTGAAAGTTGCTGTCTTAAAGAAACTGAAAAAGTTTG
GAATGAATTATACCCTCAAGAGCCCTTCAGCTTCAATTACTTCAACTTACAAGAGGATGATGTCCCAAAAGATCAGCTCTCACAACTTGAAAAATACACCAAATATGATC
TTGTCTCAGCTGTTAAAAGACAGACCCCCTTCTACTATCAGGTGTCTCAACCTCACATGAAAAATGAAGATTTCCTTGAAGAAGCAGTGGCTAGATACAGAGGATTCCTA
TATCTAATCAAAAGTAACATGGATGAGTCTGTACAAAAGTTTTGTGTTCCAACATATGATATCGATCTAATTTGGCATACTCATCAATTGCACCCTCTTCCCTATTACAA
AGACATGAAAAAATTACTTGGTTTGGTATTGGAGCATGATGATATGGATTCAGATCGATCTGAAGGAAAGAAATTGGATATTGGCTTCACTGGAACTACAAAACAATGGG
ATGATACATTTGGTACAAGGTATTGGAAGGTTGGAGCAATGTATAGAGGCCCAGCTCCTCGAATGCAAGGTATGGTGAAAAGTGGTGGATGCACTACTGCTGCTGGGTGC
TCAAGCAAGGTAGAGAGCGGTGGATGCAGTGCTGCTGCTGCTGGGTGCTCAAGCAAGGTAAAAAGCGGTGGATGCAGTGCTGCTGCTGCGTGCTTAAGCAAGGTAAAGAG
CGGTGTAGAGAGCGGTGGATGCAGTGCTGCTGCTGGGTGCTCAAGCAAGGTAAAAAGCGGTGGATGCAGTGCTGCTGCTGGGTGCTCAAGCAAGGTAAAGAGCGGTGGAT
GCAGTGCTGCTGCTGGGTGCTCAAACAAGGTAAAGAGCGGTGTAGAGAGCGGTGGATGCAGTGCTGCTGCTGGGTGCTCAAGCAAGGTAAAAAGCGGTGGATGCAGTGCT
GCTGCTGCGTGCTTAAGCAAGGTAAAGAGCGGTGTAGAGAGCGGTGGATGCAGTGCTGCTGCTGGGTGCTCAAGCAAGGTAAAAAGCGGTGGATGCAGTGCTGCTGCTGG
GTGCTCAAGCAAGGTAAAGAGCGGTGGATGCAGTGCTGCTGCTGGGTGCTCAAACAAGGTAAAGAGCGGTGGATGCAGTGCTGCTGCTGCGTGCTTAAGCAAGGTAAAGA
GCGGTGTAGAGAGCGGTGGATGCAGTGCTGCTGCTGGGTGCTCAAGCAAGGTAAAAAGCGGTGGATGCAGTGCTGCTGCTGCGTGCTTAAGCAAGGTAAAAAGCGGTGGA
TGCAGTGCTGCTGCTGGGTGCTCAAGCAAGGTAAAGAGCGGTGGATGCAGTGCTGCTGCTGGGTGCTCAAACAAGGTAAAGAGCGGTGTAGAGAGCGGTGGATGCAGTGC
TGCTGTTGGGTGCGGAAGCAAGGTAAAGAGTGGTGGATGCAGTGCTGCTGCTGGGTGGGGAAGCCAGGTAAAGAGTGGTGGATGCAGTGCTGCTGCTGGGTCCGAAAGCC
AGGTAAAGAGTGGTGGATGCAATGCTGCTGTTGGGTGGGGAAGCCAGGTAAAGAGTGGTGGATGCAGTGCTGCTGCTGGGTGCGGAAGCAAGGTGAAGAGTGGTGGATGC
AGTGCTATTGCTGGGTCCGGAAGCCAAGTAAAGAGTGGTGGATGCAGTGCTGCTGCTGGGTGCGGAAGCCAGGTAAAGAGTGGTGGATGCAGTGCTGCTGCTGGGTGCAA
AGACATCGTAAAGAGCGATGGATGCAATGTTTGTGCTGGGTGTGGAATCATGGTGTAG
Protein sequenceShow/hide protein sequence
MSSLPLQISHAHGISRKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPILDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPV
QYISDCEELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFYYQVSQPHMKNEDFLEEAVARYRGFL
YLIKSNMDESVQKFCVPTYDIDLIWHTHQLHPLPYYKDMKKLLGLVLEHDDMDSDRSEGKKLDIGFTGTTKQWDDTFGTRYWKVGAMYRGPAPRMQGMVKSGGCTTAAGC
SSKVESGGCSAAAAGCSSKVKSGGCSAAAACLSKVKSGVESGGCSAAAGCSSKVKSGGCSAAAGCSSKVKSGGCSAAAGCSNKVKSGVESGGCSAAAGCSSKVKSGGCSA
AAACLSKVKSGVESGGCSAAAGCSSKVKSGGCSAAAGCSSKVKSGGCSAAAGCSNKVKSGGCSAAAACLSKVKSGVESGGCSAAAGCSSKVKSGGCSAAAACLSKVKSGG
CSAAAGCSSKVKSGGCSAAAGCSNKVKSGVESGGCSAAVGCGSKVKSGGCSAAAGWGSQVKSGGCSAAAGSESQVKSGGCNAAVGWGSQVKSGGCSAAAGCGSKVKSGGC
SAIAGSGSQVKSGGCSAAAGCGSQVKSGGCSAAAGCKDIVKSDGCNVCAGCGIMV