; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G03150 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G03150
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGlycine-rich domain-containing protein 1-like
Genome locationClcChr10:3472610..3480658
RNA-Seq ExpressionClc10G03150
SyntenyClc10G03150
Gene Ontology termsNA
InterPro domainsIPR009836 - Glycine-rich domain-containing protein-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK19501.1 glycine-rich domain-containing protein 1-like [Cucumis melo var. makuwa]2.4e-13477.59Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEW +AQ+IEIGVDLVAAAKRQLQFLSAV+ ++FL+E P+L+RAIYRYNAYWLPLLAKHSESPLF+GPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVP--KDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVAR
        ELYGKILDNSNVIST  I S C +ETEKVWNELYP+EPF+FN FN   D +    + LS L+KYTKYDLVSAVKRQ+PFFYQVS+PHM NE FLQEAVAR
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVP--KDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVAR

Query:  YRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA
        Y+GFLYLIKS  ++S+ RFCVPTYDIDLIWH+HQLHPLSYCKD+KK+LG VLEHDDT +DR+ GK+LD GF+GTTKQW+DTFGTRYWK   MYRG  P+
Subjt:  YRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA

XP_023007682.1 glycine-rich domain-containing protein 1-like isoform X1 [Cucurbita maxima]4.1e-13477Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEWA+AQ+IEIGVDLVAAAKRQLQFLSAVD ++FL+EGP+L+RAIYRYNAYWLPLLAKHSESPLFEGPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA
        ELYGKILDNSNV+ST  + S C +ETE+VWNELYP+E F+FN+      QED +    LS LEKYTKYDLVSAVKRQ+PFFYQVS+PHM NE FLQEAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA

Query:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA
        RY+GFLYLIKS  ++S+ RFCVPTYDIDLIWHTHQLHP+SYCKD+K LLG++LEHDD  +DR+ GK+LD GF+GTTKQW+DTFGTRYWK   MYRG +P+
Subjt:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA

XP_023007683.1 glycine-rich domain-containing protein 1-like isoform X2 [Cucurbita maxima]4.1e-13477Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEWA+AQ+IEIGVDLVAAAKRQLQFLSAVD ++FL+EGP+L+RAIYRYNAYWLPLLAKHSESPLFEGPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA
        ELYGKILDNSNV+ST  + S C +ETE+VWNELYP+E F+FN+      QED +    LS LEKYTKYDLVSAVKRQ+PFFYQVS+PHM NE FLQEAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA

Query:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA
        RY+GFLYLIKS  ++S+ RFCVPTYDIDLIWHTHQLHP+SYCKD+K LLG++LEHDD  +DR+ GK+LD GF+GTTKQW+DTFGTRYWK   MYRG +P+
Subjt:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA

XP_023552041.1 glycine-rich domain-containing protein 1-like [Cucurbita pepo subsp. pepo]3.1e-13476.67Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEWA+AQ+IEIGVDLVAAAKRQLQFLSAVD ++FL+EGP+L+RAIYRYNAYWLPLLAKHSESPLFEGPL VP DCEWIWHCHRLNPV+Y S+CE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA
        ELYGKILDNSNV+ST  + S CL+ETE+VWNELYP+EPF+FN+      QED +    LS L+KYTKYDLVSAVKRQ+PFFYQVS+PHM NE FL+EAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA

Query:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA
        RY+GFLYLIKS  + S+ RFCVPTYDIDLIWHTHQLHP+SYCKD+K LLG+VLEHDD  +DR+ GK+LD GF+GTTKQW+DTFGTRYWK   MYRG +P+
Subjt:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA

XP_038876850.1 uncharacterized protein LOC120069217 [Benincasa hispida]1.6e-16767.28Show/hide
Query:  IIVFPHLCVYTKTVFITTE-IREIINLYQV---PSMKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLA
        +I+F  +C       +T E IR  + LY +      K+EKNKELEW KAQKIEIGVDLVAAAK QLQFLS VDS  FLHEGP+LDRAIYRYNAYWLPLLA
Subjt:  IIVFPHLCVYTKTVFITTE-IREIINLYQV---PSMKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLA

Query:  KHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCEELYGKILDNSNVISTTIIES-CCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKD-QLSQLEK
        KHSESPLFEGPLVVPLDCEWIWH HRLNPVQYI+DCEELYGKILDNSNVISTT++ES C +KETE +WNELYP+EPFSFN FN QE DVPKD QLSQLEK
Subjt:  KHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCEELYGKILDNSNVISTTIIES-CCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKD-QLSQLEK

Query:  YTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSV
        YTKYDLV AVKRQTPFFYQVSQPHMKNE+FLQEA+ARYRGFLYLIKS+MDESV  FCVPTYDIDLIWHTHQLHPL YCKDMKKLLGLVLEHDDTVTDR+V
Subjt:  YTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSV

Query:  GKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPAGSCGSKVQGGGCTAAAGSCGSKVQGGGCTAAAGSCGSKVEGGGSTPAGSCGSKVQGGGCTAAAG
        G++LDIGFTGTTKQWDDTFGT Y KI  MYRGPAP      ++QGG            V+ G C         K+E  G   AG CGSKV+  G TA AG
Subjt:  GKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPAGSCGSKVQGGGCTAAAGSCGSKVQGGGCTAAAGSCGSKVEGGGSTPAGSCGSKVQGGGCTAAAG

Query:  SCGSKVHGGGCTAAAGSCGSKVQGGGSTPAGSCGSKVQGGGCTAAAGSCGSKVLGGGCTAAAGSCGSKVQGGGSTPAGSCGSKVHGGGCTAAAGSCGSKV
         CGSKV   G    AG CGSKV   G T A   GSKV G      A  CGS +     T A G CGSKV  GG T A +CGSKV   G T AAG CGSKV
Subjt:  SCGSKVHGGGCTAAAGSCGSKVQGGGSTPAGSCGSKVQGGGCTAAAGSCGSKVLGGGCTAAAGSCGSKVQGGGSTPAGSCGSKVHGGGCTAAAGSCGSKV

Query:  HGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVQGGGSTPAGSCGSKV
          GG  AAAG CGSK+E        G CGSK++  G T AG C  +V
Subjt:  HGGGCTAAAGSCGSKVEGGGCTAAAGSCGSKVQGGGSTPAGSCGSKV

TrEMBL top hitse value%identityAlignment
A0A1S3BLM1 glycine-rich domain-containing protein 1-like4.4e-13476.82Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEW +AQ+IEIGVDLVAAAKRQLQFLSAV+ ++FL+E P+L+RAIYRYNAYWLPLLAKHSESPLF+GPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNL-----QEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEA
        ELYGKILDNSNVIST  I S C +ETEKVWNELYP+EPF+FN FN       ++D+  + LS L+KYTKYDLVSAVKRQ+PFFYQVS+PHM NE FLQEA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNL-----QEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEA

Query:  VARYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPA
        VARY+GFLYLIKS  ++S+ RFCVPTYDIDLIWH+HQLHPLSYCKD+KK+LG VLEHDDT +DR+ GK+LD GF+GTTKQW+DTFGTRYWK   MYRG  
Subjt:  VARYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPA

Query:  PA
        P+
Subjt:  PA

A0A5D3D7F4 Glycine-rich domain-containing protein 1-like1.2e-13477.59Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEW +AQ+IEIGVDLVAAAKRQLQFLSAV+ ++FL+E P+L+RAIYRYNAYWLPLLAKHSESPLF+GPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVP--KDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVAR
        ELYGKILDNSNVIST  I S C +ETEKVWNELYP+EPF+FN FN   D +    + LS L+KYTKYDLVSAVKRQ+PFFYQVS+PHM NE FLQEAVAR
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVP--KDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVAR

Query:  YRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA
        Y+GFLYLIKS  ++S+ RFCVPTYDIDLIWH+HQLHPLSYCKD+KK+LG VLEHDDT +DR+ GK+LD GF+GTTKQW+DTFGTRYWK   MYRG  P+
Subjt:  YRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA

A0A6J1KZD5 glycine-rich domain-containing protein 1-like isoform X12.0e-13477Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEWA+AQ+IEIGVDLVAAAKRQLQFLSAVD ++FL+EGP+L+RAIYRYNAYWLPLLAKHSESPLFEGPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA
        ELYGKILDNSNV+ST  + S C +ETE+VWNELYP+E F+FN+      QED +    LS LEKYTKYDLVSAVKRQ+PFFYQVS+PHM NE FLQEAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA

Query:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA
        RY+GFLYLIKS  ++S+ RFCVPTYDIDLIWHTHQLHP+SYCKD+K LLG++LEHDD  +DR+ GK+LD GF+GTTKQW+DTFGTRYWK   MYRG +P+
Subjt:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA

A0A6J1L3N7 glycine-rich domain-containing protein 1-like isoform X32.0e-13477Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEWA+AQ+IEIGVDLVAAAKRQLQFLSAVD ++FL+EGP+L+RAIYRYNAYWLPLLAKHSESPLFEGPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA
        ELYGKILDNSNV+ST  + S C +ETE+VWNELYP+E F+FN+      QED +    LS LEKYTKYDLVSAVKRQ+PFFYQVS+PHM NE FLQEAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA

Query:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA
        RY+GFLYLIKS  ++S+ RFCVPTYDIDLIWHTHQLHP+SYCKD+K LLG++LEHDD  +DR+ GK+LD GF+GTTKQW+DTFGTRYWK   MYRG +P+
Subjt:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA

A0A6J1L8C3 glycine-rich domain-containing protein 1-like isoform X22.0e-13477Show/hide
Query:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE
        MEKN+ELEWA+AQ+IEIGVDLVAAAKRQLQFLSAVD ++FL+EGP+L+RAIYRYNAYWLPLLAKHSESPLFEGPLVVP DCEWIWHCHRLNPV+Y SDCE
Subjt:  MEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCE

Query:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA
        ELYGKILDNSNV+ST  + S C +ETE+VWNELYP+E F+FN+      QED +    LS LEKYTKYDLVSAVKRQ+PFFYQVS+PHM NE FLQEAVA
Subjt:  ELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNY---FNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA

Query:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA
        RY+GFLYLIKS  ++S+ RFCVPTYDIDLIWHTHQLHP+SYCKD+K LLG++LEHDD  +DR+ GK+LD GF+GTTKQW+DTFGTRYWK   MYRG +P+
Subjt:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAPA

SwissProt top hitse value%identityAlignment
Q9SZJ2 Glycine-rich domain-containing protein 22.0e-10761.2Show/hide
Query:  MKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYIS
        M  EK + LEW +AQKI+I VDL+AAAK+ L FL AVD ++ L++GP L RAIYRYNAYWLPLLA+++E S + +GPLV PLDCEW+WHCHRLNPV+Y +
Subjt:  MKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYIS

Query:  DCEELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA
        DCE+ YG++LDNS V+S+  +   C  +TE +W  LYP EP+  ++ N   +  P D +S LEK T YDLV AVKRQ+PFFYQVS+ H+ N+ FLQEAVA
Subjt:  DCEELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA

Query:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP
        RY+ FLYLIK   + S+  FCVPTYDIDLIWHTHQLH +SYC D+ K++G VLEHDDT +DRS GK+LD GF+GTT QW++TFG RYWK   M RG  P
Subjt:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP

Q9ZQ47 Glycine-rich domain-containing protein 12.5e-11062.21Show/hide
Query:  MKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYIS
        M  EK+ E+EW +AQKIEI VDL+AAAK+ L FL  VD +++L++GP L++AIYRYNA WLPLL K+SE S + EG LV PLDCEWIWHCHRLNPV+Y S
Subjt:  MKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYIS

Query:  DCEELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA
        DCE+ YG++LDNS V+S+  ++  C  +TE +W  LYP EP+  +  N+  +D+  ++ S LEK TKYDLVSAVKRQ+PF+YQVS+ H+ ++ FLQEAVA
Subjt:  DCEELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA

Query:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP
        RY+GFLYLIK   + S+ RFCVPTYD+DLIWHTHQLHP+SYC DM KL+G VLEHDDT +DR  GK+LD GF+ TT QW++TFGTRYWK   M+RG  P
Subjt:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP

Arabidopsis top hitse value%identityAlignment
AT1G56230.1 Protein of unknown function (DUF1399)3.7e-3230.96Show/hide
Query:  EWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVV-PLDCEWIWHCHRLNPVQYISDCEELYGKI
        E ++   + IG D++++A+R +  L +V   Q+LH  P +  AI RY+  W+PL++  +     + P+++ PLD EW+W CH LNPV Y   CE  + K+
Subjt:  EWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVV-PLDCEWIWHCHRLNPVQYISDCEELYGKI

Query:  LDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYRGFLYLI
        +    +      E   + + EK+W+  YP E F        E+    D L  +    + D+ S VK+Q   + + S P+M    +L  A  RY+GFL ++
Subjt:  LDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYRGFLYLI

Query:  KSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQ-WDDTFGTRYWK
            DE      +P  DI L+W THQ +P  Y  D+ ++L      + T     VG++++     TTK+ WD  F   Y K
Subjt:  KSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQ-WDDTFGTRYWK

AT1G56230.2 Protein of unknown function (DUF1399)3.7e-3230.96Show/hide
Query:  EWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVV-PLDCEWIWHCHRLNPVQYISDCEELYGKI
        E ++   + IG D++++A+R +  L +V   Q+LH  P +  AI RY+  W+PL++  +     + P+++ PLD EW+W CH LNPV Y   CE  + K+
Subjt:  EWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSESPLFEGPLVV-PLDCEWIWHCHRLNPVQYISDCEELYGKI

Query:  LDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYRGFLYLI
        +    +      E   + + EK+W+  YP E F        E+    D L  +    + D+ S VK+Q   + + S P+M    +L  A  RY+GFL ++
Subjt:  LDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYRGFLYLI

Query:  KSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQ-WDDTFGTRYWK
            DE      +P  DI L+W THQ +P  Y  D+ ++L      + T     VG++++     TTK+ WD  F   Y K
Subjt:  KSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQ-WDDTFGTRYWK

AT2G22660.1 Protein of unknown function (duplicated DUF1399)1.8e-11162.21Show/hide
Query:  MKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYIS
        M  EK+ E+EW +AQKIEI VDL+AAAK+ L FL  VD +++L++GP L++AIYRYNA WLPLL K+SE S + EG LV PLDCEWIWHCHRLNPV+Y S
Subjt:  MKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYIS

Query:  DCEELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA
        DCE+ YG++LDNS V+S+  ++  C  +TE +W  LYP EP+  +  N+  +D+  ++ S LEK TKYDLVSAVKRQ+PF+YQVS+ H+ ++ FLQEAVA
Subjt:  DCEELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA

Query:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP
        RY+GFLYLIK   + S+ RFCVPTYD+DLIWHTHQLHP+SYC DM KL+G VLEHDDT +DR  GK+LD GF+ TT QW++TFGTRYWK   M+RG  P
Subjt:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP

AT2G22660.2 Protein of unknown function (duplicated DUF1399)1.8e-11162.21Show/hide
Query:  MKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYIS
        M  EK+ E+EW +AQKIEI VDL+AAAK+ L FL  VD +++L++GP L++AIYRYNA WLPLL K+SE S + EG LV PLDCEWIWHCHRLNPV+Y S
Subjt:  MKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYIS

Query:  DCEELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA
        DCE+ YG++LDNS V+S+  ++  C  +TE +W  LYP EP+  +  N+  +D+  ++ S LEK TKYDLVSAVKRQ+PF+YQVS+ H+ ++ FLQEAVA
Subjt:  DCEELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA

Query:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP
        RY+GFLYLIK   + S+ RFCVPTYD+DLIWHTHQLHP+SYC DM KL+G VLEHDDT +DR  GK+LD GF+ TT QW++TFGTRYWK   M+RG  P
Subjt:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP

AT4G37900.1 Protein of unknown function (duplicated DUF1399)1.4e-10861.2Show/hide
Query:  MKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYIS
        M  EK + LEW +AQKI+I VDL+AAAK+ L FL AVD ++ L++GP L RAIYRYNAYWLPLLA+++E S + +GPLV PLDCEW+WHCHRLNPV+Y +
Subjt:  MKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAIYRYNAYWLPLLAKHSE-SPLFEGPLVVPLDCEWIWHCHRLNPVQYIS

Query:  DCEELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA
        DCE+ YG++LDNS V+S+  +   C  +TE +W  LYP EP+  ++ N   +  P D +S LEK T YDLV AVKRQ+PFFYQVS+ H+ N+ FLQEAVA
Subjt:  DCEELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEKYTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVA

Query:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP
        RY+ FLYLIK   + S+  FCVPTYDIDLIWHTHQLH +SYC D+ K++G VLEHDDT +DRS GK+LD GF+GTT QW++TFG RYWK   M RG  P
Subjt:  RYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTGTTKQWDDTFGTRYWKIEPMYRGPAP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTTTGCCGCATAATAATCCTCGGTACAAAGTGATTCCCACACCTTATGTTCTCAACGCGTCGCGGCTTCAGCGCATAATAGTCTTCCCTCACCTTTGCGTTTACAC
CAAAACTGTTTTCATCACAACAGAAATTAGAGAAATTATCAATTTATATCAAGTTCCATCAATGAAAATGGAGAAGAACAAAGAGCTTGAGTGGGCAAAAGCACAGAAGA
TTGAAATAGGTGTTGATCTTGTAGCTGCTGCTAAACGTCAGCTTCAGTTCCTCTCTGCAGTCGATAGCGATCAGTTCCTTCATGAAGGCCCTACCCTTGATCGGGCTATT
TACAGGTACAATGCTTATTGGCTTCCATTGCTTGCTAAACATTCTGAATCCCCACTATTTGAAGGACCTTTGGTTGTCCCTTTGGACTGTGAATGGATTTGGCATTGCCA
CAGATTGAATCCTGTACAATACATATCTGATTGTGAGGAACTTTATGGAAAGATACTTGACAATTCCAATGTTATATCAACTACCATTATTGAAAGTTGCTGTCTTAAAG
AAACTGAAAAAGTTTGGAATGAATTATACCCTCAAGAGCCCTTCAGCTTCAATTACTTCAACTTACAAGAGGATGATGTCCCAAAAGATCAGCTCTCACAACTTGAAAAA
TACACCAAATATGATCTTGTCTCAGCTGTTAAAAGACAGACCCCCTTCTTCTATCAGGTTTCTCAACCTCACATGAAGAATGAAGATTTCCTTCAAGAAGCTGTGGCTAG
ATACAGAGGATTTCTATATCTAATCAAAAGTGCAATGGATGAGTCTGTTGCTAGGTTTTGTGTTCCAACATATGATATTGATCTAATTTGGCATACCCATCAATTGCACC
CTCTTTCCTATTGCAAAGACATGAAAAAATTACTTGGTTTGGTATTGGAACATGATGATACGGTTACGGATAGATCAGTAGGAAAACAATTGGATATTGGGTTCACTGGA
ACTACAAAACAATGGGATGATACATTTGGTACAAGGTATTGGAAGATCGAACCAATGTATAGAGGCCCAGCTCCTGCTGGTAGTTGCGGAAGCAAGGTACAAGGTGGTGG
ATGCACTGCTGCAGCTGGTAGTTGCGGAAGCAAGGTACAGGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATCCACTCCTGCTG
GTAGTTGCGGAAGCAAGGTACAGGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTACATGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGC
AAGGTACAGGGCGGTGGATCCACTCCTGCTGGTAGTTGCGGAAGCAAGGTACAGGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTACTTGGTGGTGG
ATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTACAGGGCGGTGGATCCACTCCTGCTGGTAGTTGCGGAAGCAAGGTACATGGTGGTGGATGCACTGCTGCTGCTG
GTAGTTGCGGAAGCAAGGTACATGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGC
AAGGTACAGGGCGGTGGATCCACTCCTGCTGGTAGTTGCGGAAGCAAGGTACAGGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTACATGGTGGTGG
ATGCACTGCTACTGCTGGTAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATCCACTCCTGCTGGTAGTTGCGGAAGCAAGGTACAGGGTGGTGGATGCACTGCTGCTGCTG
GTAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATCCACTCCTGCTGGTAGTTGCGGAAGCAAGGTACAGGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGTGGAAGCAAG
GTACATGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATCCACTCCTGCTGGTAGTTGCGGAAGCAAGGTACAGGGCGGTGGATG
CACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTTCAGAGCGGTGGATCCACTGCTGCTGCTGGGTGCGGAAGCAAGGTAAAGGGCGGTGGATCCACTCCTGCTGGTAGTT
GCGGAAGCAAGGTACAGGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTACAGAGCGGTGGATGCAATGTTGCTGCTGGGTGCGAAAGCAAGATACAG
AGCGGTGGATGCACTGCTGCTGCTGGGTGCGGAAGCAAGGTACCAAGCGGTGGATGCACTGCTGCTGCTGGATGCAAAAACATCGTAAAGAGCAATGGATGCATTGCTTG
TGCTGGGTGTGGAATCATGGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGTTTGCCGCATAATAATCCTCGGTACAAAGTGATTCCCACACCTTATGTTCTCAACGCGTCGCGGCTTCAGCGCATAATAGTCTTCCCTCACCTTTGCGTTTACAC
CAAAACTGTTTTCATCACAACAGAAATTAGAGAAATTATCAATTTATATCAAGTTCCATCAATGAAAATGGAGAAGAACAAAGAGCTTGAGTGGGCAAAAGCACAGAAGA
TTGAAATAGGTGTTGATCTTGTAGCTGCTGCTAAACGTCAGCTTCAGTTCCTCTCTGCAGTCGATAGCGATCAGTTCCTTCATGAAGGCCCTACCCTTGATCGGGCTATT
TACAGGTACAATGCTTATTGGCTTCCATTGCTTGCTAAACATTCTGAATCCCCACTATTTGAAGGACCTTTGGTTGTCCCTTTGGACTGTGAATGGATTTGGCATTGCCA
CAGATTGAATCCTGTACAATACATATCTGATTGTGAGGAACTTTATGGAAAGATACTTGACAATTCCAATGTTATATCAACTACCATTATTGAAAGTTGCTGTCTTAAAG
AAACTGAAAAAGTTTGGAATGAATTATACCCTCAAGAGCCCTTCAGCTTCAATTACTTCAACTTACAAGAGGATGATGTCCCAAAAGATCAGCTCTCACAACTTGAAAAA
TACACCAAATATGATCTTGTCTCAGCTGTTAAAAGACAGACCCCCTTCTTCTATCAGGTTTCTCAACCTCACATGAAGAATGAAGATTTCCTTCAAGAAGCTGTGGCTAG
ATACAGAGGATTTCTATATCTAATCAAAAGTGCAATGGATGAGTCTGTTGCTAGGTTTTGTGTTCCAACATATGATATTGATCTAATTTGGCATACCCATCAATTGCACC
CTCTTTCCTATTGCAAAGACATGAAAAAATTACTTGGTTTGGTATTGGAACATGATGATACGGTTACGGATAGATCAGTAGGAAAACAATTGGATATTGGGTTCACTGGA
ACTACAAAACAATGGGATGATACATTTGGTACAAGGTATTGGAAGATCGAACCAATGTATAGAGGCCCAGCTCCTGCTGGTAGTTGCGGAAGCAAGGTACAAGGTGGTGG
ATGCACTGCTGCAGCTGGTAGTTGCGGAAGCAAGGTACAGGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATCCACTCCTGCTG
GTAGTTGCGGAAGCAAGGTACAGGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTACATGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGC
AAGGTACAGGGCGGTGGATCCACTCCTGCTGGTAGTTGCGGAAGCAAGGTACAGGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTACTTGGTGGTGG
ATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTACAGGGCGGTGGATCCACTCCTGCTGGTAGTTGCGGAAGCAAGGTACATGGTGGTGGATGCACTGCTGCTGCTG
GTAGTTGCGGAAGCAAGGTACATGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGC
AAGGTACAGGGCGGTGGATCCACTCCTGCTGGTAGTTGCGGAAGCAAGGTACAGGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTACATGGTGGTGG
ATGCACTGCTACTGCTGGTAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATCCACTCCTGCTGGTAGTTGCGGAAGCAAGGTACAGGGTGGTGGATGCACTGCTGCTGCTG
GTAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATCCACTCCTGCTGGTAGTTGCGGAAGCAAGGTACAGGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGTGGAAGCAAG
GTACATGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTAGAGGGCGGTGGATCCACTCCTGCTGGTAGTTGCGGAAGCAAGGTACAGGGCGGTGGATG
CACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTTCAGAGCGGTGGATCCACTGCTGCTGCTGGGTGCGGAAGCAAGGTAAAGGGCGGTGGATCCACTCCTGCTGGTAGTT
GCGGAAGCAAGGTACAGGGTGGTGGATGCACTGCTGCTGCTGGTAGTTGCGGAAGCAAGGTACAGAGCGGTGGATGCAATGTTGCTGCTGGGTGCGAAAGCAAGATACAG
AGCGGTGGATGCACTGCTGCTGCTGGGTGCGGAAGCAAGGTACCAAGCGGTGGATGCACTGCTGCTGCTGGATGCAAAAACATCGTAAAGAGCAATGGATGCATTGCTTG
TGCTGGGTGTGGAATCATGGTGTAG
Protein sequenceShow/hide protein sequence
MRLPHNNPRYKVIPTPYVLNASRLQRIIVFPHLCVYTKTVFITTEIREIINLYQVPSMKMEKNKELEWAKAQKIEIGVDLVAAAKRQLQFLSAVDSDQFLHEGPTLDRAI
YRYNAYWLPLLAKHSESPLFEGPLVVPLDCEWIWHCHRLNPVQYISDCEELYGKILDNSNVISTTIIESCCLKETEKVWNELYPQEPFSFNYFNLQEDDVPKDQLSQLEK
YTKYDLVSAVKRQTPFFYQVSQPHMKNEDFLQEAVARYRGFLYLIKSAMDESVARFCVPTYDIDLIWHTHQLHPLSYCKDMKKLLGLVLEHDDTVTDRSVGKQLDIGFTG
TTKQWDDTFGTRYWKIEPMYRGPAPAGSCGSKVQGGGCTAAAGSCGSKVQGGGCTAAAGSCGSKVEGGGSTPAGSCGSKVQGGGCTAAAGSCGSKVHGGGCTAAAGSCGS
KVQGGGSTPAGSCGSKVQGGGCTAAAGSCGSKVLGGGCTAAAGSCGSKVQGGGSTPAGSCGSKVHGGGCTAAAGSCGSKVHGGGCTAAAGSCGSKVEGGGCTAAAGSCGS
KVQGGGSTPAGSCGSKVQGGGCTAAAGSCGSKVHGGGCTATAGSCGSKVEGGGSTPAGSCGSKVQGGGCTAAAGSCGSKVEGGGSTPAGSCGSKVQGGGCTAAAGSCGSK
VHGGGCTAAAGSCGSKVEGGGSTPAGSCGSKVQGGGCTAAAGSCGSKVQSGGSTAAAGCGSKVKGGGSTPAGSCGSKVQGGGCTAAAGSCGSKVQSGGCNVAAGCESKIQ
SGGCTAAAGCGSKVPSGGCTAAAGCKNIVKSNGCIACAGCGIMV