; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G47180 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G47180
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionglycine-rich protein
Genome locationChr3:40204969..40209139
RNA-Seq ExpressionCSPI03G47180
SyntenyCSPI03G47180
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8651460.1 hypothetical protein Csa_001417 [Cucumis sativus]7.1e-22096.76Show/hide
Query:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT
        MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSN+LQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT
Subjt:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT

Query:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG
        GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG
Subjt:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG

Query:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL
        GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL
Subjt:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL

Query:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG
        DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQV   +       LY G
Subjt:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG

Query:  G
        G
Subjt:  G

XP_004148522.1 uncharacterized protein LOC101208985 isoform X1 [Cucumis sativus]7.1e-22096.76Show/hide
Query:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT
        MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSN+LQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT
Subjt:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT

Query:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG
        GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG
Subjt:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG

Query:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL
        GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL
Subjt:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL

Query:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG
        DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQV   +       LY G
Subjt:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG

Query:  G
        G
Subjt:  G

XP_016903549.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103503496 [Cucumis melo]1.5e-21494.26Show/hide
Query:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT
        MCSSL HWHI QYIVWGCLYMSVI+LNSLQYESGNVF N+L+HEFRPVTGNGS+NISPILFSSS+HFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT
Subjt:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT

Query:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG
        GNLEILPHV ICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVF AANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG
Subjt:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG

Query:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL
        GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRG+GTISAAGGKGWGGGGGGRISL
Subjt:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL

Query:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG
        DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQV   +       LY G
Subjt:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG

Query:  G
        G
Subjt:  G

XP_031737959.1 uncharacterized protein LOC101208985 isoform X2 [Cucumis sativus]6.6e-21896.51Show/hide
Query:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT
        MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSN+LQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT
Subjt:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT

Query:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG
        GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG
Subjt:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG

Query:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL
        GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKL GNGTISAAGGKGWGGGGGGRISL
Subjt:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL

Query:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG
        DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQV   +       LY G
Subjt:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG

Query:  G
        G
Subjt:  G

XP_031737960.1 uncharacterized protein LOC101208985 isoform X3 [Cucumis sativus]7.1e-22096.76Show/hide
Query:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT
        MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSN+LQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT
Subjt:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT

Query:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG
        GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG
Subjt:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG

Query:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL
        GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL
Subjt:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL

Query:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG
        DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQV   +       LY G
Subjt:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG

Query:  G
        G
Subjt:  G

TrEMBL top hitse value%identityAlignment
A0A0A0LKA9 Uncharacterized protein1.8e-23299.51Show/hide
Query:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT
        MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSN+LQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT
Subjt:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT

Query:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG
        GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG
Subjt:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG

Query:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL
        GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL
Subjt:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL

Query:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG
        DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFF MLPLLYRG
Subjt:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG

Query:  GKCLAPL
        GKCLAPL
Subjt:  GKCLAPL

A0A1S4E5Q0 LOW QUALITY PROTEIN: uncharacterized protein LOC1035034967.4e-21594.26Show/hide
Query:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT
        MCSSL HWHI QYIVWGCLYMSVI+LNSLQYESGNVF N+L+HEFRPVTGNGS+NISPILFSSS+HFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT
Subjt:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT

Query:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG
        GNLEILPHV ICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVF AANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG
Subjt:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG

Query:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL
        GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRG+GTISAAGGKGWGGGGGGRISL
Subjt:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL

Query:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG
        DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQV   +       LY G
Subjt:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG

Query:  G
        G
Subjt:  G

A0A6J1EYB2 uncharacterized protein LOC111439602 isoform X21.3e-19586.78Show/hide
Query:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT
        MCSSL H HIG YI+ GCL MS + LNSLQYESGN FSN+ QHEFRPVTGNGS+N SP  FSSS+HFVSCEDL GVGSFNTTCLLNTNLSL SDFY+SGT
Subjt:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT

Query:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG
        GNLEILPHV ICCPIEGC+ITLNMSGNIKVS HA VVAGSVVFSAAN+T+EYNSYINTT+LGGAPP+QTSGTP G+DGSGGGHGGRGASC KSNQTSNWG
Subjt:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG

Query:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL
        GDVYAWSTLS+PWSYGSKGGGIS+EKPYGGLGGGRVKL+IV VLYLNGSILAEGGDGGS GGGGSGGSIFVHAVKL+G+GTISAAGGKG GGGGGGRISL
Subjt:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL

Query:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG
        DCYSIQEDIKVTVHGG+SIGC GNAGAAGTYFNADLLSLRVGNDN+TTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQV   +       LYRG
Subjt:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG

Query:  G
        G
Subjt:  G

A0A6J1F3N5 uncharacterized protein LOC111439602 isoform X11.3e-19586.78Show/hide
Query:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT
        MCSSL H HIG YI+ GCL MS + LNSLQYESGN FSN+ QHEFRPVTGNGS+N SP  FSSS+HFVSCEDL GVGSFNTTCLLNTNLSL SDFY+SGT
Subjt:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT

Query:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG
        GNLEILPHV ICCPIEGC+ITLNMSGNIKVS HA VVAGSVVFSAAN+T+EYNSYINTT+LGGAPP+QTSGTP G+DGSGGGHGGRGASC KSNQTSNWG
Subjt:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG

Query:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL
        GDVYAWSTLS+PWSYGSKGGGIS+EKPYGGLGGGRVKL+IV VLYLNGSILAEGGDGGS GGGGSGGSIFVHAVKL+G+GTISAAGGKG GGGGGGRISL
Subjt:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL

Query:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG
        DCYSIQEDIKVTVHGG+SIGC GNAGAAGTYFNADLLSLRVGNDN+TTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQV   +       LYRG
Subjt:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG

Query:  G
        G
Subjt:  G

A0A6J1IDZ5 uncharacterized protein LOC111471805 isoform X16.1e-19385.79Show/hide
Query:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT
        MCSSL H H+G YI+ GCL MS + LNSLQYESGN FSN+ Q+EF PV GNGS+N SP  F SS+HFVSCEDLGGVGSFNTTCLLNTNLSL SDFY+SGT
Subjt:  MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGT

Query:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG
        GNLEILPHV ICCPIEGC+ITLNMSGNIKVS HA VVAGSVVFSAAN+T+EYNSYINTT+LGGAPP+QTSGTP G+DGSGGGHGGRGASC KSNQTSNWG
Subjt:  GNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWG

Query:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL
        GDVYAWSTLS+PWSYGSKGGGIS+EKPYGGLGGGRVKL+IV VLYLNGSILAEGGDGGS GGGGSGGSIFVHAVKL+G+GTISAAGGKG GGGGGGRISL
Subjt:  GDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISL

Query:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG
        DCYSIQEDIKVTVHGG+SIGC GNAGAAGTYFNADLLSLRVGNDN+TTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQV   +       LYRG
Subjt:  DCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG

Query:  G
        G
Subjt:  G

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G32920.1 glycine-rich protein5.1e-8349.13Show/hide
Query:  FRPVTGNGSRNISPILFSSSSHFVSC-EDLGGVGSFNTTCLLNTNLSLYSDFYISGTGNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVF
        F P++ + +   SP    SS   VSC +DLGGVGS ++TC L  +L+L  D  I+G GNL +LP V + C   GC+I++N+SGN  ++ ++ V+AG+   
Subjt:  FRPVTGNGSRNISPILFSSSSHFVSC-EDLGGVGSFNTTCLLNTNLSLYSDFYISGTGNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVF

Query:  SAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQT----SNWGGDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLI
        +A N     +S ++TT L G PP  TSGTP G +G+GGG+GGRGA C     T      +GGDVY WS+L +P  YGS+GG  S+E  YGG GGG V + 
Subjt:  SAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQT----SNWGGDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLI

Query:  IVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISLDCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSL
        I+G + LNGS+LA+G  GG +GGGGSGGSIFV A K+ GNG +SA+GG G+ GGGGGR+S+D YS   D K+  +GG S GC  NAGAAGT ++    SL
Subjt:  IVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISLDCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSL

Query:  RVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQV
         + N N TT T+T LL+F    L++N+++ N AK  VPL W+RVQV
Subjt:  RVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQV

AT4G32920.2 glycine-rich protein5.1e-8349.13Show/hide
Query:  FRPVTGNGSRNISPILFSSSSHFVSC-EDLGGVGSFNTTCLLNTNLSLYSDFYISGTGNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVF
        F P++ + +   SP    SS   VSC +DLGGVGS ++TC L  +L+L  D  I+G GNL +LP V + C   GC+I++N+SGN  ++ ++ V+AG+   
Subjt:  FRPVTGNGSRNISPILFSSSSHFVSC-EDLGGVGSFNTTCLLNTNLSLYSDFYISGTGNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVF

Query:  SAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQT----SNWGGDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLI
        +A N     +S ++TT L G PP  TSGTP G +G+GGG+GGRGA C     T      +GGDVY WS+L +P  YGS+GG  S+E  YGG GGG V + 
Subjt:  SAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQT----SNWGGDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLI

Query:  IVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISLDCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSL
        I+G + LNGS+LA+G  GG +GGGGSGGSIFV A K+ GNG +SA+GG G+ GGGGGR+S+D YS   D K+  +GG S GC  NAGAAGT ++    SL
Subjt:  IVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISLDCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSL

Query:  RVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQV
         + N N TT T+T LL+F    L++N+++ N AK  VPL W+RVQV
Subjt:  RVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQV

AT5G11700.1 LOCATED IN: vacuole2.9e-9455.21Show/hide
Query:  VSC-EDLGGVGSFNTTCLLNTNLSLYSDFYISGTGNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPP
        VSC EDLGGVG  +TTC +  +L+L  D YI+G GN  ILP V   CPI GC+I +N+SGN  +   + +VAG++  +A N +    S +NTT L G+PP
Subjt:  VSC-EDLGGVGSFNTTCLLNTNLSLYSDFYISGTGNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPP

Query:  SQTSGTPFGYDGSGGGHGGRGASCF---KSNQTSNWGGDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGG
         QTSGTP G DG+GGGHGGRGA C    K      WGGD Y+WSTL +PWSYGSKGG  S E  YGG GGG+VK+ I+ +L +NGS+LA GG GG++GGG
Subjt:  SQTSGTPFGYDGSGGGHGGRGASCF---KSNQTSNWGGDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGG

Query:  GSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISLDCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLW
        GSGGSI++ A K+ G G ISA GG G+GGGGGGR+S+D +S  +D K+ VHGG SIGC  N+GAAGT ++A   SL V N N TT+T T LL+F   PLW
Subjt:  GSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISLDCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLW

Query:  SNVFVENNAKALVPLLWTRVQVLSDV
        +NV++++ A+A  PLLW+RVQV   +
Subjt:  SNVFVENNAKALVPLLWTRVQVLSDV

AT5G11700.2 BEST Arabidopsis thaliana protein match is: glycine-rich protein (TAIR:AT4G32920.3)2.9e-9455.21Show/hide
Query:  VSC-EDLGGVGSFNTTCLLNTNLSLYSDFYISGTGNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPP
        VSC EDLGGVG  +TTC +  +L+L  D YI+G GN  ILP V   CPI GC+I +N+SGN  +   + +VAG++  +A N +    S +NTT L G+PP
Subjt:  VSC-EDLGGVGSFNTTCLLNTNLSLYSDFYISGTGNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPP

Query:  SQTSGTPFGYDGSGGGHGGRGASCF---KSNQTSNWGGDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGG
         QTSGTP G DG+GGGHGGRGA C    K      WGGD Y+WSTL +PWSYGSKGG  S E  YGG GGG+VK+ I+ +L +NGS+LA GG GG++GGG
Subjt:  SQTSGTPFGYDGSGGGHGGRGASCF---KSNQTSNWGGDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGG

Query:  GSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISLDCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLW
        GSGGSI++ A K+ G G ISA GG G+GGGGGGR+S+D +S  +D K+ VHGG SIGC  N+GAAGT ++A   SL V N N TT+T T LL+F   PLW
Subjt:  GSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISLDCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSPLW

Query:  SNVFVENNAKALVPLLWTRVQVLSDV
        +NV++++ A+A  PLLW+RVQV   +
Subjt:  SNVFVENNAKALVPLLWTRVQVLSDV

AT5G47020.1 unknown protein4.0e-12867.85Show/hide
Query:  SSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGTGNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSL
        +SS   V+C+DL GVGS NTTC LN+NL   SD Y+ GTGNL IL HV + CP+EGC IT N+SG I +   A +VAGSVVFSA NLTM+ NS I TT+L
Subjt:  SSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGTGNLEILPHVAICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSL

Query:  GGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWGGDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRG
         G PPSQTSGTP+G DG+GGGHGGRGASC KSN+T+ WGGDVYAWS+L +PWSYGS+GG     K   G GGGRVKLI+   +++NG++ A+GGD G  G
Subjt:  GGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWGGDVYAWSTLSEPWSYGSKGGGISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRG

Query:  GGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISLDCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSP
        GGGSGGSI + AVKL+G G ISA+GG+GWGGGGGGRISLDCYSIQED+KV VHGG SIGC  NAGAAGTYFNA+L+SLRVGNDN+TTETETPLLDF T P
Subjt:  GGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISLDCYSIQEDIKVTVHGGISIGCSGNAGAAGTYFNADLLSLRVGNDNLTTETETPLLDFSTSP

Query:  LWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG
        LWSN++V+NNAK LVPLLWTR+QV   +       LYRG
Subjt:  LWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTCTTCCCTGTCGCACTGGCACATAGGACAGTATATTGTTTGGGGATGTTTATACATGTCTGTCATCTCCCTCAATTCACTTCAGTATGAAAGTGGAAATGTTTT
TTCGAACAATTTGCAGCATGAATTCCGGCCAGTGACAGGCAATGGCTCTCGGAACATTAGTCCAATACTTTTTTCTTCTTCAAGTCATTTTGTATCATGCGAGGACCTGG
GAGGTGTTGGGTCATTTAACACAACCTGCCTGTTAAACACAAACTTGTCTTTATATTCTGATTTCTACATATCTGGAACGGGGAATCTGGAAATACTTCCACATGTTGCG
ATTTGCTGCCCCATAGAAGGTTGTACGATTACTCTTAATATGTCTGGCAATATCAAAGTCAGTCACCACGCTGGCGTTGTTGCTGGTTCTGTGGTTTTTTCTGCTGCTAA
TCTGACAATGGAATACAATTCGTACATAAATACTACTTCCCTCGGTGGAGCACCTCCTTCTCAAACTAGTGGCACCCCATTTGGTTATGATGGATCTGGTGGAGGTCATG
GTGGCCGAGGGGCCTCCTGTTTTAAAAGTAATCAGACAAGCAATTGGGGTGGTGATGTGTATGCTTGGTCTACATTGTCTGAACCATGGAGCTATGGGAGCAAGGGTGGT
GGCATATCAGATGAAAAGCCGTATGGAGGGCTTGGTGGGGGACGTGTGAAACTTATAATTGTAGGTGTATTATATCTAAATGGTTCTATCCTAGCAGAAGGTGGAGACGG
AGGATCAAGAGGTGGAGGTGGATCTGGTGGAAGTATCTTTGTGCATGCTGTAAAGTTAAGGGGAAATGGAACTATATCTGCTGCAGGCGGGAAGGGATGGGGTGGAGGTG
GTGGGGGAAGAATTTCTCTGGATTGCTATAGCATTCAAGAAGATATTAAAGTTACTGTACATGGTGGTATAAGTATTGGGTGTTCTGGTAATGCTGGGGCAGCTGGCACT
TACTTCAATGCTGATTTGCTTAGTTTAAGAGTTGGAAATGACAATCTCACGACTGAAACTGAAACCCCTTTGCTAGACTTTTCAACTAGTCCCCTGTGGTCAAACGTGTT
TGTGGAGAATAATGCAAAAGCATTGGTCCCATTGCTGTGGACAAGAGTCCAGGTATTGAGTGACGTTTTCTTCCATATGCTTCCCTTGTTGTATAGAGGTGGAAAATGCC
TTGCTCCCTTATAA
mRNA sequenceShow/hide mRNA sequence
GAGTATAATTTTTTTTAATTTATGTGAAAGCATTGGAATAGTAATTTACAAAATAAGATCGATCCAAATTCTTCTTTCAAAGACGCCGGAGGGATGAAACTGAAATTTCT
GGTCTACATCAGACGGATGATGATTGTTGTAAGATTGTTTCGGCCGCTTTGACTCTCGATGAGGTTATATAGATATTAATGTTTCTGATGCACCAGAAAGAGCAAATGCC
ACAAAGACTATTCTACTTCTGAGTTAATTTTGCTCGATAAATTCATCTGGAGTAATATGTGCGAGTAGCACTCTGCAAAATAACTTGCAGTGATGTGTTCTTCCCTGTCG
CACTGGCACATAGGACAGTATATTGTTTGGGGATGTTTATACATGTCTGTCATCTCCCTCAATTCACTTCAGTATGAAAGTGGAAATGTTTTTTCGAACAATTTGCAGCA
TGAATTCCGGCCAGTGACAGGCAATGGCTCTCGGAACATTAGTCCAATACTTTTTTCTTCTTCAAGTCATTTTGTATCATGCGAGGACCTGGGAGGTGTTGGGTCATTTA
ACACAACCTGCCTGTTAAACACAAACTTGTCTTTATATTCTGATTTCTACATATCTGGAACGGGGAATCTGGAAATACTTCCACATGTTGCGATTTGCTGCCCCATAGAA
GGTTGTACGATTACTCTTAATATGTCTGGCAATATCAAAGTCAGTCACCACGCTGGCGTTGTTGCTGGTTCTGTGGTTTTTTCTGCTGCTAATCTGACAATGGAATACAA
TTCGTACATAAATACTACTTCCCTCGGTGGAGCACCTCCTTCTCAAACTAGTGGCACCCCATTTGGTTATGATGGATCTGGTGGAGGTCATGGTGGCCGAGGGGCCTCCT
GTTTTAAAAGTAATCAGACAAGCAATTGGGGTGGTGATGTGTATGCTTGGTCTACATTGTCTGAACCATGGAGCTATGGGAGCAAGGGTGGTGGCATATCAGATGAAAAG
CCGTATGGAGGGCTTGGTGGGGGACGTGTGAAACTTATAATTGTAGGTGTATTATATCTAAATGGTTCTATCCTAGCAGAAGGTGGAGACGGAGGATCAAGAGGTGGAGG
TGGATCTGGTGGAAGTATCTTTGTGCATGCTGTAAAGTTAAGGGGAAATGGAACTATATCTGCTGCAGGCGGGAAGGGATGGGGTGGAGGTGGTGGGGGAAGAATTTCTC
TGGATTGCTATAGCATTCAAGAAGATATTAAAGTTACTGTACATGGTGGTATAAGTATTGGGTGTTCTGGTAATGCTGGGGCAGCTGGCACTTACTTCAATGCTGATTTG
CTTAGTTTAAGAGTTGGAAATGACAATCTCACGACTGAAACTGAAACCCCTTTGCTAGACTTTTCAACTAGTCCCCTGTGGTCAAACGTGTTTGTGGAGAATAATGCAAA
AGCATTGGTCCCATTGCTGTGGACAAGAGTCCAGGTATTGAGTGACGTTTTCTTCCATATGCTTCCCTTGTTGTATAGAGGTGGAAAATGCCTTGCTCCCTTATAACTTC
AAATATTTGTTGATTAATTGATGAAATGCATATATTACATGTTTATCAATCAATTGTTGAAAAAAACATTTGAAATGCATATAATGATTTCATATTTATAAGAGATGTTC
TTTTCTAATTCAGAAGGTGCCGAGACAAGAAAACAGAACCTCTACTAAAAAGGGAC
Protein sequenceShow/hide protein sequence
MCSSLSHWHIGQYIVWGCLYMSVISLNSLQYESGNVFSNNLQHEFRPVTGNGSRNISPILFSSSSHFVSCEDLGGVGSFNTTCLLNTNLSLYSDFYISGTGNLEILPHVA
ICCPIEGCTITLNMSGNIKVSHHAGVVAGSVVFSAANLTMEYNSYINTTSLGGAPPSQTSGTPFGYDGSGGGHGGRGASCFKSNQTSNWGGDVYAWSTLSEPWSYGSKGG
GISDEKPYGGLGGGRVKLIIVGVLYLNGSILAEGGDGGSRGGGGSGGSIFVHAVKLRGNGTISAAGGKGWGGGGGGRISLDCYSIQEDIKVTVHGGISIGCSGNAGAAGT
YFNADLLSLRVGNDNLTTETETPLLDFSTSPLWSNVFVENNAKALVPLLWTRVQVLSDVFFHMLPLLYRGGKCLAPL