; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027312 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027312
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionBEST Arabidopsis thaliana protein match is: FRIGIDA interacting protein 1 .
Genome locationtig00153048:2983018..2997843
RNA-Seq ExpressionSgr027312
SyntenySgr027312
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005765 - lysosomal membrane (cellular component)
GO:0005770 - late endosome (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR029399 - TMEM192 family
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055774.1 uncharacterized protein E6C27_scaffold181G001600 [Cucumis melo var. makuwa]1.8e-16595.41Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL
        MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLV+P LCSC VVLL+LTGIFQQYLVYQV KIRL
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL

Query:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
        QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQI ALSIPIILR+IM+IEAVCAGSFMI+YI YVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
Subjt:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY

Query:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
        HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
Subjt:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY

Query:  VEENERLRAILGEWSTRAAKGSRGVDA
        VEENERLRAILGEWSTRAAK  R ++A
Subjt:  VEENERLRAILGEWSTRAAKGSRGVDA

KAE8653456.1 hypothetical protein Csa_007539 [Cucumis sativus]5.3e-16589.77Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL
        MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKH V+P LCSC VVLL+LTGIFQQYLVYQV KIRL
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL

Query:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
        QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVM WEPQI ALSIPIILR+IM+IEAVCAGSFMI+YI YVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
Subjt:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY

Query:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
        HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
Subjt:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY

Query:  VEENERLRAILGEWSTRAAKGSRGVDACTKPCMEYQFILKVGT-ETPPHGPQ
        VEENERLRAILGEWSTRAAK  R ++A     +E Q   K+ T +  PH  +
Subjt:  VEENERLRAILGEWSTRAAKGSRGVDACTKPCMEYQFILKVGT-ETPPHGPQ

KAG6590036.1 Protein FIP1, partial [Cucurbita argyrosperma subsp. sororia]1.4e-16595.72Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL
        MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIF PI HLV+P LCSCDVVLL+LTGIFQQYLVYQVQKIRL
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL

Query:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
        QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQI ALSIPIILRMIM+IEAVCAGSFMI+YI YVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
Subjt:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY

Query:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
        HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
Subjt:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY

Query:  VEENERLRAILGEWSTRAAKGSRGVDA
        VEENERLRAILGEWSTRAAK  R ++A
Subjt:  VEENERLRAILGEWSTRAAKGSRGVDA

XP_022147717.1 uncharacterized protein LOC111016583 [Momordica charantia]4.0e-16592.56Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL
        MAAERHASSRAT+SEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVL  YAALAIGAPWIFHPIKH V+P LCSCDVVLL+LTGIFQQYLVYQVQKIRL
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL

Query:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
        QGYYSFSQKLKHIVRLPFAVTAYGTAALLLV+VWEPQI ALSIPIILRMIM+IEAVCAGSFMI+YIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
Subjt:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY

Query:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
        HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEI+K LTTNKQY
Subjt:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY

Query:  VEENERLRAILGEWSTRAAKGSRGVDACTKPCMEYQ
        VEENERLRAILGEWSTRAAK  R ++   K  +E Q
Subjt:  VEENERLRAILGEWSTRAAKGSRGVDACTKPCMEYQ

XP_023516893.1 uncharacterized protein LOC111780659 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-16495.11Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL
        MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVL  YAALAIGAPWIF PI HLV+P LCSCDVVLL+LTGIFQQYLVYQVQKIRL
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL

Query:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
        QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQI ALSIPIILRMIM+IEAVCAGSFMI+YI YVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
Subjt:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY

Query:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
        HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
Subjt:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY

Query:  VEENERLRAILGEWSTRAAKGSRGVDA
        VEENERLRAILGEWSTRAAK  R ++A
Subjt:  VEENERLRAILGEWSTRAAKGSRGVDA

TrEMBL top hitse value%identityAlignment
A0A0A0M261 Uncharacterized protein2.2e-16489.2Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL
        MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVL  YAALAIGAPWIFHPIKH V+P LCSC VVLL+LTGIFQQYLVYQV KIRL
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL

Query:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
        QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVM WEPQI ALSIPIILR+IM+IEAVCAGSFMI+YI YVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
Subjt:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY

Query:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
        HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
Subjt:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY

Query:  VEENERLRAILGEWSTRAAKGSRGVDACTKPCMEYQFILKVGT-ETPPHGPQ
        VEENERLRAILGEWSTRAAK  R ++A     +E Q   K+ T +  PH  +
Subjt:  VEENERLRAILGEWSTRAAKGSRGVDACTKPCMEYQFILKVGT-ETPPHGPQ

A0A1S3BRE8 uncharacterized protein LOC1034923697.4e-16594.8Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL
        MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVL  YAALAIGAPWIFHPIKHLV+P LCSC VVLL+LTGIFQQYLVYQV KIRL
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL

Query:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
        QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQI ALSIPIILR+IM+IEAVCAGSFMI+YI YVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
Subjt:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY

Query:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
        HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
Subjt:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY

Query:  VEENERLRAILGEWSTRAAKGSRGVDA
        VEENERLRAILGEWSTRAAK  R ++A
Subjt:  VEENERLRAILGEWSTRAAKGSRGVDA

A0A5A7UKR5 Uncharacterized protein8.8e-16695.41Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL
        MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLV+P LCSC VVLL+LTGIFQQYLVYQV KIRL
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL

Query:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
        QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQI ALSIPIILR+IM+IEAVCAGSFMI+YI YVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
Subjt:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY

Query:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
        HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
Subjt:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY

Query:  VEENERLRAILGEWSTRAAKGSRGVDA
        VEENERLRAILGEWSTRAAK  R ++A
Subjt:  VEENERLRAILGEWSTRAAKGSRGVDA

A0A6J1D357 uncharacterized protein LOC1110165832.0e-16592.56Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL
        MAAERHASSRAT+SEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVL  YAALAIGAPWIFHPIKH V+P LCSCDVVLL+LTGIFQQYLVYQVQKIRL
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL

Query:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
        QGYYSFSQKLKHIVRLPFAVTAYGTAALLLV+VWEPQI ALSIPIILRMIM+IEAVCAGSFMI+YIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
Subjt:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY

Query:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
        HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEI+K LTTNKQY
Subjt:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY

Query:  VEENERLRAILGEWSTRAAKGSRGVDACTKPCMEYQ
        VEENERLRAILGEWSTRAAK  R ++   K  +E Q
Subjt:  VEENERLRAILGEWSTRAAKGSRGVDACTKPCMEYQ

A0A6J1HBP9 uncharacterized protein LOC111461818 isoform X11.3e-16495.11Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL
        MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVL  YAALAIGAPWIF PI HLV+P LCSCDVVLL+LTGIFQQYLVYQVQKIRL
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL

Query:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
        QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQI ALSIPIILRMIM+IEAVCAGSFMI+YI YVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
Subjt:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY

Query:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
        HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
Subjt:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY

Query:  VEENERLRAILGEWSTRAAKGSRGVDA
        VEENERLRAILGEWSTRAAK  R ++A
Subjt:  VEENERLRAILGEWSTRAAKGSRGVDA

SwissProt top hitse value%identityAlignment
A0A072UTP9 Pro-cathepsin H2.6e-0570Show/hide
Query:  KTYGKNISLSEQQLLDCAGDFDNFGCESGL
        + +GKNISLSEQQL+DCAG ++NFGC  GL
Subjt:  KTYGKNISLSEQQLLDCAGDFDNFGCESGL

P05167 Thiol protease aleurain8.9e-0674.19Show/hide
Query:  TKTYGKNISLSEQQLLDCAGDFDNFGCESGL
        T+  GKNISLSEQQL+DCAG F+NFGC  GL
Subjt:  TKTYGKNISLSEQQLLDCAGDFDNFGCESGL

Q40143 Cysteine proteinase 35.8e-0570Show/hide
Query:  KTYGKNISLSEQQLLDCAGDFDNFGCESGL
        + +GK ISLSEQQL+DCAG F+NFGC  GL
Subjt:  KTYGKNISLSEQQLLDCAGDFDNFGCESGL

Q8RWQ9 Thiol protease aleurain-like4.4e-0570Show/hide
Query:  KTYGKNISLSEQQLLDCAGDFDNFGCESGL
        + +GK ISLSEQQL+DCAG F+NFGC  GL
Subjt:  KTYGKNISLSEQQLLDCAGDFDNFGCESGL

Q8S8K9 Protein FIP19.5e-12572.09Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL
        M+ ER ASS   ++E+NAMFLDILHEAPLFGHRK    VGS +Y  +LA YA LA GAPW+FH ++ L    LC CDV LL++TG+FQQY VYQVQKIRL
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL

Query:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
        QGYYSFSQKLKH+VRLPFA+ AYGTAA+LLV+VW PQI  LSI  + R+IM++EAV AG FM +YIGYV +YNS+NS+PDVLKSLYSPLQ SSS+E LRY
Subjt:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY

Query:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
        ++ GRLSDQQ ALLQYQRENLHFL+EEIL LQE LSKYE+S DGSTPQVDLAH+LAARDQELRTLSAEMNQ+ SELRLARS+IAERD E+Q++ +TN QY
Subjt:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY

Query:  VEENERLRAILGEWSTRAAKGSRGVD
        +EENERLRAIL EWS RAA   R ++
Subjt:  VEENERLRAILGEWSTRAAKGSRGVD

Arabidopsis top hitse value%identityAlignment
AT2G06005.1 FRIGIDA interacting protein 16.7e-12672.09Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL
        M+ ER ASS   ++E+NAMFLDILHEAPLFGHRK    VGS +Y  +LA YA LA GAPW+FH ++ L    LC CDV LL++TG+FQQY VYQVQKIRL
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRL

Query:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY
        QGYYSFSQKLKH+VRLPFA+ AYGTAA+LLV+VW PQI  LSI  + R+IM++EAV AG FM +YIGYV +YNS+NS+PDVLKSLYSPLQ SSS+E LRY
Subjt:  QGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRY

Query:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY
        ++ GRLSDQQ ALLQYQRENLHFL+EEIL LQE LSKYE+S DGSTPQVDLAH+LAARDQELRTLSAEMNQ+ SELRLARS+IAERD E+Q++ +TN QY
Subjt:  HDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQY

Query:  VEENERLRAILGEWSTRAAKGSRGVD
        +EENERLRAIL EWS RAA   R ++
Subjt:  VEENERLRAILGEWSTRAAKGSRGVD

AT2G06005.2 FRIGIDA interacting protein 18.9e-9470.3Show/hide
Query:  IFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGS
        +FH ++ L    LC CDV LL++TG+FQQY VYQVQKIRLQGYYSFSQKLKH+VRLPFA+ AYGTAA+LLV+VW PQI  LSI  + R+IM++EAV AG 
Subjt:  IFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGS

Query:  FMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQ
        FM +YI             DVLKSLYSPLQ SSS+E LRY++ GRLSDQQ ALLQYQRENLHFL+EEIL LQE LSKYE+S DGSTPQVDLAH+LAARDQ
Subjt:  FMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQ

Query:  ELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKGSRGVD
        ELRTLSAEMNQ+ SELRLARS+IAERD E+Q++ +TN QY+EENERLRAIL EWS RAA   R ++
Subjt:  ELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKGSRGVD

AT3G45310.1 Cysteine proteinases superfamily protein3.1e-0670Show/hide
Query:  KTYGKNISLSEQQLLDCAGDFDNFGCESGL
        + +GK ISLSEQQL+DCAG F+NFGC  GL
Subjt:  KTYGKNISLSEQQLLDCAGDFDNFGCESGL

AT5G20580.1 BEST Arabidopsis thaliana protein match is: FRIGIDA interacting protein 1 (TAIR:AT2G06005.1)3.1e-11568.2Show/hide
Query:  MAAERHASSRATSSE-DNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIR
        MA +R ASS   S+E DNAMFLDILHEAPLFGHR+    VGS IY  +LA YA LA GAPWI   + +L+   LCSC+V LL+LTG+FQQY V QVQKIR
Subjt:  MAAERHASSRATSSE-DNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIR

Query:  LQGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLR
        LQGYYSFSQKLKH+VRLPFA+ AYGTA++LL M W P +  L I  + R IM +EA+ A SFMI+++GYV++YNS+NSQPDVL SLYSPL Q ++LE LR
Subjt:  LQGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLR

Query:  YHDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQ
        YH+ GRLSDQQMALLQYQRENLH+L+EEILRLQE LSKYE ++  STPQVDLAH++A RDQELRTLSAE++Q+ SEL LARS+I+ERD EIQ +  TN Q
Subjt:  YHDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQ

Query:  YVEENERLRAILGEWSTRAAKGSRGVD
        YV ENERLRAILGEWS RAAK  R ++
Subjt:  YVEENERLRAILGEWSTRAAKGSRGVD

AT5G20580.2 BEST Arabidopsis thaliana protein match is: FRIGIDA interacting protein 1 (TAIR:AT2G06005.1)2.4e-11568.2Show/hide
Query:  MAAERHASSRATSSE-DNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIR
        MA +R ASS   S+E DNAMFLDILHEAPLFGHR+    VGS IY  +LA YA LA GAPWI   + +L+   LCSC+V LL+LTG+FQQY V QVQKIR
Subjt:  MAAERHASSRATSSE-DNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIR

Query:  LQGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLR
        LQGYYSFSQKLKH+VRLPFA+ AYGTA++LL M W P +  L I  + R IM +EA+ A SFMI+++GYV++YNS+NSQPDVL SLYSPL Q ++LE LR
Subjt:  LQGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLR

Query:  YHDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQ
        YH+ GRLSDQQMALLQYQRENLH+L+EEILRLQE LSKYE ++  STPQVDLAH++A RDQELRTLSAE++Q+ SEL LARS+I+ERD EIQ +  TN Q
Subjt:  YHDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQ

Query:  YVEENERLRAILGEWSTRAAKGSRGVD
        YV ENERLRAILGEWS RAAK  R ++
Subjt:  YVEENERLRAILGEWSTRAAKGSRGVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATTGTCAGCCCTGTTAAAGATCAAGGCCACTGTGGTTCCTGGACATTCAGATTATAGCATTATTTTCCCAACTTGCCTCCGCCTACGCACAAAAACATATGGGAA
GAACATCTCCCTGTCGGAGCAGCAGCTGTTGGACTGTGCCGGAGATTTTGACAACTTTGGCTGCGAAAGTGGACTGTGCCTTCCCAAGCTTTTGAGTACATCAAGTACAA
TGGCGGCCTTGGAAACTGAAGGAATTTACTCTTGTGACACATGCCGCAGTGATCCTGAGGTGTTGCAACTGGTGCTTCATTTCCCCCGTTTGCTTAAGAGTTCTGTTGAA
GAGGACAATGGGCAAAGCTTCTTTCTTGACAGGGAAAGAAGTTATAACGTGTCAATTCCACCCGTTGTTGTACATGAAAAAAAAAAAAAAAAAACCTTTTGTCATTTCTA
CACGGCGAAAGAAGCCTATGCTGAAGCTGAAATAAGTTGCAGAATCATACTCTCTCTCTCTCTCTCTCTCAAAATCACTGAGAAACCGAATTCGCGGTTCGGTAGGCTTC
CACGGCGCAATTCAGCACCCAATTCTTCTTCCATCTCCGTAAAAGCCTCTACAATCTCAAGACCAAACATGGCAGCGGAGAGGCACGCCTCTTCGCGCGCAACATCATCT
GAAGACAACGCGATGTTTCTCGATATACTGCATGAGGCCCCGTTATTTGGTCACCGGAAGCCTGCGAGAACAGTTGGGAGCATAATATATTGTTTTGTTTTGGCAAGCTA
TGCTGCCCTGGCTATTGGAGCACCATGGATTTTTCATCCTATAAAGCACTTGGTTCAACCATTTCTCTGCAGTTGTGATGTTGTTCTTTTAATACTCACAGGTATCTTTC
AGCAATATCTAGTATATCAAGTCCAGAAAATTCGCTTGCAGGGTTATTATAGTTTTAGCCAGAAGTTAAAGCATATTGTTCGTCTACCTTTTGCAGTTACTGCATATGGA
ACTGCTGCCCTTTTACTTGTCATGGTATGGGAACCTCAAATCCGCGCACTTTCGATCCCTATAATTCTACGGATGATTATGATAATTGAGGCCGTATGTGCTGGATCATT
TATGATTATGTATATTGGTTATGTACAAAAGTACAATTCATTGAATTCTCAACCTGATGTTTTGAAGTCATTGTATTCTCCACTTCAGCAATCAAGTTCTTTGGAAGATC
TAAGGTATCATGATGTTGGTCGACTTTCTGATCAGCAAATGGCTCTGTTGCAATATCAGCGAGAGAACCTTCATTTTCTTAATGAAGAGATTCTTCGGTTGCAAGAGTGC
CTAAGTAAATATGAACGGTCTAGTGATGGAAGCACCCCTCAGGTTGATCTTGCCCATATGCTAGCTGCTCGAGATCAGGAATTGCGGACACTTTCAGCTGAGATGAATCA
GGTGACATCAGAACTTAGGCTTGCTCGATCTGTGATAGCTGAGAGGGATACTGAGATTCAGAAATTACTCACCACCAACAAGCAGTATGTAGAAGAAAATGAGAGACTGA
GAGCTATTCTAGGAGAATGGAGTACACGGGCTGCAAAGGGCAGCAGGGGAGTTGATGCTTGTACTAAACCATGCATGGAGTATCAGTTTATTCTCAAGGTTGGAACGGAA
ACGCCGCCCCATGGTCCTCAAGCTCCAACCATCTCACAGTGTCATTTCCTTTATCTTTCGTACCACGTATTTCACTCTCTACAAGTTCCCAAACACTCGGCCGAACTTCA
TTATCATTTGAATGGGTTTCCTCTTCCTCATCCAAATTCAGATCAATATCCACACACATCAAGGACGCCATTCTCTCGGCCAATCCCGGTACTTCCTGCAACTGTGCGCA
CCCAAGTATTCCACAAGTTTACCAGAGTAATGAACGTCAGAGAGCACGAGCCGTCTGCAACGTCCATCCTCGCACGGTTCGACTTGGAAGTGAGCGCCGCCATCGAAGGT
CTCCGACGTCAGCTTCTGGAACAGTTGACGGTGCAGAGATTGGGGAAGTCCGGATGCGGTAAGGAGCAAACCGTGGACTTCAACGAAATCCTCGTAGGTTTCGATTCTTC
TGGCCGCGGACATTTTCGGACGTCAATTACGCTGTCCTTGCCGCTCGATGGCTGCTTTGTGCAACTCGAAAGCGCAGAGCTGTGCTCGGCCGCCGCACCTGAGGGAGACG
CTCGATCGTCGTGGGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCATTGTCAGCCCTGTTAAAGATCAAGGCCACTGTGGTTCCTGGACATTCAGATTATAGCATTATTTTCCCAACTTGCCTCCGCCTACGCACAAAAACATATGGGAA
GAACATCTCCCTGTCGGAGCAGCAGCTGTTGGACTGTGCCGGAGATTTTGACAACTTTGGCTGCGAAAGTGGACTGTGCCTTCCCAAGCTTTTGAGTACATCAAGTACAA
TGGCGGCCTTGGAAACTGAAGGAATTTACTCTTGTGACACATGCCGCAGTGATCCTGAGGTGTTGCAACTGGTGCTTCATTTCCCCCGTTTGCTTAAGAGTTCTGTTGAA
GAGGACAATGGGCAAAGCTTCTTTCTTGACAGGGAAAGAAGTTATAACGTGTCAATTCCACCCGTTGTTGTACATGAAAAAAAAAAAAAAAAAACCTTTTGTCATTTCTA
CACGGCGAAAGAAGCCTATGCTGAAGCTGAAATAAGTTGCAGAATCATACTCTCTCTCTCTCTCTCTCTCAAAATCACTGAGAAACCGAATTCGCGGTTCGGTAGGCTTC
CACGGCGCAATTCAGCACCCAATTCTTCTTCCATCTCCGTAAAAGCCTCTACAATCTCAAGACCAAACATGGCAGCGGAGAGGCACGCCTCTTCGCGCGCAACATCATCT
GAAGACAACGCGATGTTTCTCGATATACTGCATGAGGCCCCGTTATTTGGTCACCGGAAGCCTGCGAGAACAGTTGGGAGCATAATATATTGTTTTGTTTTGGCAAGCTA
TGCTGCCCTGGCTATTGGAGCACCATGGATTTTTCATCCTATAAAGCACTTGGTTCAACCATTTCTCTGCAGTTGTGATGTTGTTCTTTTAATACTCACAGGTATCTTTC
AGCAATATCTAGTATATCAAGTCCAGAAAATTCGCTTGCAGGGTTATTATAGTTTTAGCCAGAAGTTAAAGCATATTGTTCGTCTACCTTTTGCAGTTACTGCATATGGA
ACTGCTGCCCTTTTACTTGTCATGGTATGGGAACCTCAAATCCGCGCACTTTCGATCCCTATAATTCTACGGATGATTATGATAATTGAGGCCGTATGTGCTGGATCATT
TATGATTATGTATATTGGTTATGTACAAAAGTACAATTCATTGAATTCTCAACCTGATGTTTTGAAGTCATTGTATTCTCCACTTCAGCAATCAAGTTCTTTGGAAGATC
TAAGGTATCATGATGTTGGTCGACTTTCTGATCAGCAAATGGCTCTGTTGCAATATCAGCGAGAGAACCTTCATTTTCTTAATGAAGAGATTCTTCGGTTGCAAGAGTGC
CTAAGTAAATATGAACGGTCTAGTGATGGAAGCACCCCTCAGGTTGATCTTGCCCATATGCTAGCTGCTCGAGATCAGGAATTGCGGACACTTTCAGCTGAGATGAATCA
GGTGACATCAGAACTTAGGCTTGCTCGATCTGTGATAGCTGAGAGGGATACTGAGATTCAGAAATTACTCACCACCAACAAGCAGTATGTAGAAGAAAATGAGAGACTGA
GAGCTATTCTAGGAGAATGGAGTACACGGGCTGCAAAGGGCAGCAGGGGAGTTGATGCTTGTACTAAACCATGCATGGAGTATCAGTTTATTCTCAAGGTTGGAACGGAA
ACGCCGCCCCATGGTCCTCAAGCTCCAACCATCTCACAGTGTCATTTCCTTTATCTTTCGTACCACGTATTTCACTCTCTACAAGTTCCCAAACACTCGGCCGAACTTCA
TTATCATTTGAATGGGTTTCCTCTTCCTCATCCAAATTCAGATCAATATCCACACACATCAAGGACGCCATTCTCTCGGCCAATCCCGGTACTTCCTGCAACTGTGCGCA
CCCAAGTATTCCACAAGTTTACCAGAGTAATGAACGTCAGAGAGCACGAGCCGTCTGCAACGTCCATCCTCGCACGGTTCGACTTGGAAGTGAGCGCCGCCATCGAAGGT
CTCCGACGTCAGCTTCTGGAACAGTTGACGGTGCAGAGATTGGGGAAGTCCGGATGCGGTAAGGAGCAAACCGTGGACTTCAACGAAATCCTCGTAGGTTTCGATTCTTC
TGGCCGCGGACATTTTCGGACGTCAATTACGCTGTCCTTGCCGCTCGATGGCTGCTTTGTGCAACTCGAAAGCGCAGAGCTGTGCTCGGCCGCCGCACCTGAGGGAGACG
CTCGATCGTCGTGGGCTTAG
Protein sequenceShow/hide protein sequence
MALSALLKIKATVVPGHSDYSIIFPTCLRLRTKTYGKNISLSEQQLLDCAGDFDNFGCESGLCLPKLLSTSSTMAALETEGIYSCDTCRSDPEVLQLVLHFPRLLKSSVE
EDNGQSFFLDRERSYNVSIPPVVVHEKKKKKTFCHFYTAKEAYAEAEISCRIILSLSLSLKITEKPNSRFGRLPRRNSAPNSSSISVKASTISRPNMAAERHASSRATSS
EDNAMFLDILHEAPLFGHRKPARTVGSIIYCFVLASYAALAIGAPWIFHPIKHLVQPFLCSCDVVLLILTGIFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYG
TAALLLVMVWEPQIRALSIPIILRMIMIIEAVCAGSFMIMYIGYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEILRLQEC
LSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKGSRGVDACTKPCMEYQFILKVGTE
TPPHGPQAPTISQCHFLYLSYHVFHSLQVPKHSAELHYHLNGFPLPHPNSDQYPHTSRTPFSRPIPVLPATVRTQVFHKFTRVMNVREHEPSATSILARFDLEVSAAIEG
LRRQLLEQLTVQRLGKSGCGKEQTVDFNEILVGFDSSGRGHFRTSITLSLPLDGCFVQLESAELCSAAAPEGDARSSWA