; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cp4.1LG18g03230 (gene) of Cucurbita pepo (MU-CU-16) v4.1 genome

Gene IDCp4.1LG18g03230
OrganismCucurbita pepo var. pepo MU-CU-16 (Cucurbita pepo (MU-CU-16) v4.1)
DescriptionBEST Arabidopsis thaliana protein match is: FRIGIDA interacting protein 1 .
Genome locationCp4.1LG18:4711589..4720024
RNA-Seq ExpressionCp4.1LG18g03230
SyntenyCp4.1LG18g03230
Gene Ontology termsGO:0005765 - lysosomal membrane (cellular component)
GO:0005770 - late endosome (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR029399 - TMEM192 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6590036.1 Protein FIP1, partial [Cucurbita argyrosperma subsp. sororia]4.92e-22181.9Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL
        MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV   YAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGI              
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL

Query:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
                                                                    FQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
Subjt:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA

Query:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
        ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
Subjt:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE

Query:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
        EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
Subjt:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE

Query:  AERISNLELQKRIATLKKLSHASETSEQQGS
        AERISNLELQKRI+TLKKLSHASETSEQQGS
Subjt:  AERISNLELQKRIATLKKLSHASETSEQQGS

XP_022961268.1 uncharacterized protein LOC111461818 isoform X1 [Cucurbita moschata]9.92e-22181.9Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL
        MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV  GYAALAIGAPWIF PIMHLVEPLLCSCDVVLLMLTGI              
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL

Query:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
                                                                    FQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
Subjt:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA

Query:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
        ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
Subjt:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE

Query:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
        EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
Subjt:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE

Query:  AERISNLELQKRIATLKKLSHASETSEQQGS
        AERISNLELQKRI+TLKKLSHASETSEQQGS
Subjt:  AERISNLELQKRIATLKKLSHASETSEQQGS

XP_022988290.1 uncharacterized protein LOC111485584 isoform X1 [Cucurbita maxima]6.99e-22181.9Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL
        MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV  GYAALAIGAPWIFRP MHLVEPLLCSCDVVLLMLTGI              
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL

Query:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
                                                                    FQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
Subjt:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA

Query:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
        ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
Subjt:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE

Query:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
        EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
Subjt:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE

Query:  AERISNLELQKRIATLKKLSHASETSEQQGS
        AERISNLELQKRI+TLKKLSHASETSEQQGS
Subjt:  AERISNLELQKRIATLKKLSHASETSEQQGS

XP_023516893.1 uncharacterized protein LOC111780659 isoform X1 [Cucurbita pepo subsp. pepo]2.10e-22282.37Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL
        MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV  GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGI              
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL

Query:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
                                                                    FQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
Subjt:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA

Query:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
        ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
Subjt:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE

Query:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
        EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
Subjt:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE

Query:  AERISNLELQKRIATLKKLSHASETSEQQGS
        AERISNLELQKRIATLKKLSHASETSEQQGS
Subjt:  AERISNLELQKRIATLKKLSHASETSEQQGS

XP_038879093.1 protein FIP1-like isoform X1 [Benincasa hispida]2.43e-21479.81Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCF--VGYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL
        MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCF  VGYAALAIGAPWIF PI HLVEPLLCSCDVVLLMLTGI              
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCF--VGYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL

Query:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
                                                                    FQQYLVYQV KIRLQGYYSFSQKLKHIVRLPFAV AYGTA
Subjt:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA

Query:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
        ALLLVMVWEPQISALSIPIILR+IMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
Subjt:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE

Query:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
        EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
Subjt:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE

Query:  AERISNLELQKRIATLKKLSHASETSEQQGS
        AERISNLELQKRI+TLKK  H SETSE+ GS
Subjt:  AERISNLELQKRIATLKKLSHASETSEQQGS

TrEMBL top hitse value%identityAlignment
A0A0A0M261 Uncharacterized protein3.90e-21379.12Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL
        MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV  GYAALAIGAPWIF PI H VEPLLCSC VVLLMLTGI              
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL

Query:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
                                                                    FQQYLVYQV KIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
Subjt:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA

Query:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
        ALLLVM WEPQISALSIPIILR+IMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
Subjt:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE

Query:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
        EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
Subjt:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE

Query:  AERISNLELQKRIATLKKLSHASETSEQQGS
        AER+SN+ELQK+I+TLKK  HASETSE QGS
Subjt:  AERISNLELQKRIATLKKLSHASETSEQQGS

A0A1S3BRE8 uncharacterized protein LOC1034923691.36e-21379.58Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL
        MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV  GYAALAIGAPWIF PI HLVEPLLCSC VVLLMLTGI              
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL

Query:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
                                                                    FQQYLVYQV KIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
Subjt:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA

Query:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
        ALLLVMVWEPQISALSIPIILR+IMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
Subjt:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE

Query:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
        EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
Subjt:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE

Query:  AERISNLELQKRIATLKKLSHASETSEQQGS
        AER+SN+ELQKRI+TLKK  HASETSE Q S
Subjt:  AERISNLELQKRIATLKKLSHASETSEQQGS

A0A5A7UKR5 Uncharacterized protein1.12e-21279.35Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL
        MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV   YAALAIGAPWIF PI HLVEPLLCSC VVLLMLTGI              
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL

Query:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
                                                                    FQQYLVYQV KIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
Subjt:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA

Query:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
        ALLLVMVWEPQISALSIPIILR+IMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
Subjt:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE

Query:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
        EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
Subjt:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE

Query:  AERISNLELQKRIATLKKLSHASETSEQQGS
        AER+SN+ELQKRI+TLKK  HASETSE Q S
Subjt:  AERISNLELQKRIATLKKLSHASETSEQQGS

A0A6J1HBP9 uncharacterized protein LOC111461818 isoform X14.80e-22181.9Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL
        MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV  GYAALAIGAPWIF PIMHLVEPLLCSCDVVLLMLTGI              
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL

Query:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
                                                                    FQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
Subjt:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA

Query:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
        ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
Subjt:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE

Query:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
        EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
Subjt:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE

Query:  AERISNLELQKRIATLKKLSHASETSEQQGS
        AERISNLELQKRI+TLKKLSHASETSEQQGS
Subjt:  AERISNLELQKRIATLKKLSHASETSEQQGS

A0A6J1JJ60 uncharacterized protein LOC111485584 isoform X13.38e-22181.9Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL
        MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV  GYAALAIGAPWIFRP MHLVEPLLCSCDVVLLMLTGI              
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL

Query:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
                                                                    FQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
Subjt:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA

Query:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
        ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
Subjt:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE

Query:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
        EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
Subjt:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE

Query:  AERISNLELQKRIATLKKLSHASETSEQQGS
        AERISNLELQKRI+TLKKLSHASETSEQQGS
Subjt:  AERISNLELQKRIATLKKLSHASETSEQQGS

SwissProt top hitse value%identityAlignment
Q8S8K9 Protein FIP15.2e-11958.28Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL
        M+ ER ASS   ++E+NAMFLDILHEAPLFGHRK    VGS +Y  +  GYA LA GAPW+F  +  L   LLC CDV LL++TG+              
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL

Query:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
                                                                    FQQY VYQVQKIRLQGYYSFSQKLKH+VRLPFA+ AYGTA
Subjt:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA

Query:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
        A+LLV+VW PQI  LSI  + R+IML+EAV AG FM +YI YV +YNS+NS+PDVLKSLYSPLQ SSS+E LRY++ GRLSDQQ ALLQYQRENLHFL+E
Subjt:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE

Query:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
        EIL LQE LSKYE+S DGSTPQVDLAH+LAARDQELRTLSAEMNQ+ SELRLARS+IAERD E+Q++ +TN QY+EENERLRAIL EWS RAA LERALE
Subjt:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE

Query:  AERISNLELQKRIATLKKLSHASETSEQQ
         ER+SN ELQK +A+ ++      T+ +Q
Subjt:  AERISNLELQKRIATLKKLSHASETSEQQ

Q9CXT7 Transmembrane protein 1928.5e-0526.14Show/hide
Query:  LLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWE---PQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDV
        LLF++Y+ Y  +K+R +GY    +  +H+  L   + + G  ALLL++  +   P+ S L + +IL  ++ +E +C+ S +I+YI  ++++N     PDV
Subjt:  LLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWE---PQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDV

Query:  LKSLYSPLQQSSSLEDLRYHDVGRLS---DQQMALLQYQRENLHFLNEEILRL
        L+        S++  +  +  V  L    ++Q  ++ Y + +   L++ +L L
Subjt:  LKSLYSPLQQSSSLEDLRYHDVGRLS---DQQMALLQYQRENLHFLNEEILRL

Arabidopsis top hitse value%identityAlignment
AT2G06005.1 FRIGIDA interacting protein 13.7e-12058.28Show/hide
Query:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL
        M+ ER ASS   ++E+NAMFLDILHEAPLFGHRK    VGS +Y  +  GYA LA GAPW+F  +  L   LLC CDV LL++TG+              
Subjt:  MAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFL

Query:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA
                                                                    FQQY VYQVQKIRLQGYYSFSQKLKH+VRLPFA+ AYGTA
Subjt:  TSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTA

Query:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE
        A+LLV+VW PQI  LSI  + R+IML+EAV AG FM +YI YV +YNS+NS+PDVLKSLYSPLQ SSS+E LRY++ GRLSDQQ ALLQYQRENLHFL+E
Subjt:  ALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNE

Query:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE
        EIL LQE LSKYE+S DGSTPQVDLAH+LAARDQELRTLSAEMNQ+ SELRLARS+IAERD E+Q++ +TN QY+EENERLRAIL EWS RAA LERALE
Subjt:  EILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALE

Query:  AERISNLELQKRIATLKKLSHASETSEQQ
         ER+SN ELQK +A+ ++      T+ +Q
Subjt:  AERISNLELQKRIATLKKLSHASETSEQQ

AT2G06005.2 FRIGIDA interacting protein 18.5e-9371.11Show/hide
Query:  LFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSL
        +FQQY VYQVQKIRLQGYYSFSQKLKH+VRLPFA+ AYGTAA+LLV+VW PQI  LSI  + R+IML+EAV AG FM +YI             DVLKSL
Subjt:  LFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSL

Query:  YSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAE
        YSPLQ SSS+E LRY++ GRLSDQQ ALLQYQRENLHFL+EEIL LQE LSKYE+S DGSTPQVDLAH+LAARDQELRTLSAEMNQ+ SELRLARS+IAE
Subjt:  YSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAE

Query:  RDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALEAERISNLELQKRIATLKKLSHASETSEQQ
        RD E+Q++ +TN QY+EENERLRAIL EWS RAA LERALE ER+SN ELQK +A+ ++      T+ +Q
Subjt:  RDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALEAERISNLELQKRIATLKKLSHASETSEQQ

AT5G20580.1 BEST Arabidopsis thaliana protein match is: FRIGIDA interacting protein 1 (TAIR:AT2G06005.1)3.1e-11156.7Show/hide
Query:  MAAERHASSRATSSE-DNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYF
        MA +R ASS   S+E DNAMFLDILHEAPLFGHR+    VGS IY  +   YA LA GAPWI + + +L+  LLCSC+V LLMLTG              
Subjt:  MAAERHASSRATSSE-DNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYF

Query:  LTSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGT
                                                                    +FQQY V QVQKIRLQGYYSFSQKLKH+VRLPFA+ AYGT
Subjt:  LTSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGT

Query:  AALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLN
        A++LL M W P +S L I  + R IM +EA+ A SFMI+++ YV++YNS+NSQPDVL SLYSPL Q ++LE LRYH+ GRLSDQQMALLQYQRENLH+L+
Subjt:  AALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLN

Query:  EEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERAL
        EEILRLQE LSKYE ++  STPQVDLAH++A RDQELRTLSAE++Q+ SEL LARS+I+ERD EIQ +  TN QYV ENERLRAILGEWS RAAKLERAL
Subjt:  EEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERAL

Query:  EAERISNLELQKRIATLK
        E ERISN+EL+K+++ L+
Subjt:  EAERISNLELQKRIATLK

AT5G20580.2 BEST Arabidopsis thaliana protein match is: FRIGIDA interacting protein 1 (TAIR:AT2G06005.1)2.8e-11256.94Show/hide
Query:  MAAERHASSRATSSE-DNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYF
        MA +R ASS   S+E DNAMFLDILHEAPLFGHR+    VGS IY  +  GYA LA GAPWI + + +L+  LLCSC+V LLMLTG              
Subjt:  MAAERHASSRATSSE-DNAMFLDILHEAPLFGHRKPARTVGSIIYCFV--GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYF

Query:  LTSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGT
                                                                    +FQQY V QVQKIRLQGYYSFSQKLKH+VRLPFA+ AYGT
Subjt:  LTSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCLLFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGT

Query:  AALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLN
        A++LL M W P +S L I  + R IM +EA+ A SFMI+++ YV++YNS+NSQPDVL SLYSPL Q ++LE LRYH+ GRLSDQQMALLQYQRENLH+L+
Subjt:  AALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLN

Query:  EEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERAL
        EEILRLQE LSKYE ++  STPQVDLAH++A RDQELRTLSAE++Q+ SEL LARS+I+ERD EIQ +  TN QYV ENERLRAILGEWS RAAKLERAL
Subjt:  EEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERAL

Query:  EAERISNLELQKRIATLK
        E ERISN+EL+K+++ L+
Subjt:  EAERISNLELQKRIATLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTATTACTCATCATTCAAAACTCGTATGCTGAAATTAGTTGCAGAAGAATCAGTCTCTCTCTCTCCGGAGAAATCACCGATCGTCGGAATTGGAAAAGCCGAACTGA
CGGTTTCGAAAGCCTTCAGCACCGCAATTCAGCCTCCAAATTCTTGTTTCATCTCCGTTTAAGCCTCTACATTCCAAACATGGCAGCGGAGAGGCACGCCTCTTCGCGCG
CAACATCATCTGAAGACAACGCGATGTTTCTTGATATACTGCATGAGGCCCCGTTATTTGGTCATCGGAAGCCTGCACGAACAGTTGGGAGCATAATTTATTGTTTCGTA
GGCTATGCTGCTCTGGCTATTGGAGCGCCATGGATTTTTCGTCCTATAATGCACTTGGTTGAACCATTGCTCTGCAGTTGCGATGTTGTTCTCTTGATGCTCACAGGTAT
TCTAATAACGTCCAGAGTTATGCTTATTACCAACTACTTTCTGACCAGCAAGGTACTGGAGAATTTTATTTCTGCTATTCGTGTAGCCAGTGTACTTTTTGTCATTGCAG
GATTAGAAATTTCAATTCAGTTCATTCTTAGGAATCAAGCTATTGAAATCGATCATTTAAGCACATCTTTAATCGTCTTGACATATCAAAAGAAGTTGAACCACTGTTTA
TTATTTCAGCAATATCTAGTATACCAAGTCCAGAAAATTCGATTGCAGGGTTATTATAGCTTTAGCCAGAAGTTAAAGCATATTGTTCGTCTACCTTTTGCAGTTACTGC
ATATGGAACTGCTGCCCTTTTACTTGTCATGGTATGGGAACCTCAGATCAGTGCACTCTCGATCCCCATAATTTTAAGGATGATTATGTTAATTGAAGCGGTATGTGCTG
GATCATTTATGATTATATATATCAGTTATGTTCAAAAGTACAATTCATTAAATTCTCAGCCTGATGTTTTGAAGTCATTGTATTCTCCACTTCAGCAATCAAGTTCTTTG
GAAGATCTAAGGTATCATGATGTTGGTCGACTTTCTGATCAGCAAATGGCTTTGTTGCAATATCAGCGAGAGAACCTTCATTTTCTGAATGAGGAGATTCTTCGGTTGCA
AGAGTGTTTGAGTAAATATGAACGGTCCAGTGATGGAAGCACGCCTCAGGTTGATCTTGCCCATATGCTAGCTGCTCGTGATCAGGAATTGAGGACACTTTCAGCTGAGA
TGAATCAGGTGACATCAGAACTTAGGCTTGCTCGATCTGTGATAGCCGAGAGGGATACCGAGATTCAGAAATTACTCACCACCAACAAGCAGTATGTAGAAGAAAATGAA
AGACTGAGAGCTATTCTAGGAGAATGGAGTACACGGGCTGCAAAGCTCGAGAGGGCGCTTGAAGCCGAGCGTATATCAAATCTTGAATTGCAAAAGAGGATTGCAACACT
AAAAAAGCTATCACATGCATCTGAAACATCAGAGCAGCAGGGGAGTTGA
mRNA sequenceShow/hide mRNA sequence
AAATTAAGTTAAAGCGATTATATATAATTAAGTTATGGAATTCAAAAAAAAAAAAAAAAAAAATGTTATTACTCATCATTCAAAACTCGTATGCTGAAATTAGTTGCAGA
AGAATCAGTCTCTCTCTCTCCGGAGAAATCACCGATCGTCGGAATTGGAAAAGCCGAACTGACGGTTTCGAAAGCCTTCAGCACCGCAATTCAGCCTCCAAATTCTTGTT
TCATCTCCGTTTAAGCCTCTACATTCCAAACATGGCAGCGGAGAGGCACGCCTCTTCGCGCGCAACATCATCTGAAGACAACGCGATGTTTCTTGATATACTGCATGAGG
CCCCGTTATTTGGTCATCGGAAGCCTGCACGAACAGTTGGGAGCATAATTTATTGTTTCGTAGGCTATGCTGCTCTGGCTATTGGAGCGCCATGGATTTTTCGTCCTATA
ATGCACTTGGTTGAACCATTGCTCTGCAGTTGCGATGTTGTTCTCTTGATGCTCACAGGTATTCTAATAACGTCCAGAGTTATGCTTATTACCAACTACTTTCTGACCAG
CAAGGTACTGGAGAATTTTATTTCTGCTATTCGTGTAGCCAGTGTACTTTTTGTCATTGCAGGATTAGAAATTTCAATTCAGTTCATTCTTAGGAATCAAGCTATTGAAA
TCGATCATTTAAGCACATCTTTAATCGTCTTGACATATCAAAAGAAGTTGAACCACTGTTTATTATTTCAGCAATATCTAGTATACCAAGTCCAGAAAATTCGATTGCAG
GGTTATTATAGCTTTAGCCAGAAGTTAAAGCATATTGTTCGTCTACCTTTTGCAGTTACTGCATATGGAACTGCTGCCCTTTTACTTGTCATGGTATGGGAACCTCAGAT
CAGTGCACTCTCGATCCCCATAATTTTAAGGATGATTATGTTAATTGAAGCGGTATGTGCTGGATCATTTATGATTATATATATCAGTTATGTTCAAAAGTACAATTCAT
TAAATTCTCAGCCTGATGTTTTGAAGTCATTGTATTCTCCACTTCAGCAATCAAGTTCTTTGGAAGATCTAAGGTATCATGATGTTGGTCGACTTTCTGATCAGCAAATG
GCTTTGTTGCAATATCAGCGAGAGAACCTTCATTTTCTGAATGAGGAGATTCTTCGGTTGCAAGAGTGTTTGAGTAAATATGAACGGTCCAGTGATGGAAGCACGCCTCA
GGTTGATCTTGCCCATATGCTAGCTGCTCGTGATCAGGAATTGAGGACACTTTCAGCTGAGATGAATCAGGTGACATCAGAACTTAGGCTTGCTCGATCTGTGATAGCCG
AGAGGGATACCGAGATTCAGAAATTACTCACCACCAACAAGCAGTATGTAGAAGAAAATGAAAGACTGAGAGCTATTCTAGGAGAATGGAGTACACGGGCTGCAAAGCTC
GAGAGGGCGCTTGAAGCCGAGCGTATATCAAATCTTGAATTGCAAAAGAGGATTGCAACACTAAAAAAGCTATCACATGCATCTGAAACATCAGAGCAGCAGGGGAGTTG
AGTTGATGCCTGTAGTAAACCATACATAAAGCATCATCAGTTTAATCTCAAGGTTGGAACGGAAGTGCCGCCCTCAGCATAGGCTACATTTATATTCTCTTGAATATGAG
CAGCCGCAGATGAGACCCGTTTTCTCTCGTGCATTCTGGCCTCCAGTATCGACCAAATACCGGCAGTAGCTGTGTAAACAAATGAAGGTTCGTTTCTTTTTTTCCTTTTT
GATCTTGGGACTAAAATTTGAATTTCTTTGATTGTACATCTACTTTTTTTCTTTTGAGGTTCTCTTTTCTTTTAATGCATTTTGATTTTAGTGCACTTCGGAGGGAGATG
ATAATACACTCCTAAATGACACCAAAAATTTCATTAGGGGCCTCATCTTTGAGGCATGGTGCATTTAGGTCTCTGAAGTCGACACACACTCATAGTTGTCTGTTTTCTTG
CGTACTAGCACAATGAGAACCAACATTTGCGATTCTCATATGATAGAGGGGTTGAAGTACACATAAATGGTCGTCAATTTTTATCAAAATATGCATTAGCTCTTGGTTTG
ATTACCATC
Protein sequenceShow/hide protein sequence
MLLLIIQNSYAEISCRRISLSLSGEITDRRNWKSRTDGFESLQHRNSASKFLFHLRLSLYIPNMAAERHASSRATSSEDNAMFLDILHEAPLFGHRKPARTVGSIIYCFV
GYAALAIGAPWIFRPIMHLVEPLLCSCDVVLLMLTGILITSRVMLITNYFLTSKVLENFISAIRVASVLFVIAGLEISIQFILRNQAIEIDHLSTSLIVLTYQKKLNHCL
LFQQYLVYQVQKIRLQGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQISALSIPIILRMIMLIEAVCAGSFMIIYISYVQKYNSLNSQPDVLKSLYSPLQQSSSL
EDLRYHDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELRLARSVIAERDTEIQKLLTTNKQYVEENE
RLRAILGEWSTRAAKLERALEAERISNLELQKRIATLKKLSHASETSEQQGS